Approach
Firstly, we introduce an efficient test-phase computation process with the network parameters quantized. Secondly, we demonstrate that better quantization can be learned by directly minimizing the estimation error of each layer’s response.
-
Quantizing the Fully connected Layer
- Quantization with Error Correction
Experiment
References:
Quantized Convolutional Neural Networks for Mobile Devices, chengjian, 2016, CVPR