Abstract
In this paper, we propose to train convolutional neural networks (CNNs) with both binarized weights and activations, leading to quantized models specifically for mobile devices with limited power capacity and computation resources. By assuming the same architecture to full-precision networks, previous works on quantizing CNNs seek to preserve the floating-point information using a set of discrete values, which we call value approximation. However, we take a novel "structure approximation" view for quantization--it is very likely that a different architecture may be better for best performance. In particular, we propose a "network decomposition" strategy, named Group-Net, in which we divide the network into groups. In this way, each full-precision group can be effectively reconstructed by aggregating a set of homogeneous binary branches. In addition, we learn effect connections among groups to improve the representational capability. Moreover, the proposed Group-Net shows strong generalization to other tasks. For instance, we extend Group-Net for highly accurate semantic segmentation by embedding rich context into the binary structure. Experiments on both classification and semantic segmentation tasks demonstrate the superior performance of the proposed methods over various popular architectures. In particular, we outperform the previous best binary neural networks in terms of accuracy and huge computation saving.
Original language | English |
---|---|
Title of host publication | Proceedings - 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019 |
Editors | Abhinav Gupta, Derek Hoiem, Gang Hua, Zhuowen Tu |
Place of Publication | Piscataway NJ USA |
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Pages | 413-422 |
Number of pages | 10 |
ISBN (Electronic) | 9781728132938 |
ISBN (Print) | 9781728132945 |
DOIs | |
Publication status | Published - 2019 |
Externally published | Yes |
Event | IEEE Conference on Computer Vision and Pattern Recognition 2019 - Long Beach, United States of America Duration: 16 Jun 2019 → 20 Jun 2019 Conference number: 32nd http://cvpr2019.thecvf.com/ https://ieeexplore.ieee.org/xpl/conhome/8938205/proceeding (Proceedings) |
Conference
Conference | IEEE Conference on Computer Vision and Pattern Recognition 2019 |
---|---|
Abbreviated title | CVPR 2019 |
Country/Territory | United States of America |
City | Long Beach |
Period | 16/06/19 → 20/06/19 |
Internet address |
Keywords
- Deep Learning
- Grouping and Shape
- Representation Learning
- Segmentation