Transform Quantization for CNN Compression

Name: Transform Quantization for CNN Compression
Start: 2021-06-28T19:00:00-04:00
End: 2021-06-28T20:00:00-04:00
Location: Virtual: https://events.vtools.ieee.org/m/274143

June 28, 2021 @ 7:00 pm - 8:00 pm

In this work, we compress convolutional neural network (CNN) weights post-training via transform quantization. Previous CNN quantization techniques tend to ignore the joint statistics of weights and activations, producing sub-optimal CNN performance at a given quantization bit-rate, or consider their joint statistics during training only and do not facilitate efficient compression of already trained CNN models. We optimally transform (decorrelate) and quantize the weights post-training using a rate-distortion framework to improve compression at any given quantization bit-rate. Transform quantization unifies quantization and dimensionality reduction (decorrelation) techniques in a single framework to facilitate low bit-rate compression of CNNs and efficient inference in the transform domain. We first introduce a theory of rate and distortion for CNN quantization, and pose optimum quantization as a rate-distortion optimization problem. We then show that this problem can be solved using optimal bit-depth allocation following decorrelation by the optimal End-to-end Learned Transform (ELT) we derive in this paper. Experiments demonstrate that transform quantization advances the state of the art in CNN compression in both retrained and non-retrained quantization scenarios. In particular, we find that transform quantization with retraining is able to compress CNN models such as AlexNet, ResNet and DenseNet to very low bit-rates (1-2 bits).

This talk is based on joint published work with Zhe Wang, David Taubman and Bernd Girod. Preprint is available at https://arxiv.org/abs/2009.01174.

Speaker(s): Dr. Sean I. Young,

Virtual: https://events.vtools.ieee.org/m/274143

Details

Date:: June 28, 2021
Time:: 7:00 pm - 8:00 pm
Website:: https://events.vtools.ieee.org/m/274143

Organizer

: ivan_bajic@ieee_org
Email: ivan_bajic@ieee_org

Venue

: Virtual: https://events.vtools.ieee.org/m/274143
: Virtual: https://events.vtools.ieee.org/m/274143 + Google Map

IEEE Montreal Section

Transform Quantization for CNN Compression

Details

Organizer

Venue