On the impact of the parametrization of deep convolutional neural networks on post-training quantization
Houache, Aujol, Traonmilin
This paper introduces novel theoretical approximation bounds for the output of quantized neural networks, with a focus on convolutional neural networks (CNN). By considering layerwise parametrization and focusing on the quantization of weights, we provide bounds that gain several orders of magnitude compared to state-of-the-art results on classical deep convolutional neural networks such as MobileNetV2 or ResNets. These gains are achieved by improving the behaviour of the approximation bounds with respect to the depth parameter, which has the most impact on the approximation error induced by quantization. To complement our theoretical result, we provide a numerical exploration of our bounds on MobileNetV2 and ResNets.
academic
On the impact of the parametrization of deep convolutional neural networks on post-training quantization
This paper introduces novel theoretical approximation bounds for the output of quantized neural networks, with particular focus on convolutional neural networks (CNNs). By considering layer-wise parametrization and focusing on weight quantization, the authors provide bounds that achieve several orders of magnitude improvement over existing state-of-the-art results on classical deep convolutional neural networks (such as MobileNetV2 or ResNets). These improvements are achieved through enhanced approximation bounds with respect to the depth parameter, which has the most significant impact on approximation errors induced by quantization. To complement the theoretical results, the authors provide numerical exploration on MobileNetV2 and ResNets.
Core Problem: When deploying deep neural networks in resource-constrained environments, quantization techniques introduce performance degradation. Theoretical bounds are needed to quantify this degradation.
Significance:
Growing demand for neural network deployment on mobile devices and embedded systems
Tighter Approximation Bounds: Improves the NL² factor from Gonon et al. to ∑ᴸₗ₌₁Nₗ₋₁, simplifying to NL for constant-width networks
Relaxed Norm Constraints: Allows arbitrary positive values for rₗ (operator norm of layer l), making results applicable to networks with smaller parameter norms
Improved Geometric Mean Term: Replaces maximum parameter norm r with rmean, providing less pessimistic estimates
Convolutional Network Specialization: Provides specialized bounds for convolutional structures, considering only filter size and channel count
Practical Validation: Verifies theoretical improvements on classical pre-trained CNN models, demonstrating several orders of magnitude improvement
Gonon, A., et al. (2023). Approximation speed of quantized vs. unquantized relu neural networks and beyond. IEEE Transactions on Information Theory.
Nagel, M., et al. (2020). Up or down? adaptive rounding for post-training quantization. ICML.
Sandler, M., et al. (2018). Mobilenetv2: Inverted residuals and linear bottlenecks. CVPR.
He, K., et al. (2016). Deep residual learning for image recognition. CVPR.
Summary: This paper achieves important progress in theoretical analysis of neural network quantization. Through more refined layer-wise analysis and geometric mean strategies, it significantly improves existing approximation bounds. While bounds remain relatively conservative, the orders of magnitude improvement and relaxed constraints provide important theoretical value and practical significance.