TL;DR
This paper introduces a meta-learning approach that dynamically generates CNN weights for varying input resolutions, enabling scalable and efficient image classification with improved accuracy and adaptability across different resource settings.
Contribution
The proposed method uses meta learners and privatized Batch Normalization to adapt CNNs to arbitrary input sizes, enhancing flexibility and performance over traditional fixed-resolution models.
Findings
Achieves better accuracy-efficiency trade-off on ImageNet
Enables dynamic resolution adaptation for resource-constrained environments
Outperforms individually trained models across various input scales
Abstract
Convolutional Neural Networks (CNNs) do not have a predictable recognition behavior with respect to the input resolution change. This prevents the feasibility of deployment on different input image resolutions for a specific model. To achieve efficient and flexible image classification at runtime, we employ meta learners to generate convolutional weights of main networks for various input scales and maintain privatized Batch Normalization layers per scale. For improved training performance, we further utilize knowledge distillation on the fly over model predictions based on different input resolutions. The learned meta network could dynamically parameterize main networks to act on input images of arbitrary size with consistently better accuracy compared to individually trained models. Extensive experiments on the ImageNet demonstrate that our method achieves an improved…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsKnowledge Distillation · Batch Normalization
