TL;DR
This paper introduces a dual complementary CNN approach combined with a memory component to significantly reduce inference energy consumption on devices like Jetson Nano, while maintaining high accuracy.
Contribution
The paper presents a novel dual CNN collaboration method with a memory component to lower energy use during inference, improving efficiency over traditional single CNN models.
Findings
Achieved up to 85.8% energy reduction on Jetson Nano.
Memory component reduces redundant computations for identical inputs.
Maintains high accuracy despite significant energy savings.
Abstract
Energy efficiency of Convolutional Neural Networks (CNNs) has become an important area of research, with various strategies being developed to minimize the power consumption of these models. Previous efforts, including techniques like model pruning, quantization, and hardware optimization, have made significant strides in this direction. However, there remains a need for more effective on device AI solutions that balance energy efficiency with model performance. In this paper, we propose a novel approach to reduce the energy requirements of inference of CNNs. Our methodology employs two small Complementary CNNs that collaborate with each other by covering each other's "weaknesses" in predictions. If the confidence for a prediction of the first CNN is considered low, the second CNN is invoked with the aim of producing a higher confidence prediction. This dual-CNN setup significantly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
