CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device Learning
Sai Qian Zhang, Thierry Tambe, Nestor Cuevas, Gu-Yeon Wei, David, Brooks

TL;DR
CAMEL introduces a co-designed AI model and hardware system utilizing embedded DRAM to enable efficient, low-power on-device training with reduced memory and energy consumption, while maintaining high accuracy.
Contribution
It proposes reversible DNN architectures and a CAMEL training engine that together reduce data lifetime, memory usage, and energy consumption for on-device learning.
Findings
2.5x faster training process
2.8x energy savings
Maintains high training accuracy
Abstract
On-device learning allows AI models to adapt to user data, thereby enhancing service quality on edge platforms. However, training AI on resource-limited devices poses significant challenges due to the demanding computing workload and the substantial memory consumption and data access required by deep neural networks (DNNs). To address these issues, we propose utilizing embedded dynamic random-access memory (eDRAM) as the primary storage medium for transient training data. In comparison to static random-access memory (SRAM), eDRAM provides higher storage density and lower leakage power, resulting in reduced access cost and power leakage. Nevertheless, to maintain the integrity of the stored data, periodic power-hungry refresh operations could potentially degrade system performance. To minimize the occurrence of expensive eDRAM refresh operations, it is beneficial to shorten the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Stochastic Gradient Optimization Techniques · Ferroelectric and Negative Capacitance Devices
Methodstravel james
