TL;DR
This paper discusses the challenges in designing hardware for machine learning applications, focusing on energy efficiency, flexibility, and data movement constraints, and explores solutions across architecture, algorithms, circuits, and emerging technologies.
Contribution
It provides a comprehensive overview of hardware design challenges for machine learning and proposes multi-level solutions from architecture to advanced technologies.
Findings
Identifies key hardware constraints in ML applications.
Proposes hardware-friendly algorithms and architectures.
Highlights potential of emerging technologies for ML hardware.
Abstract
Machine learning plays a critical role in extracting meaningful information out of the zetabytes of sensor data collected every day. For some applications, the goal is to analyze and understand the data to identify trends (e.g., surveillance, portable/wearable electronics); in other applications, the goal is to take immediate action based the data (e.g., robotics/drones, self-driving cars, smart Internet of Things). For many of these applications, local embedded processing near the sensor is preferred over the cloud due to privacy or latency concerns, or limitations in the communication bandwidth. However, at the sensor there are often stringent constraints on energy consumption and cost in addition to throughput and accuracy requirements. Furthermore, flexibility is often required such that the processing can be adapted for different applications or environments (e.g., update the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
