AI Tax: The Hidden Cost of AI Data Center Applications
Daniel Richins, Dharmisha Doshi, Matthew Blackmore, Aswathy, Thulaseedharan Nair, Neha Pathapati, Ankit Patel, Brainard Daguman, Daniel, Dobrijalowski, Ramesh Illikkal, Kevin Long, David Zimmerman, Vijay Janapa, Reddi

TL;DR
This paper investigates the hidden costs of deploying AI workloads in edge data centers, revealing that AI acceleration can strain infrastructure and proposing a specialized design to reduce total cost of ownership.
Contribution
It highlights the overlooked infrastructure costs of AI acceleration and demonstrates a purpose-built edge data center design that lowers TCO by 15%.
Findings
AI workloads impose bottlenecks on storage and network bandwidth.
Specialized edge data center design reduces TCO by 15%.
Holistic analysis reveals infrastructure stresses not apparent in isolated AI algorithm evaluations.
Abstract
Artificial intelligence and machine learning are experiencing widespread adoption in industry and academia. This has been driven by rapid advances in the applications and accuracy of AI through increasingly complex algorithms and models; this, in turn, has spurred research into specialized hardware AI accelerators. Given the rapid pace of advances, it is easy to forget that they are often developed and evaluated in a vacuum without considering the full application environment. This paper emphasizes the need for a holistic, end-to-end analysis of AI workloads and reveals the "AI tax." We deploy and characterize Face Recognition in an edge data center. The application is an AI-centric edge video analytics application built using popular open source infrastructure and ML tools. Despite using state-of-the-art AI and ML algorithms, the application relies heavily on pre-and post-processing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors · IoT and Edge/Fog Computing · Advanced Memory and Neural Computing
