CATransformers: Carbon Aware Transformers Through Joint Model-Hardware Optimization
Irene Wang, Newsha Ardalani, Mostafa Elhoushi, Daniel Jiang, Samuel Hsia, Ekin Sumbul, Divya Mahajan, Carole-Jean Wu, Bilge Acun

TL;DR
This paper presents heirname, a pioneering framework for co-optimizing Transformer models and hardware accelerators with a focus on reducing carbon footprint, balancing sustainability with performance.
Contribution
It introduces the first framework integrating operational and embodied carbon considerations into Transformer and hardware co-design, enabling more sustainable AI development.
Findings
Potential to reduce total carbon emissions by up to 30%
Maintains accuracy and latency while optimizing for carbon efficiency
Demonstrates effectiveness across various Transformer models
Abstract
Machine learning solutions are rapidly adopted to enable a variety of key use cases, from conversational AI assistants to scientific discovery. This growing adoption is expected to increase the associated lifecycle carbon footprint, including both \emph{operational carbon} from training and inference and \emph{embodied carbon} from AI hardware manufacturing. We introduce \ourframework -- the first carbon-aware co-optimization framework for Transformer-based models and hardware accelerators. By integrating both operational and embodied carbon into early-stage design space exploration, \ourframework enables sustainability-driven model architecture and hardware accelerator co-design that reveals fundamentally different trade-offs than latency- or energy-centric approaches. Evaluated across a range of Transformer models, \ourframework consistently demonstrates the potential to reduce total…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGreen IT and Sustainability · Advanced Neural Network Applications · Machine Learning in Materials Science
MethodsContrastive Language-Image Pre-training
