Physical Foundation Models: Fixed hardware implementations of large-scale neural networks
Logan G Wright, Tianyu Wang, Tatsuhiro Onodera, and Peter L. McMahon

TL;DR
The paper proposes Physical Foundation Models (PFMs), hardware implementations of large neural networks realized directly through physical design, promising significant energy efficiency and scalability improvements.
Contribution
It introduces the concept of PFMs, advocating for hardware that embodies neural networks physically, enabling orders-of-magnitude gains in efficiency and scalability over traditional digital hardware.
Findings
PFMs could drastically reduce energy consumption of large models.
Optical and nanoelectronic platforms are promising for implementing PFMs.
Scaling PFMs to trillion-parameter sizes is discussed with potential physical realizations.
Abstract
Foundation models are deep neural networks (such as GPT-5, Gemini~3, and Opus~4) trained on large datasets that can perform diverse downstream tasks -- text and code generation, question answering, summarization, image classification, and so on. The philosophy of foundation models is to put effort into a single, large (-parameter) general-purpose model that can be adapted to many downstream tasks with no or minimal additional training. We argue that the rise of foundation models presents an opportunity for hardware engineers: in contrast to when different models were used for different tasks, it now makes sense to build special-purpose, fixed hardware implementations of neural networks, manufactured and released at the roughly 1-year cadence of major new foundation-model versions. Beyond conventional digital-electronic inference hardware with read-only weight memory, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
