Physical Foundation Models: Fixed hardware implementations of large-scale neural networks

Logan G Wright; Tianyu Wang; Tatsuhiro Onodera; and Peter L. McMahon

arXiv:2604.27911·cs.LG·May 1, 2026

Physical Foundation Models: Fixed hardware implementations of large-scale neural networks

Logan G Wright, Tianyu Wang, Tatsuhiro Onodera, and Peter L. McMahon

PDF

TL;DR

The paper proposes Physical Foundation Models (PFMs), hardware implementations of large neural networks realized directly through physical design, promising significant energy efficiency and scalability improvements.

Contribution

It introduces the concept of PFMs, advocating for hardware that embodies neural networks physically, enabling orders-of-magnitude gains in efficiency and scalability over traditional digital hardware.

Findings

01

PFMs could drastically reduce energy consumption of large models.

02

Optical and nanoelectronic platforms are promising for implementing PFMs.

03

Scaling PFMs to trillion-parameter sizes is discussed with potential physical realizations.

Abstract

Foundation models are deep neural networks (such as GPT-5, Gemini~3, and Opus~4) trained on large datasets that can perform diverse downstream tasks -- text and code generation, question answering, summarization, image classification, and so on. The philosophy of foundation models is to put effort into a single, large ( $\sim 1 0^{12}$ -parameter) general-purpose model that can be adapted to many downstream tasks with no or minimal additional training. We argue that the rise of foundation models presents an opportunity for hardware engineers: in contrast to when different models were used for different tasks, it now makes sense to build special-purpose, fixed hardware implementations of neural networks, manufactured and released at the roughly 1-year cadence of major new foundation-model versions. Beyond conventional digital-electronic inference hardware with read-only weight memory, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.