What Is Next for LLMs? Next-Generation AI Computing Hardware Using Photonic Chips
Renjie Li, Wenjie Wei, Qi Xin, Xiaoli Liu, Sixuan Mao, Erik Ma, Zijian Chen, Malu Zhang, Haizhou Li, Zhaoyu Zhang

TL;DR
This paper reviews emerging photonic hardware for next-generation AI, highlighting potential for vastly improved efficiency and speed in large language model computing, while discussing current challenges and future research directions.
Contribution
It provides a comprehensive survey of photonic neural networks, neuromorphic devices, and integration strategies for scaling LLMs beyond electronic hardware.
Findings
Photonic systems could surpass electronic processors in throughput and energy efficiency.
Memory and storage breakthroughs are crucial for scaling mega-sized LLMs.
Integrated photonic architectures enable ultrafast matrix operations for AI.
Abstract
Large language models (LLMs) are rapidly pushing the limits of contemporary computing hardware. For example, training GPT-3 has been estimated to consume around 1300 MWh of electricity, and projections suggest future models may require city-scale (gigawatt) power budgets. These demands motivate exploration of computing paradigms beyond conventional von Neumann architectures. This review surveys emerging photonic hardware optimized for next-generation generative AI computing. We discuss integrated photonic neural network architectures (e.g., Mach-Zehnder interferometer meshes, lasers, wavelength-multiplexed microring resonators) that perform ultrafast matrix operations. We also examine promising alternative neuromorphic devices, including spiking neural network circuits and hybrid spintronic-photonic synapses, which combine memory and processing. The integration of two-dimensional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Photonic and Optical Devices · Advanced Memory and Neural Computing
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · Byte Pair Encoding · Attention Dropout · Softmax · Residual Connection · Linear Layer · Weight Decay · Adam · Multi-Head Attention
