Photonic Fabric Platform for AI Accelerators
Jing Ding, Trung Diep

TL;DR
This paper introduces the Photonic FabricTM and PFA, a photonic-enabled switch and memory system that significantly enhances AI accelerator performance and energy efficiency by enabling scalable, high-bandwidth shared memory for distributed training and inference.
Contribution
The paper presents a novel photonic fabric platform and appliance that overcome fixed memory-to-compute ratios, enabling scalable, high-bandwidth shared memory for AI accelerators.
Findings
Up to 3.66x throughput improvement in LLM inference
Up to 7.04x throughput improvement in large models
60-90% energy savings in data movement
Abstract
This paper presents the Photonic FabricTM and the Photonic Fabric ApplianceTM (PFA), a photonic-enabled switch and memory subsystem that delivers low latency, high bandwidth, and low per-bit energy. By integrating high-bandwidth HBM3E memory, an on-module photonic switch, and external DDR5 in a 2.5D electro-optical system-in-package, the PFA offers up to 32 TB of shared memory alongside 115 Tbps of all-to-all digital switching. The Photonic FabricTM enables distributed AI training and inference to execute parallelism strategies more efficiently. The Photonic Fabric removes the silicon beachfront constraint that limits the fixed memory-to-compute ratio observed in virtually all current XPU accelerator designs. Replacing a local HBM stack on an XPU with a chiplet that connects to the Photonic Fabric increases its memory capacity and correspondingly its memory bandwidth by offering a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhotonic and Optical Devices · Neural Networks and Reservoir Computing · Advanced Optical Network Technologies
