NaNet: a Low-Latency, Real-Time, Multi-Standard Network Interface Card with GPUDirect Features
A. Lonardo, F. Ameli, R. Ammendola, A. Biagioni, O. Frezza, G., Lamanna, F. Lo Cicero, M. Martinelli, P. S. Paolucci, E. Pastorelli, L., Pontisso, D. Rossetti, F. Simeone, F. Simula, M. Sozzi, L. Tosoratto, P., Vicini

TL;DR
NaNet is a FPGA-based PCIe network interface card designed to provide low-latency, real-time data transfer with direct GPU memory access, supporting multiple network standards and custom links for high-performance computing applications.
Contribution
This paper introduces NaNet, a novel FPGA-based NIC with GPUDirect support, configurable for various network standards, enabling deterministic low-latency data transfer in real-time GPU systems.
Findings
NaNet achieves cycle-accurate, upper-bound latency for supported links.
NaNet supports both standard and custom high-speed links.
Real-world use cases demonstrate its effectiveness in physics experiments.
Abstract
While the GPGPU paradigm is widely recognized as an effective approach to high performance computing, its adoption in low-latency, real-time systems is still in its early stages. Although GPUs typically show deterministic behaviour in terms of latency in executing computational kernels as soon as data is available in their internal memories, assessment of real-time features of a standard GPGPU system needs careful characterization of all subsystems along data stream path. The networking subsystem results in being the most critical one in terms of absolute value and fluctuations of its response latency. Our envisioned solution to this issue is NaNet, a FPGA-based PCIe Network Interface Card (NIC) design featuring a configurable and extensible set of network channels with direct access through GPUDirect to NVIDIA Fermi/Kepler GPU memories. NaNet design currently supports both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmbedded Systems Design Techniques · Interconnection Networks and Systems · Distributed and Parallel Computing Systems
