Loading paper
A Switch-Centric In-Network Architecture for Accelerating LLM Inference in Shared-Memory Network | Tomesphere