ORCA: A Network and Architecture Co-design for Offloading us-scale Datacenter Applications
Yifan Yuan, Jinghan Huang, Yan Sun, Tianchen Wang, Jacob, Nelson, Dan R. K. Ports, Yipeng Wang, Ren Wang, Charlie Tai and, Nam Sung Kim

TL;DR
ORCA is a comprehensive network and architecture co-design that leverages RDMA and cache-coherent technologies to significantly reduce latency, increase throughput, and improve power efficiency for datacenter applications.
Contribution
This paper introduces ORCA, a novel co-design integrating hardware and software components to optimize datacenter application performance using current RDMA and cache-coherent interconnects.
Findings
30.1-69.1% lower latency
up to 2.5x higher throughput
3x higher power efficiency
Abstract
Responding to the "datacenter tax" and "killer microseconds" problems for datacenter applications, diverse solutions including Smart NIC-based ones have been proposed. Nonetheless, they often suffer from high overhead of communications over network and/or PCIe links. To tackle the limitations of the current solutions, this paper proposes ORCA, a holistic network and architecture co-design solution that leverages current RDMA and emerging cache-coherent off-chip interconnect technologies. Specifically, ORCA consists of four hardware and software components: (1) unified abstraction of inter- and intra-machine communications managed by one-sided RDMA write and cache-coherent memory write; (2) efficient notification of requests to accelerators assisted by cache coherence; (3) cache-coherent accelerator architecture directly processing requests received by NIC; and (4) adaptive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Advanced Data Storage Technologies · Parallel Computing and Optimization Techniques
