XR-NPE: High-Throughput Mixed-precision SIMD Neural Processing Engine for Extended Reality Perception Workloads
Tejas Chaudhari, Akarsh J., Tanushree Dewangan, Mukul Lokhande, and Santosh Kumar Vishvakarma

TL;DR
XR-NPE is a novel high-throughput, mixed-precision SIMD neural processing engine optimized for XR perception tasks, supporting ultra-low bit formats and adaptive algorithms to enhance energy efficiency and reduce area and power consumption.
Contribution
It introduces XR-NPE, the first engine supporting multiple low-precision formats with layer adaptive hybrid algorithms, and demonstrates significant improvements over state-of-the-art accelerators in XR workloads.
Findings
Achieves 1.72 GHz frequency and 14 pJ arithmetic intensity at 28nm.
Reduces area by 42% and power by 38% compared to state-of-the-art MACs.
Provides 23% better energy efficiency and 4% higher compute density for VIO workloads.
Abstract
This work proposes XR-NPE, a high-throughput Mixed-precision SIMD Neural Processing Engine, designed for extended reality (XR) perception workloads like visual inertial odometry (VIO), object classification, and eye gaze extraction. XR-NPE is first to support FP4, Posit (4,1), Posit (8,0), and Posit (16,1) formats, with layer adaptive hybrid-algorithmic implementation supporting ultra-low bit precision to significantly reduce memory bandwidth requirements, and accompanied by quantization-aware training for minimal accuracy loss. The proposed Reconfigurable Mantissa Multiplication and Exponent processing Circuitry (RMMEC) reduces dark silicon in the SIMD MAC compute engine, assisted by selective power gating to reduce energy consumption, providing 2.85x improved arithmetic intensity. XR-NPE achieves a maximum operating frequency of 1.72 GHz, area 0.016 mm2 , and arithmetic intensity 14…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Advanced Optical Sensing Technologies · Advanced Neural Network Applications
