Overmind NSA: A Unified Neuro-Symbolic Computing Architecture with Approximate Nonlinear Activations and Preemptive Memory Bypass
Weilun Wang, Zirui Wang, Wantong Li

TL;DR
Overmind is a unified neuro-symbolic computing architecture that uses approximate nonlinear functions and memory bypass techniques to improve efficiency and performance in AI applications.
Contribution
It introduces a novel hardware architecture with Padé approximations and preemptive memory bypass, optimizing neuro-symbolic AI deployment with reduced resource usage.
Findings
Achieves 8.1 TOPS/W energy efficiency.
Reaches 410 GOPS throughput for mixed workloads.
Maintains minimal accuracy loss with adaptive nonlinear approximation.
Abstract
Neuro-symbolic AI is gaining traction in domains such as large language models, scientific discovery, and autonomous systems due to its ability to combine perception with structured reasoning. However, its deployment is often constrained by high memory demands, diverse computation patterns, and complex hardware requirements. Existing hardware platforms struggle with large on-chip memory overheads, frequent pipeline stalls, limited I/O bandwidth, and inefficient handling of nonlinear operations. To address these key computational bottlenecks, we propose Overmind, a unified neuro-symbolic architecture with cross-layer optimizations. Overmind tackles these core bottlenecks through Pad\'e approximations for universal nonlinear functions, preemptive memory bypass that eliminates costly on-chip caches, and a complete software stack that optimizes model deployment. By reconfiguring the Pad\'e…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
