ARCHES: Adaptive Real-Time Switching of AI Models for the RAN
Neagin Neasamoni Santhi, Davide Villa, Michele Polese, Salvatore D'Oro, Yunseong Lee, Koichiro Furueda, Tommaso Melodia

TL;DR
ARCHES is a framework that enables real-time switching between AI and conventional models in the RAN to optimize performance and power efficiency based on current network conditions.
Contribution
It introduces a GPU-accelerated, real-time expert switching system with a novel control plane and switching kernel for adaptive RAN signal processing.
Findings
Achieves median throughput gains of 5.32% and 7.23% under different conditions.
Reduces GPU power consumption by 15.8 W (9.6%) when defaulting to traditional methods.
Maintains low control-loop latency of approximately 140 microseconds.
Abstract
Artificial Intelligence (AI) has become a powerful tool for model-free Radio Access Network (RAN) signal processing and optimization. However, designing a single model that generalizes across all radio environments is challenging. Specialized AI models outperform conventional algorithms only under specific conditions, while their higher compute and energy cost makes unconditional execution impractical at the base station. This creates a need for real-time expert switching: dynamically activating the most appropriate AI or conventional expert based on current network conditions. To address this, we propose ARCHES (Adaptive Real-time CUDA Hot-swapping of Experts in the RAN Stack), a framework hosting multiple AI-based and conventional signal processing experts within a GPU-accelerated PHY pipeline, dynamically selecting the most appropriate expert at slot-boundary granularity without…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
