ARCHES: Adaptive Real-Time Switching of AI Models for the RAN

Neagin Neasamoni Santhi; Davide Villa; Michele Polese; Salvatore D'Oro; Yunseong Lee; Koichiro Furueda; Tommaso Melodia

arXiv:2604.23397·cs.NI·April 28, 2026

ARCHES: Adaptive Real-Time Switching of AI Models for the RAN

Neagin Neasamoni Santhi, Davide Villa, Michele Polese, Salvatore D'Oro, Yunseong Lee, Koichiro Furueda, Tommaso Melodia

PDF

TL;DR

ARCHES is a framework that enables real-time switching between AI and conventional models in the RAN to optimize performance and power efficiency based on current network conditions.

Contribution

It introduces a GPU-accelerated, real-time expert switching system with a novel control plane and switching kernel for adaptive RAN signal processing.

Findings

01

Achieves median throughput gains of 5.32% and 7.23% under different conditions.

02

Reduces GPU power consumption by 15.8 W (9.6%) when defaulting to traditional methods.

03

Maintains low control-loop latency of approximately 140 microseconds.

Abstract

Artificial Intelligence (AI) has become a powerful tool for model-free Radio Access Network (RAN) signal processing and optimization. However, designing a single model that generalizes across all radio environments is challenging. Specialized AI models outperform conventional algorithms only under specific conditions, while their higher compute and energy cost makes unconditional execution impractical at the base station. This creates a need for real-time expert switching: dynamically activating the most appropriate AI or conventional expert based on current network conditions. To address this, we propose ARCHES (Adaptive Real-time CUDA Hot-swapping of Experts in the RAN Stack), a framework hosting multiple AI-based and conventional signal processing experts within a GPU-accelerated PHY pipeline, dynamically selecting the most appropriate expert at slot-boundary granularity without…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.