Meta-Reinforcement Learning for Fast and Data-Efficient Spectrum Allocation in Dynamic Wireless Networks

Oluwaseyi Giwa; Tobi Awodunmila; Muhammad Ahmed Mohsin; Ahsan Bilal; Muhammad Ali Jamshed

arXiv:2507.10619·cs.LG·March 2, 2026

Meta-Reinforcement Learning for Fast and Data-Efficient Spectrum Allocation in Dynamic Wireless Networks

Oluwaseyi Giwa, Tobi Awodunmila, Muhammad Ahmed Mohsin, Ahsan Bilal, Muhammad Ali Jamshed

PDF

TL;DR

This paper introduces a meta-learning approach for spectrum allocation in dynamic wireless networks, enabling rapid adaptation and improved performance over traditional DRL methods, with significant gains in throughput and safety.

Contribution

It develops and evaluates three meta-learning architectures for fast, data-efficient spectrum management, demonstrating superior performance and safety compared to standard DRL algorithms.

Findings

01

Meta-learning architectures outperform PPO baseline in throughput.

02

Attention-based meta-learning reduces interference and latency violations.

03

Rapid adaptation improves fairness and resource allocation.

Abstract

The dynamic allocation of spectrum in 5G / 6G networks is critical to efficient resource utilization. However, applying traditional deep reinforcement learning (DRL) is often infeasible due to its immense sample complexity and the safety risks associated with unguided exploration, which can cause severe network interference. To address these challenges, we propose a meta-learning framework that enables agents to learn a robust initial policy and rapidly adapt to new wireless scenarios with minimal data. We implement three meta-learning architectures, model-agnostic meta-learning (MAML), recurrent neural network (RNN), and an attention-enhanced RNN, and evaluate them against a non-meta-learning DRL algorithm, proximal policy optimization (PPO) baseline, in a simulated dynamic integrated access/backhaul (IAB) environment. Our results show a clear performance gap. The attention-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsProximal Policy Optimization