Cost-Aware Learning and Optimization for Opportunistic Spectrum Access
Chao Gan, Ruida Zhou, Jing Yang, Cong Shen

TL;DR
This paper develops cost-aware strategies for opportunistic spectrum access in cognitive radio systems, optimizing the balance between sensing costs, transmission rewards, and uncertainty in channel states.
Contribution
It introduces a recursive double threshold policy for offline optimization and an order-optimal online learning algorithm with regret bounds.
Findings
Optimal offline policy has a recursive double threshold structure.
Online algorithm achieves O(log T) regret, matching lower bounds.
Simulation confirms theoretical results.
Abstract
In this paper, we investigate cost-aware joint learning and optimization for multi-channel opportunistic spectrum access in a cognitive radio system. We investigate a discrete time model where the time axis is partitioned into frames. Each frame consists of a sensing phase, followed by a transmission phase. During the sensing phase, the user is able to sense a subset of channels sequentially before it decides to use one of them in the following transmission phase. We assume the channel states alternate between busy and idle according to independent Bernoulli random processes from frame to frame. To capture the inherent uncertainty in channel sensing, we assume the reward of each transmission when the channel is idle is a random variable. We also associate random costs with sensing and transmission actions. Our objective is to understand how the costs and reward of the actions would…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCognitive Radio Networks and Spectrum Sensing · Advanced Bandit Algorithms Research · Advanced MIMO Systems Optimization
