Throughput Maximization for Ambient Backscatter Communication: A Reinforcement Learning Approach
Xiaokang Wen, Suzhi Bi, Xiaohui Lin, Lina Yuan, and Juan Wang

TL;DR
This paper introduces a reinforcement learning approach to optimize mode switching in ambient backscatter communication systems, significantly improving throughput performance under fading channels.
Contribution
It proposes a novel RL-based method for adaptive mode selection in AB communication, handling unknown channel distributions and achieving near-optimal throughput.
Findings
QL method approaches optimal throughput
Outperforms benchmark strategies
Effective in fading channel environments
Abstract
Ambient backscatter (AB) communication is an emerging wireless communication technology that enables wireless devices (WDs) to communicate without requiring active radio transmission. In an AB communication system, a WD switches between communication and energy harvesting modes. The harvested energy is used to power the devices operations, e.g., circuit power consumption and sensing operation. In this paper, we focus on maximizing the throughput performance of AB communication system by adaptively selecting the operating mode under fading channel environment. We model the problem as an infinite-horizon Markov Decision Process (MDP) and accordingly obtain the optimal mode switching policy by the value iteration algorithm given the channel distributions. Meanwhile, when the knowledge of channel distribution is absent, a Q-learning (QL) method is applied to explore a suboptimal strategy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnergy Harvesting in Wireless Networks · Advanced MIMO Systems Optimization · Full-Duplex Wireless Communications
