Latency-aware Human-in-the-Loop Reinforcement Learning for Semantic Communications

Peizheng Li; Xinyi Lin; Adnan Aijaz

arXiv:2602.15640·eess.SP·February 18, 2026

Latency-aware Human-in-the-Loop Reinforcement Learning for Semantic Communications

Peizheng Li, Xinyi Lin, Adnan Aijaz

PDF

Open Access

TL;DR

This paper presents a latency-aware reinforcement learning framework that incorporates human feedback for semantic communication, ensuring timely task execution while maintaining semantic fidelity in RAN systems.

Contribution

It introduces a novel constrained RL approach with human-in-the-loop feedback and latency control for semantic communication in wireless networks.

Findings

01

Successfully meets per-user timing constraints in simulations

02

Outperforms baseline schedulers in reward metrics

03

Stabilizes resource consumption in heterogeneous deadline scenarios

Abstract

Semantic communication promises task-aligned transmission but must reconcile semantic fidelity with stringent latency guarantees in immersive and safety-critical services. This paper introduces a time-constrained human-in-the-loop reinforcement learning (TC-HITL-RL) framework that embeds human feedback, semantic utility, and latency control within a semantic-aware Open radio access network (RAN) architecture. We formulate semantic adaptation driven by human feedback as a constrained Markov decision process (CMDP) whose state captures semantic quality, human preferences, queue slack, and channel dynamics, and solve it via a primal--dual proximal policy optimization algorithm with action shielding and latency-aware reward shaping. The resulting policy preserves PPO-level semantic rewards while tightening the variability of both air-interface and near-real-time RAN intelligent controller…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAge of Information Optimization · Cognitive Radio Networks and Spectrum Sensing · Wireless Signal Modulation Classification