Towards Native Intelligence: 6G-LLM Trained with Reinforcement Learning from NDT Feedback
Zhuoran Xiao, Tao Tao, Chenhui Ye, Yunbo Hu, Yijia Feng, Tianyu Jiao, and Liyu Cai

TL;DR
This paper introduces a novel reinforcement learning framework using digital twin feedback to train 6G domain large language models, enhancing network orchestration and adaptability beyond traditional offline methods.
Contribution
The paper proposes RLDTF, a new training paradigm for 6G-LLMs utilizing network digital twins and reinforcement learning for continual self-improvement.
Findings
Significant improvement in orchestration accuracy.
Enhanced solution optimality over state-of-the-art baselines.
Effective use of digital twin feedback for training.
Abstract
Owing to its comprehensive understanding of upper-layer application requirements and the capabilities of practical communication systems, the 6G-LLM (6G domain large language model) offers a promising pathway toward realizing network native intelligence. Serving as the system orchestrator, the 6G-LLM drives a paradigm shift that fundamentally departs from existing rule-based approaches, which primarily rely on modular, experience-driven optimization. By contrast, the 6G-LLM substantially enhances network flexibility and adaptability. Nevertheless, current efforts to construct 6G-LLMs are constrained by their reliance on large-scale, meticulously curated, human-authored corpora, which are impractical to obtain in real-world scenarios. Moreover, purely offline-trained models lack the capacity for continual self-improvement, limiting their ability to adapt to the highly dynamic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware-Defined Networks and 5G · Advanced Wireless Communication Technologies · Ferroelectric and Negative Capacitance Devices
