TL;DR
This paper introduces a novel centralized cache management system in wireless networks using a Large Language Model to make cooperative, real-time content replacement decisions based on detailed network state prompts.
Contribution
It presents a new LLM-based approach for cooperative edge caching, trained with supervised fine-tuning and policy optimization, outperforming classical heuristics and approaches.
Findings
Approaches a single-step exhaustive search performance in cache hit rate.
Outperforms classical heuristics by 4.1% in cache hit rate.
Demonstrates robust zero-shot transfer across various network conditions.
Abstract
Cooperative edge caching in overlapping zones couples Base Station (BS) decisions, making content replacement sensitive to spatial topology and temporal reuse. Conventional heuristics suffer from myopia, while Deep Reinforcement Learning relies on brittle numerical representations and needs prohibitive retraining under topological or traffic dynamics. This paper studies a centralized, cooperative multi-BS cache-replacement controller driven by a Large Language Model (LLM) within a deterministic text-to-action loop. At each time slot, the global cache state is rendered into a prompt encapsulating each BS's inventory, deduplicated requests, and multi-scale frequency summaries. The LLM generates one decision line per BS. A strict parser and feasibility checker then either accept the joint action or fall back to an all-BS NoOp action. We align the LLM via two-stage training: Supervised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
