Advancing Compositional LLM Reasoning with Structured Task Relations in Interactive Multimodal Communications
Xinye Cao, Hongcan Guo, Guoshun Nan, Jiaoyang Cui, Haoting Qian, Yihan Lin, Yilin Peng, Diyang Zhang, Yanzhao Hou, Huici Wu, Xiaofeng Tao, Tony Q.S. Quek

TL;DR
This paper introduces a novel single LLM framework, guided by structured task relations and optimized scheduling, to efficiently perform multiple interactive multimodal applications over wireless networks, reducing resource use and improving adaptability.
Contribution
It proposes ContextLoRA and ContextGear, enabling a single LLM to adapt to diverse IMAs with structured reasoning and resource-efficient training, unlike prior multi-LLM approaches.
Findings
Outperforms existing methods on three benchmarks.
Demonstrates practical applicability on real-world wireless testbed.
Reduces computational and communication costs in resource-constrained environments.
Abstract
Interactive multimodal applications (IMAs), such as route planning in the Internet of Vehicles, enrich users' personalized experiences by integrating various forms of data over wireless networks. Recent advances in large language models (LLMs) utilize mixture-of-experts (MoE) mechanisms to empower multiple IMAs, with each LLM trained individually for a specific task that presents different business workflows. In contrast to existing approaches that rely on multiple LLMs for IMAs, this paper presents a novel paradigm that accomplishes various IMAs using a single compositional LLM over wireless networks. The two primary challenges include 1) guiding a single LLM to adapt to diverse IMA objectives and 2) ensuring the flexibility and efficiency of the LLM in resource-constrained mobile environments. To tackle the first challenge, we propose ContextLoRA, a novel method that guides an LLM to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
