Role-RL: Online Long-Context Processing with Role Reinforcement Learning for Distinct LLMs in Their Optimal Roles
Lewei He, Tianyu Shi, Pengran Huang, Bingzhi Chen, Qianglong Chen,, Jiahui Pan

TL;DR
This paper introduces Online Long-context Processing (OLP) for handling unlimited-length documents efficiently and proposes Role Reinforcement Learning (Role-RL) to dynamically assign LLMs to roles, improving performance and reducing costs.
Contribution
It presents a novel OLP paradigm for long-context processing and a Role-RL method for optimal LLM role deployment, addressing complexity and efficiency challenges.
Findings
Achieved 93.2% recall rate on OLP benchmark.
Saved 79.4% of LLM costs.
Demonstrated effectiveness on the OLP-MINI dataset.
Abstract
Large language models (LLMs) with long-context processing are still challenging because of their implementation complexity, training efficiency and data sparsity. To address this issue, a new paradigm named Online Long-context Processing (OLP) is proposed when we process a document of unlimited length, which typically occurs in the information reception and organization of diverse streaming media such as automated news reporting, live e-commerce, and viral short videos. Moreover, a dilemma was often encountered when we tried to select the most suitable LLM from a large number of LLMs amidst explosive growth aiming for outstanding performance, affordable prices, and short response delays. In view of this, we also develop Role Reinforcement Learning (Role-RL) to automatically deploy different LLMs in their respective roles within the OLP pipeline according to their actual performance.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsService-Oriented Architecture and Web Services · Data Mining Algorithms and Applications · Semantic Web and Ontologies
