Role-RL: Online Long-Context Processing with Role Reinforcement Learning   for Distinct LLMs in Their Optimal Roles

Lewei He; Tianyu Shi; Pengran Huang; Bingzhi Chen; Qianglong Chen,; Jiahui Pan

arXiv:2409.18014·cs.AI·September 27, 2024

Role-RL: Online Long-Context Processing with Role Reinforcement Learning for Distinct LLMs in Their Optimal Roles

Lewei He, Tianyu Shi, Pengran Huang, Bingzhi Chen, Qianglong Chen,, Jiahui Pan

PDF

Open Access

TL;DR

This paper introduces Online Long-context Processing (OLP) for handling unlimited-length documents efficiently and proposes Role Reinforcement Learning (Role-RL) to dynamically assign LLMs to roles, improving performance and reducing costs.

Contribution

It presents a novel OLP paradigm for long-context processing and a Role-RL method for optimal LLM role deployment, addressing complexity and efficiency challenges.

Findings

01

Achieved 93.2% recall rate on OLP benchmark.

02

Saved 79.4% of LLM costs.

03

Demonstrated effectiveness on the OLP-MINI dataset.

Abstract

Large language models (LLMs) with long-context processing are still challenging because of their implementation complexity, training efficiency and data sparsity. To address this issue, a new paradigm named Online Long-context Processing (OLP) is proposed when we process a document of unlimited length, which typically occurs in the information reception and organization of diverse streaming media such as automated news reporting, live e-commerce, and viral short videos. Moreover, a dilemma was often encountered when we tried to select the most suitable LLM from a large number of LLMs amidst explosive growth aiming for outstanding performance, affordable prices, and short response delays. In view of this, we also develop Role Reinforcement Learning (Role-RL) to automatically deploy different LLMs in their respective roles within the OLP pipeline according to their actual performance.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsService-Oriented Architecture and Web Services · Data Mining Algorithms and Applications · Semantic Web and Ontologies