Towards an On-device Agent for Text Rewriting
Yun Zhu, Yinxiao Liu, Felix Stahlberg, Shankar Kumar, Yu-hui Chen,, Liangchen Luo, Lei Shu, Renjie Liu, Jindong Chen, Lei Meng

TL;DR
This paper presents a novel instruction tuning and reinforcement learning approach to develop a compact, high-quality on-device text rewriting model that surpasses larger LLMs in performance, while maintaining privacy and efficiency.
Contribution
It introduces a new instruction tuning method, a heuristic reinforcement learning framework, and a cascade approach to enhance on-device text rewriting models for mobile scenarios.
Findings
The on-device model outperforms state-of-the-art LLMs in text rewriting.
The cascade approach improves model performance.
The methods enable high-quality text rewriting without human-labeled data.
Abstract
Large Language Models (LLMs) have demonstrated impressive capabilities for text rewriting. Nonetheless, the large sizes of these models make them impractical for on-device inference, which would otherwise allow for enhanced privacy and economical inference. Creating a smaller yet potent language model for text rewriting presents a formidable challenge because it requires balancing the need for a small size with the need to retain the emergent capabilities of the LLM, that requires costly data collection. To address the above challenge, we introduce a new instruction tuning approach for building a mobile-centric text rewriting model. Our strategies enable the generation of high quality training data without any human labeling. In addition, we propose a heuristic reinforcement learning framework which substantially enhances performance without requiring preference data. To further bridge…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Ferroelectric and Negative Capacitance Devices
