TL;DR
This paper introduces UNO, a framework that leverages user logs to enhance large language models by distilling feedback, managing data heterogeneity, and filtering noise, leading to improved performance.
Contribution
The paper presents UNO, a novel unified approach for improving LLM systems using user logs, addressing noise and heterogeneity challenges in real-world data.
Findings
UNO outperforms RAG and memory-based baselines in effectiveness.
The framework achieves state-of-the-art efficiency in LLM improvement.
Extensive experiments validate UNO's ability to filter noise and adapt to user feedback.
Abstract
Scaling training data and model parameters has long driven progress in large language models (LLMs), but this paradigm is increasingly constrained by the scarcity of high-quality data and diminishing returns from rising computational costs. As a result, recent work is increasing the focus on continual learning from real-world deployment, where user interaction logs provide a rich source of authentic human feedback and procedural knowledge. However, learning from user logs is challenging due to their unstructured and noisy nature. Vanilla LLM systems often struggle to distinguish useful feedback signals from noisy user behavior, and the disparity between user log collection and model optimization (e.g., the off-policy optimization problem) further strengthens the problem. To this end, we propose UNO (User log-driveN Optimization), a unified framework for improving LLM systems (LLMsys)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques
