Reinforced Interactive Continual Learning via Real-time Noisy Human Feedback
Yutao Yang, Jie Zhou, Junsong Li, Qianjun Pan, Bihao Zhan, Qin Chen, Xipeng Qiu, Liang He

TL;DR
This paper presents RiCL, a novel framework for interactive continual learning that effectively learns from real-time, noisy human feedback using LLMs, addressing limitations of traditional static and clean-label assumptions.
Contribution
Introduces RiCL, a reinforcement learning-based framework with noise filtering and preference optimization for dynamic, noisy feedback in continual learning.
Findings
RiCL outperforms existing methods on benchmark datasets with noisy labels.
The framework effectively filters noise and aligns model behavior with human intent.
Experimental results demonstrate robustness to real-world noisy feedback.
Abstract
This paper introduces an interactive continual learning paradigm where AI models dynamically learn new skills from real-time human feedback while retaining prior knowledge. This paradigm distinctively addresses two major limitations of traditional continual learning: (1) dynamic model updates using streaming, real-time human-annotated data, rather than static datasets with fixed labels, and (2) the assumption of clean labels, by explicitly handling the noisy feedback common in real-world interactions. To tackle these problems, we propose RiCL, a Reinforced interactive Continual Learning framework leveraging Large Language Models (LLMs) to learn new skills effectively from dynamic feedback. RiCL incorporates three key components: a temporal consistency-aware purifier to automatically discern clean from noisy samples in data streams; an interaction-aware direct preference optimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Adaptive Filtering Techniques · Speech and Audio Processing
MethodsContrastive Learning · ALIGN
