Reinforced Interactive Continual Learning via Real-time Noisy Human Feedback

Yutao Yang; Jie Zhou; Junsong Li; Qianjun Pan; Bihao Zhan; Qin Chen; Xipeng Qiu; Liang He

arXiv:2505.09925·cs.LG·May 16, 2025

Reinforced Interactive Continual Learning via Real-time Noisy Human Feedback

Yutao Yang, Jie Zhou, Junsong Li, Qianjun Pan, Bihao Zhan, Qin Chen, Xipeng Qiu, Liang He

PDF

Open Access

TL;DR

This paper presents RiCL, a novel framework for interactive continual learning that effectively learns from real-time, noisy human feedback using LLMs, addressing limitations of traditional static and clean-label assumptions.

Contribution

Introduces RiCL, a reinforcement learning-based framework with noise filtering and preference optimization for dynamic, noisy feedback in continual learning.

Findings

01

RiCL outperforms existing methods on benchmark datasets with noisy labels.

02

The framework effectively filters noise and aligns model behavior with human intent.

03

Experimental results demonstrate robustness to real-world noisy feedback.

Abstract

This paper introduces an interactive continual learning paradigm where AI models dynamically learn new skills from real-time human feedback while retaining prior knowledge. This paradigm distinctively addresses two major limitations of traditional continual learning: (1) dynamic model updates using streaming, real-time human-annotated data, rather than static datasets with fixed labels, and (2) the assumption of clean labels, by explicitly handling the noisy feedback common in real-world interactions. To tackle these problems, we propose RiCL, a Reinforced interactive Continual Learning framework leveraging Large Language Models (LLMs) to learn new skills effectively from dynamic feedback. RiCL incorporates three key components: a temporal consistency-aware purifier to automatically discern clean from noisy samples in data streams; an interaction-aware direct preference optimization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Advanced Adaptive Filtering Techniques · Speech and Audio Processing

MethodsContrastive Learning · ALIGN