Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent

Jianzhe Lin; Zeyu Pan; Yun Zhu; Ruiqi Song; Jining Yang

arXiv:2511.23436·cs.AI·December 1, 2025

Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent

Jianzhe Lin, Zeyu Pan, Yun Zhu, Ruiqi Song, Jining Yang

PDF

Open Access

TL;DR

SuperIntelliAgent is a novel framework combining a trainable diffusion model with a frozen language model to enable autonomous, continual learning through self-supervised interaction, dual-scale memory, and preference optimization.

Contribution

It introduces a self-supervised, continual learning framework that integrates dual-scale memory and a verifier-learner setup for scalable intelligence growth.

Findings

01

Learns effectively without manual annotations.

02

Improves performance across multiple benchmarks.

03

Supports lifelong learning with memory replay.

Abstract

We introduce SuperIntelliAgent, an agentic learning framework that couples a trainable small diffusion model (the learner) with a frozen large language model (the verifier) to enable continual intelligence growth through self-supervised interaction. Unlike conventional supervised fine-tuning, SuperIntelliAgent learns autonomously without annotation: the learner generates candidate outputs, the verifier evaluates them through step-by-step reasoning, and their interaction produces chosen/rejected pairs for Direct Preference Optimization (DPO). This converts each input into a pseudo-training signal for continual improvement. The framework integrates dual-scale memory: short-term in-context memory that preserves reasoning traces across refinement cycles, and long-term memory that consolidates acquired knowledge through lightweight on-the-fly fine-tuning. A replay buffer retains samples that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics · Multimodal Machine Learning Applications