TL;DR
SOLAR is an autonomous agent that uses parameter-level meta-learning and reinforcement learning to self-improve and adapt to new tasks in dynamic environments, reducing reliance on manual data curation.
Contribution
It introduces SOLAR, a novel open-ended agent that autonomously discovers adaptation strategies for lifelong learning using a multi-level reinforcement learning framework.
Findings
Outperforms strong baselines on reasoning tasks across domains.
Effectively balances plasticity and stability in continual learning.
Demonstrates autonomous discovery of adaptation strategies.
Abstract
Despite the remarkable success of large language models (LLMs), they still face bottlenecks while deploying in dynamic, real-world settings with primary challenges being concept drift and the high cost of gradient-based adaptation. Traditional fine-tuning (FT) struggles to adapt to non-stationary data streams without resulting in catastrophic for getting or requiring extensive manual data curation. To address these limitations within the streaming and continual learning paradigm, we propose the Self-Optimizing Lifelong Autonomous Reasoner (SOLAR) which is an open-ended autonomous agent that leverages parameter-level meta-learning to self-improve, treating model weights as an environment for exploration. It initiates the process by consolidating a strong prior over common-sense knowledge making it effective for transfer-learning. By utilizing a multi-level reinforcement learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
