Environment Agnostic Goal-Conditioning, A Study of Reward-Free Autonomous Learning

Hampus {\AA}str\"om; Elin Anna Topp; Jacek Malec

arXiv:2511.04598·cs.LG·November 7, 2025

Environment Agnostic Goal-Conditioning, A Study of Reward-Free Autonomous Learning

Hampus {\AA}str\"om, Elin Anna Topp, Jacek Malec

PDF

Open Access

TL;DR

This paper introduces a goal-conditioning approach that enables reward-free, environment-agnostic autonomous learning, allowing agents to learn to solve tasks by self-selecting goals without environment-specific tuning.

Contribution

The study presents a novel environment-agnostic goal-conditioning method that improves average success rates and stabilizes performance in reward-free autonomous learning.

Findings

01

Average goal success rate improves and stabilizes.

02

Method is independent of underlying off-policy algorithms.

03

Agents can be instructed to seek any environment observations.

Abstract

In this paper we study how transforming regular reinforcement learning environments into goal-conditioned environments can let agents learn to solve tasks autonomously and reward-free. We show that an agent can learn to solve tasks by selecting its own goals in an environment-agnostic way, at training times comparable to externally guided reinforcement learning. Our method is independent of the underlying off-policy learning algorithm. Since our method is environment-agnostic, the agent does not value any goals higher than others, leading to instability in performance for individual goals. However, in our experiments, we show that the average goal success rate improves and stabilizes. An agent trained with this method can be instructed to seek any observations made in the environment, enabling generic training of agents prior to specific use cases.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Domain Adaptation and Few-Shot Learning