Improving Environment Novelty Quantification for Effective Unsupervised Environment Design
Jayden Teoh, Wenjun Li, Pradeep Varakantham

TL;DR
This paper introduces CENIE, a novel framework for quantifying environment novelty in Unsupervised Environment Design, enhancing curriculum effectiveness by combining novelty measurement with existing regret-based methods.
Contribution
It proposes a scalable, domain-agnostic method to measure environment novelty using state-action coverage and Gaussian Mixture Models, improving curriculum design for better generalization.
Findings
Augmenting regret-based UED with CENIE improves performance
CENIE effectively measures environment novelty across benchmarks
Combining regret and novelty enhances exploration and generalization
Abstract
Unsupervised Environment Design (UED) formalizes the problem of autocurricula through interactive training between a teacher agent and a student agent. The teacher generates new training environments with high learning potential, curating an adaptive curriculum that strengthens the student's ability to handle unseen scenarios. Existing UED methods mainly rely on regret, a metric that measures the difference between the agent's optimal and actual performance, to guide curriculum design. Regret-driven methods generate curricula that progressively increase environment complexity for the student but overlook environment novelty -- a critical element for enhancing an agent's generalizability. Measuring environment novelty is especially challenging due to the underspecified nature of environment parameters in UED, and existing approaches face significant limitations. To address this, this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsBuilding Energy and Comfort Optimization
