E2CL: Exploration-based Error Correction Learning for Embodied Agents
Hanlin Wang, Chak Tou Leong, Jian Wang, Wenjie Li

TL;DR
E2CL introduces an exploration-driven learning framework for embodied agents, enabling better environmental understanding and self-correction, which outperforms traditional methods in virtual environment tasks.
Contribution
The paper presents a novel exploration-based error correction framework that improves environment alignment and adaptability of embodied agents through exploration and feedback mechanisms.
Findings
E2CL-trained agents outperform baseline methods.
Agents demonstrate superior self-correction capabilities.
E2CL enhances environmental understanding in virtual environments.
Abstract
Language models are exhibiting increasing capability in knowledge utilization and reasoning. However, when applied as agents in embodied environments, they often suffer from misalignment between their intrinsic knowledge and environmental knowledge, leading to infeasible actions. Traditional environment alignment methods, such as supervised learning on expert trajectories and reinforcement learning, encounter limitations in covering environmental knowledge and achieving efficient convergence, respectively. Inspired by human learning, we propose Exploration-based Error Correction Learning (E2CL), a novel framework that leverages exploration-induced errors and environmental feedback to enhance environment alignment for embodied agents. E2CL incorporates teacher-guided and teacher-free explorations to gather environmental feedback and correct erroneous actions. The agent learns to provide…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Explainable Artificial Intelligence (XAI) · Machine Learning and Data Classification
