SIME: Enhancing Policy Self-Improvement with Modal-level Exploration

Yang Jin; Jun Lv; Wenye Yu; Hongjie Fang; Yong-Lu Li; Cewu Lu

arXiv:2505.01396·cs.RO·May 5, 2025

SIME: Enhancing Policy Self-Improvement with Modal-level Exploration

Yang Jin, Jun Lv, Wenye Yu, Hongjie Fang, Yong-Lu Li, Cewu Lu

PDF

Open Access

TL;DR

This paper introduces SIME, a method that improves robotic self-learning by promoting diverse interactions through modal-level exploration and selecting valuable data for training, leading to more robust control strategies.

Contribution

The paper proposes a novel modal-level exploration mechanism combined with data selection to enhance robot self-improvement during interaction-based learning.

Findings

01

Effective self-improvement demonstrated on simulation benchmarks.

02

Successful real-world robot experiments show increased robustness.

03

Method reduces training costs while improving control strategy quality.

Abstract

Self-improvement requires robotic systems to initially learn from human-provided data and then gradually enhance their capabilities through interaction with the environment. This is similar to how humans improve their skills through continuous practice. However, achieving effective self-improvement is challenging, primarily because robots tend to repeat their existing abilities during interactions, often failing to generate new, valuable data for learning. In this paper, we identify the key to successful self-improvement: modal-level exploration and data selection. By incorporating a modal-level exploration mechanism during policy execution, the robot can produce more diverse and multi-modal interactions. At the same time, we select the most valuable trials and high-quality segments from these interactions for learning. We successfully demonstrate effective robot self-improvement on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware System Performance and Reliability · Cloud Computing and Resource Management · Business Process Modeling and Analysis