MaxMI: A Maximal Mutual Information Criterion for Manipulation Concept Discovery
Pei Zhou, Yanchao Yang

TL;DR
MaxMI introduces an information-theoretic approach to discover manipulation concepts in unannotated demonstrations, enabling more accurate key state identification and improved robotic manipulation policies without relying on human labels.
Contribution
The paper proposes a novel Maximal Mutual Information criterion and a framework for autonomous manipulation concept discovery, reducing dependence on human annotations and enhancing policy generalization.
Findings
MaxMI accurately identifies key physical states in robotic demonstrations.
The framework improves manipulation policy success rates and generalization.
Key states discovered align well with human perception.
Abstract
We aim to discover manipulation concepts embedded in the unannotated demonstrations, which are recognized as key physical states. The discovered concepts can facilitate training manipulation policies and promote generalization. Current methods relying on multimodal foundation models for deriving key states usually lack accuracy and semantic consistency due to limited multimodal robot data. In contrast, we introduce an information-theoretic criterion to characterize the regularities that signify a set of physical states. We also develop a framework that trains a concept discovery network using this criterion, thus bypassing the dependence on human semantics and alleviating costly human labeling. The proposed criterion is based on the observation that key states, which deserve to be conceptualized, often admit more physical constraints than non-key states. This phenomenon can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Natural Language Processing Techniques
MethodsSparse Evolutionary Training
