MaxMI: A Maximal Mutual Information Criterion for Manipulation Concept   Discovery

Pei Zhou; Yanchao Yang

arXiv:2407.15086·cs.RO·July 23, 2024

MaxMI: A Maximal Mutual Information Criterion for Manipulation Concept Discovery

Pei Zhou, Yanchao Yang

PDF

Open Access 1 Repo

TL;DR

MaxMI introduces an information-theoretic approach to discover manipulation concepts in unannotated demonstrations, enabling more accurate key state identification and improved robotic manipulation policies without relying on human labels.

Contribution

The paper proposes a novel Maximal Mutual Information criterion and a framework for autonomous manipulation concept discovery, reducing dependence on human annotations and enhancing policy generalization.

Findings

01

MaxMI accurately identifies key physical states in robotic demonstrations.

02

The framework improves manipulation policy success rates and generalization.

03

Key states discovered align well with human perception.

Abstract

We aim to discover manipulation concepts embedded in the unannotated demonstrations, which are recognized as key physical states. The discovered concepts can facilitate training manipulation policies and promote generalization. Current methods relying on multimodal foundation models for deriving key states usually lack accuracy and semantic consistency due to limited multimodal robot data. In contrast, we introduce an information-theoretic criterion to characterize the regularities that signify a set of physical states. We also develop a framework that trains a concept discovery network using this criterion, thus bypassing the dependence on human semantics and alleviating costly human labeling. The proposed criterion is based on the observation that key states, which deserve to be conceptualized, often admit more physical constraints than non-key states. This phenomenon can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

PeiZhou26/MaxMI
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Text Analysis Techniques · Natural Language Processing Techniques

MethodsSparse Evolutionary Training