Discovering Robotic Interaction Modes with Discrete Representation Learning
Liquan Wang, Ankit Goyal, Haoping Xu, Animesh Garg

TL;DR
This paper introduces ActAIM2, an unsupervised method for learning discrete interaction modes in robotic manipulation, improving generalization and sampling efficiency without requiring labeled data.
Contribution
It presents a novel unsupervised approach to learn discrete interaction modes for robots, using simulator data and a mode selector with a low-level action predictor.
Findings
High success rate in manipulating articulated objects
Robust sampling of meaningful actions from discrete modes
Enhanced manipulability and generalization over baselines
Abstract
Human actions manipulating articulated objects, such as opening and closing a drawer, can be categorized into multiple modalities we define as interaction modes. Traditional robot learning approaches lack discrete representations of these modes, which are crucial for empirical sampling and grounding. In this paper, we present ActAIM2, which learns a discrete representation of robot manipulation interaction modes in a purely unsupervised fashion, without the use of expert labels or simulator-based privileged information. Utilizing novel data collection methods involving simulator rollouts, ActAIM2 consists of an interaction mode selector and a low-level action predictor. The selector generates discrete representations of potential interaction modes with self-supervision, while the predictor outputs corresponding action trajectories. Our method is validated through its success rate in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Robot Manipulation and Learning · Time Series Analysis and Forecasting
