CAIMAN: Causal Action Influence Detection for Sample-efficient Loco-manipulation
Yuanchen Yuan, Jin Cheng, N\'uria Armengol Urp\'i, Stelian Coros

TL;DR
CAIMAN introduces a reinforcement learning framework that enhances legged robots' ability to perform object pushing by using causal action influence as an intrinsic motivation, leading to sample-efficient learning and successful real-world transfer.
Contribution
The paper proposes CAIMAN, a novel RL approach that leverages causal influence for intrinsic motivation, improving sample efficiency and transferability in loco-manipulation tasks.
Findings
CAIMAN achieves higher sample efficiency in simulation.
The method successfully transfers to real robots without fine-tuning.
It enables effective object pushing in unstructured environments.
Abstract
Enabling legged robots to perform non-prehensile loco-manipulation is crucial for enhancing their versatility. Learning behaviors such as whole-body object pushing often requires sophisticated planning strategies or extensive task-specific reward shaping, especially in unstructured environments. In this work, we present CAIMAN, a practical reinforcement learning framework that encourages the agent to gain control over other entities in the environment. CAIMAN leverages causal action influence as an intrinsic motivation objective, allowing legged robots to efficiently acquire object pushing skills even under sparse task rewards. We employ a hierarchical control strategy, combining a low-level locomotion module with a high-level policy that generates task-relevant velocity commands and is trained to maximize the intrinsic reward. To estimate causal action influence, we learn the dynamics…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning · Time Series Analysis and Forecasting
