CAIMAN: Causal Action Influence Detection for Sample-efficient Loco-manipulation

Yuanchen Yuan; Jin Cheng; N\'uria Armengol Urp\'i; Stelian Coros

arXiv:2502.00835·cs.RO·March 3, 2026

CAIMAN: Causal Action Influence Detection for Sample-efficient Loco-manipulation

Yuanchen Yuan, Jin Cheng, N\'uria Armengol Urp\'i, Stelian Coros

PDF

Open Access

TL;DR

CAIMAN introduces a reinforcement learning framework that enhances legged robots' ability to perform object pushing by using causal action influence as an intrinsic motivation, leading to sample-efficient learning and successful real-world transfer.

Contribution

The paper proposes CAIMAN, a novel RL approach that leverages causal influence for intrinsic motivation, improving sample efficiency and transferability in loco-manipulation tasks.

Findings

01

CAIMAN achieves higher sample efficiency in simulation.

02

The method successfully transfers to real robots without fine-tuning.

03

It enables effective object pushing in unstructured environments.

Abstract

Enabling legged robots to perform non-prehensile loco-manipulation is crucial for enhancing their versatility. Learning behaviors such as whole-body object pushing often requires sophisticated planning strategies or extensive task-specific reward shaping, especially in unstructured environments. In this work, we present CAIMAN, a practical reinforcement learning framework that encourages the agent to gain control over other entities in the environment. CAIMAN leverages causal action influence as an intrinsic motivation objective, allowing legged robots to efficiently acquire object pushing skills even under sparse task rewards. We employ a hierarchical control strategy, combining a low-level locomotion module with a high-level policy that generates task-relevant velocity commands and is trained to maximize the intrinsic reward. To estimate causal action influence, we learn the dynamics…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning · Time Series Analysis and Forecasting