Disentangling perception and reasoning for improving data efficiency in learning cloth manipulation without demonstrations

Donatien Delehelle; Fei Chen; Darwin Caldwell

arXiv:2601.21713·cs.RO·January 30, 2026

Disentangling perception and reasoning for improving data efficiency in learning cloth manipulation without demonstrations

Donatien Delehelle, Fei Chen, Darwin Caldwell

PDF

Open Access

TL;DR

This paper introduces a modular, efficient reinforcement learning approach for cloth manipulation that reduces model size and training time, enabling effective sim-to-real transfer and outperforming existing methods on benchmark tasks.

Contribution

It presents a novel, scalable RL framework that disentangles perception and reasoning, significantly improving data efficiency and transferability in cloth manipulation tasks.

Findings

01

Reduced model size and training time in simulation.

02

Successful transfer of policies to real-world cloth manipulation.

03

Outperformed baseline methods on SoftGym benchmark.

Abstract

Cloth manipulation is a ubiquitous task in everyday life, but it remains an open challenge for robotics. The difficulties in developing cloth manipulation policies are attributed to the high-dimensional state space, complex dynamics, and high propensity to self-occlusion exhibited by fabrics. As analytical methods have not been able to provide robust and general manipulation policies, reinforcement learning (RL) is considered a promising approach to these problems. However, to address the large state space and complex dynamics, data-based methods usually rely on large models and long training times. The resulting computational cost significantly hampers the development and adoption of these methods. Additionally, due to the challenge of robust state estimation, garment manipulation policies often adopt an end-to-end learning approach with workspace images as input. While this approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Robot Manipulation and Learning · Generative Adversarial Networks and Image Synthesis