Coarse-to-Fine for Sim-to-Real: Sub-Millimetre Precision Across Wide Task Spaces
Eugene Valassakis, Norman Di Palo, Edward Johns

TL;DR
This paper introduces a coarse-to-fine control framework combining classical motion planning and learned vision-based control to achieve sub-millimetre precision and wide task generalization in sim-to-real transfer.
Contribution
The paper presents a novel hybrid control approach that integrates classical motion planning with learned controllers for precise and generalizable sim-to-real transfer.
Findings
Significantly outperforms purely motion planning methods.
Outperforms purely learning-based methods.
Provides insights on sensor modalities and feature representations.
Abstract
In this paper, we study the problem of zero-shot sim-to-real when the task requires both highly precise control with sub-millimetre error tolerance, and wide task space generalisation. Our framework involves a coarse-to-fine controller, where trajectories begin with classical motion planning using ICP-based pose estimation, and transition to a learned end-to-end controller which maps images to actions and is trained in simulation with domain randomisation. In this way, we achieve precise control whilst also generalising the controller across wide task spaces, and keeping the robustness of vision-based, end-to-end control. Real-world experiments on a range of different tasks show that, by exploiting the best of both worlds, our framework significantly outperforms purely motion planning methods, and purely learning-based methods. Furthermore, we answer a range of questions on best…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Human Pose and Action Recognition
