End-to-end example-based sim-to-real RL policy transfer based on neural stylisation with application to robotic cutting

Jamie Hathaway; Alireza Rastegarpanah; Rustam Stolkin

arXiv:2601.20846·cs.RO·January 29, 2026

End-to-end example-based sim-to-real RL policy transfer based on neural stylisation with application to robotic cutting

Jamie Hathaway, Alireza Rastegarpanah, Rustam Stolkin

PDF

Open Access

TL;DR

This paper introduces a novel neural stylisation-based sim-to-real transfer method for reinforcement learning policies, enabling robots to adapt to real-world tasks like cutting with minimal real data and improved stability.

Contribution

It presents a new approach combining neural style transfer and variational autoencoders for effective sim-to-real policy transfer in contact-rich robotic tasks.

Findings

01

Improved task completion time over baseline methods

02

Enhanced behavioural stability in real-world deployment

03

Robustness to geometric and material variations

Abstract

Whereas reinforcement learning has been applied with success to a range of robotic control problems in complex, uncertain environments, reliance on extensive data - typically sourced from simulation environments - limits real-world deployment due to the domain gap between simulated and physical systems, coupled with limited real-world sample availability. We propose a novel method for sim-to-real transfer of reinforcement learning policies, based on a reinterpretation of neural style transfer from image processing to synthesise novel training data from unpaired unlabelled real world datasets. We employ a variational autoencoder to jointly learn self-supervised feature representations for style transfer and generate weakly paired source-target trajectories to improve physical realism of synthesised trajectories. We demonstrate the application of our approach based on the case study of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Generative Adversarial Networks and Image Synthesis