Zero-Shot Sim-to-Real Reinforcement Learning for Fruit Harvesting

Emlyn Williams; Athanasios Polydoros

arXiv:2505.08458·cs.RO·May 14, 2025

Zero-Shot Sim-to-Real Reinforcement Learning for Fruit Harvesting

Emlyn Williams, Athanasios Polydoros

PDF

TL;DR

This paper develops a sim-to-real reinforcement learning pipeline for autonomous strawberry harvesting, combining simulation with domain randomization and deep RL to enable effective transfer to real robots.

Contribution

It introduces a comprehensive sim-to-real pipeline using a custom Mujoco environment and a novel RL algorithm for fruit harvesting tasks.

Findings

01

Successful transfer from simulation to real robot

02

Effective domain randomization techniques used

03

Promising performance in real laboratory environment

Abstract

This paper presents a comprehensive sim-to-real pipeline for autonomous strawberry picking from dense clusters using a Franka Panda robot. Our approach leverages a custom Mujoco simulation environment that integrates domain randomization techniques. In this environment, a deep reinforcement learning agent is trained using the dormant ratio minimization algorithm. The proposed pipeline bridges low-level control with high-level perception and decision making, demonstrating promising performance in both simulation and in a real laboratory environment, laying the groundwork for successful transfer to real-world autonomous fruit harvesting.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.