Multi-log grasping using reinforcement learning and virtual visual servoing
Erik Wallin, Viktor Wiberg, Martin Servin

TL;DR
This paper presents a reinforcement learning and virtual visual servoing approach for multi-log grasping in simulated environments, achieving high success rates in complex outdoor-like scenarios.
Contribution
It introduces a novel method combining virtual visual servoing with reinforcement learning for multi-log grasping, simplifying domain transfer and reducing computational demands.
Findings
95% success rate in multi-log grasping tasks
Effective separation of image segmentation from crane control
Use of virtual camera and 3D reconstruction for robust control
Abstract
We explore multi-log grasping using reinforcement learning and virtual visual servoing for automated forwarding in a simulated environment. Automation of forest processes is a major challenge, and many techniques regarding robot control pose different challenges due to the unstructured and harsh outdoor environment. Grasping multiple logs involves various problems of dynamics and path planning, where understanding the interaction between the grapple, logs, terrain, and obstacles requires visual information. To address these challenges, we separate image segmentation from crane control and utilise a virtual camera to provide an image stream from reconstructed 3D data. We use Cartesian control to simplify domain transfer to real-world applications. Since log piles are static, visual servoing using a 3D reconstruction of the pile and its surroundings is equivalent to using real camera data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging
