3D Operation of Autonomous Excavator based on Reinforcement Learning through Independent Reward for Individual Joints
Yoonkyu Yoo, Donghwi Jung, and Seong-Woo Kim

TL;DR
This paper introduces a reinforcement learning control algorithm for autonomous excavators that independently trains each joint with separate rewards, enabling precise 3D operation without prior physical knowledge.
Contribution
It presents a novel RL-based control method that expands excavator workspace to 3D and trains joints independently, improving adaptability and precision in construction environments.
Findings
Successfully expanded excavator workspace into 3D space.
Achieved independent joint training with separate rewards.
Enhanced control precision without prior physical knowledge.
Abstract
In this paper, we propose a control algorithm based on reinforcement learning, employing independent rewards for each joint to control excavators in a 3D space. The aim of this research is to address the challenges associated with achieving precise control of excavators, which are extensively utilized in construction sites but prove challenging to control with precision due to their hydraulic structures. Traditional methods relied on operator expertise for precise excavator operation, occasionally resulting in safety accidents. Therefore, there have been endeavors to attain precise excavator control through equation-based control algorithms. However, these methods had the limitation of necessitating prior information related to physical values of the excavator, rendering them unsuitable for the diverse range of excavators used in the field. To overcome these limitations, we have…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndustrial Technology and Control Systems · Manufacturing Process and Optimization
