AGPNet -- Autonomous Grading Policy Network

Chana Ross; Yakov Miron; Yuval Goldfracht; Dotan Di Castro

arXiv:2112.10877·cs.RO·December 22, 2021·1 cites

AGPNet -- Autonomous Grading Policy Network

Chana Ross, Yakov Miron, Yuval Goldfracht, Dotan Di Castro

PDF

Open Access

TL;DR

This paper introduces AGPNet, a hybrid reinforcement learning agent for autonomous dozer grading, which achieves human-level performance and generalizes well to real-world scenarios.

Contribution

It formalizes autonomous grading as a Markov Decision Process and develops a hybrid learning approach outperforming existing methods.

Findings

01

AGPNet reaches human-level performance.

02

The agent outperforms current state-of-the-art methods.

03

AGPNet generalizes to unseen real-world scenarios.

Abstract

In this work, we establish heuristics and learning strategies for the autonomous control of a dozer grading an uneven area studded with sand piles. We formalize the problem as a Markov Decision Process, design a simulation which demonstrates agent-environment interactions and finally compare our simulator to a real dozer prototype. We use methods from reinforcement learning, behavior cloning and contrastive learning to train a hybrid policy. Our trained agent, AGPNet, reaches human-level performance and outperforms current state-of-the-art machine learning methods for the autonomous grading task. In addition, our agent is capable of generalizing from random scenarios to unseen real world problems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFlood Risk Assessment and Management · Water Quality Monitoring Technologies

MethodsContrastive Learning