Censored Deep Reinforcement Patrolling with Information Criterion for   Monitoring Large Water Resources using Autonomous Surface Vehicles

Samuel Yanes Luis; Daniel Guti\'errez Reina; Sergio Toral Mar\'in

arXiv:2210.08115·cs.RO·October 18, 2022

Censored Deep Reinforcement Patrolling with Information Criterion for Monitoring Large Water Resources using Autonomous Surface Vehicles

Samuel Yanes Luis, Daniel Guti\'errez Reina, Sergio Toral Mar\'in

PDF

Open Access

TL;DR

This paper introduces a deep reinforcement learning framework with a novel Q-Censoring mechanism for autonomous surface vehicles to efficiently monitor large water resources, reducing redundancy and improving contamination detection accuracy.

Contribution

It presents a new deep Q-learning algorithm with Q-Censoring for obstacle avoidance and information gathering, outperforming previous coverage strategies in water resource monitoring.

Findings

01

Reduced path redundancy by 3 times using noise-networks.

02

Achieved 13% higher accuracy in contamination modeling.

03

Improved dangerous contamination peak detection by 37%.

Abstract

Monitoring and patrolling large water resources is a major challenge for conservation. The problem of acquiring data of an underlying environment that usually changes within time involves a proper formulation of the information. The use of Autonomous Surface Vehicles equipped with water quality sensor modules can serve as an early-warning system agents for contamination peak-detection, algae blooms monitoring, or oil-spill scenarios. In addition to information gathering, the vehicle must plan routes that are free of obstacles on non-convex maps. This work proposes a framework to obtain a collision-free policy that addresses the patrolling task for static and dynamic scenarios. Using information gain as a measure of the uncertainty reduction over data, it is proposed a Deep Q-Learning algorithm improved by a Q-Censoring mechanism for model-based obstacle avoidance. The obtained results…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Optimization and Search Problems · Oil Spill Detection and Mitigation

MethodsQ-Learning