Combining Deep Architectures for Information Gain estimation and Reinforcement Learning for multiagent field exploration
Emanuele Masiero, Vito Trianni, Giuseppe Vizzari, Dimitri Ognibene

TL;DR
This paper presents a deep learning framework combining information gain estimation and reinforcement learning to improve autonomous multi-agent exploration in agricultural environments, emphasizing efficient resource use and scalable strategies.
Contribution
It introduces a novel two-stage deep learning approach with a belief model and POV visibility mask, enhancing exploration efficiency and scalability in multi-agent systems.
Findings
Double-CNN DQN agent outperforms others in larger maps.
POV visibility mask improves policy performance.
Untrained IG-based agent performs surprisingly well.
Abstract
Precision agriculture requires efficient autonomous systems for crop monitoring, where agents must explore large-scale environments while minimizing resource consumption. This work addresses the problem as an active exploration task in a grid environment representing an agricultural field. Each cell may contain targets (e.g., damaged crops) observable from nine predefined points of view (POVs). Agents must infer the number of targets per cell using partial, sequential observations. We propose a two-stage deep learning framework. A pre-trained LSTM serves as a belief model, updating a probabilistic map of the environment and its associated entropy, which defines the expected information gain (IG). This allows agents to prioritize informative regions. A key contribution is the inclusion of a POV visibility mask in the input, preserving the Markov property under partial observability and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Industrial Vision Systems and Defect Detection
MethodsQ-Learning · Dense Connections · Convolution · Tanh Activation · Deep Q-Network · Sigmoid Activation · Long Short-Term Memory
