MATT-Diff: Multimodal Active Target Tracking by Diffusion Policy

Saida Liu; Nikolay Atanasov; Shumon Koga

arXiv:2511.11931·cs.RO·April 23, 2026

MATT-Diff: Multimodal Active Target Tracking by Diffusion Policy

Saida Liu, Nikolay Atanasov, Shumon Koga

PDF

1 Repo

TL;DR

MATT-Diff is a diffusion-based control policy enabling a mobile agent to perform active multi-target tracking with multimodal behaviors, balancing exploration and exploitation without prior target knowledge.

Contribution

The paper introduces a novel diffusion model-based policy for multimodal active target tracking that integrates multiple expert behaviors and handles variable target scenarios.

Findings

01

MATT-Diff outperforms other learning-based methods in tracking accuracy.

02

The policy demonstrates effective multimodal behavior sourcing from multiple expert planners.

03

Evaluation in novel environments confirms superior tracking performance.

Abstract

This paper proposes MATT-Diff: Multimodal Active Target Tracking by Diffusion Policy, a control policy for active multi-target tracking using a mobile agent. The policy enables multiple behavior modes for the agent, including exploration, tracking, and target reacquisition, without prior knowledge of the target numbers, states, or dynamics. Effective target tracking demands balancing exploration for undetected or lost targets with exploitation, i.e., uncertainty reduction, of detected but uncertain ones. We generate a demonstration dataset from three expert planners including frontier-based exploration, an uncertainty-based hybrid planner switching between frontier-based exploration and RRT* tracking, and a time-based hybrid planner switching between exploration and target reacquisition based on target detection time. Our control policy utilizes a vision transformer for egocentric map…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

CINAPSLab/MATT-Diff
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.