ADM-DP: Adaptive Dynamic Modality Diffusion Policy through Vision-Tactile-Graph Fusion for Multi-Agent Manipulation

Enyi Wang; Wen Fan; Dandan Zhang

arXiv:2602.21622·cs.RO·February 26, 2026

ADM-DP: Adaptive Dynamic Modality Diffusion Policy through Vision-Tactile-Graph Fusion for Multi-Agent Manipulation

Enyi Wang, Wen Fan, Dandan Zhang

PDF

Open Access

TL;DR

This paper introduces ADM-DP, a multi-modal control framework for multi-agent robotic manipulation that adaptively fuses vision, tactile, and graph data to improve coordination, grasp stability, and collision avoidance.

Contribution

The paper presents a novel adaptive fusion mechanism and a decoupled training paradigm for multi-agent manipulation, enhancing robustness and performance across diverse tasks.

Findings

01

Achieves 12-25% performance improvements over baselines.

02

Adaptive modality attention improves task-specific perception.

03

Robust across seven multi-agent manipulation scenarios.

Abstract

Multi-agent robotic manipulation remains challenging due to the combined demands of coordination, grasp stability, and collision avoidance in shared workspaces. To address these challenges, we propose the Adaptive Dynamic Modality Diffusion Policy (ADM-DP), a framework that integrates vision, tactile, and graph-based (multi-agent pose) modalities for coordinated control. ADM-DP introduces four key innovations. First, an enhanced visual encoder merges RGB and point-cloud features via Feature-wise Linear Modulation (FiLM) modulation to enrich perception. Second, a tactile-guided grasping strategy uses Force-Sensitive Resistor (FSR) feedback to detect insufficient contact and trigger corrective grasp refinement, improving grasp stability. Third, a graph-based collision encoder leverages shared tool center point (TCP) positions of multiple agents as structured kinematic context to maintain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Social Robot Interaction and HRI