MARS: Multimodal Active Robotic Sensing for Articulated Characterization

Hongliang Zeng; Ping Zhang; Chengjiong Wu; Jiahua Wang; Tingyu Ye and; Fang Li

arXiv:2407.01191·cs.RO·July 2, 2024

MARS: Multimodal Active Robotic Sensing for Articulated Characterization

Hongliang Zeng, Ping Zhang, Chengjiong Wu, Jiahua Wang, Tingyu Ye and, Fang Li

PDF

Open Access 1 Repo

TL;DR

MARS is a multi-modal active sensing framework that improves articulated object perception for robots by combining RGB and point cloud data with reinforcement learning to optimize viewpoints, outperforming existing methods.

Contribution

The paper introduces MARS, a novel multi-modal fusion and active sensing framework that enhances articulated object characterization in robotic perception.

Findings

01

Outperforms state-of-the-art in joint parameter estimation

02

Reduces errors through active viewpoint optimization

03

Generalizes effectively to real-world objects

Abstract

Precise perception of articulated objects is vital for empowering service robots. Recent studies mainly focus on point cloud, a single-modal approach, often neglecting vital texture and lighting details and assuming ideal conditions like optimal viewpoints, unrepresentative of real-world scenarios. To address these limitations, we introduce MARS, a novel framework for articulated object characterization. It features a multi-modal fusion module utilizing multi-scale RGB features to enhance point cloud features, coupled with reinforcement learning-based active sensing for autonomous optimization of observation viewpoints. In experiments conducted with various articulated object instances from the PartNet-Mobility dataset, our method outperformed current state-of-the-art methods in joint parameter estimation accuracy. Additionally, through active sensing, MARS further reduces errors,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

robhlzeng/mars
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning

Methodstravel james · Focus