MAPRPose: Mask-Aware Proposal and Amodal Refinement for Multi-Object 6D Pose Estimation

Yang Luo; Yan Gong; Yongsheng Gao; Xiaoying Sun; and Jie Zhao

arXiv:2604.20650·cs.CV·April 23, 2026

MAPRPose: Mask-Aware Proposal and Amodal Refinement for Multi-Object 6D Pose Estimation

Yang Luo, Yan Gong, Yongsheng Gao, Xiaoying Sun, and Jie Zhao

PDF

TL;DR

MAPRPose is a novel two-stage framework for multi-object 6D pose estimation that effectively handles occlusion and noise, achieving state-of-the-art accuracy and significant speed improvements.

Contribution

It introduces mask-aware correspondences for pose proposal and an amodal-driven refinement pipeline with GPU acceleration, advancing robustness and efficiency in pose estimation.

Findings

01

Achieves 76.5% AR on BOP benchmark, outperforming FoundationPose by 3.1%.

02

Provides 43x faster multi-object inference speed.

03

Utilizes a tensorized render-and-compare pipeline for robust refinement.

Abstract

6D object pose estimation in cluttered scenes remains challenging due to severe occlusion and sensor noise. We propose MAPRPose, a two-stage framework that leverages mask-aware correspondences for pose proposal and amodal-driven Region-of-Interest (ROI) prediction for robust refinement. In the Mask-Aware Pose Proposal (MAPP) stage, we lift 2D correspondences into 3D space to establish reliable keypoint matches and generate geometrically consistent pose hypotheses based on correspondence-level scoring, from which the top- $K$ candidates are selected. In the refinement stage, we introduce a tensorized render-and-compare pipeline integrated with an Amodal Mask Prediction and ROI Re-Alignment (AMPR) module. By reconstructing complete object geometry and dynamically adjusting the ROI, AMPR mitigates localization errors and spatial misalignment under heavy occlusion. Furthermore, our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.