CleanPose: Category-Level Object Pose Estimation via Causal Learning and Knowledge Distillation

Xiao Lin; Yun Peng; Liuyi Wang; Xianyou Zhong; Minghao Zhu; Jingwei Yang; Yi Feng; Chengju Liu; Qijun Chen

arXiv:2502.01312·cs.CV·July 21, 2025

CleanPose: Category-Level Object Pose Estimation via Causal Learning and Knowledge Distillation

Xiao Lin, Yun Peng, Liuyi Wang, Xianyou Zhong, Minghao Zhu, Jingwei Yang, Yi Feng, Chengju Liu, Qijun Chen

PDF

Open Access

TL;DR

CleanPose introduces a causal learning and knowledge distillation framework for category-level object pose estimation, effectively reducing bias from confounders and improving generalization to unseen instances.

Contribution

The paper proposes a novel causal inference module with front-door adjustment and a residual-based knowledge distillation method to enhance pose estimation accuracy and robustness.

Findings

01

Outperforms state-of-the-art methods on multiple benchmarks.

02

Effectively reduces spurious correlations caused by confounders.

03

Improves generalization to unseen object instances.

Abstract

Category-level object pose estimation aims to recover the rotation, translation and size of unseen instances within predefined categories. In this task, deep neural network-based methods have demonstrated remarkable performance. However, previous studies show they suffer from spurious correlations raised by "unclean" confounders in models, hindering their performance on novel instances with significant variations. To address this issue, we propose CleanPose, a novel approach integrating causal learning and knowledge distillation to enhance category-level pose estimation. To mitigate the negative effect of unobserved confounders, we develop a causal inference module based on front-door adjustment, which promotes unbiased estimation by reducing potential spurious correlations. Additionally, to further improve generalization ability, we devise a residual-based knowledge distillation method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Multimodal Machine Learning Applications · Human Pose and Action Recognition