MimiCAT: Mimic with Correspondence-Aware Cascade-Transformer for Category-Free 3D Pose Transfer

Zenghao Chai; Chen Tang; Yongkang Wong; Xulei Yang; Mohan Kankanhalli

arXiv:2511.18370·cs.CV·March 26, 2026

MimiCAT: Mimic with Correspondence-Aware Cascade-Transformer for Category-Free 3D Pose Transfer

Zenghao Chai, Chen Tang, Yongkang Wong, Xulei Yang, Mohan Kankanhalli

PDF

Open Access

TL;DR

MimiCAT introduces a category-free 3D pose transfer method using a cascade-transformer that learns soft correspondences via semantic keypoints, enabling pose transfer across diverse character types with improved quality.

Contribution

The paper presents MimiCAT, a novel cascade-transformer model that enables flexible many-to-many pose transfer across different character categories using soft correspondence matching.

Findings

01

MimiCAT outperforms prior methods in diverse character pose transfer.

02

The model generalizes well to unseen character types.

03

Extensive experiments validate the effectiveness of the soft correspondence approach.

Abstract

3D pose transfer aims to transfer the pose-style of a source mesh to a target character while preserving both the target's geometry and the source's pose characteristic. Existing methods are largely restricted to characters with similar structures and fail to generalize to category-free settings (e.g., transferring a humanoid's pose to a quadruped). The key challenge lies in the structural and transformation diversity inherent in distinct character types, which often leads to mismatched regions and poor transfer quality. To address these issues, we first construct a million-scale pose dataset across hundreds of distinct characters. We further propose MimiCAT, a cascade-transformer model designed for category-free 3D pose transfer. Instead of relying on strict one-to-one correspondence mappings, MimiCAT leverages semantic keypoint labels to learn a novel soft correspondence that enables…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · 3D Shape Modeling and Analysis · Human Motion and Animation