Alleviating Human-level Shift : A Robust Domain Adaptation Method for Multi-person Pose Estimation
Xixia Xu, Qi Zou, Xue Lin

TL;DR
This paper introduces a novel domain adaptation method for multi-person pose estimation that aligns human topological structures and fine-grained features, improving performance in unlabeled or sparsely labeled target domains.
Contribution
The paper proposes a new domain adaptation framework with modules for human-level topological structure alignment and fine-grained feature adaptation, addressing limitations of existing methods.
Findings
Outperforms existing supervised methods on benchmark datasets.
Effectively handles occluded and extreme poses.
Reduces inter-domain discrepancy through human topology alignment.
Abstract
Human pose estimation has been widely studied with much focus on supervised learning requiring sufficient annotations. However, in real applications, a pretrained pose estimation model usually need be adapted to a novel domain with no labels or sparse labels. Such domain adaptation for 2D pose estimation hasn't been explored. The main reason is that a pose, by nature, has typical topological structure and needs fine-grained features in local keypoints. While existing adaptation methods do not consider topological structure of object-of-interest and they align the whole images coarsely. Therefore, we propose a novel domain adaptation method for multi-person pose estimation to conduct the human-level topological structure alignment and fine-grained feature alignment. Our method consists of three modules: Cross-Attentive Feature Alignment (CAFA), Intra-domain Structure Adaptation (ISA) and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
MethodsMax Pooling · Sigmoid Activation · Average Pooling · Convolution · Communication--Guide||How Do I Communicate to Expedia? · Graph Convolutional Network
