Unsupervised Domain Adaptation for Action Recognition via Self-Ensembling and Conditional Embedding Alignment
Indrajeet Ghosh, Garvit Chugh, Abu Zaher Md Faridee, Nirmalya Roy

TL;DR
This paper introduces $AR, a novel unsupervised domain adaptation framework for wearable human action recognition that combines consistency regularization, temporal ensembling, and conditional distribution alignment to improve cross-domain accuracy.
Contribution
The paper proposes a new joint optimization architecture, $AR, that effectively aligns source and target domains in wearable action recognition using three integrated modules.
Findings
Achieves 4-12% higher macro-F1 scores than state-of-the-art methods.
Improves generalization with limited labeled source data.
Enhances pseudo-label robustness through temporal ensembling.
Abstract
Recent advancements in deep learning-based wearable human action recognition (wHAR) have improved the capture and classification of complex motions, but adoption remains limited due to the lack of expert annotations and domain discrepancies from user variations. Limited annotations hinder the model's ability to generalize to out-of-distribution samples. While data augmentation can improve generalizability, unsupervised augmentation techniques must be applied carefully to avoid introducing noise. Unsupervised domain adaptation (UDA) addresses domain discrepancies by aligning conditional distributions with labeled target samples, but vanilla pseudo-labeling can lead to error propagation. To address these challenges, we propose DAR, a novel joint optimization architecture comprised of three functions: (i) consistency regularizer between augmented samples to improve model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Domain Adaptation and Few-Shot Learning
