Unsupervised Domain Adaptation for Action Recognition via   Self-Ensembling and Conditional Embedding Alignment

Indrajeet Ghosh; Garvit Chugh; Abu Zaher Md Faridee; Nirmalya Roy

arXiv:2410.17489·cs.CV·October 24, 2024

Unsupervised Domain Adaptation for Action Recognition via Self-Ensembling and Conditional Embedding Alignment

Indrajeet Ghosh, Garvit Chugh, Abu Zaher Md Faridee, Nirmalya Roy

PDF

Open Access

TL;DR

This paper introduces $AR, a novel unsupervised domain adaptation framework for wearable human action recognition that combines consistency regularization, temporal ensembling, and conditional distribution alignment to improve cross-domain accuracy.

Contribution

The paper proposes a new joint optimization architecture, $AR, that effectively aligns source and target domains in wearable action recognition using three integrated modules.

Findings

01

Achieves 4-12% higher macro-F1 scores than state-of-the-art methods.

02

Improves generalization with limited labeled source data.

03

Enhances pseudo-label robustness through temporal ensembling.

Abstract

Recent advancements in deep learning-based wearable human action recognition (wHAR) have improved the capture and classification of complex motions, but adoption remains limited due to the lack of expert annotations and domain discrepancies from user variations. Limited annotations hinder the model's ability to generalize to out-of-distribution samples. While data augmentation can improve generalizability, unsupervised augmentation techniques must be applied carefully to avoid introducing noise. Unsupervised domain adaptation (UDA) addresses domain discrepancies by aligning conditional distributions with labeled target samples, but vanilla pseudo-labeling can lead to error propagation. To address these challenges, we propose $μ$ DAR, a novel joint optimization architecture comprised of three functions: (i) consistency regularizer between augmented samples to improve model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Domain Adaptation and Few-Shot Learning