TL;DR
This paper introduces an adversarial bipartite graph learning framework for video domain adaptation, directly modeling source-target interactions to improve generalization across different video domains.
Contribution
It proposes a novel bipartite graph approach that models source-target frame interactions, moving beyond traditional domain-invariant feature learning for better video domain adaptation.
Findings
Outperforms state-of-the-art methods on four video recognition benchmarks.
Effective in both supervised and semi-supervised transfer settings.
Enhances model robustness on challenging domain shifts.
Abstract
Domain adaptation techniques, which focus on adapting models between distributionally different domains, are rarely explored in the video recognition area due to the significant spatial and temporal shifts across the source (i.e. training) and target (i.e. test) domains. As such, recent works on visual domain adaptation which leverage adversarial learning to unify the source and target video representations and strengthen the feature transferability are not highly effective on the videos. To overcome this limitation, in this paper, we learn a domain-agnostic video classifier instead of learning domain-invariant representations, and propose an Adversarial Bipartite Graph (ABG) learning framework which directly models the source-target interactions with a network topology of the bipartite graph. Specifically, the source and target frames are sampled as heterogeneous vertexes while the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
