EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge: Mixed Sequences Prediction
Amirshayan Nasirimajd, Simone Alberto Peirone, Chiara Plizzari,, Barbara Caputo

TL;DR
This paper introduces a novel approach for unsupervised domain adaptation in action recognition, leveraging sequence mixing, pseudo-labeling, language models, and co-occurrence filtering to improve transfer performance on the EPIC-Kitchens-100 dataset.
Contribution
It proposes a new sequence mixing and filtering method for unsupervised domain adaptation in action recognition tasks.
Findings
Achieved 2nd place in verb recognition on the leaderboard.
Achieved 4th place in noun and action recognition.
Demonstrated improved transfer results using sequence and language filtering techniques.
Abstract
This report presents the technical details of our approach for the EPIC-Kitchens-100 Unsupervised Domain Adaptation (UDA) Challenge in Action Recognition. Our approach is based on the idea that the order in which actions are performed is similar between the source and target domains. Based on this, we generate a modified sequence by randomly combining actions from the source and target domains. As only unlabelled target data are available under the UDA setting, we use a standard pseudo-labeling strategy for extracting action labels for the target. We then ask the network to predict the resulting action sequence. This allows to integrate information from both domains during training and to achieve better transfer results on target. Additionally, to better incorporate sequence information, we use a language model to filter unlikely sequences. Lastly, we employed a co-occurrence matrix to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition
