Bipartite Mode Matching for Vision Training Set Search from a Hierarchical Data Server
Yue Yao, Ruining Yang, Tom Gedeon

TL;DR
This paper introduces a hierarchical data server and bipartite mode matching algorithm to select training data that better aligns with target domain modes, improving model performance in vision tasks without real-time annotation.
Contribution
The paper proposes a novel hierarchical data server structure combined with bipartite mode matching for optimized training set selection, enhancing domain adaptation in vision models.
Findings
Matched server modes have smaller domain gaps with target data.
Models trained on matched sets outperform baseline methods.
Combining BMM with existing UDA methods yields further accuracy improvements.
Abstract
We explore a situation in which the target domain is accessible, but real-time data annotation is not feasible. Instead, we would like to construct an alternative training set from a large-scale data server so that a competitive model can be obtained. For this problem, because the target domain usually exhibits distinct modes (i.e., semantic clusters representing data distribution), if the training set does not contain these target modes, the model performance would be compromised. While prior existing works improve algorithms iteratively, our research explores the often-overlooked potential of optimizing the structure of the data server. Inspired by the hierarchical nature of web search engines, we introduce a hierarchical data server, together with a bipartite mode matching algorithm (BMM) to align source and target modes. For each target mode, we look in the server data tree for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
