DATA: Domain-Aware and Task-Aware Self-supervised Learning
Qing Chang, Junran Peng, Lingxie Xie, Jiajun Sun, Haoran Yin, Qi Tian,, Zhaoxiang Zhang

TL;DR
This paper introduces DATA, a NAS approach for self-supervised learning that creates domain- and task-aware models capable of adapting to diverse downstream vision tasks without labeled data.
Contribution
The paper proposes a novel NAS method for SSL that trains a supernet and searches for models tailored to specific domains and tasks without explicit metrics.
Findings
Achieves strong performance across various downstream tasks
Enables customization for different data domains and tasks
Demonstrates generalizability across SSL methods
Abstract
The paradigm of training models on massive data without label through self-supervised learning (SSL) and finetuning on many downstream tasks has become a trend recently. However, due to the high training costs and the unconsciousness of downstream usages, most self-supervised learning methods lack the capability to correspond to the diversities of downstream scenarios, as there are various data domains, different vision tasks and latency constraints on models. Neural architecture search (NAS) is one universally acknowledged fashion to conquer the issues above, but applying NAS on SSL seems impossible as there is no label or metric provided for judging model selection. In this paper, we present DATA, a simple yet effective NAS approach specialized for SSL that provides Domain-Aware and Task-Aware pre-training. Specifically, we (i) train a supernet which could be deemed as a set of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Machine Learning and Data Classification
MethodsDense Connections · Feedforward Network · Random Gaussian Blur · InfoNCE · MoCo v2 · Batch Normalization · Momentum Contrast
