Hierarchically Robust Representation Learning
Qi Qian, Juhua Hu, Hao Li

TL;DR
This paper introduces a hierarchically robust optimization approach to learn deep features that maintain performance across different data distributions, addressing the limitations of empirical risk minimization.
Contribution
It proposes a novel distributionally robust optimization framework with Wasserstein constraints to enhance the generality of deep features for diverse tasks.
Findings
Robust features outperform standard features on various benchmarks
The method improves transferability across different data distributions
Experimental results validate the effectiveness of the proposed approach
Abstract
With the tremendous success of deep learning in visual tasks, the representations extracted from intermediate layers of learned models, that is, deep features, attract much attention of researchers. Previous empirical analysis shows that those features can contain appropriate semantic information. Therefore, with a model trained on a large-scale benchmark data set (e.g., ImageNet), the extracted features can work well on other tasks. In this work, we investigate this phenomenon and demonstrate that deep features can be suboptimal due to the fact that they are learned by minimizing the empirical risk. When the data distribution of the target task is different from that of the benchmark data set, the performance of deep features can degrade. Hence, we propose a hierarchically robust optimization method to learn more generic features. Considering the example-level and concept-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Hierarchically Robust Representation Learning· youtube
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
