Hierarchically Robust Representation Learning

Qi Qian; Juhua Hu; Hao Li

arXiv:1911.04047·cs.CV·March 31, 2020

Hierarchically Robust Representation Learning

Qi Qian, Juhua Hu, Hao Li

PDF

Open Access 1 Video

TL;DR

This paper introduces a hierarchically robust optimization approach to learn deep features that maintain performance across different data distributions, addressing the limitations of empirical risk minimization.

Contribution

It proposes a novel distributionally robust optimization framework with Wasserstein constraints to enhance the generality of deep features for diverse tasks.

Findings

01

Robust features outperform standard features on various benchmarks

02

The method improves transferability across different data distributions

03

Experimental results validate the effectiveness of the proposed approach

Abstract

With the tremendous success of deep learning in visual tasks, the representations extracted from intermediate layers of learned models, that is, deep features, attract much attention of researchers. Previous empirical analysis shows that those features can contain appropriate semantic information. Therefore, with a model trained on a large-scale benchmark data set (e.g., ImageNet), the extracted features can work well on other tasks. In this work, we investigate this phenomenon and demonstrate that deep features can be suboptimal due to the fact that they are learned by minimizing the empirical risk. When the data distribution of the target task is different from that of the benchmark data set, the performance of deep features can degrade. Hence, we propose a hierarchically robust optimization method to learn more generic features. Considering the example-level and concept-level…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Hierarchically Robust Representation Learning· youtube

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques