Conditional entropy minimization principle for learning domain invariant   representation features

Thuan Nguyen; Boyang Lyu; Prakash Ishwar; Matthias Scheutz; Shuchin; Aeron

arXiv:2201.10460·cs.LG·July 12, 2022

Conditional entropy minimization principle for learning domain invariant representation features

Thuan Nguyen, Boyang Lyu, Prakash Ishwar, Matthias Scheutz, Shuchin, Aeron

PDF

Open Access 2 Repos

TL;DR

This paper introduces a conditional entropy minimization framework to improve domain invariant feature learning, effectively filtering out spurious features and enhancing generalization in domain generalization tasks.

Contribution

It proposes a novel CEM-based method that better isolates true invariant features, closely relates to the Information Bottleneck framework, and demonstrates improved generalization.

Findings

01

Achieves competitive accuracy on several DG datasets

02

Effectively filters out spurious invariant features

03

Theoretically recovers true invariant features under certain conditions

Abstract

Invariance-principle-based methods such as Invariant Risk Minimization (IRM), have recently emerged as promising approaches for Domain Generalization (DG). Despite promising theory, such approaches fail in common classification tasks due to the mixing of true invariant features and spurious invariant features. To address this, we propose a framework based on the conditional entropy minimization (CEM) principle to filter-out the spurious invariant features leading to a new algorithm with a better generalization capability. We show that our proposed approach is closely related to the well-known Information Bottleneck (IB) framework and prove that under certain assumptions, entropy minimization can exactly recover the true invariant features. Our approach provides competitive classification accuracy compared to recent theoretically-principled state-of-the-art alternatives across several DG…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Machine Learning and ELM