Domain Generalization with Small Data

Kecheng Chen; Elena Gal; Hong Yan; and Haoliang Li

arXiv:2402.06150·cs.LG·February 12, 2024·1 cites

Domain Generalization with Small Data

Kecheng Chen, Elena Gal, Hong Yan, and Haoliang Li

PDF

Open Access

TL;DR

This paper introduces a probabilistic domain generalization method that learns domain-invariant representations using a novel probabilistic MMD and contrastive loss, improving performance on medical datasets with limited samples.

Contribution

It proposes a new probabilistic framework for domain generalization that extends MMD and contrastive loss to probabilistic embeddings, addressing small data challenges.

Findings

01

Outperforms state-of-the-art methods on three medical datasets.

02

Effectively captures distributional information with probabilistic embeddings.

03

Enhances domain invariance in low-data scenarios.

Abstract

In this work, we propose to tackle the problem of domain generalization in the context of \textit{insufficient samples}. Instead of extracting latent feature embeddings based on deterministic models, we propose to learn a domain-invariant representation based on the probabilistic framework by mapping each data point into probabilistic embeddings. Specifically, we first extend empirical maximum mean discrepancy (MMD) to a novel probabilistic MMD that can measure the discrepancy between mixture distributions (i.e., source domains) consisting of a series of latent distributions rather than latent points. Moreover, instead of imposing the contrastive semantic alignment (CSA) loss based on pairs of latent points, a novel probabilistic CSA loss encourages positive probabilistic embedding pairs to be closer while pulling other negative ones apart. Benefiting from the learned representation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification