Measuring Generalization with Optimal Transport

Ching-Yao Chuang; Youssef Mroueh; Kristjan Greenewald; Antonio; Torralba; Stefanie Jegelka

arXiv:2106.03314·cs.LG·November 9, 2021·5 cites

Measuring Generalization with Optimal Transport

Ching-Yao Chuang, Youssef Mroueh, Kristjan Greenewald, Antonio, Torralba, Stefanie Jegelka

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a new theoretical framework for understanding neural network generalization using optimal transport costs, providing bounds that align well with empirical observations on large datasets.

Contribution

It develops margin-based generalization bounds normalized with optimal transport costs, linking feature space structure to generalization performance.

Findings

01

Optimal transport costs generalize variance and capture feature space structure.

02

The bounds accurately predict generalization error on large datasets.

03

Feature concentration and separation are key factors in generalization.

Abstract

Understanding the generalization of deep neural networks is one of the most important tasks in deep learning. Although much progress has been made, theoretical error bounds still often behave disparately from empirical observations. In this work, we develop margin-based generalization bounds, where the margins are normalized with optimal transport costs between independent random subsets sampled from the training distribution. In particular, the optimal transport cost can be interpreted as a generalization of variance which captures the structural properties of the learned feature space. Our bounds robustly predict the generalization error, given training data and network parameters, on large scale datasets. Theoretically, we demonstrate that the concentration and separation of features play crucial roles in generalization, supporting empirical results in the literature. The code is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chingyaoc/kV-Margin
tfOfficial

Videos

Measuring Generalization with Optimal Transport· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Algorithms · Advanced Neural Network Applications