Scalable Set Encoding with Universal Mini-Batch Consistency and Unbiased   Full Set Gradient Approximation

Jeffrey Willette; Seanie Lee; Bruno Andreis; Kenji Kawaguchi; Juho; Lee; Sung Ju Hwang

arXiv:2208.12401·cs.LG·June 9, 2023

Scalable Set Encoding with Universal Mini-Batch Consistency and Unbiased Full Set Gradient Approximation

Jeffrey Willette, Seanie Lee, Bruno Andreis, Kenji Kawaguchi, Juho, Lee, Sung Ju Hwang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a universal mini-batch consistency framework for set functions that allows scalable, unbiased full set gradient approximation, enabling efficient training on large sets across various applications.

Contribution

The authors propose a new class of set functions called UMBC that maintains mini-batch consistency while supporting arbitrary components, along with an unbiased gradient approximation algorithm with constant memory overhead.

Findings

01

UMBC enables wider function class usage in MBC settings.

02

The proposed algorithm provides unbiased full set gradient estimates.

03

Experiments demonstrate efficiency and effectiveness across diverse tasks.

Abstract

Recent work on mini-batch consistency (MBC) for set functions has brought attention to the need for sequentially processing and aggregating chunks of a partitioned set while guaranteeing the same output for all partitions. However, existing constraints on MBC architectures lead to models with limited expressive power. Additionally, prior work has not addressed how to deal with large sets during training when the full set gradient is required. To address these issues, we propose a Universally MBC (UMBC) class of set functions which can be used in conjunction with arbitrary non-MBC components while still satisfying MBC, enabling a wider range of function classes to be used in MBC settings. Furthermore, we propose an efficient MBC training algorithm which gives an unbiased approximation of the full set gradient and has a constant memory overhead for any set size for both train- and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jeffwillette/umbc
pytorchOfficial

Videos

Scalable Set Encoding with Universal Mini-Batch Consistency and Unbiased Full Set Gradient Approximation· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Algorithms · Machine Learning and Data Classification

MethodsMonte Carlo Dropout · Dropout