Scalable Set Encoding with Universal Mini-Batch Consistency and Unbiased Full Set Gradient Approximation
Jeffrey Willette, Seanie Lee, Bruno Andreis, Kenji Kawaguchi, Juho, Lee, Sung Ju Hwang

TL;DR
This paper introduces a universal mini-batch consistency framework for set functions that allows scalable, unbiased full set gradient approximation, enabling efficient training on large sets across various applications.
Contribution
The authors propose a new class of set functions called UMBC that maintains mini-batch consistency while supporting arbitrary components, along with an unbiased gradient approximation algorithm with constant memory overhead.
Findings
UMBC enables wider function class usage in MBC settings.
The proposed algorithm provides unbiased full set gradient estimates.
Experiments demonstrate efficiency and effectiveness across diverse tasks.
Abstract
Recent work on mini-batch consistency (MBC) for set functions has brought attention to the need for sequentially processing and aggregating chunks of a partitioned set while guaranteeing the same output for all partitions. However, existing constraints on MBC architectures lead to models with limited expressive power. Additionally, prior work has not addressed how to deal with large sets during training when the full set gradient is required. To address these issues, we propose a Universally MBC (UMBC) class of set functions which can be used in conjunction with arbitrary non-MBC components while still satisfying MBC, enabling a wider range of function classes to be used in MBC settings. Furthermore, we propose an efficient MBC training algorithm which gives an unbiased approximation of the full set gradient and has a constant memory overhead for any set size for both train- and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Algorithms · Machine Learning and Data Classification
MethodsMonte Carlo Dropout · Dropout
