Set Based Stochastic Subsampling

Bruno Andreis; Seanie Lee; A. Tuan Nguyen; Juho Lee; Eunho Yang; Sung; Ju Hwang

arXiv:2006.14222·cs.LG·May 31, 2022

Set Based Stochastic Subsampling

Bruno Andreis, Seanie Lee, A. Tuan Nguyen, Juho Lee, Eunho Yang, Sung, Ju Hwang

PDF

Open Access

TL;DR

This paper introduces a set-based neural subsampling method that efficiently reduces data volume for high-dimensional tasks, improving performance and scalability across various applications.

Contribution

A novel two-stage neural subsampling framework that jointly optimizes data reduction with downstream tasks using set encoding and attention mechanisms.

Findings

01

Outperforms baselines at low subsampling rates

02

Improves scalability of Neural Processes

03

Effective in image and function reconstruction

Abstract

Deep models are designed to operate on huge volumes of high dimensional data such as images. In order to reduce the volume of data these models must process, we propose a set-based two-stage end-to-end neural subsampling model that is jointly optimized with an \textit{arbitrary} downstream task network (e.g. classifier). In the first stage, we efficiently subsample \textit{candidate elements} using conditionally independent Bernoulli random variables by capturing coarse grained global information using set encoding functions, followed by conditionally dependent autoregressive subsampling of the candidate elements using Categorical random variables by modeling pair-wise interactions using set attention networks in the second stage. We apply our method to feature and instance selection and show that it outperforms the relevant baselines under low subsampling rates on a variety of tasks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Machine Learning and Algorithms