# Gaussian Switch Sampling: A Second Order Approach to Active Learning

**Authors:** Ryan Benkert, Mohit Prabhushankar, Ghassan AlRegib, Armin Pacharmi,, and Enrique Corona

arXiv: 2302.12018 · 2023-02-24

## TL;DR

This paper introduces Gaussian Switch Sampling, a second-order active learning method based on model 'forgetting' to improve sample selection robustness across various models and distributions.

## Contribution

It proposes a novel importance measure based on second-order representation shifts and develops GauSS, a setup-agnostic sampling strategy for active learning.

## Key findings

- GauSS outperforms four popular query strategies by up to 5%.
- The importance measure remains accurate even with limited training data.
- GauSS is robust across different architectures and distribution types.

## Abstract

In active learning, acquisition functions define informativeness directly on the representation position within the model manifold. However, for most machine learning models (in particular neural networks) this representation is not fixed due to the training pool fluctuations in between active learning rounds. Therefore, several popular strategies are sensitive to experiment parameters (e.g. architecture) and do not consider model robustness to out-of-distribution settings. To alleviate this issue, we propose a grounded second-order definition of information content and sample importance within the context of active learning. Specifically, we define importance by how often a neural network "forgets" a sample during training - artifacts of second order representation shifts. We show that our definition produces highly accurate importance scores even when the model representations are constrained by the lack of training data. Motivated by our analysis, we develop Gaussian Switch Sampling (GauSS). We show that GauSS is setup agnostic and robust to anomalous distributions with exhaustive experiments on three in-distribution benchmarks, three out-of-distribution benchmarks, and three different architectures. We report an improvement of up to 5% when compared against four popular query strategies.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2302.12018/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/2302.12018/full.md

## References

56 references — full list in the complete paper: https://tomesphere.com/paper/2302.12018/full.md

---
Source: https://tomesphere.com/paper/2302.12018