Bayesian Active Learning By Distribution Disagreement

Thorben Werner; Lars Schmidt-Thieme

arXiv:2501.01248·cs.LG·January 3, 2025

Bayesian Active Learning By Distribution Disagreement

Thorben Werner, Lars Schmidt-Thieme

PDF

Open Access 1 Repo 3 Reviews

TL;DR

This paper introduces BALSA, a novel active learning algorithm for regression using normalizing flows, which effectively distinguishes between types of uncertainty and achieves state-of-the-art results across multiple datasets.

Contribution

The paper develops BALSA, an adaptation of BALD for regression with normalizing flows, enhancing uncertainty quantification and active learning performance.

Findings

01

BALSA outperforms existing methods on four datasets.

02

Normalizing flows enable direct uncertainty estimation in regression.

03

Sophisticated algorithms are needed for effective pool-based active learning.

Abstract

Active Learning (AL) for regression has been systematically under-researched due to the increased difficulty of measuring uncertainty in regression models. Since normalizing flows offer a full predictive distribution instead of a point forecast, they facilitate direct usage of known heuristics for AL like Entropy or Least-Confident sampling. However, we show that most of these heuristics do not work well for normalizing flows in pool-based AL and we need more sophisticated algorithms to distinguish between aleatoric and epistemic uncertainty. In this work we propose BALSA, an adaptation of the BALD algorithm, tailored for regression with normalizing flows. With this work we extend current research on uncertainty quantification with normalizing flows \cite{berry2023normalizing, berry2023escaping} to real world data and pool-based AL with multiple acquisition functions and query sizes. We…

Peer Reviews

Decision·Submitted to ICLR 2025

Reviewer 01Rating 5Confidence 2

Strengths

The paper is original in its focus on developing active learning strategies specifically for regression with normalizing flows, while much of the AL research traditionally focuses on classification tasks. The paper offers a comprehensive benchmark for AL in regression with predictive distributions. The experiments are robust, testing BALSA across four diverse regression datasets and two model architectures. This extensive comparison proves BALSA's effectiveness and generalizability.

Weaknesses

The presentation of this paper could be improved. For example, Figure 1 is presented without reference or explanation in the text, which reduces clarity for the reader. In Pair Comparison, the paper introduces a pairwise approach to approximate Eq. 2, which is one of the core components of BALSA. However, the paper does not adequately explain why this approach is effective, nor does it discuss any potential trade-offs or advantages that led to this specific choice. The paper claims that BA

Reviewer 02Rating 3Confidence 4

Strengths

The authors report strong results on the chosen datasets.

Weaknesses

The writing is very poor and hard to follow. The notation exhibits a lack of rigor and mathematical expressions are not properly introduced. For example, it is unclear what each \theta_i in eq. (1) refers to. It's unclear how these quantities are measured. 'BALD' is also mentioned many times and never properly described. While the authors claim that the datasets were selected to 'provide maximal intersection with other literature for AL with regression', they are quite small by modern standar

Reviewer 03Rating 3Confidence 5

Strengths

1. The paper addresses an under-researched area of the literature. 2. The broader approach to the BALD acquisition criterion is interesting and appears to have strong potential.

Weaknesses

1. Formatting issues: - Ensure paragraphs are indented throughout the document. - Correct the initial quotation marks by using `` for opening quotes in LaTeX. - In Figure 3, the abundance of methods makes the lines difficult to differentiate. Consider moving some of these methods to the appendix to enhance readability. 2. Results presentation: The results lack standard deviation and confidence intervals, making it challenging to fully trust the conclusions. Including these would provid

Code & Models

Repositories

wernerth94/bayesian-active-learning-by-distribution-disagreement
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms

MethodsNormalizing Flows