Human-in-the-Loop Mixup

Katherine M. Collins; Umang Bhatt; Weiyang Liu; Vihari Piratla; Ilia; Sucholutsky; Bradley Love; Adrian Weller

arXiv:2211.01202·cs.LG·August 1, 2023·1 cites

Human-in-the-Loop Mixup

Katherine M. Collins, Umang Bhatt, Weiyang Liu, Vihari Piratla, Ilia, Sucholutsky, Bradley Love, Adrian Weller

PDF

Open Access 1 Repo

TL;DR

This paper investigates whether synthetic labels used in mixup data augmentation align with human perception, revealing misalignments and proposing a human-in-the-loop approach to improve model robustness and reliability.

Contribution

It introduces the HILL MixE Suite, a set of elicitation interfaces for collecting human judgments on mixup examples, and provides insights into aligning synthetic data with human perception.

Findings

01

Human perceptions often do not match traditional synthetic labels.

02

Incorporating human uncertainty can enhance model reliability.

03

The H-Mix data hub facilitates further research on human-aligned synthetic data.

Abstract

Aligning model representations to humans has been found to improve robustness and generalization. However, such methods often focus on standard observational data. Synthetic data is proliferating and powering many advances in machine learning; yet, it is not always clear whether synthetic labels are perceptually aligned to humans -- rendering it likely model representations are not human aligned. We focus on the synthetic data used in mixup: a powerful regularizer shown to improve model robustness, generalization, and calibration. We design a comprehensive series of elicitation interfaces, which we release as HILL MixE Suite, and recruit 159 participants to provide perceptual judgments along with their uncertainties, over mixup examples. We find that human perceptions do not consistently align with the labels traditionally used for synthetic points, and begin to demonstrate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cambridge-mlg/hill-mixup
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Visualization and Analytics · Explainable Artificial Intelligence (XAI) · Time Series Analysis and Forecasting

MethodsMixup · ALIGN