User Modelling for Avoiding Overfitting in Interactive Knowledge   Elicitation for Prediction

Pedram Daee; Tomi Peltola; Aki Vehtari; Samuel Kaski

arXiv:1710.04881·cs.HC·March 12, 2018

User Modelling for Avoiding Overfitting in Interactive Knowledge Elicitation for Prediction

Pedram Daee, Tomi Peltola, Aki Vehtari, Samuel Kaski

PDF

1 Repo

TL;DR

This paper introduces a user modeling approach to prevent overfitting in interactive machine learning by inferring user knowledge, demonstrated through a sentiment analysis task with improved predictive performance.

Contribution

The paper proposes a probabilistic user modeling methodology to mitigate overfitting caused by user interaction in human-in-the-loop machine learning systems.

Findings

01

User modeling improves predictive accuracy in sentiment analysis.

02

The method effectively guards against overfitting caused by noisy user input.

03

Empirical validation with 48 participants supports the approach's effectiveness.

Abstract

In human-in-the-loop machine learning, the user provides information beyond that in the training data. Many algorithms and user interfaces have been designed to optimize and facilitate this human--machine interaction; however, fewer studies have addressed the potential defects the designs can cause. Effective interaction often requires exposing the user to the training data or its statistics. The design of the system is then critical, as this can lead to double use of data and overfitting, if the user reinforces noisy patterns in the data. We propose a user modelling methodology, by assuming simple rational behaviour, to correct the problem. We show, in a user study with 48 participants, that the method improves predictive performance in a sparse linear regression sentiment analysis task, where graded user knowledge on feature relevance is elicited. We believe that the key idea of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HIIT/human-overfitting-in-IML
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.