$\beta^3$-IRT: A New Item Response Model and its Applications

Yu Chen; Telmo Silva Filho; Ricardo B. C. Prud\^encio; Tom; Diethe; Peter Flach

arXiv:1903.04016·stat.ML·June 4, 2019·5 cites

$\beta^3$-IRT: A New Item Response Model and its Applications

Yu Chen, Telmo Silva Filho, Ricardo B. C. Prud\^encio, Tom, Diethe, Peter Flach

PDF

Open Access 1 Repo

TL;DR

The paper introduces the $eta^3$-IRT model, a novel item response theory approach for continuous responses, outperforming standard models and enabling new classifier evaluation metrics.

Contribution

It proposes the $eta^3$-IRT model, extending IRT to continuous responses and applying it to assess machine learning classifiers.

Findings

01

$eta^3$-IRT outperforms 2PL-ND on all datasets.

02

The model generates enriched item characteristic curves.

03

New metric for classifier probability estimate quality.

Abstract

Item Response Theory (IRT) aims to assess latent abilities of respondents based on the correctness of their answers in aptitude test items with different difficulty levels. In this paper, we propose the $β^{3}$ -IRT model, which models continuous responses and can generate a much enriched family of Item Characteristic Curve (ICC). In experiments we applied the proposed model to data from an online exam platform, and show our model outperforms a more standard 2PL-ND model on all datasets. Furthermore, we show how to apply $β^{3}$ -IRT to assess the ability of machine learning classifiers. This novel application results in a new metric for evaluating the quality of the classifier's probability estimates, based on the inferred difficulty and discrimination of data instances.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yc14600/beta3_IRT
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition · Machine Learning and ELM