Learning Latent Parameters without Human Response Patterns: Item   Response Theory with Artificial Crowds

John P. Lalor; Hao Wu; Hong Yu

arXiv:1908.11421·cs.CL·September 2, 2019·1 cites

Learning Latent Parameters without Human Response Patterns: Item Response Theory with Artificial Crowds

John P. Lalor, Hao Wu, Hong Yu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method to learn Item Response Theory models using artificial crowds of neural networks, enabling analysis of NLP model performance without relying on human response data.

Contribution

It proposes a novel approach to train IRT models with DNN-generated response patterns, bypassing the need for human data and facilitating large-scale NLP analysis.

Findings

01

IRT parameters from machine and human RPs are positively correlated

02

Difficulty-based sampling improves training set filtering

03

Machine RPs reveal discrepancies with human expectations

Abstract

Incorporating Item Response Theory (IRT) into NLP tasks can provide valuable information about model performance and behavior. Traditionally, IRT models are learned using human response pattern (RP) data, presenting a significant bottleneck for large data sets like those required for training deep neural networks (DNNs). In this work we propose learning IRT models using RPs generated from artificial crowds of DNN models. We demonstrate the effectiveness of learning IRT models using DNN-generated data through quantitative and qualitative analyses for two NLP tasks. Parameters learned from human and machine RPs for natural language inference and sentiment analysis exhibit medium to large positive correlations. We demonstrate a use-case for latent difficulty item parameters, namely training set filtering, and show that using difficulty to sample training data outperforms baseline methods.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jplalor/py-irt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning