Learning Latent Parameters without Human Response Patterns: Item Response Theory with Artificial Crowds
John P. Lalor, Hao Wu, Hong Yu

TL;DR
This paper introduces a method to learn Item Response Theory models using artificial crowds of neural networks, enabling analysis of NLP model performance without relying on human response data.
Contribution
It proposes a novel approach to train IRT models with DNN-generated response patterns, bypassing the need for human data and facilitating large-scale NLP analysis.
Findings
IRT parameters from machine and human RPs are positively correlated
Difficulty-based sampling improves training set filtering
Machine RPs reveal discrepancies with human expectations
Abstract
Incorporating Item Response Theory (IRT) into NLP tasks can provide valuable information about model performance and behavior. Traditionally, IRT models are learned using human response pattern (RP) data, presenting a significant bottleneck for large data sets like those required for training deep neural networks (DNNs). In this work we propose learning IRT models using RPs generated from artificial crowds of DNN models. We demonstrate the effectiveness of learning IRT models using DNN-generated data through quantitative and qualitative analyses for two NLP tasks. Parameters learned from human and machine RPs for natural language inference and sentiment analysis exhibit medium to large positive correlations. We demonstrate a use-case for latent difficulty item parameters, namely training set filtering, and show that using difficulty to sample training data outperforms baseline methods.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
