Machine Learning Capability: A standardized metric using case difficulty with applications to individualized deployment of supervised machine learning
Adrienne Kline, Joon Lee

TL;DR
This paper introduces Machine Learning Capability (MLC), a novel, standardized metric based on case difficulty using IRT and CAT, enabling efficient benchmarking and comparison of supervised learning models across datasets.
Contribution
The paper presents MLC, a new metric that incorporates case difficulty into model evaluation, improving benchmarking efficiency and outcome independence.
Findings
MLC is less than 1% of dataset size, 22-60x more efficient than traditional metrics.
MLC is unbiased to outcome classification, enabling fair model comparisons.
The method effectively benchmarks datasets using IRT and CAT approaches.
Abstract
Model evaluation is a critical component in supervised machine learning classification analyses. Traditional metrics do not currently incorporate case difficulty. This renders the classification results unbenchmarked for generalization. Item Response Theory (IRT) and Computer Adaptive Testing (CAT) with machine learning can benchmark datasets independent of the end-classification results. This provides high levels of case-level information regarding evaluation utility. To showcase, two datasets were used: 1) health-related and 2) physical science. For the health dataset a two-parameter IRT model, and for the physical science dataset a polytonomous IRT model, was used to analyze predictive features and place each case on a difficulty continuum. A CAT approach was used to ascertain the algorithms' performance and applicability to new data. This method provides an efficient way to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education · Artificial Intelligence in Healthcare
