Multi-Dimensional Ability Diagnosis for Machine Learning Algorithms
Qi Liu, Zheng Gong, Zhenya Huang, Chuanren Liu, Hengshu Zhu, Zhi Li,, Enhong Chen, Hui Xiong

TL;DR
This paper introduces Camilla, a multi-dimensional evaluation framework inspired by psychometric theories, to better measure and compare machine learning algorithms' multifaceted strengths beyond traditional accuracy metrics.
Contribution
The paper proposes a novel, task-agnostic diagnostic framework that uses neural networks and psychometric principles to assess multiple skills of algorithms simultaneously.
Findings
Camilla outperforms state-of-the-art baselines in reliability and stability.
It captures detailed strengths and weaknesses of algorithms.
The framework effectively quantifies sample difficulty and algorithm skills.
Abstract
Machine learning algorithms have become ubiquitous in a number of applications (e.g. image classification). However, due to the insufficient measurement of traditional metrics (e.g. the coarse-grained Accuracy of each classifier), substantial gaps are usually observed between the real-world performance of these algorithms and their scores in standardized evaluations. In this paper, inspired by the psychometric theories from human measurement, we propose a task-agnostic evaluation framework Camilla, where a multi-dimensional diagnostic metric Ability is defined for collaboratively measuring the multifaceted strength of each machine learning algorithm. Specifically, given the response logs from different algorithms to data samples, we leverage cognitive diagnosis assumptions and neural networks to learn the complex interactions among algorithms, samples and the skills (explicitly or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Advanced Graph Neural Networks · Machine Learning and Data Classification
