Meta Pattern Concern Score: A Novel Evaluation Measure with Human Values for Multi-classifiers
Yanyun Wang, Dehui Du, Yuanhao Liu

TL;DR
This paper introduces the Meta Pattern Concern Score, a new evaluation measure for multi-class classifiers that incorporates human values and can improve model training by balancing safety and accuracy.
Contribution
The paper proposes the Meta Pattern Concern Score, a novel evaluation metric that integrates human safety concerns into classifier evaluation and training.
Findings
Effectively evaluates models considering human safety values.
Reduces dangerous misclassifications with minimal accuracy loss.
Enhances training by dynamically adjusting learning rates.
Abstract
While advanced classifiers have been increasingly used in real-world safety-critical applications, how to properly evaluate the black-box models given specific human values remains a concern in the community. Such human values include punishing error cases of different severity in varying degrees and making compromises in general performance to reduce specific dangerous cases. In this paper, we propose a novel evaluation measure named Meta Pattern Concern Score based on the abstract representation of probabilistic prediction and the adjustable threshold for the concession in prediction confidence, to introduce the human values into multi-classifiers. Technically, we learn from the advantages and disadvantages of two kinds of common metrics, namely the confusion matrix-based evaluation measures and the loss values, so that our measure is effective as them even under general tasks, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning · Risk and Safety Analysis
