Aligning Multiclass Neural Network Classifier Criterion with Task Performance Metrics
Deyuan Li, Taesoo Daniel Lee, Marynel V\'azquez, Nathan Tsoi

TL;DR
This paper introduces EAST, a novel training method for multiclass neural networks that directly optimizes surrogate metrics aligned with evaluation criteria like accuracy or F1-score, improving performance over standard cross-entropy.
Contribution
EAST is the first approach to incorporate dynamic thresholding, soft-set confusion matrices, and an annealing process to align training with specific evaluation metrics.
Findings
EAST improves alignment between training loss and evaluation metrics.
EAST outperforms existing methods on multiple datasets.
Theoretical guarantees show convergence to metric-optimal solutions.
Abstract
Multiclass neural network classifiers are typically trained using cross-entropy loss but evaluated using metrics derived from the confusion matrix, such as Accuracy, -Score, and Matthews Correlation Coefficient. This mismatch between the training objective and evaluation metric can lead to suboptimal performance, particularly when the user's priorities differ from what cross-entropy implicitly optimizes. For example, in the presence of class imbalance, -Score may be preferred over Accuracy. Similarly, given a preference towards precision, the -Score will better reflect this preference than -Score. However, standard cross-entropy loss does not accommodate such a preference. Building on prior work leveraging soft-set confusion matrices and a continuous piecewise-linear Heaviside approximation, we propose Evaluation Aligned Surrogate Training (EAST), a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
