Aligning Multiclass Neural Network Classifier Criterion with Task Performance Metrics

Deyuan Li; Taesoo Daniel Lee; Marynel V\'azquez; Nathan Tsoi

arXiv:2405.20954·cs.LG·May 27, 2025

Aligning Multiclass Neural Network Classifier Criterion with Task Performance Metrics

Deyuan Li, Taesoo Daniel Lee, Marynel V\'azquez, Nathan Tsoi

PDF

Open Access

TL;DR

This paper introduces EAST, a novel training method for multiclass neural networks that directly optimizes surrogate metrics aligned with evaluation criteria like accuracy or F1-score, improving performance over standard cross-entropy.

Contribution

EAST is the first approach to incorporate dynamic thresholding, soft-set confusion matrices, and an annealing process to align training with specific evaluation metrics.

Findings

01

EAST improves alignment between training loss and evaluation metrics.

02

EAST outperforms existing methods on multiple datasets.

03

Theoretical guarantees show convergence to metric-optimal solutions.

Abstract

Multiclass neural network classifiers are typically trained using cross-entropy loss but evaluated using metrics derived from the confusion matrix, such as Accuracy, $F_{β}$ -Score, and Matthews Correlation Coefficient. This mismatch between the training objective and evaluation metric can lead to suboptimal performance, particularly when the user's priorities differ from what cross-entropy implicitly optimizes. For example, in the presence of class imbalance, $F_{1}$ -Score may be preferred over Accuracy. Similarly, given a preference towards precision, the $F_{β = 0.25}$ -Score will better reflect this preference than $F_{1}$ -Score. However, standard cross-entropy loss does not accommodate such a preference. Building on prior work leveraging soft-set confusion matrices and a continuous piecewise-linear Heaviside approximation, we propose Evaluation Aligned Surrogate Training (EAST), a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications