ZClassifier: Temperature Tuning and Manifold Approximation via KL Divergence on Logit Space

Shim Soon Yong

arXiv:2507.10638·cs.LG·August 12, 2025

ZClassifier: Temperature Tuning and Manifold Approximation via KL Divergence on Logit Space

Shim Soon Yong

PDF

Open Access 1 Repo

TL;DR

ZClassifier introduces a probabilistic approach to classification by modeling logits as Gaussian distributions, improving robustness and calibration through KL divergence minimization, and unifying uncertainty and latent control.

Contribution

It presents a novel probabilistic classification framework that replaces deterministic logits with Gaussian distributions, addressing temperature scaling and manifold approximation simultaneously.

Findings

01

Improves robustness over softmax classifiers.

02

Enhances calibration and latent separation.

03

Shows consistent benefits across datasets.

Abstract

We introduce a novel classification framework, ZClassifier, that replaces conventional deterministic logits with diagonal Gaussian-distributed logits. Our method simultaneously addresses temperature scaling and manifold approximation by minimizing the KL divergence between the predicted Gaussian distributions and a unit isotropic Gaussian. This unifies uncertainty calibration and latent control in a principled probabilistic manner, enabling a natural interpretation of class confidence and geometric consistency. Experiments on CIFAR-10 and CIFAR-100 demonstrate that ZClassifier improves over softmax classifiers in robustness, calibration, and latent separation, with consistent benefits across small-scale and large-scale classification settings.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ShimSoonYong/ZClassifier
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsSoftmax