Harmonic Loss Trains Interpretable AI Models

David D. Baek; Ziming Liu; Riya Tyagi; Max Tegmark

arXiv:2502.01628·cs.LG·July 11, 2025

Harmonic Loss Trains Interpretable AI Models

David D. Baek, Ziming Liu, Riya Tyagi, Max Tegmark

PDF

Open Access 1 Repo

TL;DR

This paper introduces harmonic loss, a novel training method for neural networks that improves interpretability, convergence speed, and data efficiency by replacing softmax and dot product with a scale-invariant approach.

Contribution

The paper proposes harmonic loss, a new loss function that enhances interpretability and convergence in neural models, validated across multiple domains and compared to standard training methods.

Findings

01

Harmonic loss improves model interpretability.

02

Models trained with harmonic loss require less data.

03

Harmonic loss reduces grokking phenomena.

Abstract

In this paper, we introduce harmonic loss as an alternative supervisory signal for training neural networks and large language models (LLMs). Harmonic loss differs from standard cross-entropy loss by (a) replacing the usual SoftMax normalization with a scale-invariant HarMax function and (b) computing logits via Euclidean distance rather than a dot product. Harmonic loss enables improved interpretability and faster convergence, owing to its scale invariance and finite convergence point by design, which can be interpreted as a class center. We first validate the performance of harmonic models across algorithmic, vision, and language datasets. Through extensive experiments, we demonstrate that models trained with harmonic loss perform better than standard models by: (a) enhancing interpretability, (b) requiring less data for generalization, and (c) reducing grokking. Moreover, we compare…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ches-001/audio-segmenter
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical and Computational Modeling · Energy Load and Power Forecasting

MethodsAttention Is All You Need · Byte Pair Encoding · Layer Normalization · Residual Connection · Linear Layer · Dense Connections · Multi-Head Attention · Adam · Softmax · Dropout