Chebyshev Moment Regularization (CMR): Condition-Number Control with Moment Shaping

Jinwoo Baek

arXiv:2510.21772·cs.LG·October 28, 2025

Chebyshev Moment Regularization (CMR): Condition-Number Control with Moment Shaping

Jinwoo Baek

PDF

TL;DR

Chebyshev Moment Regularization (CMR) is a novel loss function that improves neural network training stability and accuracy by controlling layer spectra through spectral shaping and condition number optimization.

Contribution

The paper introduces CMR, a new architecture-agnostic regularization method that directly optimizes layer spectra and condition numbers, with proven theoretical properties and practical effectiveness.

Findings

01

Reduces mean layer condition numbers by ~1000 times in 5 epochs

02

Restores test accuracy from ~10% to ~86% on MNIST with adversarial stress

03

Increases average gradient magnitude during training

Abstract

We introduce \textbf{Chebyshev Moment Regularization (CMR)}, a simple, architecture-agnostic loss that directly optimizes layer spectra. CMR jointly controls spectral edges via a log-condition proxy and shapes the interior via Chebyshev moments, with a decoupled, capped mixing rule that preserves task gradients. We prove strictly monotone descent for the condition proxy, bounded moment gradients, and orthogonal invariance. In an adversarial `` $κ$ -stress'' setting (MNIST, 15-layer MLP), \emph{compared to vanilla training}, CMR reduces mean layer condition numbers by $\sim 1 0^{3}$ (from $\approx 3.9 \times 1 0^{3}$ to $\approx 3.4$ in 5 epochs), increases average gradient magnitude, and restores test accuracy ( $\approx 10% \to \approx 86%$ ). These results support \textbf{optimization-driven spectral preconditioning}: directly steering models toward well-conditioned regimes for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.