A Geometrically-Grounded Drive for MDL-Based Optimization in Deep Learning

Ming Lei; Shufan Wu; Christophe Baehr

arXiv:2603.12304·cs.LG·March 16, 2026

A Geometrically-Grounded Drive for MDL-Based Optimization in Deep Learning

Ming Lei, Shufan Wu, Christophe Baehr

PDF

Open Access

TL;DR

This paper proposes a new optimization framework for deep learning that integrates the MDL principle into training via a geometrically-grounded manifold and Ricci flow, leading to better generalization and model simplification.

Contribution

It introduces a novel MDL-driven optimization method based on a coupled Ricci flow on a cognitive manifold, with theoretical guarantees and practical algorithms.

Findings

01

Monotonically decreasing description length during training

02

Finite topological phase transitions in the model evolution

03

Empirical results show improved generalization and model compression

Abstract

This paper introduces a novel optimization framework that fundamentally integrates the Minimum Description Length (MDL) principle into the training dynamics of deep neural networks. Moving beyond its conventional role as a model selection criterion, we reformulate MDL as an active, adaptive driving force within the optimization process itself. The core of our method is a geometrically-grounded cognitive manifold whose evolution is governed by a \textit{coupled Ricci flow}, enriched with a novel \textit{MDL Drive} term derived from first principles. This drive, modulated by the task-loss gradient, creates a seamless harmony between data fidelity and model simplification, actively compressing the internal representation during training. We establish a comprehensive theoretical foundation, proving key properties including the monotonic decrease of description length…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopological and Geometric Data Analysis · Advanced Graph Neural Networks · Domain Adaptation and Few-Shot Learning