A Geometrically-Grounded Drive for MDL-Based Optimization in Deep Learning
Ming Lei, Shufan Wu, Christophe Baehr

TL;DR
This paper proposes a new optimization framework for deep learning that integrates the MDL principle into training via a geometrically-grounded manifold and Ricci flow, leading to better generalization and model simplification.
Contribution
It introduces a novel MDL-driven optimization method based on a coupled Ricci flow on a cognitive manifold, with theoretical guarantees and practical algorithms.
Findings
Monotonically decreasing description length during training
Finite topological phase transitions in the model evolution
Empirical results show improved generalization and model compression
Abstract
This paper introduces a novel optimization framework that fundamentally integrates the Minimum Description Length (MDL) principle into the training dynamics of deep neural networks. Moving beyond its conventional role as a model selection criterion, we reformulate MDL as an active, adaptive driving force within the optimization process itself. The core of our method is a geometrically-grounded cognitive manifold whose evolution is governed by a \textit{coupled Ricci flow}, enriched with a novel \textit{MDL Drive} term derived from first principles. This drive, modulated by the task-loss gradient, creates a seamless harmony between data fidelity and model simplification, actively compressing the internal representation during training. We establish a comprehensive theoretical foundation, proving key properties including the monotonic decrease of description length…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Advanced Graph Neural Networks · Domain Adaptation and Few-Shot Learning
