A Deep Generative Approach to Stratified Learning

Randy Martinez; Rong Tang; Lizhen Lin

arXiv:2604.10650·stat.ML·April 14, 2026

A Deep Generative Approach to Stratified Learning

Randy Martinez, Rong Tang, Lizhen Lin

PDF

TL;DR

This paper introduces two deep generative frameworks for modeling and learning distributions on stratified spaces, addressing challenges posed by varying dimensions and singularities.

Contribution

It develops a dimension-aware mixture of VAEs and a diffusion-based method, with theoretical convergence and consistency guarantees for stratified learning.

Findings

01

Convergence rates depend on intrinsic dimensions and smoothness.

02

Algorithms accurately estimate the number and dimensions of strata.

03

Methods outperform existing models in molecular dynamics applications.

Abstract

While the manifold hypothesis is widely adopted in modern machine learning, complex data is often better modeled as stratified spaces -- unions of manifolds (strata) of varying dimensions. Stratified learning is challenging due to varying dimensionality, intersection singularities, and lack of efficient models in learning the underlying distributions. We provide a deep generative approach to stratified learning by developing two generative frameworks for learning distributions on stratified spaces. The first is a sieve maximum likelihood approach realized via a dimension-aware mixture of variational autoencoders. The second is a diffusion-based framework that explores the score field structure of a mixture. We establish the convergence rates for learning both the ambient and intrinsic distributions, which are shown to be dependent on the intrinsic dimensions and smoothness of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.