DiffATS: Diffusion in Aligned Tensor Space
Jinhua Lyu, Tianmin Yu, Brian Kim, Lizhuo Zhou, Chanwook Park, Naichen Shi

TL;DR
DiffATS introduces a novel tensor primitive-based diffusion framework that enables efficient, high-quality generative modeling of high-dimensional data without autoencoders.
Contribution
The paper develops data-dependent tensor primitives aligned via Procrustes analysis and proves their homeomorphism properties, enabling diffusion modeling directly in primitive space.
Findings
Achieves 3.9x to 210x data compression across tasks.
Demonstrates strong unconditional and conditional generation performance.
Operates without pretrained autoencoders, simplifying the pipeline.
Abstract
Direct diffusion modeling of high-resolution spatiotemporal fields is computationally challenging. Parameter-efficient primitives address this by representing high-dimensional data with a compact set of parameters. In this paper, we construct data-dependent tensor primitives without pretrained compression autoencoders. Our construction starts from Tucker decomposition, which captures low-rank multilinear structure through a core tensor and mode-wise factors. However, Tucker factors are non-unique: the same tensor can be represented by different rotated factors, which complicates generative modeling. We address this issue with orthogonal Procrustes (OP) alignment. Specifically, we select medoid anchor matrices from the data and align the factor matrices to resolve the gauge ambiguity. This yields matrix Grassmannian primitives and tensor Grassmannian primitives that are compact,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
