When Latent Geometry Is Not Enough: Draft-Conditioned Latent Refinement for Non-Autoregressive Text Generation
De Shuai Zhang

TL;DR
This paper investigates the limitations of latent geometry in non-autoregressive text generation, proposing a draft-conditioned latent refinement model that emphasizes decoder recoverability and structured refinement over pure latent similarity.
Contribution
It introduces a novel draft-conditioned latent refinement approach that highlights the importance of decoder recoverability and structured refinement in latent space for text generation.
Findings
Good latent-space metrics do not guarantee decoding quality.
Full BERT latents outperform compressed latents in token recovery.
Latent geometry alone is insufficient for effective text generation.
Abstract
Continuous diffusion and flow models are attractive for non-autoregressive text generation because they can update all positions in parallel. A major difficulty is the interface between continuous latent states and discrete tokens. This report studies a draft-conditioned latent refinement model built from a frozen BERT encoder, a parallel decoder, a denoising DraftPrior, a local FlowNet, and a learned diagonal MetricNet. Early Gaussian-start experiments showed that good latent-space metrics, such as scale matching or cosine similarity, do not guarantee good decoding. Generated latents can be close to real encoder latents but still produce high-entropy, biased, or repetitive token distributions. We therefore frame the task as controlled local refinement rather than full generation from noise. On ROCStories, using the first two sentences as prompt and the last three as target, full…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
