Hidden Biases in Conditioning Autoregressive Models
Francois Pachet, Pierre Roy

TL;DR
This paper investigates the computational complexity of exact inference in autoregressive models, revealing that certain constrained decoding tasks are NP-hard or -hard, highlighting fundamental limitations in neural generation.
Contribution
It formalizes the hardness of exact inference tasks in autoregressive models, providing proofs of NP-hardness and -hardness for constrained decoding and sampling.
Findings
Exact sentence-level MAP decoding is NP-hard.
Exact conditioned normalization is -hard for regular constraints.
Global form constraints make exact decoding computationally intractable.
Abstract
Large language and music models are increasingly used for constrained generation: rhyming lines, fixed meter, inpainting or infilling, positional endings, and other global form requirements. These systems often perform strikingly well, but the induced procedures are usually not exact conditioning of the underlying autoregressive model. This creates a hidden inferential bias, distinct from the better-known notion of bias inherited from the training set: samples are distorted relative to the true constrained distribution, with no generic guarantee of complete coverage of the admissible solution space or of correct conditional probabilities over valid completions. We formalize several exact inference tasks for autoregressive models and prove corresponding hardness results. For succinctly represented autoregressive models whose next-token probabilities are computable in polynomial time,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
