Intent Factored Generation: Unleashing the Diversity in Your Language Model
Eltayeb Ahmed, Uljad Berdica, Martha Elliott, Danijela Horak, Jakob N. Foerster

TL;DR
This paper introduces Intent Factored Generation (IFG), a novel method that enhances diversity in language model outputs by separating intent sampling from response generation, improving exploration and diversity across tasks.
Contribution
The paper proposes a two-stage sampling approach that separates intent and response, enabling higher diversity without sacrificing quality, and demonstrates its effectiveness across multiple tasks.
Findings
Improves diversity in language model outputs.
Enhances reasoning and conversational engagement.
Maintains or improves task performance.
Abstract
Obtaining multiple meaningfully diverse, high quality samples from Large Language Models for a fixed prompt remains an open challenge. Current methods for increasing diversity often only operate at the token-level, paraphrasing the same response. This is problematic because it leads to poor exploration on reasoning problems and to unengaging, repetitive conversational agents. To address this we propose Intent Factored Generation (IFG), factorising the sampling process into two stages. First, we sample a semantically dense intent, e.g., a summary or keywords. Second, we sample the final response conditioning on both the original prompt and the intent from the first stage. This allows us to use a higher temperature during the intent step to promote conceptual diversity, and a lower temperature during the final generation to ensure the outputs are coherent and self-consistent.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsSparse Evolutionary Training
