TL;DR
This paper introduces a Bayesian-based method for topical language generation that combines pre-trained transformers with topic modeling to improve control, coherence, and diversity in generated text.
Contribution
It presents a novel Bayesian framework integrating topic probabilities with language models, enabling explicit control over topical properties in generated text.
Findings
Outperforms state-of-the-art in coherence, diversity, and fluency.
Faster decoding compared to existing models.
Provides adjustable control over topical features.
Abstract
Large-scale transformer-based language models (LMs) demonstrate impressive capabilities in open text generation. However, controlling the generated text's properties such as the topic, style, and sentiment is challenging and often requires significant changes to the model architecture or retraining and fine-tuning the model on new supervised data. This paper presents a novel approach for Topical Language Generation (TLG) by combining a pre-trained LM with topic modeling information. We cast the problem using Bayesian probability formulation with topic probabilities as a prior, LM probabilities as the likelihood, and topical language generation probability as the posterior. In learning the model, we derive the topic probability distribution from the user-provided document's natural structure. Furthermore, we extend our model by introducing new parameters and functions to influence the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
