TL;DR
This paper introduces a deep latent variable model for story generation that learns to produce coherent stories by inducing latent anchor word sequences, improving story quality and diversity without relying on supervised plan annotations.
Contribution
The paper presents an unsupervised latent plan induction model for story generation, utilizing variational inference and anchor words to guide story coherence without external plan supervision.
Findings
Human evaluations favor the proposed model over baselines without plans.
Model achieves better perplexity and diversity scores.
Effective control of story content via learned discrete plans.
Abstract
Past work on story generation has demonstrated the usefulness of conditioning on a generation plan to generate coherent stories. However, these approaches have used heuristics or off-the-shelf models to first tag training stories with the desired type of plan, and then train generation models in a supervised fashion. In this paper, we propose a deep latent variable model that first samples a sequence of anchor words, one per sentence in the story, as part of its generative process. During training, our model treats the sequence of anchor words as a latent variable and attempts to induce anchoring sequences that help guide generation in an unsupervised fashion. We conduct experiments with several types of sentence decoder distributions: left-to-right and non-monotonic, with different degrees of restriction. Further, since we use amortized variational inference to train our model, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
