A Temporal Variational Model for Story Generation
David Wilmot, Frank Keller

TL;DR
This paper introduces a temporal variational model using TD-VAE for story generation, improving long-term coherence and plot development through latent vector planning and reranking, outperforming some baselines in automatic and human evaluations.
Contribution
It presents a novel latent planning approach with TD-VAE for story generation, demonstrating improved coherence and evaluation performance over baseline models.
Findings
TD-VAE reranking enhances story quality in automatic and human assessments.
Conditioning on latent vectors reduces diversity and narrative progression.
The model outperforms GPT-2 baseline and matches hierarchical LSTM in evaluations.
Abstract
Recent language models can generate interesting and grammatically correct text in story generation but often lack plot development and long-term coherence. This paper experiments with a latent vector planning approach based on a TD-VAE (Temporal Difference Variational Autoencoder), using the model for conditioning and reranking for text generation. The results demonstrate strong performance in automatic cloze and swapping evaluations. The human judgments show stories generated with TD-VAE reranking improve on a GPT-2 medium baseline and show comparable performance to a hierarchical LSTM reranking model. Conditioning on the latent vectors proves disappointing and deteriorates performance in human evaluation because it reduces the diversity of generation, and the models don't learn to progress the narrative. This highlights an important difference between technical task performance (e.g.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Cosine Annealing · Byte Pair Encoding · Discriminative Fine-Tuning · Linear Warmup With Cosine Annealing · Residual Connection · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections
