WakaVT: A Sequential Variational Transformer for Waka Generation
Yuka Takeishi, Mingxuan Niu, Jing Luo, Zhong Jin, Xinyu Yang

TL;DR
WakaVT is a novel model combining Transformer and variational autoencoder techniques to generate high-quality Japanese Waka poetry from user keywords, addressing form, diversity, and linguistic quality.
Contribution
This paper introduces the first Transformer-variational autoencoder model for Waka generation, improving form adherence, diversity, and linguistic quality in Japanese poetry AI.
Findings
WakaVT outperforms baseline models in objective metrics.
Subjective evaluations favor WakaVT's poetic quality.
The model effectively balances form, diversity, and fluency.
Abstract
Poetry generation has long been a challenge for artificial intelligence. In the scope of Japanese poetry generation, many researchers have paid attention to Haiku generation, but few have focused on Waka generation. To further explore the creative potential of natural language generation systems in Japanese poetry creation, we propose a novel Waka generation model, WakaVT, which automatically produces Waka poems given user-specified keywords. Firstly, an additive mask-based approach is presented to satisfy the form constraint. Secondly, the structures of Transformer and variational autoencoder are integrated to enhance the quality of generated content. Specifically, to obtain novelty and diversity, WakaVT employs a sequence of latent variables, which effectively captures word-level variability in Waka data. To improve linguistic quality in terms of fluency, coherence, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Artificial Intelligence in Games · Multimodal Machine Learning Applications
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Softmax · Dense Connections · Attention Is All You Need · Dropout · Residual Connection · Byte Pair Encoding · Layer Normalization
