WakaVT: A Sequential Variational Transformer for Waka Generation

Yuka Takeishi; Mingxuan Niu; Jing Luo; Zhong Jin; Xinyu Yang

arXiv:2104.00426·cs.CL·April 2, 2021

WakaVT: A Sequential Variational Transformer for Waka Generation

Yuka Takeishi, Mingxuan Niu, Jing Luo, Zhong Jin, Xinyu Yang

PDF

Open Access

TL;DR

WakaVT is a novel model combining Transformer and variational autoencoder techniques to generate high-quality Japanese Waka poetry from user keywords, addressing form, diversity, and linguistic quality.

Contribution

This paper introduces the first Transformer-variational autoencoder model for Waka generation, improving form adherence, diversity, and linguistic quality in Japanese poetry AI.

Findings

01

WakaVT outperforms baseline models in objective metrics.

02

Subjective evaluations favor WakaVT's poetic quality.

03

The model effectively balances form, diversity, and fluency.

Abstract

Poetry generation has long been a challenge for artificial intelligence. In the scope of Japanese poetry generation, many researchers have paid attention to Haiku generation, but few have focused on Waka generation. To further explore the creative potential of natural language generation systems in Japanese poetry creation, we propose a novel Waka generation model, WakaVT, which automatically produces Waka poems given user-specified keywords. Firstly, an additive mask-based approach is presented to satisfy the form constraint. Secondly, the structures of Transformer and variational autoencoder are integrated to enhance the quality of generated content. Specifically, to obtain novelty and diversity, WakaVT employs a sequence of latent variables, which effectively captures word-level variability in Waka data. To improve linguistic quality in terms of fluency, coherence, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Games · Multimodal Machine Learning Applications

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Softmax · Dense Connections · Attention Is All You Need · Dropout · Residual Connection · Byte Pair Encoding · Layer Normalization