Deep learning and sub-word-unit approach in written art generation
Krzysztof Wo{\l}k, Emilia Zawadzka-Gosk, Wojciech Czarnowski

TL;DR
This paper introduces a novel sub-word-unit approach for neural network-based poetry generation, significantly improving the quality of generated poems by better capturing linguistic features, especially in morphologically rich languages like Polish.
Contribution
The work presents a new sub-word-unit method for training RNNs on poetry, including a specialized stemming pre-processing step, which enhances the naturalness and metrical accuracy of generated poems.
Findings
Sub-word units outperform character-based models in poem quality.
The approach effectively captures Polish language morphology.
Generated poems closely follow source text metre and vocabulary.
Abstract
Automatic poetry generation is novel and interesting application of natural language processing research. It became more popular during the last few years due to the rapid development of technology and neural computing power. This line of research can be applied to the study of linguistics and literature, for social science experiments, or simply for entertainment. The most effective known method of artificial poem generation uses recurrent neural networks (RNN). We also used RNNs to generate poems in the style of Adam Mickiewicz. Our network was trained on the Sir Thaddeus poem. For data pre-processing, we used a specialized stemming tool, which is one of the major innovations and contributions of this work. Our experiment was conducted on the source text, divided into sub-word units (at a level of resolution close to syllables). This approach is novel and is not often employed in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAdam
