Diverse Melody Generation from Chinese Lyrics via Mutual Information   Maximization

Ruibin Yuan; Ge Zhang; Anqiao Yang; Xinyue Zhang

arXiv:2012.03805·cs.SD·December 8, 2020

Diverse Melody Generation from Chinese Lyrics via Mutual Information Maximization

Ruibin Yuan, Ge Zhang, Anqiao Yang, Xinyue Zhang

PDF

Open Access

TL;DR

This paper introduces DMG, a novel method using mutual information maximization for Chinese lyrics-conditioned melody generation, enhancing diversity, alignment, and overall quality of generated melodies.

Contribution

It adapts mutual information maximization to improve diversity and alignment in Chinese lyrics melody generation, incorporating scheduled sampling and force decoding techniques.

Findings

01

DMG generates more pleasing melodies than baselines

02

Improved alignment between lyrics and melodies

03

Enhanced diversity in generated tunes

Abstract

In this paper, we propose to adapt the method of mutual information maximization into the task of Chinese lyrics conditioned melody generation to improve the generation quality and diversity. We employ scheduled sampling and force decoding techniques to improve the alignment between lyrics and melodies. With our method, which we called Diverse Melody Generation (DMG), a sequence-to-sequence model learns to generate diverse melodies heavily depending on the input style ids, while keeping the tonality and improving the alignment. The experimental results of subjective tests show that DMG can generate more pleasing and coherent tunes than baseline methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Generative Adversarial Networks and Image Synthesis · Topic Modeling