A General Framework for Learning Prosodic-Enhanced Representation of Rap Lyrics
Hongru Liang, Haozheng Wang, Qian Li, Jun Wang, Guandong Xu, Jiawei, Chen, Jin-Mao Wei, Zhenglu Yang

TL;DR
This paper introduces HAVAE, a hierarchical autoencoder that effectively combines semantic and prosodic features, including rhyme information, to improve rap lyrics representation for various music-related applications.
Contribution
It presents a novel framework that integrates prosodic features using rhyme2vec and feature aggregation, advancing rap lyrics analysis beyond existing methods.
Findings
Outperforms state-of-the-art approaches in multiple rap lyrics tasks
Effectively encodes prosodic features with rhyme2vec
Enhances lyrics representation for music applications
Abstract
Learning and analyzing rap lyrics is a significant basis for many web applications, such as music recommendation, automatic music categorization, and music information retrieval, due to the abundant source of digital music in the World Wide Web. Although numerous studies have explored the topic, knowledge in this field is far from satisfactory, because critical issues, such as prosodic information and its effective representation, as well as appropriate integration of various features, are usually ignored. In this paper, we propose a hierarchical attention variational autoencoder framework (HAVAE), which simultaneously consider semantic and prosodic features for rap lyrics representation learning. Specifically, the representation of the prosodic features is encoded by phonetic transcriptions with a novel and effective strategy~(i.e., rhyme2vec). Moreover, a feature aggregation strategy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSolana Customer Service Number +1-833-534-1729
