Improving Emotional Expression and Cohesion in Image-Based Playlist   Description and Music Topics: A Continuous Parameterization Approach

Yuelyu Ji; Yuheng Song; Wei Wang; Ruoyi Xu; Zhongqian Xie; Huiyun Liu

arXiv:2310.01248·cs.CL·October 16, 2023

Improving Emotional Expression and Cohesion in Image-Based Playlist Description and Music Topics: A Continuous Parameterization Approach

Yuelyu Ji, Yuheng Song, Wei Wang, Ruoyi Xu, Zhongqian Xie, Huiyun Liu

PDF

Open Access

TL;DR

This paper introduces a novel continuous parameterization method for controlled text generation in image-based music content platforms, enabling precise manipulation of style, emotion, and cohesion to improve relevance and coherence.

Contribution

It proposes CPCTG, a continuous control framework that integrates style learning, semantic cohesion, and emotional expression, advancing beyond discrete input limitations in text generation.

Findings

01

Significant ROUGE score improvements in experiments

02

Enhanced relevance and coherence in generated playlist descriptions

03

Effective manipulation of emotional expression and style

Abstract

Text generation in image-based platforms, particularly for music-related content, requires precise control over text styles and the incorporation of emotional expression. However, existing approaches often need help to control the proportion of external factors in generated text and rely on discrete inputs, lacking continuous control conditions for desired text generation. This study proposes Continuous Parameterization for Controlled Text Generation (CPCTG) to overcome these limitations. Our approach leverages a Language Model (LM) as a style learner, integrating Semantic Cohesion (SC) and Emotional Expression Proportion (EEP) considerations. By enhancing the reward method and manipulating the CPCTG level, our experiments on playlist description and music topic generation tasks demonstrate significant improvements in ROUGE scores, indicating enhanced relevance and coherence in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Topic Modeling · Computational and Text Analysis Methods