ScripTONES: Sentiment-Conditioned Music Generation for Movie Scripts

Vishruth Veerendranath; Vibha Masti; Utkarsh Gupta; Hrishit Chaudhuri,; Gowri Srinivasa

arXiv:2401.07084·cs.MM·January 17, 2024·1 cites

ScripTONES: Sentiment-Conditioned Music Generation for Movie Scripts

Vishruth Veerendranath, Vibha Masti, Utkarsh Gupta, Hrishit Chaudhuri,, Gowri Srinivasa

PDF

Open Access

TL;DR

This paper introduces ScripTONES, a two-stage system that analyzes movie script sentiment and generates matching piano music, aiming to automate film scoring for small-scale creators.

Contribution

It presents a novel pipeline combining sentiment analysis with conditional music generation, and explores improvements in sentiment-conditioning for VAE architectures.

Findings

01

Sentiment analysis effectively encodes scene emotion in valence-arousal space.

02

Conditional music generation can produce piano MIDI matching scene sentiment.

03

Proposed methods improve sentiment-conditioning in VAE-based music models.

Abstract

Film scores are considered an essential part of the film cinematic experience, but the process of film score generation is often expensive and infeasible for small-scale creators. Automating the process of film score composition would provide useful starting points for music in small projects. In this paper, we propose a two-stage pipeline for generating music from a movie script. The first phase is the Sentiment Analysis phase where the sentiment of a scene from the film script is encoded into the valence-arousal continuous space. The second phase is the Conditional Music Generation phase which takes as input the valence-arousal vector and conditionally generates piano MIDI music to match the sentiment. We study the efficacy of various music generation architectures by performing a qualitative user survey and propose methods to improve sentiment-conditioning in VAE architectures.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Video Analysis and Summarization · Generative Adversarial Networks and Image Synthesis