Secure & Personalized Music-to-Video Generation via CHARCHA

Mehul Agarwal; Gauri Agarwal; Santiago Benoit; Andrew Lippman; Jean Oh

arXiv:2502.02610·cs.AI·February 6, 2025

Secure & Personalized Music-to-Video Generation via CHARCHA

Mehul Agarwal, Gauri Agarwal, Santiago Benoit, Andrew Lippman, Jean Oh

PDF

Open Access

TL;DR

This paper presents a secure, personalized music-to-video generation pipeline that combines multimodal techniques and a novel facial verification protocol to create immersive, user-specific music videos while ensuring privacy and ethical use.

Contribution

It introduces CHARCHA, a facial identity verification protocol, and a fully-automated pipeline for personalized, context-aware music video generation based on music and user images.

Findings

01

Successfully generates personalized music videos reflecting music and user identity.

02

Ensures privacy and security with the CHARCHA facial verification protocol.

03

Demonstrates the feasibility of immersive, user-specific music video creation.

Abstract

Music is a deeply personal experience and our aim is to enhance this with a fully-automated pipeline for personalized music video generation. Our work allows listeners to not just be consumers but co-creators in the music video generation process by creating personalized, consistent and context-driven visuals based on lyrics, rhythm and emotion in the music. The pipeline combines multimodal translation and generation techniques and utilizes low-rank adaptation on listeners' images to create immersive music videos that reflect both the music and the individual. To ensure the ethical use of users' identity, we also introduce CHARCHA (patent pending), a facial identity verification protocol that protects people against unauthorized use of their face while at the same time collecting authorized images from users for personalizing their videos. This paper thus provides a secure and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic Technology and Sound Studies · Music and Audio Processing