Automated Time-frequency Domain Audio Crossfades using Graph Cuts

Kyle Robinson; Dan Brown

arXiv:2301.13380·cs.SD·February 1, 2023

Automated Time-frequency Domain Audio Crossfades using Graph Cuts

Kyle Robinson, Dan Brown

PDF

Open Access

TL;DR

This paper introduces a novel automated method for seamless audio crossfades in the time-frequency domain, utilizing graph cut optimization to improve transition quality in personal music playback.

Contribution

It proposes a new approach that discretizes the frequency spectrum and applies graph flow optimization for automatic audio transitions, a first in this context.

Findings

01

First automatic time-frequency domain crossfade method

02

Uses graph cut optimization for smooth transitions

03

Potentially improves user experience in music playback

Abstract

The problem of transitioning smoothly from one audio clip to another arises in many music consumption scenarios, especially as music consumption has moved from professionally curated and live-streamed radios to personal playback devices and services. we present the first steps toward a new method of automatically transitioning from one audio clip to another by discretizing the frequency spectrum into bins and then finding transition times for each bin. We phrase the problem as one of graph flow optimization; specifically min-cut/max-flow.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Music Technology and Sound Studies · Diverse Musicological Studies

MethodsContrastive Language-Image Pre-training