TL;DR
The paper introduces ITO-Master, a novel inference-time optimization framework for audio mastering style transfer that enhances user control and improves the fidelity of the transferred mastering style.
Contribution
It presents a new inference-time optimization method for mastering style transfer, enabling dynamic user adjustments and improved style similarity in audio processing.
Findings
ITO improves style transfer accuracy
User control enhances mastering results
Both black-box and white-box models benefit from ITO
Abstract
Music mastering style transfer aims to model and apply the mastering characteristics of a reference track to a target track, simulating the professional mastering process. However, existing methods apply fixed processing based on a reference track, limiting users' ability to fine-tune the results to match their artistic intent. In this paper, we introduce the ITO-Master framework, a reference-based mastering style transfer system that integrates Inference-Time Optimization (ITO) to enable finer user control over the mastering process. By optimizing the reference embedding during inference, our approach allows users to refine the output dynamically, making micro-level adjustments to achieve more precise mastering results. We explore both black-box and white-box methods for modeling mastering processors and demonstrate that ITO improves mastering performance across different styles.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
