Improving Real-Time Music Accompaniment Separation with MMDenseNet
Chun-Hsiang Wang, Chung-Che Wang, Jun-You Wang, Jyh-Shing Roger Jang,, Yen-Hsun Chu

TL;DR
This paper improves a lightweight music source separation model, MMDenseNet, to achieve better real-time performance and acceptable separation quality, making it suitable for edge devices and short input durations.
Contribution
The paper introduces enhancements to MMDenseNet, including complex ideal ratio mask, self-attention, band-merge-split, and feature look back, balancing separation quality and latency for real-time use.
Findings
Achieves low real-time factor and optimal latency
Maintains acceptable separation quality
Suitable for real-time accompaniment separation on edge devices
Abstract
Music source separation aims to separate polyphonic music into different types of sources. Most existing methods focus on enhancing the quality of separated results by using a larger model structure, rendering them unsuitable for deployment on edge devices. Moreover, these methods may produce low-quality output when the input duration is short, making them impractical for real-time applications. Therefore, the goal of this paper is to enhance a lightweight model, MMDenstNet, to strike a balance between separation quality and latency for real-time applications. Different directions of improvement are explored or proposed in this paper, including complex ideal ratio mask, self-attention, band-merge-split method, and feature look back. Source-to-distortion ratio, real-time factor, and optimal latency are employed to evaluate the performance. To align with our application requirements, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Speech Recognition and Synthesis
MethodsFocus · ALIGN
