Addressing the Recitative Problem in Real-time Opera Tracking
Charles Brazier, Gerhard Widmer

TL;DR
This paper improves real-time opera tracking by using dual trackers focused on music and speech features, addressing challenges during recitative passages for more accurate alignment in live settings.
Contribution
It introduces a novel dual-tracker approach combining music and speech classifiers to enhance score following accuracy during recitative sections.
Findings
Dual trackers improve alignment accuracy during recitatives.
Speech features help distinguish recitative passages from music.
Combining classifiers yields more robust real-time opera tracking.
Abstract
Robust real-time opera tracking (score following) would be extremely useful for many processes surrounding live opera staging and streaming, including automatic lyrics displays, camera control, or live video cutting. Recent work has shown that, with some appropriate measures to account for common problems such as breaks and interruptions, spontaneous applause, various noises and interludes, current audio-to-audio alignment algorithms can be made to follow an entire opera from beginning to end, in a relatively robust way. However, they remain inaccurate when the textual content becomes prominent against the melody or music -- notably, during recitativo passages. In this paper, we address this specific problem by proposing to use two specialized trackers in parallel, one focusing on music-, the other on speech-sensitive features. We first carry out a systematic study on speech-related…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Speech and Audio Processing
