Threshold Independent Evaluation of Sound Event Detection Scores
Janek Ebbers, Romain Serizel, Reinhold Haeb-Umbach

TL;DR
This paper introduces a method for evaluating sound event detection systems across all thresholds simultaneously, providing a more comprehensive and threshold-independent performance assessment.
Contribution
It proposes a novel approach to compute system performance for all thresholds jointly, improving accuracy over existing approximation methods.
Findings
Enables exact computation of performance metrics across all thresholds.
Improves robustness of evaluation by removing threshold bias.
Supports application-specific threshold selection.
Abstract
Performing an adequate evaluation of sound event detection (SED) systems is far from trivial and is still subject to ongoing research. The recently proposed polyphonic sound detection (PSD)-receiver operating characteristic (ROC) and PSD score (PSDS) make an important step into the direction of an evaluation of SED systems which is independent from a certain decision threshold. This allows to obtain a more complete picture of the overall system behavior which is less biased by threshold tuning. Yet, the PSD-ROC is currently only approximated using a finite set of thresholds. The choice of the thresholds used in approximation, however, can have a severe impact on the resulting PSDS. In this paper we propose a method which allows for computing system performance on an evaluation set for all possible thresholds jointly, enabling accurate computation not only of the PSD-ROC and PSDS but…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Music and Audio Processing · Advanced Adaptive Filtering Techniques
