TL;DR
This paper introduces a comprehensive dataset with subjective quality labels for Time Scale Modification (TSM), enabling the development and evaluation of objective quality measures, and provides initial results demonstrating the dataset's utility.
Contribution
The paper presents a new, large-scale dataset with subjective quality ratings for TSM, including diverse audio types and extensive listener evaluations, to improve objective quality assessment methods.
Findings
Objective measures poorly correlate with subjective quality.
Listeners' ratings are consistent across demographics.
Initial retrained objective measure shows promising results.
Abstract
Time Scale Modification (TSM) is a well-researched field; however, no effective objective measure of quality exists. This paper details the creation, subjective evaluation, and analysis of a dataset for use in the development of an objective measure of quality for TSM. Comprised of two parts, the training component contains 88 source files processed using six TSM methods at 10 time scales, while the testing component contains 20 source files processed using three additional methods at four time scales. The source material contains speech, solo harmonic and percussive instruments, sound effects, and a range of music genres. Ratings (42 529) were collected from 633 sessions using laboratory and remote collection methods. Analysis of results shows no correlation between age and quality of rating; expert and non-expert listeners to be equivalent; minor differences between participants with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
