6KSFx Synth Dataset
Nelly Garcia, Joshua Reiss

TL;DR
This paper introduces the 6KSFx Synth Dataset, a comprehensive collection of 6000 synthetic audio samples across 30 sound categories, aimed at advancing research and development in procedural audio and sound synthesis.
Contribution
It provides the first large-scale, publicly available dataset of synthetic sounds with detailed synthesis descriptions to facilitate evaluation and innovation in procedural audio.
Findings
Enables robust evaluation frameworks for procedural audio
Highlights the diversity of synthesis methods across sound categories
Supports accelerated development of sound synthesis models
Abstract
Procedural audio, often referred to as "digital Foley", generates sound from scratch using computational processes. It represents an innovative approach to sound-effects creation. However, the development and adoption of procedural audio has been constrained by a lack of publicly available datasets and models, which hinders evaluation and optimization. To address this important gap, this paper presents a dataset of 6000 synthetic audio samples specifically designed to advance research and development in sound synthesis within 30 sound categories. By offering a description of the diverse synthesis methods used in each sound category and supporting the creation of robust evaluation frameworks, this dataset not only highlights the potential of procedural audio, but also provides a resource for researchers, audio developers, and sound designers. This contribution can accelerate the progress…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging Techniques and Applications
