6KSFx Synth Dataset

Nelly Garcia; Joshua Reiss

arXiv:2501.17198·cs.SD·January 30, 2025

6KSFx Synth Dataset

Nelly Garcia, Joshua Reiss

PDF

Open Access 1 Repo

TL;DR

This paper introduces the 6KSFx Synth Dataset, a comprehensive collection of 6000 synthetic audio samples across 30 sound categories, aimed at advancing research and development in procedural audio and sound synthesis.

Contribution

It provides the first large-scale, publicly available dataset of synthetic sounds with detailed synthesis descriptions to facilitate evaluation and innovation in procedural audio.

Findings

01

Enables robust evaluation frameworks for procedural audio

02

Highlights the diversity of synthesis methods across sound categories

03

Supports accelerated development of sound synthesis models

Abstract

Procedural audio, often referred to as "digital Foley", generates sound from scratch using computational processes. It represents an innovative approach to sound-effects creation. However, the development and adoption of procedural audio has been constrained by a lack of publicly available datasets and models, which hinders evaluation and optimization. To address this important gap, this paper presents a dataset of 6000 synthetic audio samples specifically designed to advance research and development in sound synthesis within 30 sound categories. By offering a description of the diverse synthesis methods used in each sound category and supporting the creation of robust evaluation frameworks, this dataset not only highlights the potential of procedural audio, but also provides a resource for researchers, audio developers, and sound designers. This contribution can accelerate the progress…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nellyngz95/6ksfx
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMedical Imaging Techniques and Applications