Did You Hear That? Introducing AADG: A Framework for Generating Benchmark Data in Audio Anomaly Detection
Ksheeraja Raghavan, Samiran Gode, Ankit Shah, Surabhi Raghavan,, Wolfram Burgard, Bhiksha Raj, Rita Singh

TL;DR
This paper presents AADG, a flexible framework that uses large language models to generate diverse, realistic audio datasets for anomaly detection, addressing the scarcity of varied benchmark data in the field.
Contribution
Introduces a novel, modular audio data generation framework leveraging LLMs to create realistic, diverse datasets for anomaly detection in various environments.
Findings
Generated datasets improve anomaly detection model performance.
Framework successfully simulates real-world audio scenarios.
Ensures high reliability through rigorous output verification.
Abstract
We introduce a novel, general-purpose audio generation framework specifically designed for anomaly detection and localization. Unlike existing datasets that predominantly focus on industrial and machine-related sounds, our framework focuses a broader range of environments, particularly useful in real-world scenarios where only audio data are available, such as in video-derived or telephonic audio. To generate such data, we propose a new method inspired by the LLM-Modulo framework, which leverages large language models(LLMs) as world models to simulate such real-world scenarios. This tool is modular allowing a plug-and-play approach. It operates by first using LLMs to predict plausible real-world scenarios. An LLM further extracts the constituent sounds, the order and the way in which these should be merged to create coherent wholes. Much like the LLM-Modulo framework, we include…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech Recognition and Synthesis · Anomaly Detection Techniques and Applications
MethodsFocus
