SonifyAR: Context-Aware Sound Generation in Augmented Reality
Xia Su, Jon E. Froehlich, Eunyee Koh, Chang Xiao

TL;DR
SonifyAR is an innovative system that leverages large language models to generate context-aware sound effects in augmented reality, enhancing user experience and immersion through automatic sound authoring.
Contribution
The paper introduces SonifyAR, a novel LLM-based AR sound authoring system that automatically collects context and generates sound effects, expanding AR sound design capabilities.
Findings
User study shows high usability of SonifyAR
System effectively generates contextually appropriate sounds
Applications demonstrate practical benefits in AR experiences
Abstract
Sound plays a crucial role in enhancing user experience and immersiveness in Augmented Reality (AR). However, current platforms lack support for AR sound authoring due to limited interaction types, challenges in collecting and specifying context information, and difficulty in acquiring matching sound assets. We present SonifyAR, an LLM-based AR sound authoring system that generates context-aware sound effects for AR experiences. SonifyAR expands the current design space of AR sound and implements a Programming by Demonstration (PbD) pipeline to automatically collect contextual information of AR events, including virtual content semantics and real world context. This context information is then processed by a large language model to acquire sound effects with Recommendation, Retrieval, Generation, and Transfer methods. To evaluate the usability and performance of our system, we conducted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic Technology and Sound Studies · Music and Audio Processing · Tactile and Sensory Interactions
