Leveraging AI to Generate Audio for User-generated Content in Video Games
Thomas Marrinan, Pakeeza Akram, Oli Gurmessa, Anthony Shishkin

TL;DR
This paper explores using generative AI to produce real-time audio for user-created content in video games, addressing the challenge of creating diverse sounds for limitless user-generated environments and objects.
Contribution
It introduces two novel methods for AI-driven audio generation based on text descriptions and images of user content in video games.
Findings
Prototype games demonstrate real-time audio generation for user-created content.
AI methods successfully produce contextually relevant environmental sounds.
Discussion includes ethical considerations of AI-generated game audio.
Abstract
In video game design, audio (both environmental background music and object sound effects) play a critical role. Sounds are typically pre-created assets designed for specific locations or objects in a game. However, user-generated content is becoming increasingly popular in modern games (e.g. building custom environments or crafting unique objects). Since the possibilities are virtually limitless, it is impossible for game creators to pre-create audio for user-generated content. We explore the use of generative artificial intelligence to create music and sound effects on-the-fly based on user-generated content. We investigate two avenues for audio generation: 1) text-to-audio: using a text description of user-generated content as input to the audio generator, and 2) image-to-audio: using a rendering of the created environment or object as input to an image-to-text generator, then piping…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Artificial Intelligence in Games · Video Analysis and Summarization
