Naturalistic audio-visual volumetric sequences dataset of sounding actions for six degree-of-freedom interaction
Hanne Stenzel, Davide Berghi, Marco Volino, Philip J.B. Jackson

TL;DR
This paper introduces a new volumetric dataset of sounding actions combining high-quality 3D video and audio, designed to support immersive audio-visual system testing and development.
Contribution
It provides a comprehensive naturalistic audio-visual dataset with diverse sound types for six degree-of-freedom interaction research.
Findings
Dataset includes 40 short action sequences.
Captures diverse sound types and features.
Supports integrated bimodal experience testing.
Abstract
As audio-visual systems increasingly bring immersive and interactive capabilities into our work and leisure activities, so the need for naturalistic test material grows. New volumetric datasets have captured high-quality 3D video, but accompanying audio is often neglected, making it hard to test an integrated bimodal experience. Designed to cover diverse sound types and features, the presented volumetric dataset was constructed from audio and video studio recordings of scenes to yield forty short action sequences. Potential uses in technical and scientific tests are discussed.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
