Naturalistic audio-visual volumetric sequences dataset of sounding   actions for six degree-of-freedom interaction

Hanne Stenzel; Davide Berghi; Marco Volino; Philip J.B. Jackson

arXiv:2105.00641·cs.MM·May 4, 2021

Naturalistic audio-visual volumetric sequences dataset of sounding actions for six degree-of-freedom interaction

Hanne Stenzel, Davide Berghi, Marco Volino, Philip J.B. Jackson

PDF

TL;DR

This paper introduces a new volumetric dataset of sounding actions combining high-quality 3D video and audio, designed to support immersive audio-visual system testing and development.

Contribution

It provides a comprehensive naturalistic audio-visual dataset with diverse sound types for six degree-of-freedom interaction research.

Findings

01

Dataset includes 40 short action sequences.

02

Captures diverse sound types and features.

03

Supports integrated bimodal experience testing.

Abstract

As audio-visual systems increasingly bring immersive and interactive capabilities into our work and leisure activities, so the need for naturalistic test material grows. New volumetric datasets have captured high-quality 3D video, but accompanying audio is often neglected, making it hard to test an integrated bimodal experience. Designed to cover diverse sound types and features, the presented volumetric dataset was constructed from audio and video studio recordings of scenes to yield forty short action sequences. Potential uses in technical and scientific tests are discussed.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.