# A model of audio–visual motion integration during active self-movement

**Authors:** Maria Gallagher, Joshua D. Haynes, John F. Culling, Tom C. A. Freeman

PMC · DOI: 10.1167/jov.25.2.8 · Journal of Vision · 2025-02-19

## TL;DR

This study explores how people integrate visual and auditory motion cues during head movements, showing that coordinate transformation is essential for accurate perception.

## Contribution

The paper introduces a model showing that audio-visual motion integration during self-movement requires coordinate transformation into a common body-centered frame.

## Key findings

- Audio-visual performance was best predicted by models using coordinate transformation.
- Precision improved when shared noise from head movement signals was considered.
- Motion perception in active observers relies on partially correlated body-centered signals.

## Abstract

Despite good evidence for optimal audio–visual integration in stationary observers, few studies have considered the impact of self-movement on this process. When the head and/or eyes move, the integration of vision and hearing is complicated, as the sensory measurements begin in different coordinate frames. To successfully integrate these signals, they must first be transformed into the same coordinate frame. We propose that audio and visual motion cues are separately transformed using self-movement signals, before being integrated as body-centered cues to audio–visual motion. We tested this hypothesis using a psychophysical audio–visual integration task in which participants made left/right judgments of audio, visual, or audio–visual targets during self-generated yaw head rotations. Estimates of precision and bias from the audio and visual conditions were used to predict performance in the audio–visual conditions. We found that audio–visual performance was predicted well by models that suggested the transformation of cues into common coordinates but could not be explained by a model that did not rely on coordinate transformation before integration. We also found that precision specifically was better predicted by a model that accounted for shared noise arising from signals encoding head movement. Taken together, our findings suggest that motion perception in active observers is based on the integration of partially correlated body-centered signals.

## Full-text entities

- **Diseases:** neurological or psychiatric conditions (MESH:D001523)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11841688/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11841688/full.md

## References

71 references — full list in the complete paper: https://tomesphere.com/paper/PMC11841688/full.md

---
Source: https://tomesphere.com/paper/PMC11841688