Binding and Perspective Taking as Inference in a Generative Neural Network Model
Mahdi Sadeghi, Fabian Schrodt, Sebastian Otte, Martin V. Butz

TL;DR
This paper presents a generative neural network model that uses retrospective inference to solve the binding and perspective taking problem, enabling flexible feature integration and viewpoint adaptation in biological motion perception.
Contribution
It introduces a novel gradient-based inference method for binding and perspective taking within a generative encoder-decoder neural architecture, applicable to diverse perceptual domains.
Findings
The model successfully solves binding and perspective taking for biological motion.
Redundant features and population encodings enhance inference accuracy.
The approach mimics Gestalt perception mechanisms.
Abstract
The ability to flexibly bind features into coherent wholes from different perspectives is a hallmark of cognition and intelligence. Importantly, the binding problem is not only relevant for vision but also for general intelligence, sensorimotor integration, event processing, and language. Various artificial neural network models have tackled this problem with dynamic neural fields and related approaches. Here we focus on a generative encoder-decoder architecture that adapts its perspective and binds features by means of retrospective inference. We first train a model to learn sufficiently accurate generative models of dynamic biological motion or other harmonic motion patterns, such as a pendulum. We then scramble the input to a certain extent, possibly vary the perspective onto it, and propagate the prediction error back onto a binding matrix, that is, hidden neural states that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
