Binding Dancers Into Attractors
Franziska Kaltenberger, Sebastian Otte, Martin V. Butz

TL;DR
This paper introduces a recurrent neural network model that integrates feature binding and perspective taking to perceive and interpret 3D motion, demonstrating its ability to resolve visual illusions and infer depth and rotation direction.
Contribution
The model uniquely combines feature binding and perspective taking in a recurrent neural network, enabling it to interpret complex 3D motion and resolve perceptual ambiguities.
Findings
Successfully predicts 3D motion dynamics from a canonical perspective.
Reorders and binds features into Gestalts using retrospective inference.
Resolves the silhouette illusion by interpreting ambiguous stimuli as different depth perceptions.
Abstract
To effectively perceive and process observations in our environment, feature binding and perspective taking are crucial cognitive abilities. Feature binding combines observed features into one entity, called a Gestalt. Perspective taking transfers the percept into a canonical, observer-centered frame of reference. Here we propose a recurrent neural network model that solves both challenges. We first train an LSTM to predict 3D motion dynamics from a canonical perspective. We then present similar motion dynamics with novel viewpoints and feature arrangements. Retrospective inference enables the deduction of the canonical perspective. Combined with a robust mutual-exclusive softmax selection scheme, random feature arrangements are reordered and precisely bound into known Gestalt percepts. To corroborate evidence for the architecture's cognitive validity, we examine its behavior on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · Human Motion and Animation
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Softmax
