Binding Dancers Into Attractors

Franziska Kaltenberger; Sebastian Otte; Martin V. Butz

arXiv:2206.02558·q-bio.NC·June 7, 2022

Binding Dancers Into Attractors

Franziska Kaltenberger, Sebastian Otte, Martin V. Butz

PDF

Open Access

TL;DR

This paper introduces a recurrent neural network model that integrates feature binding and perspective taking to perceive and interpret 3D motion, demonstrating its ability to resolve visual illusions and infer depth and rotation direction.

Contribution

The model uniquely combines feature binding and perspective taking in a recurrent neural network, enabling it to interpret complex 3D motion and resolve perceptual ambiguities.

Findings

01

Successfully predicts 3D motion dynamics from a canonical perspective.

02

Reorders and binds features into Gestalts using retrospective inference.

03

Resolves the silhouette illusion by interpreting ambiguous stimuli as different depth perceptions.

Abstract

To effectively perceive and process observations in our environment, feature binding and perspective taking are crucial cognitive abilities. Feature binding combines observed features into one entity, called a Gestalt. Perspective taking transfers the percept into a canonical, observer-centered frame of reference. Here we propose a recurrent neural network model that solves both challenges. We first train an LSTM to predict 3D motion dynamics from a canonical perspective. We then present similar motion dynamics with novel viewpoints and feature arrangements. Retrospective inference enables the deduction of the canonical perspective. Combined with a robust mutual-exclusive softmax selection scheme, random feature arrangements are reordered and precisely bound into known Gestalt percepts. To corroborate evidence for the architecture's cognitive validity, we examine its behavior on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · Human Motion and Animation

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Softmax