Embodied vision for learning object representations
Arthur Aubret, C\'eline Teuli\`ere, Jochen Triesch

TL;DR
This paper investigates how embodied, toddler-like visual inputs affect the development of object recognition in neural networks, highlighting the importance of realistic visual experiences in learning invariant object representations.
Contribution
It introduces a visually embodied agent that learns object representations through time-contrastive learning with toddler-like visual inputs, demonstrating improved recognition accuracy.
Findings
Toddler-like visual statistics enhance object recognition.
Background feature reduction improves model performance.
Embodied visual experience influences neural network biases.
Abstract
Recent time-contrastive learning approaches manage to learn invariant object representations without supervision. This is achieved by mapping successive views of an object onto close-by internal representations. When considering this learning approach as a model of the development of human object recognition, it is important to consider what visual input a toddler would typically observe while interacting with objects. First, human vision is highly foveated, with high resolution only available in the central region of the field of view. Second, objects may be seen against a blurry background due to infants' limited depth of field. Third, during object manipulation a toddler mostly observes close objects filling a large part of the field of view due to their rather short arms. Here, we study how these effects impact the quality of visual representations learnt through time-contrastive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Domain Adaptation and Few-Shot Learning · Infrared Target Detection Methodologies
