Disentangling What and Where for 3D Object-Centric Representations Through Active Inference
Toon Van de Maele, Tim Verbelen, Ozan Catal, Bart Dhoedt

TL;DR
This paper introduces an active inference agent that learns and classifies 3D objects by integrating what and where information streams, enabling unsupervised learning, ambiguity resolution, and novel category identification in a flexible, human-inspired manner.
Contribution
It presents a novel active inference framework with dual information streams for 3D object representation, learning, and classification, including the ability to identify new categories.
Findings
Achieves state-of-the-art classification accuracy.
Learns object representations in an unsupervised manner.
Successfully identifies novel object categories.
Abstract
Although modern object detection and classification models achieve high accuracy, these are typically constrained in advance on a fixed train set and are therefore not flexible to deal with novel, unseen object categories. Moreover, these models most often operate on a single frame, which may yield incorrect classifications in case of ambiguous viewpoints. In this paper, we propose an active inference agent that actively gathers evidence for object classifications, and can learn novel object categories over time. Drawing inspiration from the human brain, we build object-centric generative models composed of two information streams, a what- and a where-stream. The what-stream predicts whether the observed object belongs to a specific category, while the where-stream is responsible for representing the object in its internal 3D reference frame. We show that our agent (i) is able to learn…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis
