Object Pursuit: Building a Space of Objects via Discriminative Weight Generation
Chuanyu Pan, Yanchao Yang, Kaichun Mo, Yueqi Duan, and Leonidas Guibas

TL;DR
This paper introduces a novel framework for learning object-centric representations from visual data by leveraging interactions and discriminative weight generation, reducing annotation needs and improving robustness.
Contribution
It presents a new method that continuously learns object representations without explicit supervision, using a hypernetwork to generate discriminative weights for objects in complex scenes.
Findings
Effective sampling of diverse object variations
Robust re-identification and forgetting prevention
Improved label efficiency in downstream tasks
Abstract
We propose a framework to continuously learn object-centric representations for visual learning and understanding. Existing object-centric representations either rely on supervisions that individualize objects in the scene, or perform unsupervised disentanglement that can hardly deal with complex scenes in the real world. To mitigate the annotation burden and relax the constraints on the statistical complexity of the data, our method leverages interactions to effectively sample diverse variations of an object and the corresponding training signals while learning the object-centric representations. Throughout learning, objects are streamed one by one in random order with unknown identities, and are associated with latent codes that can synthesize discriminative weights for each object through a convolutional hypernetwork. Moreover, re-identification of learned objects and forgetting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Human Pose and Action Recognition
