Separating Knowledge and Perception with Procedural Data
Adri\'an Rodr\'iguez-Mu\~noz, Manel Baradad, Phillip Isola, Antonio Torralba

TL;DR
This paper introduces a procedural data-based representation model that achieves competitive performance on visual tasks without additional training, using an explicit visual memory for reference, and analyzes its limitations compared to real data models.
Contribution
The paper presents a novel procedural data training approach that fully separates knowledge representation from perception, enabling strong zero-shot performance across multiple visual tasks.
Findings
Achieves within 1% of Places-trained models on NIGHTS visual similarity
Outperforms fine-grained classification models on CUB200 and Flowers102
Demonstrates strong zero-shot segmentation close to real-data models
Abstract
We train representation models with procedural data only, and apply them on visual similarity, classification, and semantic segmentation tasks without further training by using visual memory -- an explicit database of reference image embeddings. Unlike prior work on visual memory, our approach achieves full compartmentalization with respect to all real-world images while retaining strong performance. Compared to a model trained on Places, our procedural model performs within on NIGHTS visual similarity, outperforms by and on CUB200 and Flowers102 fine-grained classification, and is within on ImageNet-1K classification. It also demonstrates strong zero-shot segmentation, achieving an on COCO within of the models trained on real data. Finally, we analyze procedural versus real data models, showing that parts of the same object have dissimilar…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsSemantic Web and Ontologies
