Separating Knowledge and Perception with Procedural Data

Adri\'an Rodr\'iguez-Mu\~noz; Manel Baradad; Phillip Isola; Antonio Torralba

arXiv:2508.11697·cs.CV·August 19, 2025

Separating Knowledge and Perception with Procedural Data

Adri\'an Rodr\'iguez-Mu\~noz, Manel Baradad, Phillip Isola, Antonio Torralba

PDF

Open Access 1 Video

TL;DR

This paper introduces a procedural data-based representation model that achieves competitive performance on visual tasks without additional training, using an explicit visual memory for reference, and analyzes its limitations compared to real data models.

Contribution

The paper presents a novel procedural data training approach that fully separates knowledge representation from perception, enabling strong zero-shot performance across multiple visual tasks.

Findings

01

Achieves within 1% of Places-trained models on NIGHTS visual similarity

02

Outperforms fine-grained classification models on CUB200 and Flowers102

03

Demonstrates strong zero-shot segmentation close to real-data models

Abstract

We train representation models with procedural data only, and apply them on visual similarity, classification, and semantic segmentation tasks without further training by using visual memory -- an explicit database of reference image embeddings. Unlike prior work on visual memory, our approach achieves full compartmentalization with respect to all real-world images while retaining strong performance. Compared to a model trained on Places, our procedural model performs within $1%$ on NIGHTS visual similarity, outperforms by $8%$ and $15%$ on CUB200 and Flowers102 fine-grained classification, and is within $10%$ on ImageNet-1K classification. It also demonstrates strong zero-shot segmentation, achieving an $R^{2}$ on COCO within $10%$ of the models trained on real data. Finally, we analyze procedural versus real data models, showing that parts of the same object have dissimilar…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Separating Knowledge and Perception with Procedural Data· slideslive

Taxonomy

TopicsSemantic Web and Ontologies