Bridging Neural and Symbolic Representations with Transitional Dictionary Learning
Junyan Cheng, Peter Chin

TL;DR
This paper presents a Transitional Dictionary Learning framework that implicitly learns symbolic knowledge and compositional patterns in visual data, outperforming existing unsupervised methods and aligning well with human evaluations.
Contribution
The introduction of a game-theoretic diffusion model and novel metrics for evaluating symbolic and compositional learning in visual datasets.
Findings
Outperforms state-of-the-art unsupervised part segmentation methods
Discoveries of meaningful compositional patterns in visual data
Metrics align well with human judgment
Abstract
This paper introduces a novel Transitional Dictionary Learning (TDL) framework that can implicitly learn symbolic knowledge, such as visual parts and relations, by reconstructing the input as a combination of parts with implicit relations. We propose a game-theoretic diffusion model to decompose the input into visual parts using the dictionaries learned by the Expectation Maximization (EM) algorithm, implemented as the online prototype clustering, based on the decomposition results. Additionally, two metrics, clustering information gain, and heuristic shape score are proposed to evaluate the model. Experiments are conducted on three abstract compositional visual object datasets, which require the model to utilize the compositionality of data instead of simply exploiting visual features. Then, three tasks on symbol grounding to predefined classes of parts and relations, as well as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNeural Networks and Applications
MethodsDiffusion
