Towards Generalization in Subitizing with Neuro-Symbolic Loss using Holographic Reduced Representations
Mohammad Mahmudul Alam, Edward Raff, Tim Oates

TL;DR
This paper introduces a neuro-symbolic loss function using Holographic Reduced Representations to enhance the generalization ability of CNNs and ViTs in the task of subitizing, addressing limitations of standard deep learning approaches from a cognitive science perspective.
Contribution
The paper proposes a novel HRR-based loss function that improves subitizing generalization in CNNs and ViTs, integrating cognitive science tools into deep learning training.
Findings
HRR loss improves subitizing generalization but does not fully solve it
ViTs perform worse than CNNs in subitizing tasks, except with HRR loss
Saliency maps and out-of-distribution tests support the effectiveness of HRR loss
Abstract
While deep learning has enjoyed significant success in computer vision tasks over the past decade, many shortcomings still exist from a Cognitive Science (CogSci) perspective. In particular, the ability to subitize, i.e., quickly and accurately identify the small (less than 6) count of items, is not well learned by current Convolutional Neural Networks (CNNs) or Vision Transformers (ViTs) when using a standard cross-entropy (CE) loss. In this paper, we demonstrate that adapting tools used in CogSci research can improve the subitizing generalization of CNNs and ViTs by developing an alternative loss function using Holographic Reduced Representations (HRRs). We investigate how this neuro-symbolic approach to learning affects the subitizing capability of CNNs and ViTs, and so we focus on specially crafted problems that isolate generalization to specific aspects of subitizing. Via saliency…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors · Advanced Memory and Neural Computing · Domain Adaptation and Few-Shot Learning
MethodsFocus
