Subitizing with Variational Autoencoders
Rijnder Wever, Tom F.H. Runia

TL;DR
This paper demonstrates that variational autoencoders can spontaneously learn to subitize in complex natural images without supervision, encoding numerosity as a basic visual property and showing invariance to object size.
Contribution
It shows that unsupervised hierarchical neural networks, specifically variational autoencoders, can perform subitizing on natural images without supervision, aligning with biological neural properties.
Findings
VAEs can perform subitizing after unsupervised training
Networks encode numerosity as a visual property
Representations are invariant to object area
Abstract
Numerosity, the number of objects in a set, is a basic property of a given visual scene. Many animals develop the perceptual ability to subitize: the near-instantaneous identification of the numerosity in small sets of visual items. In computer vision, it has been shown that numerosity emerges as a statistical property in neural networks during unsupervised learning from simple synthetic images. In this work, we focus on more complex natural images using unsupervised hierarchical neural networks. Specifically, we show that variational autoencoders are able to spontaneously perform subitizing after training without supervision on a large amount images from the Salient Object Subitizing dataset. While our method is unable to outperform supervised convolutional networks for subitizing, we observe that the networks learn to encode numerosity as basic visual property. Moreover, we find that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Cell Image Analysis Techniques · Digital Imaging for Blood Diseases
