SizeNet: Object Recognition via Object Real Size-based Convolutional Networks
Xiaofei Li, Zhong Dong

TL;DR
SizeNet introduces a novel object recognition framework that incorporates real object sizes alongside features, significantly improving accuracy and interpretability over existing deep learning models.
Contribution
The paper presents SizeNet, a new object recognition model that leverages real object sizes to enhance recognition accuracy and distinguish objects with similar features.
Findings
SizeNet outperforms AlexNet, VGG-16, Inception V3, Resnet-18, and DenseNet-121 in accuracy.
SizeNet effectively differentiates objects with similar features but different sizes.
Incorporating real size reduces the category label set for recognition tasks.
Abstract
Inspired by the conclusion that humans choose the visual cortex regions corresponding to the real size of an object to analyze its features when identifying objects in the real world, this paper presents a framework, SizeNet, which is based on both the real sizes and features of objects to solve object recognition problems. SizeNet was used for object recognition experiments on the homemade Rsize dataset, and was compared with the state-of-the-art methods AlexNet, VGG-16, Inception V3, Resnet-18, and DenseNet-121. The results showed that SizeNet provides much higher accuracy rates for object recognition than the other algorithms. SizeNet can solve the two problems of correctly recognizing objects with highly similar features but real sizes that are obviously different from each other, and correctly distinguishing a target object from interference objects whose real sizes are obviously…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Cell Image Analysis Techniques
