An Improved Deep Learning Approach For Product Recognition on Racks in Retail Stores
Ankit Sinha, Soham Banerjee, Pratik Chattopadhyay

TL;DR
This paper presents a lightweight, two-stage deep learning pipeline combining Faster R-CNN and ResNet-18 for accurate product recognition on retail racks, improving effectiveness and memory efficiency.
Contribution
It introduces a novel end-to-end approach with fine-tuned models and data augmentation, enhancing product recognition accuracy in retail environments.
Findings
Effective detection and classification on Grozi-32k and GP-180 datasets.
Lightweight models suitable for real-time deployment.
Improved accuracy over existing methods.
Abstract
Automated product recognition in retail stores is an important real-world application in the domain of Computer Vision and Pattern Recognition. In this paper, we consider the problem of automatically identifying the classes of the products placed on racks in retail stores from an image of the rack and information about the query/product images. We improve upon the existing approaches in terms of effectiveness and memory requirement by developing a two-stage object detection and recognition pipeline comprising of a Faster-RCNN-based object localizer that detects the object regions in the rack image and a ResNet-18-based image encoder that classifies the detected regions into the appropriate classes. Each of the models is fine-tuned using appropriate data sets for better prediction and data augmentation is performed on each query image to prepare an extensive gallery set for fine-tuning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Visual Attention and Saliency Detection · Industrial Vision Systems and Defect Detection
MethodsTriplet Loss
