Fine-Grained Product Class Recognition for Assisted Shopping
Marian George, Dejan Mircic, G\'abor S\"or\"os, Christian, Floerkemeier, Friedemann Mattern

TL;DR
This paper introduces a comprehensive system for visually recognizing fine-grained product classes in grocery stores to assist shoppers, especially the visually impaired, by combining text recognition, visual patch analysis, and active learning.
Contribution
It presents a novel multi-component system that integrates text recognition, discriminative visual patches, and active learning for scalable, robust product class recognition in real-world shopping scenarios.
Findings
Robust recognition across different store environments.
Scalable to large product databases with minimal re-training.
Improved accuracy through active learning.
Abstract
Assistive solutions for a better shopping experience can improve the quality of life of people, in particular also of visually impaired shoppers. We present a system that visually recognizes the fine-grained product classes of items on a shopping list, in shelves images taken with a smartphone in a grocery store. Our system consists of three components: (a) We automatically recognize useful text on product packaging, e.g., product name and brand, and build a mapping of words to product classes based on the large-scale GroceryProducts dataset. When the user populates the shopping list, we automatically infer the product class of each entered word. (b) We perform fine-grained product class recognition when the user is facing a shelf. We discover discriminative patches on product packaging to differentiate between visually similar product classes and to increase the robustness against…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · QR Code Applications and Technologies
