RPC: A Large-Scale Retail Product Checkout Dataset
Xiu-Shen Wei, Quan Cui, Lei Yang, Peng Wang, Lingqiao Liu

TL;DR
This paper introduces a large-scale retail product checkout dataset designed to advance computer vision research in automatic checkout systems, featuring diverse, realistic images and annotations, and provides benchmark results for various methods.
Contribution
The paper presents the largest retail checkout dataset with multi-level annotations, enabling realistic and diverse research in automatic checkout systems.
Findings
The dataset contains the largest number of product categories and images to date.
Benchmarking shows varying performance of different approaches on the dataset.
The dataset improves the realism and applicability of research in retail checkout scenarios.
Abstract
Over recent years, emerging interest has occurred in integrating computer vision technology into the retail industry. Automatic checkout (ACO) is one of the critical problems in this area which aims to automatically generate the shopping list from the images of the products to purchase. The main challenge of this problem comes from the large scale and the fine-grained nature of the product categories as well as the difficulty for collecting training images that reflect the realistic checkout scenarios due to continuous update of the products. Despite its significant practical and research value, this problem is not extensively studied in the computer vision community, largely due to the lack of a high-quality dataset. To fill this gap, in this work we propose a new dataset to facilitate relevant research. Our dataset enjoys the following characteristics: (1) It is by far the largest…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Visual Attention and Saliency Detection
