Fast, Accurate Barcode Detection in Ultra High-Resolution Images
Jerome Quenum, Kehan Wang, Avideh Zakhor

TL;DR
This paper introduces a semantic segmentation approach for fast and accurate barcode detection in ultra high-resolution images, significantly outperforming existing methods in speed and accuracy.
Contribution
The paper presents a novel pipeline combining a modified RPN and a new Y-Net segmentation network for efficient barcode detection in UHR images.
Findings
End-to-end system latency of 16 ms, 2.5x faster than YOLOv4.
Outperforms YOLOv4 and Mask R-CNN in accuracy on synthetic dataset.
Provides synthetic barcode dataset and code for research use.
Abstract
Object detection in Ultra High-Resolution (UHR) images has long been a challenging problem in computer vision due to the varying scales of the targeted objects. When it comes to barcode detection, resizing UHR input images to smaller sizes often leads to the loss of pertinent information, while processing them directly is highly inefficient and computationally expensive. In this paper, we propose using semantic segmentation to achieve a fast and accurate detection of barcodes of various scales in UHR images. Our pipeline involves a modified Region Proposal Network (RPN) on images of size greater than 10k10k and a newly proposed Y-Net segmentation network, followed by a post-processing workflow for fitting a bounding box around each segmented barcode mask. The end-to-end system has a latency of 16 milliseconds, which is faster than YOLOv4 and faster than…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRegion Proposal Network · Grid Sensitive · Batch Normalization · Global Average Pooling · Sigmoid Activation · Average Pooling · Max Pooling · Residual Connection · BNB Customer Service Number +1-833-534-1729 · k-Means Clustering
