SCD: A Stacked Carton Dataset for Detection and Segmentation

Jinrong Yang; Shengkai Wu; Lijun Gou; Hangcheng Yu; Chenxi Lin,; Jiazhuo Wang; Minxuan Li; Xiaoping Li

arXiv:2102.12808·cs.CV·February 26, 2021·1 cites

SCD: A Stacked Carton Dataset for Detection and Segmentation

Jinrong Yang, Shengkai Wu, Lijun Gou, Hangcheng Yu, Chenxi Lin,, Jiazhuo Wang, Minxuan Li, Xiaoping Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces a large-scale carton dataset called SCD for detection and segmentation, along with a novel detector that improves accuracy and generalizes well across datasets.

Contribution

The paper provides the first large-scale carton dataset and proposes a new detection method with modules that enhance localization and boundary attention.

Findings

01

SCD contains 250,000 instance masks from 16,136 images.

02

The proposed detector improves AP by 3.1%-4.7% on SCD.

03

OPCL module enhances generalization, improving AP on MS COCO and PASCAL VOC.

Abstract

Carton detection is an important technique in the automatic logistics system and can be applied to many applications such as the stacking and unstacking of cartons, the unloading of cartons in the containers. However, there is no public large-scale carton dataset for the research community to train and evaluate the carton detection models up to now, which hinders the development of carton detection. In this paper, we present a large-scale carton dataset named Stacked Carton Dataset(SCD) with the goal of advancing the state-of-the-art in carton detection. Images are collected from the internet and several warehourses, and objects are labeled using per-instance segmentation for precise localization. There are totally 250,000 instance masks from 16,136 images. In addition, we design a carton detector based on RetinaNet by embedding Offset Prediction between Classification and Localization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yancie-yjr/scd.github.io/blob/main/index.md
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Vehicle License Plate Recognition · Wood and Agarwood Research

Methods1x1 Convolution · Focal Loss · Convolution · Feature Pyramid Network · RetinaNet