CRAFT Objects from Images

Bin Yang; Junjie Yan; Zhen Lei; Stan Z. Li

arXiv:1604.03239·cs.CV·April 13, 2016

CRAFT Objects from Images

Bin Yang, Junjie Yan, Zhen Lei, Stan Z. Li

PDF

1 Repo

TL;DR

CRAFT introduces a cascade approach to object detection that improves proposal quality and classification accuracy by further dividing the tasks into sub-tasks, leading to state-of-the-art results.

Contribution

It proposes a novel cascade network architecture that enhances object proposal generation and classification, advancing the state-of-the-art in object detection.

Findings

01

Improved object proposal localization and compactness.

02

Reduced false positives in classification.

03

Achieved superior performance on PASCAL VOC and ILSVRC benchmarks.

Abstract

Object detection is a fundamental problem in image understanding. One popular solution is the R-CNN framework and its fast versions. They decompose the object detection problem into two cascaded easier tasks: 1) generating object proposals from images, 2) classifying proposals into various object categories. Despite that we are handling with two relatively easier tasks, they are not solved perfectly and there's still room for improvement. In this paper, we push the "divide and conquer" solution even further by dividing each task into two sub-tasks. We call the proposed method "CRAFT" (Cascade Region-proposal-network And FasT-rcnn), which tackles each task with a carefully designed network cascade. We show that the cascade structure helps in both tasks: in proposal generation, it provides more compact and better localized object proposals; in object classification, it reduces false…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

byangderek/CRAFT
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSupport Vector Machine · Max Pooling · Convolution · R-CNN