TL;DR
This paper introduces a multi-task deep learning approach for extracting object skeletons and their scales from natural images, improving accuracy by capturing multi-scale context and providing useful features for object detection tasks.
Contribution
It proposes a novel multi-scale, multi-task fully convolutional network with scale-associated side outputs for robust skeleton and scale extraction in natural images.
Findings
Achieves superior skeleton extraction accuracy on benchmark datasets.
Effectively predicts skeleton scales, aiding in object detection.
Outperforms existing methods significantly.
Abstract
Object skeletons are useful for object representation and object detection. They are complementary to the object contour, and provide extra information, such as how object scale (thickness) varies among object parts. But object skeleton extraction from natural images is very challenging, because it requires the extractor to be able to capture both local and non-local image context in order to determine the scale of each skeleton pixel. In this paper, we present a novel fully convolutional network with multiple scale-associated side outputs to address this problem. By observing the relationship between the receptive field sizes of the different layers in the network and the skeleton scales they can capture, we introduce two scale-associated side outputs to each stage of the network. The network is trained by multi-task learning, where one task is skeleton localization to classify whether…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
