Object Skeleton Extraction in Natural Images by Fusing Scale-associated Deep Side Outputs
Wei Shen, Kai Zhao, Yuan Jiang, Yan Wang, Zhijiang Zhang, Xiang Bai

TL;DR
This paper introduces a fully convolutional network with scale-associated side outputs for efficient and accurate object skeleton extraction in natural images, capturing both local and global context.
Contribution
It proposes a novel scale-associated side output mechanism that guides multi-scale feature learning and fusion, improving skeleton extraction performance.
Findings
Achieves promising results on two skeleton datasets.
Outperforms existing methods significantly.
Effectively captures multi-scale skeleton features.
Abstract
Object skeleton is a useful cue for object detection, complementary to the object contour, as it provides a structural representation to describe the relationship among object parts. While object skeleton extraction in natural images is a very challenging problem, as it requires the extractor to be able to capture both local and global image context to determine the intrinsic scale of each skeleton pixel. Existing methods rely on per-pixel based multi-scale feature computation, which results in difficult modeling and high time consumption. In this paper, we present a fully convolutional network with multiple scale-associated side outputs to address this problem. By observing the relationship between the receptive field sizes of the sequential stages in the network and the skeleton scales they can capture, we introduce a scale-associated side output to each stage. We impose supervision…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization
