Learning to Parse Wireframes in Images of Man-Made Environments
Kun Huang, Yifan Wang, Zihan Zhou, Tianjiao Ding, Shenghua Gao, Yi Ma

TL;DR
This paper introduces a learning-based method for automatically extracting wireframes from images of man-made environments, using a large labeled dataset and CNNs to improve line and junction detection for various visual tasks.
Contribution
The authors created a large new dataset of over 5,000 images with human-labeled wireframes and developed two CNNs that outperform existing methods in detecting lines and junctions.
Findings
Significantly improved detection accuracy over state-of-the-art methods.
Large-scale dataset enables robust learning for wireframe extraction.
Effective wireframe parsing benefits tasks like 3D reconstruction and navigation.
Abstract
In this paper, we propose a learning-based approach to the task of automatically extracting a "wireframe" representation for images of cluttered man-made environments. The wireframe (see Fig. 1) contains all salient straight lines and their junctions of the scene that encode efficiently and accurately large-scale geometry and object shapes. To this end, we have built a very large new dataset of over 5,000 images with wireframes thoroughly labelled by humans. We have proposed two convolutional neural networks that are suitable for extracting junctions and lines with large spatial support, respectively. The networks trained on our dataset have achieved significantly better performance than state-of-the-art methods for junction detection and line segment detection, respectively. We have conducted extensive experiments to evaluate quantitatively and qualitatively the wireframes obtained by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Image and Object Detection Techniques · Advanced Neural Network Applications
