Neural Geometric Parser for Single Image Camera Calibration
Jinwoo Lee, Minhyuk Sung, Hyunjoon Lee, Junho Kim

TL;DR
This paper introduces a neural geometric parser that combines semantic and geometric cues to improve single image camera calibration accuracy in man-made scenes, outperforming existing methods.
Contribution
The proposed framework uniquely integrates geometric line cues with neural networks for camera calibration, leveraging scene priors and weak supervision for enhanced performance.
Findings
Significantly higher calibration accuracy than state-of-the-art methods.
Effective in both indoor and outdoor man-made scenes.
Utilizes geometric cues and scene priors for weakly supervised learning.
Abstract
We propose a neural geometric parser learning single image camera calibration for man-made scenes. Unlike previous neural approaches that rely only on semantic cues obtained from neural networks, our approach considers both semantic and geometric cues, resulting in significant accuracy improvement. The proposed framework consists of two networks. Using line segments of an image as geometric cues, the first network estimates the zenith vanishing point and generates several candidates consisting of the camera rotation and focal length. The second network evaluates each candidate based on the given image and the geometric cues, where prior knowledge of man-made scenes is used for the evaluation. With the supervision of datasets consisting of the horizontal line and focal length of the images, our networks can be trained to estimate the same camera parameters. Based on the Manhattan world…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Optical measurement and interference techniques · Robotics and Sensor-Based Localization
