Semantic Aware Feature Extraction for Enhanced 3D Reconstruction
Ronald Nap, Andy Xiao

TL;DR
This paper introduces a semantic-aware feature extraction framework that combines multi-task learning and deep matching to improve 3D reconstruction quality by incorporating semantic and elevation information from monocular fisheye camera data.
Contribution
It presents a novel joint training approach for keypoint detection, description, and semantic segmentation to enhance feature matching and 3D reconstruction accuracy.
Findings
Improved structural detail in 3D point clouds.
Enhanced elevation estimation and multi-level mapping.
Better feature correspondence through semantic cues.
Abstract
Feature matching is a fundamental problem in computer vision with wide-ranging applications, including simultaneous localization and mapping (SLAM), image stitching, and 3D reconstruction. While recent advances in deep learning have improved keypoint detection and description, most approaches focus primarily on geometric attributes and often neglect higher-level semantic information. This work proposes a semantic-aware feature extraction framework that employs multi-task learning to jointly train keypoint detection, keypoint description, and semantic segmentation. The method is benchmarked against standard feature matching techniques and evaluated in the context of 3D reconstruction. To enhance feature correspondence, a deep matching module is integrated. The system is tested using input from a single monocular fisheye camera mounted on a vehicle and evaluated within a multi-floor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Advanced Vision and Imaging
