Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization
Sungho Yoon, Ayoung Kim

TL;DR
This paper introduces a novel line descriptor based on transformer architecture inspired by NLP, effectively handling variable line lengths and improving visual localization accuracy in robotics and computer vision tasks.
Contribution
The paper proposes a line-transformer architecture and line signature networks that better abstract variable line lengths and geometric attributes for enhanced localization.
Findings
Improved homography estimation accuracy
Enhanced visual localization performance
Effective handling of variable line lengths
Abstract
Along with feature points for image matching, line features provide additional constraints to solve visual geometric problems in robotics and computer vision (CV). Although recent convolutional neural network (CNN)-based line descriptors are promising for viewpoint changes or dynamic environments, we claim that the CNN architecture has innate disadvantages to abstract variable line length into the fixed-dimensional descriptor. In this paper, we effectively introduce Line-Transformers dealing with variable lines. Inspired by natural language processing (NLP) tasks where sentences can be understood and abstracted well in neural nets, we view a line segment as a sentence that contains points (words). By attending to well-describable points on aline dynamically, our descriptor performs excellently on variable line length. We also propose line signature networks sharing the line's geometric…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Image and Object Detection Techniques
