TopoBDA: Towards Bezier Deformable Attention for Road Topology Understanding
Muhammet Esat Kalfaoglu, Halil Ibrahim Ozturk, Ozsel Kilinc, Alptekin Temizel

TL;DR
TopoBDA introduces a Bezier deformable attention mechanism within a transformer framework to improve road topology understanding from multi-camera imagery, achieving state-of-the-art results in lane detection and topology reasoning.
Contribution
The paper proposes a novel Bezier Deformable Attention module integrated into a transformer for enhanced road topology comprehension from multi-camera data.
Findings
Outperforms existing methods on OpenLane-V2 for centerline detection.
Achieves state-of-the-art results on OpenLane-V1 in 3D lane detection.
Multimodal data integration further improves topology understanding.
Abstract
Understanding road topology is crucial for autonomous driving. This paper introduces TopoBDA (Topology with Bezier Deformable Attention), a novel approach that enhances road topology comprehension by leveraging Bezier Deformable Attention (BDA). TopoBDA processes multi-camera 360-degree imagery to generate Bird's Eye View (BEV) features, which are refined through a transformer decoder employing BDA. BDA utilizes Bezier control points to drive the deformable attention mechanism, improving the detection and representation of elongated and thin polyline structures, such as lane centerlines. Additionally, TopoBDA integrates two auxiliary components: an instance mask formulation loss and a one-to-many set prediction loss strategy, to further refine centerline detection and enhance road topology understanding. Experimental evaluations on the OpenLane-V2 dataset demonstrate that TopoBDA…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrastructure Maintenance and Monitoring · Human Pose and Action Recognition · Hand Gesture Recognition Systems
MethodsSoftmax · Attention Is All You Need · Sparse Evolutionary Training
