TopoBDA: Towards Bezier Deformable Attention for Road Topology Understanding

Muhammet Esat Kalfaoglu; Halil Ibrahim Ozturk; Ozsel Kilinc; Alptekin Temizel

arXiv:2412.18951·cs.CV·January 9, 2026

TopoBDA: Towards Bezier Deformable Attention for Road Topology Understanding

Muhammet Esat Kalfaoglu, Halil Ibrahim Ozturk, Ozsel Kilinc, Alptekin Temizel

PDF

Open Access

TL;DR

TopoBDA introduces a Bezier deformable attention mechanism within a transformer framework to improve road topology understanding from multi-camera imagery, achieving state-of-the-art results in lane detection and topology reasoning.

Contribution

The paper proposes a novel Bezier Deformable Attention module integrated into a transformer for enhanced road topology comprehension from multi-camera data.

Findings

01

Outperforms existing methods on OpenLane-V2 for centerline detection.

02

Achieves state-of-the-art results on OpenLane-V1 in 3D lane detection.

03

Multimodal data integration further improves topology understanding.

Abstract

Understanding road topology is crucial for autonomous driving. This paper introduces TopoBDA (Topology with Bezier Deformable Attention), a novel approach that enhances road topology comprehension by leveraging Bezier Deformable Attention (BDA). TopoBDA processes multi-camera 360-degree imagery to generate Bird's Eye View (BEV) features, which are refined through a transformer decoder employing BDA. BDA utilizes Bezier control points to drive the deformable attention mechanism, improving the detection and representation of elongated and thin polyline structures, such as lane centerlines. Additionally, TopoBDA integrates two auxiliary components: an instance mask formulation loss and a one-to-many set prediction loss strategy, to further refine centerline detection and enhance road topology understanding. Experimental evaluations on the OpenLane-V2 dataset demonstrate that TopoBDA…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInfrastructure Maintenance and Monitoring · Human Pose and Action Recognition · Hand Gesture Recognition Systems

MethodsSoftmax · Attention Is All You Need · Sparse Evolutionary Training