TL;DR
LINEA introduces a fast, scalable transformer-based line detection method that eliminates the need for large dataset pretraining, achieving superior accuracy and speed for real-time video analysis.
Contribution
The paper proposes Deformable Line Attention (DLA), enabling fast, accurate line detection without large dataset pretraining, improving over existing transformer-based methods.
Findings
LINEA is significantly faster than previous models.
LINEA outperforms existing models in out-of-distribution tests.
LINEA achieves higher sAP scores in experiments.
Abstract
Line detection is a basic digital image processing operation used by higher-level processing methods. Recently, transformer-based methods for line detection have proven to be more accurate than methods based on CNNs, at the expense of significantly lower inference speeds. As a result, video analysis methods that require low latencies cannot benefit from current transformer-based methods for line detection. In addition, current transformer-based models require pretraining attention mechanisms on large datasets (e.g., COCO or Object360). This paper develops a new transformer-based method that is significantly faster without requiring pretraining the attention mechanism on large datasets. We eliminate the need to pre-train the attention mechanism using a new mechanism, Deformable Line Attention (DLA). We use the term LINEA to refer to our new transformer-based method based on DLA.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSoftmax · Attention Is All You Need · Deep Layer Aggregation
