TL;DR
This paper introduces a simple yet effective encoder-decoder based edge detector that achieves state-of-the-art performance with reduced complexity and computational cost, emphasizing high-quality feature extraction.
Contribution
The paper proposes a vanilla encoder-decoder architecture with a bilateral encoder and cascaded feature fusion decoder for improved edge detection performance.
Findings
Achieves SOTA ODS of 0.838 on BSDS500
Outperforms complex models with less computational cost
Highlights importance of high-quality features in edge detection
Abstract
The performance of deep learning based edge detector has far exceeded that of humans, but the huge computational cost and complex training strategy hinder its further development and application. In this paper, we eliminate these complexities with a vanilla encoder-decoder based detector. Firstly, we design a bilateral encoder to decouple the extraction process of location features and semantic features. Since the location branch no longer provides cues for the semantic branch, the richness of features can be further compressed, which is the key to make our model more compact. We propose a cascaded feature fusion decoder, where the location features are progressively refined by semantic features. The refined location features are the only basis for generating the edge map. The coarse original location features and semantic features are avoided from direct contact with the final result.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
