CV 3315 Is All You Need : Semantic Segmentation Competition
Akide Liu, Zihan Wang

TL;DR
This paper reviews transformer-based methods, especially SegFormer, for urban semantic segmentation, achieving a balance of high accuracy and efficiency in a challenging unbalanced dataset competition.
Contribution
It evaluates transformer-driven segmentation models, particularly SegFormer variants, and identifies SegFormer-B2 as the optimal model balancing performance and computational cost.
Findings
SegFormer-B0 achieved 74.6% mIoU with minimal FLOPS.
SegFormer-B5 achieved 80.2% mIoU.
SegFormer-B2 was selected as the final model with 78.5% mIoU and 50.6 GFLOPS.
Abstract
This competition focus on Urban-Sense Segmentation based on the vehicle camera view. Class highly unbalanced Urban-Sense images dataset challenge the existing solutions and further studies. Deep Conventional neural network-based semantic segmentation methods such as encoder-decoder architecture and multi-scale and pyramid-based approaches become flexible solutions applicable to real-world applications. In this competition, we mainly review the literature and conduct experiments on transformer-driven methods especially SegFormer, to achieve an optimal trade-off between performance and efficiency. For example, SegFormer-B0 achieved 74.6% mIoU with the smallest FLOPS, 15.6G, and the largest model, SegFormer- B5 archived 80.2% mIoU. According to multiple factors, including individual case failure analysis, individual class performance, training pressure and efficiency estimation, the final…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Brain Tumor Detection and Classification · Industrial Vision Systems and Defect Detection
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Convolution · Residual Connection · Mix-FFN · Linear Layer · SegFormer
