A Network Structure to Explicitly Reduce Confusion Errors in Semantic Segmentation
Qichuan Geng, Xinyu Huang, Zhong Zhou, Ruigang Yang

TL;DR
This paper introduces a novel network structure for semantic segmentation that explicitly reduces confusion errors by ensembling heterogeneous subnets and employing an improved loss function, leading to significant performance improvements.
Contribution
The proposed network explicitly targets confusion errors in semantic segmentation through subnet ensembling and a specialized loss, a novel approach not previously explored.
Findings
Achieves 3.05% improvement over ResNet-101 on Cityscapes.
Achieves 1.30% improvement over ResNet-38 on PASCAL VOC.
Consistent performance gains across multiple baseline models.
Abstract
Confusing classes that are ubiquitous in real world often degrade performance for many vision related applications like object detection, classification, and segmentation. The confusion errors are not only caused by similar visual patterns but also amplified by various factors during the training of our designed models, such as reduced feature resolution in the encoding process or imbalanced data distributions. A large amount of deep learning based network structures has been proposed in recent years to deal with these individual factors and improve network performance. However, to our knowledge, no existing work in semantic image segmentation is designed to tackle confusion errors explicitly. In this paper, we present a novel and general network structure that reduces confusion errors in more direct manner and apply the network for semantic segmentation. There are two major…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications
