Improving Prediction Accuracy of Semantic Segmentation Methods Using Convolutional Autoencoder Based Pre-processing Layers
Hisashi Shimodaira

TL;DR
This paper introduces a convolutional autoencoder-based pre-processing layer to enhance the prediction accuracy of semantic segmentation methods like FCN, demonstrating significant improvements in generalization and accuracy with minimal parameter increase.
Contribution
The paper presents a novel, simple pre-processing approach using autoencoder layers that significantly improves semantic segmentation accuracy and generalization, applicable to various methods.
Findings
Mean IoU increased by 18.7% with the proposed method
Significant improvement in generalization ability observed
Method is simple, with minimal parameter increase
Abstract
In this paper, we propose a method to improve prediction accuracy of semantic segmentation methods as follows: (1) construct a neural network that has pre-processing layers based on a convolutional autoencoder ahead of a semantic segmentation network, and (2) train the entire network initialized by the weights of the pre-trained autoencoder. We applied this method to the fully convolutional network (FCN) and experimentally compared its prediction accuracy on the cityscapes dataset. The Mean IoU of the proposed target model with the He normal initialization is 18.7% higher than that of FCN with the He normal initialization. In addition, those of the modified models of the target model are significantly higher than that of FCN with the He normal initialization. The accuracy and loss curves during the training showed that these are resulting from the improvement of the generalization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndustrial Vision Systems and Defect Detection
MethodsMax Pooling · Convolution · Fully Convolutional Network
