Performance Evaluation of Deep Learning Networks for Semantic Segmentation of Traffic Stereo-Pair Images
Vlad Taran, Nikita Gordienko, Yuriy Kochura, Yuri Gordienko, Alexandr, Rokovyi, Oleg Alienin, Sergii Stirenko

TL;DR
This study evaluates deep learning models PSPNet and ICNet for semantic segmentation of traffic stereo images, analyzing accuracy, inference time, and statistical properties across different cities and channels, revealing their sensitivities and potential for enhanced scene understanding.
Contribution
It provides a comparative analysis of PSPNet and ICNet on traffic stereo images, highlighting their statistical distribution properties and implications for model fine-tuning and multi-channel data utilization.
Findings
Different accuracy distributions for various cities and stereo channels.
PSPNet shows more asymmetric and outlier-prone accuracy distributions than ICNet.
Stereo pairs can be used as additional data channels for scene analysis.
Abstract
Semantic image segmentation is one the most demanding task, especially for analysis of traffic conditions for self-driving cars. Here the results of application of several deep learning architectures (PSPNet and ICNet) for semantic image segmentation of traffic stereo-pair images are presented. The images from Cityscapes dataset and custom urban images were analyzed as to the segmentation accuracy and image inference time. For the models pre-trained on Cityscapes dataset, the inference time was equal in the limits of standard deviation, but the segmentation accuracy was different for various cities and stereo channels even. The distributions of accuracy (mean intersection over union - mIoU) values for each city and channel are asymmetric, long-tailed, and have many extreme outliers, especially for PSPNet network in comparison to ICNet network. Some statistical properties of these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Convolution · Average Pooling · Pyramid Pooling Module · Auxiliary Classifier · Dilated Convolution · PSPNet
