TL;DR
This paper enhances video-based semantic segmentation for adverse weather by optimizing recurrent units, achieving real-time performance with maintained accuracy and improved robustness over single-image methods.
Contribution
It introduces modifications to recurrent units in a video segmentation network to enable real-time processing without sacrificing accuracy in bad weather conditions.
Findings
Inference time reduced by 23% with modifications.
Achieves similar accuracy to LSTM-ICNet, faster than original.
Outperforms single-image segmentation in adverse weather.
Abstract
Computer vision tasks such as semantic segmentation perform very well in good weather conditions, but if the weather turns bad, they have problems to achieve this performance in these conditions. One possibility to obtain more robust and reliable results in adverse weather conditions is to use video-segmentation approaches instead of commonly used single-image segmentation methods. Video-segmentation approaches capture temporal information of the previous video-frames in addition to current image information, and hence, they are more robust against disturbances, especially if they occur in only a few frames of the video-sequence. However, video-segmentation approaches, which are often based on recurrent neural networks, cannot be applied in real-time applications anymore, since their recurrent structures in the network are computational expensive. For instance, the inference time of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
