Monocular Instance Motion Segmentation for Autonomous Driving: KITTI InstanceMotSeg Dataset and Multi-task Baseline
Eslam Mohamed, Mahmoud Ewaisha, Mennatullah Siam, Hazem Rashed,, Senthil Yogamani, Waleed Hamdy, Muhammad Helmi, Ahmad El-Sallab

TL;DR
This paper introduces a new large dataset and a multi-task baseline model for instance-level motion segmentation in autonomous driving, enabling better detection of moving objects regardless of appearance.
Contribution
The creation of the InstanceMotSeg dataset with 12.9K samples and multi-class annotations, along with an adapted multi-task model for real-time, class-agnostic, and semantic instance segmentation.
Findings
Achieved 39 fps with MobileNetV2 encoder.
Improved mAP by 10% over baseline.
Enhanced state-of-the-art motion segmentation by 3.3%.
Abstract
Moving object segmentation is a crucial task for autonomous vehicles as it can be used to segment objects in a class agnostic manner based on their motion cues. It enables the detection of unseen objects during training (e.g., moose or a construction truck) based on their motion and independent of their appearance. Although pixel-wise motion segmentation has been studied in autonomous driving literature, it has been rarely addressed at the instance level, which would help separate connected segments of moving objects leading to better trajectory planning. As the main issue is the lack of large public datasets, we create a new InstanceMotSeg dataset comprising of 12.9K samples improving upon our KITTIMoSeg dataset. In addition to providing instance level annotations, we have added 4 additional classes which is crucial for studying class agnostic motion segmentation. We adapt YOLACT and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDepthwise Convolution · Pointwise Convolution · Depthwise Separable Convolution · 1x1 Convolution · Batch Normalization · Average Pooling · Convolution · Inverted Residual Block · Tether Customer Service Number +1-833-534-1729
