Monocular Instance Motion Segmentation for Autonomous Driving: KITTI   InstanceMotSeg Dataset and Multi-task Baseline

Eslam Mohamed; Mahmoud Ewaisha; Mennatullah Siam; Hazem Rashed,; Senthil Yogamani; Waleed Hamdy; Muhammad Helmi; Ahmad El-Sallab

arXiv:2008.07008·cs.CV·May 27, 2021

Monocular Instance Motion Segmentation for Autonomous Driving: KITTI InstanceMotSeg Dataset and Multi-task Baseline

Eslam Mohamed, Mahmoud Ewaisha, Mennatullah Siam, Hazem Rashed,, Senthil Yogamani, Waleed Hamdy, Muhammad Helmi, Ahmad El-Sallab

PDF

TL;DR

This paper introduces a new large dataset and a multi-task baseline model for instance-level motion segmentation in autonomous driving, enabling better detection of moving objects regardless of appearance.

Contribution

The creation of the InstanceMotSeg dataset with 12.9K samples and multi-class annotations, along with an adapted multi-task model for real-time, class-agnostic, and semantic instance segmentation.

Findings

01

Achieved 39 fps with MobileNetV2 encoder.

02

Improved mAP by 10% over baseline.

03

Enhanced state-of-the-art motion segmentation by 3.3%.

Abstract

Moving object segmentation is a crucial task for autonomous vehicles as it can be used to segment objects in a class agnostic manner based on their motion cues. It enables the detection of unseen objects during training (e.g., moose or a construction truck) based on their motion and independent of their appearance. Although pixel-wise motion segmentation has been studied in autonomous driving literature, it has been rarely addressed at the instance level, which would help separate connected segments of moving objects leading to better trajectory planning. As the main issue is the lack of large public datasets, we create a new InstanceMotSeg dataset comprising of 12.9K samples improving upon our KITTIMoSeg dataset. In addition to providing instance level annotations, we have added 4 additional classes which is crucial for studying class agnostic motion segmentation. We adapt YOLACT and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDepthwise Convolution · Pointwise Convolution · Depthwise Separable Convolution · 1x1 Convolution · Batch Normalization · Average Pooling · Convolution · Inverted Residual Block · Tether Customer Service Number +1-833-534-1729