VM-MODNet: Vehicle Motion aware Moving Object Detection for Autonomous Driving
Hazem Rashed, Ahmad El Sallab, Senthil Yogamani

TL;DR
This paper introduces VM-MODNet, a vehicle motion-aware CNN model for moving object detection in autonomous driving, which leverages ego-motion information to improve accuracy and runs efficiently in real-time.
Contribution
It proposes a novel Vehicle Motion Tensor (VMT) input that enables ego-motion compensation, leading to improved detection performance without additional sensors.
Findings
Achieves 5.6% higher mIoU over baseline
State-of-the-art results on KITTI_MoSeg_Extended dataset
Runs at 85 fps on a TitanX GPU
Abstract
Moving object Detection (MOD) is a critical task in autonomous driving as moving agents around the ego-vehicle need to be accurately detected for safe trajectory planning. It also enables appearance agnostic detection of objects based on motion cues. There are geometric challenges like motion-parallax ambiguity which makes it a difficult problem. In this work, we aim to leverage the vehicle motion information and feed it into the model to have an adaptation mechanism based on ego-motion. The motivation is to enable the model to implicitly perform ego-motion compensation to improve performance. We convert the six degrees of freedom vehicle motion into a pixel-wise tensor which can be fed as input to the CNN model. The proposed model using Vehicle Motion Tensor (VMT) achieves an absolute improvement of 5.6% in mIoU over the baseline architecture. We also achieve state-of-the-art results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
