MultiXNet: Multiclass Multistage Multimodal Motion Prediction

Nemanja Djuric; Henggang Cui; Zhaoen Su; Shangxuan Wu; Huahua Wang,; Fang-Chieh Chou; Luisa San Martin; Song Feng; Rui Hu; Yang Xu; Alyssa Dayan,; Sidney Zhang; Brian C. Becker; Gregory P. Meyer; Carlos Vallespi-Gonzalez,; Carl K. Wellington

arXiv:2006.02000·cs.CV·May 25, 2021

MultiXNet: Multiclass Multistage Multimodal Motion Prediction

Nemanja Djuric, Henggang Cui, Zhaoen Su, Shangxuan Wu, Huahua Wang,, Fang-Chieh Chou, Luisa San Martin, Song Feng, Rui Hu, Yang Xu, Alyssa Dayan,, Sidney Zhang, Brian C. Becker, Gregory P. Meyer, Carlos Vallespi-Gonzalez,, Carl K. Wellington

PDF

TL;DR

MultiXNet is an end-to-end multimodal motion prediction system for self-driving vehicles that detects and predicts multiple traffic actor behaviors directly from lidar data, outperforming existing methods.

Contribution

It introduces a novel multimodal, multiclass, multistage approach with trajectory refinement and uncertainty calibration for lidar-based motion prediction.

Findings

01

Outperforms state-of-the-art methods on real-world datasets

02

Handles multiple traffic actor classes simultaneously

03

Provides calibrated probabilistic motion predictions

Abstract

One of the critical pieces of the self-driving puzzle is understanding the surroundings of a self-driving vehicle (SDV) and predicting how these surroundings will change in the near future. To address this task we propose MultiXNet, an end-to-end approach for detection and motion prediction based directly on lidar sensor data. This approach builds on prior work by handling multiple classes of traffic actors, adding a jointly trained second-stage trajectory refinement step, and producing a multimodal probability distribution over future actor motion that includes both multiple discrete traffic behaviors and calibrated continuous position uncertainties. The method was evaluated on large-scale, real-world data collected by a fleet of SDVs in several cities, with the results indicating that it outperforms existing state-of-the-art approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.