MultiXNet: Multiclass Multistage Multimodal Motion Prediction
Nemanja Djuric, Henggang Cui, Zhaoen Su, Shangxuan Wu, Huahua Wang,, Fang-Chieh Chou, Luisa San Martin, Song Feng, Rui Hu, Yang Xu, Alyssa Dayan,, Sidney Zhang, Brian C. Becker, Gregory P. Meyer, Carlos Vallespi-Gonzalez,, Carl K. Wellington

TL;DR
MultiXNet is an end-to-end multimodal motion prediction system for self-driving vehicles that detects and predicts multiple traffic actor behaviors directly from lidar data, outperforming existing methods.
Contribution
It introduces a novel multimodal, multiclass, multistage approach with trajectory refinement and uncertainty calibration for lidar-based motion prediction.
Findings
Outperforms state-of-the-art methods on real-world datasets
Handles multiple traffic actor classes simultaneously
Provides calibrated probabilistic motion predictions
Abstract
One of the critical pieces of the self-driving puzzle is understanding the surroundings of a self-driving vehicle (SDV) and predicting how these surroundings will change in the near future. To address this task we propose MultiXNet, an end-to-end approach for detection and motion prediction based directly on lidar sensor data. This approach builds on prior work by handling multiple classes of traffic actors, adding a jointly trained second-stage trajectory refinement step, and producing a multimodal probability distribution over future actor motion that includes both multiple discrete traffic behaviors and calibrated continuous position uncertainties. The method was evaluated on large-scale, real-world data collected by a fleet of SDVs in several cities, with the results indicating that it outperforms existing state-of-the-art approaches.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
