RMP: A Random Mask Pretrain Framework for Motion Prediction
Yi Yang, Qingwen Zhang, Thomas Gilles, Nazre Batool, John Folkesson

TL;DR
This paper introduces RMP, a pretraining framework for motion prediction in autonomous driving that masks object positions at random times and learns to fill them in, improving accuracy especially in occlusion scenarios.
Contribution
The paper proposes a novel pretraining framework inspired by NLP and CV masked models, tailored for trajectory prediction in autonomous driving.
Findings
Improves motion prediction accuracy on Argoverse and NuScenes datasets.
Enhances robustness to noisy inputs and occlusions.
Flexible framework that can adapt to various motion-related tasks.
Abstract
As the pretraining technique is growing in popularity, little work has been done on pretrained learning-based motion prediction methods in autonomous driving. In this paper, we propose a framework to formalize the pretraining task for trajectory prediction of traffic participants. Within our framework, inspired by the random masked model in natural language processing (NLP) and computer vision (CV), objects' positions at random timesteps are masked and then filled in by the learned neural network (NN). By changing the mask profile, our framework can easily switch among a range of motion-related tasks. We show that our proposed pretraining framework is able to deal with noisy inputs and improves the motion prediction accuracy and miss rate, especially for objects occluded over time by evaluating it on Argoverse and NuScenes datasets.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Human Pose and Action Recognition · Multimodal Machine Learning Applications
