RMP: A Random Mask Pretrain Framework for Motion Prediction

Yi Yang; Qingwen Zhang; Thomas Gilles; Nazre Batool; John Folkesson

arXiv:2309.08989·cs.RO·September 19, 2023

RMP: A Random Mask Pretrain Framework for Motion Prediction

Yi Yang, Qingwen Zhang, Thomas Gilles, Nazre Batool, John Folkesson

PDF

Open Access 1 Repo

TL;DR

This paper introduces RMP, a pretraining framework for motion prediction in autonomous driving that masks object positions at random times and learns to fill them in, improving accuracy especially in occlusion scenarios.

Contribution

The paper proposes a novel pretraining framework inspired by NLP and CV masked models, tailored for trajectory prediction in autonomous driving.

Findings

01

Improves motion prediction accuracy on Argoverse and NuScenes datasets.

02

Enhances robustness to noisy inputs and occlusions.

03

Flexible framework that can adapt to various motion-related tasks.

Abstract

As the pretraining technique is growing in popularity, little work has been done on pretrained learning-based motion prediction methods in autonomous driving. In this paper, we propose a framework to formalize the pretraining task for trajectory prediction of traffic participants. Within our framework, inspired by the random masked model in natural language processing (NLP) and computer vision (CV), objects' positions at random timesteps are masked and then filled in by the learned neural network (NN). By changing the mask profile, our framework can easily switch among a range of motion-related tasks. We show that our proposed pretraining framework is able to deal with noisy inputs and improves the motion prediction accuracy and miss rate, especially for objects occluded over time by evaluating it on Argoverse and NuScenes datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kth-rpl/rmp
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Human Pose and Action Recognition · Multimodal Machine Learning Applications