MTR-VP: Towards End-to-End Trajectory Planning through Context-Driven Image Encoding and Multiple Trajectory Prediction
Maitrayee Keskar, Mohan Trivedi, Ross Greer

TL;DR
This paper introduces MTR-VP, an end-to-end trajectory planning method for autonomous driving that uses vision-based context embeddings and multiple trajectory predictions, improving scene understanding and planning accuracy.
Contribution
The paper proposes a novel approach combining vision-based context encoding with multimodal trajectory prediction, replacing map features with learned visual representations in an end-to-end framework.
Findings
Predicting multiple future trajectories improves planning performance.
Transformer-based visual and kinetic feature integration has limitations.
Using learned visual scene context enhances trajectory prediction accuracy.
Abstract
We present a method for trajectory planning for autonomous driving, learning image-based context embeddings that align with motion prediction frameworks and planning-based intention input. Within our method, a ViT encoder takes raw images and past kinematic state as input and is trained to produce context embeddings, drawing inspiration from those generated by the recent MTR (Motion Transformer) encoder, effectively substituting map-based features with learned visual representations. MTR provides a strong foundation for multimodal trajectory prediction by localizing agent intent and refining motion iteratively via motion query pairs; we name our approach MTR-VP (Motion Transformer for Vision-based Planning), and instead of the learnable intention queries used in the MTR decoder, we use cross attention on the intent and the context embeddings, which reflect a combination of information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Multimodal Machine Learning Applications · Robotic Path Planning Algorithms
