ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting
Sandro Papais, Letian Wang, Brian Cheong, Steven L. Waslander

TL;DR
ForeSight is a novel multi-task framework for autonomous vehicle perception that jointly detects objects and forecasts their trajectories using a streaming, bidirectional learning approach, improving accuracy and temporal consistency without explicit object tracking.
Contribution
It introduces a joint detection and forecasting model with streaming, bidirectional learning, and forecast-aware transformers, eliminating the need for explicit object tracking and enhancing multi-view perception.
Findings
Achieves state-of-the-art EPA of 54.9% on nuScenes
Surpasses previous methods by 9.3% in EPA
Attains best mAP and minADE among multi-view models
Abstract
We introduce ForeSight, a novel joint detection and forecasting framework for vision-based 3D perception in autonomous vehicles. Traditional approaches treat detection and forecasting as separate sequential tasks, limiting their ability to leverage temporal cues. ForeSight addresses this limitation with a multi-task streaming and bidirectional learning approach, allowing detection and forecasting to share query memory and propagate information seamlessly. The forecast-aware detection transformer enhances spatial reasoning by integrating trajectory predictions from a multiple hypothesis forecast memory queue, while the streaming forecast transformer improves temporal consistency using past forecasts and refined detections. Unlike tracking-based methods, ForeSight eliminates the need for explicit object association, reducing error propagation with a tracking-free model that efficiently…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Advanced Neural Network Applications · Video Surveillance and Tracking Methods
