TL;DR
This paper presents a method for predicting the future locations and emergence of objects in egocentric views using a reachability prior and multi-hypotheses learning, without relying on map structures, improving multimodal prediction accuracy.
Contribution
It introduces a novel reachability prior estimation from semantic maps and propagates it with egomotion, enhancing future object localization and emergence prediction in egocentric scenes.
Findings
Reachability prior improves multimodal future localization.
Method predicts emergence of new objects.
Zero-shot transfer to unseen datasets is effective.
Abstract
In this paper, we investigate the problem of anticipating future dynamics, particularly the future location of other vehicles and pedestrians, in the view of a moving vehicle. We approach two fundamental challenges: (1) the partial visibility due to the egocentric view with a single RGB camera and considerable field-of-view change due to the egomotion of the vehicle; (2) the multimodality of the distribution of future states. In contrast to many previous works, we do not assume structural knowledge from maps. We rather estimate a reachability prior for certain classes of objects from the semantic map of the present image and propagate it into the future using the planned egomotion. Experiments show that the reachability prior combined with multi-hypotheses learning improves multimodal prediction of the future location of tracked objects and, for the first time, the emergence of new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
