Predicting Physical World Destinations for Commands Given to Self-Driving Cars
Dusan Grujicic, Thierry Deruyttere, Marie-Francine Moens, Matthew, Blaschko

TL;DR
This paper advances understanding of natural language commands for self-driving cars by focusing on predicting the physical destination in 3D space, enhancing human-vehicle interaction and safety.
Contribution
It introduces a novel annotation scheme for 3D destinations and proposes a model that outperforms existing baselines in predicting these destinations.
Findings
Proposed a new 3D destination annotation method
Developed a model surpassing prior baselines
Improved accuracy in destination prediction
Abstract
In recent years, we have seen significant steps taken in the development of self-driving cars. Multiple companies are starting to roll out impressive systems that work in a variety of settings. These systems can sometimes give the impression that full self-driving is just around the corner and that we would soon build cars without even a steering wheel. The increase in the level of autonomy and control given to an AI provides an opportunity for new modes of human-vehicle interaction. However, surveys have shown that giving more control to an AI in self-driving cars is accompanied by a degree of uneasiness by passengers. In an attempt to alleviate this issue, recent works have taken a natural language-oriented approach by allowing the passenger to give commands that refer to specific objects in the visual scene. Nevertheless, this is only half the task as the car should also understand…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Autonomous Vehicle Technology and Safety · Human Mobility and Location-Based Analysis
