Exploring the Potential of Multi-Modal AI for Driving Hazard Prediction

Korawat Charoenpitaks; Van-Quang Nguyen; Masanori Suganuma; Masahiro; Takahashi; Ryoma Niihara; Takayuki Okatani

arXiv:2310.04671·cs.CV·July 2, 2024·1 cites

Exploring the Potential of Multi-Modal AI for Driving Hazard Prediction

Korawat Charoenpitaks, Van-Quang Nguyen, Masanori Suganuma, Masahiro, Takahashi, Ryoma Niihara, Takayuki Okatani

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new dataset and problem formulation for predicting driving hazards from static dashcam images, emphasizing high-level reasoning about future accidents using multi-modal AI.

Contribution

It presents the DHPR dataset and a novel approach to hazard prediction based on visual abductive reasoning from single images.

Findings

01

Baseline methods show promising results but highlight challenges in hazard prediction.

02

The dataset enables research on reasoning about future events from static images.

03

Future work can improve accuracy and incorporate additional modalities.

Abstract

This paper addresses the problem of predicting hazards that drivers may encounter while driving a car. We formulate it as a task of anticipating impending accidents using a single input image captured by car dashcams. Unlike existing approaches to driving hazard prediction that rely on computational simulations or anomaly detection from videos, this study focuses on high-level inference from static images. The problem needs predicting and reasoning about future events based on uncertain observations, which falls under visual abductive reasoning. To enable research in this understudied area, a new dataset named the DHPR (Driving Hazard Prediction and Reasoning) dataset is created. The dataset consists of 15K dashcam images of street scenes, and each image is associated with a tuple containing car speed, a hypothesized hazard description, and visual entities present in the scene. These…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dhpr-dataset/dhpr-dataset
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Autonomous Vehicle Technology and Safety · Multimodal Machine Learning Applications