# Imminent Collision Mitigation with Reinforcement Learning and Vision

**Authors:** Horia Porav, Paul Newman

arXiv: 1901.00898 · 2019-01-07

## TL;DR

This paper explores reinforcement learning-based collision mitigation using vision, demonstrating improved control policies over traditional braking by predicting obstacle dynamics and optimizing for injury severity.

## Contribution

Introduces a vision-based reinforcement learning approach for collision mitigation that predicts obstacle dynamics and optimizes for injury severity, outperforming baseline braking methods.

## Key findings

- RL policies outperform baseline braking in collision scenarios
- Injury-based reward policy yields best performance
- Model predicts obstacle dynamics from camera images

## Abstract

This work examines the role of reinforcement learning in reducing the severity of on-road collisions by controlling velocity and steering in situations in which contact is imminent. We construct a model, given camera images as input, that is capable of learning and predicting the dynamics of obstacles, cars and pedestrians, and train our policy using this model. Two policies that control both braking and steering are compared against a baseline where the only action taken is (conventional) braking in a straight line. The two policies are trained using two distinct reward structures, one where any and all collisions incur a fixed penalty, and a second one where the penalty is calculated based on already established delta-v models of injury severity. The results show that both policies exceed the performance of the baseline, with the policy trained using injury models having the highest performance.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1901.00898/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1901.00898/full.md

## References

23 references — full list in the complete paper: https://tomesphere.com/paper/1901.00898/full.md

---
Source: https://tomesphere.com/paper/1901.00898