Conditional Affordance Learning for Driving in Urban Environments
Axel Sauer, Nikolay Savinov, Andreas Geiger

TL;DR
This paper introduces a direct perception method for autonomous urban driving that effectively navigates complex environments, handles traffic signals, and improves safety and navigation success rates over existing approaches.
Contribution
It presents a novel direct perception approach that maps video to intermediate representations for urban driving, handling traffic lights and signs with image-level labels, outperforming prior methods.
Findings
Achieves up to 68% improvement in goal-directed navigation in CARLA
First to handle traffic lights and speed signs with image-level labels
Reduces traffic accidents in simulation
Abstract
Most existing approaches to autonomous driving fall into one of two categories: modular pipelines, that build an extensive model of the environment, and imitation learning approaches, that map images directly to control outputs. A recently proposed third paradigm, direct perception, aims to combine the advantages of both by using a neural network to learn appropriate low-dimensional intermediate representations. However, existing direct perception approaches are restricted to simple highway situations, lacking the ability to navigate intersections, stop at traffic lights or respect speed limits. In this work, we propose a direct perception approach which maps video input to intermediate representations suitable for autonomous navigation in complex urban environments given high-level directional inputs. Compared to state-of-the-art reinforcement and conditional imitation learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Human Pose and Action Recognition
MethodsEntropy Regularization · Proximal Policy Optimization · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · CARLA: An Open Urban Driving Simulator
