Cognitive TransFuser: Semantics-guided Transformer-based Sensor Fusion for Improved Waypoint Prediction
Hwan-Soo Choi, Jongoh Jeong, Young Hoo Cho, Kuk-Jin Yoon, and, Jong-Hwan Kim

TL;DR
Cognitive TransFuser is a multi-task sensor fusion network that leverages semantics-guided features from RGB and LIDAR data to improve waypoint prediction for autonomous driving, achieving real-time performance in simulation.
Contribution
The paper introduces a novel semantics-guided transformer-based sensor fusion approach that incorporates auxiliary tasks for enhanced waypoint prediction in autonomous driving.
Findings
Significant performance improvement over baseline in waypoint prediction.
Achieves up to 44.2 FPS real-time inference in CARLA simulator.
Effective multi-task fusion of semantic and auxiliary features enhances safety.
Abstract
Sensor fusion approaches for intelligent self-driving agents remain key to driving scene understanding given visual global contexts acquired from input sensors. Specifically, for the local waypoint prediction task, single-modality networks are still limited by strong dependency on the sensitivity of the input sensor, and thus recent works therefore promote the use of multiple sensors in fusion in feature level in practice. While it is well known that multiple data modalities encourage mutual contextual exchange, it requires global 3D scene understanding in real-time with minimal computation upon deployment to practical driving scenarios, thereby placing greater significance on the training strategy given a limited number of practically usable sensors. In this light, we exploit carefully selected auxiliary tasks that are highly correlated with the target task of interest (e.g., traffic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Autonomous Vehicle Technology and Safety · Video Surveillance and Tracking Methods
MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator · Balanced Selection
