# Uncertainty weighted multi task learning for robust traffic scene semantic understanding

**Authors:** Zhiping Wan, Shitong Ye, Feng Wang, Shaojiang Liu, Ling Peng

PMC · DOI: 10.1038/s41598-025-24838-3 · Scientific Reports · 2025-11-20

## TL;DR

This paper introduces a new multi-task learning framework that improves traffic scene understanding in challenging conditions like bad weather and occlusion.

## Contribution

The novel framework uses uncertainty-weighted learning and a hybrid model to enhance robustness in traffic perception tasks.

## Key findings

- UW-MTL outperforms existing methods on 3D object detection and BEV semantic segmentation.
- The model shows significant improvements in low-visibility and heavily occluded scenarios.
- Performance gains are especially notable at long ranges and in adverse conditions.

## Abstract

This paper addresses perception degradation caused by adverse weather, occlusion, and asynchronous sampling by proposing an uncertainty-weighted multi-task learning framework for robust semantic understanding of traffic scenes (UW-MTL). The method performs differentiable multi-source spatiotemporal alignment to unify camera, LiDAR, radar, and IMU into a BEV sequence, and adopts a hybrid backbone that combines a Mixture of Experts Transformer with a spatiotemporal graph neural network to balance global semantics and local topology. Each task employs evidential prediction heads that explicitly output confidence and uncertainty. During training, soft-temperature weighting and a sigma aware gradient conflict resolver enable stable joint optimization. On the nuScenes benchmark, UW-MTL consistently surpasses BEVFusion and UniAD on 3D object detection, BEV semantic segmentation, and short-horizon trajectory prediction, with especially pronounced gains at long range, under heavy occlusion, and in low-visibility conditions.

## Full-text entities

- **Chemicals:** GNN (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12635334/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12635334/full.md

## References

3 references — full list in the complete paper: https://tomesphere.com/paper/PMC12635334/full.md

---
Source: https://tomesphere.com/paper/PMC12635334