Multitask Network for Joint Object Detection, Semantic Segmentation and Human Pose Estimation in Vehicle Occupancy Monitoring
Nikolas Ebert, Patrick Mangat, Oliver Wasenm\"uller

TL;DR
This paper introduces a multitask neural network that jointly performs object detection, semantic segmentation, and human pose estimation for vehicle occupancy monitoring, improving efficiency and accuracy in autonomous driving contexts.
Contribution
The paper presents the first multitask network combining detection, segmentation, and pose estimation specifically for vehicle occupancy monitoring, enabling flexible, end-to-end training.
Findings
Higher accuracy achieved compared to single-task models
Reduced memory and computing costs due to shared architecture
Demonstrated superior performance on public datasets
Abstract
In order to ensure safe autonomous driving, precise information about the conditions in and around the vehicle must be available. Accordingly, the monitoring of occupants and objects inside the vehicle is crucial. In the state-of-the-art, single or multiple deep neural networks are used for either object recognition, semantic segmentation, or human pose estimation. In contrast, we propose our Multitask Detection, Segmentation and Pose Estimation Network (MDSP) -- the first multitask network solving all these three tasks jointly in the area of occupancy monitoring. Due to the shared architecture, memory and computing costs can be saved while achieving higher accuracy. Furthermore, our architecture allows a flexible combination of the three mentioned tasks during a simple end-to-end training. We perform comprehensive evaluations on the public datasets SVIRO and TiCaM in order to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Human Pose and Action Recognition · Autonomous Vehicle Technology and Safety
