Multitask Network for Joint Object Detection, Semantic Segmentation and   Human Pose Estimation in Vehicle Occupancy Monitoring

Nikolas Ebert; Patrick Mangat; Oliver Wasenm\"uller

arXiv:2205.01515·cs.CV·May 4, 2022·1 cites

Multitask Network for Joint Object Detection, Semantic Segmentation and Human Pose Estimation in Vehicle Occupancy Monitoring

Nikolas Ebert, Patrick Mangat, Oliver Wasenm\"uller

PDF

Open Access

TL;DR

This paper introduces a multitask neural network that jointly performs object detection, semantic segmentation, and human pose estimation for vehicle occupancy monitoring, improving efficiency and accuracy in autonomous driving contexts.

Contribution

The paper presents the first multitask network combining detection, segmentation, and pose estimation specifically for vehicle occupancy monitoring, enabling flexible, end-to-end training.

Findings

01

Higher accuracy achieved compared to single-task models

02

Reduced memory and computing costs due to shared architecture

03

Demonstrated superior performance on public datasets

Abstract

In order to ensure safe autonomous driving, precise information about the conditions in and around the vehicle must be available. Accordingly, the monitoring of occupants and objects inside the vehicle is crucial. In the state-of-the-art, single or multiple deep neural networks are used for either object recognition, semantic segmentation, or human pose estimation. In contrast, we propose our Multitask Detection, Segmentation and Pose Estimation Network (MDSP) -- the first multitask network solving all these three tasks jointly in the area of occupancy monitoring. Due to the shared architecture, memory and computing costs can be saved while achieving higher accuracy. Furthermore, our architecture allows a flexible combination of the three mentioned tasks during a simple end-to-end training. We perform comprehensive evaluations on the public datasets SVIRO and TiCaM in order to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Human Pose and Action Recognition · Autonomous Vehicle Technology and Safety