Deep Multi-Task Networks For Occluded Pedestrian Pose Estimation

Arindam Das; Sudip Das; Ganesh Sistu; Jonathan Horgan; Ujjwal; Bhattacharya; Edward Jones; Martin Glavin; and Ciar\'an Eising

arXiv:2206.07510·cs.CV·August 9, 2022

Deep Multi-Task Networks For Occluded Pedestrian Pose Estimation

Arindam Das, Sudip Das, Ganesh Sistu, Jonathan Horgan, Ujjwal, Bhattacharya, Edward Jones, Martin Glavin, and Ciar\'an Eising

PDF

TL;DR

This paper introduces a multi-task deep learning framework that enhances pedestrian pose estimation, especially for occluded pedestrians, by leveraging domain adaptation across different datasets and improving multiple related tasks.

Contribution

It proposes a novel multi-task framework with unsupervised domain adaptation to improve occluded pedestrian pose estimation across diverse datasets.

Findings

01

Achieved state-of-the-art performance in pose estimation, detection, and segmentation.

02

Effectively handles occlusions in pedestrian pose estimation.

03

Demonstrated improved generalization across datasets.

Abstract

Most of the existing works on pedestrian pose estimation do not consider estimating the pose of an occluded pedestrian, as the annotations of the occluded parts are not available in relevant automotive datasets. For example, CityPersons, a well-known dataset for pedestrian detection in automotive scenes does not provide pose annotations, whereas MS-COCO, a non-automotive dataset, contains human pose estimation. In this work, we propose a multi-task framework to extract pedestrian features through detection and instance segmentation tasks performed separately on these two distributions. Thereafter, an encoder learns pose specific features using an unsupervised instance-level domain adaptation method for the pedestrian instances from both distributions. The proposed framework has improved state-of-the-art performances of pose estimation, pedestrian detection, and instance segmentation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.