TL;DR
This paper introduces KTN, a novel framework that improves multi-person densepose estimation by refining feature representations and leveraging external knowledge graphs to handle incomplete and low-quality annotations.
Contribution
KTN enhances densepose estimation accuracy by refining pyramidal features and integrating external knowledge, addressing challenges of partial annotations and false estimations.
Findings
KTN outperforms state-of-the-art methods on DensePose-COCO dataset.
Refined pyramidal features improve densepose accuracy.
External knowledge integration reduces errors from low-quality labels.
Abstract
Human densepose estimation, aiming at establishing dense correspondences between 2D pixels of human body and 3D human body template, is a key technique in enabling machines to have an understanding of people in images. It still poses several challenges due to practical scenarios where real-world scenes are complex and only partial annotations are available, leading to incompelete or false estimations. In this work, we present a novel framework to detect the densepose of multiple people in an image. The proposed method, which we refer to Knowledge Transfer Network (KTN), tackles two main problems: 1) how to refine image representation for alleviating incomplete estimations, and 2) how to reduce false estimation caused by the low-quality training labels (i.e., limited annotations and class-imbalance labels). Unlike existing works directly propagating the pyramidal features of regions for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
