Exploring the Robustness of Human Parsers Towards Common Corruptions

Sanyi Zhang; Xiaochun Cao; Rui Wang; Guo-Jun Qi; Jie Zhou

arXiv:2309.00938·cs.CV·September 8, 2023

Exploring the Robustness of Human Parsers Towards Common Corruptions

Sanyi Zhang, Xiaochun Cao, Rui Wang, Guo-Jun Qi, Jie Zhou

PDF

Open Access

TL;DR

This paper introduces new benchmarks and a novel augmentation-based method to significantly improve the robustness of human parsers against common image corruptions like blur and noise, without sacrificing performance on clean data.

Contribution

The paper proposes a heterogeneous augmentation-enhanced mechanism combining image-aware and model-aware augmentations to improve robustness of human parsers under corruptions, applicable to various models.

Findings

01

Improved robustness of human parsers on corruption benchmarks

02

Method enhances model resilience without losing clean data accuracy

03

Universal applicability across different human parsing frameworks

Abstract

Human parsing aims to segment each pixel of the human image with fine-grained semantic categories. However, current human parsers trained with clean data are easily confused by numerous image corruptions such as blur and noise. To improve the robustness of human parsers, in this paper, we construct three corruption robustness benchmarks, termed LIP-C, ATR-C, and Pascal-Person-Part-C, to assist us in evaluating the risk tolerance of human parsing models. Inspired by the data augmentation strategy, we propose a novel heterogeneous augmentation-enhanced mechanism to bolster robustness under commonly corrupted conditions. Specifically, two types of data augmentations from different views, i.e., image-aware augmentation and model-aware image-to-image transformation, are integrated in a sequential manner for adapting to unforeseen image corruptions. The image-aware augmentation can enrich the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning