On the Robustness of Human-Object Interaction Detection against Distribution Shift

Chi Xie; Shuang Liang; Jie Li; Feng Zhu; Rui Zhao; Yichen Wei; Shengjie Zhao

arXiv:2506.18021·cs.CV·June 24, 2025

On the Robustness of Human-Object Interaction Detection against Distribution Shift

Chi Xie, Shuang Liang, Jie Li, Feng Zhu, Rui Zhao, Yichen Wei, Shengjie Zhao

PDF

TL;DR

This paper benchmarks the robustness of Human-Object Interaction detection models under distribution shifts, analyzes their weaknesses, and proposes simple, effective methods to enhance their robustness, improving practical applicability.

Contribution

It introduces the first robustness evaluation benchmark for HOI detection, analyzes existing models' weaknesses, and proposes plug-and-play strategies to improve robustness against distribution shifts.

Findings

01

Existing HOI models are insufficiently robust under distribution shifts.

02

The proposed data augmentation and feature fusion strategies significantly improve robustness.

03

The methods also enhance performance on standard benchmarks.

Abstract

Human-Object Interaction (HOI) detection has seen substantial advances in recent years. However, existing works focus on the standard setting with ideal images and natural distribution, far from practical scenarios with inevitable distribution shifts. This hampers the practical applicability of HOI detection. In this work, we investigate this issue by benchmarking, analyzing, and enhancing the robustness of HOI detection models under various distribution shifts. We start by proposing a novel automated approach to create the first robustness evaluation benchmark for HOI detection. Subsequently, we evaluate more than 40 existing HOI detection models on this benchmark, showing their insufficiency, analyzing the features of different frameworks, and discussing how the robustness in HOI is different from other tasks. With the insights from such analyses, we propose to improve the robustness…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocus