A Plug-and-Play Method for Rare Human-Object Interactions Detection by   Bridging Domain Gap

Lijun Zhang; Wei Suo; Peng Wang; Yanning Zhang

arXiv:2407.21438·cs.CV·August 1, 2024

A Plug-and-Play Method for Rare Human-Object Interactions Detection by Bridging Domain Gap

Lijun Zhang, Wei Suo, Peng Wang, Yanning Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces CEFA, a plug-and-play framework that bridges the domain gap between generated and original data at the feature level, significantly improving rare human-object interaction detection.

Contribution

The paper proposes a novel domain gap bridging method, CEFA, combining feature alignment and context enhancement to improve rare HOI detection performance.

Findings

01

Improves detection accuracy on rare HOI categories

02

Effective domain gap reduction between generated and real data

03

Enhances contextual understanding in HOI detection

Abstract

Human-object interactions (HOI) detection aims at capturing human-object pairs in images and corresponding actions. It is an important step toward high-level visual reasoning and scene understanding. However, due to the natural bias from the real world, existing methods mostly struggle with rare human-object pairs and lead to sub-optimal results. Recently, with the development of the generative model, a straightforward approach is to construct a more balanced dataset based on a group of supplementary samples. Unfortunately, there is a significant domain gap between the generated data and the original data, and simply merging the generated images into the original dataset cannot significantly boost the performance. To alleviate the above problem, we present a novel model-agnostic framework called \textbf{C}ontext-\textbf{E}nhanced \textbf{F}eature \textbf{A}lignment (CEFA) module, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lijunzhang01/cefa
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition

MethodsALIGN