A Problem Reduction Approach for Visual Relationships Detection

Toshiyuki Fukuzawa

arXiv:1809.09828·cs.CV·September 27, 2018·1 cites

A Problem Reduction Approach for Visual Relationships Detection

Toshiyuki Fukuzawa

PDF

Open Access

TL;DR

This paper introduces a reduction-based method for visual relationship detection, transforming the problem into object detection tasks, demonstrated successfully in a competitive challenge and achieving prize-winning results.

Contribution

It proposes a novel approach that reduces visual relationship detection to object detection, simplifying the problem and improving performance.

Findings

01

Achieved prize-winning performance in the ECCV 2018 challenge

02

Effectively detected object pairs with specific relationships

03

Validated the reduction approach on a large-scale dataset

Abstract

Identifying different objects (man and cup) is an important problem on its own, but identifying the relationship between them (holding) is critical for many real world use cases. This paper describes an approach to reduce a visual relationship detection problem to object detection problems. The method was applied to Google AI Open Images V4 Visual Relationship Track Challenge, which was held in conjunction with 2018 European Conference on Computer Vision (ECCV 2018) and it finished as a prize winner. The challenge was to build an algorithm that detects pairs of objects in particular relations: things like "woman playing guitar," "beer on table," or "dog inside car.".

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Advanced Neural Network Applications