RAID: A Relation-Augmented Image Descriptor

Paul Guerrero; Niloy J. Mitra; Peter Wonka

arXiv:1510.01113·cs.GR·October 7, 2015·1 cites

RAID: A Relation-Augmented Image Descriptor

Paul Guerrero, Niloy J. Mitra, Peter Wonka

PDF

Open Access

TL;DR

RAID is a novel image descriptor that captures complex inter-region relations to improve image search based on spatial relationships, outperforming existing methods in identifying images with intricate region interactions.

Contribution

Introduces RAID, a relation-augmented descriptor that models complex inter-region relations for enhanced image retrieval based on spatial relationships.

Findings

01

Successfully retrieves images with complex relations from COCO dataset

02

Outperforms existing methods in relation-based image search

03

Effectively captures spatial distribution of point-to-region relationships

Abstract

As humans, we regularly interpret images based on the relations between image regions. For example, a person riding object X, or a plank bridging two objects. Current methods provide limited support to search for images based on such relations. We present RAID, a relation-augmented image descriptor that supports queries based on inter-region relations. The key idea of our descriptor is to capture the spatial distribution of simple point-to-region relationships to describe more complex relationships between two image regions. We evaluate the proposed descriptor by querying into a large subset of the Microsoft COCO database and successfully extract nontrivial images demonstrating complex inter-region relations, which are easily missed or erroneously classified by existing methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Multimodal Machine Learning Applications