Multi-modal image retrieval with random walk on multi-layer graphs

Renata Khasanova; Xiaowen Dong; Pascal Frossard

arXiv:1607.03406·cs.IR·March 7, 2017

Multi-modal image retrieval with random walk on multi-layer graphs

Renata Khasanova, Xiaowen Dong, Pascal Frossard

PDF

TL;DR

This paper introduces a novel multi-layer graph-based image retrieval method that integrates multimodal features through random walks, significantly improving image similarity ranking in large datasets.

Contribution

It proposes a new multi-layer graph model with optimized edge weights for effective multimodal image retrieval, outperforming existing methods.

Findings

01

High retrieval accuracy in real-world datasets

02

Superior performance over state-of-the-art solutions

03

Effective multimodal feature integration

Abstract

The analysis of large collections of image data is still a challenging problem due to the difficulty of capturing the true concepts in visual data. The similarity between images could be computed using different and possibly multimodal features such as color or edge information or even text labels. This motivates the design of image analysis solutions that are able to effectively integrate the multi-view information provided by different feature sets. We therefore propose a new image retrieval solution that is able to sort images through a random walk on a multi-layer graph, where each layer corresponds to a different type of information about the image data. We study in depth the design of the image graph and propose in particular an effective method to select the edge weights for the multi-layer graph, such that the image ranking scores are optimised. We then provide extensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.