SVS-JOIN: Efficient Spatial Visual Similarity Join over Multimedia Data

Chengyuan Zhang; Ruipeng Chen; Lei Zhu; Zuping Zhang; Fang Huang,; Yunwu Lin

arXiv:1810.00549·cs.MM·October 2, 2018

SVS-JOIN: Efficient Spatial Visual Similarity Join over Multimedia Data

Chengyuan Zhang, Ruipeng Chen, Lei Zhu, Zuping Zhang, Fang Huang,, Yunwu Lin

PDF

Open Access

TL;DR

This paper introduces SVS-JOIN, a novel method for efficiently performing spatial visual similarity joins on geo-multimedia data, combining geo-location and visual content analysis to improve large-scale geo-image retrieval.

Contribution

It defines the spatial visual similarity join problem and proposes three methods, including a novel quadtree and inverted index-based approach, to enhance efficiency.

Findings

01

SVS-JOIN$_G$ improves efficiency with spatial grid strategy.

02

SVS-JOIN$_Q$ achieves high performance using quadtree and inverted index.

03

Experimental results show significant performance gains.

Abstract

In the big data era, massive amount of multimedia data with geo-tags has been generated and collected by mobile smart devices equipped with mobile communications module and position sensor module. This trend has put forward higher request on large-scale of geo-multimedia data retrieval. Spatial similarity join is one of the important problem in the area of spatial database. Previous works focused on textual document with geo-tags, rather than geo-multimedia data such as geo-images. In this paper, we study a novel search problem named spatial visual similarity join (SVS-JOIN for short), which aims to find similar geo-image pairs in both the aspects of geo-location and visual content. We propose the definition of SVS-JOIN at the first time and present how to measure geographical similarity and visual similarity. Then we introduce a baseline inspired by the method for textual similarity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Video Analysis and Summarization