SVS-JOIN: Efficient Spatial Visual Similarity Join over Multimedia Data
Chengyuan Zhang, Ruipeng Chen, Lei Zhu, Zuping Zhang, Fang Huang,, Yunwu Lin

TL;DR
This paper introduces SVS-JOIN, a novel method for efficiently performing spatial visual similarity joins on geo-multimedia data, combining geo-location and visual content analysis to improve large-scale geo-image retrieval.
Contribution
It defines the spatial visual similarity join problem and proposes three methods, including a novel quadtree and inverted index-based approach, to enhance efficiency.
Findings
SVS-JOIN$_G$ improves efficiency with spatial grid strategy.
SVS-JOIN$_Q$ achieves high performance using quadtree and inverted index.
Experimental results show significant performance gains.
Abstract
In the big data era, massive amount of multimedia data with geo-tags has been generated and collected by mobile smart devices equipped with mobile communications module and position sensor module. This trend has put forward higher request on large-scale of geo-multimedia data retrieval. Spatial similarity join is one of the important problem in the area of spatial database. Previous works focused on textual document with geo-tags, rather than geo-multimedia data such as geo-images. In this paper, we study a novel search problem named spatial visual similarity join (SVS-JOIN for short), which aims to find similar geo-image pairs in both the aspects of geo-location and visual content. We propose the definition of SVS-JOIN at the first time and present how to measure geographical similarity and visual similarity. Then we introduce a baseline inspired by the method for textual similarity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Video Analysis and Summarization
