An Efficient Approach for Geo-Multimedia Cross-Modal Retrieval
Lei Zhu, Jun Long, Chengyuan Zhang, Ruipeng Chen, Xinpan Yuan, Zhan, Yang

TL;DR
This paper introduces a novel geo-multimedia cross-modal retrieval framework that combines deep learning, semantic matching, and a hybrid indexing structure to efficiently search geographically tagged multimedia data.
Contribution
It proposes the first definition of $k$NN geo-multimedia cross-modal queries and develops a comprehensive framework including semantic matching and a hybrid index for efficient retrieval.
Findings
Outperforms existing methods in accuracy and efficiency
Effectively bridges semantic gaps between modalities
Demonstrates scalability on large datasets
Abstract
Due to the rapid development of mobile Internet techniques, cloud computation and popularity of online social networking and location-based services, massive amount of multimedia data with geographical information is generated and uploaded to the Internet. In this paper, we propose a novel type of cross-modal multimedia retrieval called geo-multimedia cross-modal retrieval which aims to search out a set of geo-multimedia objects based on geographical distance proximity and semantic similarity between different modalities. Previous studies for cross-modal retrieval and spatial keyword search cannot address this problem effectively because they do not consider multimedia data with geo-tags and do not focus on this type of query. In order to address this problem efficiently, we present the definition of NN geo-multimedia cross-modal query at the first time and introduce relevant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Data Management and Algorithms · Multimodal Machine Learning Applications
