Modality-dependent Cross-media Retrieval
Yunchao Wei, Yao Zhao, Zhenfeng Zhu, Shikui Wei, Yanhui Xiao, Jiashi, Feng, Shuicheng Yan

TL;DR
This paper introduces a modality-dependent cross-media retrieval model that learns separate projections for image-to-text and text-to-image tasks, improving retrieval performance by optimizing task-specific mappings.
Contribution
The proposed MDCR model learns distinct projection functions for each cross-media retrieval direction, outperforming existing methods by optimizing modality-specific mappings.
Findings
Achieves 41.5% mAP on Wikipedia dataset
Outperforms previous cross-media retrieval methods
Demonstrates the effectiveness of modality-dependent projections
Abstract
In this paper, we investigate the cross-media retrieval between images and text, i.e., using image to search text (I2T) and using text to search images (T2I). Existing cross-media retrieval methods usually learn one couple of projections, by which the original features of images and text can be projected into a common latent space to measure the content similarity. However, using the same projections for the two different retrieval tasks (I2T and T2I) may lead to a tradeoff between their respective performances, rather than their best performances. Different from previous works, we propose a modality-dependent cross-media retrieval (MDCR) model, where two couples of projections are learned for different cross-media retrieval tasks instead of one couple of projections. Specifically, by jointly optimizing the correlation between images and text and the linear regression from one modal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques
MethodsLinear Discriminant Analysis · Linear Regression
