Discriminative Cross-View Binary Representation Learning
Liu Liu, Hairong Qi

TL;DR
This paper introduces DCVH, an end-to-end discriminative cross-view hashing method that leverages CNNs and multitask learning to produce semantic-preserving binary representations for multimedia retrieval and annotation.
Contribution
It proposes a novel end-to-end framework combining CNN-based nonlinear hashing, multilabel classification, and view alignment via Hamming distance minimization for improved cross-view multimedia retrieval.
Findings
DCVH outperforms state-of-the-art cross-view hashing methods on benchmark datasets.
The method effectively integrates image and text modalities for retrieval tasks.
DCVH achieves competitive results in image annotation and tagging.
Abstract
Learning compact representation is vital and challenging for large scale multimedia data. Cross-view/cross-modal hashing for effective binary representation learning has received significant attention with exponentially growing availability of multimedia content. Most existing cross-view hashing algorithms emphasize the similarities in individual views, which are then connected via cross-view similarities. In this work, we focus on the exploitation of the discriminative information from different views, and propose an end-to-end method to learn semantic-preserving and discriminative binary representation, dubbed Discriminative Cross-View Hashing (DCVH), in light of learning multitasking binary representation for various tasks including cross-view retrieval, image-to-image retrieval, and image annotation/tagging. The proposed DCVH has the following key components. First, it uses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Image Retrieval and Classification Techniques
