Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text Matching
Haiwen Diao, Ying Zhang, Shang Gao, Xiang Ruan, Huchuan Lu

TL;DR
This paper introduces Deep Boosting Learning (DBL), a novel cooperative training approach for image-text matching that leverages peer branch knowledge transfer to improve model accuracy and robustness.
Contribution
The paper proposes a new DBL algorithm that uses an anchor and target branch to enhance multi-modal matching through knowledge transfer, outperforming existing cooperative strategies.
Findings
DBL achieves consistent improvements over state-of-the-art models.
DBL outperforms traditional distillation, mutual learning, and contrastive learning methods.
The method is flexible and integrates seamlessly into existing training scenarios.
Abstract
Image-text matching remains a challenging task due to heterogeneous semantic diversity across modalities and insufficient distance separability within triplets. Different from previous approaches focusing on enhancing multi-modal representations or exploiting cross-modal correspondence for more accurate retrieval, in this paper we aim to leverage the knowledge transfer between peer branches in a boosting manner to seek a more powerful matching model. Specifically, we propose a brand-new Deep Boosting Learning (DBL) algorithm, where an anchor branch is first trained to provide insights into the data properties, with a target branch gaining more advanced knowledge to develop optimal features and distance metrics. Concretely, an anchor branch initially learns the absolute or relative distance between positive and negative pairs, providing a foundational understanding of the particular…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques
MethodsContrastive Learning
