CoT-MAE v2: Contextual Masked Auto-Encoder with Multi-view Modeling for   Passage Retrieval

Xing Wu; Guangyuan Ma; Peng Wang; Meng Lin; Zijia Lin; Fuzheng Zhang; and Songlin Hu

arXiv:2304.03158·cs.CL·April 7, 2023·1 cites

CoT-MAE v2: Contextual Masked Auto-Encoder with Multi-view Modeling for Passage Retrieval

Xing Wu, Guangyuan Ma, Peng Wang, Meng Lin, Zijia Lin, Fuzheng Zhang, and Songlin Hu

PDF

Open Access

TL;DR

CoT-MAE v2 introduces multi-view modeling with dense and sparse vectors, along with dual decoding strategies, to enhance passage retrieval performance through improved contextual representations.

Contribution

It presents a novel multi-view pretraining framework combining dense/sparse representations and autoencoding/auto-regressive decoding for passage retrieval.

Findings

01

Outperforms existing methods on large-scale benchmarks

02

Demonstrates robustness on out-of-domain zero-shot tasks

03

Enhances contextual representations for passage retrieval

Abstract

Growing techniques have been emerging to improve the performance of passage retrieval. As an effective representation bottleneck pretraining technique, the contextual masked auto-encoder utilizes contextual embedding to assist in the reconstruction of passages. However, it only uses a single auto-encoding pre-task for dense representation pre-training. This study brings multi-view modeling to the contextual masked auto-encoder. Firstly, multi-view representation utilizes both dense and sparse vectors as multi-view representations, aiming to capture sentence semantics from different aspects. Moreover, multiview decoding paradigm utilizes both autoencoding and auto-regressive decoders in representation bottleneck pre-training, aiming to provide both reconstructive and generative signals for better contextual representation pretraining. We refer to this multi-view pretraining method as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications