MASTER: Multi-task Pre-trained Bottlenecked Masked Autoencoders are   Better Dense Retrievers

Kun Zhou; Xiao Liu; Yeyun Gong; Wayne Xin Zhao; Daxin Jiang; Nan Duan,; Ji-Rong Wen

arXiv:2212.07841·cs.CL·June 21, 2023

MASTER: Multi-task Pre-trained Bottlenecked Masked Autoencoders are Better Dense Retrievers

Kun Zhou, Xiao Liu, Yeyun Gong, Wayne Xin Zhao, Daxin Jiang, Nan Duan,, Ji-Rong Wen

PDF

Open Access 1 Repo 2 Models

TL;DR

MASTER unifies various pre-training tasks into a multi-task masked autoencoder framework, significantly enhancing dense retrieval performance by effectively capturing passage and inter-passage information.

Contribution

It introduces a novel multi-task pre-training approach with a shared-encoder and multi-decoder architecture to improve dense retrieval models.

Findings

01

Outperforms existing dense retrieval methods.

02

Effectively integrates multiple pre-training tasks.

03

Demonstrates strong generalization across benchmarks.

Abstract

Pre-trained Transformers (\eg BERT) have been commonly used in existing dense retrieval methods for parameter initialization, and recent studies are exploring more effective pre-training tasks for further improving the quality of dense vectors. Although various novel and effective tasks have been proposed, their different input formats and learning objectives make them hard to be integrated for jointly improving the model performance. In this work, we aim to unify a variety of pre-training tasks into the bottlenecked masked autoencoder manner, and integrate them into a multi-task pre-trained model, namely MASTER. Concretely, MASTER utilizes a shared-encoder multi-decoder architecture that can construct a representation bottleneck to compress the abundant semantic information across tasks into dense vectors. Based on it, we integrate three types of representative pre-training tasks:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

microsoft/simxns
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Image Retrieval and Classification Techniques