Auto-MLM: Improved Contrastive Learning for Self-supervised   Multi-lingual Knowledge Retrieval

Wenshen Xu; Mieradilijiang Maimaiti; Yuanhang Zheng; Xin Tang; Ji; Zhang

arXiv:2203.16187·cs.CL·March 31, 2022·1 cites

Auto-MLM: Improved Contrastive Learning for Self-supervised Multi-lingual Knowledge Retrieval

Wenshen Xu, Mieradilijiang Maimaiti, Yuanhang Zheng, Xin Tang, Ji, Zhang

PDF

Open Access

TL;DR

Auto-MLM introduces a joint training approach combining contrastive learning and masked language modeling to enhance self-supervised multi-lingual knowledge retrieval, outperforming previous state-of-the-art methods across multiple languages and datasets.

Contribution

It proposes a novel joint training method that integrates CL and Auto-MLM to better extract internal information for multi-lingual knowledge retrieval.

Findings

01

Outperforms all previous SOTA methods on AliExpress and LAZADA datasets.

02

Effective across 8 languages with consistent improvements.

03

Enhances internal information extraction from queries.

Abstract

Contrastive learning (CL) has become a ubiquitous approach for several natural language processing (NLP) downstream tasks, especially for question answering (QA). However, the major challenge, how to efficiently train the knowledge retrieval model in an unsupervised manner, is still unresolved. Recently the commonly used methods are composed of CL and masked language model (MLM). Unexpectedly, MLM ignores the sentence-level training, and CL also neglects extraction of the internal info from the query. To optimize the CL hardly obtain internal information from the original query, we introduce a joint training method by combining CL and Auto-MLM for self-supervised multi-lingual knowledge retrieval. First, we acquire the fixed dimensional sentence vector. Then, mask some words among the original sentences with random strategy. Finally, we generate a new token representation for predicting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Expert finding and Q&A systems · Text and Document Classification Technologies

Methodstravel james · INFO: An Efficient Optimization Algorithm based on Weighted Mean of Vectors