SimCLAD: A Simple Framework for Contrastive Learning of Acronym   Disambiguation

Bin Li; Fei Xia; Yixuan Weng; Xiusheng Huang; Bin Sun

arXiv:2111.14306·cs.CL·December 10, 2021·1 cites

SimCLAD: A Simple Framework for Contrastive Learning of Acronym Disambiguation

Bin Li, Fei Xia, Yixuan Weng, Xiusheng Huang, Bin Sun

PDF

Open Access

TL;DR

SimCLAD introduces a contrastive learning framework that improves acronym disambiguation by enhancing phrase representations, outperforming existing methods in scientific domain tasks.

Contribution

The paper proposes a novel contrastive pre-training approach that refines pre-trained language models for better acronym disambiguation in scientific texts.

Findings

01

Outperforms state-of-the-art methods on scientific acronym disambiguation

02

Enhances model generalization through contrastive pre-training

03

Improves phrase-level discrimination in vector space

Abstract

Acronym disambiguation means finding the correct meaning of an ambiguous acronym from the dictionary in a given sentence, which is one of the key points for scientific document understanding (SDU@AAAI-22). Recently, many attempts have tried to solve this problem via fine-tuning the pre-trained masked language models (MLMs) in order to obtain a better acronym representation. However, the acronym meaning is varied under different contexts, whose corresponding phrase representation mapped in different directions lacks discrimination in the entire vector space. Thus, the original representations of the pre-trained MLMs are not ideal for the acronym disambiguation task. In this paper, we propose a Simple framework for Contrastive Learning of Acronym Disambiguation (SimCLAD) method to better understand the acronym meanings. Specifically, we design a continual contrastive pre-training method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Natural Language Processing Techniques

MethodsContrastive Learning