HIT at SemEval-2022 Task 2: Pre-trained Language Model for Idioms   Detection

Zheng Chu; Ziqing Yang; Yiming Cui; Zhigang Chen; Ming Liu

arXiv:2204.06145·cs.CL·April 14, 2022

HIT at SemEval-2022 Task 2: Pre-trained Language Model for Idioms Detection

Zheng Chu, Ziqing Yang, Yiming Cui, Zhigang Chen, Ming Liu

PDF

Open Access

TL;DR

This paper presents a method using pre-trained language models to accurately detect idiomatic expressions in sentences by leveraging context-aware embeddings, addressing limitations of non-contextual approaches.

Contribution

The paper introduces a novel approach employing pre-trained language models for idiom detection, improving accuracy over traditional non-contextual methods.

Findings

01

Pre-trained language models enhance idiom detection accuracy.

02

Context-aware embeddings outperform non-contextual methods.

03

Effective in distinguishing literal and idiomatic meanings.

Abstract

The same multi-word expressions may have different meanings in different sentences. They can be mainly divided into two categories, which are literal meaning and idiomatic meaning. Non-contextual-based methods perform poorly on this problem, and we need contextual embedding to understand the idiomatic meaning of multi-word expressions correctly. We use a pre-trained language model, which can provide a context-aware sentence embedding, to detect whether multi-word expression in the sentence is idiomatic usage.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification