MICE: Mining Idioms with Contextual Embeddings

Tadej \v{S}kvorc; Polona Gantar; Marko Robnik-\v{S}ikonja

arXiv:2008.05759·cs.CL·November 11, 2021

MICE: Mining Idioms with Contextual Embeddings

Tadej \v{S}kvorc, Polona Gantar, Marko Robnik-\v{S}ikonja

PDF

1 Repo

TL;DR

This paper introduces MICE, a method leveraging contextual embeddings like ELMo and BERT to detect idiomatic expressions, including unseen ones, and provides a new dataset for training and evaluation.

Contribution

The paper presents a novel approach using contextual embeddings for idiom detection and introduces a new dataset, improving detection accuracy and enabling cross-lingual transfer.

Findings

01

Deep neural networks outperform existing methods.

02

Models detect unseen idioms effectively.

03

Cross-lingual transfer is feasible with the approach.

Abstract

Idiomatic expressions can be problematic for natural language processing applications as their meaning cannot be inferred from their constituting words. A lack of successful methodological approaches and sufficiently large datasets prevents the development of machine learning approaches for detecting idioms, especially for expressions that do not occur in the training set. We present an approach, called MICE, that uses contextual embeddings for that purpose. We present a new dataset of multi-word expressions with literal and idiomatic meanings and use it to train a classifier based on two state-of-the-art contextual word embeddings: ELMo and BERT. We show that deep neural networks using both embeddings perform much better than existing approaches, and are capable of detecting idiomatic word use, even for expressions that were not present in the training set. We demonstrate cross-lingual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

TadejSkvorc/MICE
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Tanh Activation · Sigmoid Activation · Layer Normalization · Adam · Long Short-Term Memory · Dense Connections · Linear Warmup With Linear Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Bidirectional LSTM