VCDM: Leveraging Variational Bi-encoding and Deep Contextualized Word   Representations for Improved Definition Modeling

Machel Reid; Edison Marrese-Taylor; Yutaka Matsuo

arXiv:2010.03124·cs.CL·October 8, 2020

VCDM: Leveraging Variational Bi-encoding and Deep Contextualized Word Representations for Improved Definition Modeling

Machel Reid, Edison Marrese-Taylor, Yutaka Matsuo

PDF

Open Access 1 Repo

TL;DR

This paper introduces VCDM, a generative model using variational inference and contextualized embeddings to improve word and phrase definition modeling, achieving state-of-the-art results on multiple benchmarks.

Contribution

The paper presents a novel generative approach with explicit latent variables for definition modeling, leveraging contextualized embeddings and introducing new datasets.

Findings

01

VCDM outperforms existing models on four benchmarks.

02

The model achieves higher accuracy in automatic and human evaluations.

03

New datasets 'Cambridge' and 'Robert' enhance empirical analysis.

Abstract

In this paper, we tackle the task of definition modeling, where the goal is to learn to generate definitions of words and phrases. Existing approaches for this task are discriminative, combining distributional and lexical semantics in an implicit rather than direct way. To tackle this issue we propose a generative model for the task, introducing a continuous latent variable to explicitly model the underlying relationship between a phrase used within a context and its definition. We rely on variational inference for estimation and leverage contextualized word embeddings for improved performance. Our approach is evaluated on four existing challenging benchmarks with the addition of two new datasets, "Cambridge" and the first non-English corpus "Robert", which we release to complement our empirical study. Our Variational Contextual Definition Modeler (VCDM) achieves state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

machelreid/vcdm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications