Grounded Compositional Outputs for Adaptive Language Modeling

Nikolaos Pappas; Phoebe Mulcaire; Noah A. Smith

arXiv:2009.11523·cs.CL·October 7, 2020

Grounded Compositional Outputs for Adaptive Language Modeling

Nikolaos Pappas, Phoebe Mulcaire, Noah A. Smith

PDF

1 Repo

TL;DR

This paper introduces a fully compositional output embedding layer for language models grounded in WordNet, enabling size-independent vocabularies and improved adaptation, especially for low-frequency words.

Contribution

It presents the first word-level language model with size independent of the training vocabulary, grounded in structured lexical information, enhancing adaptation and efficiency.

Findings

01

Outperforms previous output embedding methods in language modeling tasks.

02

Achieves better adaptation in cross-domain settings with open vocabularies.

03

More accurate for low-frequency words, demonstrating improved sample efficiency.

Abstract

Language models have emerged as a central component across NLP, and a great deal of progress depends on the ability to cheaply adapt them (e.g., through finetuning) to new domains and tasks. A language model's vocabulary $-$ typically selected before training and permanently fixed later $-$ affects its size and is part of what makes it resistant to such adaptation. Prior work has used compositional input embeddings based on surface forms to ameliorate this issue. In this work, we go one step beyond and propose a fully compositional output embedding layer for language models, which is further grounded in information from a structured lexicon (WordNet), namely semantically related words and free-text definitions. To our knowledge, the result is the first word-level language model with a size that does not depend on the training vocabulary. We evaluate the model on conventional language…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Noahs-ARK/groc
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.