Reconsidering Degeneration of Token Embeddings with Definitions for Encoder-based Pre-trained Language Models
Ying Zhang, Dongyuan Li, and Manabu Okumura

TL;DR
This paper introduces DefinitionEMB, a method that uses dictionary definitions to improve token embeddings in encoder-based PLMs, enhancing their semantic quality and performance on NLP tasks.
Contribution
It proposes a novel approach leveraging definitions to reconstruct isotropic, semantics-rich token embeddings, addressing degeneration issues in pre-trained language models.
Findings
Reconstructed embeddings improve model performance on GLUE tasks.
Definitions from Wiktionary effectively enhance low-frequency token representations.
The method maintains robustness during fine-tuning.
Abstract
Learning token embeddings based on token co-occurrence statistics has proven effective for both pre-training and fine-tuning in natural language processing. However, recent studies have pointed out that the distribution of learned embeddings degenerates into anisotropy (i.e., non-uniform distribution), and even pre-trained language models (PLMs) suffer from a loss of semantics-related information in embeddings for low-frequency tokens. This study first analyzes the fine-tuning dynamics of encoder-based PLMs and demonstrates their robustness against degeneration. On the basis of this analysis, we propose DefinitionEMB, a method that utilizes definitions to re-construct isotropically distributed and semantics-related token embeddings for encoder-based PLMs while maintaining original robustness during fine-tuning. Our experiments demonstrate the effectiveness of leveraging definitions from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning in Healthcare · Natural Language Processing Techniques
