TL;DR
This paper introduces a novel, layer-informed method for generating sense embeddings from Transformer-based language models, improving performance on sense disambiguation and related tasks by leveraging all model layers.
Contribution
It presents a principled approach to utilize all layers of NLMs for sense embedding creation, outperforming prior methods and demonstrating versatility across multiple sense-related tasks.
Findings
Layer-informed sense embeddings outperform previous models.
Using all layers of NLMs enhances sense disambiguation accuracy.
Sense embeddings show versatility beyond WSD tasks.
Abstract
Distributional semantics based on neural approaches is a cornerstone of Natural Language Processing, with surprising connections to human meaning representation as well. Recent Transformer-based Language Models have proven capable of producing contextual word representations that reliably convey sense-specific information, simply as a product of self-supervision. Prior work has shown that these contextual representations can be used to accurately represent large sense inventories as sense embeddings, to the extent that a distance-based solution to Word Sense Disambiguation (WSD) tasks outperforms models trained specifically for the task. Still, there remains much to understand on how to use these Neural Language Models (NLMs) to produce sense embeddings that can better harness each NLM's meaning representation abilities. In this work we introduce a more principled approach to leverage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
