Evaluating Distributed Representations for Multi-Level Lexical   Semantics: A Research Proposal

Zhu Liu

arXiv:2406.00751·cs.CL·December 4, 2024

Evaluating Distributed Representations for Multi-Level Lexical Semantics: A Research Proposal

Zhu Liu

PDF

Open Access

TL;DR

This research proposal aims to evaluate how well neural network-based distributed representations encode different levels of lexical semantics, bridging computational models and linguistic theory.

Contribution

It formalizes three levels of lexical semantics and proposes a framework for evaluating language models across these levels using multilingual datasets.

Findings

01

Formalization of local, global, and mixed lexical semantic levels

02

Development of multilingual datasets for semantic evaluation

03

Framework for assessing neural network representations of lexical meaning

Abstract

Modern neural networks (NNs), trained on extensive raw sentence data, construct distributed representations by compressing individual words into dense, continuous, high-dimensional vectors. These representations are expected to capture multi-level lexical meaning. In this thesis, our objective is to examine the efficacy of distributed representations from NNs in encoding lexical meaning. Initially, we identify and formalize three levels of lexical semantics: \textit{local}, \textit{global}, and \textit{mixed} levels. Then, for each level, we evaluate language models by collecting or constructing multilingual datasets, leveraging various language models, and employing linguistic analysis theories. This thesis builds a bridge between computational models and lexical semantics, aiming to complement each other.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Natural Language Processing Techniques · Speech and dialogue systems