Using Distributional Principles for the Semantic Study of Contextual   Language Models

Olivier Ferret

arXiv:2111.12174·cs.CL·November 25, 2021

Using Distributional Principles for the Semantic Study of Contextual Language Models

Olivier Ferret

PDF

Open Access

TL;DR

This paper investigates the semantic properties of contextual language models using distributional principles, focusing on substitution-based probing in controlled and open settings to compare static and contextual models.

Contribution

It introduces a novel method for analyzing semantic similarity in language models and adapts it to compare static and contextual models in both controlled and open environments.

Findings

01

Distributional substitution reveals semantic properties of models

02

Differences identified between static and contextual models

03

Method applicable to various language model analyses

Abstract

Many studies were recently done for investigating the properties of contextual language models but surprisingly, only a few of them consider the properties of these models in terms of semantic similarity. In this article, we first focus on these properties for English by exploiting the distributional principle of substitution as a probing mechanism in the controlled context of SemCor and WordNet paradigmatic relations. Then, we propose to adapt the same method to a more open setting for characterizing the differences between static and contextual language models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Semantic Web and Ontologies