Inspecting the concept knowledge graph encoded by modern language models
Carlos Aspillaga, Marcelo Mendoza, Alvaro Soto

TL;DR
This paper investigates the concept knowledge graphs encoded by modern language models using a probing classifier grounded on WordNet, revealing that all models encode knowledge but with inaccuracies and biases influenced by architecture and training.
Contribution
It systematically extracts and compares the concept knowledge graphs of nine influential language models, highlighting their inaccuracies and biases based on architecture and training strategies.
Findings
All models encode concept knowledge but with inaccuracies.
Different architectures and training strategies lead to distinct biases.
Certain concepts are more challenging to encode accurately.
Abstract
The field of natural language understanding has experienced exponential progress in the last few years, with impressive results in several tasks. This success has motivated researchers to study the underlying knowledge encoded by these models. Despite this, attempts to understand their semantic capabilities have not been successful, often leading to non-conclusive, or contradictory conclusions among different works. Via a probing classifier, we extract the underlying knowledge graph of nine of the most influential language models of the last years, including word embeddings, text generators, and context encoders. This probe is based on concept relatedness, grounded on WordNet. Our results reveal that all the models encode this knowledge, but suffer from several inaccuracies. Furthermore, we show that the different architectures and training strategies lead to different model biases. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
