Conditional probing: measuring usable information beyond a baseline
John Hewitt, Kawin Ethayarajh, Percy Liang, Christopher D. Manning

TL;DR
This paper introduces conditional probing, a method to measure the information in neural representations beyond what is captured by baseline embeddings, revealing deeper insights into neural network layers.
Contribution
It extends $ ext{V}$-information theory to develop conditional probing, enabling detection of information not explained by baselines in neural representations.
Findings
Part-of-speech information is accessible at deeper layers after conditioning on word embeddings.
Conditional probing reveals more nuanced information in neural networks than traditional baseline comparisons.
The method uncovers hidden layers of information beyond baseline explanations.
Abstract
Probing experiments investigate the extent to which neural representations make properties -- like part-of-speech -- predictable. One suggests that a representation encodes a property if probing that representation produces higher accuracy than probing a baseline representation like non-contextual word embeddings. Instead of using baselines as a point of comparison, we're interested in measuring information that is contained in the representation but not in the baseline. For example, current methods can detect when a representation is more useful than the word identity (a baseline) for predicting part-of-speech; however, they cannot detect when the representation is predictive of just the aspects of part-of-speech not explainable by the word identity. In this work, we extend a theory of usable information called -information and propose conditional probing, which explicitly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
