BERTops: Studying BERT Representations under a Topological Lens
Jatin Chauhan, Manohar Kaul

TL;DR
This paper introduces a topological analysis method for BERT's hidden representations using persistent homology, providing a new scoring function that correlates with model accuracy and robustness against adversarial attacks.
Contribution
It proposes a novel persistence scoring function (PSF) that captures topological features of BERT representations, outperforming existing metrics in accuracy correlation and robustness prediction.
Findings
PSF accurately correlates with test set accuracy.
PSF is more stable to perturbations than baseline metrics.
PSF predicts attack success rates effectively.
Abstract
Proposing scoring functions to effectively understand, analyze and learn various properties of high dimensional hidden representations of large-scale transformer models like BERT can be a challenging task. In this work, we explore a new direction by studying the topological features of BERT hidden representations using persistent homology (PH). We propose a novel scoring function named "persistence scoring function (PSF)" which: (i) accurately captures the homology of the high-dimensional hidden representations and correlates well with the test set accuracy of a wide range of datasets and outperforms existing scoring metrics, (ii) captures interesting post fine-tuning "per-class" level properties from both qualitative and quantitative viewpoints, (iii) is more stable to perturbations as compared to the baseline functions, which makes it a very robust proxy, and (iv) finally, also serves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Leprosy Research and Treatment · Tryptophan and brain disorders
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Residual Connection · Multi-Head Attention · Layer Normalization · Attention Dropout · Softmax · Dense Connections · Weight Decay
