TopEx: Topic-based Explanations for Model Comparison

Shreya Havaldar; Adam Stein; Eric Wong; Lyle Ungar

arXiv:2306.00976·cs.CL·June 5, 2023·1 cites

TopEx: Topic-based Explanations for Model Comparison

Shreya Havaldar, Adam Stein, Eric Wong, Lyle Ungar

PDF

Open Access

TL;DR

TopEx introduces a model-agnostic explanation method using topics to facilitate meaningful comparison of language models, addressing the limitations of current explanation techniques.

Contribution

The paper presents TopEx, a novel explanation approach that enables fair comparison of language models through the use of model-agnostic topics.

Findings

01

TopEx effectively identifies similarities between DistilRoBERTa and GPT-2.

02

TopEx reveals differences in model behavior across NLP tasks.

03

The method simplifies explanation complexity for human understanding.

Abstract

Meaningfully comparing language models is challenging with current explanation methods. Current explanations are overwhelming for humans due to large vocabularies or incomparable across models. We present TopEx, an explanation method that enables a level playing field for comparing language models via model-agnostic topics. We demonstrate how TopEx can identify similarities and differences between DistilRoBERTa and GPT-2 on a variety of NLP tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Scientific Computing and Data Management

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Cosine Annealing · Layer Normalization · Byte Pair Encoding · Softmax · Linear Warmup With Cosine Annealing · Adam · Refunds@Expedia|||How do I get a full refund from Expedia?