Addressing Topic Leakage in Cross-Topic Evaluation for Authorship Verification
Jitkapat Sawatphol, Can Udomcharoenchaikit, Sarana Nutanong

TL;DR
This paper introduces HITS, a new evaluation method for authorship verification that reduces topic leakage effects, leading to more stable model rankings and revealing models' reliance on topic-specific cues.
Contribution
It proposes HITS for better evaluation of AV models under topic shift and introduces RAVEN benchmark to test topic shortcut reliance.
Findings
HITS creates more stable model rankings across different splits.
RAVEN uncovers models' dependence on topic-specific features.
Topic leakage significantly impacts AV model evaluation.
Abstract
Authorship verification (AV) aims to identify whether a pair of texts has the same author. We address the challenge of evaluating AV models' robustness against topic shifts. The conventional evaluation assumes minimal topic overlap between training and test data. However, we argue that there can still be topic leakage in test data, causing misleading model performance and unstable rankings. To address this, we propose an evaluation method called Heterogeneity-Informed Topic Sampling (HITS), which creates a smaller dataset with a heterogeneously distributed topic set. Our experimental results demonstrate that HITS-sampled datasets yield a more stable ranking of models across random seeds and evaluation splits. Our contributions include: 1. An analysis of causes and effects of topic leakage. 2. A demonstration of the HITS in reducing the effects of topic leakage, and 3. The Robust…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuthorship Attribution and Profiling · Misinformation and Its Impacts · Topic Modeling
