Addressing Topic Leakage in Cross-Topic Evaluation for Authorship   Verification

Jitkapat Sawatphol; Can Udomcharoenchaikit; Sarana Nutanong

arXiv:2407.19164·cs.CL·July 30, 2024

Addressing Topic Leakage in Cross-Topic Evaluation for Authorship Verification

Jitkapat Sawatphol, Can Udomcharoenchaikit, Sarana Nutanong

PDF

Open Access 1 Repo

TL;DR

This paper introduces HITS, a new evaluation method for authorship verification that reduces topic leakage effects, leading to more stable model rankings and revealing models' reliance on topic-specific cues.

Contribution

It proposes HITS for better evaluation of AV models under topic shift and introduces RAVEN benchmark to test topic shortcut reliance.

Findings

01

HITS creates more stable model rankings across different splits.

02

RAVEN uncovers models' dependence on topic-specific features.

03

Topic leakage significantly impacts AV model evaluation.

Abstract

Authorship verification (AV) aims to identify whether a pair of texts has the same author. We address the challenge of evaluating AV models' robustness against topic shifts. The conventional evaluation assumes minimal topic overlap between training and test data. However, we argue that there can still be topic leakage in test data, causing misleading model performance and unstable rankings. To address this, we propose an evaluation method called Heterogeneity-Informed Topic Sampling (HITS), which creates a smaller dataset with a heterogeneously distributed topic set. Our experimental results demonstrate that HITS-sampled datasets yield a more stable ranking of models across random seeds and evaluation splits. Our contributions include: 1. An analysis of causes and effects of topic leakage. 2. A demonstration of the HITS in reducing the effects of topic leakage, and 3. The Robust…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jitkapat/hits_authorship
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuthorship Attribution and Profiling · Misinformation and Its Impacts · Topic Modeling