Towards Effective Authorship Attribution: Integrating Class-Incremental   Learning

Mostafa Rahgouy; Hamed Babaei Giglou; Mehnaz Tabassum; Dongji Feng,; Amit Das; Taher Rahgooy; Gerry Dozier; Cheryl D. Seals

arXiv:2408.08900·cs.IR·August 20, 2024

Towards Effective Authorship Attribution: Integrating Class-Incremental Learning

Mostafa Rahgouy, Hamed Babaei Giglou, Mehnaz Tabassum, Dongji Feng,, Amit Das, Taher Rahgooy, Gerry Dozier, Cheryl D. Seals

PDF

Open Access 1 Repo

TL;DR

This paper redefines authorship attribution as a class-incremental learning problem, enabling systems to adapt to new authors over time and address limitations of traditional closed-world models.

Contribution

It introduces a novel perspective of applying class-incremental learning to authorship attribution, highlighting its potential to handle emerging authors and prevent catastrophic forgetting.

Findings

01

Examines CIL approaches in the context of AA

02

Identifies strengths and weaknesses of CIL methods for AA

03

Outlines future directions for CIL-based AA systems

Abstract

AA is the process of attributing an unidentified document to its true author from a predefined group of known candidates, each possessing multiple samples. The nature of AA necessitates accommodating emerging new authors, as each individual must be considered unique. This uniqueness can be attributed to various factors, including their stylistic preferences, areas of expertise, gender, cultural background, and other personal characteristics that influence their writing. These diverse attributes contribute to the distinctiveness of each author, making it essential for AA systems to recognize and account for these variations. However, current AA benchmarks commonly overlook this uniqueness and frame the problem as a closed-world classification, assuming a fixed number of authors throughout the system's lifespan and neglecting the inclusion of emerging new authors. This oversight renders…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MostafaRahgouy/AA-CIL
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuthorship Attribution and Profiling · Hate Speech and Cyberbullying Detection · Topic Modeling