Towards Effective Authorship Attribution: Integrating Class-Incremental Learning
Mostafa Rahgouy, Hamed Babaei Giglou, Mehnaz Tabassum, Dongji Feng,, Amit Das, Taher Rahgooy, Gerry Dozier, Cheryl D. Seals

TL;DR
This paper redefines authorship attribution as a class-incremental learning problem, enabling systems to adapt to new authors over time and address limitations of traditional closed-world models.
Contribution
It introduces a novel perspective of applying class-incremental learning to authorship attribution, highlighting its potential to handle emerging authors and prevent catastrophic forgetting.
Findings
Examines CIL approaches in the context of AA
Identifies strengths and weaknesses of CIL methods for AA
Outlines future directions for CIL-based AA systems
Abstract
AA is the process of attributing an unidentified document to its true author from a predefined group of known candidates, each possessing multiple samples. The nature of AA necessitates accommodating emerging new authors, as each individual must be considered unique. This uniqueness can be attributed to various factors, including their stylistic preferences, areas of expertise, gender, cultural background, and other personal characteristics that influence their writing. These diverse attributes contribute to the distinctiveness of each author, making it essential for AA systems to recognize and account for these variations. However, current AA benchmarks commonly overlook this uniqueness and frame the problem as a closed-world classification, assuming a fixed number of authors throughout the system's lifespan and neglecting the inclusion of emerging new authors. This oversight renders…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuthorship Attribution and Profiling · Hate Speech and Cyberbullying Detection · Topic Modeling
