Author Identification using Multi-headed Recurrent Neural Networks

Douglas Bagnall

arXiv:1506.04891·cs.CL·August 17, 2016·44 cites

Author Identification using Multi-headed Recurrent Neural Networks

Douglas Bagnall

PDF

Open Access 4 Repos

TL;DR

This paper introduces a multi-headed RNN approach for author identification, sharing a recurrent layer among authors while using separate output heads, achieving competitive results across multiple languages.

Contribution

It presents a novel multi-headed RNN architecture that effectively models individual author styles with limited data, outperforming existing methods in some languages.

Findings

01

Ranked first in two of four languages

02

Effective modeling of author style with shared recurrent layer

03

Competitive performance in author identification

Abstract

Recurrent neural networks (RNNs) are very good at modelling the flow of text, but typically need to be trained on a far larger corpus than is available for the PAN 2015 Author Identification task. This paper describes a novel approach where the output layer of a character-level RNN language model is split into several independent predictive sub-models, each representing an author, while the recurrent layer is shared by all. This allows the recurrent layer to model the language as a whole without over-fitting, while the outputs select aspects of the underlying model that reflect their author's style. The method proves competitive, ranking first in two of the four languages.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuthorship Attribution and Profiling · Natural Language Processing Techniques · Topic Modeling