Isolating authorship from content with semantic embeddings and contrastive learning
Javier Huertas-Tato, Adri\'an Gir\'on-Jim\'enez, Alejandro Mart\'in,, David Camacho

TL;DR
This paper introduces a contrastive learning method with synthetic hard negatives to better disentangle authorship style from content in embeddings, improving authorship attribution accuracy especially in challenging out-of-domain scenarios.
Contribution
The paper proposes a novel contrastive learning approach using synthetic hard negatives to reduce content leakage and better isolate stylistic features in authorship embeddings.
Findings
Up to 10% accuracy improvement in difficult out-of-domain tasks.
Effective disentanglement of style and content embeddings.
Preserves zero-shot capabilities after fine-tuning.
Abstract
Authorship has entangled style and content inside. Authors frequently write about the same topics in the same style, so when different authors write about the exact same topic the easiest way out to distinguish them is by understanding the nuances of their style. Modern neural models for authorship can pick up these features using contrastive learning, however, some amount of content leakage is always present. Our aim is to reduce the inevitable impact and correlation between content and authorship. We present a technique to use contrastive learning (InfoNCE) with additional hard negatives synthetically created using a semantic similarity model. This disentanglement technique aims to distance the content embedding space from the style embedding space, leading to embeddings more informed by style. We demonstrate the performance with ablations on two different datasets and compare them on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuthorship Attribution and Profiling · Topic Modeling · Misinformation and Its Impacts
MethodsContrastive Learning
