Isolating authorship from content with semantic embeddings and   contrastive learning

Javier Huertas-Tato; Adri\'an Gir\'on-Jim\'enez; Alejandro Mart\'in,; David Camacho

arXiv:2411.18472·cs.CL·November 28, 2024

Isolating authorship from content with semantic embeddings and contrastive learning

Javier Huertas-Tato, Adri\'an Gir\'on-Jim\'enez, Alejandro Mart\'in,, David Camacho

PDF

Open Access

TL;DR

This paper introduces a contrastive learning method with synthetic hard negatives to better disentangle authorship style from content in embeddings, improving authorship attribution accuracy especially in challenging out-of-domain scenarios.

Contribution

The paper proposes a novel contrastive learning approach using synthetic hard negatives to reduce content leakage and better isolate stylistic features in authorship embeddings.

Findings

01

Up to 10% accuracy improvement in difficult out-of-domain tasks.

02

Effective disentanglement of style and content embeddings.

03

Preserves zero-shot capabilities after fine-tuning.

Abstract

Authorship has entangled style and content inside. Authors frequently write about the same topics in the same style, so when different authors write about the exact same topic the easiest way out to distinguish them is by understanding the nuances of their style. Modern neural models for authorship can pick up these features using contrastive learning, however, some amount of content leakage is always present. Our aim is to reduce the inevitable impact and correlation between content and authorship. We present a technique to use contrastive learning (InfoNCE) with additional hard negatives synthetically created using a semantic similarity model. This disentanglement technique aims to distance the content embedding space from the style embedding space, leading to embeddings more informed by style. We demonstrate the performance with ablations on two different datasets and compare them on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuthorship Attribution and Profiling · Topic Modeling · Misinformation and Its Impacts

MethodsContrastive Learning