Real or Fake? Learning to Discriminate Machine from Human Generated Text
Anton Bakhtin, Sam Gross, Myle Ott, Yuntian Deng, Marc'Aurelio, Ranzato, Arthur Szlam

TL;DR
This paper explores the application of energy-based models (EBMs) to distinguish between machine-generated and human-written text, demonstrating their strong generalization capabilities and sensitivity to training data variations.
Contribution
It introduces a novel residual EBM framework that leverages pre-trained language models for negative sample generation in text classification tasks.
Findings
EBMs can effectively discriminate machine from human text.
They generalize well across different generator architectures.
Sensitivity to training set data impacts EBM performance.
Abstract
Energy-based models (EBMs), a.k.a. un-normalized models, have had recent successes in continuous spaces. However, they have not been successfully applied to model text sequences. While decreasing the energy at training samples is straightforward, mining (negative) samples where the energy should be increased is difficult. In part, this is because standard gradient-based methods are not readily applicable when the input is high-dimensional and discrete. Here, we side-step this issue by generating negatives using pre-trained auto-regressive language models. The EBM then works in the residual of the language model; and is trained to discriminate real text from text generated by the auto-regressive models. We investigate the generalization ability of residual EBMs, a pre-requisite for using them in other applications. We extensively analyze generalization for the task of classifying whether…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis
Methodsenergy-based model
