Tortured phrases: A dubious writing style emerging in science. Evidence   of critical issues affecting established journals

Guillaume Cabanac; Cyril Labb\'e; Alexander Magazinov

arXiv:2107.06751·cs.DL·July 15, 2021·106 cites

Tortured phrases: A dubious writing style emerging in science. Evidence of critical issues affecting established journals

Guillaume Cabanac, Cyril Labb\'e, Alexander Magazinov

PDF

Open Access 1 Repo 4 Models

TL;DR

This paper investigates the emergence of 'tortured phrases' in scientific literature, revealing potential AI-generated or rewritten texts that threaten research integrity, especially in reputable journals.

Contribution

It introduces the concept of tortured phrases, analyzes their prevalence in a reputable journal, and highlights irregularities suggesting AI involvement and questionable publication practices.

Findings

01

Concentration of tortured phrases in the journal's abstracts.

02

Detection of synthetic texts using AI-based classifiers.

03

Irregular editorial timelines and questionable article features.

Abstract

Probabilistic text generators have been used to produce fake scientific papers for more than a decade. Such nonsensical papers are easily detected by both human and machine. Now more complex AI-powered generation techniques produce texts indistinguishable from that of humans and the generation of scientific texts from a few keywords has been documented. Our study introduces the concept of tortured phrases: unexpected weird phrases in lieu of established ones, such as 'counterfeit consciousness' instead of 'artificial intelligence.' We combed the literature for tortured phrases and study one reputable journal where these concentrated en masse. Hypothesising the use of advanced language models we ran a detector on the abstracts of recent articles of this journal and on several control sets. The pairwise comparisons reveal a concentration of abstracts flagged as 'synthetic' in the journal.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gcabanac/editorial-assessment
none

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling