Tortured phrases: A dubious writing style emerging in science. Evidence of critical issues affecting established journals
Guillaume Cabanac, Cyril Labb\'e, Alexander Magazinov

TL;DR
This paper investigates the emergence of 'tortured phrases' in scientific literature, revealing potential AI-generated or rewritten texts that threaten research integrity, especially in reputable journals.
Contribution
It introduces the concept of tortured phrases, analyzes their prevalence in a reputable journal, and highlights irregularities suggesting AI involvement and questionable publication practices.
Findings
Concentration of tortured phrases in the journal's abstracts.
Detection of synthetic texts using AI-based classifiers.
Irregular editorial timelines and questionable article features.
Abstract
Probabilistic text generators have been used to produce fake scientific papers for more than a decade. Such nonsensical papers are easily detected by both human and machine. Now more complex AI-powered generation techniques produce texts indistinguishable from that of humans and the generation of scientific texts from a few keywords has been documented. Our study introduces the concept of tortured phrases: unexpected weird phrases in lieu of established ones, such as 'counterfeit consciousness' instead of 'artificial intelligence.' We combed the literature for tortured phrases and study one reputable journal where these concentrated en masse. Hypothesising the use of advanced language models we ran a detector on the abstracts of recent articles of this journal and on several control sets. The pairwise comparisons reveal a concentration of abstracts flagged as 'synthetic' in the journal.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
