Semantic Reconstruction of Adversarial Plagiarism: A Context-Aware Framework for Detecting and Restoring "Tortured Phrases" in Scientific Literature

Agniva Maiti; Prajwal Panth; and Suresh Chandra Satapathy

arXiv:2512.10435·cs.CL·December 12, 2025

Semantic Reconstruction of Adversarial Plagiarism: A Context-Aware Framework for Detecting and Restoring "Tortured Phrases" in Scientific Literature

Agniva Maiti, Prajwal Panth, and Suresh Chandra Satapathy

PDF

Open Access

TL;DR

This paper introduces SRAP, a novel framework combining anomaly detection and semantic reconstruction to identify and restore paraphrased scientific text, significantly improving detection accuracy over existing methods.

Contribution

It presents a two-stage approach using domain-specific language models and dense retrieval for detecting and restoring adversarial plagiarism in scientific literature.

Findings

01

Achieves 23.67% restoration accuracy, outperforming baseline methods.

02

Static decision boundaries are more robust than dynamic thresholds in scientific jargon.

03

Zero-shot baselines fail completely in restoring obfuscated phrases.

Abstract

The integrity and reliability of scientific literature is facing a serious threat by adversarial text generation techniques, specifically from the use of automated paraphrasing tools to mask plagiarism. These tools generate "tortured phrases", statistically improbable synonyms (e.g. "counterfeit consciousness" for "artificial intelligence"), that preserve the local grammar while obscuring the original source. Most existing detection methods depend heavily on static blocklists or general-domain language models, which suffer from high false-negative rates for novel obfuscations and cannot determine the source of the plagiarized content. In this paper, we propose Semantic Reconstruction of Adversarial Plagiarism (SRAP), a framework designed not only to detect these anomalies but to mathematically recover the original terminology. We use a two-stage architecture: (1) statistical anomaly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAcademic integrity and plagiarism · Authorship Attribution and Profiling · Topic Modeling