DoPE: Decoy Oriented Perturbation Encapsulation Human-Readable, AI-Hostile Documents for Academic Integrity

Ashish Raj Shekhar; Shiven Agarwal; Priyanuj Bordoloi; Yash Shah; Tejas Anvekar; Vivek Gupta

arXiv:2601.12505·cs.CL·January 21, 2026

DoPE: Decoy Oriented Perturbation Encapsulation Human-Readable, AI-Hostile Documents for Academic Integrity

Ashish Raj Shekhar, Shiven Agarwal, Priyanuj Bordoloi, Yash Shah, Tejas Anvekar, Vivek Gupta

PDF

Open Access

TL;DR

DoPE introduces a novel document-layer defense that embeds semantic decoys into exam documents to prevent and detect AI-based cheating in assessments, leveraging render-parse discrepancies in multimodal LLMs.

Contribution

The paper presents DoPE, a new framework for embedding semantic decoys into exam documents, along with FewSoRT-Q and FewSoRT-D pipelines, to enhance academic integrity against multimodal LLMs.

Findings

01

Achieves 91.4% detection rate with 8.7% false positives.

02

Prevents successful AI completion in 96.3% of attempts.

03

Provides a new benchmark, Integrity-Bench, for evaluating document-layer defenses.

Abstract

Multimodal Large Language Models (MLLMs) can directly consume exam documents, threatening conventional assessments and academic integrity. We present DoPE (Decoy-Oriented Perturbation Encapsulation), a document-layer defense framework that embeds semantic decoys into PDF/HTML assessments to exploit render-parse discrepancies in MLLM pipelines. By instrumenting exams at authoring time, DoPE provides model-agnostic prevention (stop or confound automated solving) and detection (flag blind AI reliance) without relying on conventional one-shot classifiers. We formalize prevention and detection tasks, and introduce FewSoRT-Q, an LLM-guided pipeline that generates question-level semantic decoys and FewSoRT-D to encapsulate them into watermarked documents. We evaluate on Integrity-Bench, a novel benchmark of 1826 exams (PDF+HTML) derived from public QA datasets and OpenCourseWare. Against…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAcademic integrity and plagiarism · Topic Modeling · Adversarial Robustness in Machine Learning