Integrity Shield A System for Ethical AI Use & Authorship Transparency in Assessments
Ashish Raj Shekhar, Shiven Agarwal, Priyanuj Bordoloi, Yash Shah, Tejas Anvekar, Vivek Gupta

TL;DR
Integrity Shield is a novel document-layer watermarking system that embeds invisible, schema-aware signatures into assessment PDFs, effectively preventing large language models from answering protected exams and enabling reliable authorship verification.
Contribution
The paper introduces a new watermarking approach that operates at the document level, ensuring exam integrity against proprietary black-box AI systems and enabling robust signature recovery.
Findings
Achieves 91-94% exam-level blocking across 30 diverse exams.
Enables 89-93% reliable signature retrieval from AI responses.
Effective across multiple commercial language models.
Abstract
Large Language Models (LLMs) can now solve entire exams directly from uploaded PDF assessments, raising urgent concerns about academic integrity and the reliability of grades and credentials. Existing watermarking techniques either operate at the token level or assume control over the model's decoding process, making them ineffective when students query proprietary black-box systems with instructor-provided documents. We present Integrity Shield, a document-layer watermarking system that embeds schema-aware, item-level watermarks into assessment PDFs while keeping their human-visible appearance unchanged. These watermarks consistently prevent MLLMs from answering shielded exam PDFs and encode stable, item-level signatures that can be reliably recovered from model or student responses. Across 30 exams spanning STEM, humanities, and medical reasoning, Integrity Shield achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Academic integrity and plagiarism · Topic Modeling
