TL;DR
This paper introduces the Universal Defensive Underpainting Patch (UDUP), a novel method that effectively prevents OCR from extracting text by modifying the underpainting rather than the characters, ensuring robustness across various scenarios.
Contribution
The paper presents UDUP, a small, fixed-size patch that creates non-overlapping underpainting to defend against OCR piracy regardless of text content, size, or background.
Findings
UDUP effectively prevents OCR extraction across diverse backgrounds.
UDUP is robust to image scaling and compression.
UDUP successfully evades multiple off-the-shelf OCR systems.
Abstract
Optical Character Recognition (OCR) enables automatic text extraction from scanned or digitized text images, but it also makes it easy to pirate valuable or sensitive text from these images. Previous methods to prevent OCR piracy by distorting characters in text images are impractical in real-world scenarios, as pirates can capture arbitrary portions of the text images, rendering the defenses ineffective. In this work, we propose a novel and effective defense mechanism termed the Universal Defensive Underpainting Patch (UDUP) that modifies the underpainting of text images instead of the characters. UDUP is created through an iterative optimization process to craft a small, fixed-size defensive patch that can generate non-overlapping underpainting for text images of any size. Experimental results show that UDUP effectively defends against unauthorized OCR under the setting of any…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
