Fooling OCR Systems with Adversarial Text Images

Congzheng Song; Vitaly Shmatikov

arXiv:1802.05385·cs.LG·February 16, 2018·36 cites

Fooling OCR Systems with Adversarial Text Images

Congzheng Song, Vitaly Shmatikov

PDF

Open Access

TL;DR

This paper shows that advanced OCR systems can be fooled by slight, imperceptible modifications to printed text images, causing them to produce incorrect, semantically opposite transcriptions that can mislead downstream NLP applications.

Contribution

It introduces adversarial image techniques that can reliably alter OCR outputs without changing the visual appearance of the text.

Findings

01

Adversarial images can change OCR recognition results.

02

Minor modifications do not affect human reading.

03

OCR errors can be semantically opposite.

Abstract

We demonstrate that state-of-the-art optical character recognition (OCR) based on deep learning is vulnerable to adversarial images. Minor modifications to images of printed text, which do not change the meaning of the text to a human reader, cause the OCR system to "recognize" a different text where certain words chosen by the adversary are replaced by their semantic opposites. This completely changes the meaning of the output produced by the OCR system and by the NLP applications that use OCR for preprocessing their inputs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Digital Media Forensic Detection · Advanced Malware Detection Techniques