TaCo: Textual Attribute Recognition via Contrastive Learning
Chang Nie, Yiqing Hu, Yanqiu Qu, Hao Liu, Deqiang Jiang, Bo Ren

TL;DR
TaCo introduces a contrastive learning framework for textual attribute recognition in documents, effectively distinguishing subtle attribute differences and handling distortions, surpassing previous supervised methods in accuracy.
Contribution
The paper presents a novel contrastive learning approach tailored for textual attribute recognition, addressing ambiguity and distortion issues in document images.
Findings
TaCo outperforms supervised methods on multiple attribute recognition tasks.
It effectively distinguishes subtle attribute differences.
The framework is robust to real-world distortions.
Abstract
As textual attributes like font are core design elements of document format and page style, automatic attributes recognition favor comprehensive practical applications. Existing approaches already yield satisfactory performance in differentiating disparate attributes, but they still suffer in distinguishing similar attributes with only subtle difference. Moreover, their performance drop severely in real-world scenarios where unexpected and obvious imaging distortions appear. In this paper, we aim to tackle these problems by proposing TaCo, a contrastive framework for textual attribute recognition tailored toward the most common document scenes. Specifically, TaCo leverages contrastive learning to dispel the ambiguity trap arising from vague and open-ended attributes. To realize this goal, we design the learning paradigm from three perspectives: 1) generating attribute views, 2)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsHandwritten Text Recognition Techniques · Image Retrieval and Classification Techniques · Text and Document Classification Technologies
MethodsContrastive Learning
