Exploiting Multi-Scale Fusion, Spatial Attention and Patch Interaction Techniques for Text-Independent Writer Identification
Abhishek Srivastava, Sukalpa Chanda, Umapada Pal

TL;DR
This paper introduces a novel deep learning framework combining spatial attention, multi-scale fusion, and patch interaction techniques to improve text-independent writer identification accuracy across multiple datasets.
Contribution
It proposes a new multi-technique deep learning approach that effectively captures writer-specific handwriting features, outperforming existing methods on several benchmarks.
Findings
Outperforms state-of-the-art methods on CVL, Firemaker, CERUG-EN datasets.
Achieves comparable performance on IAM dataset.
Demonstrates effectiveness of multi-scale and patch-based features in writer identification.
Abstract
Text independent writer identification is a challenging problem that differentiates between different handwriting styles to decide the author of the handwritten text. Earlier writer identification relied on handcrafted features to reveal pieces of differences between writers. Recent work with the advent of convolutional neural network, deep learning-based methods have evolved. In this paper, three different deep learning techniques - spatial attention mechanism, multi-scale feature fusion and patch-based CNN were proposed to effectively capture the difference between each writer's handwriting. Our methods are based on the hypothesis that handwritten text images have specific spatial regions which are more unique to a writer's style, multi-scale features propagate characteristic features with respect to individual writers and patch-based features give more general and robust…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Vehicle License Plate Recognition · Image Processing and 3D Reconstruction
