TL;DR
This paper introduces a Transformer-based method for online handwritten character segmentation that leverages known transcriptions to improve segmentation accuracy, outperforming existing methods on standard datasets.
Contribution
It presents a novel Transformer architecture with learned character queries for precise segmentation, decoupling segmentation from recognition and addressing the assignment problem.
Findings
Achieves state-of-the-art segmentation results on IAM-OnDB and HANDS-VNOnDB datasets.
Demonstrates the effectiveness of Transformer-based cluster assignment in handwriting segmentation.
Provides new ground truth data for evaluation of online handwritten character segmentation.
Abstract
On-line handwritten character segmentation is often associated with handwriting recognition and even though recognition models include mechanisms to locate relevant positions during the recognition process, it is typically insufficient to produce a precise segmentation. Decoupling the segmentation from the recognition unlocks the potential to further utilize the result of the recognition. We specifically focus on the scenario where the transcription is known beforehand, in which case the character segmentation becomes an assignment problem between sampling points of the stylus trajectory and characters in the text. Inspired by the -means clustering algorithm, we view it from the perspective of cluster assignment and present a Transformer-based architecture where each cluster is formed based on a learned character query in the Transformer decoder block. In order to assess the quality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Residual Connection · Byte Pair Encoding · Label Smoothing · Dropout · Absolute Position Encodings · Layer Normalization · Adam
