StrokeNet: Unveiling How to Learn Fine-Grained Interactions in Online Handwritten Stroke Classification
Yiheng Huang, Shuang She, Zewei Wei, Jianmin Lin, Ming Yang, and Wenyin Liu

TL;DR
StrokeNet introduces a novel architecture that models fine-grained stroke interactions in online handwritten Chinese character recognition by using reference points and specialized attention mechanisms, significantly improving accuracy.
Contribution
The paper proposes StrokeNet, a new network that encodes strokes with reference points and features, employing dynamic selection and attention modules to better capture local interactions.
Findings
Achieves state-of-the-art accuracy on multiple datasets.
Improves CASIA-onDo from 93.81% to 95.54%.
Demonstrates robustness and effectiveness of the approach.
Abstract
Stroke classification remains challenging due to variations in writing style, ambiguous content, and dynamic writing positions. The core challenge in stroke classification is modeling the semantic relationships between strokes. Our observations indicate that stroke interactions are typically localized, making it difficult for existing deep learning methods to capture such fine-grained relationships. Although viewing strokes from a point-level perspective can address this issue, it introduces redundancy. However, by selecting reference points and using their sequential order to represent strokes in a fine-grained manner, this problem can be effectively solved. This insight inspired StrokeNet, a novel network architecture encoding strokes as reference pair representations (points + feature vectors), where reference points enable spatial queries and features mediate interaction modeling.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Image Retrieval and Classification Techniques · Advanced Neural Network Applications
