A Multi-head-based architecture for effective morphological tagging in Russian with open dictionary
K. Skibin, M. Pozhidaev, S. Suschenko

TL;DR
This paper introduces a multi-head attention-based architecture for Russian morphological tagging that supports open dictionaries, achieves high accuracy, and is efficient to train without large pretraining datasets.
Contribution
The novel architecture effectively integrates subtoken aggregation and open dictionary support, outperforming previous methods in accuracy and speed.
Findings
Achieves 98-99% accuracy on some grammatical categories
Correctly predicts all categories for 90% of words
Trains efficiently on consumer-grade hardware
Abstract
The article proposes a new architecture based on Multi-head attention to solve the problem of morphological tagging for the Russian language. The preprocessing of the word vectors includes splitting the words into subtokens, followed by a trained procedure for aggregating the vectors of the subtokens into vectors for tokens. This allows to support an open dictionary and analyze morphological features taking into account parts of words (prefixes, endings, etc.). The open dictionary allows in future to analyze words that are absent in the training dataset. The performed computational experiment on the SinTagRus and Taiga datasets shows that for some grammatical categories the proposed architecture gives accuracy 98-99% and above, which outperforms previously known results. For nine out of ten words, the architecture precisely predicts all grammatical categories and indicates when the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
