BKTreebank: Building a Vietnamese Dependency Treebank
Kiem-Hieu Nguyen

TL;DR
This paper introduces BKTreebank, a Vietnamese dependency treebank, detailing its design and demonstrating its usefulness through POS tagging and dependency parsing experiments, thereby advancing Vietnamese NLP resources.
Contribution
The paper presents the creation of BKTreebank, the first Vietnamese dependency treebank, including its annotation guidelines and experimental validation for NLP tasks.
Findings
POS tagging accuracy improved with the treebank
Dependency parsing results demonstrate the treebank's utility
Provides a foundational resource for Vietnamese NLP
Abstract
Dependency treebank is an important resource in any language. In this paper, we present our work on building BKTreebank, a dependency treebank for Vietnamese. Important points on designing POS tagset, dependency relations, and annotation guidelines are discussed. We describe experiments on POS tagging and dependency parsing on the treebank. Experimental results show that the treebank is a useful resource for Vietnamese language processing.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text and Document Classification Technologies
