TL;DR
VnCoreNLP is a comprehensive, open-source Vietnamese NLP toolkit that provides fast, accurate annotations for key tasks like segmentation, POS tagging, NER, and dependency parsing, supporting Vietnamese language research.
Contribution
It introduces a unified, efficient toolkit for Vietnamese NLP that achieves state-of-the-art performance across multiple core tasks.
Findings
Achieves state-of-the-art results in Vietnamese NLP tasks
Provides a fast and easy-to-use annotation pipeline
Supports multiple NLP tasks within a single toolkit
Abstract
We present an easy-to-use and fast toolkit, namely VnCoreNLP---a Java NLP annotation pipeline for Vietnamese. Our VnCoreNLP supports key natural language processing (NLP) tasks including word segmentation, part-of-speech (POS) tagging, named entity recognition (NER) and dependency parsing, and obtains state-of-the-art (SOTA) results for these tasks. We release VnCoreNLP to provide rich linguistic annotations to facilitate research work on Vietnamese NLP. Our VnCoreNLP is open-source and available at: https://github.com/vncorenlp/VnCoreNLP
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
