Improving Neural Sequence Labelling using Additional Linguistic Information
Mahtab Ahmed, Muhammad Rifayat Samee, Robert E. Mercer

TL;DR
This paper introduces a neural sequence labelling approach that incorporates linguistic features like sense embeddings and character-level information, leading to improved accuracy and faster convergence on POS, NER, and chunking tasks.
Contribution
The study presents a novel method of integrating linguistic features into neural models, resulting in more efficient training and state-of-the-art performance.
Findings
Achieved state-of-the-art results on POS, NER, and chunking datasets.
Model converges faster than previous methods.
Incorporating linguistic features enhances model efficiency and accuracy.
Abstract
Sequence labelling is the task of assigning categorical labels to a data sequence. In Natural Language Processing, sequence labelling can be applied to various fundamental problems, such as Part of Speech (POS) tagging, Named Entity Recognition (NER), and Chunking. In this study, we propose a method to add various linguistic features to the neural sequence framework to improve sequence labelling. Besides word level knowledge, sense embeddings are added to provide semantic information. Additionally, selective readings of character embeddings are added to capture contextual as well as morphological features for each word in a sentence. Compared to previous methods, these added linguistic features allow us to design a more concise model and perform more efficient training. Our proposed architecture achieves state of the art results on the benchmark datasets of POS, NER, and chunking.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
