Enhancing deep neural networks with morphological information

Matej Klemen; Luka Krsnik; Marko Robnik-\v{S}ikonja

arXiv:2011.12432·cs.CL·March 3, 2022

Enhancing deep neural networks with morphological information

Matej Klemen, Luka Krsnik, Marko Robnik-\v{S}ikonja

PDF

2 Repos

TL;DR

This paper investigates how adding morphological features to LSTM and BERT models affects NLP tasks in less-resourced, morphologically rich languages, revealing mixed results depending on feature quality and task.

Contribution

It provides a comparative analysis of the impact of morphological features on neural models across multiple languages and tasks, highlighting conditions for effectiveness.

Findings

01

Morphological features improve LSTM performance on NER and DP.

02

High-quality features enhance BERT's DP performance, but predicted features do not.

03

Effects vary by task, feature quality, and model type.

Abstract

Deep learning approaches are superior in NLP due to their ability to extract informative features and patterns from languages. The two most successful neural architectures are LSTM and transformers, used in large pretrained language models such as BERT. While cross-lingual approaches are on the rise, most current NLP techniques are designed and applied to English, and less-resourced languages are lagging behind. In morphologically rich languages, information is conveyed through morphology, e.g., through affixes modifying stems of words. Existing neural approaches do not explicitly use the information on word morphology. We analyse the effect of adding morphological features to LSTM and BERT models. As a testbed, we use three tasks available in many less-resourced languages: named entity recognition (NER), dependency parsing (DP), and comment filtering (CF). We construct baselines…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Linear Warmup With Linear Decay · Residual Connection · Layer Normalization · Softmax · Adam · Weight Decay · Attention Is All You Need · Dropout · Tanh Activation