Syntactic Nuclei in Dependency Parsing -- A Multilingual Exploration
Ali Basirat, Joakim Nivre

TL;DR
This paper explores enriching dependency parsing models with the concept of syntactic nuclei across 12 languages, demonstrating small but significant accuracy improvements by incorporating nucleus composition into the parser.
Contribution
It introduces a method to define and integrate the notion of syntactic nuclei into dependency parsing within the Universal Dependencies framework, enhancing parser performance.
Findings
Nucleus composition improves parsing accuracy across multiple languages.
Improvements are most notable for nominal modifiers, coordination, main predicates, and direct objects.
The approach is effective in a multilingual setting.
Abstract
Standard models for syntactic dependency parsing take words to be the elementary units that enter into dependency relations. In this paper, we investigate whether there are any benefits from enriching these models with the more abstract notion of nucleus proposed by Tesni\`{e}re. We do this by showing how the concept of nucleus can be defined in the framework of Universal Dependencies and how we can use composition functions to make a transition-based dependency parser aware of this concept. Experiments on 12 languages show that nucleus composition gives small but significant improvements in parsing accuracy. Further analysis reveals that the improvement mainly concerns a small number of dependency relations, including nominal modifiers, relations of coordination, main predicates, and direct objects.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttentive Walk-Aggregating Graph Neural Network
