Spanish Resource Grammar version 2023
Olga Zamaraeva, Lorena S. Allegue, Carlos G\'omez-Rodr\'iguez

TL;DR
This paper introduces the 2023 version of the Spanish Resource Grammar, combining linguistic theory testing and NLP applications, supported by a new treebank and empirical evaluation methods.
Contribution
It presents an updated HPSG-based Spanish grammar integrated with a new treebank and a novel approach for empirical syntactic theory testing and language learning evaluation.
Findings
High coverage and low overgeneration on learner sentences
Treebanking process enhances empirical syntactic research
Grammar supports NLP applications in language learning
Abstract
We present the latest version of the Spanish Resource Grammar (SRG), a grammar of Spanish implemented in the HPSG formalism. Such grammars encode a complex set of hypotheses about syntax making them a resource for empirical testing of linguistic theory. They also encode a strict notion of grammaticality which makes them a resource for natural language processing applications in computer-assisted language learning. This version of the SRG uses the recent version of the Freeling morphological analyzer and is released along with an automatically created, manually verified treebank of 2,291 sentences. We explain the treebanking process, emphasizing how it is different from treebanking with manual annotation and how it contributes to empirically-driven development of syntactic theory. The treebanks' high level of consistency and detail makes them a resource for training high-quality semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
