PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English
Michael Kranzlein, Emma Manning, Siyao Peng, Shira Wein, Aryaman, Arora, Bradford Salen, Nathan Schneider

TL;DR
PASTRIE is a new manually annotated corpus of English prepositions with supersense tags, including data from speakers of four different L1s, enabling analysis of L1 influence on preposition use.
Contribution
The paper introduces PASTRIE, a comprehensive annotated corpus of English prepositions with supersenses from multilingual Reddit data, facilitating linguistic analysis.
Findings
Distributional patterns of prepositions vary across L1 groups
L1 influence affects L2 preposition choice
Corpus enables cross-linguistic preposition studies
Abstract
We present the Prepositions Annotated with Supersense Tags in Reddit International English ("PASTRIE") corpus, a new dataset containing manually annotated preposition supersenses of English data from presumed speakers of four L1s: English, French, German, and Spanish. The annotations are comprehensive, covering all preposition types and tokens in the sample. Along with the corpus, we provide analysis of distributional patterns across the included L1s and a discussion of the influence of L1s on L2 preposition choice.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
