Quirk or Palmer: A Comparative Study of Modal Verb Frameworks with Annotated Datasets
Risako Owan, Maria Gini, Dongyeop Kang

TL;DR
This paper introduces the Moverb dataset with annotated modal verb senses based on two frameworks, demonstrating the challenge of disambiguation and providing a resource for future NLP research.
Contribution
It presents a new dataset with annotations for modal verb senses using Quirk and Palmer frameworks, and evaluates RoBERTa classifiers on this data.
Findings
Similar inter-annotator agreement for both frameworks
Achieved F1 scores of 82.2 and 78.3 with RoBERTa classifiers
Modal sense disambiguation remains a challenging task
Abstract
Modal verbs, such as "can", "may", and "must", are commonly used in daily communication to convey the speaker's perspective related to the likelihood and/or mode of the proposition. They can differ greatly in meaning depending on how they're used and the context of a sentence (e.g. "They 'must' help each other out." vs. "They 'must' have helped each other out.") Despite their practical importance in natural language understanding, linguists have yet to agree on a single, prominent framework for the categorization of modal verb senses. This lack of agreement stems from high degrees of flexibility and polysemy from the modal verbs, making it more difficult for researchers to incorporate insights from this family of words into their work. This work presents Moverb dataset, which consists of 27,240 annotations of modal verb senses over 4,540 utterances containing one or more sentences from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
