Quirk or Palmer: A Comparative Study of Modal Verb Frameworks with   Annotated Datasets

Risako Owan; Maria Gini; Dongyeop Kang

arXiv:2212.10152·cs.CL·December 21, 2022

Quirk or Palmer: A Comparative Study of Modal Verb Frameworks with Annotated Datasets

Risako Owan, Maria Gini, Dongyeop Kang

PDF

Open Access

TL;DR

This paper introduces the Moverb dataset with annotated modal verb senses based on two frameworks, demonstrating the challenge of disambiguation and providing a resource for future NLP research.

Contribution

It presents a new dataset with annotations for modal verb senses using Quirk and Palmer frameworks, and evaluates RoBERTa classifiers on this data.

Findings

01

Similar inter-annotator agreement for both frameworks

02

Achieved F1 scores of 82.2 and 78.3 with RoBERTa classifiers

03

Modal sense disambiguation remains a challenging task

Abstract

Modal verbs, such as "can", "may", and "must", are commonly used in daily communication to convey the speaker's perspective related to the likelihood and/or mode of the proposition. They can differ greatly in meaning depending on how they're used and the context of a sentence (e.g. "They 'must' help each other out." vs. "They 'must' have helped each other out.") Despite their practical importance in natural language understanding, linguists have yet to agree on a single, prominent framework for the categorization of modal verb senses. This lack of agreement stems from high degrees of flexibility and polysemy from the modal verbs, making it more difficult for researchers to incorporate insights from this family of words into their work. This work presents Moverb dataset, which consists of 27,240 annotations of modal verb senses over 4,540 utterances containing one or more sentences from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems