A Modality Lexicon and its use in Automatic Tagging

Kathryn Baker; Michael Bloodgood; Bonnie J. Dorr; Nathaniel W.; Filardo; Lori Levin; Christine Piatko

arXiv:1410.4868·cs.CL·October 21, 2014·49 cites

A Modality Lexicon and its use in Automatic Tagging

Kathryn Baker, Michael Bloodgood, Bonnie J. Dorr, Nathaniel W., Filardo, Lori Levin, Christine Piatko

PDF

Open Access

TL;DR

This paper presents a new modality lexicon and two automated taggers that improve semantic annotation and machine translation quality by identifying modality components in language data.

Contribution

We developed a semi-automatically expanded modality lexicon and two taggers, demonstrating improved annotation precision and translation quality in machine translation tasks.

Findings

01

Structure-based tagger achieves 86% precision

02

Modality annotation improves translation quality by 0.3 Bleu points

03

Lexicon is publicly available for research use

Abstract

This paper describes our resource-building results for an eight-week JHU Human Language Technology Center of Excellence Summer Camp for Applied Language Exploration (SCALE-2009) on Semantically-Informed Machine Translation. Specifically, we describe the construction of a modality annotation scheme, a modality lexicon, and two automated modality taggers that were built using the lexicon and annotation scheme. Our annotation scheme is based on identifying three components of modality: a trigger, a target and a holder. We describe how our modality lexicon was produced semi-automatically, expanding from an initial hand-selected list of modality trigger words and phrases. The resulting expanded modality lexicon is being made publicly available. We demonstrate that one tagger---a structure-based tagger---results in precision around 86% (depending on genre) for tagging of a standard LDC data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Lexicography and Language Studies · Topic Modeling