Lexical Adaptation of Link Grammar to the Biomedical Sublanguage: a   Comparative Evaluation of Three Approaches

Sampo Pyysalo; Tapio Salakoski; Sophie Aubin (LIPN); Adeline Nazarenko; (LIPN)

arXiv:cs/0606119·cs.CL·May 23, 2007

Lexical Adaptation of Link Grammar to the Biomedical Sublanguage: a Comparative Evaluation of Three Approaches

Sampo Pyysalo, Tapio Salakoski, Sophie Aubin (LIPN), Adeline Nazarenko, (LIPN)

PDF

Open Access

TL;DR

This paper evaluates three methods for adapting the Link Grammar Parser to biomedical language, demonstrating that combining a domain-specific POS tagger with other techniques significantly improves parsing accuracy and efficiency.

Contribution

It introduces and compares three approaches for lexical adaptation of Link Grammar to biomedical text, highlighting the effectiveness of using a domain-specific POS tagger.

Findings

01

45% increase in parsing efficiency

02

10% relative decrease in error with POS tagging

03

Best approach combines multiple adaptation techniques

Abstract

We study the adaptation of Link Grammar Parser to the biomedical sublanguage with a focus on domain terms not found in a general parser lexicon. Using two biomedical corpora, we implement and evaluate three approaches to addressing unknown words: automatic lexicon expansion, the use of morphological clues, and disambiguation using a part-of-speech tagger. We evaluate each approach separately for its effect on parsing performance and consider combinations of these approaches. In addition to a 45% increase in parsing efficiency, we find that the best approach, incorporating information from a domain part-of-speech tagger, offers a statistically signicant 10% relative decrease in error. The adapted parser is available under an open-source license at http://www.it.utu.fi/biolg.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Biomedical Text Mining and Ontologies · Text Readability and Simplification