Not Enough Labeled Data? Just Add Semantics: A Data-Efficient Method for   Inferring Online Health Texts

Joseph Gatto; Sarah M. Preum

arXiv:2309.09877·cs.CL·September 19, 2023

Not Enough Labeled Data? Just Add Semantics: A Data-Efficient Method for Inferring Online Health Texts

Joseph Gatto, Sarah M. Preum

PDF

Open Access

TL;DR

This paper introduces a semantic graph-based approach using Abstract Meaning Representation (AMR) to enhance low-resource health NLP tasks, improving performance on complex texts by augmenting embeddings with semantic information.

Contribution

The study demonstrates that incorporating AMR graphs into NLP models improves performance on low-resource health tasks and handles complex texts better than traditional methods.

Findings

01

AMR augmentation improves task performance

02

Semantic graphs help model complex health texts

03

Models show less variance with complex inputs

Abstract

User-generated texts available on the web and social platforms are often long and semantically challenging, making them difficult to annotate. Obtaining human annotation becomes increasingly difficult as problem domains become more specialized. For example, many health NLP problems require domain experts to be a part of the annotation pipeline. Thus, it is crucial that we develop low-resource NLP solutions able to work with this set of limited-data problems. In this study, we employ Abstract Meaning Representation (AMR) graphs as a means to model low-resource Health NLP tasks sourced from various online health resources and communities. AMRs are well suited to model online health texts as they can represent multi-sentence inputs, abstract away from complex terminology, and model long-distance relationships between co-referring tokens. AMRs thus improve the ability of pre-trained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification