Inference Gap in Domain Expertise and Machine Intelligence in Named Entity Recognition: Creation of and Insights from a Substance Use-related Dataset

Sumon Kanti Dey; Jeanne M. Powell; Azra Ismail; Jeanmarie Perrone; Abeed Sarker

arXiv:2508.19467·cs.CL·August 28, 2025

Inference Gap in Domain Expertise and Machine Intelligence in Named Entity Recognition: Creation of and Insights from a Substance Use-related Dataset

Sumon Kanti Dey, Jeanne M. Powell, Azra Ismail, Jeanmarie Perrone, Abeed Sarker

PDF

TL;DR

This paper introduces a new dataset and NER framework for extracting opioid-related impacts from social media, highlighting the importance of domain-specific fine-tuning and revealing a significant gap between AI and expert performance.

Contribution

The study presents RedditImpacts 2.0, a high-quality dataset with refined annotations, and demonstrates the effectiveness of fine-tuned models over LLMs in domain-specific NER tasks.

Findings

01

Fine-tuned DeBERTa-large achieves F1 of 0.61

02

Strong performance with less labeled data

03

Significant gap between AI and expert agreement

Abstract

Nonmedical opioid use is an urgent public health challenge, with far-reaching clinical and social consequences that are often underreported in traditional healthcare settings. Social media platforms, where individuals candidly share first-person experiences, offer a valuable yet underutilized source of insight into these impacts. In this study, we present a named entity recognition (NER) framework to extract two categories of self-reported consequences from social media narratives related to opioid use: ClinicalImpacts (e.g., withdrawal, depression) and SocialImpacts (e.g., job loss). To support this task, we introduce RedditImpacts 2.0, a high-quality dataset with refined annotation guidelines and a focus on first-person disclosures, addressing key limitations of prior work. We evaluate both fine-tuned encoder-based models and state-of-the-art large language models (LLMs) under zero-…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.