Reddit-Impacts: A Named Entity Recognition Dataset for Analyzing Clinical and Social Effects of Substance Use Derived from Social Media
Yao Ge, Sudeshna Das, Karen O'Connor, Mohammed Ali Al-Garadi, and Graciela Gonzalez-Hernandez, Abeed Sarker

TL;DR
Reddit-Impacts is a new annotated dataset from social media aimed at improving automatic detection of clinical and social impacts of substance use, facilitating better understanding and public health responses.
Contribution
The paper introduces Reddit-Impacts, a novel NER dataset focused on clinical and social effects of substance use from social media, and evaluates baseline machine learning models on it.
Findings
Transformer models like BERT and RoBERTa achieved strong baseline performance.
Few-shot and one-shot learning models demonstrated potential for NER tasks.
The dataset is publicly available for further research and development.
Abstract
Substance use disorders (SUDs) are a growing concern globally, necessitating enhanced understanding of the problem and its trends through data-driven research. Social media are unique and important sources of information about SUDs, particularly since the data in such sources are often generated by people with lived experiences. In this paper, we introduce Reddit-Impacts, a challenging Named Entity Recognition (NER) dataset curated from subreddits dedicated to discussions on prescription and illicit opioids, as well as medications for opioid use disorder. The dataset specifically concentrates on the lesser-studied, yet critically important, aspects of substance use--its clinical and social impacts. We collected data from chosen subreddits using the publicly available Application Programming Interface for Reddit. We manually annotated text spans representing clinical and social impacts…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing
Methods{Dispute@FaQ-s}How to file a dispute with Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Cosine Annealing · Dropout · Linear Warmup With Cosine Annealing · Residual Connection · Softmax · WordPiece · RoBERTa
