Validation of Treatment Discontinuation in Claims Data Using NLP and Electronic Health Records
Chun-Ting Yang, Kerry Ngan, Dae Hyun Kim, Jie Yang, Jun Liu, Joshua Lin

TL;DR
This study uses NLP to validate how well claims data can detect when patients stop taking medications, showing that accuracy varies by drug and gap length.
Contribution
A scalable NLP-based framework is introduced to validate medication discontinuation algorithms in claims data using EHRs as a reference standard.
Findings
90-day-gap algorithms had low sensitivity (0.40-0.54) but moderate to high specificity (0.75-0.89) across medications.
15-day-gap algorithms showed higher sensitivity (0.61-0.75) but lower specificity (0.45-0.66) compared to 90-day-gap algorithms.
Positive predictive values were more influenced by medication discontinuation rates than by gap lengths.
Abstract
Measuring medication discontinuation in claims data primarily relies on the gaps between prescription fills, but such definitions are rarely validated. This study aimed to establish a natural language processing (NLP)-based framework to validate claims-based discontinuation algorithms for commonly used medications against NLP-based reference standards from electronic health records (EHRs). We identified 35,010 patients receiving antipsychotic medications (APMs), benzodiazepines, warfarin, or direct oral anticoagulants (DOACs) from EHRs at the Mass General Brigham in 2007-2020. These EHR data were linked with 82,455 Medicare Part D claims. An NLP-aided chart review was applied to determine medication discontinuation from EHRs (reference standard). In claims data, we defined discontinuation based on a prescription gap larger than 15-90 days (claims-based algorithms). Sensitivity,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPharmacovigilance and Adverse Drug Reactions · Advanced Causal Inference Techniques · Machine Learning in Healthcare
