A Method for Modeling Co-Occurrence Propensity of Clinical Codes with Application to ICD-10-PCS Auto-Coding
Michael Subotin, Anthony R. Davis

TL;DR
This paper introduces a method to improve medical auto-coding by modeling code co-occurrence propensities, leading to enhanced accuracy in ICD-10-PCS code assignment.
Contribution
It presents a novel approach that incorporates co-occurrence probabilities into auto-coding, improving code assignment accuracy beyond traditional independent methods.
Findings
Achieved a 12% relative improvement in F-score for ICD-10 procedure codes.
Demonstrated the method's compatibility with existing auto-coders.
Showed potential for wider application in medical auto-coding.
Abstract
Objective. Natural language processing methods for medical auto-coding, or automatic generation of medical billing codes from electronic health records, generally assign each code independently of the others. They may thus assign codes for closely related procedures or diagnoses to the same document, even when they do not tend to occur together in practice, simply because the right choice can be difficult to infer from the clinical narrative. Materials and Methods. We propose a method that injects awareness of the propensities for code co-occurrence into this process. First, a model is trained to estimate the conditional probability that one code is assigned by a human coder, given than another code is known to have been assigned to the same document. Then, at runtime, an iterative algorithm is used to apply this model to the output of an existing statistical auto-coder to modify the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Coding and Health Information · Biomedical Text Mining and Ontologies
