Scalable Community Extraction of Text Networks for Automated Grouping in   Medical Databases

Tomilayo Komolafe; Allan Fong; Srijan Sengupta

arXiv:2111.15633·cs.SI·December 1, 2021

Scalable Community Extraction of Text Networks for Automated Grouping in Medical Databases

Tomilayo Komolafe, Allan Fong, Srijan Sengupta

PDF

Open Access

TL;DR

This paper introduces a scalable community extraction algorithm tailored for large text networks in medical databases, improving group detection accuracy over manual tagging and aiding in patient safety analysis.

Contribution

It adapts a known community extraction method to large-scale text networks, enabling more accurate and scalable grouping in medical databases.

Findings

01

Generated groups are more accurate than manual tags.

02

Method is scalable to large text datasets.

03

Improves patient safety data analysis.

Abstract

Networks are ubiquitous in today's world. Community structure is a well-known feature of many empirical networks, and a lot of statistical methods have been developed for community detection. In this paper, we consider the problem of community structure in text networks,which is greatly relevant in medical errors and patient safety databases. We adapt a well-known community extraction method to develop a scalable algorithm for community extraction in large text databases. The application of our method on a real-world patient safety report database demonstrates that the groups generated from community extraction are much more accurate than manual tagging by frontline workers.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsService-Oriented Architecture and Web Services