Disease Identification From Unstructured User Input

Fahim Faisal (1); Shafkat Ahmed Bhuiyan (1); Abu Raihan Mostofa Kamal; (1) ((1) Islamic University of Technology)

arXiv:1905.01987·cs.IR·September 5, 2024

Disease Identification From Unstructured User Input

Fahim Faisal (1), Shafkat Ahmed Bhuiyan (1), Abu Raihan Mostofa Kamal, (1) ((1) Islamic University of Technology)

PDF

Open Access

TL;DR

This paper presents a novel two-phase text classification approach that combines lexicographic, semantic, and symptom-disease correlation features to accurately identify diseases from unstructured user input like health forum posts.

Contribution

It introduces a new algorithm that extracts comprehensive features from unstructured text to improve disease identification accuracy.

Findings

01

Effective in extracting features from unstructured data

02

Improves disease classification accuracy

03

Utilizes symptom-disease correlation for better results

Abstract

A method to identify probable diseases from the unstructured textual input (eg, health forum posts) by incorporating a lexicographic and semantic feature based two-phase text classification module and a symptom-disease correlation-based similarity measurement module. One notable aspect of my approach was to develop a competent algorithm to extract all inherent features from the data source to make a better decision.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Text and Document Classification Technologies