Data-Driven Regular Expressions Evolution for Medical Text Classification Using Genetic Programming
J Liu, R Bai, Z Lu, P Ge, D Liu, Uwe Aickelin

TL;DR
This paper introduces a genetic programming approach to evolve interpretable regular expressions for medical text classification, enhancing transparency and allowing medical professionals to fine-tune classifiers for better accuracy.
Contribution
It presents a novel method combining genetic programming with a new regular expression syntax to produce understandable classifiers for medical text, improving interpretability and customization.
Findings
Effective classification of medical inquiries with satisfactory precision and recall
Generated classifiers are fully understandable and editable by medical professionals
Promising performance demonstrated on real healthcare data
Abstract
In medical fields, text classification is one of the most important tasks that can significantly reduce human workload through structured information digitization and intelligent decision support. Despite the popularity of learning-based text classification techniques, it is hard for human to understand or manually fine-tune the classification results for better precision and recall, due to the black box nature of learning. This study proposes a novel regular expression-based text classification method making use of genetic programming (GP) approaches to evolve regular expressions that can classify a given medical text inquiry with satisfactory precision and recall while allow human to read the classifier and fine-tune accordingly if necessary. Given a seed population of regular expressions (can be randomly initialized or manually constructed by experts), our method evolves a population…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Algorithms and Applications · Text and Document Classification Technologies · Machine Learning and Data Classification
