A Brief Survey of Text Mining: Classification, Clustering and Extraction   Techniques

Mehdi Allahyari; Seyedamin Pouriyeh; Mehdi Assefi; Saied Safaei,; Elizabeth D. Trippe; Juan B. Gutierrez; Krys Kochut

arXiv:1707.02919·cs.CL·July 31, 2017·514 cites

A Brief Survey of Text Mining: Classification, Clustering and Extraction Techniques

Mehdi Allahyari, Seyedamin Pouriyeh, Mehdi Assefi, Saied Safaei,, Elizabeth D. Trippe, Juan B. Gutierrez, Krys Kochut

PDF

Open Access 1 Repo

TL;DR

This survey reviews fundamental text mining techniques such as classification, clustering, and extraction, highlighting their applications in biomedical and healthcare domains amidst increasing unstructured text data.

Contribution

It provides a comprehensive overview of core text mining tasks and techniques, including recent applications in biomedical and health care fields.

Findings

01

Summarizes key text mining methods and algorithms.

02

Highlights applications in biomedical and healthcare domains.

03

Emphasizes importance of efficient processing of unstructured text.

Abstract

The amount of text that is generated every day is increasing dramatically. This tremendous volume of mostly unstructured text cannot be simply processed and perceived by computers. Therefore, efficient and effective techniques and algorithms are required to discover useful patterns. Text mining is the task of extracting meaningful information from text, which has gained significant attentions in recent years. In this paper, we describe several of the most fundamental text mining tasks and techniques including text pre-processing, classification and clustering. Additionally, we briefly explain text mining in biomedical and health care domains.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

RAJAT--PALIWAL/research_AI
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Text Analysis Techniques · Biomedical Text Mining and Ontologies · Text and Document Classification Technologies