Quantifying the informativeness for biomedical literature summarization:   An itemset mining method

Milad Moradi; Nasser Ghadiri

arXiv:1609.03067·cs.IR·November 26, 2018

Quantifying the informativeness for biomedical literature summarization: An itemset mining method

Milad Moradi, Nasser Ghadiri

PDF

TL;DR

This paper introduces a novel biomedical literature summarization method that combines itemset mining and domain knowledge to identify key subtopics and select the most informative sentences, improving summary quality.

Contribution

It presents a new concept-based summarization approach using itemset mining and UMLS concepts, outperforming existing statistical and frequency-based methods.

Findings

01

Achieves higher ROUGE scores than baseline methods

02

Effectively identifies core subtopics in biomedical texts

03

Enhances informativeness measurement of sentences

Abstract

Objective: Automatic text summarization tools can help users in the biomedical domain to access information efficiently from a large volume of scientific literature and other sources of text documents. In this paper, we propose a summarization method that combines itemset mining and domain knowledge to construct a concept-based model and to extract the main subtopics from an input document. Our summarizer quantifies the informativeness of each sentence using the support values of itemsets appearing in the sentence. Methods: To address the concept-level analysis of text, our method initially maps the original document to biomedical concepts using the UMLS. Then, it discovers the essential subtopics of the text using a data mining technique, namely itemset mining, and constructs the summarization model. The employed itemset mining algorithm extracts a set of frequent itemsets containing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.