Supervised Machine Learning for Extractive Query Based Summarisation of   Biomedical Data

Mandeep Kaur; Diego Moll\'a

arXiv:1809.05268·cs.CL·December 7, 2018

Supervised Machine Learning for Extractive Query Based Summarisation of Biomedical Data

Mandeep Kaur, Diego Moll\'a

PDF

TL;DR

This paper evaluates supervised machine learning methods for extractive query-based summarisation of biomedical literature, finding that classification approaches outperform regression methods in this context.

Contribution

It introduces a simple annotation approach for training classifiers and demonstrates its effectiveness over regression-based methods for biomedical summarisation.

Findings

01

Classification methods outperform regression in summarisation tasks.

02

A simple annotation approach improves training effectiveness.

03

The study uses BioASQ Challenge data for evaluation.

Abstract

The automation of text summarisation of biomedical publications is a pressing need due to the plethora of information available on-line. This paper explores the impact of several supervised machine learning approaches for extracting multi-document summaries for given queries. In particular, we compare classification and regression approaches for query-based extractive summarisation using data provided by the BioASQ Challenge. We tackled the problem of annotating sentences for training classification systems and show that a simple annotation approach outperforms regression-based summarisation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.