# Extracting Factual Min/Max Age Information from Clinical Trial Studies

**Authors:** Yufang Hou, Debasis Ganguly, Lea A. Deleris, Francesca Bonin

arXiv: 1904.03262 · 2019-04-09

## TL;DR

This paper presents a neural network-based question answering approach to extract and validate minimum and maximum age information from clinical trial articles, outperforming previous methods.

## Contribution

It introduces a novel QA model trained on clinical trial data to accurately extract and verify age range information from research articles.

## Key findings

- Significant improvement over passage retrieval and CRF-based systems
- Effective filtering of non-factual age expressions
- High accuracy on a dataset of smoking cessation studies

## Abstract

Population age information is an essential characteristic of clinical trials. In this paper, we focus on extracting minimum and maximum (min/max) age values for the study samples from clinical research articles. Specifically, we investigate the use of a neural network model for question answering to address this information extraction task. The min/max age QA model is trained on the massive structured clinical study records from ClinicalTrials.gov. For each article, based on multiple min and max age values extracted from the QA model, we predict both actual min/max age values for the study samples and filter out non-factual age expressions. Our system improves the results over (i) a passage retrieval based IE system and (ii) a CRF-based system by a large margin when evaluated on an annotated dataset consisting of 50 research papers on smoking cessation.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.03262/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/1904.03262/full.md

## References

24 references — full list in the complete paper: https://tomesphere.com/paper/1904.03262/full.md

---
Source: https://tomesphere.com/paper/1904.03262