# Prediction of relevant biomedical documents: a human microbiome case study

**Authors:** Paul Thompson, Juliette C. Madan, Jason H. Moore

PMC · DOI: 10.1186/s13040-015-0061-5 · BioData Mining · 2015-09-10

## TL;DR

This paper shows how using a researcher's feedback on document relevance can improve the ranking of biomedical documents, making it easier to find relevant studies.

## Contribution

The study demonstrates that relevance feedback can accurately predict relevant documents beyond just using recency.

## Key findings

- Relevance judgments from researchers can predict document relevance with high accuracy.
- The method works across different time periods, as shown by training on 2008–2010 and testing on 2011 documents.
- Relevance feedback improves document retrieval beyond simple recency-based ranking.

## Abstract

Retrieving relevant biomedical literature has become increasingly difficult due to the large volume and rapid growth of biomedical publication. A query to a biomedical retrieval system often retrieves hundreds of results. Since the searcher will not likely consider all of these documents, ranking the documents is important. Ranking by recency, as PubMed does, takes into account only one factor indicating potential relevance. This study explores the use of the searcher’s relevance feedback judgments to support relevance ranking based on features more general than recency.

It was found that the researcher’s relevance judgments could be used to accurately predict the relevance of additional documents: both using tenfold cross-validation and by training on publications from 2008–2010 and testing on documents from 2011.

This case study has shown the promise for relevance feedback to improve biomedical document retrieval. A researcher’s judgments as to which initially retrieved documents are relevant, or not, can be leveraged to predict additional relevant documents.

## Full-text entities

- **Diseases:** NASA (MESH:D008158), infection (MESH:D007239), UMLS (MESH:D007806), inflammation (MESH:D007249), sepsis (MESH:D018805), ARRF (MESH:D020969), MEDLARS (MESH:D000069279)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC4564977/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC4564977/full.md

## References

41 references — full list in the complete paper: https://tomesphere.com/paper/PMC4564977/full.md

---
Source: https://tomesphere.com/paper/PMC4564977