Improving Active Learning in Systematic Reviews

Gaurav Singh; James Thomas; John Shawe-Taylor

arXiv:1801.09496·cs.IR·January 30, 2018·6 cites

Improving Active Learning in Systematic Reviews

Gaurav Singh, James Thomas, John Shawe-Taylor

PDF

Open Access

TL;DR

This paper enhances active learning techniques for systematic reviews, reducing manual effort by proposing a novel algorithm and feature selection method, validated on extensive datasets across disciplines.

Contribution

It introduces a new active learning algorithm and a feature selection mechanism tailored for systematic review screening tasks.

Findings

01

Improved screening performance with the new active learning algorithm

02

Identified bias towards selecting similar documents in naive methods

03

Proposed feature extraction method selection enhances accuracy

Abstract

Systematic reviews are essential to summarizing the results of different clinical and social science studies. The first step in a systematic review task is to identify all the studies relevant to the review. The task of identifying relevant studies for a given systematic review is usually performed manually, and as a result, involves substantial amounts of expensive human resource. Lately, there have been some attempts to reduce this manual effort using active learning. In this work, we build upon some such existing techniques, and validate by experimenting on a larger and comprehensive dataset than has been attempted until now. Our experiments provide insights on the use of different feature extraction models for different disciplines. More importantly, we identify that a naive active learning based screening process is biased in favour of selecting similar documents. We aimed to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Topic Modeling