Autodiscover: A reinforcement learning recommendation system for the cold-start imbalance challenge in active learning, powered by graph-aware thompson sampling
Parsa Vares

TL;DR
AutoDiscover is an adaptive reinforcement learning framework that improves systematic literature review screening by dynamically selecting query strategies using graph-aware representations and Thompson sampling, especially effective in cold-start scenarios.
Contribution
It introduces a novel RL-based approach with graph attention networks and Thompson sampling to adaptively manage query strategies in active learning for literature screening.
Findings
Outperforms static active learning baselines on 26 datasets.
Effectively mitigates cold-start issues with minimal initial labels.
Provides an open-source dashboard for decision interpretation.
Abstract
Systematic literature reviews (SLRs) are fundamental to evidence-based research, but manual screening is an increasing bottleneck as scientific output grows. Screening features low prevalence of relevant studies and scarce, costly expert decisions. Traditional active learning (AL) systems help, yet typically rely on fixed query strategies for selecting the next unlabeled documents. These static strategies do not adapt over time and ignore the relational structure of scientific literature networks. This thesis introduces AutoDiscover, a framework that reframes AL as an online decision-making problem driven by an adaptive agent. Literature is modeled as a heterogeneous graph capturing relationships among documents, authors, and metadata. A Heterogeneous Graph Attention Network (HAN) learns node representations, which a Discounted Thompson Sampling (DTS) agent uses to dynamically manage a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Machine Learning and Algorithms · Data Visualization and Analytics
