IntelliAsk: Learning to Ask High-Quality Research Questions via RLVR

Karun Sharma; Vidushee Vats; Shengzhi Li; Yuxiang Wang; Zhongtian Sun; Prayag Tiwari

arXiv:2602.15849·cs.CL·March 9, 2026

IntelliAsk: Learning to Ask High-Quality Research Questions via RLVR

Karun Sharma, Vidushee Vats, Shengzhi Li, Yuxiang Wang, Zhongtian Sun, Prayag Tiwari

PDF

Open Access 1 Models

TL;DR

IntelliAsk leverages reinforcement learning with a reward model trained on expert-annotated reviewer questions to generate high-quality, evidence-based research questions that outperform baselines in grounding and effort.

Contribution

The paper introduces IntelliAsk, a novel RL-based question generation model trained with a new reward model, improving the quality of research questions generated by LLMs.

Findings

01

IntelliAsk generates more grounded, substantive questions than baselines.

02

It reduces reliance on first-page content in questions.

03

Improves reasoning and writing benchmark scores.

Abstract

Peer review relies on substantive, evidence-based questions, yet current LLMs generate surface-level queries that perform worse than human reviewer questions in expert evaluation. To address this gap, we curate a high-quality dataset of reviewer questions from OpenReview and conduct a human preference study where expert annotators evaluate question-paper pairs across three dimensions: effort, evidence, and grounding. From these annotations, we train IntelliReward, a reward model built from a frozen autoregressive LLM with trainable multi-head transformers. Validated against expert judgments, IntelliReward predicts reviewer-question quality better than API-based SFT baselines and provides scalable evaluation. We apply Decoupled Clip and Dynamic Sampling Policy Optimization (DAPO) with IntelliReward to train IntelliAsk, a question-generation model aligned with human standards of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
anonymousatom/IntelliAsk-Qwen3-32B-450-Merged
model· 55 dl
55 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExpert finding and Q&A systems · Topic Modeling · Machine Learning in Materials Science