ALFA: Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning

Shuyue Stella Li; Jimin Mun; Faeze Brahman; Pedram Hosseini; Bryceton G. Thomas; Jessica M. Sin; Bing Ren; Jonathan S. Ilgen; Yulia Tsvetkov; Maarten Sap

arXiv:2502.14860·cs.CL·August 12, 2025

ALFA: Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning

Shuyue Stella Li, Jimin Mun, Faeze Brahman, Pedram Hosseini, Bryceton G. Thomas, Jessica M. Sin, Bing Ren, Jonathan S. Ilgen, Yulia Tsvetkov, Maarten Sap

PDF

Open Access 1 Repo 2 Datasets

TL;DR

This paper introduces ALFA, a framework that enhances large language models' ability to ask effective questions by decomposing question quality into attributes, synthesizing variations, and aligning models through preference optimization, demonstrated in clinical reasoning.

Contribution

ALFA provides a novel, attribute-based approach to improve LLM question-asking, with a case study in healthcare that shows significant reduction in diagnostic errors.

Findings

01

Models aligned with ALFA reduce diagnostic errors by 56.6%.

02

Achieved a question-level win-rate of 64.4%.

03

Demonstrated strong generalizability across tasks.

Abstract

Large language models (LLMs) often fail to ask effective questions under uncertainty, making them unreliable in domains where proactive information-gathering is essential for decision-making. We present ALignment via Fine-grained Attributes, (ALFA) a framework that improves LLM question-asking by (i) decomposing the notion of a "good" question into a set of theory-grounded attributes (e.g., clarity, relevance), (ii) controllably synthesizing attribute-specific question variations, and (iii) aligning models via preference-based optimization to explicitly learn to ask better questions along these fine-grained attributes. Focusing on clinical reasoning as a case study, we introduce the MediQ-AskDocs dataset, composed of 17k real-world clinical interactions augmented with 80k attribute-specific preference pairs of follow-up questions, as well as a novel expert-annotated interactive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

stellalisy/alfa
pytorchOfficial

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Law

MethodsSparse Evolutionary Training