T-FIX: Text-Based Explanations with Features Interpretable to eXperts

Shreya Havaldar; Weiqiu You; Chaehyeon Kim; Anton Xue; Helen Jin; Marco Gatti; Bhuvnesh Jain; Helen Qu; Amin Madani; Daniel A. Hashimoto; Gary E. Weissman; Rajat Deo; Sameed Khatana; Lyle Ungar; Eric Wong

arXiv:2511.04070·cs.CL·May 19, 2026

T-FIX: Text-Based Explanations with Features Interpretable to eXperts

Shreya Havaldar, Weiqiu You, Chaehyeon Kim, Anton Xue, Helen Jin, Marco Gatti, Bhuvnesh Jain, Helen Qu, Amin Madani, Daniel A. Hashimoto, Gary E. Weissman, Rajat Deo, Sameed Khatana, Lyle Ungar, Eric Wong

PDF

1 Repo 1 Video

TL;DR

T-FIX is a unified, automatic evaluation framework for assessing how well LLM explanations align with domain experts' reasoning across multiple scientific tasks, reducing reliance on costly annotations.

Contribution

It introduces a scalable, domain-grounded evaluation method for expert-aligned explanations applicable to unseen data, with a unified framework across diverse scientific fields.

Findings

01

Evaluated across seven scientific tasks in three domains.

02

Enables automatic and personalized assessment of expert alignment.

03

Generalizes to unseen explanations without ongoing expert input.

Abstract

As LLMs are deployed in knowledge-intensive settings (e.g., surgery, astronomy, therapy), users are often domain experts who expect not just answers, but explanations that mirror professional reasoning. Yet evaluating whether an LLM "thinks like an expert" remains difficult: existing approaches rely on per-example expert annotation, making them costly, hard to scale, and tied to a single notion of correct reasoning within each domain. To address this gap, we introduce T-FIX, a unified evaluation framework that operationalizes expert alignment as a desired attribute of LLM-generated explanations. T-FIX spans seven scientific tasks across three domains, with each task evaluated against expert-defined criteria that capture domain-grounded reasoning rather than generic explanation quality. Our framework enables automatic, personalizable evaluation of expert alignment that generalizes to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

BrachioLab/FIX-2
github

Videos

T-FIX: Text-Based Explanations with Features Interpretable to eXperts· underline

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education · Topic Modeling