Key Considerations for Domain Expert Involvement in LLM Design and Evaluation: An Ethnographic Study

Annalisa Szymanski; Oghenemaro Anuyah; Toby Jia-Jun Li; Ronald A. Metoyer

arXiv:2602.14357·cs.HC·February 17, 2026

Key Considerations for Domain Expert Involvement in LLM Design and Evaluation: An Ethnographic Study

Annalisa Szymanski, Oghenemaro Anuyah, Toby Jia-Jun Li, Ronald A. Metoyer

PDF

Open Access

TL;DR

This ethnographic study explores how development teams involving domain experts in LLM design and evaluation navigate practical challenges, revealing key practices and proposing design opportunities for more effective workflows.

Contribution

It provides an in-depth ethnographic analysis of real-world practices and challenges in involving domain experts in LLM development, highlighting new strategies and design considerations.

Findings

01

Teams create workarounds for data collection

02

Use augmentation when expert input is limited

03

Co-develop evaluation criteria with experts

Abstract

Large Language Models (LLMs) are increasingly developed for use in complex professional domains, yet little is known about how teams design and evaluate these systems in practice. This paper examines the challenges and trade-offs in LLM development through a 12-week ethnographic study of a team building a pedagogical chatbot. The researcher observed design and evaluation activities and conducted interviews with both developers and domain experts. Analysis revealed four key practices: creating workarounds for data collection, turning to augmentation when expert input was limited, co-developing evaluation criteria with experts, and adopting hybrid expert-developer-LLM evaluation strategies. These practices show how teams made strategic decisions under constraints and demonstrate the central role of domain expertise in shaping the system. Challenges included expert motivation and trust,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in Service Interactions · Artificial Intelligence in Healthcare and Education · Ethics and Social Impacts of AI