Sycamore: Characterizing Synthetic Personas for Evaluating Genomics Visualization Retrieval
Huyen N. Nguyen, Astrid van den Brandt, Nils Gehlenborg

TL;DR
Sycamore investigates how grounded versus ungrounded synthetic personas influence the evaluation of genomics visualization tools, revealing their alignment and divergence from real expert feedback.
Contribution
This study introduces a three-condition probe design to assess the impact of grounding synthetic personas in genomics visualization evaluation.
Findings
Grounded synthetic personas align more with documented user concerns.
Ungrounded personas focus on operational details not raised by real users.
Both synthetic conditions miss certain image-modality preferences of real experts.
Abstract
Evaluating visualization systems in niche domains such as genomics is challenging due to scarcity of domain experts and difficulty recruiting a representative user base. While LLM-based synthetic personas are increasingly used to ease evaluation bottlenecks, they face well-founded skepticism. Rather than weighing synthetic personas as substitutes for real users, we ask a fundamental open question: when synthetic personas evaluate a real visualization system, what do they actually produce, and how does that output change when grounded in documented human contexts? We present Sycamore, an exploratory three-condition probe design using Geranium, a search engine for multimodal genomics visualization, as a case study. Sycamore evaluates Geranium using: (1) ungrounded synthetic personas from generic LLM priors; (2) grounded synthetic personas constrained by voice-of-customer artifacts from a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
