Subjective data models in bioinformatics: Do wet-lab and computational biologists comprehend data differently?
Yo Yehudi, Lukas Hughes-Noehrer, Carole Goble, Caroline Jay

TL;DR
This study explores how biological and computational researchers perceive and interpret biological data differently, revealing fluid subjective data models that vary with context and highlighting implications for data interface design and metadata provision.
Contribution
It introduces the concept of subjective data models in bioinformatics and empirically investigates how different users conceptualize the same biological data.
Findings
People have fluid subjective data models that change with context.
Data identifier formats influence ease of understanding.
Perceptions do not cluster by computational experience.
Abstract
Biological science produces large amounts of data in a variety of formats, which necessitates the use of computational tools to process, integrate, analyse, and glean insights from the data. Researchers who use computational biology tools range from those who use computers primarily for communication and data lookup, to those who write complex software programs in order to analyse data or make it easier for others to do so. This research examines how people differ in how they conceptualise the same data, for which we coin the term "subjective data models". We interviewed 22 people with biological experience and varied levels of computational experience to elicit their perceptions of the same subset of biological data entities. The results suggest that many people had fluid subjective data models that would change depending on the circumstance or tool they were using. Surprisingly,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Research Data Management Practices
