10 Simple Rules for Improving Your Standardized Fields and Terms
Rhiannon Cameron (1), Emma Griffiths (1), Damion Dooley (1), William Hsiao (1) ((1) Centre for Infectious Disease Genomics, One Health, Faculty of Health Sciences, Simon Fraser University, Burnaby, BC, Canada)

TL;DR
This paper offers practical guidelines and strategies for improving the standardization of research metadata fields and terms, enhancing data findability, sharing, and reuse.
Contribution
It provides a set of ten actionable rules for effective vocabulary standardization, grounded in real-world experience and aligned with FAIR principles.
Findings
Identifies common challenges like semantic noise and concept bombs.
Provides practical strategies for vocabulary harmonization.
Emphasizes alignment with FAIR principles.
Abstract
Contextual metadata is the unsung hero of research data. When done right, standardized and structured vocabularies make your data findable, shareable, and reusable. When done wrong, they turn a well intended effort into data cleanup and curation nightmares. In this paper we tackle the surprisingly tricky process of vocabulary standardization with a mix of practical advice and grounded examples. Drawing from real-world experience in contextual data harmonization, we highlight common challenges (e.g., semantic noise and concept bombs) and provide actionable strategies to address them. Our rules emphasize alignment with Findability, Accessibility, Interoperability, and Reusability (FAIR) principles while remaining adaptable to evolving user and research needs. Whether you are curating datasets, designing a schema, or contributing to a standards body, these rules aim to help you create…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
