DOME Registry: Implementing community-wide recommendations for reporting supervised machine learning in biology
Omar Abdelghani Attafi (1), Damiano Clementel (1), Konstantinos, Kyritsis (2), Emidio Capriotti (3), Gavin Farrell (4), Styliani-Christina, Fragkouli (2, 5), Leyla Jael Castro (6), Andr\'as Hatos (7, 8, 9, 10),, Tom Lenaerts (11, 12, 13), Stanislav Mazurenko (14, 15), Soroush

TL;DR
The paper introduces the DOME Registry, a tool to standardize reporting and improve reproducibility of supervised machine learning research in biology by providing structured documentation and evaluation standards.
Contribution
It presents the DOME Registry, a platform for managing and assessing ML studies in biology using community-driven standards and unique identifiers.
Findings
The registry enables comprehensive documentation of ML studies.
It assigns DOME scores to evaluate adherence to standards.
Future plans include expanding community curation and publisher adoption.
Abstract
Supervised machine learning (ML) is used extensively in biology and deserves closer scrutiny. The DOME recommendations aim to enhance the validation and reproducibility of ML research by establishing standards for key aspects such as data handling and processing, optimization, evaluation, and model interpretability. The recommendations help to ensure that key details are reported transparently by providing a structured set of questions. Here, we introduce the DOME Registry (URL: registry.dome-ml.org), a database that allows scientists to manage and access comprehensive DOME-related information on published ML studies. The registry uses external resources like ORCID, APICURON and the Data Stewardship Wizard to streamline the annotation process and ensure comprehensive documentation. By assigning unique identifiers and DOME scores to publications, the registry fosters a standardized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Genetics, Bioinformatics, and Biomedical Research
MethodsSparse Evolutionary Training · Wizard: Unsupervised goats tracking algorithm
