MIFA: Metadata, Incentives, Formats, and Accessibility guidelines to improve the reuse of AI datasets for bioimage analysis
Teresa Zulueta-Coarasa, Florian Jug, Aastha Mathur, Josh Moore, Arrate Mu\~noz-Barrutia, Liviu Anita, Kola Babalola, Pete Bankhead, Perrine Gilloteaux, Nodar Gogoberidze, Martin Jones, Gerard J. Kleywegt, Paul Korir, Anna Kreshuk, Ayb\"uke K\"upc\"u Yolda\c{s}, Luca Marconato

TL;DR
The paper introduces MIFA guidelines to standardize bioimage data sharing, aiming to enhance AI development in bioimage analysis by improving dataset accessibility and quality.
Contribution
It presents a comprehensive set of standards and incentives to improve bioimage dataset sharing and reuse for AI applications.
Findings
Development of community-driven guidelines for data sharing.
Expected acceleration of AI tool development in bioimaging.
Promotion of standardized formats and metadata for datasets.
Abstract
Artificial Intelligence methods are powerful tools for biological image analysis and processing. High-quality annotated images are key to training and developing new methods, but access to such data is often hindered by the lack of standards for sharing datasets. We brought together community experts in a workshop to develop guidelines to improve the reuse of bioimages and annotations for AI applications. These include standards on data formats, metadata, data presentation and sharing, and incentives to generate new datasets. We are positive that the MIFA (Metadata, Incentives, Formats, and Accessibility) recommendations will accelerate the development of AI tools for bioimage analysis by facilitating access to high quality training data.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCell Image Analysis Techniques · Genetics, Bioinformatics, and Biomedical Research · Scientific Computing and Data Management
