BioBricks.ai: A Versioned Data Registry for Life Sciences Data Assets
Yifan Gao, Zakariyya Mughal, Jose A. Jaramillo-Villegas, Marie, Corradi, Alexandre Borrel, Ben Lieberman, Suliman Sharif, John Shaffer,, Karamarie Fecho, Ajay Chatrath, Alexandra Maertens, Marc A.T. Teunis, Nicole, Kleinstreuer, Thomas Hartung, Thomas Luechtefeld

TL;DR
BioBricks.ai is a centralized, versioned data registry that simplifies access, management, and integration of diverse life sciences datasets, accelerating research workflows and data sharing.
Contribution
It introduces a versioned data registry platform with tools for managing and integrating biomedical datasets, reducing redundancy and improving data accessibility.
Findings
Over ninety datasets available on BioBricks.ai
Provides a package manager-like system for data dependencies
Supports updateable data pipelines for integration
Abstract
Researchers in biomedical research, public health, and the life sciences often spend weeks or months discovering, accessing, curating, and integrating data from disparate sources, significantly delaying the onset of actual analysis and innovation. Instead of countless developers creating redundant and inconsistent data pipelines, BioBricks.ai offers a centralized data repository and a suite of developer-friendly tools to simplify access to scientific data. Currently, BioBricks.ai delivers over ninety biological and chemical datasets. It provides a package manager-like system for installing and managing dependencies on data sources. Each 'brick' is a Data Version Control git repository that supports an updateable pipeline for extraction, transformation, and loading data into the BioBricks.ai backend at https://biobricks.ai. Use cases include accelerating data science workflows and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
