KG-Hub -- Building and Exchanging Biological Knowledge Graphs
J Harry Caufield, Tim Putman, Kevin Schaper, Deepak R Unni, Harshad, Hegde, Tiffany J Callahan, Luca Cappelletti, Sierra AT Moxon, Vida Ravanmehr,, Seth Carbon, Lauren E Chan, Katherina Cortes, Kent A Shefchek, Glass, Elsarboukh, James P Balhoff, Tommaso Fontana

TL;DR
KG-Hub is a platform that standardizes the construction, exchange, and reuse of biological knowledge graphs, integrating diverse data sources and supporting advanced graph analysis and machine learning applications.
Contribution
Introduces KG-Hub, a comprehensive platform for building, sharing, and analyzing biological knowledge graphs with standardized data models and integrated ML tools.
Findings
Supports diverse biological research use cases
Enables automated graph machine learning workflows
Facilitates data sharing and reproducibility in biology
Abstract
Knowledge graphs (KGs) are a powerful approach for integrating heterogeneous data and making inferences in biology and many other domains, but a coherent solution for constructing, exchanging, and facilitating the downstream use of knowledge graphs is lacking. Here we present KG-Hub, a platform that enables standardized construction, exchange, and reuse of knowledge graphs. Features include a simple, modular extract-transform-load (ETL) pattern for producing graphs compliant with Biolink Model (a high-level data model for standardizing biological data), easy integration of any OBO (Open Biological and Biomedical Ontologies) ontology, cached downloads of upstream data sources, versioned and automatically updated builds with stable URLs, web-browsable storage of KG artifacts on cloud infrastructure, and easy reuse of transformed subgraphs across projects. Current KG-Hub projects span use…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Biomedical Text Mining and Ontologies · Gene expression and cancer classification
