Using association rule mining and ontologies to generate metadata recommendations from multiple biomedical databases
Marcos Mart\'inez-Romero, Martin J. O'Connor, Attila L. Egyedi, Debra, Willrett, Josef Hardi, John Graybeal, Mark A. Musen

TL;DR
This paper presents a novel system that uses association rule mining and ontologies to provide real-time, accurate metadata recommendations for biomedical datasets, improving metadata quality and authoring efficiency.
Contribution
It introduces a method that combines association rule mining across multiple repositories with ontology alignment to enhance metadata recommendation accuracy.
Findings
Effective in generating accurate metadata recommendations
Able to analyze data from multiple repositories simultaneously
Enhances metadata quality through ontology integration
Abstract
Metadata-the machine-readable descriptions of the data-are increasingly seen as crucial for describing the vast array of biomedical datasets that are currently being deposited in public repositories. While most public repositories have firm requirements that metadata must accompany submitted datasets, the quality of those metadata is generally very poor. A key problem is that the typical metadata acquisition process is onerous and time consuming, with little interactive guidance or assistance provided to users. Secondary problems include the lack of validation and sparse use of standardized terms or ontologies when authoring metadata. There is a pressing need for improvements to the metadata acquisition process that will help users to enter metadata quickly and accurately. In this paper we outline a recommendation system for metadata that aims to address this challenge. Our approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
