DatAasee -- A Metadata-Lake as Metadata Catalog for a Virtual Data-Lake
Christian Himpe

TL;DR
This paper introduces DatAasee, a metadata-lake architecture designed to improve metadata management for distributed data sources, demonstrated through a proof-of-concept implementation and initial evaluation.
Contribution
It proposes a novel metadata-lake architecture specifically tailored for research data and library settings, with a working implementation and preliminary assessment.
Findings
Proof-of-concept implementation of the metadata-lake
Initial evaluation shows feasibility of the approach
Addresses long-standing metadata management challenges
Abstract
Metadata management for distributed data sources is a long-standing but ever-growing problem. To counter this challenge in a research-data and library-oriented setting, this work constructs a data architecture, derived from the data-lake: the metadata-lake. A proof-of-concept implementation of this proposed metadata aggregator is presented and briefly evaluated.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
