A Capability Maturity Model for Urban Dataset Meta-data
Mark S. Fox, Bart Gajderowicz, Dishu Lyu

TL;DR
This paper introduces a structured capability maturity model for urban dataset metadata, aiming to improve dataset discoverability and integration by guiding dataset creators through progressive metadata documentation levels.
Contribution
It proposes a novel dataset metadata maturity model inspired by software engineering, integrated into CKAN via a custom plugin to enhance metadata quality and dataset usability.
Findings
The maturity model delineates seven key dimensions with five levels each.
The CKAN plugin enables metadata enhancement and knowledge graph integration.
The approach improves dataset discoverability and relevance assessment.
Abstract
In the current environment of data generation and publication, there is an ever-growing number of datasets available for download. This growth precipitates an existing challenge: sourcing and integrating relevant datasets for analysis is becoming more complex. Despite efforts by open data platforms, obstacles remain, predominantly rooted in inadequate metadata, unsuitable data presentation, complications in pinpointing desired data, and data integration. This paper delves into the intricacies of dataset retrieval, emphasizing the pivotal role of metadata in aligning datasets with user queries. Through an exploration of existing literature, it underscores prevailing issues such as the identification of valuable metadata and the development of tools to maintain and annotate them effectively. The central contribution of this research is the proposition of a dataset metadata maturity model.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Mobility and Location-Based Analysis · Traffic Prediction and Management Techniques · Technology and Data Analysis
