Evaluation of Semantic Metadata Pair Modelling Using Data Clustering
Hiba Khalid, Esteban Zimanyi, Robert Wrembel

TL;DR
This paper proposes a data clustering-based approach to model semantic metadata pairs, enabling better connection and understanding between disintegrated datasets through a generic metadata representation and relevance mapping.
Contribution
It introduces a mapper function called cognate for finding mathematical relevance between attribute pairs, and uses clustering to group similar metadata, enhancing dataset linkage.
Findings
Effective clustering of metadata pairs based on similarity indices
Generation of meta-pointers for domain representation and synonym detection
Improved dataset connection through semantic metadata modeling
Abstract
Metadata presents a medium for connection, elaboration, examination, and comprehension of relativity between two datasets. Metadata can be enriched to calculate the existence of a connection between different disintegrated datasets. In order to do so, the very first task is to attain a generic metadata representation for domains. This representation narrows down the metadata search space. The metadata search space consists of attributes, tags, semantic content, annotations etc. to perform classification. The existing technologies limit the metadata bandwidth i.e. the operation set for matching purposes is restricted or limited. This research focuses on generating a mapper function called cognate that can find mathematical relevance based on pairs of attributes between disintegrated datasets. Each pair is designed from one of the datasets under consideration using the existing metadata…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Advanced Database Systems and Queries · Scientific Computing and Data Management
