TL;DR
This paper evaluates the impact of different background knowledge sources and strategies on schema matching, finding explicit strategies outperform latent ones and BabelNet as a consistently strong resource.
Contribution
It systematically compares six knowledge graphs and three exploitation strategies, highlighting the importance of strategy choice over dataset selection in schema matching.
Findings
Explicit strategies outperform latent ones.
Strategy choice impacts results more than background dataset.
BabelNet provides consistently good matching performance.
Abstract
The use of external background knowledge can be beneficial for the task of matching schemas or ontologies automatically. In this paper, we exploit six general-purpose knowledge graphs as sources of background knowledge for the matching task. The background sources are evaluated by applying three different exploitation strategies. We find that explicit strategies still outperform latent ones and that the choice of the strategy has a greater impact on the final alignment than the actual background dataset on which the strategy is applied. While we could not identify a universally superior resource, BabelNet achieved consistently good results. Our best matcher configuration with BabelNet performs very competitively when compared to other matching systems even though no dataset-specific optimizations were made.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
