It's AI Match: A Two-Step Approach for Schema Matching Using Embeddings

Benjamin H\"attasch; Michael Truong-Ngoc; Andreas Schmidt; Carsten; Binnig

arXiv:2203.04366·cs.DB·March 10, 2022·5 cites

It's AI Match: A Two-Step Approach for Schema Matching Using Embeddings

Benjamin H\"attasch, Michael Truong-Ngoc, Andreas Schmidt, Carsten, Binnig

PDF

Open Access

TL;DR

This paper introduces a two-step neural embedding-based method for schema matching, significantly improving the accuracy and robustness of identifying semantic correspondences between data schemas, reducing manual effort in data integration.

Contribution

It presents a novel end-to-end schema matching approach using embeddings at both table and attribute levels, outperforming traditional methods in finding complex correspondences.

Findings

01

Robust and reliable schema matching results.

02

Ability to identify non-trivial correspondences.

03

Outperforms traditional schema matching approaches.

Abstract

Since data is often stored in different sources, it needs to be integrated to gather a global view that is required in order to create value and derive knowledge from it. A critical step in data integration is schema matching which aims to find semantic correspondences between elements of two schemata. In order to reduce the manual effort involved in schema matching, many solutions for the automatic determination of schema correspondences have already been developed. In this paper, we propose a novel end-to-end approach for schema matching based on neural embeddings. The main idea is to use a two-step approach consisting of a table matching step followed by an attribute matching step. In both steps we use embeddings on different levels either representing the whole table or single attributes. Our results show that our approach is able to determine correspondences in a robust and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling