Schema Matching using Machine Learning

Tanvi Sahay; Ankita Mehta; Shruti Jadon

arXiv:1911.11543·cs.DB·April 22, 2020

Schema Matching using Machine Learning

Tanvi Sahay, Ankita Mehta, Shruti Jadon

PDF

TL;DR

This paper presents a hybrid machine learning approach for schema matching that combines data and schema names, introduces a global dictionary for one-to-many matching, and compares different methods based on performance metrics.

Contribution

It introduces a novel hybrid approach utilizing data and schema names, along with a global dictionary, for improved schema matching.

Findings

01

The hybrid approach achieves competitive F-scores, precision, and recall.

02

Comparison shows advantages over previous methods.

03

Global dictionary enhances one-to-many schema matching.

Abstract

Schema Matching is a method of finding attributes that are either similar to each other linguistically or represent the same information. In this project, we take a hybrid approach at solving this problem by making use of both the provided data and the schema name to perform one to one schema matching and introduce the creation of a global dictionary to achieve one to many schema matching. We experiment with two methods of one to one matching and compare both based on their F-scores, precision, and recall. We also compare our method with the ones previously suggested and highlight differences between them.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.