An Efficient Metric of Automatic Weight Generation for Properties in   Instance Matching Technique

Md. Hanif Seddiqui; Rudra Pratap Deb Nath; Masaki Aono

arXiv:1502.03556·cs.AI·February 13, 2015

An Efficient Metric of Automatic Weight Generation for Properties in Instance Matching Technique

Md. Hanif Seddiqui, Rudra Pratap Deb Nath, Masaki Aono

PDF

TL;DR

This paper proposes a new mathematical metric for automatically generating property weights in instance matching, improving efficiency by leveraging information content principles.

Contribution

It introduces a novel metric based on property value ratios and information theory to enhance automatic weight generation in instance matching.

Findings

01

The proposed metric improves instance matching efficiency.

02

Experimental results validate the effectiveness of the metric.

03

The approach outperforms existing weight generation methods.

Abstract

The proliferation of heterogeneous data sources of semantic knowledge base intensifies the need of an automatic instance matching technique. However, the efficiency of instance matching is often influenced by the weight of a property associated to instances. Automatic weight generation is a non-trivial, however an important task in instance matching technique. Therefore, identifying an appropriate metric for generating weight for a property automatically is nevertheless a formidable task. In this paper, we investigate an approach of generating weights automatically by considering hypotheses: (1) the weight of a property is directly proportional to the ratio of the number of its distinct values to the number of instances contain the property, and (2) the weight is also proportional to the ratio of the number of distinct values of a property to the number of instances in a training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.