NEMA: Automatic Integration of Large Network Management Databases

Fubao Wu; Han Hee Song; Jiangtao Yin; Lixin Gao; Mario Baldi; Narendra; Anand

arXiv:2006.01294·cs.DB·June 3, 2020

NEMA: Automatic Integration of Large Network Management Databases

Fubao Wu, Han Hee Song, Jiangtao Yin, Lixin Gao, Mario Baldi, Narendra, Anand

PDF

Open Access

TL;DR

NEMA is an automated technique that uses instance-level matching to effectively integrate large, heterogeneous network management databases with high accuracy and efficiency.

Contribution

This paper introduces NEMA, a novel instance-level matching method tailored for large network management databases, outperforming existing schema and string similarity approaches.

Findings

01

Achieves up to 95% accuracy in large database matching

02

Outperforms baseline methods by 2-10% in accuracy

03

Provides 5x-14x faster matching speed

Abstract

Network management, whether for malfunction analysis, failure prediction, performance monitoring and improvement, generally involves large amounts of data from different sources. To effectively integrate and manage these sources, automatically finding semantic matches among their schemas or ontologies is crucial. Existing approaches on database matching mainly fall into two categories. One focuses on the schema-level matching based on schema properties such as field names, data types, constraints and schema structures. Network management databases contain massive tables (e.g., network products, incidents, security alert and logs) from different departments and groups with nonuniform field names and schema characteristics. It is not reliable to match them by those schema properties. The other category is based on the instance-level matching using general string similarity techniques,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Quality and Management · Semantic Web and Ontologies · Service-Oriented Architecture and Web Services