NEMA: Automatic Integration of Large Network Management Databases
Fubao Wu, Han Hee Song, Jiangtao Yin, Lixin Gao, Mario Baldi, Narendra, Anand

TL;DR
NEMA is an automated technique that uses instance-level matching to effectively integrate large, heterogeneous network management databases with high accuracy and efficiency.
Contribution
This paper introduces NEMA, a novel instance-level matching method tailored for large network management databases, outperforming existing schema and string similarity approaches.
Findings
Achieves up to 95% accuracy in large database matching
Outperforms baseline methods by 2-10% in accuracy
Provides 5x-14x faster matching speed
Abstract
Network management, whether for malfunction analysis, failure prediction, performance monitoring and improvement, generally involves large amounts of data from different sources. To effectively integrate and manage these sources, automatically finding semantic matches among their schemas or ontologies is crucial. Existing approaches on database matching mainly fall into two categories. One focuses on the schema-level matching based on schema properties such as field names, data types, constraints and schema structures. Network management databases contain massive tables (e.g., network products, incidents, security alert and logs) from different departments and groups with nonuniform field names and schema characteristics. It is not reliable to match them by those schema properties. The other category is based on the instance-level matching using general string similarity techniques,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Semantic Web and Ontologies · Service-Oriented Architecture and Web Services
