BDIViz in Action: Interactive Curation and Benchmarking for Schema Matching Methods

Eden Wu; Christos Koutras; Cl\'audio T. Silva; and Juliana Freire

arXiv:2604.10763·cs.IR·April 14, 2026

BDIViz in Action: Interactive Curation and Benchmarking for Schema Matching Methods

Eden Wu, Christos Koutras, Cl\'audio T. Silva, and Juliana Freire

PDF

TL;DR

BDIViz is an interactive visualization system that facilitates schema matching, validation, and benchmarking with human-in-the-loop capabilities, leveraging LLM assistance for data integration tasks.

Contribution

The paper introduces an extension to BDIViz enabling human-in-the-loop benchmarking, custom matcher integration, and real-time performance evaluation for schema matching methods.

Findings

01

Supports human validation of schema matches in an interactive heatmap.

02

Enables real-time benchmarking and comparison of different matching algorithms.

03

Facilitates high-quality ground-truth dataset creation through expert validation.

Abstract

Schema matching remains fundamental to data integration, yet evaluating and comparing matching methods is hindered by limited benchmark diversity and lack of interactive validation frameworks. BDIViz, recently published at IEEE VIS 2025, is an interactive visualization system for schema matching with LLM-assisted validation. Given source and target datasets, BDIViz applies automatic matching methods and visualizes candidates in an interactive heatmap with hierarchical navigation, zoom, and filtering. Users validate matches directly in the heatmap and inspect ambiguous cases using coordinated views that show attribute descriptions, example values, and distributions. An LLM assistant generates structured explanations for selected candidates to support decision-making. This demonstration showcases a new extension to BDIViz that addresses a critical need in data integration research:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.