Auto-BI: Automatically Build BI-Models Leveraging Local Join Prediction and Global Schema Graph
Yiming Lin, Yeye He, Surajit Chaudhuri

TL;DR
Auto-BI is an innovative system that automatically constructs business intelligence models by predicting table joins using a graph-based optimization approach, significantly reducing manual effort and improving accuracy.
Contribution
We introduce a novel graph-theoretical optimization framework called k-MCA for automatic BI-model construction, with algorithms that are both optimal and scalable.
Findings
Achieves over 0.9 F1-score on real and synthetic benchmarks.
Operates with sub-second latency on large BI-models.
Effectively scales to nearly 100 tables in practice.
Abstract
Business Intelligence (BI) is crucial in modern enterprises and billion-dollar business. Traditionally, technical experts like database administrators would manually prepare BI-models (e.g., in star or snowflake schemas) that join tables in data warehouses, before less-technical business users can run analytics using end-user dashboarding tools. However, the popularity of self-service BI (e.g., Tableau and Power-BI) in recent years creates a strong demand for less technical end-users to build BI-models themselves. We develop an Auto-BI system that can accurately predict BI models given a set of input tables, using a principled graph-based optimization problem we propose called \textit{k-Min-Cost-Arborescence} (k-MCA), which holistically considers both local join prediction and global schema-graph structures, leveraging a graph-theoretical structure called \textit{arborescence}. While…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Data Mining Algorithms and Applications · Big Data and Business Intelligence
