Automatic Integration Issues of Tabular Data for On-Line Analysis Processing
Yuzhao Yang (IRIT-SIG), J\'er\^ome Darmont (ERIC), Franck Ravat, (IRIT-SIG), Olivier Teste (IRIT-SIG)

TL;DR
This paper discusses the challenges of automatically integrating tabular data for online analysis and proposes an initial automatic solution focusing on schema generation and data feature analysis.
Contribution
It introduces a typology of tabular data and presents a novel approach for automatic multidimensional schema generation for online data analysis.
Findings
Proposed a typology of tabular data types
Developed an initial automatic schema generation method
Facilitated cross-analysis of tabular data online
Abstract
Companies and individuals produce numerous tabular data. The objective of this position paper is to draw up the challenges posed by the automatic integration of data in the form of tables so that they can be cross-analyzed. We provide a first automatic solution for the integration of such tabular data to allow On-Line Analysis Processing. To fulfil this task, features of tabular data should be analyzed and the challenge of automatic multidimensional schema generation should be addressed. Hence, we propose a typology of tabular data and discuss our idea of an automatic solution.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Semantic Web and Ontologies · Advanced Database Systems and Queries
