Blended Integrated Open Data: dados abertos p\'ublicos integrados
Fabiola Santore, Lucas F. Oliveira, Rafael de Paulo Dias, Henrique V., Ehrenfried, Alessandro Elias, Diego Pasqualin, Luis C. E. de Bona, Marcos, Didonet Del Fabro, Marcos Sunye

TL;DR
The paper introduces BIOD, a system that integrates over 300GB of open public data, enabling easier access, querying, and data production to facilitate research and analysis across disconnected datasets.
Contribution
It presents a comprehensive platform that consolidates diverse open data sources, simplifying access and enabling new data generation for improved usability.
Findings
Integrated over 300GB of open data from multiple sources
Enabled complex queries across previously disconnected datasets
Provided methods for producing compatible new data
Abstract
While several public institutions provide its data openly, the effort required to access, integrate and query this data is too high, reducing the amount of possible dataset users. The Blended Integrated Open Data (BIOD) project has as objective to ease the access to public Open Data. It integrates and makes available more than 300Gb of data, containing billions of records from different Open Data Sets, allowing to query over them, and thus to retrieve related information from originally disconnected data sets. This paper presents the set of open data available, how to access it and how produce new compatible data to improve the existing data set.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Data Mining Algorithms and Applications · Big Data and Business Intelligence
