Data streaming platform for crowd-sourced vehicle dataset generation
Felipe Mogollon, Zaloa Fernandez, Angel Martin, Juan Diego Ortega and, Gorka Velez

TL;DR
This paper presents an edge-cloud data platform for vehicle data collection and processing, addressing key challenges like privacy and interoperability, and evaluates its performance with different data types and connectivity technologies.
Contribution
It introduces a novel edge-cloud platform for heterogeneous vehicle data, analyzing performance limits and connectivity impacts to optimize data processing in automotive IoT.
Findings
Latency drops to 33ms with 5G connectivity using pipelining.
Performance varies with data type and connectivity technology.
Guidelines provided for processing assets to prevent bottlenecks.
Abstract
Vehicles are sophisticated machines equipped with sensors that provide real-time data for onboard driving assistance systems. Due to the wide variety of traffic, road, and weather conditions, continuous system enhancements are essential. Connectivity allows vehicles to transmit previously unknown data, expanding datasets and accelerating the development of new data models. This enables faster identification and integration of novel data, improving system reliability and reducing time to market. Data Spaces aim to create a data-driven, interconnected, and innovative data economy, where edge and cloud infrastructures support a virtualised IoT platform that connects data sources and development servers. This paper proposes an edge-cloud data platform to connect car data producers with multiple and heterogeneous services, addressing key challenges in Data Spaces, such as data sovereignty,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
