A systematic data characteristic understanding framework towards physical-sensor big data challenges
Zhipeng Ma, Bo N{\o}rregaard J{\o}rgensen, Zheng Grace Ma

TL;DR
This paper introduces a comprehensive framework based on the 6Vs model to analyze physical-sensor big data characteristics, addressing challenges and improving data quality through quantitative, data-driven indicators and case studies.
Contribution
It proposes a systematic, data-driven framework for understanding physical-sensor big data characteristics, including time-related aspects, to better address analytics challenges.
Findings
Framework effectively analyzes all physical-sensor data characteristics.
Links data challenges to each of the 6Vs dimensions.
Case studies demonstrate practical application and insights.
Abstract
Big data present new opportunities for modern society while posing challenges for data scientists. Recent advancements in sensor networks and the widespread adoption of IoT have led to the collection of physical-sensor data on an enormous scale. However, significant challenges arise in high-quality big data analytics. To uncover big data challenges and enhance data quality, it is essential to quantitatively unveil data characteristics. Furthermore, the existing studies lack analysis of the specific time-related characteristics. Enhancing the efficiency and precision of data analytics through the big data lifecycle requires a comprehensive understanding of data characteristics to address the hidden big data challenges. To fill in the research gap, this paper proposes a systematic data characteristic framework based on a 6Vs model. The framework aims to unveil the data characteristics in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training
