Greening Big Data Networks: The Impact of Veracity
Ali M. Al-Salim, Taisir E. H. El-Gorashi, Ahmed Q. Lawey, and Jaafar, M. H. Elmirghani

TL;DR
This paper explores how ensuring data veracity in big data networks can significantly improve energy efficiency by implementing a strategic data cleansing and processing approach, leading to up to 52% power savings.
Contribution
It introduces a MILP model to incorporate data veracity into green networking strategies, optimizing data cleansing and processing locations for energy efficiency.
Findings
Achieved up to 52% network power savings with the green approach.
Demonstrated the importance of data veracity in energy-efficient big data networks.
Proposed a model for strategic placement of processing and backup nodes.
Abstract
The continuous increase in big data applications, in number and types, creates new challenges that should be tackled by the green ICT community. Big data is mainly characterized by 4 Vs volume, variety, velocity, and veracity. Each V poses a number of challenges that have implications on the energy efficiency of the underlying networks carrying the big data. Addressing the veracity of the data is a more serious challenge to data scientists, since they need to distinguish between the meaningful data and the dirty data. In this article, we investigate the impact of big data veracity on greening IP by developing a Mixed Integer Linear Programming, MILP, model that encapsulates the distinctive features of veracity. In our analyses, the big data network was greened by cleansing the raw big data before processing and then progressively processing the cleansed big data at strategic locations,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
