A Survey of Big Data Machine Learning Applications Optimization in Cloud Data Centers and Networks
Sanaa Hamid Mohamed, Taisir E.H. El-Gorashi, Jaafar M.H. Elmirghani

TL;DR
This survey reviews the challenges and solutions for optimizing big data machine learning applications in cloud data centers, focusing on infrastructure, traffic management, and energy efficiency.
Contribution
It provides a comprehensive overview of big data programming models, cloud infrastructure, and networking technologies, highlighting optimization strategies and future research directions.
Findings
MapReduce and Hadoop enable efficient big data analytics
Traffic congestion and power consumption are key bottlenecks
Optimization efforts improve performance and energy efficiency
Abstract
This survey article reviews the challenges associated with deploying and optimizing big data applications and machine learning algorithms in cloud data centers and networks. The MapReduce programming model and its widely-used open-source platform; Hadoop, are enabling the development of a large number of cloud-based services and big data applications. MapReduce and Hadoop thus introduce innovative, efficient, and accelerated intensive computations and analytics. These services usually utilize commodity clusters within geographically-distributed data centers and provide cost-effective and elastic solutions. However, the increasing traffic between and within the data centers that migrate, store, and process big data, is becoming a bottleneck that calls for enhanced infrastructures capable of reducing the congestion and power consumption. Moreover, enterprises with multiple tenants…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
