Cloud Platforms for Developing Generative AI Solutions: A Scoping Review of Tools and Services
Dhavalkumar Patel, Ganesh Raut, Satya Narayan Cheetirala, Girish N, Nadkarni, Robert Freeman, Benjamin S. Glicksberg, Eyal Klang, and Prem, Timsina

TL;DR
This paper provides a comprehensive review of cloud platforms and services that support the development and deployment of generative AI, comparing major providers and discussing technical, security, and future challenges.
Contribution
It offers a detailed comparison of cloud services for generative AI, highlighting key tools, architectures, and challenges to guide practitioners and researchers.
Findings
AWS, Azure, Google Cloud, IBM Cloud, Oracle, Alibaba Cloud support generative AI development.
Cloud services vary in performance, cost, and security features for AI workloads.
Case studies demonstrate practical applications in healthcare, finance, and entertainment.
Abstract
Generative AI is transforming enterprise application development by enabling machines to create content, code, and designs. These models, however, demand substantial computational power and data management. Cloud computing addresses these needs by offering infrastructure to train, deploy, and scale generative AI models. This review examines cloud services for generative AI, focusing on key providers like Amazon Web Services (AWS), Microsoft Azure, Google Cloud, IBM Cloud, Oracle Cloud, and Alibaba Cloud. It compares their strengths, weaknesses, and impact on enterprise growth. We explore the role of high-performance computing (HPC), serverless architectures, edge computing, and storage in supporting generative AI. We also highlight the significance of data management, networking, and AI-specific tools in building and deploying these models. Additionally, the review addresses security…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Business Intelligence
MethodsADaptive gradient method with the OPTimal convergence rate
