IBM Deep Learning Service
Bishwaranjan Bhattacharjee, Scott Boag, Chandani Doshi, Parijat Dube,, Ben Herta, Vatche Ishakian, K. R. Jayaram, Rania Khalaf, Avesh Krishna, Yu Bo, Li, Vinod Muthusamy, Ruchir Puri, Yufei Ren, Florian Rosenberg, Seetharami R., Seelam, Yandong Wang, Jian Ming Zhang, Li Zhang

TL;DR
IBM's Deep Learning Service (DLaaS) offers a scalable, flexible cloud platform enabling developers to run popular deep learning frameworks efficiently across heterogeneous resources, transforming AI deployment.
Contribution
This paper details the architecture of IBM's DLaaS, integrating deep learning frameworks with cloud infrastructure for scalable, resilient AI development and deployment.
Findings
Supports multiple deep learning frameworks like TensorFlow, Caffe, Torch
Enables scalable training across GPU and CPU resources
Provides flexible resource management in cloud environments
Abstract
Deep learning driven by large neural network models is overtaking traditional machine learning methods for understanding unstructured and perceptual data domains such as speech, text, and vision. At the same time, the "as-a-Service"-based business model on the cloud is fundamentally transforming the information technology industry. These two trends: deep learning, and "as-a-service" are colliding to give rise to a new business model for cognitive application delivery: deep learning as a service in the cloud. In this paper, we will discuss the details of the software architecture behind IBM's deep learning as a service (DLaaS). DLaaS provides developers the flexibility to use popular deep learning libraries such as Caffe, Torch and TensorFlow, in the cloud in a scalable and resilient manner with minimal effort. The platform uses a distribution and orchestration layer that facilitates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Graph Theory and Algorithms · Data Quality and Management
