Deploying Jupyter Notebooks at scale on XSEDE resources for Science Gateways and workshops
Andrea Zonca, Robert S. Sinkovits

TL;DR
This paper presents three scalable deployment strategies for JupyterHub on XSEDE resources, enabling large-scale interactive computing for science gateways and workshops.
Contribution
It introduces three novel deployment approaches for JupyterHub at scale on XSEDE resources, including supercomputer integration, Docker Swarm, and Kubernetes.
Findings
Scalable deployment methods demonstrated on XSEDE resources
Provision of persistent storage and user quotas in deployments
Fault-tolerant JupyterHub setup capable of supporting thousands of users
Abstract
Jupyter Notebooks have become a mainstream tool for interactive computing in every field of science. Jupyter Notebooks are suitable as companion applications for Science Gateways, providing more flexibility and post-processing capability to the users. Moreover they are often used in training events and workshops to provide immediate access to a pre-configured interactive computing environment. The Jupyter team released the JupyterHub web application to provide a platform where multiple users can login and access a Jupyter Notebook environment. When the number of users and memory requirements are low, it is easy to setup JupyterHub on a single server. However, setup becomes more complicated when we need to serve Jupyter Notebooks at scale to tens or hundreds of users. In this paper we will present three strategies for deploying JupyterHub at scale on XSEDE resources. All options share…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
