Securing Your Collaborative Jupyter Notebooks in the Cloud using Container and Load Balancing Services
Haw-minn Lu, Adrian Kwong, Jose Unpingco

TL;DR
This paper presents a secure, cloud-based Jupyter Notebook platform that integrates AWS services, ensuring HIPAA compliance, secure collaboration, and privacy for sensitive data applications, without exposing vulnerabilities of Kubernetes or JupyterHub.
Contribution
The paper introduces a novel cloud architecture for Jupyter that enhances security and privacy using containerization and JWT-based authentication, addressing HIPAA requirements.
Findings
Secure platform supports collaboration and sharing of notebooks.
Architecture ensures HIPAA compliance for sensitive data.
Supports multiple applications beyond Jupyter, like R-Studio.
Abstract
Jupyter has become the go-to platform for developing data applications but data and security concerns, especially when dealing with healthcare, have become paramount for many institutions and applications dealing with sensitive information. How then can we continue to enjoy the data analysis and machine learning opportunities provided by Jupyter and the Python ecosystem while guaranteeing auditable compliance with security and privacy concerns? We will describe the architecture and implementation of a cloud based platform based on Jupyter that integrates with Amazon Web Services (AWS) and uses containerized services without exposing the platform to the vulnerabilities present in Kubernetes and JupyterHub. This architecture addresses the HIPAA requirements to ensure both security and privacy of data. The architecture uses an AWS service to provide JSON Web Tokens (JWT) for authentication…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Data Quality and Management
