# Enabling Interactive Analytics of Secure Data using Cloud Kotta

**Authors:** Yadu N. Babuji, Kyle Chard, Eamon Duede

arXiv: 1705.00070 · 2017-05-02

## TL;DR

This paper presents an extension to Cloud Kotta that enables researchers to perform interactive, collaborative data analysis on sensitive data using Jupyter notebooks, overcoming traditional batch processing limitations.

## Contribution

The paper introduces a novel approach to support interactive analytics within secure data enclaves by integrating Jupyter notebooks and Python for flexible, real-time analysis.

## Key findings

- Supports collaborative, interactive analysis of private data
- Reduces latency compared to batch processing systems
- Enables exploratory data science in secure environments

## Abstract

Research, especially in the social sciences and humanities, is increasingly reliant on the application of data science methods to analyze large amounts of (often private) data. Secure data enclaves provide a solution for managing and analyzing private data. However, such enclaves do not readily support discovery science---a form of exploratory or interactive analysis by which researchers execute a range of (sometimes large) analyses in an iterative and collaborative manner. The batch computing model offered by many data enclaves is well suited to executing large compute tasks; however it is far from ideal for day-to-day discovery science. As researchers must submit jobs to queues and wait for results, the high latencies inherent in queue-based, batch computing systems hinder interactive analysis. In this paper we describe how we have augmented the Cloud Kotta secure data enclave to support collaborative and interactive analysis of sensitive data. Our model uses Jupyter notebooks as a flexible analysis environment and Python language constructs to support the execution of arbitrary functions on private data within this secure framework.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1705.00070/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/1705.00070/full.md

## References

18 references — full list in the complete paper: https://tomesphere.com/paper/1705.00070/full.md

---
Source: https://tomesphere.com/paper/1705.00070