HySec-Flow: Privacy-Preserving Genomic Computing with SGX-based Big-Data Analytics Framework
Chathura Widanage, Weijie Liu, Jiayu Li, Hongbo Chen, XiaoFeng Wang,, Haixu Tang, Judy Fox

TL;DR
HySec-Flow is a novel framework that enhances privacy-preserving large-scale genomic analysis using SGX by partitioning tasks into parallel, secure and non-secure containers, improving performance despite enclave memory limitations.
Contribution
The paper introduces HySec-Flow, a hybrid framework that enables scalable, privacy-preserving genomic data analysis on SGX platforms by partitioning tasks and supporting legacy code execution.
Findings
Performance improved by task partitioning in SGX enclaves
Framework supports large-scale genomic workflows
Open-source implementation available
Abstract
Trusted execution environments (TEE) such as Intel's Software Guard Extension (SGX) have been widely studied to boost security and privacy protection for the computation of sensitive data such as human genomics. However, a performance hurdle is often generated by SGX, especially from the small enclave memory. In this paper, we propose a new Hybrid Secured Flow framework (called "HySec-Flow") for large-scale genomic data analysis using SGX platforms. Here, the data-intensive computing tasks can be partitioned into independent subtasks to be deployed into distinct secured and non-secured containers, therefore allowing for parallel execution while alleviating the limited size of Page Cache (EPC) memory in each enclave. We illustrate our contributions using a workflow supporting indexing, alignment, dispatching, and merging the execution of SGX- enabled containers. We provide details…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
