Split learning for health: Distributed deep learning without sharing raw   patient data

Praneeth Vepakomma; Otkrist Gupta; Tristan Swedish; Ramesh Raskar

arXiv:1812.00564·cs.LG·December 4, 2018·179 cites

Split learning for health: Distributed deep learning without sharing raw patient data

Praneeth Vepakomma, Otkrist Gupta, Tristan Swedish, Ramesh Raskar

PDF

Open Access 1 Repo

TL;DR

This paper introduces SplitNN, a distributed deep learning method enabling health institutions to collaboratively train models without sharing raw patient data, addressing privacy concerns while maintaining performance.

Contribution

It proposes multiple configurations of SplitNN tailored for various healthcare data sharing scenarios, enhancing privacy-preserving collaborative learning.

Findings

01

SplitNN outperforms federated learning in certain settings.

02

SplitNN reduces data sharing risks while maintaining model accuracy.

03

Efficient resource utilization demonstrated in experiments.

Abstract

Can health entities collaboratively train deep learning models without sharing sensitive raw data? This paper proposes several configurations of a distributed deep learning method called SplitNN to facilitate such collaborations. SplitNN does not share raw data or model details with collaborating institutions. The proposed configurations of splitNN cater to practical settings of i) entities holding different modalities of patient data, ii) centralized and local health entities collaborating on multiple tasks and iii) learning without sharing labels. We compare performance and resource efficiency trade-offs of splitNN and other distributed deep learning methods like federated learning, large batch synchronous stochastic gradient descent and show highly encouraging results for splitNN.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bt-s/Split-Learning-and-Federated-Learning
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Artificial Intelligence in Healthcare and Education · Machine Learning in Healthcare