FeSViBS: Federated Split Learning of Vision Transformer with Block   Sampling

Faris Almalik; Naif Alkhunaizi; Ibrahim Almakky; and Karthik; Nandakumar

arXiv:2306.14638·cs.CV·June 27, 2023·1 cites

FeSViBS: Federated Split Learning of Vision Transformer with Block Sampling

Faris Almalik, Naif Alkhunaizi, Ibrahim Almakky, and Karthik, Nandakumar

PDF

Open Access 1 Repo

TL;DR

FeSViBS introduces a federated split learning framework with block sampling for vision transformers, improving medical image classification by leveraging intermediate features and pseudo tokens, especially under data-scarce and privacy-sensitive conditions.

Contribution

The paper proposes a novel federated split learning approach with block sampling for vision transformers, enhancing model generalization in medical imaging tasks.

Findings

01

Outperforms existing SL and FL methods on three datasets.

02

Effective in both IID and non-IID data settings.

03

Improves model generalizability with pseudo class tokens.

Abstract

Data scarcity is a significant obstacle hindering the learning of powerful machine learning models in critical healthcare applications. Data-sharing mechanisms among multiple entities (e.g., hospitals) can accelerate model training and yield more accurate predictions. Recently, approaches such as Federated Learning (FL) and Split Learning (SL) have facilitated collaboration without the need to exchange private data. In this work, we propose a framework for medical imaging classification tasks called Federated Split learning of Vision transformer with Block Sampling (FeSViBS). The FeSViBS framework builds upon the existing federated split vision transformer and introduces a block sampling module, which leverages intermediate features extracted by the Vision Transformer (ViT) at the server. This is achieved by sampling features (patch tokens) from an intermediate transformer block and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

faresmalik/fesvibs
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Radiomics and Machine Learning in Medical Imaging · Digital Radiography and Breast Imaging

MethodsMulti-Head Attention · Attention Is All You Need · Absolute Position Encodings · Linear Layer · Layer Normalization · Position-Wise Feed-Forward Layer · Dense Connections · Label Smoothing · Adam · Byte Pair Encoding