FeSViBS: Federated Split Learning of Vision Transformer with Block Sampling
Faris Almalik, Naif Alkhunaizi, Ibrahim Almakky, and Karthik, Nandakumar

TL;DR
FeSViBS introduces a federated split learning framework with block sampling for vision transformers, improving medical image classification by leveraging intermediate features and pseudo tokens, especially under data-scarce and privacy-sensitive conditions.
Contribution
The paper proposes a novel federated split learning approach with block sampling for vision transformers, enhancing model generalization in medical imaging tasks.
Findings
Outperforms existing SL and FL methods on three datasets.
Effective in both IID and non-IID data settings.
Improves model generalizability with pseudo class tokens.
Abstract
Data scarcity is a significant obstacle hindering the learning of powerful machine learning models in critical healthcare applications. Data-sharing mechanisms among multiple entities (e.g., hospitals) can accelerate model training and yield more accurate predictions. Recently, approaches such as Federated Learning (FL) and Split Learning (SL) have facilitated collaboration without the need to exchange private data. In this work, we propose a framework for medical imaging classification tasks called Federated Split learning of Vision transformer with Block Sampling (FeSViBS). The FeSViBS framework builds upon the existing federated split vision transformer and introduces a block sampling module, which leverages intermediate features extracted by the Vision Transformer (ViT) at the server. This is achieved by sampling features (patch tokens) from an intermediate transformer block and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Radiomics and Machine Learning in Medical Imaging · Digital Radiography and Breast Imaging
MethodsMulti-Head Attention · Attention Is All You Need · Absolute Position Encodings · Linear Layer · Layer Normalization · Position-Wise Feed-Forward Layer · Dense Connections · Label Smoothing · Adam · Byte Pair Encoding
