Mixture of Predefined Experts: Maximizing Data Usage on Vertical Federated Learning

Jon Irureta; Gorka Azkune; Jon Imaz; Aizea Lojo; Javier Fernandez-Marques

arXiv:2602.12708·cs.LG·April 22, 2026

Mixture of Predefined Experts: Maximizing Data Usage on Vertical Federated Learning

Jon Irureta, Gorka Azkune, Jon Imaz, Aizea Lojo, Javier Fernandez-Marques

PDF

TL;DR

Split-MoPE is a novel VFL framework combining Split Learning with a predefined experts architecture, maximizing data utilization and robustness in real-world, misaligned sample scenarios.

Contribution

It introduces a new architecture that uses predefined experts for data alignment, enabling efficient, robust, and interpretable federated learning without full sample overlap.

Findings

01

Achieves state-of-the-art performance with a single communication round.

02

Reduces communication overhead compared to multi-round training.

03

Demonstrates robustness against malicious or noisy participants.

Abstract

Vertical Federated Learning (VFL) has emerged as a critical paradigm for collaborative model training in privacy-sensitive domains such as finance and healthcare. However, most existing VFL frameworks rely on the idealized assumption of full sample alignment across participants, a premise that rarely holds in real-world scenarios. To bridge this gap, this work introduces Split-MoPE, a novel framework that integrates Split Learning with a specialized Mixture of Predefined Experts (MoPE) architecture. Unlike standard Mixture of Experts (MoE), where routing is learned dynamically, MoPE uses predefined experts to process specific data alignments, effectively maximizing data usage during both training and inference without requiring full sample overlap. By leveraging pretrained encoders for target data domains, Split-MoPE achieves state-of-the-art performance in a single communication round,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.