Federated Rule Ensemble Method in Medical Data
Ke Wan, Kensuke Tanioka, Toshio Shimokawa

TL;DR
This paper introduces a federated RuleFit framework that enables interpretable, accurate, and privacy-preserving machine learning across distributed medical datasets, addressing data heterogeneity and privacy concerns.
Contribution
It proposes a novel federated RuleFit method combining differentially private preprocessing, shared rule generation, and sparse coefficient estimation for interpretable models.
Findings
Achieved performance comparable to centralized RuleFit in simulations.
Outperformed existing federated approaches in predictive accuracy.
Provided interpretable insights in real-world medical data analysis.
Abstract
Machine learning has become integral to medical research and is increasingly applied in clinical settings to support diagnosis and decision-making; however, its effectiveness depends on access to large, diverse datasets, which are limited within single institutions. Although integrating data across institutions can address this limitation, privacy regulations and data ownership constraints hinder these efforts. Federated learning enables collaborative model training without sharing raw data; however, most methods rely on complex architectures that lack interpretability, limiting clinical applicability. Therefore, we proposed a federated RuleFit framework to construct a unified and interpretable global model for distributed environments. It integrates three components: preprocessing based on differentially private histograms to estimate shared cutoff values, enabling consistent rule…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
