The OARF Benchmark Suite: Characterization and Implications for   Federated Learning Systems

Sixu Hu; Yuan Li; Xu Liu; Qinbin Li; Zhaomin Wu; Bingsheng He

arXiv:2006.07856·cs.LG·March 3, 2022

The OARF Benchmark Suite: Characterization and Implications for Federated Learning Systems

Sixu Hu, Yuan Li, Xu Liu, Qinbin Li, Zhaomin Wu, Bingsheng He

PDF

1 Repo

TL;DR

The paper introduces OARF, a comprehensive benchmark suite for federated learning that uses realistic datasets across various data types, enabling better evaluation of system performance and research opportunities.

Contribution

It presents a new benchmark suite, OARF, with diverse, realistic datasets and reference implementations for evaluating federated learning systems.

Findings

01

Federated learning can significantly increase end-to-end throughput.

02

OARF covers diverse data sizes, distributions, and tasks.

03

The benchmark facilitates future research in federated learning.

Abstract

This paper presents and characterizes an Open Application Repository for Federated Learning (OARF), a benchmark suite for federated machine learning systems. Previously available benchmarks for federated learning have focused mainly on synthetic datasets and use a limited number of applications. OARF mimics more realistic application scenarios with publicly available data sets as different data silos in image, text and structured data. Our characterization shows that the benchmark suite is diverse in data size, distribution, feature distribution and learning task complexity. The extensive evaluations with reference implementations show the future research opportunities for important aspects of federated learning systems. We have developed reference implementations, and evaluated the important aspects of federated learning, including model accuracy, communication cost, throughput and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Xtra-Computing/PrivML
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.