Benchmarking Federated Learning for Semantic Datasets: Federated Scene Graph Generation

SeungBum Ha; Taehwan Lee; Jiyoun Lim; Sung Whan Yoon

arXiv:2412.10436·cs.CV·August 20, 2025

Benchmarking Federated Learning for Semantic Datasets: Federated Scene Graph Generation

SeungBum Ha, Taehwan Lee, Jiyoun Lim, Sung Whan Yoon

PDF

Open Access 1 Repo 4 Reviews

TL;DR

This paper introduces a novel federated learning benchmark for complex semantic datasets, specifically scene graphs, allowing controlled semantic heterogeneity across clients to evaluate federated methods on multi-semantic vision tasks.

Contribution

It proposes a new benchmark framework for federated learning with controllable semantic heterogeneity, addressing a gap in existing simple classification task benchmarks.

Findings

01

Demonstrates the effectiveness of existing PSG methods in federated settings.

02

Shows that robust federated algorithms improve performance under data heterogeneity.

03

Provides a new benchmark for evaluating federated learning on multi-semantic vision tasks.

Abstract

Federated learning (FL) enables decentralized training while preserving data privacy, yet existing FL benchmarks address relatively simple classification tasks, where each sample is annotated with a one-hot label. However, little attention has been paid to demonstrating an FL benchmark that handles complicated semantics, where each sample encompasses diverse semantic information, such as relations between objects. Because the existing benchmarks are designed to distribute data in a narrow view of a single semantic, managing the complicated semantic heterogeneity across clients when formalizing FL benchmarks is non-trivial. In this paper, we propose a benchmark process to establish an FL benchmark with controllable semantic heterogeneity across clients: two key steps are (i) data clustering with semantics and (ii) data distributing via controllable semantic heterogeneity across clients.…

Peer Reviews

Decision·ICLR 2025 Conference Withdrawn Submission

Reviewer 01Rating 3Confidence 5

Strengths

1. This paper applies the federated learning framework to the SGG/PSG problem, which is more challenging than the ordinary classification problem. 2. This paper focuses on the data heterogeneity problem in federated learning and constructs a client-side data partitioning approach in federated learning from the perspective of multi-semantic and multi-labeling. These works contribute to the later application of federated learning to complex semantic scenarios.

Weaknesses

1. There are problems with the presentation of certain words in the writing of this paper. For example, the first sentence of the abstract, federated learning is a distributed machine learning paradigm, but it can not be directly said that it is decentralized, but also includes centralized, and the method of this paper is based on centralized federated learning. 2. The abstract part, less description of their own methods and innovations, and a large description of background knowledge. Through

Reviewer 02Rating 3Confidence 4

Strengths

1. The method for constructing semantic heterogeneity among clients is novel. 2. This paper has clear logic and easy to understand.

Weaknesses

1. The complex tasks proposed in the paper, which require deep semantic information from samples, demand large-scale data. However, the paper also requires the dataset to provide scene graph information, which incurs substantial annotation costs. For reference, most multimodal large-scale models that handle these complex tasks are trained on unlabeled data. 2. The analysis presented in the paper is based solely on the PSG dataset, and it remains unclear whether similar clustering performance app

Reviewer 03Rating 5Confidence 3

Strengths

* The paper addresses an important gap in existing FL benchmarks, which primarily focus on simpler single-label tasks, by proposing a benchmark suited to more complex tasks that involve multiple semantics and inter-object relationships. * The authors propose a novel FL benchmark for the task of Panoptic Scene Graph Generation. * The paper is well structured and easy to read.

Weaknesses

* Only one task is presented as a proof of concept, which is understandable; however, it raises questions about the methodology’s broader applicability, especially for tasks that integrate both vision and language in a VQA style. It would improve the paper if the authors could discuss the feasibility of extending this approach to other complex tasks with similar or higher levels of complexity. Comments in this direction would be greatly appreciated. * The paper does not investigate the effects

Reviewer 04Rating 3Confidence 4

Strengths

- This work proposes to apply FL into a new application, i.e. scene graph generation and multi-semantic settings. It is important to develop a benchmark for such a new application. - Extensive experiments are conducted across multiple settings and algorithms. - The organization of the paper is clear and easy to follow.

Weaknesses

- It is good to apply FL to a new application, but the motivation is unclear. Why this application is important in FL? - The writing of the paper could be further improved and it is generally not easy to follow. The structure of the paper could be improved. It would be better to introduce the main methodology / benchmark earlier. - Various claims in the paper could be improved. For example, - 1. The paper emphaszes that existing works mostly focus on image classification, but there are many

Code & Models

Repositories

seung-b/fl-psg
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Advanced Graph Neural Networks · Data Quality and Management

MethodsSoftmax · Attention Is All You Need · + ( 1 ) ⟷ 888 ⟷ ( 829 ) ⟷ 0881||How do I resolve a dispute on Expedia?