Federated Action Recognition for Smart Worker Assistance Using FastPose

Vinit Hegiste; Vidit Goyal; Tatjana Legler; Martin Ruskowski

arXiv:2508.14113·cs.CV·August 21, 2025

Federated Action Recognition for Smart Worker Assistance Using FastPose

Vinit Hegiste, Vidit Goyal, Tatjana Legler, Martin Ruskowski

PDF

Open Access

TL;DR

This paper introduces a federated learning framework for skeleton-based human activity recognition in industrial environments, improving accuracy and privacy without centralized data collection.

Contribution

It proposes a novel federated learning approach using FastPose and evaluates its effectiveness with new industrial gesture datasets and models.

Findings

01

Federated learning significantly improves cross-user generalization.

02

FedEnsemble outperforms centralized training by +16.3 percentage points.

03

FL and FedEnsemble achieve higher accuracy on unseen clients.

Abstract

In smart manufacturing environments, accurate and real-time recognition of worker actions is essential for productivity, safety, and human-machine collaboration. While skeleton-based human activity recognition (HAR) offers robustness to lighting, viewpoint, and background variations, most existing approaches rely on centralized datasets, which are impractical in privacy-sensitive industrial scenarios. This paper presents a federated learning (FL) framework for pose-based HAR using a custom skeletal dataset of eight industrially relevant upper-body gestures, captured from five participants and processed using a modified FastPose model. Two temporal backbones, an LSTM and a Transformer encoder, are trained and evaluated under four paradigms: centralized, local (per-client), FL with weighted federated averaging (FedAvg), and federated ensemble learning (FedEnsemble). On the global test…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsContext-Aware Activity Recognition Systems · Human Pose and Action Recognition · IoT and Edge/Fog Computing