Federated Action Recognition for Smart Worker Assistance Using FastPose
Vinit Hegiste, Vidit Goyal, Tatjana Legler, Martin Ruskowski

TL;DR
This paper introduces a federated learning framework for skeleton-based human activity recognition in industrial environments, improving accuracy and privacy without centralized data collection.
Contribution
It proposes a novel federated learning approach using FastPose and evaluates its effectiveness with new industrial gesture datasets and models.
Findings
Federated learning significantly improves cross-user generalization.
FedEnsemble outperforms centralized training by +16.3 percentage points.
FL and FedEnsemble achieve higher accuracy on unseen clients.
Abstract
In smart manufacturing environments, accurate and real-time recognition of worker actions is essential for productivity, safety, and human-machine collaboration. While skeleton-based human activity recognition (HAR) offers robustness to lighting, viewpoint, and background variations, most existing approaches rely on centralized datasets, which are impractical in privacy-sensitive industrial scenarios. This paper presents a federated learning (FL) framework for pose-based HAR using a custom skeletal dataset of eight industrially relevant upper-body gestures, captured from five participants and processed using a modified FastPose model. Two temporal backbones, an LSTM and a Transformer encoder, are trained and evaluated under four paradigms: centralized, local (per-client), FL with weighted federated averaging (FedAvg), and federated ensemble learning (FedEnsemble). On the global test…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsContext-Aware Activity Recognition Systems · Human Pose and Action Recognition · IoT and Edge/Fog Computing
