FCA-RAC: First Cycle Annotated Repetitive Action Counting

Jiada Lu; WeiWei Zhou; Xiang Qian; Dongze Lian; Yanyu Xu; Weifeng; Wang; Lina Cao; Shenghua Gao

arXiv:2406.12178·cs.CV·June 19, 2024·2 cites

FCA-RAC: First Cycle Annotated Repetitive Action Counting

Jiada Lu, WeiWei Zhou, Xiang Qian, Dongze Lian, Yanyu Xu, Weifeng, Wang, Lina Cao, Shenghua Gao

PDF

Open Access

TL;DR

This paper introduces FCA-RAC, a comprehensive framework for repetitive action counting that enhances model generalization to unseen actions through novel annotation, sampling, multi-scale convolution, and knowledge augmentation techniques.

Contribution

The paper proposes a new framework with innovative annotation and learning strategies to improve action counting accuracy and generalization on diverse datasets.

Findings

01

Achieves superior performance on RepCount-A dataset.

02

Effectively captures multi-scale action variations.

03

Enhances model generalization to unseen actions.

Abstract

Repetitive action counting quantifies the frequency of specific actions performed by individuals. However, existing action-counting datasets have limited action diversity, potentially hampering model performance on unseen actions. To address this issue, we propose a framework called First Cycle Annotated Repetitive Action Counting (FCA-RAC). This framework contains 4 parts: 1) a labeling technique that annotates each training video with the start and end of the first action cycle, along with the total action count. This technique enables the model to capture the correlation between the initial action cycle and subsequent actions; 2) an adaptive sampling strategy that maximizes action information retention by adjusting to the speed of the first annotated action cycle in videos; 3) a Multi-Temporal Granularity Convolution (MTGC) module, that leverages the muli-scale first action as a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Machine Learning in Healthcare · Anomaly Detection Techniques and Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Convolution