Frame-Level Real-Time Assessment of Stroke Rehabilitation Exercises from Video-Level Labeled Data: Task-Specific vs. Foundation Models
Gon\c{c}alo Mesquita, Ana Rita C\'oias, Artur Dubrawski, Alexandre Bernardino

TL;DR
This paper introduces a framework that uses video-level annotations and pre-trained models to classify individual frames for stroke rehabilitation exercises, enabling real-time assessment without extensive frame-level labeling.
Contribution
It presents a novel method combining pseudo-labels and pre-trained models to improve real-time, frame-level assessment of rehabilitation exercises from video-level data.
Findings
MOMENT achieves 73% AUC in video-level assessment.
Action Transformer with Integrated Gradient reaches 72% AUC for frame-level assessment.
The approach improves generalization and reduces data labeling demands.
Abstract
The growing demands of stroke rehabilitation have increased the need for solutions to support autonomous exercising. Virtual coaches can provide real-time exercise feedback from video data, helping patients improve motor function and keep engagement. However, training real-time motion analysis systems demands frame-level annotations, which are time-consuming and costly to obtain. In this work, we present a framework that learns to classify individual frames from video-level annotations for real-time assessment of compensatory motions in rehabilitation exercises. We use a gradient-based technique and a pseudo-label selection method to create frame-level pseudo-labels for training a frame-level classifier. We leverage pre-trained task-specific models - Action Transformer, SkateFormer - and a foundation model - MOMENT - for pseudo-label generation, aiming to improve generalization to new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStroke Rehabilitation and Recovery
MethodsAbsolute Position Encodings · Layer Normalization · Byte Pair Encoding · Label Smoothing · Softmax · Dropout · Dense Connections · Transformer · Sigmoid Activation · Long Short-Term Memory
