Prompt-enhanced Hierarchical Transformer Elevating Cardiopulmonary Resuscitation Instruction via Temporal Action Segmentation
Yang Liu, Xiaoyun Zhong, Shiyao Zhai, Zhicheng Du, Zhenyuan Gao,, Qiming Huang, Canyang Zhang, Bin Jiang, Vijay Kumar Pandey, Sanyang Han,, Runming Wang, Yuxing Han, Peiwu Qin

TL;DR
This paper introduces PhiTrans, a hierarchical Transformer model enhanced with prompts, to improve the segmentation of CPR instructional videos, aiming to elevate training effectiveness through advanced deep learning techniques.
Contribution
The paper presents a novel prompt-enhanced hierarchical Transformer architecture specifically designed for temporal action segmentation in CPR videos, integrating multiple modules for improved accuracy.
Findings
Achieved over 91% in segmentation metrics on CPR datasets
Demonstrated superior performance over existing TAS methods
Validated the pipeline's effectiveness in real CPR training scenarios
Abstract
The vast majority of people who suffer unexpected cardiac arrest are performed cardiopulmonary resuscitation (CPR) by passersby in a desperate attempt to restore life, but endeavors turn out to be fruitless on account of disqualification. Fortunately, many pieces of research manifest that disciplined training will help to elevate the success rate of resuscitation, which constantly desires a seamless combination of novel techniques to yield further advancement. To this end, we collect a custom CPR video dataset in which trainees make efforts to behave resuscitation on mannequins independently in adherence to approved guidelines, thereby devising an auxiliary toolbox to assist supervision and rectification of intermediate potential issues via modern deep learning methodologies. Our research empirically views this problem as a temporal action segmentation (TAS) task in computer vision,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCardiac Arrest and Resuscitation
MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Dropout · Adam · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Multi-Head Attention · Residual Connection · Label Smoothing
