EvaluationNet: Can Human Skill be Evaluated by Deep Networks?
Seong Tae Kim, Yong Man Ro

TL;DR
This paper introduces EvaluationNet, a deep learning framework that models and evaluates human activities in instructional videos, aiming to automate skill assessment and improve learning outcomes.
Contribution
It proposes a novel structured activity modeling approach using dense trajectories, LSTM, and Siamese networks for automated human skill evaluation.
Findings
Effective activity evaluation demonstrated on public datasets.
Structured modeling improves assessment accuracy.
Deep networks can approximate expert evaluations.
Abstract
With the recent substantial growth of media such as YouTube, a considerable number of instructional videos covering a wide variety of tasks are available online. Therefore, online instructional videos have become a rich resource for humans to learn everyday skills. In order to improve the effectiveness of the learning with instructional video, observation and evaluation of the activity are required. However, it is difficult to observe and evaluate every activity steps by expert. In this study, a novel deep learning framework which targets human activity evaluation for learning from instructional video has been proposed. In order to deal with the inherent variability of activities, we propose to model activity as a structured process. First, action units are encoded from dense trajectories with LSTM network. The variable-length action unit features are then evaluated by a Siamese LSTM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Online Learning and Analytics
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
