Who's Better? Who's Best? Pairwise Deep Ranking for Skill Determination

Hazel Doughty; Dima Damen; Walterio Mayol-Cuevas

arXiv:1703.09913·cs.CV·March 30, 2018·1 cites

Who's Better? Who's Best? Pairwise Deep Ranking for Skill Determination

Hazel Doughty, Dima Damen, Walterio Mayol-Cuevas

PDF

Open Access

TL;DR

This paper introduces a deep learning-based pairwise ranking method to assess skill levels from videos across various tasks, aiming to automate the organization and evaluation of skill in video collections.

Contribution

The paper proposes a novel supervised deep ranking loss function that learns discriminative features for skill assessment from videos, applicable to multiple tasks.

Findings

01

Achieved 70-83% accuracy in correctly ordering skill videos

02

Demonstrated robustness through sensitivity analysis

03

Applicable across diverse tasks like surgery, drawing, and cooking

Abstract

We present a method for assessing skill from video, applicable to a variety of tasks, ranging from surgery to drawing and rolling pizza dough. We formulate the problem as pairwise (who's better?) and overall (who's best?) ranking of video collections, using supervised deep ranking. We propose a novel loss function that learns discriminative features when a pair of videos exhibit variance in skill, and learns shared features when a pair of videos exhibit comparable skill levels. Results demonstrate our method is applicable across tasks, with the percentage of correctly ordered pairs of videos ranging from 70% to 83% for four datasets. We demonstrate the robustness of our approach via sensitivity analysis of its parameters. We see this work as effort toward the automated organization of how-to video collections and overall, generic skill determination in video.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning