# Automated Discovery and Classification of Training Videos for Career   Progression

**Authors:** Alan Chern, Phuong Hoang, Madhav Sigdel, Janani Balaji, and Mohammed, Korayem

arXiv: 1907.11086 · 2019-07-26

## TL;DR

This paper presents an automated system that collects and classifies training videos to assist job seekers in acquiring skills for career transitions, leveraging machine learning and embedding techniques for large-scale discovery.

## Contribution

It introduces a novel automated approach for collecting and classifying educational videos to support career progression, improving model performance with embedding features.

## Key findings

- Significant performance improvements with embedding vectors.
- Optimal probability threshold reduces false positives.
- System enables large-scale discovery of relevant training videos.

## Abstract

Job transitions and upskilling are common actions taken by many industry working professionals throughout their career. With the current rapidly changing job landscape where requirements are constantly changing and industry sectors are emerging, it is especially difficult to plan and navigate a predetermined career path. In this work, we implemented a system to automate the collection and classification of training videos to help job seekers identify and acquire the skills necessary to transition to the next step in their career. We extracted educational videos and built a machine learning classifier to predict video relevancy. This system allows us to discover relevant videos at a large scale for job title-skill pairs. Our experiments show significant improvements in the model performance by incorporating embedding vectors associated with the video attributes. Additionally, we evaluated the optimal probability threshold to extract as many videos as possible with minimal false positive rate.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1907.11086/full.md

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/1907.11086/full.md

## References

11 references — full list in the complete paper: https://tomesphere.com/paper/1907.11086/full.md

---
Source: https://tomesphere.com/paper/1907.11086