VEMOCLAP: A video emotion classification web application

Serkan Sulun; Paula Viana; Matthew E. P. Davies

arXiv:2410.21303·cs.CV·October 30, 2024

VEMOCLAP: A video emotion classification web application

Serkan Sulun, Paula Viana, Matthew E. P. Davies

PDF

Open Access

TL;DR

VEMOCLAP is an open-source web app that classifies emotions in videos using pretrained models and multi-head cross-attention, achieving state-of-the-art accuracy and allowing user video analysis.

Contribution

It introduces VEMOCLAP, the first accessible web application for video emotion classification that leverages pretrained features and multi-head cross-attention for improved accuracy.

Findings

01

Increased classification accuracy by 4.3% on Ekman-6 dataset.

02

First open-source web app for video emotion analysis.

03

Enables users to analyze their own or YouTube videos.

Abstract

We introduce VEMOCLAP: Video EMOtion Classifier using Pretrained features, the first readily available and open-source web application that analyzes the emotional content of any user-provided video. We improve our previous work, which exploits open-source pretrained models that work on video frames and audio, and then efficiently fuse the resulting pretrained features using multi-head cross-attention. Our approach increases the state-of-the-art classification accuracy on the Ekman-6 video emotion dataset by 4.3% and offers an online application for users to run our model on their own videos or YouTube videos. We invite the readers to try our application at serkansulun.com/app.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEmotion and Mood Recognition