Bi-modal First Impressions Recognition using Temporally Ordered Deep   Audio and Stochastic Visual Features

Arulkumar Subramaniam; Vismay Patel; Ashish Mishra; Prashanth; Balasubramanian; Anurag Mittal

arXiv:1610.10048·cs.CV·November 1, 2016·1 cites

Bi-modal First Impressions Recognition using Temporally Ordered Deep Audio and Stochastic Visual Features

Arulkumar Subramaniam, Vismay Patel, Ashish Mishra, Prashanth, Balasubramanian, Anurag Mittal

PDF

Open Access 1 Repo

TL;DR

This paper introduces a bi-modal deep learning approach for first impressions recognition from short videos, leveraging temporally ordered audio and novel stochastic visual features to predict Big Five personality traits with high accuracy.

Contribution

It presents a novel bi-modal neural network architecture that effectively combines audio and visual features for personality trait prediction from limited video data.

Findings

01

Models perform exceptionally well on the ChaLearn LAP 2016 dataset.

02

Effective even with small input segments.

03

Achieved top performance in the APA2016 competition.

Abstract

We propose a novel approach for First Impressions Recognition in terms of the Big Five personality-traits from short videos. The Big Five personality traits is a model to describe human personality using five broad categories: Extraversion, Agreeableness, Conscientiousness, Neuroticism and Openness. We train two bi-modal end-to-end deep neural network architectures using temporally ordered audio and novel stochastic visual features from few frames, without over-fitting. We empirically show that the trained models perform exceptionally well, even after training from a small sub-portions of inputs. Our method is evaluated in ChaLearn LAP 2016 Apparent Personality Analysis (APA) competition using ChaLearn LAP APA2016 dataset and achieved excellent performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

InnovArul/first-impressions
torchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHumor Studies and Applications · Video Analysis and Summarization · Emotion and Mood Recognition