Relaxed Spatio-Temporal Deep Feature Aggregation for Real-Fake   Expression Prediction

Savas Ozkan; Gozde Bozdagi Akar

arXiv:1708.07335·cs.CV·August 25, 2017

Relaxed Spatio-Temporal Deep Feature Aggregation for Real-Fake Expression Prediction

Savas Ozkan, Gozde Bozdagi Akar

PDF

3 Repos

TL;DR

This paper introduces a learnable spatio-temporal feature aggregation method that improves real-fake expression prediction by capturing short-term temporal and spatial dependencies, outperforming existing techniques.

Contribution

The proposed method uniquely retains short-time temporal structure and spatial interdependencies in video features, and is adaptable to scarce training data scenarios.

Findings

01

Achieved 65% MAP score on real-fake expression dataset

02

Outperformed previous methods with only one misclassification

03

Set a new state-of-the-art result in Chalearn Challenge

Abstract

Frame-level visual features are generally aggregated in time with the techniques such as LSTM, Fisher Vectors, NetVLAD etc. to produce a robust video-level representation. We here introduce a learnable aggregation technique whose primary objective is to retain short-time temporal structure between frame-level features and their spatial interdependencies in the representation. Also, it can be easily adapted to the cases where there have very scarce training samples. We evaluate the method on a real-fake expression prediction dataset to demonstrate its superiority. Our method obtains 65% score on the test dataset in the official MAP evaluation and there is only one misclassified decision with the best reported result in the Chalearn Challenge (i.e. 66:7%) . Lastly, we believe that this method can be extended to different problems such as action/event recognition in future.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory