Automatic Identification of Non-Meaningful Body-Movements and What It Reveals About Humans
Md Iftekhar Tanveer, RuJie Zhao, Mohammed Hoque

TL;DR
This paper introduces a framework to automatically identify non-meaningful body movements in public speaking videos, revealing differences in focus between speakers and audiences through feature analysis.
Contribution
It presents a novel dataset and a classification approach to distinguish meaningful from non-meaningful body movements in speeches, with insights into speaker and audience attention.
Findings
Classifier achieves up to 0.82 AUC in predicting annotations.
Lexical features are more important for self-annotations.
Prosody features are more important for audience annotations.
Abstract
We present a framework to identify whether a public speaker's body movements are meaningful or non-meaningful ("Mannerisms") in the context of their speeches. In a dataset of 84 public speaking videos from 28 individuals, we extract 314 unique body movement patterns (e.g. pacing, gesturing, shifting body weights, etc.). Online workers and the speakers themselves annotated the meaningfulness of the patterns. We extracted five types of features from the audio-video recordings: disfluency, prosody, body movements, facial, and lexical. We use linear classifiers to predict the annotations with AUC up to 0.82. Analysis of the classifier weights reveals that it puts larger weights on the lexical features while predicting self-annotations. Contrastingly, it puts a larger weight on prosody features while predicting audience annotations. This analysis might provide subtle hint that public…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Speech and Audio Processing · Humor Studies and Applications
