An Analysis of Rhythmic Staccato-Vocalization Based on Frequency Demodulation for Laughter Detection in Conversational Meetings
Sucheta Ghosh, Milos Cernak, Sarbani Palit, B. B. Chaudhuri

TL;DR
This paper presents a novel method for detecting rhythmic, positive human laughter in multiparty conversations by applying music rhythm detection algorithms to speech energy frames, outperforming standard baselines.
Contribution
It introduces a new approach that leverages frequency demodulation and rhythm analysis from music to improve laughter detection in conversational speech.
Findings
Outperforms standard laughter classification baselines
Effective separation of high energy frames for analysis
Utilizes music rhythm detection algorithms for speech analysis
Abstract
Human laugh is able to convey various kinds of meanings in human communications. There exists various kinds of human laugh signal, for example: vocalized laugh and non vocalized laugh. Following the theories of psychology, among all the vocalized laugh type, rhythmic staccato-vocalization significantly evokes the positive responses in the interactions. In this paper we attempt to exploit this observation to detect human laugh occurrences, i.e., the laughter, in multiparty conversations from the AMI meeting corpus. First, we separate the high energy frames from speech, leaving out the low energy frames through power spectral density estimation. We borrow the algorithm of rhythm detection from the area of music analysis to use that on the high energy frames. Finally, we detect rhythmic laugh frames, analyzing the candidate rhythmic frames using statistics. This novel approach for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Humor Studies and Applications
