Detecting speaking persons in video

Hannes Fassold

arXiv:2110.13806·cs.CV·October 27, 2021

Detecting speaking persons in video

Hannes Fassold

PDF

Open Access

TL;DR

This paper introduces a new method for identifying speaking individuals in videos by analyzing facial landmarks extracted via neural networks and applying statistical analysis over time.

Contribution

The paper proposes a novel approach combining neural network facial landmark detection with temporal statistical analysis for speaking person detection.

Findings

01

Accurate detection of speaking persons in videos.

02

Effective use of facial landmarks and temporal analysis.

03

Improved performance over existing methods.

Abstract

We present a novel method for detecting speaking persons in video, by extracting facial landmarks with a neural network and analysing these landmarks statistically over time

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Gait Recognition and Analysis · Video Surveillance and Tracking Methods