Detecting speaking persons in video
Hannes Fassold

TL;DR
This paper introduces a new method for identifying speaking individuals in videos by analyzing facial landmarks extracted via neural networks and applying statistical analysis over time.
Contribution
The paper proposes a novel approach combining neural network facial landmark detection with temporal statistical analysis for speaking person detection.
Findings
Accurate detection of speaking persons in videos.
Effective use of facial landmarks and temporal analysis.
Improved performance over existing methods.
Abstract
We present a novel method for detecting speaking persons in video, by extracting facial landmarks with a neural network and analysing these landmarks statistically over time
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Gait Recognition and Analysis · Video Surveillance and Tracking Methods
