Enhancing Infant Crying Detection with Gradient Boosting for Improved Emotional and Mental Health Diagnostics

Kyunghun Lee; Lauren M. Henry; Eleanor Hansen; Elizabeth Tandilashvili; Lauren S. Wakschlag; Elizabeth Norton; Daniel S. Pine; Melissa A. Brotman; and Francisco Pereira

arXiv:2410.09236·eess.AS·April 2, 2026

Enhancing Infant Crying Detection with Gradient Boosting for Improved Emotional and Mental Health Diagnostics

Kyunghun Lee, Lauren M. Henry, Eleanor Hansen, Elizabeth Tandilashvili, Lauren S. Wakschlag, Elizabeth Norton, Daniel S. Pine, Melissa A. Brotman, and Francisco Pereira

PDF

TL;DR

This paper presents a novel audio analysis method combining Wav2Vec features and traditional audio features with Gradient Boosting Machines to improve infant cry detection accuracy for health diagnostics.

Contribution

It introduces an integrated approach using deep learning and traditional features with gradient boosting for enhanced infant cry classification.

Findings

01

Significant performance improvements over existing methods

02

Effective integration of Wav2Vec and traditional audio features

03

Validated on real-world dataset

Abstract

Infant crying can serve as a crucial indicator of various physiological and emotional states. This paper introduces a comprehensive approach detecting infant cries within audio data. We integrate Wav2Vec with traditional audio features and employ Gradient Boosting Machines for cry classification. We validate our approach on a real world dataset, demonstrating significant performance improvements over existing methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.