Vocal Breath Sound Based Gender Classification
Mohammad Shaique Solanki, Ashutosh M Bharadwaj, Jeevan K, Prasanta, Kumar Ghosh

TL;DR
This study investigates whether vocal breath sounds contain enough gender-specific information for automatic classification, finding that simple features and segment durations can achieve comparable accuracy to more complex methods.
Contribution
It is the first to analyze gender classification using vocal breath sounds with a focus on feature types, segment duration, and classifier complexity.
Findings
Knowledge-based features perform well with low-complexity classifiers.
3-second breath segments are optimal for classification.
Segment location is less critical than duration.
Abstract
Voiced speech signals such as continuous speech are known to have acoustic features such as pitch(F0), and formant frequencies(F1, F2, F3) which can be used for gender classification. However, gender classification studies using non-speech signals such as vocal breath sounds have not been explored as they lack typical gender-specific acoustic features. In this work, we explore whether vocal breath sounds encode gender information and if so, to what extent it can be used for automatic gender classification. In this study, we explore the use of data-driven and knowledge-based features from vocal breath sounds as well as the classifier complexity for gender classification. We also explore the importance of the location and duration of breath signal segments to be used for automatic classification. Experiments with 54.23 minutes of male and 51.83 minutes of female breath sounds reveal that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis
