VANPY: Voice Analysis Framework

Gregory Koushnir; Michael Fire; Galit Fuhrmann Alpert; Dima Kagan

arXiv:2502.17579·cs.SD·May 6, 2025

VANPY: Voice Analysis Framework

Gregory Koushnir, Michael Fire, Galit Fuhrmann Alpert, Dima Kagan

PDF

Open Access 1 Repo 8 Models

TL;DR

VANPY is an open-source Python framework that automates voice data processing, feature extraction, and classification, enabling comprehensive speaker characterization for various applications including emotion and demographic analysis.

Contribution

The paper introduces VANPY, a modular, extensible framework with new in-house components for detailed voice-based speaker attribute classification.

Findings

01

Robust performance of VANPY components across datasets

02

Successful extraction of multiple speaker characteristics from movie voices

03

Framework demonstrates versatility in voice analysis tasks

Abstract

Voice data is increasingly being used in modern digital communications, yet there is still a lack of comprehensive tools for automated voice analysis and characterization. To this end, we developed the VANPY (Voice Analysis in Python) framework for automated pre-processing, feature extraction, and classification of voice data. The VANPY is an open-source end-to-end comprehensive framework that was developed for the purpose of speaker characterization from voice data. The framework is designed with extensibility in mind, allowing for easy integration of new components and adaptation to various voice analysis applications. It currently incorporates over fifteen voice analysis components - including music/speech separation, voice activity detection, speaker embedding, vocal feature extraction, and various classification models. Four of the VANPY's components were developed in-house and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

griko/vanpy
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems