Surfboard: Audio Feature Extraction for Modern Machine Learning

Raphael Lenain; Jack Weston; Abhishek Shivkumar; Emil Fristed

arXiv:2005.08848·cs.SD·May 19, 2020

Surfboard: Audio Feature Extraction for Modern Machine Learning

Raphael Lenain, Jack Weston, Abhishek Shivkumar, Emil Fristed

PDF

2 Repos

TL;DR

Surfboard is an open-source Python library designed for efficient audio feature extraction, specifically tailored for medical applications like Parkinson's disease classification, and supports integration with modern machine learning workflows.

Contribution

It introduces a new, user-friendly audio feature extraction library optimized for clinical research, addressing limitations of existing tools and supporting large-scale processing.

Findings

01

Successfully applied to Parkinson's classification using mPower dataset

02

Highlights common pitfalls in existing audio research methods

03

Facilitates future clinical audio research with open-source tools

Abstract

We introduce Surfboard, an open-source Python library for extracting audio features with application to the medical domain. Surfboard is written with the aim of addressing pain points of existing libraries and facilitating joint use with modern machine learning frameworks. The package can be accessed both programmatically in Python and via its command line interface, allowing it to be easily integrated within machine learning workflows. It builds on state-of-the-art audio analysis packages and offers multiprocessing support for processing large workloads. We review similar frameworks and describe Surfboard's architecture, including the clinical motivation for its features. Using the mPower dataset, we illustrate Surfboard's application to a Parkinson's disease classification task, highlighting common pitfalls in existing research. The source code is opened up to the research community…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.