Detecting and analysing spontaneous oral cancer speech in the wild

Bence Mark Halpern; Rob van Son; Michiel van den Brekel; Odette; Scharenborg

arXiv:2007.14205·eess.AS·July 29, 2020

Detecting and analysing spontaneous oral cancer speech in the wild

Bence Mark Halpern, Rob van Son, Michiel van den Brekel, Odette, Scharenborg

PDF

1 Repo

TL;DR

This paper introduces a new dataset of spontaneous oral cancer speech from YouTube, analyzes key speech features for detection, and establishes baseline machine learning methods for identifying oral cancer speech in real-world conditions.

Contribution

It presents the first analysis of spontaneous oral cancer speech and sets baselines for detection using explainable machine learning techniques.

Findings

01

Sibilants and stop consonants are key indicators for detection.

02

Baseline models achieve measurable accuracy on the new dataset.

03

Spontaneous speech analysis differs from read speech in oral cancer detection.

Abstract

Oral cancer speech is a disease which impacts more than half a million people worldwide every year. Analysis of oral cancer speech has so far focused on read speech. In this paper, we 1) present and 2) analyse a three-hour long spontaneous oral cancer speech dataset collected from YouTube. 3) We set baselines for an oral cancer speech detection task on this dataset. The analysis of these explainable machine learning baselines shows that sibilants and stop consonants are the most important indicators for spontaneous oral cancer speech detection.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

karkirowle/oral_cancer_analysis
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.