End-2-End COVID-19 Detection from Breath & Cough Audio
Harry Coppock, Alexander Gaskell, Panagiotis Tzirakis, Alice, Baird, Lyn Jones, Bj\"orn W. Schuller

TL;DR
This paper presents the first end-to-end deep learning approach for COVID-19 detection from breath and cough audio, demonstrating promising results and scalability potential for population-wide testing.
Contribution
It introduces a novel deep neural network model, CIdeR, for diagnosing COVID-19 from audio, and provides a publicly available dataset and evaluation framework.
Findings
Achieved ROC-AUC of 0.846 in COVID-19 detection
Proposed a scalable, cost-effective audio-based testing method
Released data splits and model details for reproducibility
Abstract
Our main contributions are as follows: (I) We demonstrate the first attempt to diagnose COVID-19 using end-to-end deep learning from a crowd-sourced dataset of audio samples, achieving ROC-AUC of 0.846; (II) Our model, the COVID-19 Identification ResNet, (CIdeR), has potential for rapid scalability, minimal cost and improving performance as more data becomes available. This could enable regular COVID-19 testing at apopulation scale; (III) We introduce a novel modelling strategy using a custom deep neural network to diagnose COVID-19 from a joint breath and cough representation; (IV) We release our four stratified folds for cross parameter optimisation and validation on a standard public corpus and details on the models for reproducibility and future reference.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Anomaly Detection Techniques and Applications · Speech Recognition and Synthesis
MethodsResidual Connection · 1x1 Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Bottleneck Residual Block · Residual Block · Average Pooling · Max Pooling · Kaiming Initialization · Convolution
