TL;DR
This paper re-annotates cough events in the AMI corpus to create a reliable, publicly available database for developing and evaluating machine learning algorithms for audio cough detection, addressing privacy concerns.
Contribution
It introduces a new re-annotation methodology and provides a publicly accessible, annotated cough sound database based on the AMI corpus.
Findings
Re-annotated 1369 cough events in the AMI corpus.
Developed a MATLAB GUI for efficient annotation.
Made the cough annotations and tool publicly available.
Abstract
Cough sounds act as an important indicator of an individual's physical health, often used by medical professionals in diagnosing a patient's ailments. In recent years progress has been made in the area of automatically detecting cough events and, in certain cases, automatically identifying the ailment associated with a particular cough sound. Ethical and sensitivity issues associated with audio recordings of coughs makes it more difficult for this data to be made publicly available. However, without the public availability of a reliable database of cough sounds, developments in the area of audio event detection are likely to be hampered. The purpose of this paper is to spread awareness of a database containing a large amount of naturally occurring cough sounds that can be used for the implementation, evaluation, and comparison of new machine learning algorithms that allow for audio…
| Number of cough annotations | |
| Original annotations | Re-annotated list |
| 1116 | 1369 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Re-annotation of cough events
in the AMI corpus
Paul Leamy, Ted Burke, Damon Berry, David Dorran
Biomedical Research Group
Technological University Dublin
(17-18 June 2019)
1 Introduction
Cough sounds act as an important indicator of an individual’s physical health [1], often used by medical professionals in diagnosing a patient’s ailments [2]. In recent years progress has been made in the area of automatically detecting cough events and, in certain cases, automatically identifying the ailment associated with a particular cough sound. Ethical and sensitivity issues associated with audio recordings of coughs make it more difficult for this data to be made publicly available. Without the public availability of a reliable database of cough sounds, developments in the area of audio event detection are likely to be hampered [3]. The purpose of this paper is to spread awareness of a database [4] containing a large amount of naturally occurring cough sounds that can be used for the implementation, evaluation, and comparison of new machine learning algorithms that allow for audio event detection associated with cough sounds. Using a purpose built GUI designed in MATLAB, a re-annotated version of the Augmented Multi-party Interaction (AMI) corpus’ cough location annotations was produced with 1369 individual cough events. All cough annotations and the re-annotation tool are available for download and public use [5].
2 The AMI corpus
The AMI corpus data is publicly available under the Creative Commons Attribution 4.0 license agreement, and contains audio recordings of indoor meetings taken in an office environment.
The audio recordings are accompanied by annotations of a large number of audio events throughout the entire corpus. Audio recordings were captured from multiple microphone locations, with an example of one meeting room shown in a Figure 1.
2.1 Re-annotation procedure
Following an initial analysis of the annotations, it was found that a number of issues existed relating to the original cough event annotations in the AMI corpus including. These include incorrect start and end times, and successive coughs annotated as single coughs, examples of which are illustrated in Figure 2.
The re-annotation criteria for the cough events is illustrated in Figure 3.
The task of importing the audio, and the recording of the new time-stamps was carried out using a purpose built GUI, as shown in Figure 4.
3 Results
Following the re-annotation procedure, a new set of time-stamps corresponding to the start and end times of single cough events was produced. The total number of single cough events before and after the re-annotation procedure is displayed in Table 1.
4 Conclusions
Justification for the re-annotation of the cough locations in the AMI corpus came following the discovery of a number of issues relating to the original annotations. In the area of cough sound analysis and detection the presence of a reliably annotated database will be a useful addition particularly in the area of machine learning. This database will be useful for progressing the development and testing of new audio event detection algorithms concerned with cough sounds. The re-annotated version of the cough location annotations have been made publicly available along with the GUI used in the re-annotation procedure [5]. A consistent approach to labelling of start and end times of the individual cough events was used, and as a result the potential for the AMI corpus to become a more reliable and reusable source of annotated cough recordings has increased.
4.1 References
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. El Ayadi, M. S. Kamel, and F. Karray, “Survey on speech emotion recognition: Features, classification schemes, and databases,” Pattern Recognition , vol. 44, no. 3, pp. 572–587, 2011.
- 2[2] S. M. Schappert and C. W. Burt, “Ambulatory care visits to physician offices, hospital outpatient departments, and emergency departments: United states, 2001-02.” Vital and Health Statistics. Series 13, Data from the National Health Survey , no. 159, pp. 1–66, 2006.
- 3[3] J. Schröder, J. Anemiiller, and S. Goetze, “Classification of human cough signals using spectro-temporal gabor filterbank features,” in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , March 2016, pp. 6455–6459.
- 4[4] J. Carletta, “Unleashing the killer corpus: experiences in creating the multi-everything ami meeting corpus,” Language Resources and Evaluation , vol. 41, no. 2, pp. 181–190, 2007.
- 5[5] P. Leamy, “paulleamy/ami-cough-annotations,” Mar 2019. [Online]. Available: https://github.com/paulleamy/AMI-Cough-Annotations.git
