The DISCOURSE in psychosis (London Ontario): A speech dataset to examine communication disturbances in early-stage psychosis
Brian Cho, Estée Balles, Michael Mackinley, Paulina Dzialoszynski, Sabrina Ford, Rohit Lodhi, Lena Palaniyappan

TL;DR
This paper introduces a new speech dataset to study communication issues in early-stage psychosis patients using naturalistic conversations.
Contribution
The DISCOURSE dataset provides naturalistic speech samples and clinical data from early-stage psychosis patients and controls for NLP research.
Findings
The dataset includes raw audio and transcribed speech from psychosis patients and healthy controls.
Clinical and demographic data are provided alongside speech samples for comprehensive analysis.
The dataset supports multi-site, multi-language research on psychosis-related communication disturbances.
Abstract
Advances in speech technology and Natural Language Processing (NLP) have demonstrated promise in using speech as a valid source of data to detect features of psychosis. These technologies can potentially detect subtle speech aberrations that often go unnoticed by clinicians and family members. However, research in this area is hindered by a significant limitation: a lack of sufficient and appropriate speech corpora from psychosis patients, especially datasets containing naturalistic speech that reflects typical clinical interactions. This scarcity limits the development, testing, and generalization of new computational methods for psychosis prediction. To address this gap, our new dataset offers naturalistic speech samples collected using the semi-structured DISCOURSE protocol. This resource includes both raw audio recordings and transcribed speech from individuals participating in an…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSchizophrenia research and treatment · Emotion and Mood Recognition · Digital Mental Health Interventions
