# Using voice and speech data in healthcare: a scoping review of the ethical, legal and social implications

**Authors:** Marie-Françoise Malo, Sarah Bouhouita-Guermech, Hortense Gallois, Vardit Ravitsky, Jean-Christophe Bélisle-Pipon

PMC · DOI: 10.3389/fdgth.2026.1750111 · Frontiers in Digital Health · 2026-02-10

## TL;DR

This review explores ethical, legal, and social issues of using voice and speech data in healthcare, highlighting the need for better frameworks and inclusive practices.

## Contribution

The paper provides a comprehensive analysis of ELSIs specific to voice and speech data, advocating for contextualist approaches and interdisciplinary collaboration.

## Key findings

- Ethical concerns include privacy breaches and challenges in informed consent.
- Legal issues involve unclear regulatory frameworks and data ownership challenges.
- Social issues highlight biases and risks of discrimination in voice data usage.

## Abstract

Human voice and speech, integral to personal identity and social communication, are increasingly used as biometric and digital biomarkers in healthcare. Their collection and analysis, enabled by artificial intelligence, machine learning, and natural language processing, offer promising applications in disease detection and health monitoring. This scoping review examines the ethical, legal, and social implications (ELSIs) associated with using voice and speech data in healthcare. Following a structured search of four databases and a snowball method, 65 articles published between 2009 and 2024 were analyzed. The findings are organized into three main ELSI categories: ethical concerns include privacy breaches, challenges of informed consent, and the need for data validation and respect for vulnerable populations; social issues highlight biases, representational disparities, and risks of discrimination and data misuse; legal issues include unclear regulatory frameworks, conflicting jurisdictional mandates, and challenges in defining data ownership. The review reveals that while many ELSIs mirror those of other biomarker data, the unique properties of voice and speech require adapted frameworks for consent, data governance, and privacy protection. Technological limitations, dataset scarcity, and industry-academic divides exacerbate risks and hinder equitable development. Few studies deeply explore ELSIs in underrepresented populations, and there is a lack of robust empirical research. The review argues for a contextualist, not exceptionalist, approach to voice biomarkers, acknowledging both overlapping and unique challenges. It concludes by stressing the need for harmonized regulations, inclusive datasets, and interdisciplinary collaboration to ensure responsible, equitable integration of voice and speech technologies in healthcare.

## Full-text entities

- **Diseases:** mood disorders (MESH:D019964), AI (MESH:C538142), sleepiness (MESH:D000077260), ELSI (MESH:D001766), Voice (MESH:D014832), Parkinson disease (MESH:D010300), Huntington's disease (MESH:D006816), diabetes (MESH:D003920), neuropsychiatric disorders (MESH:D001523), Alzheimer's disease (MESH:D000544), autism (MESH:D001321), depression (MESH:D003866), heart failure (MESH:D006333), speech disorders (MESH:D013064), mental health conditions (MESH:D000071069), illness (MESH:D002908), communication disorders (MESH:D003147), cognitive decline (MESH:D003072), COVID-19 (MESH:D000086382), mental health (OMIM:603663), aphasia (MESH:D001037), cough (MESH:D003371)
- **Species:** Fenestella gardiennetii (species) [taxon 2499855], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12930345/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12930345/full.md

## References

156 references — full list in the complete paper: https://tomesphere.com/paper/PMC12930345/full.md

---
Source: https://tomesphere.com/paper/PMC12930345