Artificial Intelligence in the Diagnosis and Treatment of Speech Disorders: Bridging Neurology and Otorhinolaryngology
Wyllians Vendramini Borelli, Tatiana Luft, Geraldo Pereira Jotz

Abstract
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVoice and Speech Disorders · Phonetics and Phonology Research · Dysphagia Assessment and Management
General Aspects
In recent years, speech disorders have emerged as a significant interdisciplinary challenge at the crossroads of neurology and otorhinolaryngology. Disorders such as dysarthria, apraxia of speech, and aphasia not only impair communication but also serve as early indicators of neurodegenerative conditions. Advances in artificial intelligence (AI) have begun to transform our approach to both diagnosis and rehabilitation in this field, offering automated, objective tools that can augment traditional clinical assessments.
The use of machine learning algorithms and deep neural networks in speech analysis has shown promising results, particularly in neurodegenerative pathologies like Parkinson's disease. For instance, nonlinear acoustic measures derived from spontaneous speech can be mapped reliably to clinical severity ratings, paving the way for telemonitoring applications in neurodegenerative disorders. 1 Similarly, another study highlighted the potential of dysphonia measurements, when analyzed using robust signal processing techniques, to serve as biomarkers for Parkinsonian speech impairments. 2 Such studies underscore the feasibility of using AI to capture subtle changes that might be overlooked in routine clinical evaluations.
Recent advances extend these methodologies beyond Parkinson's disease. In motor speech disorders, acoustic methods have been developed to identify speech subsystems (phonation, nasal resonance), further classified into global speech functions (prosody). 3 In parallel, researchers have focused on language production deficits seen in primary progressive aphasia. Berisha et al. employed computational approaches to classify nonfluent speech patterns, confirming that machine learning can assist in disentangling the complex linguistic impairments associated with this condition. 4
Aphasia, in particular, exemplifies the required synergy between neurology and otorhinolaryngology. While traditionally viewed as a disorder of higher cortical function, its manifestation in speech production and comprehension entails a careful auditory and articulatory evaluation—a domain where otorhinolaryngologists are highly experienced. Emerging AI tools are expected to bridge these disciplinary boundaries by providing integrated diagnostic platforms. Such systems not only analyze acoustic features but also incorporate linguistic and cognitive markers to improve the accuracy of aphasia subtyping and severity grading.
The potential of AI extends to treatment and rehabilitation as well. Current AI applications in otorhinolaryngology are scarce, though automated monitoring systems and adaptive therapy tools are revolutionizing patient care. 5 6 In the realm of poststroke aphasia, recent studies employing deep learning techniques have begun to differentiate impaired from unimpaired speech with encouraging accuracy. 7 These advances suggest a future in which rehabilitation programs could be dynamically tailored based on continuous, AI-guided assessments.
Other lines of investigation have emphasized the broader application of AI for assessing speech impairments arising from diverse pathological processes. AI tools are also being developed to predict and diagnose voice disorders, such as vocal cord pathologies, in primary care settings. These tools use machine learning techniques to analyze voice recordings and differentiate between various pathologies, potentially outperforming traditional diagnostic methods. 8 Automatic speech analysis systems are being used to facilitate the treatment of speech sound disorders. 9 These systems can provide feedback on speech sound production, which is crucial for home practice and therapy. The accuracy of these systems in classifying speech sounds is high, although it varies depending on the complexity of the sounds.
In summary, AI presents an exciting frontier in the field of speech disorder diagnosis and management. By integrating objective acoustic analysis with advanced pattern recognition, clinicians may soon benefit from tools that not only support early diagnosis but also refine prognostic evaluations and individualize therapeutic interventions. The intersection of neurology and otorhinolaryngology inherent in conditions like aphasia further underscores the value of collaborative, technology-driven approaches. Continued research across these disciplines will be essential to harness the full potential of AI, ensuring that technological innovations translate into meaningful improvements in patient outcomes.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Tsanas A Little M A Mc Sharry P E Ramig L O Nonlinear speech analysis algorithms mapped to a standard metric achieve clinically useful quantification of average Parkinson's disease symptom severity J R Soc Interface 201185984285510.1098/rsif.2010.045621084338 PMC 3104343 · doi ↗ · pubmed ↗
- 2Little M A Mc Sharry P E Hunter E J Spielman J Ramig L O Suitability of dysphonia measurements for telemonitoring of Parkinson's disease IEEE Trans Biomed Eng 20095604101510.1109/TBME.2008.200595421399744 PMC 3051371 · doi ↗ · pubmed ↗
- 3Kent R D Kim Y J Toward an acoustic typology of motor speech disorders Clin Linguist Phon 2003170642744510.1080/026992003100008624814564830 · doi ↗ · pubmed ↗
- 4Fraser K C Meltzer J A Graham N L Automated classification of primary progressive aphasia subtypes from narrative speech transcripts Cortex 201455436010.1016/j.cortex.2012.12.00623332818 · doi ↗ · pubmed ↗
- 5Frosolini A Franz L Caragli V Genovese Ede Filippis C Marioni G Artificial Intelligence in Audiology: A Scoping Review of Current Applications and Future Directions Sensors (Basel)20242422712610.3390/s 2422712639598904 PMC 11598364 · doi ↗ · pubmed ↗
- 6Jotz G P Jotz A V Arnold D Borelli W V Artificial Intelligence for Diagnosis and Treatment of Dysphagia Int Arch Otorhinolaryngol 202529011210.1055/s-0044-1801781 PMC 1175695239850498 · doi ↗ · pubmed ↗
- 7Wang Y Weibin C Fahim S Qiang F Seedahmed S MA Systematic Review of Using Deep Learning in Aphasia: Challenges and Future Directions Computers (Basel)2024130511710.3390/computers 13050117 · doi ↗
- 8Compton E C Cruz T Andreassen M Developing an Artificial Intelligence Tool to Predict Vocal Cord Pathology in Primary Care Settings Laryngoscope 2023133081952196010.1002/lary.3043236226791 · doi ↗ · pubmed ↗
