Generating Completions for Broca's Aphasic Sentences Using Large Language Models
Sijbren van Vaals, Yevgen Matusevych, Frank Tsiwah

TL;DR
This study investigates the use of large language models to complete sentences characteristic of Broca's aphasia, demonstrating their potential to improve communication aids for affected individuals.
Contribution
The paper introduces a method to fine-tune LLMs using synthetic aphasic data for sentence completion in Broca's aphasia, without relying on authentic samples.
Findings
LLMs can reconstruct agrammatic sentences effectively.
Performance improves with longer input utterances.
Synthetic data can be used to train models for clinical language tasks.
Abstract
Broca's aphasia is a type of aphasia characterized by non-fluent, effortful and agrammatic speech production with relatively good comprehension. Since traditional aphasia treatment methods are often time-consuming, labour-intensive, and do not reflect real-world conversations, applying natural language processing based approaches such as Large Language Models (LLMs) could potentially contribute to improving existing treatment approaches. To address this issue, we explore the use of sequence-to-sequence LLMs for completing Broca's aphasic sentences. We first generate synthetic Broca's aphasic data using a rule-based system designed to mirror the linguistic characteristics of Broca's aphasic speech. Using this synthetic data (without authentic aphasic samples), we then fine-tune four pre-trained LLMs on the task of completing agrammatic sentences. We evaluate our fine-tuned models on both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
