# Using Natural Language Prompts With AI Models for Low-Cost Assistive Software Design: Exploratory Comparative Evaluation

**Authors:** Francesc Antoni Bañuls-Lapuerta, Vicent Marti-Miralles, Rómulo Jacobo Gónzalez-García, Gabriel Martínez-Rico

PMC · DOI: 10.2196/86786 · JMIR Rehabilitation and Assistive Technologies · 2026-03-24

## TL;DR

This study compares free and paid AI models in generating assistive software from natural language prompts, finding paid models like Gemini Pro and ChatGPT Plus perform significantly better.

## Contribution

The study introduces an exploratory evaluation of AI models for low-cost assistive software design using natural language prompts.

## Key findings

- Paid AI models outperformed free models in generating functional assistive software code.
- Gemini Pro successfully implemented 14 of 16 requested functions with minimal prompts.
- Human oversight and prompt literacy are essential for reliable and ethical AI-generated assistive technology.

## Abstract

This study investigates the capacity of 7 artificial intelligence (AI) models, 5 free and 2 paid, to generate functional software for designing low-cost, personalized assistive products.

The objective was to determine which models are most effective, accessible, and consistent in supporting nontechnical professionals in developing inclusive digital solutions and to assess the capabilities of commercially available and easy-to-access AI models to generate code from natural language interactions in the shape of a nontechnical assistive technology design process.

Each AI model was prompted using natural language, without any technical input, to create a Python program that converts an arcade gamepad into an adapted mouse-like controller. Sixteen progressively complex functions were requested through standardized prompts, delivered without additional feedback or correction. Model performance was evaluated based on the number of successfully implemented functions and the average number of prompts required.

Paid models demonstrated markedly superior performance. Gemini Pro (Google) successfully implemented 14 of 16 requested functions with an average of 1.25 (SD 0.45) prompts, while ChatGPT Plus (GPT-5) achieved 11 functions with an average of 1.31 (SD 0.48) prompts. In contrast, free models produced between 0 and 4 functional outcomes, with DeepSeek and Gemini Free ranking the highest within their category. The enhanced outcomes of paid models were linked to improved contextual understanding, greater tolerance for natural language, and reduced conversational drift.

Paid AI models, particularly Gemini Pro and ChatGPT Plus, exhibit strong potential as tools for bridging the gap between health or education professionals and software development. They enable the creation of affordable, user-centered assistive technology without requiring advanced programming skills. Nevertheless, human oversight and foundational literacy in prompt design remain crucial to guarantee functionality, reliability, and ethical use.

## Full-text entities

- **Diseases:** disabilities (MESH:D009069), AI (MESH:C538142), hallucinations (MESH:D006212), COVID-19 (MESH:D000086382)
- **Chemicals:** GPT-5 (-)
- **Species:** Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13012223/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC13012223/full.md

## References

74 references — full list in the complete paper: https://tomesphere.com/paper/PMC13012223/full.md

---
Source: https://tomesphere.com/paper/PMC13012223