# Evaluation of Few-Shot AI-Generated Feedback on Case Reports in Physical Therapy Education: Mixed Methods Study

**Authors:** Hisaya Sudo, Yoko Noborimoto, Jun Takahashi

PMC · DOI: 10.2196/85614 · 2025-12-30

## TL;DR

Japanese physical therapy students preferred AI feedback with examples over no examples, as it felt fairer and more useful for learning.

## Contribution

Demonstrates that few-shot AI feedback can reduce algorithm aversion and improve perceived fairness and usefulness in health profession education.

## Key findings

- 74% of students preferred few-shot feedback over zero-shot feedback for overall usefulness.
- Few-shot feedback scored significantly higher on fairness and willingness to invest effort.
- Algorithm aversion increased for zero-shot feedback after identity disclosure, but not for few-shot feedback.

## Abstract

While artificial intelligence (AI)–generated feedback offers significant potential to overcome constraints on faculty time and resources associated with providing personalized feedback, its perceived usefulness can be undermined by algorithm aversion. In-context learning, particularly the few-shot approach, has emerged as a promising paradigm for enhancing AI performance. However, there is limited research investigating its usefulness, especially in health profession education.

This study aimed to compare the quality of AI-generated formative feedback from 2 settings, feedback generated in a zero-shot setting (hereafter, “zero-shot feedback”) and feedback generated in a few-shot setting (hereafter, “few-shot feedback”), using a mixed methods approach in Japanese physical therapy education. Additionally, we examined the effect of algorithm aversion on these 2 feedback types.

A mixed methods study was conducted with 35 fourth-year physical therapy students (mean age 21.4, SD 0.7 years). Zero-shot feedback was created using Gemini 2.5 Pro with default settings, whereas few-shot feedback was generated by providing the same model with 9 teacher-created examples. The participants compared the quality of both feedback types using 3 methods: a direct preference question, the Feedback Perceptions Questionnaire (FPQ), and focus group interviews. Quantitative comparisons of FPQ scores were performed using the Wilcoxon signed rank test. To investigate algorithm aversion, the study examined how student perceptions changed before and after disclosure of the feedback’s identity.

Most students (26/35, 74%) preferred few-shot feedback over zero-shot feedback in terms of overall usefulness, although no significant difference was found between the 2 feedback types for the total FPQ score (P=.22). On the specific FPQ scales, few-shot feedback scored significantly higher than zero-shot feedback on fairness across all 3 items: “satisfied” (P=.02; r=0.407), “fair” (P=.04; r=0.341), and “justified” (P=.02; r=0.392). It also scored significantly higher on 1 item of the usefulness scale (“useful”; P=.02; r=0.401) and 1 item of the willingness scale (“invest a lot of effort”; P=.02; r=0.394). In contrast, zero-shot feedback scored significantly higher on the affect scale across 2 items: “successful” (P=.03; r=0.365) and “angry” (P=.008; r=0.443). Regarding algorithm aversion, evaluations for zero-shot feedback became more negative for 83% (15/18) of the items after identity disclosure, whereas positive perceptions of few-shot feedback were maintained or increased. Qualitative analysis revealed that students valued zero-shot feedback for its encouraging tone, whereas few-shot feedback was appreciated for its contextual understanding and concrete guidance for improvement.

Japanese physical therapy students perceived few-shot feedback more favorably than zero-shot feedback on case reports. This few-shot AI model shows potential to resist algorithm aversion and serves as an effective educational tool to support autonomous writing, facilitate reflection on clinical reasoning, and cultivate advanced thinking skills.

## Full-text entities

- **Diseases:** hallucinations (MESH:D006212), LLMs (MESH:D007806), AI (MESH:C538142)
- **Chemicals:** FPQ (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12811036/full.md

---
Source: https://tomesphere.com/paper/PMC12811036