A Comparative Study of Decoding Strategies in Medical Text Generation

Oriana Presacan; Alireza Nik; Vajira Thambawita; Bogdan Ionescu; Michael Riegler

arXiv:2508.13580·cs.CL·August 20, 2025

A Comparative Study of Decoding Strategies in Medical Text Generation

Oriana Presacan, Alireza Nik, Vajira Thambawita, Bogdan Ionescu, Michael Riegler

PDF

TL;DR

This study compares decoding strategies in medical text generation, revealing deterministic methods like beam search outperform stochastic ones, with larger models not necessarily being more robust or better performing across tasks.

Contribution

It provides a comprehensive evaluation of 11 decoding strategies across five medical tasks, highlighting the impact of decoding choices on output quality and model robustness.

Findings

01

Deterministic decoding strategies outperform stochastic ones.

02

Larger models are not more robust to decoding choices.

03

Decoding strategy effects can surpass model size in influence.

Abstract

Large Language Models (LLMs) rely on various decoding strategies to generate text, and these choices can significantly affect output quality. In healthcare, where accuracy is critical, the impact of decoding strategies remains underexplored. We investigate this effect in five open-ended medical tasks, including translation, summarization, question answering, dialogue, and image captioning, evaluating 11 decoding strategies with medically specialized and general-purpose LLMs of different sizes. Our results show that deterministic strategies generally outperform stochastic ones: beam search achieves the highest scores, while {\eta} and top-k sampling perform worst. Slower decoding methods tend to yield better quality. Larger models achieve higher scores overall but have longer inference times and are no more robust to decoding. Surprisingly, while medical LLMs outperform general ones in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.