# Year 2023 in Biomedical Natural Language Processing: a Tribute to Large Language Models and Generative AI

**Authors:** Cyril Grouin, Natalia Grabar

PMC · DOI: 10.1055/s-0044-1800751 · 2025-04-08

## TL;DR

This paper summarizes 2023's top biomedical NLP research, highlighting trends like large language models and generative AI in health-related tasks.

## Contribution

The paper identifies and analyzes the best NLP papers from 2023, emphasizing advancements in language models and domain adaptation.

## Key findings

- Two best papers focused on data augmentation and domain-specific model adaptation using large language models.
- 2023 trends included classical NLP tasks, medical education, and generative AI applications for health issues like cancer and mental health.
- Research on non-English languages and post-COVID-19 conditions was also highlighted.

## Abstract

Objectives
: This synopsis gives insights into scientific publications from 2023 in Natural Language Processing for the biomedical domain. We present the process we followed to identify candidates for NLP's best papers and the two best papers of this year. We also analyze the current trends in the 2023 publications.

Methods
: We queried two bibliographic databases (Medline and the ACL anthology) and refined the outputs through automatic scoring. We then manually shortlisted publications to review and selected candidate papers through an adjudication process. External reviewers assessed the interest of the 13 selected candidates. At last, the section editors chose the best NLP papers.

Results
: We collected 2,148 papers published in 2023, of which two were the best and selected as part of this NLP synopsis. Both address language models and propose solutions for data augmenta-tion, domain-specific model adaptation, and model distillation. Work is done on social media con-tent and electronic health records, using deep learning approaches such as ChatGPT and large lan-guage models.

Conclusion
: Trends from 2023 cover classical NLP tasks (information extraction, text categoriza-tion, sentiment analysis), existing topics from several years (medical education), mainstream applications (Chatbots, generative approaches), and specific issues (cancer, COVID-19, mental health). Specifically for COVID-19, current researches deal with post-COVID-19 conditions, and they explore the understanding of how this pandemic has been managed and welcomed by populations. In addition, due to language models, a few works have been done to process languages other than English, especially using language portability approaches.

## Linked entities

- **Diseases:** cancer (MONDO:0004992), COVID-19 (MONDO:0100096)

## Full-text entities

- **Diseases:** -COVID-19 (MESH:D000086382), cancer (MESH:D009369)

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12020626/full.md

---
Source: https://tomesphere.com/paper/PMC12020626