# Text Mining of Symptom Descriptions in the Vaccine Adverse Event Reporting System for Human Papillomavirus Vaccination

**Authors:** Kohei Shiota, Megumi Horibe, Yoko Ino, Mari Iwata, Hideyuki Tanaka, Mayumi Kitamura, Kazuhiro Iguchi, Mitsuhiro Nakamura

PMC · DOI: 10.7759/cureus.92575 · Cureus · 2025-09-17

## TL;DR

This study analyzed VAERS reports to understand the language used in describing HPV vaccine adverse events, finding more negative sentiment among female reports.

## Contribution

The study introduces a novel use of sentiment analysis to examine vocabulary patterns in HPV vaccine adverse event reports.

## Key findings

- Sentiment analysis revealed more negative sentiment toward HPV vaccination in female reports compared to male reports.
- Approximately 6% of VAERS reports involved product handling issues rather than direct adverse events.
- The study used the AFINN lexicon in R to quantify sentiment without qualitative interpretation.

## Abstract

Introduction: The World Health Organizationhas reaffirmed the efficacy and safety of human papillomavirus (HPV) vaccines in a statement. While vaccine safety has been extensively studied, little is known about the descriptive language used in reports of adverse events. The Vaccine Adverse Event Reporting System (VAERS), managed by the US Food and Drug Administration and the Centers for Disease Control and Prevention, collects free-text narratives on adverse events following vaccination. This study aimed to examine these narratives to describe vocabulary patterns associated with HPV vaccination.

Methods: We conducted a retrospective, cross-sectional observational study using quantitative text mining techniques. Symptom descriptions related to HPV vaccination were extracted from the Vaccine Adverse Event Reporting System (VAERS, 2009-2023). Sentiment analysis was performed with the AFINN lexicon in R (“tidytext” package), which assigns numerical sentiment scores to words. This quantitative scoring approach enabled us to describe vocabulary patterns without qualitative inference of context or emotions.

Results: Sentiment analysis was performed using the R “tidytext” package with the AFINN lexicon. Reports on suspected adverse events were obtained from the VAERS reports spanning 2009 to 2023, comprising 55,919 suspected adverse events. Approximately 6% of these reports involved product handling issues. The analysis showed that sentiments toward vaccination were more negative among females than males.

Conclusion: Healthcare providers supporting HPV vaccination should provide patients with accurate and comprehensive information on vaccine safety and potential adverse reactions.

## Full-text entities

- **Diseases:** Symptom (MESH:D012816)
- **Species:** Human papillomavirus (species) [taxon 10566], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12534138/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12534138/full.md

## References

26 references — full list in the complete paper: https://tomesphere.com/paper/PMC12534138/full.md

---
Source: https://tomesphere.com/paper/PMC12534138