# Data, dialogue, and design: patient and public involvement and engagement for natural language processing with real-world cancer data

**Authors:** Wuraola Oyewusi, Eliana M. Vasquez Osorio, Goran Nenadic, Issy MacGregor, Gareth Price

PMC · DOI: 10.3389/fdgth.2025.1560757 · Frontiers in Digital Health · 2025-05-15

## TL;DR

This study explores how involving patients and the public can improve the ethical use of AI in analyzing cancer data, focusing on data use, consent, and communication.

## Contribution

The study introduces a structured PPIE event to guide NLP research with cancer data, emphasizing patient-centered approaches.

## Key findings

- Two-thirds of participants preferred a national opt-out consent model for data use.
- Participants emphasized the need for clear, accessible information to build trust in AI research.
- Contributors highlighted the importance of involving underrepresented patient groups in NLP studies.

## Abstract

This study describes the process and outcomes of a Patient and Public Involvement and Engagement (PPIE) event designed to incorporate patient perspectives into the application of Natural Language Processing (NLP) for analyzing unstructured free-text cancer medical notes. The analysis of routinely collected data aims to provide evidence to support clinical decision making in patient groups that are often under-represented in conventional clinical trials, highlighting the critical role of PPIE in responsibly implementing AI within healthcare. The study focuses on ensuring that NLP research reflects patient-centered and clinically relevant considerations.

The event involved 13 participants: nine cancer survivors and caregivers, acting as contributors, and four researchers. These participants engaged in focus group discussions on three key topics: data use, consent preferences, and communication strategies for this type of research.

Some key findings included that two-thirds (6/9) of contributors preferred a national opt-out consent model for data use, while one-third (3/9) favored project-specific consent. They offered perspectives on data use, including how it is processed and stored. They also highlighted the importance of clear, accessible information about the research process to build trust and facilitate informed decision-making.

## Linked entities

- **Diseases:** cancer (MONDO:0004992)

## Full-text entities

- **Diseases:** cancer (MESH:D009369)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12119481/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12119481/full.md

## References

19 references — full list in the complete paper: https://tomesphere.com/paper/PMC12119481/full.md

---
Source: https://tomesphere.com/paper/PMC12119481