# The development of a data dictionary with clinical variables for artificial intelligence-driven tools in research on abdominal aortic aneurysms and peripheral arterial disease

**Authors:** Lotte Rijken, Sabrina L M Zwetsloot, Catelijne Muller, Marlies P Schijven, Vincent Jongkind, Kak Khee Yeung, Igor Koncar, Igor Koncar, Ivan Tomic, Marina Dias-Neto, Katarzyna D Bera, Riikka Tulamo, Maarit Venermo, Mirjami Laivuori, Christian-Alexander Behrendt, Stefan P M Smorenburg, Corrette Ploem, Roosmarie Jessen, Bert-Jan H van den Born, Ronak Delewi, Jelmer Wolterink, Noeska Smit, Ivana Išgum, Henk A Marquering, Fabio Catarinella, Fabien Lareyre, Juliette Raffort, Maja Živković, Tamara Djuric, Aleksandra Stankovic, Rupert Bauersachs, Rupert Bauersachs, Manar Khashram

PMC · DOI: 10.1093/ehjdh/ztaf091 · 2025-08-20

## TL;DR

This study created structured data dictionaries for AI research on abdominal aortic aneurysms and peripheral arterial disease to improve risk prediction and ensure ethical data use.

## Contribution

The paper introduces two expert-validated data dictionaries for AI-driven research on arterial vascular diseases, ensuring ethical and high-quality data input.

## Key findings

- The aneurysm data dictionary includes 312 variables, while the peripheral arterial disease dictionary includes 325 variables.
- A modified Delphi approach with 16 clinical experts was used to achieve consensus on the data dictionaries.
- Ethical and legal experts were involved throughout the process to ensure compliance with AI guidelines.

## Abstract

Patients with abdominal aortic aneurysms and peripheral arterial disease (arterial vascular diseases) carry a high disease burden and are likely to experience cardiovascular events. Novel strategies using artificial intelligence could identify which patients with arterial vascular diseases are at high risk of cardiovascular disease progression. Structured data dictionaries are needed to ensure high-quality, unbiased, and ethically sound data input for artificial intelligence models. The aim of this study was to obtain expert consensus-based data dictionaries that adhere to applicable ethical guidelines to support research on arterial vascular diseases.

The data dictionaries were created through a modified Delphi approach to achieve consensus among key opinion leaders in the cardiovascular field. First, data requirements were defined and variable longlists were created per disease through a literature review. Secondly, written feedback rounds were held. Lastly, face-to-face meetings were held to establish consensus on the final data dictionaries. During the whole process, ethical and legal experts on trustworthy artificial intelligence were involved to ensure adherence to corresponding guidelines and laws. The aneurysm data dictionary contains 312 variables, and the peripheral arterial disease data dictionary contains 325 variables. A total of 16 clinical experts were involved in the creation, including 12 vascular surgeons, two vascular medicine specialists, one cardiologist, and one gastroenterology surgeon and digital health expert.

Two expert consensus-based data dictionaries for use in clinical and artificial intelligence research on arterial vascular diseases were created, developed for application in research on predicting disease progression and cardiovascular risk.

Graphical Abstract

## Linked entities

- **Diseases:** peripheral arterial disease (MONDO:0005386), cardiovascular disease (MONDO:0004995)

## Full-text entities

- **Diseases:** peripheral arterial disease (MESH:D058729), arterial vascular diseases (MESH:D014652), aneurysm (MESH:D000783), abdominal aortic aneurysms (MESH:D017544), cardiovascular disease (MESH:D002318)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12629649/full.md

---
Source: https://tomesphere.com/paper/PMC12629649