Validation Requirements for AI-based Intervention-Evaluation in Aging   and Longevity Research and Practice

Georg Fuellen; Anton Kulaga; Sebastian Lobentanzer; Maximilian; Unfried; Roberto Avelar; Daniel Palmer; Brian K. Kennedy

arXiv:2408.15264·cs.HC·January 14, 2025

Validation Requirements for AI-based Intervention-Evaluation in Aging and Longevity Research and Practice

Georg Fuellen, Anton Kulaga, Sebastian Lobentanzer, Maximilian, Unfried, Roberto Avelar, Daniel Palmer, Brian K. Kennedy

PDF

Open Access

TL;DR

This paper discusses the necessary validation standards for AI tools, especially Large Language Models, used in aging research to ensure accurate, comprehensive, and explainable evaluation of interventions, emphasizing the importance of benchmarks and informed use.

Contribution

It proposes specific validation requirements and workflows for AI-based evaluation in aging research, highlighting the integration of Knowledge Graphs and Retrieval-Augmented Generation.

Findings

01

AI validation standards improve response quality.

02

Benchmarking is essential for trustworthy AI in aging.

03

Informed AI use can prevent harm in longevity research.

Abstract

The field of aging and longevity research is overwhelmed by vast amounts of data, calling for the use of Artificial Intelligence (AI), including Large Language Models (LLMs), for the evaluation of geroprotective interventions. Such evaluations should be correct, useful, comprehensive, explainable, and they should consider causality, interdisciplinarity, adherence to standards, longitudinal data and known aging biology. In particular, comprehensive analyses should go beyond comparing data based on canonical biomedical databases, suggesting the use of AI to interpret changes in biomarkers and outcomes. Our requirements motivate the use of LLMs with Knowledge Graphs and dedicated workflows employing, e.g., Retrieval-Augmented Generation. While naive trust in the responses of AI tools can cause harm, adding our requirements to LLM queries can improve response quality, calling for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFrailty in Older Adults · Artificial Intelligence in Healthcare and Education