Validation Requirements for AI-based Intervention-Evaluation in Aging and Longevity Research and Practice
Georg Fuellen, Anton Kulaga, Sebastian Lobentanzer, Maximilian, Unfried, Roberto Avelar, Daniel Palmer, Brian K. Kennedy

TL;DR
This paper discusses the necessary validation standards for AI tools, especially Large Language Models, used in aging research to ensure accurate, comprehensive, and explainable evaluation of interventions, emphasizing the importance of benchmarks and informed use.
Contribution
It proposes specific validation requirements and workflows for AI-based evaluation in aging research, highlighting the integration of Knowledge Graphs and Retrieval-Augmented Generation.
Findings
AI validation standards improve response quality.
Benchmarking is essential for trustworthy AI in aging.
Informed AI use can prevent harm in longevity research.
Abstract
The field of aging and longevity research is overwhelmed by vast amounts of data, calling for the use of Artificial Intelligence (AI), including Large Language Models (LLMs), for the evaluation of geroprotective interventions. Such evaluations should be correct, useful, comprehensive, explainable, and they should consider causality, interdisciplinarity, adherence to standards, longitudinal data and known aging biology. In particular, comprehensive analyses should go beyond comparing data based on canonical biomedical databases, suggesting the use of AI to interpret changes in biomarkers and outcomes. Our requirements motivate the use of LLMs with Knowledge Graphs and dedicated workflows employing, e.g., Retrieval-Augmented Generation. While naive trust in the responses of AI tools can cause harm, adding our requirements to LLM queries can improve response quality, calling for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFrailty in Older Adults · Artificial Intelligence in Healthcare and Education
