# Proteomic signature of aging in bloodstain samples: a preliminary study

**Authors:** Niu Gao, Daijing Yu, Jingjing Xu, Jiaxuan Hao, Jinding Liu, Jiangwei Yan

PMC · DOI: 10.1186/s12864-025-12164-x · 2025-10-29

## TL;DR

This study explores using proteins in bloodstains to estimate age, offering a promising alternative to DNA-based methods.

## Contribution

The study is the first to use proteomic signatures from bloodstains for forensic age prediction.

## Key findings

- Age-related proteins showed four nonlinear change patterns during aging.
- The Boruta-based model achieved an R² of 0.70 and a MAE of 9.14 years.
- Age-associated proteins were enriched in endocytosis, metabolism, and neurodegenerative disease pathways.

## Abstract

Age estimation from biological samples remains a critical challenge in forensic investigations, particularly when analyzing trace or degraded biological stains recovered from crime scenes. While DNA methylation has gained attention as a potential epigenetic clock for age prediction, practical limitations that include the requirements for bisulfite conversion and environmental interference hinder its forensic utility. Previous studies have shown that proteins are more stable than DNA and that certain proteins are highly age-related and gender-differentiated, indicating that proteomic signatures present a promising alternative for biological age determination. However, proteomic data from bloodstains have not yet been used for forensic age prediction. In this pilot study, we used the high-resolution Thermo Scientific Orbitrap Astral Mass Spectrometer to investigate the proteomic signatures of aging in bloodstain samples from 40 healthy males (aged 10–79 years) stored for 4 years at room temperature (20–25 °C). Age-related proteins were subsequently selected for age prediction. We further simplified the characteristic variables using the Least Absolute Shrinkage and Selection Operator (Lasso) regression and the Boruta algorithm, and established age prediction models for bloodstains based on Random Forest (RF) machine learning.

In total, 1,655 proteins were identified, which showed four different nonlinear change patterns during the aging process. Pearson’s correlation coefficient (R) was calculated, and 71 proteins were found to correlate significantly with age (Pearson’s |R| > 0.3, P < 0.05), including 26 positively and 45 negatively correlated proteins. Functional enrichment analysis revealed that age-associated proteins were markedly enriched in pathways related to endocytosis, metabolism, and neurodegenerative disease. Feature selection using the Lasso regression and the Boruta algorithm resulted in the identification of 18 and 10 age-associated proteins, respectively, with six overlapping proteins (including ITIH3, HSPA9, SNAP91, FTL, XPO4, and NCF2). RF regression models were constructed using different feature sets: Lasso-selected (18 proteins), Boruta-selected (10 proteins), their intersection (6 proteins), and R-value-based filtering (e.g., top 7 proteins with |R| > 0.4, P < 0.05). The Boruta-based model demonstrated the highest predictive accuracy, achieving an R² of 0.70 and a Mean Absolute Error (MAE) of 9.14 years for the testing set.

These findings demonstrate the potential of bloodstain proteomics for age estimation. This study provides a foundation for further validation in larger cohorts and diverse forensic scenarios.

The online version contains supplementary material available at 10.1186/s12864-025-12164-x.

## Linked entities

- **Proteins:** ITIH3 (inter-alpha-trypsin inhibitor heavy chain 3), HSPA9 (heat shock protein family A (Hsp70) member 9), SNAP91 (synaptosome associated protein 91), FTL (ferritin light chain), XPO4 (exportin 4), NCF2 (neutrophil cytosolic factor 2)
- **Diseases:** neurodegenerative disease (MONDO:0005559)

## Full-text entities

- **Genes:** Sod1 (superoxide dismutase 1, soluble) [NCBI Gene 20655] {aka B430204E11Rik, Cu/Zn-SOD, CuZnSOD, Ipo-1, Ipo1, SODC}, Pafah1b3 (platelet-activating factor acetylhydrolase, isoform 1b, subunit 3) [NCBI Gene 18476] {aka Pafahg, mus[g]}, Hspa9 (heat shock protein family A (Hsp70) member 9) [NCBI Gene 15526] {aka 74kDa, Csa, Grp75, Hsc74, Hsp74, Hsp74a}, Ftl1 (ferritin light polypeptide 1) [NCBI Gene 14325] {aka Ftl, Ftl-1, L-ferritin}, Kpnb1 (karyopherin subunit beta 1) [NCBI Gene 16211] {aka IPOB, Impnb}, Psma1 (proteasome subunit alpha 1) [NCBI Gene 26440] {aka C2, HC2, Pros-30, alpha-type}, Snap91 (synaptosomal-associated protein 91) [NCBI Gene 20616] {aka 91kDa, AP180, F1-20, mKIAA0656}, SRI (sorcin) [NCBI Gene 6717] {aka CP-22, CP22, SCN, V19}, Eno2 (enolase 2, gamma neuronal) [NCBI Gene 13807] {aka D6Ertd375e, Eno-2, NSE}, Tgfb1 (transforming growth factor, beta 1) [NCBI Gene 21803] {aka TGF-beta1, TGFbeta1, Tgfb, Tgfb-1}, PTMA (prothymosin alpha) [NCBI Gene 5757] {aka TMSA}, Smad3 (SMAD family member 3) [NCBI Gene 17127] {aka Madh3}, Ncf2 (neutrophil cytosolic factor 2) [NCBI Gene 17970] {aka NOXA2, Ncf-2, p67phox}, MTAP (methylthioadenosine phosphorylase) [NCBI Gene 4507] {aka BDMF, DMSFH, DMSMFH, HEL-249, LGMBF, MSAP}, ELOB (elongin B) [NCBI Gene 6923] {aka SIII, TCEB2}, Cd47 (CD47 antigen (Rh-related antigen, integrin-associated signal transducer)) [NCBI Gene 16423] {aka 9130415E20Rik, B430305P08Rik, IAP, Itgp}, GDF15 (growth differentiation factor 15) [NCBI Gene 9518] {aka GDF-15, HG, MIC-1, MIC1, NAG-1, PDF}, Hspa1b (heat shock protein family A (Hsp70) member 1B) [NCBI Gene 15511] {aka HSP70B1, Hsp70, Hsp70-1, Hsp70.1, hsp68}, Taldo1 (transaldolase 1) [NCBI Gene 21351], CCS (copper chaperone for superoxide dismutase) [NCBI Gene 9973], PZP (PZP alpha-2-macroglobulin like) [NCBI Gene 5858] {aka CPAMD6}, Fgg (fibrinogen gamma chain) [NCBI Gene 99571] {aka 3010002H13Rik}, Dnajb2 (DnaJ heat shock protein family (Hsp40) member B2) [NCBI Gene 56812] {aka 2700059H22Rik, Dnajb10, Hsj1, mDj8}, Nsf (N-ethylmaleimide sensitive fusion protein) [NCBI Gene 18195] {aka SKD2}, DDT (D-dopachrome tautomerase) [NCBI Gene 1652] {aka D-DT, DDCT, MIF-2, MIF2}, Hnrnpk (heterogeneous nuclear ribonucleoprotein K) [NCBI Gene 15387] {aka Hnrpk, KBBP, NOVA}, Itih3 (inter-alpha trypsin inhibitor, heavy chain 3) [NCBI Gene 16426] {aka ITI-HC3, Intin3, Itih-3}, IGHV4-30-2 (immunoglobulin heavy variable 4-30-2) [NCBI Gene 28398] {aka IGHV4-3, IGHV4302}, DTYMK (deoxythymidylate kinase) [NCBI Gene 1841] {aka CDC8, CONPM, PP3731, TMPK, TYMK}, Xpo4 (exportin 4) [NCBI Gene 57258] {aka B430309A01Rik, mKIAA1721}, Tubb2a (tubulin, beta 2A class IIA) [NCBI Gene 22151] {aka M(beta)2, Tubb2}, Glo1 (glyoxalase 1) [NCBI Gene 109801] {aka 0610009E22Rik, 1110008E19Rik, 2510049H23Rik, GLY1, Glo-1, Glo-1r}, Snca (synuclein, alpha) [NCBI Gene 20617] {aka NACP, alpha-Syn, alphaSYN}, Ufm1 (ubiquitin-fold modifier 1) [NCBI Gene 67890] {aka 1810045K17Rik, Gm10726}
- **Diseases:** inflammation (MESH:D007249), carcinogenesis (MESH:D063646), Parkinson's (MESH:D010300), Alzheimer's disease (MESH:D000544), Prion (MESH:D017096), ACN (MESH:D016518), cognitive decline (MESH:D003072), death (MESH:D003643), tumor (MESH:D009369), age- (MESH:D019588), degenerative diseases (MESH:D019636)
- **Chemicals:** bisulfite (MESH:C042345), dithiothreitol (MESH:D004229), Hydrogen peroxide (MESH:D006861), CaCl2 (MESH:D002122), urea (MESH:D014508), ROS (MESH:D017382), monosaccharide (MESH:D009005), methionine (MESH:D008715), DIA (-), ACN (MESH:C084683), cysteine (MESH:D003545), Acetonitrile (MESH:C032159), iron (MESH:D007501), iodoacetamide (MESH:D007460), Th (MESH:D013910), TEAB (MESH:C041737), SDS (MESH:D012967), FA (MESH:C030544), phosphatidylinositol (MESH:D010716), amino acid (MESH:D000596), cytochalasin (MESH:D003572)
- **Species:** Homo sapiens (human, species) [taxon 9606], Mus musculus (house mouse, species) [taxon 10090]
- **Cell lines:** S2 — Drosophila melanogaster (Fruit fly), Spontaneously immortalized cell line (CVCL_Z232)

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12573856/full.md

---
Source: https://tomesphere.com/paper/PMC12573856