Predicting accumulation and age at onset of amyloid-β from genetic risk and resilience for Alzheimer’s disease
Eleanor K O’Brien, Timothy Cox, Shane Fernandez, Pierrick Bourgeat, Tenielle Porter, Ben Goudey, James D Doecke, Colin L Masters, Jurgen Fripp, Kwangsik Nho, Victor L Villemagne, Carlos Cruchaga, Christopher C Rowe, Andrew J Saykin, Vincent Dore, Simon M Laws

TL;DR
This study uses genetic data to predict when amyloid-beta accumulates in the brain, which could help identify Alzheimer's risk early and enable timely interventions.
Contribution
The study introduces polygenic scores for Alzheimer’s risk and resilience to predict amyloid-beta accumulation and age at onset.
Findings
Higher PGSrisk is linked to increased amyloid-beta accumulation and earlier age at onset.
PGSresilience is associated with later age at onset of amyloid-beta accumulation.
Trait-specific PGSs showed inconsistent associations across different genetic folds.
Abstract
Accumulation of brain amyloid beta (Aβ) is a key pathological hallmark of Alzheimer’s disease (AD) and begins many years before cognitive symptoms. Being able to predict the risk of Aβ accumulation, or the age at which this accumulation exceeds a critical threshold, may enable early intervention and treatment to slow or prevent the onset of AD. We utilised published genome-wide association studies (GWAS) to develop polygenic scores (PGS) based on AD risk (PGSrisk) and resilience (PGSresilience). We tested whether these could predict (i) whether an individual was an accumulator of Aβ (‘Accumulator Status’), and (ii) in accumulators, the age at which brain Aβ is estimated to exceed a threshold of 20 centiloids (CL)(‘Estimated Age at onset of Aβ’; AAO-Aβ) among 2175 participants (1158 with AAO Aβ) from the Alzheimer’s Dementia Onset and Progression in International Cohorts (ADOPIC) study.…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4- —National Institute of Health (NIH)
- —Alzheimer’s Association (US), the Alzheimer’s Drug Discovery Foundation, an Anonymous Foundation, the Science and Industry Endowment Fund, the Dementia Collaborative Research Centres, the Victorian Go
- —NHMRC
- —Sir Charles Gairdner Hospital, Cogstate Ltd, Hollywood Private Hospital, The University of Melbourne, and St Vincent’s Hospital
- —National Institute on Aging (National Institutes of Health), DOD ADNI (Department of Defense)
- —National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, AbbVie, Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.;
- —Northern California Institute for Research and Education, Alzheimer’s Therapeutic Research Institute at the University of Southern California
- —NIH
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlzheimer's disease research and treatments · Genetic Associations and Epidemiology · Dementia and Cognitive Impairment Research
Background
Alzheimer’s disease (AD) is a neurodegenerative disorder, characterised by progressive cognitive decline and memory loss [1]. It is the most common form of dementia, accounting for around 65% of cases [2]. The accumulation of amyloid beta (Aβ) in the brain is one of the key pathological hallmarks of AD, beginning years, if not decades, before the onset of clinical symptoms [3–5]. The ‘amyloid cascade hypothesis’ posits that this accumulation precipitates a cascade of events, including tau hyperphosphorylation and formation of neurofibrillary tangles, neuronal loss, cognitive decline and ultimately dementia [5, 6]. Predicting whether an individual will accumulate Aβ and the age at which they will reach a critical threshold for brain Aβ that leads to these downstream events could therefore enable timely therapeutic strategies aimed at delaying or preventing the onset and progression of cognitive decline [7].
Late onset AD (LOAD) occurs sporadically in people aged over 65 years, and makes up 90–95% of AD cases [8]. LOAD (henceforth referred to as AD) is a complex disease, with risk determined by a combination of genetic, environmental and lifestyle factors [9]. Among the genetic factors, the Apolipoprotein E (APOE) gene on chromosome 19 shows the strongest association with the incidence of AD, with the APOE ε4 allele conferring a dose-dependent increase in risk of AD, and the ε2 allele being protective [2, 10]. APOE genotype is also associated with the magnitude and timing of Aβ accumulation, such that ε4 carriage is associated with higher average age-adjusted Aβ burden and younger ages when critical Aβ thresholds are reached [11]. However, genome-wide association studies (GWAS) have identified more than 80 additional genetic variants that show a significant association with AD risk, albeit with smaller individual effects than APOE [12–18]. GWAS for Aβ accumulation and burden have also identified several significantly associated loci in addition to those within the APOE region [19–21]. These findings suggest that the prediction of AD and related traits can be improved by considering genetic factors beyond APOE.
Polygenic scores (PGS; also referred to as ‘genetic scores’ or ‘polygenic risk scores’) aggregate the effects of variants across the genome that are associated with the disease or trait of interest [22–25]. This approach recognises that complex traits are often influenced by many genetic variants, most of small effect, and many of which will not meet the stringent threshold for genome-wide significance [26–28]. The resulting score gives an estimate of genetic susceptibility to a trait or disease, which may then be used, alongside other clinical indicators, to inform decisions around preventative actions or interventions [24]. PGSs developed from genetic associations with disease incidence may also be useful for predicting endophenotypes linked to that disease (or vice versa), which can provide further insights into the underlying biological disease mechanisms and aid in earlier diagnosis. In the context of AD, genetic risk of disease (as estimated from GWASs for incidence of AD) may therefore be useful for predicting onset and progression of pathophysiological features such as Aβ that occur earlier in the disease trajectory, prior to diagnosis.
In the present study, we develop PGSs based on previously published GWASs for risk of and resilience to AD [13, 29]. Here, ‘resilience’ was defined as remaining cognitively unimpaired in the face of high genetic risk for AD [29]. We test the extent to which these PGSs predict whether an individual accumulates brain Aβ and the age at which Aβ is estimated to reach a critical threshold. We then assess whether we can improve on these predictions using a priori GWASs to develop phenotype-specific PGSs for each of these traits. We seek to develop predictive models that can identify individuals at high risk for early Aβ pathology, to inform personalised approaches to the prevention and treatment of AD.
Methods
Study population
The study included data for 2175 participants from the Alzheimer’s Dementia Onset and Progression in International Cohorts (ADOPIC) study, which combines three longitudinal cohort studies: the Alzheimer’s Disease Neuroimaging Initiative (ADNI)[30–32], the Australian Imaging, Biomarkers and Lifestyle Study of Ageing (AIBL)[33, 34], and the Knight ADRC Open Access Series of Imaging Studies (OASIS)[35]. Full details of these studies, including recruitment criteria and the schedule of assessments have been described previously [30–35]. The studies have all been granted approval by the ethics committees of their respective member institutions, and all participants provided informed written consent. The present study was limited to ADOPIC participants of European ancestry.
Genotyping and imputation
DNA was extracted from whole blood samples. For ADNI and AIBL participants, this was done using QIAamp DNA blood spin column kits (Qiagen, Valencia, CA, USA) [31, 33, 34], and for OASIS participants using the Autogen FlexSTAR + salt precipitation [36].
ADNI samples were genotyped at genome-wide single nucleotide polymorphisms (SNPs) on either the Illumina Human610-Quad BeadChip or the Illumina Human OmniExpress BeadChip[31]. AIBL samples were genotyped on the Axiom Precision Medicine Diversity Array (Applied Biosystems^™^)[33, 34]. OASIS samples were genotyped on one of nine arrays (Illumina Human660W-Quad, Infinium OmniExpressExome-8, Illumina Omni1-Quad, Illumina Human1M-Duo, Infinium Neuro Consortium Array, Infinium CoreExome-24, Infinium Global Screening Array-24, Illumina Human610-Quad, and UK Biobank Axiom array)[36].
All genotype data were imputed against the TOPMed panel on the TOPMed Imputation Server (University of Michigan, USA) [37, 38]. Within cohorts, this was done for each genotyping run, and then the imputed data were combined across the three cohorts to produce a single harmonised genetic data set. Only autosomal SNPs were included in the final genetic data set. We removed SNPs with an imputation quality r^2^ ≤ 0.3, samples and SNPs with > 2% missing genotypes after merging, as well as SNPs with minor allele frequency (MAF) < 5%, and those where the p-value from a test for Hardy-Weinberg equilibrium was < 1 × 10^− 6^. We used PLINK [39, 40] to estimate relatedness (pihat) between all pairs of participants and removed one member of each pair of individuals with pihat > 0.25. We also used PLINK to conduct a principal components analysis (PCA) on all remaining unrelated individuals to obtain principal components (PCs) to include as covariates to control for population structure in genetic analyses (see below). Genetic data were available for 6157 individuals, of whom 131 were removed due to relatedness, giving a final genetic data set of 6026 individuals (Fig. 1). APOE genotype was determined from TaqMan^®^ genotyping assays (Life Technologies, USA) at two SNPs: rs7412 (Assay ID: C__904973_10) and rs429358 (Assay ID: C__3084793_20).
Aβ PET Imaging
Measures of brain β-amyloid (Aβ) burden were obtained for participants in all three cohorts using positron emission tomography (PET) imaging with one of the following five tracers: ^11^C–Pittsburgh compound B (PiB), ^18^F-NAV4694 (NAV), ^18^F-florbetaben (FBB), ^18^F-florbetapir (FBP), or ^18^F-flutemetamol (FLUTE). PET images were analysed using CapAIBL [41] to generate tracer-specific standardised uptake value ratios (SUVRs), which were then transformed to centiloids (CL) as previously described [42, 43]. Harmonisation of CL quantification across the three cohorts has been described previously [44].
Traits
Accumulator Status was determined for all individuals in the study with Aβ PET imaging at two or more timepoints (N = 2175; Fig. 1). Individuals were classified as “accumulators” if they had Aβ PET ≥ 20 CL at any assessment or showed an increase in Aβ PET across timepoints of ≥ 0.05 CL/year, and as “non-accumulators” otherwise [45].
Estimated Age at Onset of Aβ (AAO-Aβ) was the age at which participants’ cortical Aβ burden was estimated to have reached 20 CL, obtained by placing them on a natural history curve of Aβ accumulation, as described previously [45]. AAO-Aβ was estimated for people classified as “accumulators” who also met the following criteria: (i) Their Aβ burden at the final scan was ≥ 10 CL (due to uncertainty around whether people with brain Aβ below this level will reach the 20 CL threshold), and (ii) if brain Aβ did not already exceed the 20 CL threshold, it was predicted to do so within 5 years. In total, AAO-Aβ was estimated for 1158 individuals (Fig. 1).
Data analysis
Development of cross-trait polygenic scores
We obtained summary statistics from genome-wide association studies (GWAS) for AD risk [13] and resilience [29]. We then used PRSice-2 [46, 47] to identify optimal PGSs for each trait (Accumulator Status and AAO-Aβ), based on risk and resilience GWASs. This process finds the p-value threshold for variant selection that results in a set of variants that explains the largest proportion of variance in the target phenotype, and uses these to create a PGS. To determine whether these risk and resilience PGSs could account for significant variation in each trait beyond that explained by APOE (the gene most strongly associated with AD, located on chromosome 19), we excluded chromosome 19 from the genetic data set, and included number of APOE ε4 alleles as a covariate, as recommended by Ware et al (2020) [48]. Additional covariates were sex and study (which of the three cohort studies the participant came from). To control for population structure, we also included the first three principal components from the PCA. We used an additive genetic model with default clumping and thinning parameters (250kb distance, r^2^ threshold 0.1). PGSs were standardised to a mean of 0 and standard deviation of 1.
Four PGSs were derived, defined by the GWAS summary stats used (risk or resilience) and the trait being predicted (Accumulator Status or AAO-Aβ): PGS_risk–Accumulator_, PGS_resilience–Accumulator_, PGS_risk–AAO_ and PGS_resilience–AAO_. We then used these optimal PGSs (calculated for the whole population) for the relevant trait in (generalised) linear models to test their association with that trait in three groups that were stratified by number of APOE ε4 alleles: (a) APOE ε4 non-carriers, (b) APOE ε4 heterozygotes, and (c) APOE ε4 homozygotes. These models included the same covariates as described above, excluding APOE ε4 allele count. For Accumulator Status, we fitted logistic regression models using the ‘glm’ function in R with a binomial distribution and logit link function, and for AAO-Aβ we fitted linear regression models using the ‘lm’ function (which assumes a gaussian distribution) in R (version 4.3.3)[49]. In each case, these were implemented in RStudio (version 2024.12.1 Build 563)[50]. Beta coefficients for the PGS term in logistic regressions were exponentiated to convert them to odds ratios (OR), which represent the change in odds of being an accumulator with each 1 SD increase in the PGS. For linear regressions on AAO-Aβ, beta coefficients for the PGS term represent the change in estimated AAO-Aβ (in years) with a 1 SD increase in the PGS.
To examine the change in odds of being an accumulator and estimated AAO-Aβ between PGS extremes, we also ran models comparing people with scores in the upper and lower quintiles (top and bottom 20%) of the population for each PGS and trait. Threshold PGS scores for the upper and lower quintiles were obtained for all genotyped individuals (N = 6026), and then phenotyped individuals in these ranges were extracted for analysis. In these models, PGS was a categorical predictor with levels “low” (lower quintile) and “high” (upper quintile). Covariates were the same as in models that included the full range of PGS values, and models were again run for all individuals and stratified by number of APOE ε4 alleles.
GWAS and development of trait-specific PGS using cross-validation
Due to the relatively small size of this study, we used a Monte Carlo cross validation approach [51, 52] to derive phenotype-specific PGSs. We created 10 cross validation (CV) runs, where within each run the data were split into discovery (two-thirds of individuals; Accumulator Status N = 1450, AAO-Aβ N = 772) and validation (one-third of individuals; Accumulator Status N = 725, AAO-Aβ N = 386) sets using the ‘sample’ function in R (v 4.3.3) [49], run in R Studio (v 2024.12.1 Build 563)[50]. We conducted a genome-wide association study (GWAS) for the trait of interest in the discovery set and used the summary statistics to find the optimal PGS in the corresponding validation set. This process was repeated in each of the 10 CV runs for each trait.
We ran GWASs for each trait in each discovery data set in PLINK [39, 40], using a logistic regression model for Accumulator Status and linear regression for estimated AAO-Aβ. We assumed an additive genetic model, and included sex, study, and the first three principal components as covariates. To identify the optimal PGS for the trait in the corresponding validation set, we PRSice-2 and followed the same protocol as for the risk and resilience PGSs described above, with the same covariates and clumping parameters, and again with chromosome 19 excluded. Here, the ‘effective allele’ was the allele associated with worse outcomes (higher odds of being an accumulator and earlier estimated AAO-Aβ). Therefore in each case, higher values for these scores are expected to be associated with these worse outcomes. To enable a direct comparison of the performance of the trait-specific PGSs with the risk and resilience PGSs, we tested the association of the optimal risk and resilience PGSs (determined from the whole cohort) with the traits in the validation data set of each CV run, using the same set of covariates as previously.
To compare the predictive performance of each PGS for each trait, we summarised performance across all CV runs. For Accumulator Status, we calculated the mean odds ratio and 95% confidence interval, and for estimated AAO-Aβ, we calculate the mean and standard error of the coefficient of the linear relationship of the standardised PGS against the phenotype across the 10 CV runs. For each PGS and trait, the overall R^2^ value was estimated as the mean of the R^2^ values across CV runs (Nagelkerke’s pseudo R^2^ for Accumulator Status), and the overall p-value was estimated from the p-values in each of the 10 CV runs using Stouffer’s method [53], with p-values weighted by the inverse of the standard error and adjusting for the direction of association in each run.
Results
Demographics
Demographic characteristics of all participants with at least two Aβ PET scans (N = 2175), and of the subset of participants with estimated AAO-Aβ (N = 1158), stratified by cohort study, are shown in Table 1. Participants from OASIS were on average younger, a higher proportion of them were female, and there were fewer APOE ε4 homozygotes than in ADNI or AIBL. A higher proportion of participants from ADNI were accumulators of Aβ compared to the other cohort studies. In the subset of participants with AAO-Aβ, all of whom were accumulators of Aβ, OASIS participants were still younger and with lower final Aβ burden than those from ADNI and AIBL (Table 1). Participant counts and percentages by Accumulator Status, for each APOE ε4 group, are shown in Table 2. The percentage of participants who were accumulators of Aβ increased with number of APOE ε4 alleles, from 63.1% among non-carriers, to 84% of ε4 heterozygotes and 95.6% of ε4 homozygotes (Table 2). Consistent with previous studies, APOE ε4 carriage was strongly associated with both Accumulator Status and estimated AAO-Aβ in a dose-dependent manner, with an increasing number of ε4 alleles associated with higher odds of being an accumulator of Aβ (Fig. 2, Table S1) and earlier estimated AAO-Aβ (Fig. 3, Table S2).
Polygenic scores derived from risk and resilience GWAS against Accumulator Status and Estimated Age at Onset of Aβ (AAO-Aβ)
The optimal risk PGS for Accumulator Status (PGS_risk–Accumulator_) was significantly associated with this trait overall, with an odds ratio (OR) of 1.16 (95% CI 1.05–1.29)(Table S1), indicating that for an increase of 1 standard deviation (SD) in PGS_risk–Accumulator_, the odds of being an accumulator of Aβ increase by 16%. When stratified by number of APOE ε4 alleles, the same direction of association was seen in all groups, but this was only significant in APOE ε4 heterozygotes (marginally non-significant in ε4 non-carriers, p = 0.058) (Fig. 2, Table S1). Individuals in the upper quintile of PGS_risk–Accumulator_ were 63% more likely than those in the bottom quintile to be accumulators of Aβ overall (OR = 1.63; 95% CI 1.18–2.26)(Table S1), and this difference was also significant within APOE ε4 non-carriers and heterozygotes (Table S1). The optimal resilience PGS (PGS_resilience–Accumulator_) was not significantly associated with Accumulator Status overall, or for any group when stratified by number of APOE ε4 alleles (Fig. 2, Table S1). We also did not detect a significant difference in the probability of being an accumulator between the upper and lower quintiles of PGS_resilience–Accumulator_ (Table S1). It is important to note that only 4.4% (6 out of 135) of APOE ε4 homozygotes in our study were non-accumulators (Table 2). This was a much lower percentage than in the other ε4 groups (Table 2), and would have limited our power to detect an association with Accumulator Status in this sub-group.
The optimal risk and resilience PGSs for estimated AAO-Aβ (PGS_risk–AAO_ and PGS_resilience–AAO_) were both significant predictors of this trait overall, with higher PGS_risk–AAO_ and lower PGS_resilience–AAO_ associated with an earlier estimated AAO-Aβ (Fig. 3, Table S2). A 1 SD increase in PGS_risk–AAO_ and a 1 SD decrease in PGS_resilience–AAO_ were associated with estimated AAO-Aβ 1.3 years and 0.91 years earlier respectively (Fig. 3, Table S2). When stratified by number of APOE ε4 alleles, higher PGS_risk–AAO_ was associated with an earlier estimated AAO-Aβ in APOE ε4 non-carriers and heterozygotes, but there was no significant association with AAO-Aβ in APOE ε4 homozygotes. Individuals in the upper quintile for PGS_risk–AAO_ had a mean estimated AAO-Aβ 2.9 years earlier than those in the lower quintile after accounting for other covariates, while among APOE ε4 heterozygotes this difference was 4.5 years (Table S2). Higher PGS_resilience–AAO_ was associated with a later estimated AAO-Aβ in APOE ε4 heterozygotes and a marginally non-significant (p = 0.051) increase in estimated AAO-Aβ in ε4 homozygotes, but not associated with AAO-Aβ in APOE ε4 non-carriers (Fig. 3, Table S2). Individuals in the upper quintile for PGS_resilience–AAO_ had a mean estimated AAO-Aβ 2.2 years later than those in the lower quintile after accounting for other covariates, but there was not a significant difference between these extremes for any of the groups after stratification by number of APOE ε4 alleles (Table S2).
Polygenic scores derived from trait-specific GWAS against Accumulator Status and Estimated Age at Onset of Aβ (AAO-Aβ), and comparison with risk and resilience PGSs
The mean percentage of people who were accumulators of Aβ across each of the CV runs was 71.4% in the discovery sets (SD 0.50) and 71.8% in the validation sets (SD 1.00)(Table S3), which are both very close to the value of 71.5% for the whole sample (Table 1). In the trait-specific GWASs run in each of the 10 CV runs (excluding chromosome 19), we found a total of two SNPs (rs12192157 and rs6900289) associated with Accumulator Status at genome-wide significance (P < 5 × 10^− 8^). These closely linked variants were on chromosome 6 and were significant in just one of the CV runs (Table S4). We found one genome-wide significant SNP (rs12022131) for estimated AAO-Aβ in one of the CV runs, located on chromosome 1 (Table S4).
The optimal trait-specific PGS for Accumulator Status (PGS_Accumulator_) was significantly associated with this trait in four of the 10 CV runs (two of these remained significant after FDR correction), although the direction of the association varied among runs and the overall association was not significant (Fig. 4, Table S5). The risk and resilience PGSs for Accumulator Status, run in the validation sets of each CV run for comparison, both showed overall significant positive associations with odds of being an accumulator. In contrast to PGS_Accumulator_, the direction of association was consistent across all 10 CV runs for PGS_risk–Accumulator_ and all but one CV run for PGS_resilience–Accumulator_, although it was only significant in six CV runs for PGS_risk–Accumulator_ and not in any individual CV run for PGS_resilience–Accumulator_ (Fig. 4; Table S5).
Similarly, for estimated AAO-Aβ, the optimal trait-specific PGS (PGS_AAO_) was not significantly associated with the trait overall, and although the association was significant in four of the CV runs (three after FDR correction), the direction of association was highly variable across runs (Fig. 4, Table S6). PGS_risk–AAO_ was negatively associated with AAO-Aβ, indicating that individuals with a higher genetic risk of AD had an earlier estimated age at onset of Aβ. PGS_resilience–AAO_ was positively associated with AAO-Aβ, indicating a later estimated age at onset of Aβ in more genetically resilient individuals (Fig. 4, Table S6). Of the three PGSs for each trait, PGS_risk_ had, on average, the largest effect size and explained the greatest proportion of the variation in both Accumulator Status and estimated AAO-Aβ (Fig. 4).
Discussion
We tested whether polygenic scores based on genetic risk of and resilience to AD could predict Aβ Accumulator Status and estimated AAO-Aβ. We also tested whether PGSs based on phenotype-specific GWASs improved prediction of these traits. Higher genetic risk of AD was associated with higher odds of Aβ accumulation, and earlier estimated AAO-Aβ. Higher genetic resilience was associated with a later estimated AAO-Aβ but was not a significant predictor of Accumulator Status. Phenotype-specific PGSs did not improve on the predictive performance of PGS_risk_ or PGS_resilience_ and were not significant predictors of either trait overall.
The associations of PGSs with each trait were seen after accounting for APOE ε4 status, the strongest genetic risk factor for AD [2, 10]. Our results contrast with several studies that have found that PGSs did not improve prediction over and above APOE ε4 status for AD and all-cause dementia (ACD) [54], or for measures of Aβ deposition [55, 56]. However, other studies have found that PGSs do result in small but significant improvements over APOE alone in prediction of AD-related traits, including incidence of AD [56, 57] and Aβ pathology [58]. Apparent inconsistencies between these findings may be explained by variation in the methods used to construct PGSs, as well as the specific phenotypes examined. It has been argued that Aβ deposition is largely driven by APOE, and that other genetic contributors to AD become more important at later disease stages [54, 56]. However, our results highlight that considering genetic factors beyond APOE can improve prediction of whether and how early individuals will accumulate Aβ.
Differences between PGS_risk_ and PGS_resilience_ in their associations with the traits examined, and their interactions with APOE, offer insights into the mechanisms by which the genetic variation captured by these scores confers risk or protection against AD. Increased genetic risk of AD was a stronger predictor of adverse outcomes (higher odds of being an accumulator and earlier estimated AAO-Aβ) in ε4 non-carriers and heterozygotes than in ε4 homozygotes, suggesting it contributes little additional risk in individuals who are already at highest risk due to APOE ε4 homozygosity. By contrast, higher genetic resilience to AD was associated with later estimated AAO-Aβ in ε4 heterozygotes (and a marginally non-significant association in homozygotes), but was not associated with AAO-Aβ in ε4 non-carriers. The original AD resilience GWAS was conducted by limiting the study population to individuals at high genetic risk for AD (defined by a similar risk PGS to that developed here) and contrasting ‘resilient’ (unaffected by AD) individuals with AD cases [29]. This means that it is effectively a measure of resilience to genetic risk of AD, so it is unsurprising that it is a stronger predictor of AAO-Aβ in APOE ε4 carriers, who are at highest genetic risk [2, 10]. PGS_resilience_ was not associated with Accumulator Status, although the overall trend (significant when aggregated across runs in the cross-validation study) was positive. This implies that more genetically resilient individuals are more likely to be accumulators of Aβ, which seems counterintuitive, but is likely another consequence of the fact that for this score, more genetically resilient individuals also have higher genetic risk scores [29]. A framework previously proposed when considering protective factors for AD, distinguishes between ‘resistance’ and ‘resilience’, where resistance refers to the avoidance of pathological brain changes, while resilience is the ability to cope with accumulating neuropathology and avoid brain atrophy or cognitive decline [59–61]. The lack of association of PGS_resilience_ with Accumulator Status in our study suggests that genetic variation captured by this score does not confer protection against AD by preventing the accumulation of Aβ (‘resistance’), although the association with AAO-Aβ suggests it may slow or delay this accumulation. Further analysis involving a broader range of traits is required to disentangle this.
The trait-specific PGSs for both Accumulator Status and estimated AAO-Aβ, derived using a cross-validation approach, were not associated with either trait overall, although several individual CV runs showed significant associations for each trait. However, there was variability in effect direction for these associations. It is likely that unstable signals were owing in part to the sample sizes available for the discovery GWASs in this study (N = 1450 for Accumulator Status and N = 772 for AAO-Aβ), which were much smaller than those in the risk (N = 94 437) and resilience (N = 13 572) GWASs [13, 29]. Despite the lack of compelling evidence in the current study, trait-specific PGSs may nevertheless capture unique genetic variation associated with these traits, particularly in larger studies. We identified two closely-linked SNPs on chromosome 6 that were associated with Accumulator Status (rs12192157 and rs6900289), and one SNP on chromosome 1 (rs12022131) that was associated with AAO-Aβ at a genome-wide significant level. While each was significant in only a single CV run, it will be of interest to determine whether a signal is seen in these regions in future studies. To the best of our knowledge, there are currently no known associations of these SNPs with specific traits or diseases. However, the SNPs on chromosome 6 are in close proximity to the LPA gene, which affects plasma concentrations of lipoprotein (a) (Lp(a)) and is strongly associated with cardiovascular disease [62], a key risk factor for AD [63, 64].
Genetic variants, unlike other biomarkers of disease, remain constant across the lifespan, meaning that polygenic scores for diseases and related traits offer the potential to identify high risk individuals at a very early stage, prior to symptom onset [25]. In AD, a diagnosis is typically made once cognition is impaired, by which time there has been widespread damage to the brain. However, brain Aβ begins accumulating years or decades prior to appearance of cognitive symptoms [3–5]. Therefore, identifying individuals at risk of accumulating Aβ provides an opportunity to administer interventions to prevent or delay onset and progression of disease. While effective treatments for AD have proved elusive, recent years have seen the development of anti-amyloid monoclonal antibodies that remove Aβ from the brain and have been shown to produce modest slowing of cognitive decline in people with mild symptomatic AD [65–67]. Trials are currently evaluating whether administering these treatments at an earlier stage, in asymptomatic individuals, may be more effective and there is hope that it may eventually be possible to prevent AD [68]. In this instance, PGSs could offer a relatively inexpensive and minimally invasive method to evaluate people’s risk and prioritise them for further screening. A limitation of PGSs is that they typically explain only a small proportion of the total variation in a disease or trait [69], which was the case in this study (0.2–2.3% for PGS_risk_; 0.1–1.6% for PGS_resilience_, depending on population sub-group). While this limits their utility for making a definitive diagnosis, our results show that PGSs can nevertheless improve the accuracy of prediction of AD-related traits, and may be a useful tool for risk stratification, particularly when considered alongside other risk predictors such as demographics and lifestyle.
This study does have several limitations. Within our study population, over 70% of participants were accumulators of Aβ. While data on Accumulator Status in the wider population are scarce, one study found that ~ 20% of healthy adults aged ≥ 60 years had elevated Aβ [70]. Accumulators of Aβ are therefore almost certainly over-represented in our study population, which is unsurprising given that the component cohorts are enriched for people with cognitive complaints [30–35]. As a result, the likelihood of being an accumulator as predicted by PGS score may be overestimated and should be recalibrated based on a more representative population. Similarly, AAO-Aβ could only be estimated for people who had begun accumulating Aβ and were close to, or had exceeded, the 20 CL threshold. The predictive performance of this score in the broader population, including people with low Aβ who may accumulate Aβ, needs to be verified. Furthermore, the lack of an external validation sample for the phenotype-specific PGSs meant that it was necessary to both run the discovery GWAS and develop PGSs within the study population by dividing it into test and validation data sets. Despite utilising the largest existing dataset for these traits, this resulted in small sample sizes relative to those used in the GWASs from which risk and resilience PGSs were derived [13, 29]. This was partially addressed by our cross-validation approach. However, the instability of score performance across CV runs is likely due to the small sample. Finally, the study population was limited to people of European ancestry. Differences in LD structure, allele frequencies and genetic architecture can affect the generalisability of genetic predictors across different ancestries [71], therefore testing in diverse populations would be beneficial.
Conclusions
Polygenic scores based on genetic risk of AD explained a small but significant proportion of the variation in Accumulator Status and estimated AAO-Aβ, over and above that explained by APOE ε4. The PGS for AD risk may be particularly useful, in combination with other predictors, for identifying individuals at risk of Aβ accumulation and earlier AAO-Aβ, who may benefit from targeted prevention and treatment.
Supplementary Material
Supplementary Files
This is a list of supplementary files associated with this preprint. Click to download.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Knopman DS, Alzheimer disease. Nat reviews Disease primers. 2021;7(1):33.
- 2Van Cauwenberghe C, Van Broeckhoven C, Sleegers K. The genetic landscape of Alzheimer disease: clinical implications and perspectives. Genet Sci. 2016;18(5):421–30.
- 3Villemagne VL, Amyloid β deposition, neurodegeneration, and cognitive decline in sporadic Alzheimer’s disease: a prospective cohort study. Lancet Neurol. 2013;12(4):357–67.23477989 10.1016/S 1474-4422(13)70044-9 · doi ↗ · pubmed ↗
- 4Jack CR, Tracking pathophysiological processes in Alzheimer’s disease: an updated hypothetical model of dynamic biomarkers. Lancet Neurol. 2013;12(2):207–16.23332364 10.1016/S 1474-4422(12)70291-0PMC 3622225 · doi ↗ · pubmed ↗
- 5Selkoe DJ, Hardy J. The amyloid hypothesis of Alzheimer’s disease at 25 years. EMBO Mol Med. 2016;8(6):595–608.27025652 10.15252/emmm.201606210 PMC 4888851 · doi ↗ · pubmed ↗
- 6Hardy JA, Higgins GA. Alzheimer’s disease: the amyloid cascade hypothesis. Science. 1992;256(5054):184–5.1566067 10.1126/science.1566067 · doi ↗ · pubmed ↗
- 7Sperling RA, Jack CR Jr, Aisen PS. Testing the right target and right drug at the right stage. Science translational medicine, 2011. 3(111): p. 111cm 33–111cm 33.
- 8Harman D. Alzheimer’s disease pathogenesis: role of aging. Ann N Y Acad Sci. 2006;1067(1):454–60.16804026 10.1196/annals.1354.065 · doi ↗ · pubmed ↗
