A Genome-Wide Association Study on Calcium Nephrolithiasis in Chinese Han Population Identifies Novel Susceptible Loci at 4q35.1, 5q31.2 and 18q21.2
Lujia Wang, Zijian Zhou, Xiaoling Lin, Kangcheng Luo, Peng Gao, Deke Jiang, Zhong Wu

TL;DR
This study identifies three new genetic loci linked to calcium kidney stones in the Chinese Han population, offering insights into the genetic basis of the condition.
Contribution
The study is the first genome-wide association study on calcium nephrolithiasis in the Chinese Han population, identifying three novel susceptibility loci.
Findings
Three novel loci (SORBS2, CXXC5, MBD2) were found to be significantly associated with calcium nephrolithiasis in the Chinese Han population.
Each identified SNP showed associations with specific metabolic or kidney-related traits, such as serum chloride and BMI.
The study provides a foundation for future research into the molecular mechanisms of nephrolithiasis in this population.
Abstract
Background: Nephrolithiasis is a significant global health and economic challenge, with an increasing prevalence and a high recurrence rate. However, there is limited knowledge regarding the potential associations between calcium nephrolithiasis risk and Chinese Han populations currently. Methods: To identify the genetic factors for calcium nephrolithiasis, we presented a genome-wide association study (GWAS) using a total of 1006 calcium nephrolithiasis cases and 1200 controls of Chinese Han ethnicity. Suggestive loci (p < 1.0 × 10−6) were replicated in 445 cases and 1008 controls. We also assessed the association of GWAS-level significant single-nucleotide polymorphisms (SNPs) with quantitative traits, including metabolic, kidney-related, and electrolyte traits. Results: Here we found three novel loci for calcium nephrolithiasis: SORBS2 on 4q35.1 (rs3736194; p = 2.84 × 10−13, OR =…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2- —National Natural Science Foundation of China
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsKidney Stones and Urolithiasis Treatments · Parathyroid Disorders and Treatments · Chronic Kidney Disease and Diabetes
1. Background
Nephrolithiasis constitutes a significant global health and economic burden, impacting over 15% of men and more than 5% of women by the age of 70, with the global prevalence estimated to be approximately 10.85% recently [1]. The overall prevalence rate of nephrolithiasis was estimated to be 4.0% throughout mainland China [2]. Unfortunately, nephrolithiasis exhibits a high recurrence rate, with up to 50% of patients experiencing a subsequent episode within a decade following their initial onset [3]. The majority of kidney stones consist of either calcium oxalate or calcium phosphate crystals. The primary mechanism behind crystal formation and growth is the increased supersaturation of lithogenic salts in the urine, further exacerbated by the reduced urinary levels of natural stone inhibitors such as citrate, magnesium, pyrophosphate, and uromodulin [4]. The etiology of nephrolithiasis is complex, influenced by an interplay of environmental, lifestyle, and genetic factors. Studies indicate that approximately 65% of individuals with nephrolithiasis have a family history [5]. Furthermore, twin studies have demonstrated a significant genetic component, revealing a heritability estimate of over 45% [6]. Of particular note is the elevated risk observed in individuals with a substantial family history of urolithiasis; those with both a parent and a sibling affected exhibit a standard incidence ratio for developing stones that exceeds 50% [7]. This evidence underscores the substantial influence of genetic factors on the likelihood of nephrolithiasis, suggesting that both hereditary traits and environmental exposures play critical roles in the pathogenesis of kidney stones.
As far as we know, up to six genome-wide association studies (GWAS) of urolithiasis have been published, identifying 25 loci associated with nephrolithiasis [8,9,10,11,12,13]. The first GWAS in kidney stones, conducted in 2009 with Dutch and Icelandic cohorts, identified two synonymous variants in CLDN14. CLDN14 is expressed in the kidney and plays a crucial role in regulating paracellular permeability at epithelial tight junctions [8]. In 2011, a GWAS in nephrolithiasis was performed in a Japanese population, which was the first one for an East Asian population. They identified three novel susceptible loci at 5q35.3 (SLC34A1), 7p14.3 (AQP1), and 13q14.1 (DGKH) [10]. We replicated the association of seven single-nucleotide polymorphisms (SNPs) in these loci with urolithiasis in the Chinese Han population. Unfortunately, none of these SNPs previously identified were significantly associated with urolithiasis risk in the Chinese Han population [14]. In 2018, a large-scale GWAS of urolithiasis performed in a Japanese population revealed 14 significant loci, 9 of which were novel loci. They speculated that genetic factors influencing metabolic and crystallization pathways function in the development of urolithiasis [12]. The largest GWAS to date in nephrolithiasis was performed in 2019, comprising a trans-ethnic meta-analysis that included cohorts from the UK and Japan. They identified 20 loci of interest, 7 of which have not been identified previously. Of these, five loci, including DGKD, DGKH, WDR72, GPIC1, and BCR, are supposed to influence the CaSR signaling pathway [13].
In our previous studies, we replicated the association of the reported 28 SNPs in 24 loci with calcium nephrolithiasis in a Chinese Han population, and our findings elucidated the significance of genetic variation at only two of these loci (WDR72 and PDILT) in calcium nephrolithiasis [15,16]. This low replication rate highlights the significant genetic heterogeneity of nephrolithiasis across different ancestries, suggesting that the genetic architecture identified in other populations may not be directly transferable to the Chinese Han population. Therefore, a population-specific GWAS on nephrolithiasis in the Chinese Han population is urgently warranted to delineate its unique genetic landscape.
To further investigate the genetic factors that are related to the risk of calcium nephrolithiasis in Chinese Han population, we conducted a GWAS of calcium nephrolithiasis in the Chinese Han population.
2. Methods
2.1. Study Participants
Table 1 showed the characteristics of each cohort. The discovery stage of our GWAS included 1010 cases of nephrolithiasis and 1200 healthy controls. For the replication stage, an independent sample set was used, consisting of 445 cases and 1008 controls. The participants for the discovery and replication stages were recruited during distinct periods. The diagnosis of nephrolithiasis in patients from Huashan Hospital of Fudan University was confirmed by a clinician. Patients with uric acid, struvite, cystine, ammonium acid urate stones, or secondary nephrolithiasis caused by drugs, hyperparathyroidism, gout, or urinary deformity were excluded from the analysis. Control subjects were recruited from a routine health check-up cohort at the same institution. All controls had no self-reported history of urolithiasis and were confirmed stone-free by ultrasonography. All the participants provided informed consent, and this study was approved by the Huashan Institutional Review Board of Fudan University (HIRB).
2.2. Genotyping and Quality Control in the GWAS Discovery Stage
DNA samples were genotyped using the Illumina Asian Screening Array (ASA) Beadchip (Illumina Inc., San Diego, CA, USA), resulting in a total of 659,184 SNPs genotyped in 1010 patients with calcium nephrolithiasis. Control samples were genotyped using the Illumina OmniExpress BeadChips (Illumina Inc., San Diego, CA, USA) based on our previously published GWAS [17]. A standard quality control procedure was applied to select samples and SNPs for further analysis, with samples excluded for the following reasons: (i) a genotyping rate of less than 95%; (ii) duplicates or probable familial relationships (IBS > 0.99); and (iii) ambiguous gender. SNPs were removed if they met any of these criteria: (i) call rate below 95%; (ii) minor allele frequency (MAF) less than 0.01; or (iii) p value less than 1 × 10^−3^ in the Hardy–Weinberg Equilibrium (HWE) test among controls.
Imputation analysis was conducted using the IMPUTE software (version 2) [18], with the 1000 Genomes Project Han Chinese in Beijing (CHB) population as a reference. The threshold for the imputation information score was set at greater than 0.90. Following quality control procedures, the analysis proceeded with a dataset comprising 1006 cases and 1200 controls, encompassing 1,868,144.
2.3. SNP Selection and Genotyping in Replication Stage
A subset of SNPs independently associated with the risk of calcium nephrolithiasis was selected for replication based on the following criteria: (i) each locus with one SNP reaching p < 5 × 10^−8^ or (ii) each locus with minimum five SNPs reaching p < 1.0 × 10^−6^ for discovery samples, and (iii) linkage disequilibrium (LD) r^2^ < 0.2 between markers. Genotyping analyses of replication for 445 case samples were conducted using the ASA BeadChip (Illumina Inc., San Diego, CA, USA). Genotyping for 1008 control samples was performed using the Illumina Human OmniExpress BeadChip in our previously published GWAS [19], which was conducted in a male cohort; consequently, all controls were male. To account for this sex imbalance, sex was included as a covariate in the logistic regression models for the replication analysis. All SNPs satisfied a call rate > 98% and had no deviation from HWE (p ≥ 0.001) in the controls.
2.4. Quantitative Trait Locus Analysis
The GWAS-level significant SNPs were used to assess the association with blood parameters, including metabolic traits (body mass index (BMI), total cholesterol, triglycerides, serum high-density lipoprotein (HDL), low-density lipoprotein (LDL), albumin and blood glucose), kidney-related traits (serum urea, urea, uric acid, estimated glomerular filtration rate (eGFR) and urine pH), and electrolytes traits (serum sodium, potassium, chloride, CO_2_, calcium, magnesium, phosphorus, alkaline phosphatase (ALP), parathyroid hormone (PTH) and calcifediol) through quantitative trait locus (QTL) analysis. Laboratory measurements were conducted within two weeks prior to surgery.
2.5. Statistical Analysis
The logistic regression model was utilized to assess the relationship between SNPs and the risk of calcium nephrolithiasis, based on an additive genetic model, employing PLINK version 1.07. Odds ratios (ORs) and 95% confidence intervals (CIs) were estimated with adjustment for the age, gender, and the top two principal components to control for potential population stratification. Principal component analysis (PCA), conducted using the PLINK software package (version 1.07), accounted for population stratification and sample quality control, with PCA results visualized through the R statistical program [20]. The results obtained from both the initial GWAS and the subsequent replication phase were integrated through a comprehensive meta-analysis. To evaluate the consistency of these findings across studies, we employed Cochran’s Q statistic and the I^2^ statistic to quantify the extent of variation across studies that could be attributed to heterogeneity rather than random chance. For the meta-analysis, a fixed-effects model (Mantel–Haenszel method) was applied for SNPs with I^2^ ≤ 50%, while a random-effects model (DerSimonian and Laird method) was used for SNPs with I^2^ > 50%. We established a stringent genome-wide significance threshold of p < 5.0 × 10^−8^, while in the replication stage, significance was determined post-Bonferroni correction at p < 3.33 × 10^−3^ (0.05/15). Regional plots were generated using LocusZoom [21], and data analysis was conducted using SAS 9.2 (SAS Institute, Cary, NC, USA) and R 4.3.3.
3. Results
3.1. Discovery Phase
We performed a multistage GWAS of calcium nephrolithiasis in the Han Chinese population, with a total of 1451 calcium nephrolithiasis cases and 2208 controls. The characteristics of the subjects in each stage are summarized in Table 1.
In the first discovery stage, 1,868,144 SNPs were genotyped using the Illumina ASA Beadchips in a cohort of 1010 cases and 1200 controls. After quality control filtering, 1,837,032 SNPs remained for analysis across 1006 cases and 1200 controls. Association analyses were conducted using logistic regression adjusting for age and gender as covariates, with the results illustrated in the Manhattan plots shown in Figure 1. PCA results did not indicate substantial stratification within the study population, and the quantile-quantile (Q-Q) plot analysis revealed a genomic inflation factor (λ) value of 1.6006 (Supplementary Figure S1). To further evaluate the source of this inflation, we performed LD score regression analysis, which yielded an intercept of 1.07, indicating that the observed inflation is largely attributable to polygenic architecture rather than confounding biases such as population stratification. In the discovery stage, 827 SNPs had an association p value < 1.0 × 10^−6^, and 241 SNPs attained the genome-wide significance (p < 5.0 × 10^−8^) (Supplementary Table S1). Additionally, we evaluated the performance of previously identified SNPs related to nephrolithiasis risk: out of the 42 reported SNPs, 12 were included in our discovery phase, with 3 achieving statistical significance (Supplementary Table S2).
3.2. Replication Phase
To validate the suggestive associations identified during the discovery stage, we selected SNPs that were independently associated with the condition for a validation study. This subsequent phase involved an additional 445 cases and 1008 controls, all of whom were from the Chinese Han population. We selected 15 SNPs in 15 genomic regions with significant associations using the thresholds of p < 1.0 × 10^−6^ and r^2^ < 0.2, and adjusted for lead SNPs in each region. Five loci on chromosomes 4, 5, 10, 14 and 18 were successfully replicated: rs3736194 at 4q35.1 (p = 3.18 × 10^−5^), rs356450 at 5q31.2 (p = 3.87 × 10^−4^), rs17155703 at 10p13 (p = 5.85 × 10^−4^), rs10151594 at 14q22.1 (p = 5.60 × 10^−4^), and rs55826947 at (p = 3.11 × 10^−3^) (Figure 2a–e, Supplementary Table S3).
3.3. Combined Analysis
Employing a meta-analysis to integrate the discovery and replication stage, three novel SNPs, rs3736194 at 4q35.1 (OR = 0.6279, Pmeta = 2.84 × 10^−13^), rs356450 at 5q31.2 (OR = 2.0312, Pmeta = 6.09 × 10^−16^), and rs55826947 at (OR = 0.6017, Pmeta = 6.29 × 10^−10^), exceeded the genome-wide significant threshold (Table 2). However, rs17155703 in FAM107B at 10p13 (OR = 1.0503, Pmeta = 0.9385) and rs10151594 in FRMD6 at 14q22.1 (OR = 1.0545, Pmeta = 0.9214) failed to reach the genome-wide significance. No evidence for heterogeneity was found among the two stages at rs3736194 (Phet = 0.7457) and rs55826947 (Phet = 0.2246).
3.4. Quantitative Trait Locus Analysis
To further investigate the roles of SNPs in the pathogenesis of calcium nephrolithiasis, we explored the associations between five genetic variants and 22 quantitative traits across three categories: metabolic, kidney-related, and electrolyte (Table 3). Notably, the G allele of rs3736194 was significantly linked to increased serum carbon dioxide levels (p = 0.04666), while the G allele of rs356450 was associated with lower serum chloride levels (p = 0.02992). Additionally, the T allele of rs55826947 correlated with a higher BMI (p = 0.03174) and showed a suggestive association with decreased serum alkaline phosphatase levels (p = 0.05634). The C allele of rs17155703 was significantly associated with reduced serum albumin levels (p = 0.04223), and the A allele of rs10151594 exhibited a suggestive association with elevated serum albumin levels (p = 0.05809).
4. Discussion
To our knowledge, there has been no GWAS on the potential genetic association with the risk of calcium nephrolithiasis in the Chinese Han population. By conducting a two-stage GWAS analysis with 1451 calcium nephrolithiasis cases and 2208 controls, we identified three novel loci on 4q35.1, 5q31.2, and 18q21.2, which were associated with calcium nephrolithiasis.
rs3736194 on 4q35.1 is an intronic variant in SORBS2, encoding Sorbin and SH3 Domain Containing 2, which is a key number of the sorbin homology family of adapter and scaffold proteins. In the glomeruli of kidney, SORBS2 is involved in the formation of stress fibers in the glomeruli, which is suggested to be associated with the development of diabetic nephropathy [22]. Beyond its role in glomerular pathology, SORBS2 may contribute to nephrolithiasis through mechanisms involving crystal formation and handling. In patients with coronary atherosclerosis, SORBS2 is substantiated to regulate lipid-induced inflammation and foam cell formation [23]. This is particularly relevant because medullary macrophages in the kidney are known to monitor and remove crystal particles from the tubular fluid and interstitium, playing a crucial role in preventing stone retention [24]. Therefore, we speculate that SORBS2 variants could influence the inflammatory phenotype or phagocytic capacity of these renal macrophages, thereby affecting crystal clearance and promoting nephrolithiasis. In our study, the risk allele of rs3736194 was found to be significantly associated with elevated serum carbon dioxide levels. This finding underscores a potential genetic link between this specific allele and altered metabolic processes affecting carbon dioxide in the bloodstream. It has been reported that the risk of calcium phosphate stones increased in patients with higher level of serum CO_2_ [25]. Our findings indicate that the variation in SORBS2 might increase the risk of calcium nephrolithiasis by influencing abnormal serum CO_2_ metabolism.
rs356450 is an intronic variant in CXXC5 on 5q31.2, encoding a retinoid-inducible nuclear factor (RINF), and is a member of the CXXC-type zinc-finger protein family. CXXC5 is highly expressed in adipose tissue of obese patients with type 2 diabetes. The variation in CXXC5 expression has been reported to be resistant to obesity-related metabolic diseases [26]. Most notably, CXXC5 has been identified as a vitamin D receptor (VDR) interacting protein, which is a potent and dose-dependent activator of VDR/vitamin D responsive element-mediated transcription [27]. VDR is a master regulator of calcium homeostasis, directly controlling the expression of genes involved in intestinal calcium absorption and renal calcium transport. Polymorphisms in the VDR itself have been consistently associated with calcium nephrolithiasis and disorders of bone mineral density. It has been reported that CXXC5 knockout mice exhibit high bone mass phenotypes, with significantly increased bone thickness and bone volume density [28]. Since the variation in the VDR gene is associated with diseases of abnormal calcium metabolism, such as mineral density regulation and the development of calcium nephrolithiasis, we propose that the association of CXXC5 with nephrolithiasis is mediated through its regulatory effects on VDR signaling, leading to altered calcium homeostasis and increased stone risk.
rs55826947 on 18q21.2 is an upstream transcript variant of MBD2, a gene encoding Methyl-CpG-binding domain protein 2 (MBD2), a protein reader of methylation. Recent studies have demonstrated that the inhibition of MBD2 can effectively mitigate the progression of renal fibrosis induced by unilateral ureteral obstruction and ischemia/reperfusion [29]. Moreover, the MBD2 gene is believed to be involved in obesity and insulin resistance [30]. Insulin resistance, a core feature of obesity, leads to impaired renal ammonia production and low urine pH, which promotes the crystallization of calcium oxalate and uric acid. In our study, the variation in MBD2 was associated with higher BMI. Epidemiologic studies have demonstrated that stone risk incidence increases with BMI, a well-established risk factor for stone formation. Given the epidemiological link between increasing BMI and stone risk [31], we speculated that the variance in MBD2 might increase the risk of calcium nephrolithiasis by increasing by body weight.
We observed a suggestive association of rs17155703 on 10p13 with calcium nephrolithiasis. rs17155703 is an intronic variant in FAM107B, encoding Family with sequence similarity 107 member B (FAM107B) protein, an 18 kDa nuclear protein belonging to the Family with sequence similarity 107, also known as heat shock-inducible tumor small protein (HITS) [32]. The role of the gene FAM107B in the pathogenesis of nephrolithiasis remains unclear. Further research is required to determine how this gene may contribute to stone formation in the kidneys. Additionally, the chromosomal region 10p13 has been identified as a susceptibility locus for type 2 diabetes mellitus through GWASs conducted in both European and Asian populations [33,34].
In the present study, rs10151594 on 14q22.1 showed a suggestive association and is an upstream transcript variant of FRMD6, which encodes the FERM Domain Containing 6 (FRMD6). This protein serves as a key activator of the Hippo kinase pathway, playing crucial roles in regulating cell proliferation and apoptosis [35]. FRMD6 has been implicated in a range of complex diseases, including asthma, aortic aneurysms, Alzheimer’s disease, and some cancers [36,37]. Additionally, it has been reported that the variation in FRMD6 in immune cells could impact the binding properties of VDR to its receptor elements within FRMD6, potentially influencing the transcriptional response of FRMD6 to 1,25-dihydroxyvitamin D3 [38].
Several limitations of this study should be acknowledged. First, the sample size in the initial GWAS discovery stage was insufficient to identify all potential genetic susceptibility loci associated with calcium nephrolithiasis. Second, the control samples in the replication stages were genotyped using a different platform. This cross-platform design may have introduced technical variability, potentially affecting the consistency of the replication signals. Finally, the biological interpretation of the identified loci remains largely speculative and is based on indirect evidence. Further experimental studies, including functional assays and animal models, are warranted to elucidate the precise mechanisms by which these genes may contribute to the pathogenesis of calcium nephrolithiasis.
5. Conclusions
In conclusion, we conducted a GWAS of calcium nephrolithiasis in the Chinese Han population for the first time, identifying three novel susceptibility loci located on 4q35.1, 5q31.2, and 18q21.2. While further investigation is needed to elucidate the molecular mechanisms by which these variations may elevate the risk of nephrolithiasis, these findings indicate that factors such as calcium homeostasis, renal function, and metabolism could play significant roles in the development of calcium nephrolithiasis.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Scales C.D.Jr. Smith A.C. Hanley J.M. Saigal C.S. Urologic Diseases in America Project Prevalence of kidney stones in the United States Eur. Urol.20126216016510.1016/j.eururo.2012.03.05222498635 PMC 3362665 · doi ↗ · pubmed ↗
- 2Zeng Q. He Y. Age-specific prevalence of kidney stones in Chinese urban inhabitants Urolithiasis 201341919310.1007/s 00240-012-0520-023532431 · doi ↗ · pubmed ↗
- 3Pearle M.S. Goldfarb D.S. Assimos D.G. Curhan G. Denu-Ciocca C.J. Matlaga B.R. Monga M. Penniston K.L. Preminger G.M. Turk T.M. Medical management of kidney stones: AUA guideline J. Urol.201419231632410.1016/j.juro.2014.05.00624857648 · doi ↗ · pubmed ↗
- 4Coe F.L. Evan A. Worcester E. Kidney stone disease J. Clin. Investig.20051152598260810.1172/JCI 2666216200192 PMC 1236703 · doi ↗ · pubmed ↗
- 5Curhan G.C. Willett W.C. Rimm E.B. Stampfer M.J. Family history and risk of kidney stones J. Am. Soc. Nephrol.199781568157310.1681/ASN.V 81015689335385 · doi ↗ · pubmed ↗
- 6Goldfarb D.S. Avery A.R. Beara-Lasic L. Duncan G.E. Goldberg J. A Twin Study of Genetic Influences on Nephrolithiasis in Women and Men Kidney Int. Rep.2019453554010.1016/j.ekir.2018.11.01730993229 PMC 6451147 · doi ↗ · pubmed ↗
- 7Hemminki K. Hemminki O. Forsti A. Sundquist K. Sundquist J. Li X. Familial risks in urolithiasis in the population of Sweden BJU Int.201812147948510.1111/bju.1409629235239 · doi ↗ · pubmed ↗
- 8Thorleifsson G. Holm H. Edvardsson V. Walters G.B. Styrkarsdottir U. Gudbjartsson D.F. Sulem P. Halldorsson B.V. de Vegt F. d’Ancona F.C. Sequence variants in the CLDN 14 gene associate with kidney stones and bone mineral density Nat. Genet.20094192693010.1038/ng.40419561606 · doi ↗ · pubmed ↗
