Genetic Analysis Reveals a Protective Effect of Sphingomyelin on Cholelithiasis
Kun Mao, Ang Li, Haochen Liu, Yuntong Gao, Ziyan Wang, Xisu Wang, Shixuan Liu, Ziyuan Gao, Jiaqi Quan, Moyan Shao, Yunxi Liu, Liang Shi, Bo Zhang, Tianxiao Zhang

TL;DR
This study finds that sphingomyelin has a protective effect against gallstones, with LDL playing a key role in this relationship.
Contribution
The novel contribution is identifying a protective causal effect of sphingomyelin on cholelithiasis mediated by LDL, supported by genetic evidence.
Findings
Sphingomyelin has a significant protective causal effect on cholelithiasis (p-value = 0.0002).
LDL mediates 59.18% of the causal relationship between sphingomyelin and cholelithiasis.
Seven genes (e.g., GCKR, ABCG8) are colocalized with the causal signal between sphingomyelin and cholelithiasis.
Abstract
Background: Cholelithiasis is the most common disorder affecting the biliary system. Choline is an essential nutrient in the human diet and is crucial for the synthesis of neurotransmitters. Previous studies have suggested an association between choline metabolites and cholelithiasis. However, the underlying mechanisms remain unclear. This research aims to fill the knowledge gap regarding the role of choline metabolites in cholelithiasis. Methods: Genetic data related to choline metabolites and other covariates were retrieved from the U.K. Biobank and IEU OpenGWAS database. Two-sample (TSMR) and multivariate Mendelian randomization (MVMR) analyses, mediation analysis, linkage disequilibrium score regression (LDSC), colocalization analysis, and enrichment analysis were performed. Results: A significant causal relationship was identified between serum level of sphingomyelin and…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4- —Hygiene and Health Care Scientific Research Program of Shaanxi Province
- —Scientific Research Program of Shaanxi Provincial Centre for Disease Control and Prevention
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDrug Transport and Resistance Mechanisms · Pediatric Hepatobiliary Diseases and Treatments · Liver Disease Diagnosis and Treatment
1. Introduction
Cholelithiasis is a common condition characterized by the accumulation of elevated levels of cholesterol or bilirubin—a breakdown product of hemoglobin—in the bile, which leads to the formation of stones within the biliary system, including the gallbladder and bile ducts. Approximately 20% of adults are affected by cholelithiasis, with more than 20% of these individuals developing symptoms and complications. Consequently, cholelithiasis imposes a substantial socioeconomic burden as a prevalent and costly digestive disorder [1]. Extensive research has identified several risk factors for cholelithiasis. Non-modifiable factors, including race, female sex, pregnancy, and age over 40 years, play a particularly significant role. Modifiable risk factors for cholelithiasis include obesity, a high-calorie diet, metabolic syndrome, and dyslipidemia. At present, surgical intervention constitutes the primary treatment for symptomatic cholelithiasis [2].
Choline, an essential nutrient in the human diet, is crucial for the synthesis of neurotransmitters such as acetylcholine, as well as for the production of methyl donors, betaine, and phospholipids. It plays vital roles in cell maintenance and growth throughout life, including neurotransmission, membrane synthesis, lipid transport, and single-carbon metabolism [3]. It is imperative to note that essential choline exists in several forms, including water-soluble free choline, fat-soluble phosphatidylcholine, and sphingomyelin. These forms are absorbed and metabolized through distinct pathways, which influence their bioavailability [4]. Previous studies have suggested an association between sphingolipids and cholelithiasis [5]. However, the underlying mechanisms between choline metabolites and cholelithiasis remain unclear.
Mendelian randomization (MR) is a highly efficacious tool for the identification of causal risk factors that underpin complex traits and diseases by means of genetic data. By employing single-nucleotide polymorphisms (SNPs) from genome-wide association studies (GWAS) as instrumental variables (IVs) for exposure, MR controls confounding factors and avoids reverse causation, strengthening causal inferences [6,7,8,9]. In the present study, the causal relationship between choline metabolite levels and cholelithiasis was examined using MR analysis. This current research aims to fill the knowledge gap regarding the role of choline metabolites in cholelithiasis and provide valuable insights for developing more effective preventive and treatment strategies. The graphical abstract is presented in Supplementary Figure S1.
2. Materials and Methods
2.1. Study Design
The study design and the assumptions of an MR study are presented in Figure 1. The risk factors under consideration include choline metabolites and other covariates, including LDL, HDL, triglycerides, and coronary artery disease (CAD), which were retrieved from specific data sources. The analyses include MR, LDSC, colocalization analysis, and enrichment analysis. The selection of genetic markers adheres to the three fundamental principles of instrumental variables. Firstly, the genetic markers must demonstrate a strong association with the exposure. Secondly, they must not demonstrate a direct association with the outcome. Thirdly, they must influence the outcome exclusively through their effect on the exposure. This study was conducted in strict accordance with the STROBE-MR checklist [10]. The code employed in this article is available in Supplementary Material S1.
2.2. Data Sources
The serum levels of three choline metabolites—total choline, phosphatidylcholine, and sphingomyelin—were selected as exposure factors from the U.K. Biobank (n = 114,999). The GWAS summary statistics for these metabolites are accessible via the IEU OpenGWAS database (https://gwas.mrcieu.ac.uk, accessed on 10 November 2024), with identifiers of met-d-cholines, met-d-phosphatidylcholines, and met-d-sphingomyelins (Borges CM) [11]. The genetic data of cholelithiasis from the Neale Lab within the U.K. Biobank were used for endpoints (http://www.nealelab.is/uk-biobank, accessed on 10 November 2024). The IEU OpenGWAS database (GWAS identifier: ukb-a-559, n = 337,199) was utilized for the purpose of data retrieval, and cholelithiasis was diagnosed according to ICD-10 (K80). For MVMR, four additional covariates were selected and adjusted in the analyses: LDL (ieu-b-5089, n = 201,678), HDL (ieu-b-109, n = 403,943), triglycerides (ieu-b-111, n = 441,016) [12], and CAD (ebi-a-GCST90013868, n = 352,063) [13]. The serum levels of three choline metabolites—total choline, phosphatidylcholine, and sphingomyelin—were selected as exposure factors from the U.K. Biobank (n = 114,999). The GWAS summary statistics for these metabolites are accessible via the IEU OpenGWAS database (https://gwas.mrcieu.ac.uk, accessed on 10 November 2024), with identifiers of met-d-cholines, met-d-phosphatidylcholines, and met-d-sphingomyelins (Borges CM) [11]. The genetic data of cholelithiasis from the Neale Lab within the U.K. Biobank were used for endpoints (http://www.nealelab.is/uk-biobank, accessed on 10 November 2024). The IEU OpenGWAS database (GWAS identifier: ukb-a-559, n = 337,199) was utilized for the purpose of data retrieval, and the diagnosis of cholelithiasis was made in accordance with the International Classification of Diseases, Tenth Revision (ICD-10, K80). In the context of multivariate Mendelian randomization (MVMR), the analysis incorporated four additional covariates, which were selected and adjusted as follows: LDL (ieu-b-5089, n = 201,678), HDL (ieu-b-109, n = 403,943), triglycerides (ieu-b-111, n = 441,016) [12], and CAD (ebi-a-GCST90013868, n = 352,063) [13]. The genetic data were derived exclusively from European populations and had undergone rigorous quality control measures and ethical approval, thus ensuring direct applicability.
2.3. Genetic Instrument Selection
SNPs with strong correlations (p-value < 5 × 10^−8^) were initially selected as candidate IVs. Linkage disequilibrium (LD) among them was estimated. One genetic marker was excluded on the basis that the r^2^ values of the corresponding SNP pair were greater than 0.001 (with a distance of less than 10,000 kb). To remove weak instrumental variables, an F-test was performed for each SNP, and SNPs with F-values less than 10 were excluded. Phenoscaner (https://github.com/phenoscanner/phenoscanner, accessed on 10 November 2024) was utilized to search and remove confounding SNPs [14,15]. Finally, a total of 49, 48, and 48 SNPs were identified as IVs for total choline, phosphatidylcholine, and sphingomyelin, respectively.
2.4. TSMR and MVMR Analyses
Mendelian randomization satisfies the relevance assumption, the independence assumption, and the exclusion restriction [16]. Five MR methods, including MR-Egger [17], weighted median [18], inverse variance weighted (IVW) [19], simple mode, and weighted mode [20], were utilized for MR analyses using the TwoSampleMR package (http://www.r-project.org, accessed on 10 November 2024). The IVW estimates were selected as the primary approach, while the other four methods were employed to enhance robustness across diverse scenarios. A significant causal relationship was considered when the p-value of IVW was less than 0.05, and the results of all five methods had the same direction. Heterogeneity was assessed using the MR-Egger Cochran Q test [21]. The MR-Egger intercept [22] and the MR-Presso global test [23] were utilized to test for pleiotropy. In instances where heterogeneity and pleiotropy were detected, the MR-Presso outlier method [23] was employed to remove outliers, and the analysis was repeated. Furthermore, leave-one-out tests for each SNP were conducted using the IVW results [24], and the SNP effect values were visualized in a funnel plot [25] to ensure that no outliers were interfering with the model results.
MVMR was carried out including LDL, HDL, triglycerides, and CAD as covariates. Results were evaluated using the IVW random effects model and the MR-Egger model.
2.5. Reverse and Mediated Mendelian Randomization Analysis
Reverse and mediated MR analysis was conducted for sphingomyelin, which was identified as a significant positive causal factor for cholelithiasis. The analysis incorporated four factors as mediators. In the analysis of the mediator Mendelian randomization, the effect of sphingomyelin on LDL was β1, the effects of the mediators on cholelithiasis were β2, and the indirect effect was calculated using β1 × β2. The significance of the indirect effect was then tested using a stepwise testing method.
2.6. LDSC and Colocalization Analysis
LDSC and colocalization analyses were performed to investigate genetic correlations between the three exposure factors and cholelithiasis [26,27,28]. The coloc package was used for colocalization analysis (p1 = 1 × 10^−4^, p2 = 1 × 10^−4^, p12 = 1 × 10^−5^) [28]. SNPs associated with exposure factors were selected to define the colocalization regions based on a significance threshold of p-value < 1 × 10^−6^. Linkage disequilibrium (LD) pruning was performed within a 100 kb window by removing SNPs with r^2^ < 0.001. For the remaining SNPs, regions of colocalization were defined by extending 50 kb on either side of the selected SNPs. Subsequently, a full Bayesian colocalization analysis was conducted, utilizing Bayes factors. The threshold for posterior probability for hypothesis 4 (PPH4) was set at > 0.9.
2.7. Gene Enrichment Analysis
Gene enrichment analysis was further conducted for loci obtained from colocalization analysis using the NCBI website (https://www.ncbi.nlm.nih.gov/, accessed on 10 November 2024) and the STRING database (https://string-db.org/, accessed on 10 November 2024) [29]. Enrichment analysis was then performed using GO and KEGG databases [30].
3. Results
3.1. Significant Signals Were Identified for Causal Relationship Between Sphingomyelin and Cholelithiasis
In TSMR analysis, heterogeneity was detected (p-value < 0.05). The MR-Presso method excluded specific SNPs for total choline, phospholipids, and sphingolipids, respectively. Subsequently, a significant causal relationship was identified between serum level of sphingomyelin and cholelithiasis (OR [95%CI] = 1.0049 [1.0028–1.0070], p-value = 0.0002) (Figure 2). No horizontal pleiotropy was detected by MR-Egger intercepts, while the MR-Presso global test indicated some presence (Supplementary Table S1). Leave-one-out tests and funnel plots did not identify SNPs with a major impact on the results (Supplementary Figure S2).
3.2. Low-Density Lipoprotein (LDL) and High-Density Lipoprotein (HDL) Might Mediate Part of the Causal Relationship Between Sphingomyelin and Cholelithiasis
A protective causal effect was identified in the MVMR analysis (Table 1). The following mediated MR analysis indicated that only LDL and HDL mediated part of the causal relationship between sphingomyelin and cholelithiasis (Figure 3, Supplementary Table S2). The indirect effects of LDL were found to be −0.0029, and the indirect effects of HDL were −0.0018. The LDL accounted for 59.18% of the total effect of sphingomyelin on cholelithiasis; however, the mediated proportion of HDL did not reach statistical significance. Reverse MR analysis revealed that the causal relationship between cholelithiasis and sphingolipids was not significant (p-value for IVW = 0.2520, p-value for MR-Egger intercept = 0.1790, p-value for Cochran Q test = 3.7161 × 10^−28^).
3.3. Bioinformatics Evidence for Unraveling the Significant Causal Signal Between Sphingomyelin and Cholelithiasis
LDSC analysis demonstrated genetic correlations between serum levels of three risk factors (total choline, phosphatidylcholine, sphingomyelin) and cholelithiasis at the whole-genome level (Supplementary Table S3). Colocalization analysis revealed abundant colocalization signals, thereby identifying specific SNPs mapped to genes, including GCKR (glucokinase regulatory protein), SNX17 (sorting nexin-17), ABCG8 (ATP-binding cassette sub-family G member 5), MARCH8 (membrane-associated ring-CH-type finger 8), FUT2 (fucosyltransferase 2), APOH (apolipoprotein H) and HNF1A (hepatocyte nuclear factor 1α) (Figure 4, Supplementary Tables S4 and S5).
These seven genes were found to be enriched in multiple biological processes, cellular components, and molecular functions in the GO database. In the KEGG database, these genes were identified to be enriched in glycosphingolipid biosynthesis—lacto and neolacto series, cholesterol metabolism, maturity-onset diabetes of the young, glycosaminoglycan biosynthesis—keratan sulfate, and glycosphingolipid biosynthesis—globo and isoglobo series (Supplementary Figure S3).
4. Discussion
Choline, a trace element in plasma that performs crucial physiological functions in the human body, is often overlooked in the development of disease. In the present study, a significant causal relationship was identified between sphingomyelin, important choline metabolites, and cholelithiasis. Further analysis suggested that this protective causal effect may be mediated by serum levels of HDL and LDL. In addition, seven genes enriched in glycosphingolipid biosynthesis were also identified as contributing to sphingomyelin and cholelithiasis through colocalization analysis.
TSMR analysis revealed a significant causal relationship between sphingomyelin and cholelithiasis after the elimination of outliers. MVMR analysis was used to adjust for suspected confounders and account for pleiotropy. A protective effect was identified through MVMR between serum levels of sphingomyelin and cholelithiasis. These findings were inconsistent with a previous study performed by Jiarui et al. [5] This finding was further illustrated in followed mediating analysis. LDL and HDL levels play an important mediating role between sphingolipids and cholelithiasis. TSMR showed that sphingolipids directly increased the risk of cholelithiasis but also increased LDL. The indirect effect of sphingolipids in reducing the risk of cholelithiasis by raising LDL was significantly greater than their direct effect, as shown by the results of MVMR. Thus, the overall effect of sphingolipids is protective against cholelithiasis. The effect of HDL was similar to that of LDL, but its effect was smaller. This partly explains why adjusting for HDL in MVMR did not change the causal relationship between sphingolipids and cholelithiasis.
LDSC analysis identified genome-wide genetic correlations between three choline metabolites and cholelithiasis. Subsequent colocalization analysis revealed seven potential colocalized loci between sphingomyelin and cholelithiasis, suggesting that multiple genes may contribute to this genetic correlation. It provides a brief discussion of the functions of the seven identified genes that would help elucidate their potential roles in the genetic correlation between sphingomyelin and cholelithiasis. Among the colocalized genes, GCKR functions in the liver through protein–protein interactions, facilitating its nuclear localization [31]. HNF1A is involved in glycolipid metabolism [32]. ABCG5 encodes a transporter protein involved in the excretion of cholesterol and promotes cholesterol excretion [33]. Common mutations in ABCG5 confer the majority of the genetic risk for cholelithiasis, accounting for approximately 25% of the total risk [1]. Previous studies have shown that patients with cholesterol stone disease and cholecystitis have increased levels of ABCG5 expression [34]. In conclusion, choline metabolites may combat cholelithiasis by promoting cholesterol excretion and transport. Multiple lines of evidence indicate that these three genes are associated with diabetes, as reported in previous studies [35,36,37]. However, the present study is constrained by its reliance on public databases; further experimental validation is required to enhance its credibility. Subsequent studies may contribute to the enhancement of the study’s credibility. Cholelithiasis is also a prevalent gastrointestinal manifestation of diabetes, suggesting a genetic correlation between the two conditions [38]. Sphingomyelin has been identified to be associated with diabetes [39] and has also been demonstrated to influence the progression of diabetic kidney disease [40]. Enrichment analysis also demonstrated that the localized genes are implicated in lipid and carbohydrate metabolism and are associated with maturity-onset diabetes in young people. These findings suggest that the underlying molecular mechanisms responsible for sphingomyelin levels may contribute to the development of both cholelithiasis and diabetes. This study suffers from several limitations. Firstly, the analysis was restricted to European ancestry samples, which limits the generalisability of the findings. Further validation studies are still needed to generalize the findings to other populations. Additionally, due to limitations in the data availability, not all choline metabolites were analyzed. In order to obtain more robust results, larger data sets from multiple sources are required. While the study provides insights into the role of choline metabolites in cholelithiasis etiology, replication in more diverse populations is needed for definitive conclusions to be drawn. Furthermore, the current research reveals causal reference, and would not have great significance for clinical practice. Related clinical translational research is needed in the future.
5. Conclusions
In conclusion, the present study has identified a protective effect between sphingomyelin and cholelithiasis. This effect is largely mediated by LDL. The findings of this study offer valuable information for further exploration of the molecular mechanisms of cholelithiasis.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Lammert F. Gurusamy K. Ko C.W. Miquel J.F. Méndez-Sánchez N. Portincasa P. Van Erpecum K.J. Van Laarhoven C.J. Wang D.Q.H. Gallstones Nat. Rev. Dis. Primers 201621602410.1038/nrdp.2016.2427121416 · doi ↗ · pubmed ↗
- 2Littlefield A. Lenahan C. Cholelithiasis: Presentation and Management J. Midwifery Womens Health 20196428929710.1111/jmwh.1295930908805 · doi ↗ · pubmed ↗
- 3Wiedeman A.M. Barr S.I. Green T.J. Xu Z.M. Innis S.M. Kitts D.D. Dietary Choline Intake: Current State of Knowledge Across the Life Cycle Nutrients 201810151310.3390/nu 1010151330332744 PMC 6213596 · doi ↗ · pubmed ↗
- 4Zeisel S.H. Dietary choline—Biochemistry, physiology, and pharmacology Annu. Rev. Nutr.198119512110.1146/annurev.nu.01.070181.0005236764726 · doi ↗ · pubmed ↗
- 5Mi J.R. Jiang L.J. Liu Z.Y. Wu X. Zhao N. Wang Y.Z. Bai X.Y. Identification of blood metabolites linked to the risk of cholelithiasis: A comprehensive Mendelian randomization study Hepatol. Int.2022161484149310.1007/s 12072-022-10360-535704268 · doi ↗ · pubmed ↗
- 6Lin J.F. Zhou J.W. Xu Y. Potential drug targets for multiple sclerosis identified through Mendelian randomization analysis Brain 20231463364337210.1093/brain/awad 07036864689 PMC 10393411 · doi ↗ · pubmed ↗
- 7Li Y.J. Li Q.X. Cao Z.Q. Wu J.H. Multicenter proteome-wide Mendelian randomization study identifies causal plasma proteins in melanoma and non-melanoma skin cancers Commun. Biol.2024785710.1038/s 42003-024-06538-239003418 PMC 11246481 · doi ↗ · pubmed ↗
- 8Li H.R. Du S. Dai J.L. Jiang Y.K. Li Z.M. Fan Q.H. Zhang Y.X. You D.F. Zhang R.Y. Zhao Y. Proteome-wide Mendelian randomization identifies causal plasma proteins in lung cancer Iscience 20242710898510.1016/j.isci.2024.10898538333712 PMC 10850776 · doi ↗ · pubmed ↗
