Diet–Gene Interaction Between Fruit Intake and CMIP rs2925979 Polymorphism in Relation to Type 2 Diabetes: A Family-Based Study in Northern China
Liangchun Kuo, Yinxi Tan, Yiqun Wu, Xueying Qin, Haiying Gong, Yao Zhao, Tao Wu, Dafang Chen, Mengying Wang, Junbo Wang, Yonghua Hu

TL;DR
This study found that fruit intake and a specific genetic variant interact to influence the risk of type 2 diabetes in a Chinese population.
Contribution
The study identifies a gene–diet interaction between fruit consumption and the CMIP rs2925979 polymorphism in relation to T2DM risk.
Findings
Higher fruit intake was inversely associated with T2DM risk.
The CMIP rs2925979 polymorphism modified the protective effect of fruit intake on T2DM.
The protective effect was observed only in individuals with the T allele of rs2925979.
Abstract
Background/Objectives: This study aimed to investigate the association between dietary intake and the risk of type 2 diabetes mellitus (T2DM) in a rural northern Chinese population, and to explore potential gene–diet interactions that may influence T2DM susceptibility. Methods: A total of 1747 participants (1138 with T2DM and 609 without) were included, using baseline data from a family-based cohort study in rural northern China. Demographic characteristics, lifestyle factors, and medical history were collected via standardized questionnaires. Dietary intake was assessed using a semi-quantitative food frequency questionnaire, and anthropometric measurements were conducted according to standardized protocols. Based on findings from previous genome-wide association studies, several T2DM-related single-nucleotide polymorphisms were selected for genotyping. Generalized linear models…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1- —National Natural Science Foundation of China
- —National Key Research and Development Program of China
- —Natural Science Foundation of Beijing
- —Shanxi Province Higher Education “Billion Project”
- —School of Public Health, Shanxi Medical University “233”
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDiet, Metabolism, and Disease · Nutrition, Genetics, and Disease · Diet and metabolism studies
1. Introduction
Type 2 diabetes mellitus (T2DM) accounts for over 90% of all diabetes cases globally, and is the most prevalent and fastest-growing form of the disease. Diagnosis of T2DM is based on criteria established by the American Diabetes Association (ADA) [1] and the World Health Organization (WHO), which include fasting plasma glucose ≥ 7.0 mmol/L, 2-h plasma glucose ≥ 11.1 mmol/L after an oral glucose tolerance test (OGTT), or glycated hemoglobin (HbA1c) ≥ 6.5% [2].
According to the International Diabetes Federation (IDF), an estimated 537 million people worldwide are currently living with diabetes. This number is projected to rise to 643 million by 2030 and 783 million by 2045 [1]. In China, the number of people with diabetes increased from 90 million in 2011 to 118 million in 2021, representing a 56% increase [3,4,5,6]. Over the past three decades, five nationally representative cross-sectional studies have shown a steady upward trend in diabetes prevalence. In 1980, the prevalence of diabetes in China was less than 1%; by 2010, it had risen to 11.6%, surpassing the 10% threshold and placing China among the countries with the highest diabetes burden [3,4,5,6]. This persistent rise underscores the urgency of identifying modifiable and non-modifiable risk factors contributing to the growing T2DM epidemic [7,8,9,10].
In recent years, genome-wide association studies (GWAS) have identified more than 400 genetic loci associated with T2DM risk. One gene of particular interest is the c-Maf inducing protein (CMIP) gene, which plays a role in T-cell signaling, lipid metabolism, and insulin resistance pathways [11,12]. The single-nucleotide polymorphism (SNP) rs2925979, located in an intronic region of the CMIP gene [11], has been identified as a susceptibility locus for T2DM in East Asian populations. Notably, studies have shown that although the T allele of rs2925979 is inversely associated with body mass index (BMI) and waist circumference, it is positively associated with T2DM risk in Chinese women, suggesting pleiotropic and sex-specific effects [13]. Moreover, rs2925979 has been linked to lipid metabolism and adiponectin levels, indicating a broader role in metabolic regulation [14,15,16].
Dietary factors, particularly fruit consumption, are well-established in the prevention of T2DM. Fruits are rich in dietary fiber, polyphenols, and antioxidants, which collectively improve insulin sensitivity and reduce systemic inflammation [17]. Large-scale cohort studies and meta-analyses have consistently demonstrated an inverse relationship between fruit intake and T2DM risk [18,19,20]. However, interindividual variability in metabolic responses to dietary factors suggests a possible gene–diet interaction. To date, few studies have examined the interaction between CMIP rs2925979 and fruit intake in relation to T2DM risk, particularly in nutritionally and genetically distinct populations such as rural Chinese residents.
This study examined the potential modifying effect of the CMIP rs2925979 polymorphism on the association between fruit intake and the risk of T2DM, using data from a large family-based cohort study conducted in rural northern China. Given the increasing burden of T2DM in rural populations with limited healthcare access, and the paucity of research on how genetic variation may influence dietary associations in these settings, this investigation addresses a critical knowledge gap by evaluating the interaction between genetic susceptibility and modifiable dietary factors. Exploring gene–diet interactions is essential for advancing precision nutrition and informing population-specific preventive strategies.
We hypothesize that higher fruit intake is associated with a lower risk of T2DM, and that this protective association is more pronounced among carriers of the T allele of the CMIP rs2925979 polymorphism, compared to individuals with the CC (homozygous major allele) genotype. This study investigates the independent and interactive associations of fruit intake and the CMIP rs2925979 polymorphism with T2DM risk, aiming to generate evidence to inform personalized dietary recommendations and targeted prevention strategies in rural Chinese populations.
2. Materials and Methods
2.1. Study Participants
This study used baseline data from the Family Cohort Study of Common Chronic Non-Communicable Diseases in Rural Northern China, as previously described. Nine administrative villages in Fangshan District, Beijing, were selected as study sites due to their demographic, economic, and geographical characteristics representative of northern China. From June 2005 to August 2017, a total of 8323 participants were recruited using a family-based recruitment approach. All participants completed standardized questionnaires, underwent physical measurements, and provided blood samples. The cohort was designed based on genetic relationships, facilitating a more effective assessment of environmental and lifestyle factors while controlling for genetic background differences. The inclusion criteria for this study were: (1) age ≥ 18 years at the time of enrollment; (2) complete dietary survey data; (3) genetic testing data; and (4) voluntary participation with the completion of questionnaires, physical examinations, and biochemical tests. Exclusion criteria included: severe diseases or chronic conditions, such as malignant tumors or severe liver or kidney diseases, that hinder study participation. After applying the inclusion and exclusion criteria, 1747 participants were enrolled, fulfilling the sample size requirement. All participants provided informed consent, and the study was approved by the Biomedical Ethics Committee of Peking University (Approval No: IRB00001052-13027).
2.2. Dietary Assessment
The study employed a semi-quantitative food frequency questionnaire (FFQ) adapted from a validated simplified Chinese version developed for Chinese populations, and further modified to reflect the dietary characteristics of the study population. The questionnaire covered intake of key food groups, including staple foods (e.g., rice, noodles), vegetables, fruits, animal-based foods (e.g., unprocessed meat, processed meat, seafood, eggs), plant-based protein sources (e.g., tofu, nuts), and dairy products. It assessed both the frequency (e.g., days per week) and portion size (e.g., grams per day) of food consumption over the past 12 months. Additional questions addressed beverage intake (alcohol, tea, sugar-sweetened beverages), cooking oil types, salt consumption, and household eating patterns, enabling a comprehensive characterization of dietary behaviors relevant to chronic disease risk.
2.3. Genotyping and Basic Information on Polymorphic Loci
Based on previous GWAS conducted in European and East Asian populations, this study screened genetic susceptibility loci that reached genome-wide significance (p < 5 × 10^−8^) for associations with T2DM and its major metabolic risk factors, including blood pressure, blood lipid levels [11,21,22], insulin resistance [23,24,25], and obesity [26,27,28,29]. Inclusion criteria for SNP selection were as follows: (1) SNPs identified with genome-wide significance (p < 5 × 10^−8^) in GWAS for T2DM or its associated metabolic traits; (2) SNPs discovered in East Asian populations were prioritized to enhance population relevance, although variants from European GWAS were included due to the limited availability of East Asian-specific findings; and (3) SNPs previously associated with increased risk of T2DM (i.e., odds ratio [OR] or relative risk [RR] > 1), to facilitate risk-oriented analysis. Exclusion criteria included: (1) SNPs lacking replication in independent GWAS or validation cohorts; (2) SNPs with genotyping call rates below 95% or with >5% missing data; and (3) SNPs exhibiting high linkage disequilibrium (r^2^ > 0.8) with other selected loci, to reduce redundancy. After applying these criteria, SNP rs2925979 in the CMIP gene was selected for genotyping, based on its previously reported association with T2DM risk in East Asian populations. The distribution of CMIP rs2925979 genotypes was assessed for Hardy–Weinberg Equilibrium (HWE) using a chi-square test, and no significant deviation was detected (p = 0.514), suggesting that the genotype distribution was representative and free from genotyping bias.
2.4. Definition and Standards of Relevant Indicators
The covariate information for this study was obtained from a questionnaire, which included variables such as age, gender, marital status, education level, occupation, waist circumference, hip circumference, blood pressure, stroke history, hypertension history, BMI, smoking status, drinking status, and regular exercise. Waist and hip circumferences were measured with a precision of 0.1 cm. Blood pressure was measured three times with a 1-min interval between measurements, with the first measurement discarded and the average of the last two measurements used as the final value. All measurements were performed by the same researcher using the same blood pressure monitor. Stroke and hypertension histories were recorded as binary variables (‘yes’ or ‘no’). BMI was calculated from height and weight, with height measured to the nearest 0.1 cm and weight to the nearest 0.1 kg. Smoking status was categorized as “never smoked” or “smoked”, drinking status as “never drank” or “drank”, and regular exercise as ‘yes’ or ‘no’.
2.5. Statistical Analysis
Descriptive analyses were conducted on the study participants’ general demographic characteristics, physical measurements, lifestyle behaviors, and personal medical history. For continuous variables following a normal distribution, data were presented as means ± standard deviations; for non-normally distributed variables, medians and interquartile ranges (P_25_, P_75_) were used. Categorical variables were described as frequencies (percentages). Since the study participants were not statistically independent, intergroup comparisons were performed using generalized linear models that accounted for familial clustering. To control for potential confounding factors, a generalized linear regression model was constructed with T2DM status as the dependent variable (coded as 0 = no, 1 = yes) and various food categories as independent variables. The model was adjusted for age, sex, family structure, marital status, educational level, occupation, smoking status, alcohol consumption, regular physical activity, hypertension, BMI, waist circumference, total cholesterol, and stroke history. Interaction terms between food categories and genetic loci were included in the model to evaluate the presence of interaction effects, as indicated by the statistical significance of the interaction terms (p-values). Stratified analyses were subsequently conducted according to genotype to assess the associations between individual food categories and T2DM, and to further evaluate the potential effect modification by gene polymorphisms on these associations. All statistical analyses were conducted using R software (version 4.2.2).
3. Results
3.1. Basic Characteristics
Table 1 shows that 1747 participants were included in the study, comprising 1138 individuals diagnosed with T2DM and 609 without T2DM. Compared to those without T2DM, individuals in the T2DM group were significantly older and exhibited higher mean values for waist circumference, fasting blood glucose, and HbA1c levels. Furthermore, the prevalence of hypertension was significantly greater in the T2DM group (p < 0.05). In contrast, the proportions of participants who reported alcohol consumption and regular physical activity were significantly lower among individuals with T2DM (p < 0.05). No significant differences were observed between the two groups in terms of sex, occupational status, educational attainment, marital status, hip circumference, BMI, systolic blood pressure, total cholesterol, history of stroke, or smoking status (p > 0.05). In terms of genetic distribution, the genotype frequencies of CMIP rs2925979 among all participants were as follows: CC, 32.81%; CT, 48.21%; and TT, 18.98%. The genotype distribution used Hardy–Weinberg Equilibrium (p = 0.514), supporting the validity of the genotyping results.
3.2. Association Between Dietary Intake and T2DM
The intake of each food group was categorized into quartiles (Q1–Q4) based on the amount taken (in grams per day), with the lowest quartile (Q1) serving as the reference group. To evaluate the relationship between dietary intake and the risk of T2DM, three generalized linear models were applied. Model 1 was adjusted for age and sex; Model 2 included additional adjustments for marital status, educational level, and occupation; and Model 3 further accounted for lifestyle and clinical variables, including smoking status, alcohol consumption, regular physical activity, hypertension, BMI, waist circumference, total cholesterol, and history of stroke. As presented in Table 2, fruit intake was inversely associated with the risk of T2DM across all adjusted models. In the fully adjusted Model 3, compared to the reference group (Q1), the odds ratios (ORs) for T2DM were 0.44 (95% CI: 0.33–0.60) for Q2, 0.46 (95% CI: 0.33–0.62) for Q3, and 0.58 (95% CI: 0.43–0.80) for Q4, all of which were statistically significant (p < 0.001).
3.3. Interaction Between Dietary Factors and Genetic Polymorphisms
As shown in Figure 1, the association between fruit intake and T2DM risk differed according to CMIP rs2925979 genotype (TT, CT, and CC). Among individuals with the TT genotype, higher fruit intake was significantly associated with a reduced risk of T2DM. Compared to the lowest quartile (Q1), the adjusted odds ratios (ORs, 95% confidence intervals [CIs]) for Q2, Q3, and Q4 were 0.198 (0.079–0.491), 0.101 (0.040–0.250), and 0.263 (0.100–0.688), respectively (all p < 0.01), indicating a precise inverse dose–response trend. A similar protective pattern was observed among individuals with the CT genotype, with ORs (95% CIs) of 0.318 (0.203–0.498), 0.446 (0.281–0.709), and 0.477 (0.302–0.752) for Q2–Q4 versus Q1, respectively (all p < 0.001). In contrast, no significant associations were found among individuals with the CC genotype, with ORs of 0.749, 0.726, and 0.763 for Q2–Q4 (all p > 0.05; 95% CIs included 1.0).
Importantly, we also evaluated the independent association between CMIP rs2925979 and T2DM risk, irrespective under fruit intake. Compared to the CC genotype, carriers of the T allele (CT or TT) did not show a statistically significant association with T2DM in fully adjusted models (results available upon request), suggesting that the T allele alone may not confer increased risk. These findings indicate a potential gene–diet interaction, in which the T allele’s influence on T2DM risk appears to depend on fruit intake levels.
4. Discussion
This study identified a significant inverse association between fruit intake and the risk of T2DM in a rural northern Chinese population. Notably, a significant interaction was observed between fruit intake and the CMIP rs2925979 polymorphism. Specifically, the protective effect of higher fruit consumption was evident among individuals carrying the T allele (TT and CT genotypes). In contrast, no significant association was found among those with the CC genotype. These findings suggest a potential gene–diet interaction contributing to interindividual variability in T2DM susceptibility. The observed protective role under fruit intake is consistent with results from large-scale prospective cohort studies and meta-analyses, which have consistently demonstrated an inverse relationship between fruit consumption and T2DM risk [30,31]. Mechanistically, fruits are rich in dietary fiber, polyphenols, and essential micronutrients, all of which have been shown to enhance insulin sensitivity, reduce oxidative stress, and modulate inflammatory pathways [32,33,34]. Furthermore, the inverse association in our study remained robust even after adjusting for a comprehensive set of sociodemographic, lifestyle, and metabolic confounders, supporting an independent protective effect under fruit intake [13,35,36].
This study makes a novel contribution by identifying a significant interaction between fruit intake and the CMIP rs2925979 genotype in relation toT2DM risk. CMIP (c-Maf-inducing protein) is known to participate in insulin signaling and T-cell activation pathways, and its polymorphisms have been previously associated with insulin resistance and lipid metabolism abnormalities [31,37,38]. Our findings suggest that the stronger inverse association between fruit consumption and T2DM observed among individuals carrying the T allele may reflect differences in gene expression or metabolic responsiveness to dietary bioactive compounds such as flavonoids and dietary fiber. These results are consistent with a growing body of evidence indicating that genetic background can influence metabolic responses to diet, thereby supporting the rationale for precision nutrition strategies [39,40]. Notably, no significant association was found between fruit intake and T2DM risk among individuals with the CC genotype, suggesting that the protective effect of fruit consumption may be diminished by specific genetic factors [16,41,42,43]. These findings underscore the importance of integrating gene–environment interactions into public health nutrition strategies and tailoring dietary recommendations to accommodate genetic variability within populations.
The strengths of this study lie in its use of a family-based cohort design, which effectively reduces population stratification bias, and the comprehensive adjustment for a wide range of potential confounders. Moreover, conducting genotype-stratified analyses enabled the identification of significant gene–diet interactions that might be obscured in analyses of the general population. Despite these strengths, several limitations warrant consideration. First, the cross-sectional nature of the study precludes causal inference. Second, dietary intake was assessed using a food frequency questionnaire (FFQ), which is prone to recall bias and measurement error. Third, the small sample size within the CC genotype subgroup may have limited the statistical power to detect modest associations. Finally, our analysis focused on a single SNP and one food group; thus, future studies should explore genome-wide interactions and incorporate a broader range of dietary exposures to enhance generalizability.
Overall, this study demonstrates a significant gene–diet interaction between fruit intake and the CMIP rs2925979 polymorphism in relation to T2DM risk. These findings underscore the need to consider genetic background in formulating dietary recommendations and contribute valuable evidence toward the development of precision nutrition strategies for diabetes prevention.
5. Conclusions
In summary, this study presents novel evidence of a gene–diet interaction between fruit intake and the CMIP rs2925979 polymorphism in relation to the risk of T2DM in a rural northern Chinese population. The findings highlight the protective effect of fruit consumption and suggest that genetic variations may modify individual susceptibility to T2DM. By emphasizing differential metabolic responses to dietary components based on genetic background, our results strengthen the case for advancing precision nutrition strategies tailored to genetic profiles. These findings provide a scientific basis for more personalized dietary recommendations and contribute to the growing body of research supporting the integration of nutrigenetics into public health strategies aimed at diabetes prevention.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1International Diabetes Federation The IDF Diabetes Atlas, 10th ed.; 2021 Available online: https://diabetesatlas.org/resources/previous-editions/(accessed on 13 December 2024)
- 2Hu C. Jia W. Diabetes in China: Epidemiology and Genetic Risk Factors and Their Clinical Utility in Personalized Medication Diabetes 20176731110.2337/dbi 17-001329263166 · doi ↗ · pubmed ↗
- 3Wang L. Peng W. Zhao Z. Zhang M. Shi Z. Song Z. Zhang X. Li C. Huang Z. Sun X. Prevalence and Treatment of Diabetes in China, 2013–2018 JAMA 20213262498250610.1001/jama.2021.2220834962526 PMC 8715349 · doi ↗ · pubmed ↗
- 4International Diabetes Federation The IDF Diabetes Atlas, Diabetes Country Report 2000–2050; 2021 Available online: https://diabetesatlas.org/data-by-location/country/china/?utm_source=chatgpt.com(accessed on 21 May 2025)
- 5Guo Y. Huang Z. Sang D. Gao Q. Li Q. The Role of Nutrition in the Prevention and Intervention of Type 2 Diabetes Front. Bioeng. Biotechnol.2020857544210.3389/fbioe.2020.575442 PMC 752340833042976 · doi ↗ · pubmed ↗
- 6Deng W. Zhao L. Chen C. Ren Z. Jing Y. Qiu J. Liu D. National burden and risk factors of diabetes mellitus in China from 1990 to 2021: Results from the Global Burden of Disease study 2021 J. Diabetes 202416 e 7001210.1111/1753-0407.7001239373380 PMC 11457207 · doi ↗ · pubmed ↗
- 7Zhu D. Zhou D. Li N. Han B. Predicting Diabetes and Estimating Its Economic Burden in China Using Autoregressive Integrated Moving Average Model Int. J. Public Health 202166160444910.3389/ijph.2021.160444935126031 PMC 8810486 · doi ↗ · pubmed ↗
- 8Li Y. Teng D.I. Shi X. Qin G. Qin Y. Quan H. Shi B. Sun H. Ba J. Chen B. Prevalence of diabetes recorded in mainland China using 2018 diagnostic criteria from the American Diabetes Association: National cross sectional study BMJ 202036999710.1136/bmj.m 997PMC 718685432345662 · doi ↗ · pubmed ↗
