Gut microbiome mediates the associations between lifestyle factors and risk of colorectal high-risk adenoma: results from a population-based cohort study
Kai Song, Jiahui Luo, Yuhan Zhang, Dong Wu, Hongda Chen, Min Dai

TL;DR
This study finds that gut bacteria like Fusobacterium and Tyzzerella 4 help explain how lifestyle choices like smoking and alcohol increase colorectal cancer risk.
Contribution
The study identifies specific gut microbes that mediate lifestyle-related colorectal cancer risk, offering new prevention strategies.
Findings
High BMI, heavy smoking, and alcohol use increase colorectal high-risk adenoma risk.
Fusobacterium and Tyzzerella 4 partially mediate alcohol and smoking-related HRA risk.
Gut microbiota changes offer actionable targets for early cancer prevention.
Abstract
Lifestyle factors exert influence on the risk of colorectal cancer (CRC) and its precursors. However, the interaction between gut microbiota and lifestyle factors concerning colorectal high-risk adenomas (HRAs), as well as the specific microbial taxa implicated, remains underexplored. Here, we aimed to investigate the impact of common lifestyle factors on HRAs and to explore the potential mediating roles of gut microbiota in these effects. A total of 3,827 participants were enrolled from our multicenter CRC screening cohort. Lifestyle factors over the 12 months preceding enrollment were assessed via questionnaires. Fecal samples were collected upon enrollment and analyzed using 16S rRNA sequencing. Multivariate analyses were used to identify lifestyle-related risk factors for HRA, followed by the application of the multivariate association with linear models (MaAsLin2) to explore…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Fig 1
Fig 2
Fig 3
Fig 4| Variable | Control ( | HRA ( | |
|---|---|---|---|
| Age, mean (SD) | 60.5 (6.3) | 61.9 (6.2) |
|
| Female/male | 577/676 | 65/207 |
|
| BMI (kg/m2), mean (SD) | 24.1 (2.8) | 24.7 (3.0) |
|
| Education level, | 0.608 | ||
| <High school | 936 (74.7%) | 210 (77.2%) | |
| High school and equivalent | 225 (18.0%) | 42 (15.4%) | |
| Postsecondary graduate | 92 (7.3%) | 20 (7.4%) | |
| Family history of CRC in first-degree relatives, | 142 (11.3%) | 38 (14.0%) | 0.263 |
| History of colorectal polyps, | 36 (2.9%) | 5 (1.8%) | 0.477 |
| Use of NSAIDs, | 0.734 | ||
| No | 1,194 (95.3%) | 257 (94.5%) | |
| Sometimes | 27 (2.2%) | 8 (2.9%) | |
| Regular | 32 (2.6%) | 7 (2.6%) | |
| Number of pack-years smoked, |
| ||
| None | 870 (69.4%) | 148 (60.3%) | |
| <30 | 185 (14.8%) | 47 (17.3%) | |
| >30 | 198 (15.8%) | 77 (28.3%) | |
| Alcohol consumption, |
| ||
| None | 791 (63.1%) | 138 (50.7%) | |
| <2 unit/w | 156 (12.5%) | 26 (9.6%) | |
| 2–4 unit/w | 195 (15.6%) | 60 (22.1%) | |
| >4 unit/w | 111 (8.9%) | 48 (17.6%) | |
| Lifestyle factors | Model 1 | Model 2 | Model 3 | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Coefficient | OR (95% CI) | Coefficient | OR (95% CI) | Coefficient | OR (95% CI) | ||||
| BMI (kg/m2) (continuous variable) | 0.07 | 1.07 |
| 0.06 | 1.07 |
| 0.06 | 1.06 |
|
| BMI category (kg/m2) (categorical variable) | |||||||||
| BMI < 24 | REF | REF | REF | ||||||
| BMI 24–28 | 0.21 | 1.23 | 0.156 | 0.20 | 1.22 | 0.164 | 0.20 | 1.22 | 0.174 |
| BMI ≥ 28 | 0.54 | 1.71 |
| 0.54 | 1.71 |
| 0.46 | 1.59 |
|
| Number of pack-years smoked | |||||||||
| None | REF | REF | REF | ||||||
| Pack-years < 30 | 0.05 | 1.05 | 0.814 | 0.02 | 1.02 | 0.911 | 0.01 | 1.01 | 0.952 |
| Pack-years > 30 | 0.40 | 1.50 |
| 0.38 | 1.47 |
| 0.36 | 1.44 |
|
| Alcohol consumption | |||||||||
| None | REF | REF | REF | ||||||
| Alcohol <2 unit/week | −0.25 | 0.78 | 0.286 | −0.29 | 0.75 | 0.220 | −0.28 | 0.76 | 0.247 |
| Alcohol 2–4 unit/week | 0.27 | 1.31 | 0.150 | 0.24 | 1.28 | 0.188 | 0.28 | 1.32 | 0.139 |
| Alcohol > 4 unit/week | 0.52 | 1.69 |
| 0.52 | 1.68 |
| 0.50 | 1.65 |
|
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGut microbiota and health · Ginseng Biological Effects and Applications · Diet and metabolism studies
INTRODUCTION
Colorectal cancer (CRC) was the third most frequently diagnosed cancer worldwide in 2022, accounting for 9.6% of all new cancer cases and causing a significant health and socioeconomic burden (1). In China, CRC is the second most common cancer and the fourth leading cause of cancer mortality (2). Prevention and early intervention targeting precancerous lesions is a more cost-effective strategy for reducing the incidence and mortality of CRC (3, 4). Colorectal high-risk adenomas (HRAs) are a category of precancerous lesions defined based on clinical research evidence over the past decade, including advanced adenoma and cases with three or more adenomas (5). Despite resection, these lesions remain at high risk for both recurrence and malignant transformation to CRC, indicating inherent aggressive potential (5, 6). Therefore, understanding the risk factors and underlying mechanisms contributing to HRA development is essential for improving early prevention strategies.
While multiple studies in CRC have shown that unhealthy lifestyle factors, including excessive intake of red meat (7, 8), smoking (9), and alcohol consumption (9), increase disease risk, evidence clarifying their contribution to HRA remains scarce. Moreover, the underlying mechanism through which these external exposures promote neoplastic transformation at the precancerous stage is not fully understood.
The gut microbiota, as a key component of the intestinal microecosystem interacting with the gut epithelial mucosa, is closely linked to colorectal adenoma and CRC development through compositional alterations (10–13). These alterations, including the gut microbiota-derived metabolites that induce chronic inflammation and immune dysfunction, as well as elevated serum lipopolysaccharide levels (14), suggest that the gut microbiota may mediate the impact of adverse lifestyle factors on colorectal neoplastic risk, warranting further investigation. However, while this overall mediating role is plausible, evidence at the taxonomic level remains scarce. Specifically, the gut microbial taxa that interact with particular lifestyle factors to modulate HRA risk have yet to be clearly identified.
To address this gap, we conducted a large, multicenter cohort study recruiting participants undergoing CRC screening across China. Our aim was to identify lifestyle-related risk factors for HRA, evaluate their associations with gut microbiota, and identify certain bacterial taxa mediating these relationships. This study provides a novel perspective on the microbial mechanisms linking lifestyle and early colorectal neoplasia, contributing to establishing microbiota-based or lifestyle-targeted strategies in preventing CRC from its precancerous stage.
MATERIALS AND METHODS
Study participants
This study was based on the multicenter cohort study TARGET-C, initiated in May 2018 (registry number ChiCTR1800015506) (15, 16). In short, the main objective of TARGET-C was to compare the effectiveness of colonoscopy-based fecal immunochemical tests (FIT) and risk-adapted screening strategies for CRC screening in China. Participants were required to undergo colonoscopy and collect stool samples within 24 h prior to bowel preparation. Epidemiological data (e.g., smoking and alcohol consumption) and clinical data (e.g., polyp pathology, size, number) were also collected. The study received approval from the Ethics Committee of the National Cancer Center/Cancer Hospital, Chinese Academy of Medical Sciences, and Peking Union Medical College (18-013/1615).
For the present study, screening patients aged 50–74 with informed consent were enrolled. The inclusion criteria comprised individuals with no abnormalities detected at baseline colonoscopy (healthy controls, HCs) and patients diagnosed with low-risk adenomas (LRAs), HRA, or CRC. Additionally, stool samples were required for microbiome sequencing. Exclusion criteria, in addition to the ineligible subjects as per the study protocol, included missing data, lack of corresponding stool samples, or poor-quality 16S rRNA sequencing. For more details on patient selection, please refer to Fig. S1. HRA were defined as adenomas ≥ 10 mm, those with tubulovillous or villous histology, adenomas with high-grade dysplasia, or ≥3 adenomas (5), to focus on the relationship between lifestyle factors and high-risk precancerous lesions. LRAs were defined as colorectal adenomas without the above characteristics.
Epidemiological data collection
After participants were enrolled in the cohort, trained researchers were responsible for collecting epidemiological data, including age, gender, body mass index (BMI), family history of CRC in first-degree relatives, history of colorectal polyps, number of pack-years smoked, average alcohol consumption, and use of non-steroidal anti-inflammatory drugs (NSAIDs) over the past 12 months. Average alcohol consumption was expressed in terms of how many units of alcohol were consumed per week. We defined one unit of alcohol (one drink) as 350 mL of beer, 150 mL of wine, or 50 mL of baijiu (a distilled spirit typically consumed in 50 mL servings in China) (17, 18). When consuming two or more types of alcohol, the alcohol consumption is cumulative based on these definitions.
Stool sample collection and 16S rRNA gene amplicon sequencing
Stool samples collected from the patients were frozen at −20°C until DNA extraction. DNA was extracted using the QIAamp Fast DNA Stool Mini Kit. 16S rRNA gene amplicon sequencing was performed. The V4 region of the microbial 16S rRNA gene was amplified and sequenced on the Illumina MiSeq sequencing platform (Illumina, San Diego, CA). To minimize end-reads sequencing errors, all reads were truncated at the 150th base with a median Q score of >20. Noise sequences, chimeras, and singletons were removed, and amplicon sequence variants (ASVs) were inferred from the clean sequencing reads using the DADA2 pipeline integrated into Qiime2 (19). Taxonomy was assigned to each ASV using the classify-sklearn method with the q2-feature-classifier plugin built on the Greengenes database (version 13.8). To quantify taxonomic composition, all sequences were rarefied to a uniform sampling depth of 10,000. Additionally, only taxa present in at least 1% of the samples with an average relative abundance greater than 0.01% were included in the downstream analyses.
Statistical analysis
Continuous variables were presented as mean and standard deviation, while categorical variables were shown as frequency and proportion. Univariate analysis followed by multiple covariate adjustments was conducted to assess independent risk factors for HRA. Statistical analyses were conducted using R software (v.4.3.1). The differences in categorical variables were evaluated using the Chi-squared test, while the Kruskal-Wallis H test and Wilcoxon test were applied to examine differences in continuous variables. The cut-off value of 30 pack-years for smoking history was based on previous studies (20, 21), while the cut-off value for BMI was determined according to the established thresholds for overweight (BMI 24–28 kg/m^2^) and obesity (BMI ≥28 kg/m^2^) in the Chinese population (22).
For the 16S rRNA data, microbial diversity was assessed through α-diversity using the Shannon index, and β-diversity was calculated using the Bray-Curtis distance. These analyses were conducted using the “vegan” package in R. The associations between lifestyle factors and β-diversity dissimilarities were analyzed through permutational multivariate analysis of variance (PERMANOVA, 999 permutations), adjusted for covariates including age, gender, BMI, family history of CRC, the recruitment region of participants and the diagnosis (to adjust for the effects of disease on gut microbiota) (23, 24). To analyze the association between gut microbiota and lifestyle factors, we used the multivariate association with linear models (MaAsLin2) method, implemented via the R package “MaAsLin” (25). We log-transformed the relative abundances of microbial features and standardized lifestyle factors into Z-scores before including them in the MaAsLin2 models. Multiple hypothesis testing was addressed by controlling the false discovery rate (FDR < 0.25) via the Benjamini-Hochberg method, a statistically significant threshold standard in microbiome research (25–28). The MaAsLin2 model was adjusted for age, gender, BMI, family history of CRC, recruitment region, use of NSAIDs, history of colorectal polyp, and disease status. In addition to conducting correlation analysis in our own data set, an external cohort of the Japanese population was used to validate the association between lifestyle factors and gut microbial genera (29) (for details about the external cohort, see supplementary material).
In the differential analysis of gut microbiota, microbial taxa with a P value < 0.05 and a log2 fold-change ≥1.0 were considered significant. Additionally, taxa identified by the linear discriminant analysis effect size (LEfSe) method were included in downstream analyses.
Causal mediation analysis was conducted using the “mediation” R package, to test the hypothesis that the observed effect of lifestyle factors on the risk of HRA is mediated by its effect on lifestyle factors-associated microbial genus. The direct and indirect effects were calculated using a bootstrap with 1,000 simulations. Coefficient estimates, 95% confidence intervals, and the P values were reported.
Sensitivity analyses were performed on different clinical endpoint (advanced adenomas). Subgroup analyses were performed among participants of different genders.
RESULTS
Participant characteristics
A total of 1,575 participants with qualified sequencing data were included (Fig. S1, recruitment regions are shown in Fig. 1A). Following baseline colonoscopy and exclusion of participants with missing lifestyle data, the cohort comprised: CRC (24 patients), HRA (272 patients), LRA (465 patients), and healthy controls (HC, 788 patients). In univariate analysis, comparing the HRA group with the control group (which includes LRA and HC), the HRA group had a higher mean age (61.9 vs 60.5), a larger proportion of males (76.1% vs 54.0%), and a higher mean BMI (24.7 vs 24.1) (Table 1). In terms of lifestyle factors, the HRA group exhibited a higher number of pack-years smoked (P < 0.001) and greater alcohol consumption (P < 0.001).
The impact of lifestyle risk factors on gut microbiome. (A) Schematic diagram showing the geographic locations of each recruitment region. (B) Effect of BMI categories on Shannon index of gut microbiota. (C) Effect of BMI categories on richness of gut microbiota. (D) Effect of alcohol consumption on shannon index of gut microbiota. (E) Effect of alcohol consumption on richness of gut microbiota. (F) Effect of pack-years of smoking on Shannon index of gut microbiota. (G) Effect of pack-years of smoking on richness of gut microbiota. (H) Effect of BMI on β-diversity of gut microbiota. The principal component analysis (PCoA) plots were constructed based on the Bray-Curtis distance matrix. The P value on the figure represents the result of the covariate-adjusted permutational multivariate analysis of variance (PERMANOVA). (I) Effect of pack-years of smoking on β-diversity of gut microbiota. (J) Effect of average alcohol consumption on β-diversity of gut microbiota.
Association between lifestyle factors and the risk of high-risk adenoma
After adjusting for multiple covariates (age, gender, BMI, education level, recruitment region of participants, family history of CRC in first-degree relatives, history of colorectal polyps), the independent risk factors for HRA were limited to number of pack-years smoked >30 (adjusted OR: 1.44, 95% CI: 1.00–2.06, P = 0.048), alcohol consumption >4 units/week (adjusted OR: 1.65, 95% CI: 1.09–2.48, P = 0.017), and BMI (adjusted OR: 1.06, 95% CI: 1.01–1.11, P = 0.016) (Table 2). Given that lifestyle factors can influence the gut microbiota, we further sought to explore how these lifestyle risk factors for HRA affect the gut microbiome.
Association between lifestyle factors and gut microbiota
We first explored the effects of the aforementioned lifestyle risk factors on gut microbiota diversity. Significant differences in the Shannon index were observed among patients with different BMI categories, while no significant differences were found with respect to cigarette consumption, alcohol consumption, or richness (Fig. 1B through G). However, after adjusted for covariates (age, gender, recruitment region of participants, BMI, family history of CRC in first-degree relatives, history of colorectal polyp, use of NSAIDs, disease status), the β diversity based on the Bray-Curtis distance matrix among different BMI categories (PERMANOVA adjusted P = 0.001), cigarette consumption (PERMANOVA adjusted P = 0.001), alcohol consumption (PERMANOVA adjusted P = 0.001) showed significant differences (Fig. 1H through J). Therefore, lifestyle factors can influence the composition of the gut microbiota.
Further analysis using the MaAsLin2 method revealed that several gut genera were associated with lifestyle risk factors (Fig. 2A). After adjusted for covariates (age, gender, recruitment region of participants, BMI, family history of CRC in first-degree relatives, history of colorectal polyp, use of NSAIDs, disease status) and using a cutoff of FDR < 0.25, the genera associated with lifestyle risk factors are listed in Table S1. Specifically, genera positively correlated with alcohol consumption included Fusobacterium (coefficient = 0.285, FDR = 0.080), Negativibacillus (coefficient = 0.204, FDR = 0.017), and Prevotella 9 (coefficient = 0.533, FDR = 0.060), while Bifidobacterium (coefficient = −0.297, FDR = 0.080) negatively correlated with alcohol consumption. Genera positively correlated with number of pack-years smoked included Actinomyces (coefficient = 0.266, FDR < 0.001).
*Association between gut microbiota and lifestyle risk factors. (A) Heatmap of the association between gut genera and lifestyle risk factors based on the coefficients from MaAsLin2 analysis in our cohort. Genera were classified according to the phyla on the right side of the figure (false discovery rate [FDR] <0.25). (B) Heatmap of the association between gut genera and lifestyle risk factors based on the coefficients from MaAsLin2 analysis in external Japanese cohort (false discovery rate <0.25). Genera were classified according to the phyla on the right side of the figure. The asterisks in the plot indicate statistically significant differences. ***, FDR < 0.05; **, FDR < 0.01; , FDR < 0.001. BMI, body mass index; NS, not significant; PCoA, principal coordinates analysis; PERMANOVA, permutational multivariate analysis of variance.
We further conducted MaAsLin2 analysis, adjusted for age, gender, BMI, alcohol consumption, Brinkman index of smoking, and disease status (CRC or control), in an external cohort consisting of 299 CRC cases and 299 controls after 1:1 nearest matching by age, gender, and BMI (characteristics of the study population are shown in Table S2) (29). The associations between gut microbiota and lifestyle factors were validated, and we observed a consistency in the relationships between lifestyle factors and gut genera (e.g., Bifidobacterium and alcohol consumption, Hungatella and BMI, Allisonella and BMI) (Fig. 2B; Table S3). Notably, the genus Fusobacterium was also positively correlated with alcohol consumption in this external cohort (coefficient = 0.638, P value = 0.018, FDR = 0.200), similar with the findings in our cohort.
Interaction between gut microbiota and lifestyle factors promotes high-risk adenoma
Next, we aimed to explore whether lifestyle-related microbiota play a role in increasing the risk of HRA. We began by analyzing the differential microbiota between HRA and control groups. Differential analysis identified three microbial genera that were significantly upregulated and six that were significantly downregulated in HRA patients compared to healthy controls, with a P value < 0.05 and a log_2_ fold-change ≥1.0 (Fig. 3A). Notably, the relative abundance of genus Fusobacterium was upregulated in the HRA group.
Differential microbiota in the high-risk adenoma group and the correlation with adenoma characteristics. (A) Volcano plot of differences in abundance of genera between the high-risk adenoma group and the control group. Red points represent genera with increased abundance in the high-risk adenoma group, while blue points represent genera with decreased abundance. (B) LefSe analysis of the differences in genera between the high-risk adenoma group and the control group. The red bars represent genera with significantly increased abundance in the high-risk adenoma group, and the green bars represent genera with significantly increased abundance in the control group. (C) Heatmap of Spearman correlation between significantly different genera related to lifestyle factors and adenoma features.
LEfSe analysis identified 15 genera, which were differentially abundant in the two groups (Fig. 3B). In the HRA group, there was an enrichment of the phyla Fusobacteria, Firmicutes, and Actinobacteria, including the genera Fusobacterium, Tyzzerella 4, Phascolarctobacterium, Gemella, Ruminococcaceae UCG-004, Actinomyces, and Terrisporobacter. In contrast, the genera Oscillibacter, Lachnospiraceae UCG-004, Lachnospira, Parasutterella, Dialister, Faecalibacterium, Prevotellaceae NK3B31 group, and Ruminococcaceae UCG-003 were more abundant in the control group.
Integrating findings from both MaAsLin 2 and LEfSe analyses revealed significant overlap between genera differentially abundant in HRA vs controls and those associated with lifestyle factors. These overlapping genera include Tyzzerella 4, Fusobacterium, Actinomyces, and Oscillibacter. In Spearman correlation analysis, we found significant correlations between lifestyle-associated differential genera and adenoma characteristics, with the relative abundance of Fusobacterium positively correlated with the number of adenomas (Fig. 3C).
Therefore, lifestyle-associated differential genera appear to influence the development of HRA, potentially by increasing the number and size of adenomas, thereby contributing to the malignancy of colorectal polyps.
Mediating role of microbial genera in the increased risk of high-risk adenoma due to alcohol and cigarette consumption
Since our analysis suggested significant associations between lifestyle factors and both the risk of HRA and the relative abundance of microbial genera (Fusobacterium, Tyzzerella 4, Actinomyces, Oscillibacter), we postulated that the lifestyle factors could influence the risk of HRA via impact on the relative abundance of lifestyle factors-associated genus. We, therefore, conducted causal mediation analyses to test this, adjusting for covariates including BMI, age, gender, family history of CRC in first-degree relatives, polyp history, recruitment region of participants, and NSAID use.
Causal mediation analyses provided evidence that the effect of alcohol consumption at the risk of HRA was partially mediated by genus Fusobacterium (coefficient estimates in average causal mediation effect [ACME] = 0.0072, 95% CI: 0.0005–0.0200, P = 0.018; coefficient estimates in average direct effect [ADE] = 0.0689, 95% CI: 0.0051–0.1400, P = 0.026; proportion = 9.4%) (Fig. 4A). And the effect of cigarette consumption at the risk of HRA was completely mediated by genus Tyzzerella 4 (coefficient estimates in ACME = 0.0080, 95% CI: 0.0004–0.0200, P = 0.024; coefficient estimates in ADE = 0.0481, 95% CI: −0.0064 to 0.1100, P = 0.082; proportion = 14.2%) (Fig. 4B).
The mediating role of genera in the association between lifestyle risk factors and colorectal neoplasia risk. (A) The mediating role of genus Fusobacterium in the association between alcohol consumption and the risk of high-risk adenoma. The middle percentage represents the mediation proportion. The β on the line segment represents the coefficient of the effect between the two variables. (B) The mediating role of genus Tyzzerella 4 in the association between cigarette consumption and the risk of. (C) The mediating role of genus Fusobacterium in the association between alcohol consumption and the risk of advanced adenoma. (D) The mediating role of genus Tyzzerella 4 in the association between cigarette consumption and the risk of advanced adenoma. (E) The mediating role of genus Fusobacterium in the association between alcohol consumption and the risk of high-risk adenoma in male population. (F) The mediating role of genus Tyzzerella 4 in the association between cigarette consumption and the risk of high-risk adenoma in male population. (G) The mediating role of genus Fusobacterium in the association between alcohol consumption and the risk of high-risk adenoma in female population. (H) The mediating role of genus Tyzzerella 4 in the association between cigarette consumption and the risk of high-risk adenoma in female population.
Sensitivity analyses distinguished between different outcomes, where the mediating effect of genus Fusobacterium and Tyzzerella 4 remained significant when considering advanced adenoma as outcome variable, underscoring the role of genus Fusobacterium and Tyzzerella 4 in mediating the impact of alcohol consumption and cigarette consumption on advanced adenomas, respectively (Fig. 4C and D).
In the subgroup analysis, the mediating effects remained in the male participants; however, when only female participants were included, we found that the mediating effect of genera Fusobacterium and Tyzzerella 4 was no longer present (Fig. 4E through H). This suggests that the mediating role of these two genera in the effect of lifestyle factors on the increased risk of HRA may occur only in the male population.
DISCUSSION
In our large-scale, multicenter CRC screening cohort, we identified key lifestyle factors—including elevated BMI, heavy smoking (>30 pack-years), and alcohol consumption (>4 units/week)—serving as independent risk factors for HRA. These modifiable exposures significantly perturbed gut microbiota profiles. Specifically, Fusobacterium mediated alcohol-associated HRA risk, and Tyzzerella 4 mediated smoking-related risk, especially in male subjects. These genera were further associated with malignant features of adenomas (e.g., number and size), suggesting that the interaction between gut microbiota and lifestyle factors may influence the development of colorectal precancerous lesions.
Previous studies have established that adverse lifestyle behaviors contribute to CRC risk (30–32), but the mechanisms through which they promote carcinogenesis—especially at the precancerous stage—remain incompletely understood. Our findings help address this gap by revealing the potential role of microbiota as mediator through which lifestyle factors contribute to early colorectal neoplasia, providing mechanistic context for epidemiological observations and highlighting new opportunities for microbiota-targeted or lifestyle-based prevention strategies.
Lifestyle factors were evaluated via patient recall of the preceding year, capturing habitual patterns. These prolonged lifestyle habits had a significant impact on gut microbiota, most notably through the alcohol-associated enrichment of Fusobacterium—a pattern validated in our cohort and replicated in a Japanese population (29). Among this genus, the species Fusobacterium nucleatum and Fusobacterium mortiferum have been reported to possess tumor-promoting effects in the colorectum, especially the former (33, 34). Previous studies have shown that Fusobacterium mortiferum is significantly elevated in the stool of patients with colorectal adenomas compared to control group (35) and is enriched in adenoma patients with APC mutations compared to those without (36). Given our demonstration of Fusobacterium as a mediator between lifestyle factors and the HRAs, future microbiota-based prevention strategies should prioritize targeting this genus. However, due to the limitations of 16S rRNA sequencing, we were unable to delineate microbial mediation at the species level. Thus, the specific species mediating alcohol-associated HRA risk remains uncharacterized and warrants future investigation.
In addition, the microbial mediation effects observed in our study appeared to be sex-specific, as significant mediation by these genera persisted exclusively in male subgroup. Accumulated evidence reveals that gut microbiota composition and function vary significantly by sex. For instance, males were shown to have lower relative abundances of Firmicutes and Actinobacteria, as well as a decreased Firmicutes/Bacteroidetes ratio compared to females (37). Functional differences have also been noted in an animal study by Wang et al., which demonstrated that fecal microbiota transplantation from female donors reduced systemic pro-inflammatory cytokine levels following ischemic stroke (38). Critically, sex-dependent abundance variations were observed in Fusobacterium, the genus of particular interest in our study, revealing potential genus-level sexual dimorphism. Similar to our findings, a fecal microbiome analysis of healthy Japanese individuals reported higher levels of Fusobacterium and Prevotella in males, while females had higher abundances of Bifidobacterium and Ruminococcus (39). Another study conducted in Chinese population found Fusobacterium to be more abundant in obese males, whereas obese females showed much notable enrichment in Bifidobacterium, Coprococcus, and Dialister (40). These consistent sexual dysmorphism suggest distinct regulatory mechanism including sexual maturation and sex hormones (41, 42), while differences in immune function may also play a role (43). Consequently, elucidating the precise biological mechanisms underlying these sex-dependent microbial patterns demands further investigation.
The strength of this study lies in our large, multicenter cohort, providing strong representation across China. Besides, we focused on the risk of HRA, a critical window for intercepting malignant transformation to CRC, revealing how lifestyle-microbiota interactions drive precancerous lesions, thereby offering valuable insights for blocking cancer progression on early stage. Moreover, the external validation and sensitivity analysis provide additional support for our conclusions. However, our study also has limitations. Our cohort was restricted to screening-age populations, compromising the generalizability across all age groups. The questionnaire-derived lifestyle data collected here, though retrospective over 12 months, remain vulnerable to recall bias. Moreover, reliance on a single assessment fails to capture exposure fluctuations, introducing potential bias. Furthermore, gut microbiota profiling was performed using 16S rRNA gene amplicon sequencing, with limited depth to fully explore species-level gut microbiota. Further exploration of the mechanistic underpinnings of the observed mediation effects is needed in future research. Additionally, although validated using a large-scale external data set, population differences between cohorts may introduce selection bias. And, both cohorts’ East Asian predominance and limited lifestyle variables in public data necessitate future validation in ethnically diverse populations.
In conclusion, we integrated lifestyle factors and gut microbiome in evaluating the risk of HRAs in Chinese populations, identified that BMI, cigarette consumption, and alcohol consumption significantly increase the risk of HRA. Lifestyle factors also have a profound impact on gut microbiota, and the genus Fusobacterium and Tyzzerella 4 plays a significant mediating role in the association between lifestyle risk factors and the increased risk for HRA. This study offers novel insight into the tripartite relationship among lifestyle, microbiota, and colorectal neoplasm risk, establishing a mechanistic basis for epidemiological associations and identifying potential preventive strategies through microbiota modulation or lifestyle interventions.
Supplementary Material
Reviewer comments
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Bray F, Laversanne M, Sung H, Ferlay J, Siegel RL, Soerjomataram I, Jemal A. 2024. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 74:229–263. doi:10.3322/caac.2183438572751 · doi ↗ · pubmed ↗
- 2Han B, Zheng R, Zeng H, Wang S, Sun K, Chen R, Li L, Wei W, He J. 2024. Cancer incidence and mortality in China, 2022. JNCC 4:47–53. doi:10.1016/j.jncc.2024.01.00639036382 PMC 11256708 · doi ↗ · pubmed ↗
- 3Force U, Davidson KW, Barry MJ, Mangione CM, Cabana M, Caughey AB, Davis EM, Donahue KE, Doubeni CA, Krist AH, Kubik M, Li L, Ogedegbe G, Owens DK, Pbert L, Silverstein M, Stevermer J, Tseng CW, Wong JB. 2021. Screening for colorectal cancer: US preventive services task force recommendation statement. JAMA 325:1965–1977.34003218 10.1001/jama.2021.6238 · doi ↗ · pubmed ↗
- 4Kanth P, Inadomi JM. 2021. Screening and prevention of colorectal cancer. BMJ 374:n 1855. doi:10.1136/bmj.n 185534526356 · doi ↗ · pubmed ↗
- 5Gupta S, Lieberman D, Anderson JC, Burke CA, Dominitz JA, Kaltenbach T, Robertson DJ, Shaukat A, Syngal S, Rex DK. 2020. Recommendations for follow-up after colonoscopy and polypectomy: a consensus update by the US multi-society task force on colorectal cancer. Gastroenterology 158:1131–1153. doi:10.1053/j.gastro.2019.10.02632044092 PMC 7672705 · doi ↗ · pubmed ↗
- 6Song K, Zhang S, Luo J, Lu B, Li J, Zhou Y, Chen Y, Zhang Y, Yang A, Chen H, Wu D. 2025. Risk of metachronous advanced neoplasia after colonoscopy in patients with different index findings. J of Gastro and Hepatol 40:1942–1953. doi:10.1111/jgh.1702640437752 · doi ↗ · pubmed ↗
- 7Song M, Garrett WS, Chan AT. 2015. Nutrients, foods, and colorectal cancer prevention. Gastroenterology 148:1244–1260. doi:10.1053/j.gastro.2014.12.03525575572 PMC 4409470 · doi ↗ · pubmed ↗
- 8Mosley D, Su T, Murff HJ, Smalley WE, Ness RM, Zheng W, Shrubsole MJ. 2020. Meat intake, meat cooking methods, and meat-derived mutagen exposure and risk of sessile serrated lesions. Am J Clin Nutr 111:1244–1251. doi:10.1093/ajcn/nqaa 03032077920 PMC 7266682 · doi ↗ · pubmed ↗
