Investigating microbiota differences across chronic pancreatitis, influenced by lifestyle and genetic determinants

Abubaker Y.M Ahmed; Azita Rajai; Catherine Fullwood; Damian W Rivett; John McLaughlin; Christopher van der Gast; Ryan Marsh

PMC · DOI:10.1186/s12876-025-04327-7·October 21, 2025

Investigating microbiota differences across chronic pancreatitis, influenced by lifestyle and genetic determinants

Abubaker Y.M Ahmed, Azita Rajai, Catherine Fullwood, Damian W Rivett, John McLaughlin, Christopher van der Gast, Ryan Marsh

PDF

Open Access

TL;DR

This study explores how gut microbiota differs in people with chronic pancreatitis, particularly between alcohol-induced and CFTR-related subtypes, and how these differences may relate to lifestyle and genetics.

Contribution

The study identifies distinct microbiota profiles in chronic pancreatitis subtypes and links these differences to potential functional consequences.

Findings

01

Microbiota composition differs significantly between alcohol-induced CP and CFTR-related CP subtypes.

02

Control samples clustered more closely in microbiota profiles compared to CP samples.

03

Key genera like Bifidobacterium and Faecalibacterium were reduced in CP, suggesting physiological implications.

Abstract

Chronic pancreatitis (CP) is a multifaceted and irreversible inflammatory condition that impacts both endocrine and exocrine pancreatic functions. Whilst there is evidence of a general intestinal microbial dysbiosis, differences across CP resulting from diverse triggers remains poorly elucidated. In this study, we investigated how different CP triggers, namely alcohol-induced CP (AIP) and CFTR-related CP (CFRP), might undermine differences across microbiota structure and predicted function. Faecal samples from 25 CP participants were collected, alongside 9 healthy controls for microbiota analyses. Both full and short (V4) 16S rRNA sequencing was performed, including predicted functionality using PICRUSt2. Clinical metadata was integrated to investigate relationships with microbiota structure and predicted functionality with ordination-based approaches. Microbiota diversity patterns…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Diseases1

chronic pancreatitis

Figures5

Click any figure to enlarge with its caption.

Distribution and abundance of bacterial taxa across different sample groups. A Healthy control (FL-16S). B Alcohol-induced pancreatitis (FL-16S). C CFTR-related pancreatitis (FL-16S). D Healthy control (V4-16S). E Alcohol-induced pancreatitis (V4-16S). F CFTR-related pancreatitis (V4-16S). Given is the percentage number of patient stool samples each bacterial taxon was observed to be distributed across, plotted against the mean percentage abundance across those samples. Core taxa are defined as those that fall within the upper quartile of distribution (orange circles), and satellite taxa (grey

Microbiome diversity and similarity compared across healthy controls, alcohol-induced pancreatitis, and CFTR-related pancreatitis sample groups. Shown is the data for both shorter amplicon (V4) and full-length 16S rRNA sequencing data. Whole microbiota (black plots) and partitioned data into core (orange plots) and satellite taxa (grey plots) are given. A Differences in Fisher’s alpha index of diversity across groups. Black circles indicate individual patient data. Error bars represent samples that fall within the range, i.e. 1.5 times the inter-quartile range (IQR). Labelled *P* values descri

Radial bar plot depicting mean relative abundance across taxa highlighted as top drivers of dissimilarity across groups from SIMPER analysis (Table S7). Each concentric ring represents an increment of 1% mean relative abundance, with the outermost ring corresponding to 10%. Average contribution to microbial dissimilarity for a given taxa across the three groups is given in parenthesis below each respective taxon. All taxa shown above were designated as core based on significant distribution-abundance relationships. Taxa are organised in clockwise fashion based on mean relative abundance across

Principal Coordinates Analysis (PCoA) plot depicting compositional differences in predicted functional pathways across sample groups. Pathway abundances were derived from full-length 16S rRNA amplicon data analysed with PiCRUSt2. KEGG orthologues were aggregated prior to annotation to Level 3 hierarchical pathways. Relative abundances of pathways per sample were calculated, and Bray-Curtis dissimilarity was used as the distance metric for PCoA. ANOSIM summary statistics can be found in Table S9.

Differences in predicted metagenome function between study groups. CLR transformed compositional data was plotted to visualise differences among groups across KEGG level 3 hierarchical pathways accounting for > 1% total predicted functionality. A Core taxa. B Satellite taxa. Asterisks (*) denote significant differences across group means following ANOVA testing (*P <* 0.05). Associated summary statistics and full-length pathway names can be found in Tables S12-13.

Keywords

MicrobiomeChronic pancreatitis16S rRNA sequencingDysbiosisAlcoholGeneticCFTR

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPancreatitis Pathology and Treatment · Liver Disease Diagnosis and Treatment · Gut microbiota and health

Full text

Background

Chronic pancreatitis (CP) is a multifaceted and irreversible inflammatory condition that impacts both the endocrine and exocrine functions of the pancreas [1]. A common challenge in gastroenterology, its protracted course is fraught with a gamut of long-term complications [2]. Diverse triggers for pancreatitis have been documented in the scientific literature, including alcohol consumption, gallstone disease, genetic mutations, metabolic disorders, iatrogenic factors, structural changes within the pancreas, infections, and environmental influences [3–6].

Small percentages of patients at risk of developing the disease will do so, alluding to other factors contributing to the development of pancreatitis and the clinical course of disease [7]. Changes within the intestinal microbiota have been observed in patients with pancreatitis [8–11], however the relationship between this dysbiosis and pancreatitis is yet to be fully established at the level of causation or correlation. Furthermore, studies of gut microbiota outcomes comparing the various triggers of CP pancreatitis are scarce. In tandem with structural changes to the site of the pancreas itself, it is plausible that intestinal physiological changes associated with different CP subtypes could infer differential impacts upon intestinal microbiota composition and function. This includes the consumption of alcohol itself [12, 13], alongside changes to local intestinal milieu physiology related to particular genetic mutations relevant to CP, such as within the CFTR gene [14]. These differences are likely compounded by additional pancreatitis-related treatment interventions, such as antibiotic administration [11].

The majority of research within the CP intestinal microbiome has thus far utilised short-read amplicon sequencing across hypervariable regions of the 16S rRNA gene, due to its appeal as a cheaper high-throughput option to profile gut microbial communities [8–11, 15–18]. More recently, full-length 16S rRNA sequencing approaches, which offer advantageous taxonomic resolution capabilities [19], have become more economically viable for such studies as sample multiplexing capabilities continue to increase. This is particularly appealing when trying to detect potential relationships between specific bacterial species and host clinical outcomes across larger study cohorts.

In this pilot study, we employed such full-length (FL) 16S rRNA sequencing for the first time, alongside shorter amplicon (V4) sequencing, to profile microbial communities both comprehensively and with respect to current literature. Specifically, we investigated how various CP triggers, dietary (alcohol consumption) or genetic (CFTR mutation) in nature, might impact microbiota structure and predicted function. We further compared this data to that obtained from healthy controls for added insights. Clinical metadata was integrated to further investigate any relationships with microbiota structure and predicted function.

Methods

Study participants and design

CP patients from Manchester NHS Foundation Trust and healthy control volunteers were recruited between 2020 and 2022. Clinical diagnosis of CP was confirmed from hospital records and eligible participants were subsequently contacted for potential inclusion into the study. This included participants with alcohol-induced chronic pancreatitis (AIP), alongside participants genetically identified as heterozygous carriers of mutant CFTR, namely CFTR*-*related pancreatitis (CFRP). Healthy controls from the surrounding Manchester area were recruited. Overall, the study comprised 25 participants with CP (AIP; n = 11, CFRP; n = 14) and healthy controls (HC; n = 9). To account for any intra-participant temporal differences, participants were asked to donate three stool samples over a 7-day period using a home postal kit. Upon receipt, samples were stored at −80 °C at the Manchester University NHS Foundation Trust (MFT) dedicated Biobank. Basic participant clinical data was available for all groups and is summarised in Table 1. Exclusion criteria, alongside detailed clinical data for the CP groups (Table S1, is available in the Supplementary Methods and Results which are found within Additional File 1).Table 1. Study group characteristicsClinical variableHC (n = 9)AIP (n = 11)CFRP (n = 14)Age, median [IQR]40 [26–45]62 [56–66]59 [48-66.8]Body mass index, median [IQR]NC26.6 [22.5–30.4]26.8 [24.3–27.7]Cholecystectomy surgery (% Yes)0.018.221.4Diabetes mellitus (% Yes)0.045.535.7Pancreatic complications (% Yes)0.055.514.3Pancreatic insufficiency (PI) (% Yes)0.081.864.3Sex (% Male)55.681.885.7Vitamin D (nmol^−1^), median [IQR]NC73.0 [43–76]63.8 [77–105]NC not collected for respective subjects

PMA treatment prior to DNA extraction

1 mg propidium monoazide (PMA) (Biotium, CA, USA) was hydrated in 98 µL 20% dimethyl sulfoxide (DMSO) to give a working stock concentration of 20 mM. 300 mg of thawed stool was homogenised in 1.5 mL PBS, and then centrifuged at 3200 x g, for 5 min. The pellet was then resuspended in 0.5 mL PBS prior to subsequent PMA treatment. 1.25 µL of PMA (20 mM) was added to give a final concentration of 50 µM. Following the addition of PMA to samples in opaque Eppendorf tubes, PMA was mixed by vortexing for 10 s, followed by incubation for 15 min at room temperature (~ 20 °C). This step was repeated before the transfer of samples to clear 1.5 mL Eppendorf tubes and placement within a LED lightbox. Treatment occurred for 15 min to allow PMA intercalation into DNA from compromised bacterial cells. Samples were then centrifuged at 10,000 x g for 5 min. The supernatant was discarded, and the cellular pellet was resuspended in 200 µL PBS. Treatment was performed to remove unviable bacteria prior to DNA extraction and subsequent sequencing, as previously described [20].

DNA extraction process

Cellular pellets resuspended in PBS were loaded into the ZYMO Quick-DNA Fecal/Soil Microbe Miniprep Kit (Cambridge Bioscience, Cambridge, UK) as per the manufacturer’s instructions. Dual mechanical-chemical sample disruption was performed using the FastPrep-24 5G instrument (MP Biomedicals, California, USA) at a speed of 6.0 m/s for 40 s. Following extraction, DNA was quantity was measured using the Qubit dsDNA HS Assay Kit (high sensitivity, 0.2 to 100 ng) on the Qubit 3.0 fluorometer (Life Technologies), and purity assessed using the NanoDrop™ One Spectrophotometer (Thermo Fisher) as per the manufacturer’s protocols.

16S rRNA sequencing

A dual approach was undertaken to characterise the microbiota across samples. Initially, sequencing libraries targeting the V4 region of the 16S rRNA gene were prepared as per the Schloss protocol [21]. Libraries were sequenced on the MiSeq using V2 2 × 250 bp chemistry. Additionally, full-length 16S rRNA gene libraries were constructed utilising the Kinnex 16S rRNA kit. These libraries were sequenced on the PacBio Revio system. For both processes, PCR & DNA extraction negative controls were implemented, alongside the use of mock community positive controls, which included a Gut Microbiome Standard (ZYMO RESEARCH™). Sequencing was performed at the NUOmics DNA sequencing facility (Northumbria University).

Sequence processing and bioinformatic analyses

Sequences were analysed using DADA2, as to demultiplex and remove primer sequences, validate the quality profiles of forward and reverse reads and subsequently trim, infer sequence variants, merge denoised paired-reads, remove chimeras, and finally assign taxonomy via Naive Bayesian Classifier implementation [22]. Flexible assignment was implemented using a priority assignment approach: GTDB r207 [23], followed by Silva v138 [24], then lastly RefSeq + RDP [25]. Remaining unidentifiable ASVs were run through BLAST [26] (https://blast.ncbi.nlm.nih.gov/Blast.cgi) and matched appropriately where possible, based on adequate query coverage. Taxa with single reads were removed and excluded from subsequent statistical analysis. Amplicon sequence variants (ASVs) from the same bacterial taxon were collapsed to form a single operational taxonomic unit (OTU) for a given taxon when applicable to the species level. The R package decontam was used to remove any potential sources of contamination across samples [27], utilising the prevalence-based contamination identification approach with a threshold classification of P = 0.1. To compare the predicted functional and metabolic profiles between groups, full-length 16S rRNA ASVs were analysed using PiCRUSt2 to determine Kyoto Encyclopaedia of Genes and Genomes (KEGG) orthologs (KO) and Enzyme Commission (EC) numbers [28]. To provide a more comprehensive understanding of microbial functional potential, identified KOs were mapped to KEGG level 3 hierarchical pathways. Additionally, identified EC numbers were linked to predicted functions. Results from both approaches were aggregated across samples for group-wise comparisons. More detailed information on these processes is available in the Supplementary Methods and Results (Additional File 1). The dataset supporting the conclusions of this article are available within the European Nucleotide Archive repository, identifiable using accession number PRJEB78535.

Statistical analysis

Linear regression analysis, including calculated coefficients of determination (R^2^), degrees of freedom (df), F-statistic (F) and significance values (P), were utilised for microbial partitioning into common core and rarer satellite groups, and were calculated using XLSTAT v2021.1.1 (Addinsoft, Paris, France). Common core taxa were designated at the ≥ 75% prevalence threshold, based on goodness-of-fit to the log-normal distribution utilising chi-squared testing, in accordance with wider ecological modelling approaches [29, 30]. Fisher’s alpha index of diversity and the Bray-Curtis index of similarity were calculated using PAST v4.03 [31]. Rarefaction (rrarefy) and Shannon diversity (diversity) calculations were performed using the vegan package (2.6.4) in R (v4.3.1). Tests for significant differences in microbiota diversity and taxonomy abundances were performed using Kruskal-Wallis in XLSTAT. Analysis of similarities (ANOSIM) with Bonferroni correction was used to test for significant differences in microbiota composition and functional profile and was performed in PAST. Similarity of percentages (SIMPER) analysis, to determine which constituents drove compositional differences between groups, was also performed in PAST. Ordination approaches, including stepwise selection for both canonical correspondence (CCA) and redundancy analysis (RDA), was performed in CANOCO v5 [32]. Compositional data for predicted metagenome function across samples was centred log-ratio (CLR) transformed prior to heatmap generation for visualisation of sample differences across groups with Kruskal-Wallis testing implemented for such comparisons, further supplemented with FDR adjustment (Benjamini–Hochberg). Pairwise group comparisons were performed with Mann-Whitney U testing with Bonferroni correction applied. Statistical significance for all tests was deemed at the P ≤ 0.05 level. Subsequent figures and plots were generated using PAST (v4.03) and ggplot2 in R.

Results

Following sequence processing, FL and V4 amplicon reads of the 16S yielded similar raw counts (see Supplementary Results). Initially, we sought to partition both datasets from the whole microbiota, into the more common (≥ 75% prevalence) core taxa and the rarer, infrequent (< 75% prevalence) satellite taxa based on distribution-abundance relationships across the three study groups (Fig. 1). As such, these partitions were utilised within subsequent analyses to understand differences between microbial subcommunities and relationships with clinical outcomes across participants. Similar trends were observed across both datasets, in which the core taxa constituted 74.59% and 73.58% of the total abundance within the healthy control samples across FL and V4 datasets respectively. The satellite taxa across both AIP and CFRP were more prominent, featuring a combined relative abundance of 55.4% and 64.5% across the FL-16S, and 43.32% and 59.52% across the V4 16S rRNA datasets respectively.

Fig. 1. Distribution and abundance of bacterial taxa across different sample groups. A Healthy control (FL-16S). B Alcohol-induced pancreatitis (FL-16S). C CFTR-related pancreatitis (FL-16S). D Healthy control (V4-16S). E Alcohol-induced pancreatitis (V4-16S). F CFTR-related pancreatitis (V4-16S). Given is the percentage number of patient stool samples each bacterial taxon was observed to be distributed across, plotted against the mean percentage abundance across those samples. Core taxa are defined as those that fall within the upper quartile of distribution (orange circles), and satellite taxa (grey circles) defined as those that do not, separated by the vertical line at 75% distribution and labelled respectively. Distribution-abundance relationship regression statistics: (A) R^2^ = 0.20, F_1,1175_ = 295.1, (B) R^2^ = 0.32, F_1,1570_ = 724.3, (C) R^2^ = 0.34, F_1,1191_ = 626.2, (D) R^2^ = 0.22, F_1,611_ = 163.30, P < 0.0001; (E) R^2^ = 0.25, F_1,750_ = 242.69, P < 0.0001; (F) R^2^ = 0.30, F_1,729_ = 301.51, P < 0.0001. Goodness-of-fit statistics for the core taxa against the log-normal distribution can be found in Table S2

Comparing microbial diversity across the three study groups highlighted some significant differences (Fig. 2, Table S3), which varied slightly based on the sequencing approach investigated. For the FL 16S rRNA analysis, there were no overall differences between groups in general (Fig. 2A, P = 0.081), but pairwise differences were apparent between the AIP and CFRP groups (Fig. 2A, P = 0.041). Core taxa diversity was more similar between groups (Fig. 2A, P = 0.55) with no specific pairwise differences. As for the satellite taxa, again no significant overall differences between the three groups emerged (Fig. 2A, P = 0.080). There were, however, similar differences between the AIP and CFRP groups as demonstrated prior in the whole microbiota analysis (Fig. 2A, P = 0.041).Fig. 2. Microbiome diversity and similarity compared across healthy controls, alcohol-induced pancreatitis, and CFTR-related pancreatitis sample groups. Shown is the data for both shorter amplicon (V4) and full-length 16S rRNA sequencing data. Whole microbiota (black plots) and partitioned data into core (orange plots) and satellite taxa (grey plots) are given. A Differences in Fisher’s alpha index of diversity across groups. Black circles indicate individual patient data. Error bars represent samples that fall within the range, i.e. 1.5 times the inter-quartile range (IQR). Labelled P values describe the overall Kruskal-Wallis test values. Asterisks between groups denote a significant difference in diversity following Mann-Whitney U testing (***; P ≤ 0.0001, **; P ≤ 0.001, *; P ≤ 0.05). Summary statistics are provided in Table S3. B Microbiome variation measured within and between sampling groups, utilising the Bray-Curtis index of similarity. Error bars represent standard deviation of the mean.

For V4 16S rRNA analysis, whole microbiota diversity was similar across the three study groups (Fig. 2A, P = 0.32). However, when comparing microbiota partitions, differences were apparent. In particular, the core taxa diversity was significantly different across the three groups (Fig. 2A, P < 0.0001). This included significant differences between HC and both CP groups (P < 0.0001) but also between both CP groups themselves (P < 0.0001). Satellite taxa diversity was slightly different between groups (P = 0.095), primarily driven by the AIP group, for which diversity was larger than both the healthy control (P = 0.080) and CFRP (P = 0.058) groups. Additional incorporation of the Shannon index was implemented upon rarified reads (Fig. S1) to further account for community evenness across microbial diversity. These analyses yielded similar results between groups across all partitions of the microbiota, with no significant differences observed (P > 0.05). This extended across both the FL 16S and V4 16S rRNA analyses (Table S4).

Despite the differences across diversity, both sequencing approaches yielded similar compositional profiles across the various microbiota partitions. In terms of whole microbiota composition, the healthy control group had the highest intra-group similarity (Fig. 2B), irrespective of the sequencing approach analysed, and clustered more tightly than the CP groups. Whilst this trend was observed across the core taxa also, the satellite taxa similarity was highly variable across all study groups (Fig. 2B). When comparing between-group microbiota composition, differences were evident across all microbiota partitions across the FL 16S rRNA analyses (Table S5). The AIP microbiota composition was consistently different to healthy controls, including at the whole microbiota (P = 0.048), core taxa (P = 0.014), and satellite taxa (P < 0.0001) levels. As for the CFPR group, differences compared to AIP group were evident across both the core (P = 0.018) and satellite (P = 0.0018) taxa. Significant differences between healthy controls and CFRP were only apparent across the satellite taxa (P < 0.0001). Across the V4 16S rRNA analysis (Table S3), differences were more modest at the whole microbiota level. However, across both the core and satellite taxa, the healthy control group were significantly different to the AIP (P < 0.0001) and CFRP (P < 0.001 core, P < 0.0001 satellite) groups, respectively. Interestingly, these differences also extended to between both AIP and CFRP groups across these partitions (both P < 0.0001).

We next sought to investigate which taxa drove the differences observed across groups, by implementation of an exploratory SIMPER analysis. Across the FL 16S data (Table S6), common trends were observed between comparison of both CP groups with the healthy control group. Across the five taxa responsible for driving most dissimilarity between all groups (Fig. 3, Table S7), a reduction in the mean relative abundance of Bifidobacterium adolescentis, Ruminococcus bromii and Faecalibacterium prausnitizii across the CP groups was a common feature that drove such dissimilarity. Subsequent inter-group comparisons revealed significant differences in the relative abundance of F. prausnitizii between the healthy control group and both the AIP (P = 0.045) and CFRP (P = 0.018) groups. In addition, B. adolescentis relative abundance was significantly increased in healthy controls as compared to the AIP group (P = 0.045) (Table S6). Other taxa that typically increased across both CP groups relative to healthy controls, included members of the genus Streptococcus. When directly comparing both CP groups to one another, similar taxa were implicated alongside a reduction in other taxa, including Agathobacter rectalis and Akkermansia muciniphila in the CFRP group (Table S7). When utilising the V4 16S rRNA data for the SIMPER analysis (Table S8), similar taxa were again implicated across the differences observed amongst groups, including changes to B. adolescentis, R. bromii, and F. prausnitizii. The disparity amongst Proteobacterial relative abundance, namely Escherichia, was increased across the V4 16S rRNA analyses.Fig. 3. Radial bar plot depicting mean relative abundance across taxa highlighted as top drivers of dissimilarity across groups from SIMPER analysis (Table S7). Each concentric ring represents an increment of 1% mean relative abundance, with the outermost ring corresponding to 10%. Average contribution to microbial dissimilarity for a given taxa across the three groups is given in parenthesis below each respective taxon. All taxa shown above were designated as core based on significant distribution-abundance relationships. Taxa are organised in clockwise fashion based on mean relative abundance across healthy controls. Asterisks denote significant differences between taxa relative abundances (P < 0.05, FDR adjusted) across groups following Kruskal-Wallis testing. Detailed statistics and SIMPER data can be found in Tables S6 & S8.

In addition to investigating microbiota structural differences across the study groups, we also performed analysis with PiCRUSt2 to compare predicted metagenome function, by use of associated KEGG orthologs (KOs) and enzyme classifications (ECs) across samples based on their microbiota composition. Using the FL-16S data, associated KOs across samples were mapped to KEGG hierarchical pathways to determine if differences arose between the predicted metagenome function of sampling groups. Predicted functional profile similarity was indeed highest within the healthy control group, which clustered tighter than the CP groups based on the relative predicted pathway abundance (Fig. 4). There was no defined separation overall across groups based on the 95% ellipses, though some divergence was observed among various samples from both CP groups. In terms of pathway compositional differences (Table S9), the CFRP group was more similar to healthy controls as observed across microbiota structure previously, with the AIP group being generally dissimilar to both the healthy control and CFRP groups. When comparing function inferred instead from EC counts mapped to specific KEGG modules, again similar relationships across both intra- and inter-group analyses ensued (Fig. S2).

Fig. 4. Principal Coordinates Analysis (PCoA) plot depicting compositional differences in predicted functional pathways across sample groups. Pathway abundances were derived from full-length 16S rRNA amplicon data analysed with PiCRUSt2. KEGG orthologues were aggregated prior to annotation to Level 3 hierarchical pathways. Relative abundances of pathways per sample were calculated, and Bray-Curtis dissimilarity was used as the distance metric for PCoA. ANOSIM summary statistics can be found in Table S9.

Next, participant clinical characteristics (Table 1) were incorporated into multivariate analyses to investigate potential relationships with both microbiota structure and predicted function. Across microbiota structure, CP type was a significant explanator of both the core (pseudo-F = 1.7, P < 0.008) and satellite (pseudo-F = 1.3, P < 0.004) taxa. As for predicted function, only variation within the core taxa was significantly explained by CP type, although to a greater extent (F = 4.1, P < 0.009) (Table S10). Additional clinical variables did not significantly explain the observed variance across partitioned microbiota structure and function (Table S11).

Given the consistent impact unto both core taxa composition and predicted functional variation elicited by the type of CP present, we next compared how the most abundant predicted functions, those accounting for > 1% total predicted functionality, may have differed across the study groups (Fig. 5). Functions relating to key bacterial metabolic pathways were highly similar across all sampling groups. Across all groups, predicted functions of the core taxa were highly conserved, however there were some pairwise differences between HC and AIP groups specifically (Fig. 5A, Table S12). This included pathways relating to amino sugar and nucleotide sugar metabolism (P = 0.035) and ribosome (P = 0.013), respectively. As for the satellite taxa, many of these predicted associated functional pathways were highly conserved with the core taxa (Fig. 5B). There were, however, observed differences across all groups for multiple pathways (Table S12). This included ABC transporters (P = 0.049), glycolysis/gluconeogenesis-related functions (P = 0.049), and quorum sensing (P = 0.048). Given that most fundamental bacterial pathways were shared across both the core and satellite taxa, we also decided to investigate how predicted pathways exclusive to the satellite taxa, which were of increased relative abundance across CP participants, may have differed between all three groups (Fig. S3, Table S12). Of the predicted level three hierarchical pathways observed, many were ubiquitously present across all study groups and mapped to the site of the GI tract. In particular, this included biosynthesis of various secondary metabolites, polycyclic aromatic hydrocarbon degradation, non-alcoholic fatty liver disease, alcoholism, and colorectal cancer. Among these pathways, colorectal cancer trended toward significance across the total groupwise comparison (P = 0.150), with significant pairwise differences highlighted between CFRP and HC groups (P = 0.028).

Fig. 5. Differences in predicted metagenome function between study groups. CLR transformed compositional data was plotted to visualise differences among groups across KEGG level 3 hierarchical pathways accounting for > 1% total predicted functionality. A Core taxa. B Satellite taxa. Asterisks (*) denote significant differences across group means following ANOVA testing (P < 0.05). Associated summary statistics and full-length pathway names can be found in Tables S12-13.

Discussion

The principal aim of this study was to investigate how intestinal microbial communities may differ among CP subtypes, including additional comparisons back to healthy controls for further interpretation. For this purpose, we piloted both full-length (FL) and shorter amplicon (V4) sequencing of the 16S rRNA gene to profile faecal microbial communities, utilising collapsed ASVs for this purpose across both approaches [33]. Across similar sampling depths, as demonstrated by the raw read counts per sample post-sequencing, both approaches were comparable in partitioning total microbial communities (whole microbiota) based on significant distribution-abundance relationships across all three study groups. This is a necessary step as to isolate the common core constituents, that typically contribute a greater amount to principal functionality (in this case relating to the host), and the rarer satellite taxa which are often more opportunistic and responsive to various perturbation events across a given community [29, 30]. Indeed, this previously has been used across varying microbiome sites in health and disease to successfully partition taxa [34–36]. Following partitioning of the microbiota into these components, it was immediately apparent that both CP groups had an expansion to the relative abundance of the satellite taxa, indicative of perturbation to the local site or changes in local physiology [37].

Irrespective of microbiota partition studied, the FL 16S rRNA data yielded larger values of diversity as compared to V4 16S approach. There were subtle similarities across both the whole microbiota and satellite taxa analyses when groups were compared, however the core taxa diversity exhibited notable differences. Whilst core taxa diversity was increased in the healthy control groups as compared to both CP groups, this was significantly pronounced in the V4 16S rRNA approach. It is plausible that enhanced species-level classification across the FL 16S rRNA data is responsible for driving such discrepancies. Various constituents of the core intestinal microbiota community, such as members of the Actinobacteria (Bifidobacterium) are poorly resolved solely across the V4 region [38] and may otherwise have been labelled as the same taxon. The increased diversity across FL-16S data could have also been further inflated by increased identification of single nucleotide polymorphisms across chromosomes from individual bacteria [39]. This is likely to impede the satellite taxa specifically, whereby taxonomic resolution is likely hindered more than the core community, and for which clustering may not be possible at the species or genus level for a larger variety of taxa. Indeed, the disparity between diversity was largest across the satellite taxa between both sequencing approaches. Across the wider literature, there is evidence for both a reduction to gut microbiota diversity in CP [9, 11, 17], alongside no significant differences at all [16, 18, 40, 41] as compared with controls. In those latter studies, CP subtypes were often grouped together for such analyses or not explicitly depicted for sub-type comparisons. Interestingly, Zhou and colleagues showed that despite differences to controls, CP subtypes (alcohol and idiopathic) were not significantly different in terms of alpha-diversity metrics across a Chinese population [9]. It remains to be seen whether partitioning of microbial communities across such participants of different geographical origin and lifestyle renders significant differences in diversity.

Despite the preliminary differences across microbial structure from the diversity observed within the current study, analyses of microbial compositions exhibited high concordance between both sequencing approaches tested, importantly validating the reliability and accuracy of such principal observations across the microbial communities. Here, the healthy control subjects had increased similarity across the whole microbiota and core taxa composition as compared to both CP groups. The increased similarity observed amongst ‘healthy’ microbial ecosystems relative to their diseased counterparts has previously been termed the ‘Anna Karenina principle’, in which dysbiotic environments can stem from a variety of different perturbation events or factors associated with a given disease [42, 43]. Currently, this finding is mirrored across some studies from the CP microbiota [10, 17], with others indicating no increased similarity across a given group [9, 11]. Partitioning of microbiota into sub-communities is a pragmatic approach to decipher such relationships moving forward, given the high intragroup variance of satellite taxa composition, which seemingly drove the majority of biomass across CP participants here as observed across other dysbiotic intestinal environments [34, 35, 37, 44]. When partitioned, the composition of both core and satellite taxa across CP groups was significantly different to that of healthy controls, mirroring more general findings often observed across CP [9, 11, 15–18]. We also demonstrated significant microbiota compositional differences between CP groups themselves across both sequencing approaches, driven by the disparity among AIP participants. Differences across disease aetiology are not established, with the current limited evidence currently available [9] conflicting across relatively low group sample sizes. From our multivariate analyses, we also did not detect any changes resultant of pancreatic complications unto the microbial communities, namely resultant of exocrine pancreatic insufficiency nor endocrine pancreatic insufficiency. Whilst both independently associate with particular microbiota structure and function [45, 46], their compounding effects upon the CP microbiota are not ubiquitously reported [9–11, 40].

When comparing which taxa drove the differences between sampling groups from our SIMPER analysis, various taxa from both CP groups were consistently reduced in mean relative abundance as compared to the healthy controls. This decrease was most pronounced for B. adolescentis. Members of the genus Bifidobacterium can exhibit anti-inflammatory effects at the site of the intestinal tract through direct modulation [47], but also through lactate production which further extends to the site of the pancreas itself [48, 49]. Additionally, prominent short chain fatty acid (SCFA) producers such as F. prausnitzii, alongside key intermediate fermenters, namely R. bromii were decreased across CP. Collectively, these findings are consistent with the wider literature, in which key members of the Firmicutes and Actinobacteria relating to maintenance of intestinal immune homeostasis are reduced in CP [9–11, 15, 17, 18, 40]. It is well characterised that SCFAs, particularly butyrate, are important mediators of intestinal health relating to the modulation of local inflammatory responses and the maintenance of gut barrier integrity [50–52]. On the contrary, we also observed increases in the relative abundance of Streptococci across CP groups. Typically designated as satellite taxa across our CP groups, Streptococci have previously been associated with a proinflammatory intestinal environments [53], and their relative abundance has even been suggested as a potential early biomarker for pancreatic cancers [54]. These implications remain to be fully elucidated given the shorter 16S rRNA regions utilised, resulting in reduced species resolution across such studies.

An expansion of Proteobacteria across CP is well defined in the literature [8, 11, 15, 17, 55], however genera such as Escherichia and Klebsiella contributed relatively little to microbial dissimilarity between groups in our FL 16S rRNA analysis. They did, however, contribute significantly more when the V4 data was used for this purpose. This could reflect DNA library preparation biases for their detection with this primer set, or bias against them when V1-V9 (FL 16S) primers are utilised given differences observed in the GC% content across such regions [56]. Other bacterial relationships were generally well maintained at the genus level between the two sequencing approaches. When comparing which taxa drove differences between the two CP groups, both sequencing approaches highlighted members of genera previously discussed, with no clear trend with respect to the mean relative abundances across these seemingly beneficial taxa. Additionally, A. muciniphila, which reportedly exerts beneficial protective effects upon intestinal inflammation and other gastrointestinal abnormalities [57, 58], was shown to be decreased across the CFRP group as compared to the AIP participants. Decreases in A. muciniphila are not widely reported across human CP microbiota studies yet feature commonly within cystic fibrosis intestinal microbiota dysbiosis [59–61]. Given its utilisation of mucin as a primary source of nutrients, disruption to mucosal viscosity from heterozygous carriage of mutant CFTR and subsequent decrease in extracellular chloride and bicarbonate secretion could in theory exacerbate any diminishing effects upon A. muciniphila relative abundance in CP. Murine studies have further demonstrated bidirectionality between A. muciniphila abundance and optimal pancreatic outcomes [62, 63].

Following the high overall concordance of relationships between groups across both the FL and V4 16S rRNA compositional and SIMPER tests, we confidently opted to prioritise the FL data for subsequent analyses, including predictive metagenome functionality. Modest differences were highlighted across the study groups. The healthy control samples clustered tightly to one another, whilst the two CP groups demonstrated higher intra-sample variation. Between groups, the differences between the healthy control and AIP groups were most pronounced. For comparative purposes and subsequent analyses, we utilised annotated KEGG pathways at the level 3 hierarchy, given the increased detail of biochemical pathways, versus the more general functions and broader categories exhibited at lower levels. This was likely to ascertain any meaningful differences between groups, given that high similarity and functional redundancy is readily observed across individuals with different microbiota structures [64, 65]. Indeed, when comparing KEGG level 3 hierarchical pathways between groups, many key components of bacterial function and metabolism were highly conserved across not only the sampling groups, but across both partitions of the microbiota itself.

Of the most abundant pathways (%) identified from the core taxa overall, only ribosome and amino sugar and nucleotide sugar metabolism pathways exhibited pairwise group differences, with the latter increased in the AIP group specifically. Similarly increased in AIP amongst the satellite taxa, these implications remain to be further explored but may indicate adaptation of the community to utilise available substrates more effectively in the presence of environmental changes or stressors to local intestinal physiology [66]. Other differences among the satellite taxa were pathways such as those related to ABC transporters (increased in CFRP). ABC transporters are a diverse class of membrane proteins, with key roles in virulence across prokaryotes, which involves the uptake of essential nutrients, alongside efflux of toxins and other substances to their local environment to enhance their survival and overall fitness [67, 68]. They have previously been shown as elements of faecal metagenomic signatures relating to pancreatic cancer [69]. The CFRP group also demonstrated increases to pathways related to quorum sensing, which might further underpin effective expansion and virulence factor coordination across the satellite taxa in the presence of dysmotility and general bacterial overgrowth in CP [70, 71]. The potential importance of satellite was finally highlighted through our analysis of associated functional pathways exclusively found across this subcommunity. Of the GI-related pathways exclusively identified, pairwise significant differences of pathways relating to colorectal cancer was observed between CFRP and HC groups. Further work understanding which elements of the microbiome are attributable to this KEGG analysis is warranted but likely represents a combination of both direct and indirect microbiota-driven actions that affect host physiology and modulate inflammatory pathways.

These results of course remain to be validated, with wider implications explored using more sophisticated multi-omics approaches, as only a single study across the CP microbiome thus far has utilising metabolomics alongside metagenomics or 16S rRNA gene sequencing for example [18]. Additionally, larger cohorts of participants with alcohol and genetic-related pancreatitis holds considerable appeal, given the observed differences observed within our pilot analyses. This will inherently allow for more robust investigations into relationships between specific pancreatic manifestations and the gut microbiome. Whilst we had extensive data surrounding pancreatic sub-complications across our cohort, including pancreatic ductal strictures, pseudocysts, and necrosis, our limited sample size prevented reliable analysis of such factors. Given the proposed gut-pancreas axis, it should be of keen interest to determine signatures of gut microbiota translocation, or their metabolites, to such pancreatic tissues across CP [72]. Other lifestyle factors such as smoking status, were collected here but were exclusively co-correlated with the AIP group for the purpose of our multivariate approach. Additional exploration into smoking as a confounder may be warranted due to it impact upon CP and elements of the microbiota [11, 73].

Aside from the limitations of sample size inherent to the nature of pilot studies themselves, which here was further impacted by the COVID-19 pandemic, we also acknowledge some other caveats of this study. Firstly, we understand the predictive nature of microbiota functionality should be considered putative given the sole use of the 16S rRNA gene to infer this function. For this reason, we only utilised full-length 16S rRNA sequencing, as to resolve taxa more confidently down to the species level and use such ASVs for subsequent bioinformatic pipelines. Nonetheless, the sole use of shorter hypervariable 16S regions have previously been used to infer microbial function across the gut microbiome in CP and pancreatic cancer [10, 16–18]. Our approach for partitioning bacterial taxa into common core and satellite taxa has been successfully applied across prior human microbiota studies [34, 36]. The 75% threshold for core taxa has been successfully used in other studies consisting of limited pilot datasets [35, 44]. Here, it was best fit to the log-normal distribution for the healthy control group across both FL 16S and V4 16S datasets, whilst remaining statistically viable as a definitive threshold for core taxa across both CP. This threshold was therefore pragmatically selected, with FL-16S healthy control data being the main comparator to both CP groups and further utilised for downstream predictive metagenomics.

Whilst our inclusion criteria specifically excluded participants with recent antibiotic usage, we understand that antibiotic administration across CP patients to treat infections associated with the aforementioned sub-complications [74, 75] are likely historical perturbation events within our older CP cohort. It may be of interest to retrospectively investigate antibiotic usage across participants moving forward, or rather investigate microbiome dynamics longitudinally with increased granularity, as to determine the impacts of such events upon intestinal microbial communities. Whilst our smaller sample size does limit statistical power for extensive investigations, it also highlights the principal strength of this study, which is its ability to highlight microbiota differences across CP subtypes based on both structure and predicted metagenome functions. Subsequent work will incorporate additional compositionally aware approaches to further refine and confidently identify physiologically relevant taxa and metagenome functions. Collectively, this work warrants the additional investigation into more specific relationships across GI physiology and the microbiota in CP.

Conclusions

Overall, we implemented full length 16S rRNA sequencing for the first time to demonstrate that gut microbiota structure and predicted function differs between not only healthy controls and CP, but across CP subtypes themselves. This may be in part due to inherent differences across intestinal and pancreatic physiology, alongside lifestyle factors relating to disease. Differences in predicted functionality of the core and satellite taxa are evident, relating to microbial processes that may have downstream physiological consequences at the site of the GI tract. Future studies will employ larger cohorts with the incorporation of additional omics-based approaches. This will better elucidate differences in microbiota structure and functionality across CP subtypes, including relationships with pancreatic-based clinical outcomes.

Supplementary Information

Additional file 1.

Bibliography7

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Whitcomb DC, Group for the NAPS, Pancreatitis. TIGAR-O version 2 Risk/Etiology checklist with topic Reviews, Updates, and use primers. Clin Transl Gastroenterol. 2019;10(6).10.14309/ctg.0000000000000027 PMC 661386331166201 · doi ↗ · pubmed ↗
2Ma Z (Sam), editor. Testing the Anna Karenina Principle in Human Microbiome-Associated Diseases. i Science. 2020;23(4):101007.10.1016/j.isci.2020.101007 PMC 716332432305861 · doi ↗ · pubmed ↗
3Enaud R, Hooks KB, Barre A, Barnetche T, Hubert C, Massot M et al. Intestinal inflammation in children with cystic fibrosis is associated with Crohn’s-Like microbiota disturbances. J Clin Med. 2019;8(5):645 .10.3390/jcm 8050645 PMC 657224331083321 · doi ↗ · pubmed ↗
4Yang J, Ma Y, Tan Q, Zhou B, Yu D, Jin M et al. Gut Streptococcus is a microbial marker for the occurrence and liver metastasis of pancreatic cancer. Front Microbiol. 2023;14:1184869 .10.3389/fmicb.2023.1184869 PMC 1030644137389332 · doi ↗ · pubmed ↗
5Wang Ljuan, Jin Y ling, Pei W long, Li Jcong, Zhang R lin, Wang Jju et al. Amuc_1100 pretreatment alleviates acute pancreatitis in a mouse model through regulating gut microbiota and inhibiting inflammatory infiltration. Acta Pharmacol Sin. 2024;45(3):570–80.10.1038/s 41401-023-01186-4PMC 1083444838012292 · doi ↗ · pubmed ↗
6Lee AA, Baker JR, Wamsteker EJ, Saad R, Di Magno MJ. Small intestinal bacterial overgrowth is common in chronic pancreatitis and associates with Diabetes, chronic pancreatitis Severity, low zinc Levels, and opiate use. Am J Gastroenterol. 2019;114(7):1163-171 .10.14309/ajg.0000000000000200 PMC 661075331008737 · doi ↗ · pubmed ↗
7Aghdassi AA, Mayerle J, Kraft M, Sielenkämper AW, Heidecke CD, Lerch MM. Pancreatic pseudocysts - when and how to treat? HPB (Oxford). 2006;8(6):432–41.10.1080/13651820600748012 PMC 202075618333098 · doi ↗ · pubmed ↗