# Sequence‐Based and Functional Analysis for the Discovery of N‐Glycan Degrading Glycosidases From the Microbial Metagenome of the Infant Gut

**Authors:** Irene Boscá‐Sánchez, Jesús Rodríguez‐Díaz, María J. Yebra

PMC · DOI: 10.1002/mbo3.70264 · 2026-03-11

## TL;DR

This study identifies and characterizes glycosidases in the infant gut microbiome that break down N-glycans, revealing their potential role in shaping the microbiota.

## Contribution

The paper discovers and functionally characterizes novel glycosidases from the infant gut metagenome that act on N-glycans.

## Key findings

- Nine β-galactosidases were identified, with two (Gal1b and Gal99) capable of removing galactose from glycoconjugates.
- Exo360 uniquely hydrolyzes GlcNAc at both α1,3 and α1,6 branches of N-glycans, while Endo38 and Endo358 release full N-glycan structures.
- The infant gut microbiota shows a diverse range of glycosidases with distinct substrate preferences for N-glycan degradation.

## Abstract

The role of bacterial glycosyl hydrolases (GHs) in degrading free human milk oligosaccharides is well documented. However, their activity on glycoconjugates is less well known. Here, an in silico analysis of the metagenome of the fecal microbiome of breastfed infants was employed to identify GH2 β‐galactosidases, GH20 exo‐N‐acetylglucosaminidases and GH18 endo‐N‐acetylglucosaminidases active on N‐glycans. A total of nine β‐galactosidases were recombinantly expressed and two of them, Gal1b and Gal99, were able to remove galactose from the G2 peptide and asialofetuin. Gal1b, Gal25, Gal37c, Gal99 and Gal296 hydrolyzed lactose and N‐acetyllactosamine, indicating specificity for galactose β1,4‐linked to glucose or GlcNAc. All of the exo‐β‐N‐acetylglucosaminidases studied here (Exo10a, Exo18, Exo38, Exo39b, Exo360 and Exo399) hydrolyzed the disaccharide N‐acetylglucosaminyl‐β1,2‐mannose, which forms part of the N‐glycan structures. Exo10a, Exo38 and Exo360 hydrolyzed N‐acetylglucosamine (GlcNAc) from the G2 peptide pretreated with Gal1b. Notably, Exo360 hydrolyzed GlcNAc at both the α1,3 and α1,6 branches of the G2 peptide core mannose simultaneously, whereas Exo10a showed a preference for GlcNAc at one branch. Exo38 and Exo360 also release GlcNAc from asialofetuin once galactose has been removed. The whole structures of N‐glycans were liberated from glycoproteins by the action of the endo‐N‐acetylglucosaminidases Endo38 and Endo358. These enzymes hydrolyze the N,N’‐diacetylchitobiose core of N‐linked glycans of the high‐mannose and non‐sialylated complex types, respectively. Overall, these results provide insight into the range of glycosyl hydrolases present in the infant gut microbiota that act on glycoconjugates, which may play a role in the establishment and composition of the newborn microbiota.

This research offers valuable understanding of how N‐glycans are broken down by the gut microbiota of infants, focusing on glycoside hydrolase families GH2, GH20, and GH18. It reveals that the gut microbiota of breastfed infants has a diverse array of genes coding for these enzymes. The functional analysis of GH2 β‐galactosidases, GH20 exo‐N‐acetylglucosaminidases, and GH18 endo‐N‐acetylglucosaminidases shows distinct substrate preferences, highlighting their involvement in the degradation of dietary and host‐derived glycans.

## Linked entities

- **Proteins:** lgals2b (galectin 2b)
- **Chemicals:** galactose (PubChem CID 6036), lactose (PubChem CID 6134), N-acetyllactosamine (PubChem CID 439271), N-acetylglucosamine (PubChem CID 439174), GlcNAc (PubChem CID 439174), N,N’-diacetylchitobiose (PubChem CID 439544)

## Full-text entities

- **Genes:** MLEC (malectin) [NCBI Gene 9761] {aka KIAA0152}, SNAI1 (snail family transcriptional repressor 1) [NCBI Gene 6615] {aka SLUGH2, SNA, SNAH, SNAIL, SNAIL1, dJ710H13.1}, OGA (O-GlcNAcase) [NCBI Gene 10724] {aka MEA5, MGEA5, NCOAT}, IGKV3-31 (immunoglobulin kappa variable 3-31 (pseudogene)) [NCBI Gene 28910] {aka A16, A16a, IGKV331}, AHSG (alpha 2-HS glycoprotein) [NCBI Gene 280988], IGKV2D-18 (immunoglobulin kappa variable 2D-18 (pseudogene)) [NCBI Gene 28888] {aka A13, IGKV2D18}, GLB1 (galactosidase beta 1) [NCBI Gene 2720] {aka EBP, ELNR1, MPS4B}, GGH (gamma-glutamyl hydrolase) [NCBI Gene 8836] {aka GATD10, GH}, RCC1 (regulator of chromosome condensation 1) [NCBI Gene 1104] {aka CHC1, IIAAN, RCC1-I}, MAL (mal, T cell differentiation protein (MAL blood group)) [NCBI Gene 4118] {aka HLD28, MVP17, VIP17}, LTF (lactotransferrin) [NCBI Gene 280846] {aka Lf}, GHS (Goldenhar syndrome) [NCBI Gene 7971], GH2 (growth hormone 2) [NCBI Gene 2689] {aka GH-V, GHB2, GHL, GHV, hGH-V}, GLB1 (galactosidase beta 1) [NCBI Gene 507188], CTBS (chitobiase) [NCBI Gene 1486] {aka CTB}
- **Diseases:** inflammatory bowel disease (MESH:D015212), infection (MESH:D007239), cancer (MESH:D009369)
- **Chemicals:** SCFAs (MESH:D005232), trisaccharide (MESH:D014312), fucose (MESH:D005643), Glc (MESH:D005947), PVDF (MESH:C024865), lacto-N-triose II (MESH:C098555), Tween 20 (MESH:D011136), N,N'-diacetylchitobiose (MESH:C010913), glycopeptide (MESH:D006020), sialic acids (MESH:D012794), ampicillin (MESH:D000667), GlcNAc (MESH:D000117), carbohydrate (MESH:D002241), Glycoconjugates (MESH:D006001), glycerol (MESH:D005990), Oligosaccharides (MESH:D009844), Gal296 (-), NaOH (MESH:D012972), valerate (MESH:D014631), Gal (MESH:D005690), Asn (MESH:D001216), SDS (MESH:D012967), CaCl2 (MESH:D002122), Peptide (MESH:D010455), disaccharide (MESH:D004187), N,N',N"-triacetylchitotriose (MESH:C019117), water (MESH:D014867), Sialic acid (MESH:D019158), Monosaccharides (MESH:D009005), agar (MESH:D000362), Coomassie brilliant blue (MESH:C004692), N (MESH:D009584), lactose (MESH:D007785), Glycans (MESH:D011134), sugar (MESH:D000073893), Man (MESH:D008358), GlcNAcbeta1-4GlcNAc (MESH:C097320), N-acetyllactosamine (MESH:C000458)
- **Species:** Bacillus infantis (species) [taxon 324767], Bifidobacterium pseudolongum (species) [taxon 1694], Bos taurus (bovine, species) [taxon 9913], Homo sapiens (human, species) [taxon 9606], Bifidobacterium dentium (species) [taxon 1689], Klebsiella oxytoca (species) [taxon 571], Escherichia coli str. K-12 substr. DH10B (no rank) [taxon 316385], Bacteroides thetaiotaomicron (species) [taxon 818], Bifidobacterium adolescentis (species) [taxon 1680], Megasphaera massiliensis (species) [taxon 1232428], Enterococcus faecalis (species) [taxon 1351], Bacteroides caccae (species) [taxon 47678], Klebsiella pneumoniae (species) [taxon 573], Escherichia coli (E. coli, species) [taxon 562]
- **Cell lines:** S2 — Drosophila melanogaster (Fruit fly), Spontaneously immortalized cell line (CVCL_Z232)

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12976823/full.md

---
Source: https://tomesphere.com/paper/PMC12976823