# In-Depth Multi-Assembler Venom-Gland Transcriptomics of Three Medically Important Colombian Snakes Highlights Diversity of Accessory, Low-Abundance Protein Families

**Authors:** Mónica Saldarriaga-Córdoba, Claudia Clavero-León, Paola Rey-Suárez, Vitelbina Núñez-Rangel, Sebastián Estrada-Gómez

PMC · DOI: 10.3390/toxins18030118 · Toxins · 2026-02-25

## TL;DR

This study explores the venom-gland transcriptomes of three Colombian snakes, revealing diverse protein families that are often overlooked in traditional analyses.

## Contribution

The study introduces a multi-assembler approach to uncover low-abundance and accessory venom proteins in medically important snakes.

## Key findings

- Using multiple assemblers improved the discovery of diverse venom-related protein variants.
- The toxinomes of three snake species revealed significant diversity in both major and accessory protein families.
- Assembler choice strongly influenced transcript variant recovery and overall transcriptome completeness.

## Abstract

Typically, most omics analysis (proteomic and transcriptomic) of snakes are focused on the dominant enzymatic proteins used for evolutionary analysis or those engaged in envenoming symptoms. This study presents a comprehensive multi-assembler transcriptomic analysis focused on the non-dominant and enzymatic or non-enzymatic putative proteins of the venom glands of three medically significant Colombian snake species. Together, these results highlight how continued improvements in modern omics workflows, coupled with extensive manual curation, enable more complete putative protein variants discovery when multiple assemblers are integrated. Here, we reconstructed the toxinomes of the viperids Bothrops asper and Crotalus durissus cumanensis, and the elapid Micrurus mipartitus, by comparing four assemblers (Trinity, SPAdes, SOAPdenovo-Trans k = 31 and k = 97) and integrating them into a non-redundant meta-assembly. Protein-candidate alignments were extensively inspected, and validation of conserved domains and functional motifs are discussed. The curated toxinomes revealed substantial diversity across major and accessory families, and assembler choice strongly affected transcript variant recovery. Together, these results provide a more comprehensive view of venom-gland transcriptome analysis and diversity, expanding the set of candidate venom components for future functional and proteomic validation, with potential implications for venom composition studies and antivenom development.

## Linked entities

- **Species:** Bothrops asper (taxon 8722), Crotalus durissus cumanensis (taxon 184542), Micrurus mipartitus (taxon 430902)

## Full-text entities

- **Genes:** BCHE (butyrylcholinesterase) [NCBI Gene 590] {aka BCHED, CHE1, CHE2, E1}, WFDC2 (WAP four-disulfide core domain 2) [NCBI Gene 10406] {aka BENP, EDDM4, HE4, WAP5, dJ461P17.6}, SPINT1 (serine peptidase inhibitor, Kunitz type 1) [NCBI Gene 6692] {aka HAI, HAI1, MANSC2}, PMCH (pro-melanin concentrating hormone) [NCBI Gene 5367] {aka MCH, ppMCH}, PLA2G4A (phospholipase A2 group IVA) [NCBI Gene 5321] {aka GURDP, PLA2G4, cPLA2, cPLA2-alpha}, GDF10 (growth differentiation factor 10) [NCBI Gene 2662] {aka BIP, BMP-3b, BMP3B}, FCN2 (ficolin 2) [NCBI Gene 2220] {aka EBP-37, FCNL, P35, ficolin-2}, SRPX2 (sushi repeat containing protein X-linked 2) [NCBI Gene 27286] {aka BPP, CBPS, PMGX, RESDX, SRPUL}, ENPP3 (ectonucleotide pyrophosphatase/phosphodiesterase 3) [NCBI Gene 5169] {aka B10, CD203c, NPP3, PD-IBETA, PDNP3}, MASP1 (MBL associated serine protease 1) [NCBI Gene 5648] {aka 3MC1, CRARF, CRARF1, MAP-1, MAP1, MASP}, ALDH7A1 (aldehyde dehydrogenase 7 family member A1) [NCBI Gene 501] {aka ATQ1, EPD, EPEO4, PDE}, F10 (coagulation factor X) [NCBI Gene 2159] {aka FX, FXA}, ENPP4 (ectonucleotide pyrophosphatase/phosphodiesterase 4) [NCBI Gene 22875] {aka NPP4}, KNG1 (kininogen 1) [NCBI Gene 3827] {aka BDK, BK, HAE6, HK, HMWK, KNG}, F2 (coagulation factor II, thrombin) [NCBI Gene 2147] {aka PT, RPRGL2, THPH1}, FLT4 (fms related receptor tyrosine kinase 4) [NCBI Gene 2324] {aka CHTD7, FLT-4, FLT41, LMPH1A, LMPHM1, PCL}, FHIT (fragile histidine triad diadenosine triphosphatase) [NCBI Gene 2272] {aka AP3Aase, FRA3B}, AP2B1 (adaptor related protein complex 2 subunit beta 1) [NCBI Gene 163] {aka ADTB2, AP105B, AP2-BETA, CLAPB1}, NGF (nerve growth factor) [NCBI Gene 4803] {aka Beta-NGF, HSAN5, NGFB}, FLT1 (fms related receptor tyrosine kinase 1) [NCBI Gene 2321] {aka FLT, FLT-1, VEGFR-1, VEGFR1}, NPR1 (natriuretic peptide receptor 1) [NCBI Gene 4881] {aka ANP-A, ANPRA, ANPa, GC-A, GUC2A, GUCY2A}, ENPP6 (ectonucleotide pyrophosphatase/phosphodiesterase 6) [NCBI Gene 133121] {aka NPP6}, PGF (placental growth factor) [NCBI Gene 5228] {aka D12S1900, PGFL, PIGF, PLGF, PlGF-2, SHGC-10760}, NTRK1 (neurotrophic receptor tyrosine kinase 1) [NCBI Gene 4914] {aka MTC, TRK, TRK1, TRKA, Trk-A, p140-TrkA}, NRP1 (neuropilin 1) [NCBI Gene 8829] {aka BDCA4, CD304, NP1, NRP, VEGF165R}, EGF (epidermal growth factor) [NCBI Gene 1950] {aka HOMG4, URG}, FGB (fibrinogen beta chain) [NCBI Gene 2244] {aka HEL-S-78p}, PLA2G2A (phospholipase A2 group IIA) [NCBI Gene 5320] {aka MOM1, PLA2, PLA2B, PLA2L, PLA2S, PLAS1}, NPR2 (natriuretic peptide receptor 2) [NCBI Gene 4882] {aka AMDM, ANPRB, ANPb, ECDM, GC-B, GCB}, LIPA (lipase A, lysosomal acid type) [NCBI Gene 3988] {aka CESD, LAL}, VEGFA (vascular endothelial growth factor A) [NCBI Gene 7422] {aka L-VEGF, MVCD1, VEGF, VPF}, CNP (2',3'-cyclic nucleotide 3' phosphodiesterase) [NCBI Gene 1267] {aka CN37, CNP1, HLD20}, PI3 (peptidase inhibitor 3) [NCBI Gene 5266] {aka ESI, SKALP, WAP3, WFDC14, cementoin}, NT5E (5'-nucleotidase ecto) [NCBI Gene 4907] {aka CALJA, CD73, E5NT, NT, NT5, NTE}, VEGFC (vascular endothelial growth factor C) [NCBI Gene 7424] {aka Flt4-L, LMPH1D, LMPHM4, VRP}, VEGFB (vascular endothelial growth factor B) [NCBI Gene 7423] {aka VEGFL, VRF}, KDR (kinase insert domain receptor) [NCBI Gene 3791] {aka CD309, FLK1, VEGFR, VEGFR2}, F3 (coagulation factor III, tissue factor) [NCBI Gene 2152] {aka CD142, TF, TFA}, ACHE (acetylcholinesterase (Yt blood group)) [NCBI Gene 43] {aka ACEE, ARACHE, N-ACHE, YT}, FCN1 (ficolin 1) [NCBI Gene 2219] {aka FCNM}, IL4I1 (interleukin 4 induced 1) [NCBI Gene 259307] {aka FIG1, LAAO, LAO, hIL4I1}, SLPI (secretory leukocyte peptidase inhibitor) [NCBI Gene 6590] {aka ALK1, ALP, BLPI, HUSI, HUSI-1, HUSI-I}, VEGFD (vascular endothelial growth factor D) [NCBI Gene 2277] {aka FIGF, VEGF-D}
- **Diseases:** pain (MESH:D010146), cardiovascular diseases (MESH:D002318), PD (MESH:D010300), SV (MESH:C000719210), coronary artery disease (MESH:D003324), neurodegenerative conditions (MESH:D019636), peripheral arterial disease (MESH:D058729), QPD (MESH:C536260), CTL (OMIM:211750), carotid atherosclerosis (MESH:D002340), injury to (MESH:D014947), brain injury (MESH:D001930), depression (MESH:D003866), hypotensive (MESH:D007022), pulmonary hypertension (MESH:D006976), drop in blood pressure (MESH:D006973), aortic aneurysms (MESH:D001014), snakebite envenomation (MESH:D012909), AD (MESH:D000544), neuro- or hemotoxicity (MESH:C536203), hypoxic ischemic (MESH:D020925), inflammation (MESH:D007249), snake venom accidents (MESH:D000081084), dissection (MESH:D000784)
- **Chemicals:** heparin (MESH:D006493), Arg (MESH:D001120), hyaluronic acid (MESH:D006820), ADP (MESH:D000244), Zn+ (MESH:D015032), lipid (MESH:D008055), galactose (MESH:D005690), glycan (MESH:D011134), calcium (MESH:D002118), captopril (MESH:D002216), L-type Ca+2 (-), Ser (MESH:D012694), GPI (MESH:D017261), cGMP (MESH:D006152), disintegrins (MESH:D019483), AMP (MESH:D000249), disulfide (MESH:D004220), Asp (MESH:D001224), FAD (MESH:D005182), phosphoserine (MESH:D010768), ATP (MESH:D000255), carbohydrate (MESH:D002241), His (MESH:D006639), vitamin-K (MESH:D014812), glycerophospholipids (MESH:D020404), Natriuretic peptides (MESH:D045265), mannose (MESH:D008358), NAD (MESH:D009243), phenylephrine (MESH:D010656), Lys (MESH:D008239), peptide (MESH:D010455), sialic acids (MESH:D012794), Glu (MESH:D018698), GPC (MESH:D005997), H2O2 (MESH:D006861), ACh (MESH:D000109), L-amino acids (MESH:D000596), Cys (MESH:D003545), asparagine (MESH:D001216), phospholipids (MESH:D010743), Gla (MESH:D017965), aminoglycoside (MESH:D000617)
- **Species:** Bothrops atrox (barba amarilla, species) [taxon 8725], Homo sapiens (human, species) [taxon 9606], Naja naja (Indian cobra, species) [taxon 35670], Crotalus durissus terrificus (cascabel, subspecies) [taxon 8732], Micrurus altirostris (species) [taxon 129457], Crotalus durissus (cascabel, species) [taxon 8731], Bitis arietans (African puff adder, species) [taxon 8692], Trypanosoma cruzi (species) [taxon 5693], Pseudechis australis (king brown snake, species) [taxon 8670], Micrurus fulvius (eastern coral snake, species) [taxon 8637], Echis coloratus (species) [taxon 64175], Philodryas olfersii (species) [taxon 120305], Austrelaps labialis (species) [taxon 471292], Bos taurus (bovine, species) [taxon 9913], Micrurus tener (Texas coral snake, species) [taxon 1114301], Erythrolamprus poecilogyrus (species) [taxon 338838], Micrurus mipartitus (species) [taxon 430902], Crotalus durissus cumanensis (subspecies) [taxon 184542], Mus musculus (house mouse, species) [taxon 10090], Naja nigricollis (black-necked spitting cobra, species) [taxon 8654], Bothrops insularis (golden lancehead, species) [taxon 8723], Bacteria Latreille et al. 1825 (Bacteria stick insect, genus) [taxon 629395], Crotalus adamanteus (eastern diamondback rattlesnake, species) [taxon 8729], Bothrops jararaca (jararaca, species) [taxon 8724], Serpentes (snakes, infraorder) [taxon 8570], Oxyuranus microlepidotus (species) [taxon 111177], Vipera ammodytes ammodytes (western sand viper, subspecies) [taxon 8705], Demansia vestigiata (black whip snake, species) [taxon 412038], Drysdalia coronoides (species) [taxon 66186], Sus scrofa (pig, species) [taxon 9823], Laticauda semifasciata (broad-banded blue sea krait, species) [taxon 8631], Walterinnesia aegyptia (species) [taxon 64182], Bothrops asper (terciopelo, species) [taxon 8722], Naja atra (Chinese cobra, species) [taxon 8656], Echis (genus) [taxon 8699], Tropidechis carinatus (species) [taxon 100989], Bungarus fasciatus (banded krait, species) [taxon 8613], Ophiophagus hannah (king cobra, species) [taxon 8665], Bungarus bungaroides (species) [taxon 282164], Naja kaouthia (monocled cobra, species) [taxon 8649], Notechis scutatus (common tiger snake, species) [taxon 8663], Rhabdophis tigrinus tigrinus (subspecies) [taxon 193080], Notechis scutatus scutatus (subspecies) [taxon 70142], Pantherophis guttatus (species) [taxon 94885], Bothrops jararacussu (jararacussu, species) [taxon 8726], Protobothrops flavoviridis (habu, species) [taxon 88087], Agkistrodon piscivorus piscivorus (Eastern cottonmouth, subspecies) [taxon 8716], Rynchops (genus) [taxon 227183], Hydrophis hardwickii (Hardwick's sea snake, species) [taxon 8781], Cerberus rynchops (species) [taxon 46267], Camponotus atrox (species) [taxon 604843], Bothrops cotiara (cotiara, species) [taxon 8727], Crotalus atrox (western diamondback rattlesnake, species) [taxon 8730], Rattus norvegicus (brown rat, species) [taxon 10116]
- **Mutations:** C46939-C, C388168-C, serine at position 419, Asn67 and Asn153 were replaced by Lys and Thr, N153T, N67K
- **Cell lines:** DN2890 — Homo sapiens (Human), Parkinson disease, Induced pluripotent stem cell (CVCL_C0K6), DN2597i1 — Homo sapiens (Human), Lung adenocarcinoma, Cancer cell line (CVCL_A547), DN359 — Homo sapiens (Human), Melanoma, Cancer cell line (CVCL_6226)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13030222/full.md

## Figures

25 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13030222/full.md

## References

143 references — full list in the complete paper: https://tomesphere.com/paper/PMC13030222/full.md

---
Source: https://tomesphere.com/paper/PMC13030222