Promoter Region and Regulatory Elements of IGF and VIP Genes Associated With Reproductive Traits in Chicken
Bosenu Abera, Hunduma Dinka, Hailu Dadi, Habtamu Abera

TL;DR
This study explores the promoter regions and regulatory elements of chicken IGF and VIP genes linked to reproductive traits using computational tools.
Contribution
The study identifies key regulatory elements and transcription factors associated with IGF and VIP genes in chickens, particularly their connection to the Wnt signaling pathway.
Findings
The highest promoter prediction scores were found for IGFP4, VIP, and VIPR1 genes.
Motif II resembles three major transcription factor families and accounts for 79.17% of the motif similarity.
Key regulatory elements like CPEB1 and TCF7L2 are linked to embryonic and nervous system development.
Abstract
This study investigates the promoter region and regulatory elements of chicken insulin-like growth factor (IGF) and vasoactive intestinal polypeptide (VIP) genes associated with reproductive traits. Several in silico tools, such as Neural Network Promoter Prediction (NNPP), Multiple Expectation maximizations for Motif Elicitation (MEME-Suite), GC-Profiles, microsatellite prediction (MISA-web), CLC Genomics, Multiple Association Network Integration Algorithm (GeneMANIA), and Gene Ontology for Motifs (GOMO), were used to characterize the promoter regions and regulatory elements of IGF and VIP genes. The in silico analysis showed that the highest promoter prediction scores (1.0) for TSS were obtained for three gene sequences (IGFP4, VIP, and VIPR1), while the lowest promoter prediction score (0.8) was obtained for IGF1. The present analysis revealed that the best common motif, Motif II,…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic and phenotypic traits in livestock · Genetic Mapping and Diversity in Plants and Animals · Cancer-related molecular mechanisms research
1. Introduction
The chicken insulin-like growth factor (IGF) and vasoactive intestinal peptide (VIP) genes are physiological candidates associated with reproductive traits [1, 2]. These genes are evolutionarily well-conserved peptides with sequence similarity among animals, including chickens, and play a key role in regulating signalling pathways in different animals [3]. The IGF system comprises IGF1 and IGF2 (Insulin-Like Growth Factor 2), along with their receptors IGF1R and IGF2R, IGF-Binding Proteins 1–6 (IGFBP1–6), and IGF2-Binding Proteins 1–3 (IGF2BP1–3). The VIP gene possesses two significant receptors, VPAC1 and VPAC2, which are extensively present in the central nervous system and peripheral tissues [4]. The biological functions of IGFs and the VIP gene are realized through their interaction with receptors [5–7]. The IGFs function as the major mediator of growth hormone (GH)–stimulated somatic growth [8], whereas the VIP gene regulates several ovarian functions such as follicular development, ovulation, steroidogenesis, and granulosa cell apoptosis [9].
Studies revealed that the process of chicken reproduction is regulated by the hypothalamic-pituitary-gonad axis [10]. Those findings imply that there are numerous genes associated with reproductive traits that are affecting the performance of chicken. Among others, the IGF systems and VIP gene are expressed in various organs in early embryonic developmental stages and play important fundamental roles in chicken embryo development [11, 12]. These genes also affect the reproductive traits of chicken via influencing gonadotropin-releasing hormone secretion [13–18]. Reproductive traits play a vital role in the genetic improvement of laying and broiler chickens. They dictate the fertility and feasibility of chicken production in the poultry industry [19]. Thus, there is a growing interest in identifying and characterizing specific genes and genomic regions that play a role in the regulation of reproductive processes.
Regulatory elements are DNA sequences that control gene expression through transcription initiation (promoters) and by enhancing transcription at distant regions (enhancers). Promoters are one of the key regulatory elements that belong to noncoding regions and determine the direction of transcription and regulate gene expression [20–22]. The promoter sequences are typically located directly upstream or at the 5′ end of the transcription initiation site [23]. These regions contain CpG islands that are known to regulate gene expression through transcriptional silencing of the corresponding gene. About half of all CGIs self-evidently contain TSSs, as they coincide with promoters of annotated genes [24].
Studies on chickens identified many CpG islands and tissue-specific regulatory elements that could affect downstream genes [25–27]. Those findings indicate that an accurate identification of regulatory elements is fundamental for annotating genomes and understanding gene expression patterns. However, the regulatory mechanisms of these genes in the chicken reproduction traits have not yet been reported. In silico tools have been an integral part of the biological research designed to uncover meaningful information from biological data within a very short time. Therefore, the aim of this study was to predict promoter and regulatory elements of IGF and VIP genes associated with reproductive traits and thereby contribute to the improvement of reproductive performance and fertility in chickens.
2. Results
2.1. Identification of TSS and Promoter Regions of IGF and VIP Genes
The study of promoter regions for IGF and VIP genes associated with reproductive traits in chickens revealed a notable variation in the number of transcription start sites (TSSs), with 73.33% or 11 out of 15 sequences containing multiple TSSs (Table 1). Additionally, the study found that seven (46.7%) TSSs are located at a distance below −500 bp from the start codon. The TSSs were located in the upstream region ranging from −41 to −4626 bp (base pairs).
2.2. Common Candidate Motifs and Associated Transcription Factors in the Promoter Regions of IGF and VIP Genes
The current analysis discovered five binding motifs, of which two (I and III) were equally shared (73.3%) by all IGF and VIP genes in chicken, as shown in Table 2. Furthermore, Motif II was identified as the best common promoter motif for 80% of chicken IGF and VIP genes associated with chicken reproductive traits that serve as binding sites for TFs involved in the expression regulation of these genes.
In the present study, the majority (64.29%) of the candidate motifs were located and distributed between –800 and –200 bp, and the motifs were equally distributed in the positive and negative strands (Figure 1). The MEME search tool produced a sequence logo for Motif II, which is recognized as the most common motif to represent the information content. The TOMTOM web tool explored matched motifs for Motif II. As a result, Motif II matched with 24 known motifs found in databases (Table 3).
The sequence logo for Motif II generated by MEME is presented in Figure 2. In this visualization, the size of each letter reflects how strongly that nucleotide is conserved at each position in the motif.
The current analysis indicates that the best common motif, Motif II, resembles three major transcription factor families: the zinc finger family, homeobox transcription factors, and high-mobility group domain factors, accounting for about 79.17% (19/24) (Table 3). This study also revealed that about 62.5% (15/24) of identified transcription factors were associated with the Wnt signalling pathway. The findings from the UniProt database show that candidate TFs such as CPEB1, MAFB, SOX15, TCF7L2, TCF3, and TCF7 are pivotal in activating or repressing transcription, playing essential roles in embryonic and nervous system development. Likewise, the transcription factor LEF1 is implicated in hair cell differentiation and follicle morphogenesis. The study also highlights that the Bcl11B transcription factors suppress transcription and are crucial in normal T-cell development, as identified from the UniProt database.
2.3. Investigation for CpG Islands and Tandem Repeats
CpG islands were investigated in the promoter and gene body regions of 12 IGFs and 3 VIP genes associated with reproductive traits in chicken using different computational tools. In this study, we analysed each gene promoter and discovered CpG islands in 11 out of the 15 promoters examined, with the exception of IGF1, IGF2, IGFBP1, and IGFBP7. Similarly, 13 CpG islands were identified within the gene body regions, as shown in Table 4. This indicated that IGF and VIP genes associated with chicken reproductive traits have very rich CGIs in their promoter and gene body regions. The highest length of CpG island was observed in the VIP gene promoter (2000 bp) showing similarity in length to the targeted query (2000 bp). Multiple CpG islands were observed in the promoter regions of the IGF2BP3, IGFBP3 (Insulin-Like Growth Factor–Binding Protein 3), and IGFBP4 (Insulin-Like Growth Factor–Binding Protein 4) genes, as well as in the body regions of the IGF1R, IGF2BP1, and IGF2BP2 genes (Table 4).
In order to investigate tandem repeats, 2 kb upstream promoter sequences of chicken IGF and VIP genes were used. Tandem repeats having one to four nucleotides were identified in the promoter regions of 10 IGFs and VIP genes and the gene body region of two genes (Table 5).
Analysis for CpG islands on both promoter region and gene body region using restriction enzyme MspI was also conducted (Tables 6 and 7). The in silico analysis results showed more CpG islands in gene body region compared to promoter region. In the present analysis, about 13 CGIs and 12 CGIs were found in gene body region and promoter region, respectively. The results indicated that chicken IGF and VIP genes were highly rich in CpG islands which is in agreement with the first GC-Profiles 2.0 searching tool.
2.4. Analysis of Gene–Gene Interactions
The GeneMANIA server has predicted the functional gene–gene interaction network for IGF and VIP genes, as depicted in Figure 3. The results indicated that IGF has physical and genetic interactions with Insulin-Like Growth Factor–Binding Protein 2 (IGFBP2) and shows coexpression primarily with IGFBP4, Protein Kinase D1 (PRKD1), insulin-like growth factor–binding protein acid labile subunit (IGFALS), Relaxin 2 (RLN2), IGFBP3, and IGF2, as shown in Figure 3.
Similarly, the VIP gene interacts physically and genetically with the transcription factor PU.1 (SPI1), and is coexpressed mostly with the calcitonin receptor (CALCR), Adenylate Cyclase–Activating Polypeptide 1 (ADCYAP1), ADAM-Like Decysin 1 (ADAMDEC1), Ephrin Type A receptor 7 (EPHA7), GH-releasing hormone (GHRH), Solute Carrier Family 6 Member 3 (SLC6A3), gastric inhibitory polypeptide (GIP), and glucagon (GCG) (Figure 4).
3. Discussion
In mammals, IGF and VIP genes are evolutionarily conserved peptides that function as endocrine hormones and autocrine/paracrine factors. They are extensively present in the central and peripheral nervous systems, as well as in the digestive, respiratory, reproductive, and cardiovascular systems, serving as neurotransmitters and neuroendocrine release factors [30]. Given their widespread expression in mammalian tissues, a deep understanding of their regulatory mechanisms is essential. In this study, we employed various online bioinformatics tools to perform in silico analysis of promoter and gene body sequences of 15 IGF and VIP genes related to reproductive traits in chickens, aiming to identify characteristics that may serve as exemplary for other avian species. Furthermore, the results of our analysis provided vital information on the upstream regulatory elements and their corresponding TFs involved in regulatory mechanisms that influence the IGF and VIP genes during reproduction and embryo development in chickens.
TSS identification is a crucial step in gene expression and occurs at various promoter positions with different efficiencies [31]. In the present in silico analysis, the majority of the promoter sequences (73.33%) had multiple TSSs. This result is in agreement with Samuel and Dinka [32] who reported the presence of multiple TSSs in olfactory receptor genes in cattle, whereas it is contrary to Abera and Dinka [33] who reported the presence of single transcription sites in the MAGE gene encoding for embryonic development in cattle. This indicates the presence of alternative promoters of the genes, and it contributes to transcript isoform diversity in mammals.
The current study also revealed that approximately half of the examined TSSs (46.7%) are located at a distance below −500 bp from the start codon. This result is in agreement with Mu et al. [34] who reported a TSS location of −515 bp for the ovine DKK1 gene, Abera and Dinka [33] who reported 42.1% TSS location below –500 bp for the MAGE gene, and Pokhriyal et al. [35] who reported TSS locations at 235, 156, and 92 bp for BICP0, BICP4, and BICP22 in bovine genes, respectively.
The present analysis uncovers multiple binding motifs for IGF and VIP genes, which is significant to find all possible binding motifs including the cofactor binding motifs [36]. The majority of the discovered candidate motifs are located and distributed between –800 and –200 bp with reference to the TSS region. This is in line with Halees et al. [37] who stated that the majority of motifs are positioned proximately upstream of a TSS. The candidate motifs were equally distributed in the positive and negative strands.
The present analysis revealed that the best common motif, Motif II, bears resemblance to three major transcription factor families: zinc finger family, homeobox transcription factors, and high-mobility group domain factors, accounting for about 79.17% of the candidate transcription factors. This study found that 62.5% of the identified transcription factors interacted with the Wnt signalling pathway to coregulate transcription. This is in agreement with reports of Wang et al. [38] who described the high-mobility group transcription factors as mediators of the Wnt signalling pathway and Yu et al. [39] who reported homeobox transcription factors as coregulators. The Wnt signalling pathways have critical roles in axis patterning, cell fate specification, cell proliferation, and cell migration [40].
The current findings revealed that CPEB1, MAFB, SOX15, TCF7L2, TCF3, and TCF7 transcription factors had dual regulatory functions and have a crucial role in embryonic and nervous system development. Hrckulak et al. [41] reported that T-cell factor/lymphoid enhancer–binding factor proteins are the main downstream effectors of the Wnt signalling pathway. Furthermore, transcription factor CREB-1 is highly involved in the control of ovarian cell proliferation and steroidogenesis [42]. Evidence suggested that transcription factors like TCF and LEF play key regulatory functions in mammalian cells [43–45]. The current study also shows that Bcl11B transcription factors repress transcription and play key roles in normal T-cell development, as we revealed from the UniProt database. In agreement with this result, Lennon et al. [46] have shown the fundamental roles of Bcl11B in fetal development.
Studies identified that CGIs are highly involved in dual-mode gene regulatory processes in higher eukaryotic species [47]. In this study, IGF and VIP genes associated with chicken reproductive traits have very rich CGIs in their promoter and gene body regions. In agreement with this finding, Vinoth et al. [48] and Ummah et al. [49] found rich CpG islands in the HSP90 gene promoter region in chickens. GC content in genes is significantly associated with gene expression patterns and could be one of the important regulatory factors in the chicken genome [50].
The in silico analysis results also revealed very rich CpG islands in IGF and VIP genes, which is in agreement with the first method, CpG-Profile algorithm. A similar finding was reported by Abera and Dinka [33] who indicated slightly rich CpG islands for MAGE genes encoding for embryonic development in cattle. However, Wagari et al. [51] reported poor CGIs using MspI enzyme digestion for sheep KAP genes. CpG islands are frequently linked with the promoters of most housekeeping genes and numerous tissue-specific genes, thereby playing significant regulatory roles and serving as markers for genes [52].
It has been reported that genes containing tandem repeats in their promoters exhibit higher rates of transcriptional divergence [53], and it indicates a greater tendency for mutation [54]. The current study revealed that many tandem repeats were identified in 10 gene promoters out of 15, suggesting that they have an increased potential for accumulation of mutations during replication. The presence of tandem repeats in these promoters can be used for mutational study, and it might also participate in gene expression regulation.
The gene–gene interaction network revealed that approximately 77.64% of genes show physical interactions and 8.01% of genes show coexpression in both IGF and VIP networks. The 20 most frequently altered genes were involved in the IGF network, including pregnancy-associated plasma protein A2 (PAPPA2), RLN2, and Kallikrein-Related Peptidase 3 (KLK3), which has a key role in regulating biological functions in reproduction. Guo et al. [1] reported that the IGF system is highly conserved and is involved in the regulation of egg production, growth, and carcass traits in chickens. Similarly, VIP genes interact with important traits such as CALCR, ADCYAP1, ADAMDEC1, EPHA7, GHRH, SLC6A3, and GCG. Based on the analysis of interaction patterns, we may understand that the IGF and VIP genes might affect the function of the related genes, which highlights the importance of these linked and coexpressed genes in complex traits like reproduction.
In conclusion, the current findings showed that regulatory elements found in the promoter region of IGF and VIP genes may play important roles in the chicken reproductive traits and coregulate transcription through interacting with the Wnt pathways. Understanding these gene regulatory elements in the promoter regions could be beneficial in the poultry industry to improve reproductive performances. However, further experimental studies are needed to validate the potential role of identified transcription factors and their motifs in regulating IGF and VIP genes in chickens. Thus, the findings should be essentially validated using experimental studies such as gene expression assays.
4. Materials and Methods
4.1. Retrieval of the IGF and VIP Genes
The IGF and VIP gene sequences were retrieved from the NCBI database via a web server (https://www.ncbi.nlm.nih.gov (accessed on January 22, 2024). In this analysis, a total of 15 functional, protein-coding genes that have a coding sequence (CDS) and start codon at the beginning of the sequence were considered (Table 8). Full-length sequences of these genes, as well as their CDSs, were retrieved. All gene sequences were cross-checked with the UniProtKB database (https://www.uniprot.org).
4.2. Determination of Transcription Start Sites and Promoter Regions
In order to determine TSSs, we extracted 1 kb sequences upstream of the start codon from each gene [55]. The Neural Network Promoter Prediction (NNPP Version 2.2) tool was used to determine the TSSs by setting the minimum standard predictive score (between 0 and 1) with a cut-off value of 0.8 [56]. This tool has the ability to pinpoint precisely the position of a TSS for a given gene. The TSS with the highest prediction score was considered statistically significant and accurate for sequences containing multiple TSSs. In addition, we excised 1 kb sequences upstream of each TSS to determine the promoter regions, as previously described by Michaloski et al. [57] for mouse odorant and vomeronasal receptor (V1R) gene.
4.3. Identification of Common Candidate Motifs and Transcription Factors
To discover common candidate motifs, the predicted promoter sequences of IGF and VIP genes were analysed using the MEME Version 5.5.4 searches [58]. Then, MEME searches for statistically significant candidate motifs in the input sequence set. Buttons on the MEME HTML output allow one or all of the candidate motifs to be forwarded for further analysis by TOMTOM [59] for TF. The output of TOMTOM includes LOGOS representing the alignment of the candidate motif and TF with the p value and q-value (a measure of false discovery rate) of the match and links back to the parent transcription database for more detailed information about it [28].
4.4. Search for CpG Islands and Tandem Repeats
A 2 kb sequence upstream of the start codon was analyzed for each gene to detect tandem repeats and CpG islands. GC-Profiles 2.0 software was used to search for CpG islands with the search criteria: GC content ≥ 50%, Obs CpG/Exp CpG ≥ 0.6, and length ≥ 200 bp [60]. The MISA-web search tool [61] was used to search tandem repeats in the promoter and coding regions of each gene. Furthermore, MspI restriction enzyme cutting sites were searched using CLC Genomics Workbench Version 23.0.5, with fragment sizes ranging from 40 to 220 bps. This search is important for the detection of CGIs, and it recognizes CCGG sites [62].
4.5. Analysis of Gene–Gene Interactions
The gene–gene interaction network of the IGF and VIP gene was predicted by GeneMANIA (http://www.genemania.org). It predicts gene function and generates information such as gene interaction, coexpression, colocalization, shared protein domains, and pathways involved [29].
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Guo Y. Zhang K. Geng W. Evolutionary Analysis and Functional Characterization Reveal the Role of the Insulin-Like Growth Factor System in a Diversified Selection of Chickens (Gallus gallus) Poultry Science 2023102310241110.1016/j.psj.2022.10241136587453 PMC 9816805 · doi ↗ · pubmed ↗
- 2Tran B. N. Nguyen D. H. Vu Q. D. Polymorphism Candidate Genes of Indigenous Lien Minh Chickens Vietnam Journal of Agricultural Sciences 20181217418110.31817/vjas.2018.1.2.07 · doi ↗
- 3Li X. H. Bai J. J. Ye X. Hu Y. C. Li S. J. Yu L. Y. Polymorphisms in the 5′ Flanking Region of the Insulin-Like Growth Factor I Gene Are Associated With Growth Traits in Largemouth Bass Micropterus salmoides Fisheries Science 200975235135810.1007/s 12562-008-0051-32-s 2.0-67849090572 · doi ↗
- 4Igarashi H. Fujimori N. Ito T. Vasoactive Intestinal Peptide (VIP) and VIP Receptors-Elucidation of Structure and Function for Therapeutic Applications International Journal of Clinical Medicine 20112450050810.4236/ijcm.2011.24084 · doi ↗
- 5Kansaku N. Tobari Y. Hiyama G. Effects of Vasoactive Intestinal Polypeptide and Forskolin on m RNA Expression of Prolactin and Prolactin Regulatory Element-Binding Protein in the Anterior Pituitary Gland of Chicken Embryo and Laying Hens The Journal of Poultry Science 201653431331710.2141/jpsa.01600392-s 2.0-8499244535532908399 PMC 7477168 · doi ↗ · pubmed ↗
- 6Juengel J. L. Smith P. R. Quirke L. D. French M. C. Edwards S. J. The Local Regulation of Folliculogenesis by Members of the Transforming Growth Factor Superfamily and Its Relevance for Advanced Breeding Programmes Animal Reproduction 201815318019010.21451/1984-3143-AR 2018-00552-s 2.0-8506200306234178140 PMC 8202455 · doi ↗ · pubmed ↗
- 7Payet L. D. Wang X. H. Baxter R. C. Firth S. M. Amino- and Carboxyl-Terminal Fragments of Insulin-Like Growth Factor (IGF) Binding Protein-3 Cooperate to Bind IG Fs With High Affinity and Inhibit IGF Receptor Interactions Endocrinology 200314472797280610.1210/en.2003-01022-s 2.0-003867535812810533 · doi ↗ · pubmed ↗
- 8Al-Samerria S. Radovick S. The Role of Insulin-Like Growth Factor-1 (IGF-1) in the Control of Neuroendocrine Regulation of Growth Cells 20211010 p. 266410.3390/cells 1010266434685644 PMC 8534318 · doi ↗ · pubmed ↗
