GPXplore: an intelligent computational framework for precise gene promoter extraction
Shruti Godara, Samarth Godara, Shbana Begam, Anil Kumar Singh

TL;DR
GPXplore is a new tool that quickly and accurately extracts gene promoter regions, helping researchers study gene regulation more efficiently.
Contribution
GPXplore introduces a scalable and user-friendly computational framework for precise gene promoter extraction.
Findings
GPXplore uses vectorized data processing to significantly reduce processing time for large-scale promoter extraction.
Validation on eight diverse genomic datasets confirmed the tool's high accuracy and reliability.
The tool supports customizable parameters and offers both command-line and graphical interfaces for accessibility.
Abstract
Efficient and precise extraction of gene promoter regions is vital for understanding gene regulation, with broad implications in gene editing, functional genomics, and disease research. However, existing tools often fall short in scalability, usability and performance. To address these limitations, we present “GPXplore,” a computational tool designed for the precise and user-friendly extraction of gene promoters from genomic data. It leverages vectorized data processing techniques to significantly reduce data processing time, enhancing speed and efficiency in large-scale promoter extraction tasks. GPXplore retrieves upstream and downstream sequences relative to gene loci and supports customizable parameters, enabling users to define region lengths based on specific research needs. The tool is implemented in Python, features both a command-line and graphical user interface, and is…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · CRISPR and Genetic Engineering · Genomics and Rare Diseases
