ensemblQueryR: fast, flexible and high-throughput querying of Ensembl LD API endpoints in R
Aine Fairbrother-Browne, Sonia Garc\'ia-Ruiz, Regina H Reynolds, Mina, Ryten, Alan Hodgkinson

TL;DR
ensemblQueryR is an R package that enables fast, flexible, and high-throughput querying of Ensembl LD API endpoints, optimizing performance and resource usage for genomic research workflows.
Contribution
It introduces a user-friendly R interface with optimized functions and parallelization for efficient large-scale LD data retrieval from Ensembl.
Findings
10-fold speed increase over existing tools
Uses one-third of the RAM compared to similar software
Compatible across operating systems via Docker and Singularity
Abstract
We present ensemblQueryR, a package providing an R interface to the Ensembl REST API that facilitates flexible, fast, user-friendly and R workflow integrable querying of Ensembl REST API linkage disequilibrium (LD) endpoints, optimised for high-throughput querying. ensemblQueryR achieves this through functions that are intuitive and amenable to custom code integration, use of familiar R object types as inputs and outputs, code optimisation and optional parallelisation functionality. For each LD endpoint, ensemblQueryR provides two functions, permitting both single-query and multi-query modes of operation. The multi-query functions are optimised for large query sizes and provide optional parallelisation to leverage available computational resources and minimise processing time. We demonstrate that ensemblQueryR has improved performance in terms of random access memory (RAM) usage and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Advanced Data Storage Technologies · Distributed and Parallel Computing Systems
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
