mgikit: demultiplexing toolkit for MGI fastq files
Ziad Al Bkhetan, Sen Wang

TL;DR
mgikit is a customizable and efficient toolkit for processing and analyzing MGI sequencing data, offering improved performance over standard tools.
Contribution
mgikit introduces a highly customizable and optimized solution for demultiplexing and analyzing MGI fastq files.
Findings
mgikit outperforms the standard MGI demultiplexer in performance and memory usage.
The toolkit provides quality reports and supports various dataset types through customization.
mgikit is publicly available with comprehensive documentation for users.
Abstract
MGI sequencing is reported to be an inexpensive solution to obtain genomics information. There is a growing need for software and tools to analyse MGI’s outputs efficiently. mgikit is a tool collection to demultiplex MGI fastq data, reformat it effectively and produce visual quality reports. mgikit overcomes several limitations of the standard MGI demultiplexer. It is highly customizable to suit different kinds of datasets and is designed to achieve high performance and optimal memory utilization. The tool and its documentation are available at: https://sagc-bioinformatics.github.io/mgikit/.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Algorithms and Data Compression · Parallel Computing and Optimization Techniques
