featureCounts: An efficient general-purpose program for assigning sequence reads to genomic features
Yang Liao, Gordon K Smyth, Wei Shi

TL;DR
featureCounts is a fast, memory-efficient program for counting sequencing reads aligned to genomic features, significantly improving speed over existing methods and supporting various sequencing applications.
Contribution
It introduces a novel, highly efficient read summarization tool that outperforms existing methods in speed and memory usage for genomic feature counting.
Findings
Achieves an order of magnitude faster performance than previous tools
Uses less computer memory during read summarization
Supports both single and paired-end sequencing reads
Abstract
Next-generation sequencing technologies generate millions of short sequence reads, which are usually aligned to a reference genome. In many applications, the key information required for downstream analysis is the number of reads mapping to each genomic feature, for example to each exon or each gene. The process of counting reads is called read summarization. Read summarization is required for a great variety of genomic analyses but has so far received relatively little attention in the literature. We present featureCounts, a read summarization program suitable for counting reads generated from either RNA or genomic DNA sequencing experiments. featureCounts implements highly efficient chromosome hashing and feature blocking techniques. It is considerably faster than existing methods (by an order of magnitude for gene-level summarization) and requires far less computer memory. It works…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
