From features to expression: High-density oligonucleotide array analysis revisited
Felix Naef, Daniel A. Lim, Nila Patil, and Marcelo O. Magnasco

TL;DR
This paper introduces a new statistical algorithm for analyzing high-density oligonucleotide array data, improving gene expression measurement accuracy and increasing the number of detectable genes compared to standard methods.
Contribution
The authors present a novel algorithm that enhances the analysis of GeneChip array data by removing the need for mismatch probes and increasing detection sensitivity.
Findings
Significant improvement over current standard algorithms.
Larger number of genes identified above noise floor.
Enhanced accuracy in gene expression levels.
Abstract
One of the most popular tools for large scale gene expression studies are high-density oligonucleotide (GeneChip(R)) arrays. These currently have 16-20 small probe cells (``features'') for evaluating the transcript abundance of each gene. In addition, each probe is accompanied by a mismatched probe designed as a control for non-specificity. An algorithm is presented to compute comparative expression levels from the intensities of the individual features, based on a statistical study of their distribution. Interestingly, MM probes need not be included in the analysis. We show that our algorithm improves significantly upon the current standard and leads to a substantially larger number of genes brought above the noise floor for further analysis.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Single-cell and spatial transcriptomics · Molecular Biology Techniques and Applications
