# Tissue gene expression analysis approach in a context of high technical and biological heterogeneity

**Authors:** Evan Bagley, Souvik Seal, Lauren R. Fanning, Jean-Sebastian Anoma, Tami Crawford, Benjamin G. Vincent, Elizabeth L. Barry, Martha J. Shrubsole, Erin Kirk, John A. Baron, Dale C. Snover, David N. Lewin, Todd A. Mackenzie, Xiaohua Gao, Melissa A. Troester, Alexander V. Alekseyenko, Kristin Wallace

PMC · DOI: 10.21203/rs.3.rs-6116991/v1 · Research Square · 2025-04-07

## TL;DR

This paper introduces a new method for analyzing gene expression in colorectal lesions with high heterogeneity, improving sample quality and detection of immune-related genes.

## Contribution

A NanoString nCounter-based approach optimized for FFPE samples with high biological heterogeneity and lower quality RNA.

## Key findings

- Incorporating a colon-specific QC set reduced sample exclusions by 63% compared to standard methods.
- RUVg and NSDiff detected significant DE genes in TVs vs. TAs, showing under-expression of immune-related genes.
- NSDiff was more sensitive but had higher Type I error rates and computational demands compared to RUVg.

## Abstract

Immune expression profiling in colorectal lesions may provide insights into the origins of antitumor immunity and senescence. Optimal approaches for analyzing samples with lower quality RNA from molecularly diverse lesions are lacking. Therefore, we developed a NanoString nCounter-based approach for quality control (QC), normalization, and differential expression (DE) analysis, optimized for FFPE samples in contexts of high biologic heterogeneity.

The approach incorporates a colon specific positive control gene set (11 genes) to minimize sample exclusions. We evaluated three normalization methods Removal of Unwanted Variation (RUVg), NanoStringDiff (NSDiff), and nSolver using a 277 gene immune panel to compare 100 samples, including sessile serrated lesions (SSLs) (n = 25), tubulovillous and villous adenomas (TVs) (n = 27), and tubular adenomas (TAs) (n = 48) We assessed Type I error rates, computational efficiency, and gene significance via FDR-corrected q-values.

Incorporating the colon-specific QC set reduced sample exclusions by 63% compared to standard methods (13 vs 35 sample exclusions). All three normalization approaches identified DE genes between SSLs and TAs (e.g., TFF1, MUC5AC, MUC6). For TVs vs. TAs, only RUVg and NSDiff detected significant DE genes, revealing wide-spread under-expression of innate and adaptive genes. While NSDiff labeled twice as many significant genes as RUVg, suggesting greater sensitivity, it also exhibited higher Type I error rates and increased computational demand.

RUVg achieved a balance between computational efficiency and low Type I error, while NSDiff was more sensitive but computationally demanding and exhibited higher Type I error. Our approach provides a robust framework for profiling immune genes in heterogeneous lesions.

## Linked entities

- **Genes:** TFF1 (trefoil factor 1) [NCBI Gene 7031], MUC5AC (mucin 5AC, oligomeric mucus/gel-forming) [NCBI Gene 4586], MUC6 (mucin 6, oligomeric mucus/gel-forming (gene/pseudogene)) [NCBI Gene 4588]

## Full-text entities

- **Genes:** MUC6 (mucin 6, oligomeric mucus/gel-forming (gene/pseudogene)) [NCBI Gene 4588] {aka MUC-6}, TFF1 (trefoil factor 1) [NCBI Gene 7031] {aka BCEI, D21S21, HP1.A, HPS2, pNR-2, pS2}, MUC5AC (mucin 5AC, oligomeric mucus/gel-forming) [NCBI Gene 4586] {aka MUC5, TBM, leB, mucin}
- **Diseases:** TVs (MESH:D018253), TAs (MESH:D000236), SSLs (MESH:D009059), colorectal lesions (MESH:D015179)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12036464/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12036464/full.md

## References

28 references — full list in the complete paper: https://tomesphere.com/paper/PMC12036464/full.md

---
Source: https://tomesphere.com/paper/PMC12036464