# RNA sequencing of transcriptomes in human brain regions: protein-coding and non-coding RNAs, isoforms and alleles

**Authors:** Amy Webb, Audrey C. Papp, Amanda Curtis, Leslie C. Newman, Maciej Pietrzak, Michal Seweryn, Samuel K. Handelman, Grzegorz A. Rempala, Daqing Wang, Erica Graziosa, Rachel F. Tyndale, Caryn Lerman, John R. Kelsoe, Deborah C. Mash, Wolfgang Sadee

PMC · DOI: 10.1186/s12864-015-2207-8 · 2015-11-23

## TL;DR

This study uses RNA sequencing to analyze gene expression in human brain regions, revealing differences in coding and non-coding RNA, isoforms, and allele expression.

## Contribution

The study provides a comprehensive transcriptome database for brain regions, including non-coding RNA and allele-specific expression.

## Key findings

- Non-coding RNAs show greater diversity between brain regions compared to mRNAs.
- Over 400 genes exhibit allelic RNA ratios deviating from unity, indicating cis-acting regulatory variants.
- Mathematical modeling identified stably expressed RNAs for potential normalization in expression studies.

## Abstract

We used RNA sequencing to analyze transcript profiles of ten autopsy brain regions from ten subjects. RNA sequencing techniques were designed to detect both coding and non-coding RNA, splice isoform composition, and allelic expression. Brain regions were selected from five subjects with a documented history of smoking and five non-smokers. Paired-end RNA sequencing was performed on SOLiD instruments to a depth of >40 million reads, using linearly amplified, ribosomally depleted RNA. Sequencing libraries were prepared with both poly-dT and random hexamer primers to detect all RNA classes, including long non-coding (lncRNA), intronic and intergenic transcripts, and transcripts lacking poly-A tails, providing additional data not previously available. The study was designed to generate a database of the complete transcriptomes in brain region for gene network analyses and discovery of regulatory variants.

Of 20,318 protein coding and 18,080 lncRNA genes annotated from GENCODE and lncipedia, 12 thousand protein coding and 2 thousand lncRNA transcripts were detectable at a conservative threshold. Of the aligned reads, 52 % were exonic, 34 % intronic and 14 % intergenic. A majority of protein coding genes (65 %) was expressed in all regions, whereas ncRNAs displayed a more restricted distribution. Profiles of RNA isoforms varied across brain regions and subjects at multiple gene loci, with neurexin 3 (NRXN3) a prominent example. Allelic RNA ratios deviating from unity were identified in > 400 genes, detectable in both protein-coding and non-coding genes, indicating the presence of cis-acting regulatory variants. Mathematical modeling was used to identify RNAs stably expressed in all brain regions (serving as potential markers for normalizing expression levels), linked to basic cellular functions. An initial analysis of differential expression analysis between smokers and nonsmokers implicated a number of genes, several previously associated with nicotine exposure.

RNA sequencing identifies distinct and consistent differences in gene expression between brain regions, with non-coding RNA displaying greater diversity between brain regions than mRNAs. Numerous RNAs exhibit robust allele selective expression, proving a means for discovery of cis-acting regulatory factors with potential clinical relevance.

The online version of this article (doi:10.1186/s12864-015-2207-8) contains supplementary material, which is available to authorized users.

## Linked entities

- **Genes:** NRXN3 (neurexin 3) [NCBI Gene 9369]

## Full-text entities

- **Genes:** NLGN2 (neuroligin 2) [NCBI Gene 57555], PDE1A (phosphodiesterase 1A) [NCBI Gene 5136] {aka CAM-PDE 1A, CAM-PDE-1A, HCAM-1, HCAM1, HSPDE1A}, BRWD1 (bromodomain and WD repeat domain containing 1) [NCBI Gene 54014] {aka C21orf107, CILD51, DCAF19, N143, WDR9, WRD9}, RAB21 (RAB21, member RAS oncogene family) [NCBI Gene 23011], GJA1 (gap junction protein alpha 1) [NCBI Gene 2697] {aka AVSD3, CMDR, CX43, EKVP, EKVP3, GJAL}, FDFT1 (farnesyl-diphosphate farnesyltransferase 1) [NCBI Gene 2222] {aka DGPT, ERG9, SQS, SQSD, SS}, ASTN1 (astrotactin 1) [NCBI Gene 460] {aka ASTN}, CPSF7 (cleavage and polyadenylation specific factor 7) [NCBI Gene 79869] {aka CFIm59}, ZNF391 (zinc finger protein 391) [NCBI Gene 346157] {aka dJ153G14.3}, CIART (circadian associated repressor of transcription) [NCBI Gene 148523] {aka C1orf51, CHRONO, GM129}, ERV3-1 (endogenous retrovirus group 3 member 1, envelope) [NCBI Gene 2086] {aka ERV-R, ERV3, ERVR, HERV-R, HERVR, envR}, PILRB (paired immunoglobin like type 2 receptor beta) [NCBI Gene 29990] {aka FDFACT1, FDFACT2}, SRSF11 (serine and arginine rich splicing factor 11) [NCBI Gene 9295] {aka NET2, SFRS11, dJ677H15.2, p54}, MAP1LC3B (microtubule associated protein 1 light chain 3 beta) [NCBI Gene 81631] {aka ATG8F, LC3B, MAP1A/1BLC3, MAP1LC3B-a}, PRPF8 (pre-mRNA processing factor 8) [NCBI Gene 10594] {aka HPRP8, PRP8, PRPC8, RP13, SNRNP220}, AKAP12 (A-kinase anchoring protein 12) [NCBI Gene 9590] {aka AKAP250, SSeCKS}, STON2 (stonin 2) [NCBI Gene 85439] {aka STN2, STNB, STNB2}, ZEB1 (zinc finger E-box binding homeobox 1) [NCBI Gene 6935] {aka AREB6, BZP, DELTAEF1, FECD6, NIL2A, PPCD3}, Apold1 (apolipoprotein L domain containing 1) [NCBI Gene 444983] {aka Verge}, MECP2 (methyl-CpG binding protein 2) [NCBI Gene 4204] {aka AUTSX3, MRX16, MRX79, MRXS13, MRXSL, PPMX}, FGF1 (fibroblast growth factor 1) [NCBI Gene 2246] {aka AFGF, ECGF, ECGF-beta, ECGFA, ECGFB, FGF-1}, SNHG14 (small nucleolar RNA host gene 14) [NCBI Gene 104472715] {aka 115HG, IC-SNURF-SNRPN, IPW, LNCAT, NCRNA00002, NCRNA00214}, GSTA4 (glutathione S-transferase alpha 4) [NCBI Gene 2941] {aka GSTA4-4, GTA4}, POLR3F (RNA polymerase III subunit F) [NCBI Gene 10621] {aka C34, IMD101, RPC39, RPC6}, DNAAF11 (dynein axonemal assembly factor 11) [NCBI Gene 23639] {aka CILD19, LRRC6, LRTP, TSLRP, tilB}, APOL2 (apolipoprotein L2) [NCBI Gene 23780] {aka APOL-II}, CDK5R1 (cyclin dependent kinase 5 regulatory subunit 1) [NCBI Gene 8851] {aka CDK5P35, CDK5R, NCK5A, p23, p25, p35}, APBA1 (amyloid beta precursor protein binding family A member 1) [NCBI Gene 320] {aka D9S411E, LIN10, MINT1, X11, X11A, X11ALPHA}, PDXP (pyridoxal phosphatase) [NCBI Gene 57026] {aka CIN, PLP, dJ37E16.5}, DDX39B (DExD-box helicase 39B) [NCBI Gene 7919] {aka BAT1, D6S81E, UAP56}, RALGAPB (Ral GTPase activating protein non-catalytic subunit beta) [NCBI Gene 57148] {aka KIAA1219, RalGAPbeta}, VCAM1 (vascular cell adhesion molecule 1) [NCBI Gene 7412] {aka CD106, INCAM-100}, SPTBN1 (spectrin beta, non-erythrocytic 1) [NCBI Gene 6711] {aka DDISBA, ELF, HEL102, SPTB2, betaSpII}, SPEN (spen family transcriptional repressor) [NCBI Gene 23013] {aka HIAA0929, MINT, RATARS, RBM15C, SHARP}, GPR37 (G protein-coupled receptor 37) [NCBI Gene 2861] {aka EDNRBL, PAELR, hET(B)R-LP}, EXOC5 (exocyst complex component 5) [NCBI Gene 10640] {aka HSEC10, PRO1912, SEC10, SEC10L1, SEC10P}, ATP1A2 (ATPase Na+/K+ transporting subunit alpha 2) [NCBI Gene 477] {aka DEE98, FARIMPD, FHM2, MHP2}, ATP6V1G2 (ATPase H+ transporting V1 subunit G2) [NCBI Gene 534] {aka ATP6G, ATP6G2, NG38, VMA10}, NTRK2 (neurotrophic receptor tyrosine kinase 2) [NCBI Gene 4915] {aka DEE58, EIEE58, GP145-TrkB, OBHD, TRKB, trk-B}, NDRG3 (NDRG family member 3) [NCBI Gene 57446], SEPTIN3 (septin 3) [NCBI Gene 55964] {aka SEP3, SEPT3, bK250D10.3}, NUDT5 (nudix hydrolase 5) [NCBI Gene 11164] {aka YSA1, YSA1H, YSAH1, hNUDT5}, CHN1 (chimerin 1) [NCBI Gene 1123] {aka ARHGAP2, CHN, DURS2, NC, RHOGAP2}, SPP1 (secreted phosphoprotein 1) [NCBI Gene 6696] {aka BNSP, BSPI, ETA-1, OPN}, SEPTIN8 (septin 8) [NCBI Gene 23176] {aka SEP2, SEPT8, Septin-8}, BAZ1B (bromodomain adjacent to zinc finger domain 1B) [NCBI Gene 9031] {aka WBSCR10, WBSCR9, WSTF}, DCP2 (decapping mRNA 2) [NCBI Gene 167227] {aka NUDT20}, CCT5 (chaperonin containing TCP1 subunit 5) [NCBI Gene 22948] {aka CCT-epsilon, CCTE, HEL-S-69, HSNSP, PNAS-102, TCP-1-epsilon}, TUNAR (transmembrane neural differentiation associated intracellular calcium regulator) [NCBI Gene 100507043] {aka BNLN, HI-LNC78, LINC00617, TUNA, pTUNAR}, INPP5F (inositol polyphosphate-5-phosphatase F) [NCBI Gene 22876] {aka MSTP007, MSTPO47, SAC2, hSAC2}, RBM25 (RNA binding motif protein 25) [NCBI Gene 58517] {aka NET52, RED120, RNPC7, S164, Snu71, fSAP94}, SLC8A1 (solute carrier family 8 member A1) [NCBI Gene 6546] {aka NCX1}, POLR1D (RNA polymerase I and III subunit D) [NCBI Gene 51082] {aka AC19, RPA16, RPA9, RPAC2, RPC16, RPO1-3}, SYNJ1 (synaptojanin 1) [NCBI Gene 8867] {aka DEE53, EIEE53, INPP5G, PARK20}, DRD2 (dopamine receptor D2) [NCBI Gene 1813] {aka D2DR, D2R}, DLG2 (discs large MAGUK scaffold protein 2) [NCBI Gene 1740] {aka PPP1R58, PSD-93, PSD93, chapsyn-110}, SRPK2 (SRSF protein kinase 2) [NCBI Gene 6733] {aka SFRSK2}, SNU13 (small nuclear ribonucleoprotein 13) [NCBI Gene 4809] {aka 15.5K, FA-1, FA1, NHP2L1, NHPX, OTK27}, TXN2 (thioredoxin 2) [NCBI Gene 25828] {aka COXPD29, MT-TRX, MTRX, TRX2, TXN}, ZNF91 (zinc finger protein 91) [NCBI Gene 7644] {aka HPF7, HTF10}
- **Diseases:** DM (MESH:D009223), death (MESH:D003643), psychiatric disorders (MESH:D001523), cardiac sudden deaths (MESH:D016757), stroke (MESH:D020521), AEI (MESH:D001039), depression (MESH:D003866), ischemia (MESH:D007511), addiction (MESH:D019966), cancer (MESH:D009369), Alcoholism (MESH:D000437), FPKM (MESH:D012892), Central nervous system (MESH:D002493), subcortical cysts (MESH:D003560), leukoencephalopathy (MESH:D056784), tobacco use disorder (MESH:D014029), Schizophrenia (MESH:D012559), CNS disorder (MESH:D002494), smoking (MESH:D015208), Huntington's Disease (MESH:D006816)
- **Chemicals:** K+ (MESH:D011188), chloroform (MESH:D002725), SPIA (MESH:C047235), nicotine (MESH:D009538), poly-dT (MESH:D011071), Trizol (MESH:C411644), -dT (MESH:D013936), BA10 (-), Na+ (MESH:D012964), poly-A (MESH:D011061)
- **Species:** Rattus norvegicus (brown rat, species) [taxon 10116], Homo sapiens (human, species) [taxon 9606], Nicotiana tabacum (American tobacco, species) [taxon 4097]
- **Mutations:** rs509276, rs901732, rs1386687
- **Cell lines:** S2 — Drosophila melanogaster (Fruit fly), Spontaneously immortalized cell line (CVCL_Z232), BA22 — Homo sapiens (Human), Melanoma, Cancer cell line (CVCL_U803)

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC4657279/full.md

---
Source: https://tomesphere.com/paper/PMC4657279