Large scale comparison of gene expression levels by microarrays and RNAseq using TCGA data.

Guo Y, Sheng Q, Li J, Ye F, Samuels DC, Shyr Y
PLoS One. 2013 8 (8): e71462

PMID: 23977046 · PMCID: PMC3748065 · DOI:10.1371/journal.pone.0071462

RNAseq and microarray methods are frequently used to measure gene expression level. While similar in purpose, there are fundamental differences between the two technologies. Here, we present the largest comparative study between microarray and RNAseq methods to date using The Cancer Genome Atlas (TCGA) data. We found high correlations between expression data obtained from the Affymetrix one-channel microarray and RNAseq (Spearman correlations coefficients of ∼0.8). We also observed that the low abundance genes had poorer correlations between microarray and RNAseq data than high abundance genes. As expected, due to measurement and normalization differences, Agilent two-channel microarray and RNAseq data were poorly correlated (Spearman correlations coefficients of only ∼0.2). By examining the differentially expressed genes between tumor and normal samples we observed reasonable concordance in directionality between Agilent two-channel microarray and RNAseq data, although a small group of genes were found to have expression changes reported in opposite directions using these two technologies. Overall, RNAseq produces comparable results to microarray technologies in term of expression profiling. The RNAseq normalization methods RPKM and RSEM produce similar results on the gene level and reasonably concordant results on the exon level. Longer exons tended to have better concordance between the two normalization methods than shorter exons.

MeSH Terms (11)

Exons Gene Expression Profiling Gene Expression Regulation, Neoplastic Genes, Neoplasm Genome, Human Humans Neoplasms Oligonucleotide Array Sequence Analysis Reference Standards Sequence Analysis, RNA Statistics, Nonparametric

Connections (2)

This publication is referenced by other Labnodes entities:

Links