BACKGROUND - Triple negative breast cancer (TNBC) is a heterogeneous disease that lacks unifying molecular alterations that can guide therapy decisions. We previously identified distinct molecular subtypes of TNBC (TNBCtype) using gene expression data generated on a microarray platform using frozen tumor specimens. Tumors and cell lines representing the identified subtypes have distinct enrichment in biologically relevant transcripts with differing sensitivity to standard chemotherapies and targeted agents. Since our initial discoveries, RNA-sequencing (RNA-seq) has evolved as a sensitive and quantitative tool to measure transcript abundance.
METHODS - To demonstrate that TNBC subtypes were similar between platforms, we compared gene expression from matched specimens profiled by both microarray and RNA-seq from The Cancer Genome Atlas (TCGA). In the clinical care of patients with TNBC, tumor specimens collected for diagnostic purposes are processed by formalin fixation and paraffin-embedding (FFPE). Thus, for TNBCtype to eventually have broad and practical clinical utility we performed RNA-seq gene expression and molecular classification comparison between fresh-frozen (FF) and FFPE tumor specimens.
RESULTS - Analysis of TCGA showed consistent subtype calls between 91% of evaluable samples demonstrating conservation of TNBC subtypes across microarray and RNA-seq platforms. We compared RNA-seq performed on 21-paired FF and FFPE TNBC specimens and evaluated genome alignment, transcript coverage, differential transcript enrichment and concordance of TNBC molecular subtype calls. We demonstrate that subtype accuracy between matched FF and FFPE samples increases with sequencing depth and correlation strength to an individual TNBC subtype.
CONCLUSIONS - TNBC subtypes were reliably identified from FFPE samples, with highest accuracy if the samples were less than 4 years old and reproducible subtyping increased with sequencing depth. To reproducibly subtype tumors using gene expression, it is critical to select genes that do not vary due to platform type, tissue processing or RNA isolation method. The majority of differentially expressed transcripts between matched FF and FFPE samples could be attributed to transcripts selected for by RNA enrichment method. While differentially expressed transcripts did not impact TNBC subtyping, they will provide guidance on determining which transcripts to avoid when implementing a gene set size reduction strategy.
TRIAL REGISTRATION - NCT00930930 07/01/2009.