- Open Access
Isoform level expression profiles provide better cancer signatures than gene level expression profiles
© Zhang et al.; licensee BioMed Central Ltd. 2013
- Received: 15 January 2013
- Accepted: 17 April 2013
- Published: 17 April 2013
The majority of mammalian genes generate multiple transcript variants and protein isoforms through alternative transcription and/or alternative splicing, and the dynamic changes at the transcript/isoform level between non-oncogenic and cancer cells remain largely unexplored. We hypothesized that isoform level expression profiles would be better than gene level expression profiles at discriminating between non-oncogenic and cancer cellsgene level.
We analyzed 160 Affymetrix exon-array datasets, comprising cell lines of non-oncogenic or oncogenic tissue origins. We obtained the transcript-level and gene level expression estimates, and used unsupervised and supervised clustering algorithms to study the profile similarity between the samples at both gene and isoform levels.
Hierarchical clustering, based on isoform level expressions, effectively grouped the non-oncogenic and oncogenic cell lines with a virtually perfect homogeneity-grouping rate (97.5%), regardless of the tissue origin of the cell lines. However, gene levelthis rate was much lower, being 75% at best based on the gene level expressions. Statistical analyses of the difference between cancer and non-oncogenic samples identified the existence of numerous genes with differentially expressed isoforms, which otherwise were not significant at the gene level. We also found that canonical pathways of protein ubiquitination, purine metabolism, and breast-cancer regulation by stathmin1 were significantly enriched among genes thatshow differential expression at isoform level but not at gene level.
In summary, cancer cell lines, regardless of their tissue of origin, can be effectively discriminated from non-cancer cell lines at isoform level, but not at gene level. This study suggests the existence of an isoform signature, rather than a gene signature, which could be used to distinguish cancer cells from normal cells.
- Ingenuity Pathway Analysis
- Transcript Variant
- Human Mammary Epithelial Cell
- Silhouette Width
- Cluster Purity
The past decade has witnessed unprecedented developments in high-throughput technologies, and their application has led to the molecular classification of many cancers . Molecular profiling of gene expression, using microarrays, has shown that heterogeneity in outcome and survival of patients with cancer can be explained, in part, by genomic variation within the primary tumor. These technologies have helped identify many genetic and epigenetic modifications involved in the initiation and progression of various cancers, but their precise molecular mechanisms remain unclear. Furthermore, novel drugs have been developed to target some of the molecular pathways underlying the carcinogenic processes and maintenance of cancer phenotypes [2, 3] yet, these drugs have provided limited survival benefits to only a small subset of patients with cancer, and only a small number of practically useful biomarkers are presently available. Improved molecular classification of cancers is essential to identify highly sensitive and specific biomarkers and therapeutic targets that reflect the molecular mechanisms functionally involved in tumor type-specific survival, drug resistance, tumor relapse, and patient outcome .
One of the reasons for the limited success in the quest for genomic-based, personalized medicine is the assumption of a 'one gene → one protein → one functional pathway' paradigm in most of the studies investigating molecular classification or therapeutic targets for cancer . Recently, by making use of chromatin immunoprecipitation sequencing (ChIP-seq) and mRNA sequencing (mRNA-seq) approaches, we and others have discovered widespread use of alternative promoters and alternative splicing in mammalian genes in various tissues, developmental stages, and cell lines [6–9]. In fact, numerous genes displaying complex gene regulation via use of alternative promoters and alternative splicing, have been known for some time, and recent evidence suggests that almost all multi-exon human genes have more than one mRNA isoform. During alternative splicing, the coding and non-coding regions of a single gene are rearranged to generate several messenger RNA transcripts, yielding distinct protein isoforms with differing biological functions. Notably, there is growing evidence linking aberrant use of alternative mRNA isoforms with cancer formation; several oncogenes and tumor-suppressor genes (for example, LEF1, TP63, TP73, HNF4A, RASSF1, and BCL2L1) are already known to have multiple promoters and alternative splice forms [10–16]. Moreover, it is known that the aberrant use of one isoform over another in some of these genes is directly linked to cancer cell growth . Although the prevalence of alternative splicing in cancer genomes has been discussed in the literature [18–20], and it has been shown that use of splice forms provides better classification of normal and cancerous prostate tissue, it is not clear whether the use of genome-wide isoform level gene-expression profiles can provide a better global discriminative signature for cancer and normal cells.
Microarray expression profiling remains a powerful tool for identifying different subtypes of cancers. However, almost all microarray-based studies reported to date have measured the expression of the gene at gene level in a given locus, although a few exceptions in recent years have used exon arrays to measure differences at the exon and/or at transcript variant level. The recent application of exon arrays  and the advent of massive parallel sequencing is allowing whole cancer genomes and transcriptomes to be sequenced with extraordinary speed and accuracy, providing insight into the bewildering complexity of isoform level expression of gene transcripts . The Encyclopedia of DNA Elements (ENCODE) consortium, a collective effort to facilitate and accelerate the annotation of functional elements in the human genome, is generating genome-wide expression data in various human cell lines through the use of exon microarrays . Among the available data are gene-expression datasets, generated by the ENCODE consortium using an Affymetrix platform (GeneChip Human Exon 1.0 ST Array), across various cell lines that can be classified as either oncogenic (tumor/cancer) or non-oncogenic (normal). The arrays interrogate transcripts across their entire length, which can detect splicing differences between various types of samples [22–24]. Exons within a gene are represented on the microarray by multiple probe sets. The exon expression can thus be obtained by summarizing all the probe sets for this exon on the microarray. Once the exon-level expressions are obtained, the individual transcript expression of the gene and the total expression of the gene itself then can be inferred from the calculated exon expression, based on assumptions that the isoform structures and number of isoforms of the gene are known beforehand.
With genome-wide isoform level and gene level expression profiles in hand, it is natural to ask how the isoform level expression profiles of different oncogenic and non-oncogenic samples will cluster together, and whether isoform level expression profiles can lead to more accurate discriminators between oncogenic and non-oncogenic samples compared with gene level expression profiles. If the answer is yes, it is important to know which genes and pathways contribute to the improvement of discrimination at isoform level compared with gene level.
In the present study we analyzed Affymetrix exon-array data-sets collected from the public domain, primarily the ENCODE project from the National Center for Biotechnology Information (NCBI) Gene Expression Omnibus (GEO) database, which comprises 160 datasets from various cell lines of either non-oncogenic or oncogenic tissue origin. These data-sets were used to test the hypothesis that isoform level expression analysis provides abetter discriminator between non-oncogenic and oncogenic cell types than gene level expression analysis.
Summary of exon-array datasets
Unprocessed gene-expression datasets, generated using a whole-transcript GeneChip platform (Human Exon 1.0 ST Array; Affymetrix Inc., Santa Clara, CA, USA), were downloaded from the GEO public data depository, deposited mainly by the ENCODE project. The GEO records GSE15805 , GSE17778, GSE19090  and GSE17349  contain, respectively, 79, 36, 83, and 8 samples of various cell lines. After excluding samples that were related to blood, progeria fibroblast, and stem cells, we had a total of 160 exon-array datasets, corresponding to 87 non-oncogenic and 73 oncogenic cell lines of various tissue origins. From the 160 datasets included in the analysis, we used 8 melanoma samples and 4 non-oncogenic melanocyte samples to form the first matched non-oncogenic and oncogenic pair, and used 4 datasets representing non-oncogenic human mammary epithelial cells (HMEC) and 8 datasets from a human breast adenocarcinoma cell line (MCF7) to form the second matched non-oncogenic and oncogenic pair. The complete classification and labeling information of cell lines used in this study are summarized in the supplementary information (see Additional file 1: Table S1).
Estimation of isoform level and gene level expression values from exon-array data
The isoform level (transcript-level) and gene level expression estimates were obtained by the Multi-Mapping Bayesian Gene eXpression (MMBGX) algorithm for Affymetrix whole-transcript arrays , based on the Ensembl database (version 56) , which contains a total of 114,930 different transcript annotations that correspond to 35,612 different gene models. We set the burn-in iteration at 8,192 and real iteration at 16,384 for both gene and isoform levels. All other parameters were set to their default values in the stand-alone algorithm. The algorithm gave a stable estimation of both gene level and isoform level expressions. For example, two independent runs on the same sample provided almost identical expression levels even with different seeds for the algorithm (correlation coefficient > 0.999, data not shown), whereas runs on different samples gave comparable results, but with much lower correlation (correlation coefficient of about 0.97). Expression estimates across all the samples were then normalized using the locally weighted scatterplot smoothing (loess) algorithm [30, 31], also incorporated in the package.
Clustering and pathway analyses
We used the general hierarchical cluster algorithm to cluster the samples, using Euclidean distance as a measurement for dissimilarities . We also applied consensus hierarchical clustering to assess the stability of the clustering results by multiple runs of the clustering algorithm on resampled data [32, 33], and calculated consensus index as reported previously . Briefly, the consensus index is defined for each pair of samples, that is, the consensus index of sample pair (i, j) records the number of times that samples i and j are assigned to the same cluster, divided by the total number of times both samples are selected. To find the differential genes between two conditions, we used the limma method [34, 35]. An isoform or gene was selected if both its fold change was greater than a cut-off value of 2, and the false discovery rate (FDR)-adjusted P value was smaller than a cut-off value of 0.01 for all comparisons between the two conditions. Ingenuity Pathway Analysis (IPA)  was used to associate the identified gene sets with biological functions, canonical pathways, and networks. To identify pathway differences arising from gene sets identified at either isoform or gene level, we used the counting method on the P values of pathways from the IPA analysis; the P values were used as an indicator of association strength between the gene sets and pathways. For the three pairwise oncogenic/non-oncogenic comparisons (all oncogenic cell lines versus non-oncogenic cell lines, melanocyte versus melanoma, HMEC versus MCF7), a pathway was selected and reported if it was significantly associated with the gene sets identified at isoform-level in all three pairs of comparisons, but was not significantly associated with the gene sets identified at gene-level in all three pairs of comparisons, or vice versa. The significance level was set at P < 0.05 for all comparisons. All calculations were performed using Bioconductor (version 2.8 or above; Open Source software for bioinformatics, http://www.bioconductor.org) and R platform (version 2.10; The R Project for Statistical Computing, http://www.r-project.org) .
The study protocol was approved by the institutional review board, and all data collection and analyses adhered to the protocols approved by the institutional review board. Informed consent was obtained from all participants.
Clinical characteristics of study cohort
Women with primary operable breast cancer undergoing breast surgery at the Hospital of the University of Pennsylvania were asked to participate in our tissue-banking protocol. The study cohort included four women diagnosed with breast cancer between 2010 and 2011. Clinical characteristics, including age at diagnosis, ethnicity, histology, tumor size, tumor grade, and number of involved (positive) axilla nodes are provided (see Additional file 2: Table S2A).
After completion of surgery, the breast cancer within the surgical specimen was examined by surgical pathologists. Upon completion of gross examination and inking of the tumor specimen, fresh tumor tissue was taken from the center of the tumor without interfering with margin assessment as determined by the pathologists. A small portion of the tumor tissue and a small portion of normal adjacent breast tissue were collected, then immediately immersed in liquid nitrogen and stored at -80°C. RNA was isolated using this snap-frozen tumor tissue.
RNA isolation and reverse transcriptase-quantitative PCR experiment
Expression of transcripts/isoforms for seven genes in HMEC, MCF7, MDA-MBA-231, and T47D cell lines and expression of two TPM4 isoforms in primary breast-cancer tissues were measured by reverse transcriptase -quantitativePCR (RT-qPCR). Total RNA from cells and tissues were using TRI reagent (Sigma-Aldrich Inc., St. Louis, MO, USA) in accordance with the manufacturer's instructions. For breast-cancer and normal breast tissues, up to 50 mg of frozen tissue was transferred to 1 ml of TRI reagent, then the tissue was immediately homogenized and RNA extraction protocol was followed. Briefly, 0.5 μg of total RNA was reverse-transcribed in a 20 μl reaction with SuperscriptII reverse transcriptase (Invitrogen Inc.) in accordance with the manufacturer's instructions. RT-qPCR was performed using a master mix (Power SYBR Green; Applied Biosystems Inc., Foster City, CA, USA) and fold expression was calculated using the 2-ΔΔCT method. RT-qPCR results were normalized based on the expression of GAPDH for TPM4 and TBP for the other transcripts. The measured isoforms and the primers used for the isoform-specific PCR are presented (see Additional file 2: Table S2B).
Clustering of samples using isoform level expression estimates provided more homogeneous grouping than gene level expression estimates of oncogenic and non-oncogenic cell lines gene level
We expected the clustering of samples to result in a first-level grouping of different tissues, followed by a second-level grouping of cancer and non-oncogenic cell lines within each tissue type. Unexpectedly, however, we found almost uniform grouping of cancer and non-oncogenic cell lines into two large clusters, with an overall cluster purity of 97.5% at isoform level (Figure 1A). Further, the samples belonging to same cell/tissue type within each cancer/non-oncogenic group were clustered together, confirming the discriminatory power of isoform level gene-expression profiles. For example, the paired non-oncogenic melanocyte and cancerous melanoma samples, and the matched pairs of MCF7 (breast-cancer cell line) and the HMEC samples (non-oncogenic origin) were separated correctly into either non-oncogenic or cancer groups, with one exception from MCF7 samples (Figure 1A). Overall, only four samples, two each from non-oncogenic and cancer cell lines, were clustered into the wrong group. The cancer cell lines that were clustered into the non-oncogenic group were one MCF7/mammary gland adenocarcinoma (GEO accession number GSM472936) and one pancreatic carcinoma cell line (GEO GSM472939), and the non-oncogenic samples that were assigned to the cancer group were one adult non-oncogenic human epidermal keratinocyte (NHEK) sample (GEO GSM472937) and one non-oncogenic umbilical vein endothelial cell (HUVEC) sample (GEO GSM472935).
Although the clustering at gene-level showed some power of discrimination between non-oncogenic and cancer cell lines, the overall grouping was significantly less efficient than the clustering at isoform-level. The gene level cluster purity was 75%, with 20 samples from the non-oncogenic and cancer cell lines clustered into the wrong group (Figure 1B). The better separation of non-oncogenic and cancer cell lines at isoform-level (97.5% cluster purity) compared with gene -(75% cluster purity) implies that gene-expression profiles in cancer cells are more specifically altered at isoform-level for numerous genes, which could not be detected using gene level analysis.
We next focused our analysis on two specific cancers, breast cancer, and melanoma, for which we had matched oncogenic and non-oncogenic cell lines, in addition to the comparison of all oncogenic versus all non-oncogenic cell lines.
Transcript variants of numerous genes were differentially expressed between non-oncogenic and cancer cell lines
Experimental validation of differentially expressed transcript variants in breastcancer cell lines and samples
Isoform expression in breast-cancer cell lines as measured by reverse transcriptase -quantitative PCR (RT-qPCR).
Supervised analysis identified an isoform set able to separate the tumor lines from normal lines in an almost perfect pattern
These 14 genes are WNT5B, CCND2, SERPINB7, GPR176, INHBA, EFNB1, PTRF, CDH11, ZBTB16, GJA1, COL5A2, NID1, PRDM1, and TCF4. Except for CCND2 and GPR176, all other genes in our database have more than one isoform. Four genes (SERPINB7, INHBA, GJA1, and NID1) have two isoforms that have significantly different expression between the cancer and non-oncogenic groups. Interestingly, 12 of these 14 genes belong to the same gene network, involved in hematological system development and function, tissue morphology, and cellular development. According to the Ingenuity Pathway Knowledge Base, the network consists of a total of 27 different genes, which suggests that almost 50% of the genes belonging to this network are dysregulated either at the gene or isoform level between non-oncogenic and cancer cells (Figure 5C). Moreover, most of these genes have already been implicated both in tumorigenesis and in several developmental processes [43–48]. For example, it was shown that the phosphorylation-dependent interaction between c-Jun and TCF4 regulates intestinal tumorigenesis by integrating c-Jun kinase (JNK) and adenomatous polyposis coli (APC)/β-catenin, two distinct pathways activated by Wnt signaling . Multiple alternatively spliced transcript variants that may encode different protein isoforms of these genes (for example, TCF4, WNT5B) have been described. Therefore, it would be interesting to evaluate the components of this gene network in different cancers at isoform level.
Gene level and isoform level analyses identified interesting pathways associated with cancer
Human cancer is a complex disease. It is known that most of the genes in mammalian genomes generate different transcript variants and protein isoforms, which often function in a cell/tissue type-specific and developmental stage-specific manner in non-oncogenic cells. Cancer results from the sequential acquisition of a number of genetic and epigenetic alterations, and these mutations may alter the expression of specific isoforms but not the others of a gene. Despite this growing knowledge, most biomarker and drug-discovery studies still evaluate expression differences and study gene regulatory mechanisms at gene level rather than at isoform level. In this study, we have shown that oncogenic cell lines could be more accurately discriminated from non-oncogenic cell lines, regardless of their cells of origin, by gene-expression profiling at isoform level compared with gene level. In spite of the differences in tissues of origin, the cell lines were broadly clustered into two groups, non-oncogenic and oncogenic, by isoform level gene-expression profiles. We noted that numerous genes showed differential expression at isoform level but not at gene level. For some of these genes, the differential expression of alternative transcripts occurred in the opposite direction; while some of the isoforms of the same gene were upregulated, others were downregulated, resulting in them cancelling each other out and producing insignificant expression differences at gene level between cancer and non-oncogenic groups. Our findings are in agreement with a previous study on prostate cancer that investigated the expression of 1532 splice forms for 364 prostate cancer-related genes, using data from a customized exon junction array . The authors found that many genes were differentially expressed at splice-form level but not at gene level and this increase in the number of differentially expressed variables at splice-form level contributed to a 92% accuracy for a 128 splice-form-based classifier for normal and cancerous prostate tissue, whereas the accuracy was 87% using a classifier based on 32 genes. That study profiled 1532 mRNA splice forms from 364 potential prostate cancer-related genes, whereas in the current study, we used genome-wide exon-array data that identified the expression of 114,930 transcripts/isoforms corresponding to 35,612 different genes, including all known non-coding genes in the Ensembl database. In addition, our study focused on discriminating oncogenic and non-oncogenic cells in general, irrespective of their tissue of origin. Using genome-wide isoform level versus gene-level expression information, we found that oncogenic and non-oncogenic cells could be segregated using isoform level information with 97.5% accuracy versus 75% accuracy for gene level information, and even a smaller signature of 18 isoforms was effective in separating the two groups, with equal accuracy. These subtle differences at isoform level in discriminating non-oncogenic and oncogenic cell lines reflect the fact that gene level expression measurements, whose estimates are generally the summation of all the isoform level expression estimates of individual genes, are less accurate in characterizing cancer and non-oncogenic cells.
The pathway-enrichment analysis of genes that are differentially expressed in cancer cell lines at isoform level but not at gene level produced three interesting pathways that have been implicated in various cancers. It is well known that protein phosphorylation and protein ubiquitination regulate most aspects of cell life, and defects in these control mechanisms cause cancer and many other diseases . Similarly, abnormalities in purine metabolism and over-expression of Stathmin 1 (STMN1) are characteristic features of many human tumors [51, 52]. The key genes of these pathways (for example, STMN1, PNP, RPS27A and UBA52) transcribe different transcript variants, some of which encode different protein isoforms. It is therefore necessary to evaluate the gene-expression differences and to study gene regulatory mechanisms at isoform level rather than at gene level between non-oncogenic and disease conditions, such as cancer. Recent advances in cancer genomics have shown that gene-expression signatures are useful for biomarker identification and drug discovery . In this regard, the present study highlights the importance of studying gene-expression signatures at isoform level rather than at gene level, and makes a strong case for isoform level gene/protein-expression profiling methods for improved cancer biomarker and therapeutic discovery.
In conclusion, we have identified a common, isoform level signature that can be used to discriminate effectively between cancer and non-cancer cell lines. We found numerous genes for which the differential expression of alternative transcripts occurred in opposing directions, with some of the isoforms of the same gene being upregulated while others were downregulated, resulting in insignificant expression differences at gene level between cancer and non-oncogenic groups. This is supported by our experimental validation of opposing expression patterns for TPM4 gene isoforms in non-oncogenic and oncogenic tissue samples from breast cancer patients. The present study highlights the importance of studying gene-expression signatures at isoform level rather than at gene level in characterizing the cancer transcriptome.
This work was supported by American Cancer Society Research Scholar Grant (number RSG-07-097-01) to RD. RD holds a Philadelphia Healthcare Trust Endowed Chair Position; research in his laboratory is partially supported by the Philadelphia Healthcare Trust. The use of resources in the Bioinformatics Shared Facility of Wistar Institute Cancer Center (grant number P30 CA010815) is gratefully acknowledged. We thank Dr Zhiyan Fu for his reading of the manuscript.
- Heard E, Tishkoff S, Todd JA, Vidal M, Wagner GP, Wang J, Weigel D, Young R: Ten years of genetics and genomics: what have we achieved and where are we heading?. Nat Rev Genet. 2010, 11: 723-733. 10.1038/nrg2878View ArticlePubMedPubMed CentralGoogle Scholar
- Boran AD, Iyengar R: Systems approaches to polypharmacology and drug discovery. Curr Opin Drug Discov Devel. 2010, 13: 297-309.PubMedPubMed CentralGoogle Scholar
- Janga SC, Tzakos A: Structure and organization of drug-target networks: insights from genomic approaches for drug discovery. Mol Biosyst. 2009, 5: 1536-1548. 10.1039/b908147jView ArticlePubMedGoogle Scholar
- Swanton C, Caldas C: Molecular classification of solid tumours: towards pathway-driven therapeutics. Br J Cancer. 2009, 100: 1517-1522. 10.1038/sj.bjc.6605031View ArticlePubMedPubMed CentralGoogle Scholar
- Feero WG, Guttmacher AE, Collins FS: Genomic medicine--an updated primer. The New England journal of medicine. 2010, 362: 2001-2011. 10.1056/NEJMra0907175View ArticlePubMedGoogle Scholar
- Gupta R, Wikramasinghe P, Bhattacharyya A, Perez FA, Pal S, Davuluri RV: Annotation of gene promoters by integrative data-mining of ChIP-seq Pol-II enrichment data. BMC Bioinformatics. 2010, 11 (Suppl 1): S65- 10.1186/1471-2105-11-S1-S65View ArticlePubMedPubMed CentralGoogle Scholar
- Wang ET, Sandberg R, Luo S, Khrebtukova I, Zhang L, Mayr C, Kingsmore SF, Schroth GP, Burge CB: Alternative isoform regulation in human tissue transcriptomes. Nature. 2008, 456: 470-476. 10.1038/nature07509View ArticlePubMedPubMed CentralGoogle Scholar
- Singer GA, Wu J, Yan P, Plass C, Huang TH, Davuluri RV: Genome-wide analysis of alternative promoters of human genes using a custom promoter tiling array. BMC Genomics. 2008, 9: 349- 10.1186/1471-2164-9-349View ArticlePubMedPubMed CentralGoogle Scholar
- Barrera LO, Li Z, Smith AD, Arden KC, Cavenee WK, Zhang MQ, Green RD, Ren B: Genome-wide mapping and analysis of active promoters in mouse embryonic stem cells and adult organs. Genome Res. 2008, 18: 46-59.View ArticlePubMedPubMed CentralGoogle Scholar
- Hovanes K, Li TW, Munguia JE, Truong T, Milovanovic T, Lawrence Marsh J, Holcombe RF, Waterman ML: Beta-catenin-sensitive isoforms of lymphoid enhancer factor-1 are selectively expressed in colon cancer. Nat Genet. 2001, 28: 53-57.PubMedGoogle Scholar
- Nekulova M, Holcakova J, Coates P, Vojtesek B: The role of p63 in cancer, stem cells and cancer stem cells. Cell Mol Biol Lett. 2011, 16: 296-327. 10.2478/s11658-011-0009-9View ArticlePubMedGoogle Scholar
- Tomasini R, Tsuchihara K, Wilhelm M, Fujitani M, Rufini A, Cheung CC, Khan F, Itie-Youten A, Wakeham A, Tsao MS, et al.: TAp73 knockout shows genomic instability with infertility and tumor suppressor functions. Genes Dev. 2008, 22: 2677-2691. 10.1101/gad.1695308View ArticlePubMedPubMed CentralGoogle Scholar
- Wilhelm MT, Rufini A, Wetzel MK, Tsuchihara K, Inoue S, Tomasini R, Itie-Youten A, Wakeham A, Arsenian-Henriksson M, Melino G, et al.: Isoform-specific p73 knockout mice reveal a novel role for delta Np73 in the DNA damage response pathway. Genes Dev. 2010, 24: 549-560. 10.1101/gad.1873910View ArticlePubMedPubMed CentralGoogle Scholar
- Eeckhoute J, Moerman E, Bouckenooghe T, Lukoviak B, Pattou F, Formstecher P, Kerr-Conte J, Vandewalle B, Laine B: Hepatocyte nuclear factor 4 alpha isoforms originated from the P1 promoter are expressed in human pancreatic beta-cells and exhibit stronger transcriptional potentials than P2 promoter-driven isoforms. Endocrinology. 2003, 144: 1686-1694. 10.1210/en.2002-0024View ArticlePubMedGoogle Scholar
- Richter AM, Pfeifer GP, Dammann RH: The RASSF proteins in cancer; from epigenetic silencing to functional characterization. Biochim Biophys Acta. 2009, 1796: 114-128.PubMedGoogle Scholar
- Akgul C, Moulding DA, Edwards SW: Alternative splicing of Bcl-2-related genes: functional consequences and potential therapeutic applications. Cell Mol Life Sci. 2004, 61: 2189-2199.View ArticlePubMedGoogle Scholar
- Rajan P, Elliott DJ, Robson CN, Leung HY: Alternative splicing and biological heterogeneity in prostate cancer. Nat Rev Urol. 2009, 6: 454-460. 10.1038/nrurol.2009.125View ArticlePubMedGoogle Scholar
- David CJ, Manley JL: Alternative pre-mRNA splicing regulation in cancer: pathways and programs unhinged. Genes Dev. 2010, 24: 2343-2364. 10.1101/gad.1973010View ArticlePubMedPubMed CentralGoogle Scholar
- Ghigna C, Valacca C, Biamonti G: Alternative splicing and tumor progression. Curr Genomics. 2008, 9: 556-570. 10.2174/138920208786847971View ArticlePubMedPubMed CentralGoogle Scholar
- Zhang C, Li HR, Fan JB, Wang-Rodriguez J, Downs T, Fu XD, Zhang MQ: Profiling alternatively spliced mRNA isoforms for prostate cancer classification. BMC Bioinformatics. 2006, 7: 202- 10.1186/1471-2105-7-202View ArticlePubMedPubMed CentralGoogle Scholar
- Moller-Levet CS, Betts GN, Harris AL, Homer JJ, West CM, Miller CJ: Exon array analysis of head and neck cancers identifies a hypoxia related splice variant of LAMA3 associated with a poor prognosis. PLoS Comput Biol. 2009, 5: e1000571- 10.1371/journal.pcbi.1000571View ArticlePubMedPubMed CentralGoogle Scholar
- Clark TA: Discovery of tissue-specific exons using comprehensive human exon microarrays. Genome Biol. 2007, 8: R64- 10.1186/gb-2007-8-4-r64View ArticlePubMedPubMed CentralGoogle Scholar
- Gardina PJ: Alternative splicing and differential gene expression in colon cancer detected by a whole genome exon array. BMC Genomics. 2006, 7: 325- 10.1186/1471-2164-7-325View ArticlePubMedPubMed CentralGoogle Scholar
- Kwan T, Benovoy D, Dias C, Gurd S, Provencher C, Beaulieu P, Hudson TJ, Sladek R, Majewski J: Genome-wide analysis of transcript isoform variation in humans. Nat Genet. 2008, 40: 225-231. 10.1038/ng.2007.57View ArticlePubMedGoogle Scholar
- McDaniell R, Lee BK, Song L, Liu Z, Boyle AP, Erdos MR, Scott LJ, Morken MA, Kucera KS, Battenhouse A, et al.: Heritable individual-specific and allele-specific chromatin signatures in humans. Science. 2010, 328: 235-239. 10.1126/science.1184655View ArticlePubMedPubMed CentralGoogle Scholar
- Hansen RS, Thomas S, Sandstrom R, Canfield TK, Thurman RE, Weaver M, Dorschner MO, Gartler SM, Stamatoyannopoulos JA: Sequencing newly replicated DNA reveals widespread plasticity in human replication timing. Proc Natl Acad Sci USA. 2010, 107: 139-144. 10.1073/pnas.0912402107View ArticlePubMedPubMed CentralGoogle Scholar
- Berger MF, Levin JZ, Vijayendran K, Sivachenko A, Adiconis X, Maguire J, Johnson LA, Robinson J, Verhaak RG, Sougnez C, et al.: Integrative analysis of the melanoma transcriptome. Genome Res. 2010, 20: 413-427. 10.1101/gr.103697.109View ArticlePubMedPubMed CentralGoogle Scholar
- Turro E, Lewin A, Rose A, Dallman MJ, Richardson S: MMBGX: a method for estimating expression at the isoform level and detecting differential splicing using whole-transcript Affymetrix arrays. Nucleic Acids Res. 2010, 38: e4- 10.1093/nar/gkp853View ArticlePubMedPubMed CentralGoogle Scholar
- Yates T, Okoniewski MJ, Miller CJ: X:Map: annotation and visualization of genome structure for Affymetrix exon array analysis. Nucleic Acids Res. 2008, 36: D780-786.View ArticlePubMedPubMed CentralGoogle Scholar
- Cleveland WS, Grosse E, Shyu WM: Local regression models. 1992, Wadsworth & Brooks/ColeGoogle Scholar
- Cleveland WS: Robust Locally Weighted Regression and Smoothing Scatterplots. Journal of the American Statistical Association. 1979, 74: 829-836. 10.1080/01621459.1979.10481038.View ArticleGoogle Scholar
- Gordon ADSE: Classification. 1999, London: Chapman and Hall/CRC,Google Scholar
- Monti S, Tamayo P, Mesirov J, Golub T: Consensus clustering: A resampling-based method for class discovery and visualization of gene expression microarray data. Mach Learn. 2003, 52: 91-118. 10.1023/A:1023949509487.View ArticleGoogle Scholar
- Smyth GK, Yang YH, Speed T: Statistical issues in cDNA microarray data analysis. Methods Mol Biol. 2003, 224: 111-136.PubMedGoogle Scholar
- Smyth GK: Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol. 2004, 3: Article3,Google Scholar
- R Development Core Team: R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing. Book R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing. 2010, (Editor ed.^eds.). City,Google Scholar
- Wilkerson MD, Hayes DN: ConsensusClusterPlus: a class discovery tool with confidence assessments and item tracking. Bioinformatics. 2010, 26: 1572-1573. 10.1093/bioinformatics/btq170View ArticlePubMedPubMed CentralGoogle Scholar
- Rousseeuw PJ: Silhouettes - a Graphical Aid to the Interpretation and Validation of Cluster-Analysis. J Comput Appl Math. 1987, 20: 53-65.View ArticleGoogle Scholar
- Ugurel S, Houben R, Schrama D, Voigt H, Zapatka M, Schadendorf D, Brocker EB, Becker JC: Microphthalmia-associated transcription factor gene amplification in metastatic melanoma is a prognostic marker for patient survival, but not a predictive marker for chemosensitivity and chemotherapy response. Clin Cancer Res. 2007, 13: 6344-6350. 10.1158/1078-0432.CCR-06-2682View ArticlePubMedGoogle Scholar
- Zheng B, Jeong JH, Asara JM, Yuan YY, Granter SR, Chin L, Cantley LC: Oncogenic B-RAF negatively regulates the tumor suppressor LKB1 to promote melanoma cell proliferation. Mol Cell. 2009, 33: 237-247. 10.1016/j.molcel.2008.12.026View ArticlePubMedPubMed CentralGoogle Scholar
- Brown JH, Senthil Kumar VS, O'Neall-Hennessey E, Reshetnikova L, Robinson H, Nguyen-McCarty M, Szent-Gyorgyi AG, Cohen C: Visualizing key hinges and a potential major source of compliance in the lever arm of myosin. Proc Natl Acad Sci USA. 2010,Google Scholar
- Pscherer A, Dorflinger U, Kirfel J, Gawlas K, Ruschoff J, Buettner R, Schule R: The helix-loop-helix transcription factor SEF-2 regulates the activity of a novel initiator element in the promoter of the human somatostatin receptor II gene. EMBO J. 1996, 15: 6680-6690.PubMedPubMed CentralGoogle Scholar
- Hata S, Emi Y, Iyanagi T, Osumi T: cDNA cloning of a putative G protein-coupled receptor from brain. Biochim Biophys Acta. 1995, 1261: 121-125. 10.1016/0167-4781(95)00002-XView ArticlePubMedGoogle Scholar
- Luska G, Huchzermeyer H, Seifert E, Stender HS: [The radiological diagnosis of non-calculous biliary duct obstruction (author's transl)]. Rofo. 1977, 126: 117-122. 10.1055/s-0029-1230546View ArticlePubMedGoogle Scholar
- Gavin BJ, McMahon JA, McMahon AP: Expression of multiple novel Wnt-1/int-1-related genes during fetal and adult mouse development. Genes Dev. 1990, 4: 2319-2332. 10.1101/gad.4.12b.2319View ArticlePubMedGoogle Scholar
- Kriegl L, Horst D, Reiche JA, Engel J, Kirchner T, Jung A: LEF-1 and TCF4 expression correlate inversely with survival in colorectal cancer. J Transl Med. 2010, 8: 123- 10.1186/1479-5876-8-123View ArticlePubMedPubMed CentralGoogle Scholar
- Sareddy GR, Panigrahi M, Challa S, Mahadevan A, Babu PP: Activation of Wnt/beta-catenin/Tcf signaling pathway in human astrocytomas. Neurochem Int. 2009, 55: 307-317. 10.1016/j.neuint.2009.03.016View ArticlePubMedGoogle Scholar
- Nateri AS, Spencer-Dene B, Behrens A: Interaction of phosphorylated c-Jun with TCF4 regulates intestinal cancer development. Nature. 2005, 437: 281-285. 10.1038/nature03914View ArticlePubMedGoogle Scholar
- Pollack IF, Hamilton RL, Burger PC, Brat DJ, Rosenblum MK, Murdoch GH, Nikiforova MN, Holmes EJ, Zhou T, Cohen KJ, Jakacki RI: Akt activation is a common event in pediatric malignant gliomas and a potential adverse prognostic marker: a report from the Children's Oncology Group. J Neurooncol. 2010, 99: 155-163. 10.1007/s11060-010-0297-3View ArticlePubMedPubMed CentralGoogle Scholar
- Levine AJ, Puzio-Kuter AM: The control of the metabolic switch in cancers by oncogenes and tumor suppressor genes. Science. 2010, 330: 1340-1344. 10.1126/science.1193494View ArticlePubMedGoogle Scholar
- Rana S, Maples PB, Senzer N, Nemunaitis J: Stathmin 1: a novel therapeutic target for anticancer activity. Expert Rev Anticancer Ther. 2008, 8: 1461-1470. 10.1586/14737220.127.116.111View ArticlePubMedGoogle Scholar
- Tan DS, Thomas GV, Garrett MD, Banerji U, de Bono JS, Kaye SB, Workman P: Biomarker-driven early clinical trials in oncology: a paradigm shift in drug development. Cancer J. 2009, 15: 406-420. 10.1097/PPO.0b013e3181bd0445View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.