Transcriptional dissection of pancreatic tumors engrafted in mice
Genome Medicine volume 6, Article number: 27 (2014)
Engraftment of primary pancreas ductal adenocarcinomas (PDAC) in mice to generate patient-derived xenograft (PDX) models is a promising platform for biological and therapeutic studies in this disease. However, these models are still incompletely characterized. Here, we measured the impact of the murine tumor environment on the gene expression of the engrafted human tumoral cells.
We have analyzed gene expression profiles from 35 new PDX models and compared them with previously published microarray data of 18 PDX models, 53 primary tumors and 41 cell lines from PDAC. The results obtained in the PDAC system were further compared with public available microarray data from 42 PDX models, 108 primary tumors and 32 cell lines from hepatocellular carcinoma (HCC). We developed a robust analysis protocol to explore the gene expression space. In addition, we completed the analysis with a functional characterization of PDX models, including if changes were caused by murine environment or by serial passing.
Our results showed that PDX models derived from PDAC, or HCC, were clearly different to the cell lines derived from the same cancer tissues. Indeed, PDAC- and HCC-derived cell lines are indistinguishable from each other based on their gene expression profiles. In contrast, the transcriptomes of PDAC and HCC PDX models can be separated into two different groups that share some partial similarity with their corresponding original primary tumors. Our results point to the lack of human stromal involvement in PDXs as a major factor contributing to their differences from the original primary tumors. The main functional differences between pancreatic PDX models and human PDAC are the lower expression of genes involved in pathways related to extracellular matrix and hemostasis and the up- regulation of cell cycle genes. Importantly, most of these differences are detected in the first passages after the tumor engraftment.
Our results suggest that PDX models of PDAC and HCC retain, to some extent, a gene expression memory of the original primary tumors, while this pattern is not detected in conventional cancer cell lines. Expression changes in PDXs are mainly related to pathways reflecting the lack of human infiltrating cells and the adaptation to a new environment. We also provide evidence of the stability of gene expression patterns over subsequent passages, indicating early phases of the adaptation process.
Patient-derived xenograft (PDX) models are becoming a common platform for research and clinical purposes. The establishment of PDX models to study cancer biology and pharmacology is a common practice that has been successfully applied to many cancer types[2–5]. Xenografting of human primary carcinomas is in fact the only method currently available that permits the propagation of a significant proportion of carcinomas[6–8] and has many advantages over tumor-derived cell lines maintained in vitro[9–11]. Both cell lines and PDX models permit the removal of contaminating non-neoplastic human cells from the human tumors. However, the tissue architecture is only partially maintained in PDXs[2, 11, 12] with mouse stromal cells substituting for human stromal cells. In general, the results obtained using PDX models in mice show better preclinical and clinical concordance than those from cell lines[11, 14].
Pancreas ductal adenocarcinoma (PDAC) is usually diagnosed in advanced stages after it has metastasized to regional lymph nodes, liver or lung and the median survival after diagnosis is approximately 8 months. PDAC is notorious for how difficult it is to obtain biological material to study the disease. In addition, standard treatments have a very low percentage of success and the short survival time of the patients makes it challenging to search for alternative therapies. For these reasons, PDX models are particularly attractive for studying PDAC.
Despite their advantages over cell lines, fresh tumors xenografted in mice show differences from the original primary tumors. For example, the proportion of murine stromal cells in PDAC PDXs is lower than the proportion of human stromal cells in the original primary tumors (our experimental observations). Thus, caution should be taken when interpreting the results obtained using these models. A study by Gadaleta et al. analyzed the so-called 'pancreas expression space' by combining publicly available gene expression array datasets studied with the Affymetrix Human Genome U133 Plus 2.0 Array platform. This pancreas expression space included healthy pancreatic tissue, human primary pancreas cancer, non-tumoral tissue adjacent to tumor, tumor-derived cell lines and PDX models. These authors applied a statistical method (principal component analysis) to explore how the different samples clustered in the two first principal components. Their main findings were that (1) non-tumoral tissue adjacent to tumor was different to healthy pancreatic tissue, (2) primary tumors and tumor adjacent samples clustered together, and (3) PDXs and cell lines clustered in two other groups. One of their main conclusions was 'that ectopic subcutaneous xenografts and cell line models do not effectively represent changes occurring in pancreatic cancer'. This work highlighted the importance of understanding better the extent to which the mouse environment is altering the gene expression of the implanted human tumoral cells.
Our central goal was to understand how the ectopic xenograft mouse environment affects the expression phenotype of PDAC cells. We developed a robust analysis protocol to explore the PDAC expression space, including human primary tumors, tumor-derived cell lines and PDX models. We used expression data from primary tumors, PDX models and cell lines derived from PDACs. In addition, we used hepatocellular carcinoma (HCC) to study if PDXs and cell lines share any characteristics with the original primary tumors. The study shows that the gene expression profiles of PDAC and HCC PDX models conserved substantial similarity with their original primary tissues, while all cell lines clustered together, independently of their tissues of origin. However, we also identified several biological pathways that showed differential expression between PDX models and tumors, such as extracellular matrix organization, likely due to replacement of the human stroma by murine elements, and up-regulation of cell cycle and DNA replication that are indicative of higher proliferation. We also show that these changes happened at initial stages after engrafting and were stable over different passages.
Materials and methods
Generation of PDX models and sample processing
We performed gene expression profiling for 35 new PDX models of PDACs xenografted in nude mice and 2 human primary tumor samples. PDX models were established following an already described protocol. Mice used in this research have been treated humanely according to the regulations laid down by the CNIO Bioethics Committee and the relevant EC guidelines (directive 86/609/EEC), with due consideration to the alleviation of distress and discomfort. RNA was extracted using Qiagen RNeasy Mini kits (QIAGEN Inc., Valencia, CA, USA) and was hybridized in GeneChip® Human Genome U133 Plus 2.0 Array, Affymetrix (Santa Clara, CA, USA). The data generated in this publication have been deposited in NCBI’s Gene Expression Omnibus (GEO) and are accessible through GEO Series accession number GSE51798.
In addition to the new gene expression profiles specifically obtained for this study, we retrieved data from GEO series[19, 20] that were hybridized in the same platform as our models (Human Genome U133 Plus 2.0 Array). We also added the data from a dataset with 18 expression profiles from PDX models (GSE9599) previously described. We downloaded expression data for 51 original tumor samples available from Badea et al. and Pei et al. published in datasets GSE15471 and GSE16515. We also used gene expression profiles of 22 cell lines from Maupin et al. and 19 from Collisson et al.. These two datasets have data from 11 cell lines in common. All samples corresponded or were derived from adenocarcinomas.
In addition, we compiled an HCC dataset with 182 expression profiles, including data from 42 HCC PDX models, 32 cell lines[26, 27] (these two datasets have data from 5 cell lines in common) and 108 primary HCC[26, 28–31] samples. All samples corresponded to carcinomas, except two cell lines (HuH-6 and NCI-H684) that were derived from hepatoblastomas. All datasets used to create gene expression spaces are summarized in Table 1.
We also downloaded two datasets of expression profiles of pancreatic primary tumors and metastases, GSE42952 and GSE34153. GSE42952 was normalized and processed with the same protocol as the rest of the samples in the HGU133 Plus 2.0 Array platform. The GSE34153 dataset corresponds to the Agilent 4x44 K whole human genome array platform and we used the normalized data available in GEO series.
We used the frozen robust multiarray analysis (fRMA) method for preprocessing and normalization of microarrays. fRMA pre-processes arrays individually and allows addition of new samples to a previously normalized dataset, a feature that is very convenient when new samples are included in a pre-existing dataset. We then utilized the Gene Expression Barcode method included in the fRMA R package that converts normalized expression intensities into expression binary calls (silenced or expressed). The main benefit of this approach is that it minimizes batch effects and reduces noise. As we were not using expression intensities but yes/no expression calls, we used the multiple correspondence analysis (MCA) method to explore the pancreas expression space. MCA is equivalent to principal component analysis (PCA) when working with qualitative data instead of continuous variables. In this work we have adapted a methodological protocol, previously applied to multiple sequence alignments, for the automatic extraction of relevant signatures from MCA results. In brief, our protocol performs a MCA on a vectorial representation of multiple 'barcoded' microarray data in a high dimensional space. MCA produces a new multidimensional space so that the accumulated variance of the coordinates of every probe in a subset of dimensions is optimal. The MCA space is reduced to a low dimensional one preserving most of the original information but filtering the main sources of noise. Our protocol establishes a low dimensional space composed of those dimensions that incorporate results in a statistically significant increment of the information. The number of informative dimensions (those that explain most of the total variance) is selected according to the P-value of a Wilcoxon test between contiguous axes. Additional axes are included in the expression space if the P-value is lower than 0.01. Robust unsupervised k-means clustering is performed iteratively on this reduced space (defined by the informative dimensions) for a range of pre-specified numbers of groups (from 2 to 50). Finally, optimal clustering solutions are detected as those maximizing the Calinsky’s and Harabsz’s (CH) index.
Estimation of non-tumoral component and correction for its effect
We measured the proportion of stromal and infiltrating immune cells using ESTIMATE, a gene expression signature-based method that estimates tumor purity from gene expression data. ESTIMATE scores were calculated using the original R code from the authors using the default parameters for the Affymetrix arrays. To remove the contribution of human non-tumoral infiltrating (hNTI) cells to the PDX primary tumor expression space, we first generated an expression space without the signature genes used by ESTIMATE. Then, we removed the gene expression signal directly relating to the effect of hNTI cells from remaining genes using simple linear regression models. We fitted two linear models, one for the first component and the other for the second component, using in both cases the ESTIMATE score as the explanatory variable (R2 = 0.04, P-value = 0.002 and R2 = 0.83, P-value <2.2 × 10-16, respectively; Additional file1). Next, we removed the non-tumoral trend of each component using the corresponding linear regression models. The regression residuals of each model were used to generate a new gene expression space corrected for the contribution of non-tumoral infiltrating cells.
Pathway enrichment analyses
To study differences between original patients’ tumors and PDX models we calculated the number of times that each gene was called as 'expressed' in each group, or its 'expression frequency'. Here we considered that a gene was expressed if any of the probes detecting that gene in the microarray was called as 'expressed' according to the Barcode method. The difference in expression frequency for each gene between each group was obtained by subtracting the gene expression frequency in PDX from the gene expression frequency in primary tumors. We ranked the values of differences in expression frequency and performed gene set enrichment analyses (GSEAs) with the GseaPreranked tool of the GSEA software using as gene sets the pathways annotated in Reactome. A total of 621 pathways were testable after the default filtering step (at least 15 genes and no more than 500 genes in each gene set). We grouped together similar significant pathways (false discovery rate (FDR) <0.05) as ‘functional groups’, guided by the Reactome hierarchy. For comparisons of primary tumors and metastases[32, 33], we used the normalized expression matrices as input for GSEA to test if functional groups were differentially expressed between metastases and primary tumors. We used t-test as the metric for ranking genes.
Differential expression between different passages
We used the Limma package to test differential gene expression between four patients’ tumors and PDX models developed from them at different passages. We created a linear model with information about number of passages: primary original tumor (F0), 5th passage (F5) and 10th passage (F0), using each patient as blocking information. Then we contrasted F0 versus F5, F5 versus F10 and F0 versus F10. After each contrast analysis, we ranked genes according to their t-statistic and used the GseaPreranked tool (see above) to analyze if the expression levels of the genes changed throughout passages.
Results and discussion
Pancreas and liver cancer gene expression spaces
We compiled a PDAC gene expression dataset comprising a total of 147 PDAC and PDAC-derived profiles, including 53 PDX models, 53 whole-tissue primary PDACs and 41 cell lines (Table 1), in part collected from databases and in part derived for this study. We processed all data using the same analysis protocol, transforming the expression intensities into expression calls[34, 35]. Then, we used our MCA implementation to explore quantitatively the transcriptional space of biological samples based on their gene expression binary profiles (expressed or not expressed; see Materials and methods for details). Figure 1A shows the first two dimensions of the PDAC expression space explaining 64% of the variance (49% by the first axis and 15% by the second). The distribution of the samples in the first and second axes is significantly different (P-value = 1.7 × 10-4, Wilcoxon test), while adding a third dimension to the expression space does not result in a significant gain of information (P-value = 0.08; significance threshold = 0.01).
The MCA results clearly show that the human tumor samples, PDX models and cell lines were three distinct groups (Figure 1A) based on the transcriptional profile of the complete genome as represented in the arrays. In fact, k-means clustering of the space created by the MCA found an optimal solution that corresponds to three clusters (k-means = 3, CH index = 199.541): (1) primary original tumors, (2) PDXs and (3) cell lines. Similar results were obtained when analyzing the normalized intensities with PCA or by hierarchical clustering of the expression calls (Figures S1A and S2A in Additional file1). Therefore, based on expression profiles, PDX models and cell lines, considered globally, are as different from the original tumors as they are from each other.
To compare our results for PDAC with other cancer types, we used publicly available HCC microarray data, for which we could gather a sufficiently large collection of human primary tumors, PDX models and cell lines. We were able to compile a dataset with a total of 182 expression profiles from HCC, including 42 HCC PDX models, 32 cell lines and 108 primary HCCs. We applied the same analysis protocol used for the PDAC data. Figure 1B shows the HCC transcriptional space generated by the first two dimensions of the MCA (Wilcoxon test P-value = 1.6 × 10-7), which explain 57% of the variance (41% by the first axis and 16% by the second). HCC PDX models and cell lines were also located in different areas of the HCC expression space (Figure 1B). Indeed, the k-means clustering of the space also had an optimal solution corresponding to three clusters (k-means = 3, CH index = 236.304): (1) human primary tumors, (2) PDX models and (3) cell lines. Similar clustering of the samples was obtained using alternative methods (PCA of expression intensities and hierarchical clustering of expression calls; Figures S1B and S2B in Additional file1). The results obtained with the HCC samples were in agreement with the previously shown PDAC clusters, where the global expression of primary tumors, PDX models and cell lines produced three distinct groups.
Our results in PDAC and HCC confirmed previous observations made in PDAC about the differences between primary tumors, cell lines and PDX models in terms of their expression profiles. This scenario highlights the influence of the cellular environment in the global expression of human primary tissue samples, cell lines and PDX models. We reasoned that the PDX-associated milieu may affect similarly to different types of human tumors, so we next investigated to what extent the gene expression of the human tumoral cells changes when engrafted in PDX models focusing on PDAC and HCC.
Conservation of primary tumor gene expression signals in PDXs but not in cell lines
It is reasonable to think that xenografted cells may keep a gene expression signature specific to their tumor of origin. However, if the mouse microenvironment induces similar changes in any cell type implanted into mice, then the gene expression of any cell type could acquire xenograft-specific characteristics and totally lose its original expression blueprint. We analyzed simultaneously expression profiles from HCC primary tumors, HCC PDX models, PDAC primary tumors and PDAC PDX models. Figure 2A shows the first two dimensions of the resulting tridimensional expression space (the third axis is included in Figure S3 in Additional file1). The four-group optimal solution of the k-means clustering (CH index = 228.706) classified human primary PDAC, PDAC PDX models, human primary HCC and HCC PDX models separately. It is important to note that the clusters were obtained in an unsupervised manner (that is, sample labels were not used). Thus, we could check how many samples were well classified in their original tissue according to an automatic group selection protocol based on the expression space. We found that the groups were reliably classified; in pancreas 100% of PDAC and PDX models were grouped into their respective groups, while in liver 88% of primary HCC and 95% of HCC PDXs were correctly classified. In total, 94% of 256 total samples were correctly classified.
The first component (Figure 2A, X-axis) explained 30% of the variance and clearly separated the samples by their tissue of origin, that is, human primary tumors and PDXs from pancreas and human primary tumors and PDXs from liver clustered separately (Figure 2B). The second component (Figure 2A, Y-axis) separated PDX models and primary samples and explained 27% of the variance. The third axis explains just 8% of the variance and its incorporation in the expression space supports that there are four distinct groups of samples. Our clustering results using MCA-based expression spaces were in agreement with additional analyses using PCA (Figure S4A in Additional file1) and hierarchical clustering (Figure S5 in Additional file1). It is important to keep in mind that each axis of a gene expression space represents an independent source of variance, corresponding to different underlying gene expression patterns. Moreover, the sample distributions in the X-axis and Y-axis show highly significant differences (Wilcoxon’s test P-value = 4.1 × 10-29). Additionally, different clustering of the samples is obtained when using distances based on either the X-coordinates (samples grouped according to the tumor of origin; Figure S5C in Additional file1) or the Y-coordinates (samples group according to their environment; Figure S5D in Additional file1).
These two significant components imply the co-existence of two complementary gene expression patterns. The first component would reflect a 'gene expression memory' of the tumor of origin (similar expression calls for cells from the same origin, different calls for those from different origins). In contrast, the second component highlights the effect of an environment-associated expression signal (similar expression in similar environment, different expression otherwise). As PDXs in this second axis are grouped separately from primary tumors, this axis might represent the loss of part of the gene expression memory associated with the change of host species.
We performed a similar analysis using cell lines instead of PDX models. Figure 2B shows the resulting bi-dimensional gene expression space obtained using expression profiles from primary PDACs, PDAC cell lines, primary HCCs and HCC cell lines and which explain 65% of the variance (43% by the first axis and 22% by the second, Wilcoxon test P-value = 3.99 × 10-13). Remarkably, while PDX models derived from PDACs and HCCs form two distinct clusters (Figure 2A), the corresponding cell lines were indistinguishable (Figure 2B; Figure S6 in Additional file1). A similar observation, regarding the higher similarity of cell lines from different origins, has been previously reported. Our data suggest that, at least for PDAC and HCC, the gene expression profiles of PDXs remain partially related to the original tumors, while cell lines’ profiles are not. Our results are consistent with previous observations in breast, kidney, small cell lung cancer and uveal melanomas where PDXs maintain key features of the original tumors, including functional activity and gene expression profiles. In addition, Daniel et al. found that genetic divergence between original tumors and cell lines was higher than genetic divergence between human primary tumors and PDX models.
Contribution of non-tumoral stromal and immune infiltrating cells
Substitution of the original stroma of primary tumors by the murine stroma in PDXs is an important factor that could partially account for observed differences between primary tumors and PDXs. In fact, it is widely acknowledged that tumor samples are virtually always 'contaminated' by non-tumoral stromal and immune cells. Interestingly, PDX expression data established using human microarrays should be mostly free from the contribution of these non-tumoral components. The combination of the platform species-specificity and the well-known reduction of non-tumoral components in PDXs are expected to strongly hinder the detection of this ever-present 'contamination' in primary tumors. Consequently, the absence of this human 'non-tumoral' contamination in PDXs with respect to primary tumors could lead to an overestimation of the differences between them and primary tumors, as well as of the similarities.
To address this point, we used the ESTIMATE method to infer the fraction of stromal and immune cells in the different samples. ESTIMATE is based on a gene signature characteristic of human tumor-infiltrating stromal and immune cells. According to this method, PDAC primary tumors show a higher proportion of hNTI cells than HCC primary tumors (Figure 3A). Both types of PDXs have very low (negative) ESTIMATE scores, which indicate that human non-tumoral cells are basically absent in PDXs, supporting our own experimental observations. Next, we generated a new gene expression space using the same samples as in Figure 2A but only using the 282 genes of this hNTI cell signature. Remarkably, a totally different gene expression space was obtained, where the first component is the only informative axis, explaining as much as 86% of the variance (Figure S7A in Additional file1). Interestingly, this first MCA component separates the samples in a similar way to the ESTIMATE scores (R2 = 0.94, P-value <2.2 × 10-16; Figure S7B in Additional file1).
To evaluate the effect of hNTI cells on our results, we repeated the gene expression space analysis of PDAC and HCC PDXs and primary tumors (as in Figure 2A) but excluding the probes mapping to the genes of this non-tumoral gene signature. Interestingly, removal of these 282 genes has a minor effect on the sample distribution in the resultant gene expression space (Figure 3B), showing that these genes were not the main contributors of the gene expression space. However, when we compared the Y-axis of the expression space generated without the hNTI cell signature genes (Figure 3B) with the ESTIMATE scores, we observed a striking and very significant correlation (R2 = 0.83, P-value <2.2 × 10-16; Figure S7D in Additional file1; for correlation with other axes and an equivalent analysis for cell lines see Figures S7 to S9 in Additional file1). These results indicate that differences in the contribution of hNTI cells are a main factor associated with the separated clustering of primary tumors and PDX models (Y-axis in Figure 2A and Figure 3B). These findings imply that similarities between PDX and primary tumors are strongly underestimated by gene expression experiments.
In view of this strong effect, we tried to remove (in silico) the contribution of non-tumoral cells to the global gene expression profiles. For this we used simple linear models for each component with ESTIMATE scores as a proxy of this contribution (see Materials and methods). After model-based removal of this trend associated with the hNTI cell signature, we generated a new gene expression space corrected for their effect (Figure 3C). In contrast to the original expression space, the corrected second component does not separate PDX models from primary tumors. This suggests that this separation was attributable to the differential contribution of hNTI cells to primary tumors and PDXs. In fact, in this corrected space, PDX models and primary PDAC samples are virtually indistinguishable, while PDXs and primary HCCs overlap and form a unique cluster.
Ideally, microdissection of the different cell types would be the best way to determine how each cell type contributes to the overall gene expression of the tissue. However, the microdissection of cells in fixed tissues is associated with higher levels of RNA degradation and the amplification of partially degraded RNA provokes potential systemic biases of gene expression[46, 47]. In any case, our data also suggest that the non-tumoral (murine) infiltrating cells in PDXs have a minor contribution to the expression profiles using the Affymetrix U133 Plus 2.0 human array platform. Other studies have tried to measure the species-specificity of human microarrays in mixtures of human and mouse cells and the general conclusion was that most of the signal detected is human-specific[46, 48, 49]. Current approaches performing simultaneous quantification by RNAseq of mouse-specific sequence reads (stromal cells) and human-specific reads (tumoral cells) will help to better understand the contribution of the murine stroma to these changes in gene expression[48, 50, 51].
Pathway analysis of PDAC PDX models
Because the gene expression of PDAC PDX models differs from that of the original PDAC tumors, it is critical to understand the similarities and differences between these two groups and, in particular, the extent to which these changes in expression can influence their validity as preclinical models of drug sensitivity[1, 8, 52]. Using the gene expression data, we calculated the expression frequency (number of times that each gene can be considered to be 'expressed'; see Materials and methods) of each gene in both the PDAC primary tumors and PDX models. Then, we tested if any of the biochemical and signaling pathways from the Reactome database were statistically enriched in differentially expressed genes (see Pathway enrichment analyses section in Materials and methods).
Figure 4 shows 72 pathways that presented significant accumulations of up-regulated or down-regulated genes (FDR <0.05) in PDXs. These pathways and genes represent a small number of related cellular functions, so we grouped them in functional groups. It is particularly interesting that genes in pathways related to cell cycle and DNA replication are up-regulated in PDX models while genes in pathways related to signaling transduction, hemostasis and extracellular matrix organization were significantly down-regulated (Figure 4; Additional file2). The genes involved in cell cycle and DNA replication functional groups are mainly related to chromosome segregation and regulation of cell division subgroups. The overexpression of these genes is in agreement with the higher proliferation rates of primary tumors engrafted in mice.
Signal transduction, hemostasis and extracellular matrix organization functional groups encompass down-regulated genes in the PDX models; some genes are even common members in signal transduction and hemostasis functional groups. Interestingly, many of these functions were previously found altered in the same direction in breast PDXs. Therefore, the differential expression of these genes could reflect tumor adaptation to the murine environment. However these down-regulated pathways could also reflect, to some extent, the substitution of human stroma by murine stroma. Indeed, we found that stromal genes are significantly enriched in hemostasis (P-value = 3.1 × 10-5), signal transduction (P-value = 1.6 × 10-7) and extracellular matrix organization (P-value <10-16). Immune genes are enriched in hemostasis (P-value = 0.02) and signal transduction (P-value = 6.5 × 10-3). Our results show that the functional changes detected are affected by the different contribution of hNTI cells (see above). Therefore, these and previous functional analyses of PDXs based on expression arrays should be interpreted with caution.
Many of the altered biological pathways in PDXs are typically deregulated in metastatic tumors compared with primary tumors. In fact, it has been proposed that PDX models mimic aggressive and/or metastatic tumors derived from the original primary tumors[42, 46, 54]. The tumor engraftment could be seen as a 'forced' metastatic situation, because the tumor needs to colonize a new environment to survive. This 'metastasis-like' phenotype of PDXs is coherent with our results, where proliferation is activated and signaling and extracellular matrix proteins change their expression profiles. To analyze this in more detail, we tested if the functional groups altered in PDX models were similarly altered in metastases (compared with primary tumors). We collected two datasets from public databases that contain data on primary pancreatic tumors and pancreatic metastases located in different organs[32, 33]. Considering the metastases in groups according the colonization tissue, we performed GSEA analyses comparing each metastatic group with the corresponding primary tumors. These results are shown in Additional file3. Although some differences were observed between colonization niches and datasets, pathways altered in PDX models were, in general, also significantly altered in metastases in the same direction. As metastatic cell clones represented only part of the primary tumor population, the cells that succeed engraftment in mice may actually represent one of the subpopulations of the primary tumor with higher metastatic potential. Future studies will be needed to better understand the mechanisms involved in the colonization of new niches by tumor cells, both in natural metastases in patients and in artificial PDXs. It will be particularly useful to understand if different locations of colonization may cause differences in gene expression.
Gene expression stability through passages
We analyzed gene expression profiles obtained from PDX models at different passages (Additional file4), where passage refers to each time a tumor has been re-implanted. As each passage could accumulate more expression changes, we tested if the functional changes previously described were more pronounced after a higher number of passages. To this aim, we used a previously published dataset with samples of primary tumor (F0), 5-passage PDXs (F5) and 10-passage PDXs (F10) from four patients. Using these data, we analyzed if the enriched functional groups described in the previous section (cell cycle, DNA replication, signaling transduction, hemostasis and extracellular matrix organization) showed significant differences between passages F5 and F10. Figure 5A shows the expression levels of core genes involved in the functional groups of altered pathways. Using GSEA, we confirmed that all functional groups were altered significantly after engraftment in PDX models (F0 versus F5 and F0 versus F10). However, none of the functional groups showed significant changes between F5 and F10 PDX passages (Figure 5B; Figure S10 in Additional file1).
Taken together, these data suggest that these functional changes occur in the initial adaptation to the new environment after engraftment but no major changes occur afterwards. In other words, the gene expression pattern is stable, in agreement with previous studies in systems such as small-cell lung cancer, uveal melanomas or colorectal cancer.
The mouse niche environment affects tissue xenografted in mice, with some alterations in the transcriptome. The lack of immune and stromal infiltrating cells in PDXs seems to be an important contributor to these changes. Thus, caution is needed when research results using PDX models are translated to patients. Still, the transcriptomes of PDX models of PDACs and HCCs retain important aspects of their tissue of origin.
Our work has three important implications. First, PDAC (and HCC) PDX models retain gene expression similarities with the primary tumors, while cell lines do not. These results are in agreement with the better results in drug sensitivity prediction generally obtained with PDX models[11, 44]. Second, some functional processes are altered in tumor cells after engraftment. Our results also show that metastases and PDX models share some functional alterations regarding primary tumors. Given that metastases frequently occur in pancreas cancer patients, our future work will focus on understanding the similarities and differences between primary and metastatic PDX models. Third, although these pathways show distinct profiles in PDX models compared with original primary tumors, they appear to be stable over passages. In our view, the stability in gene expression in PDAC PDX models over different passages is of major relevance and favors the validity of this preclinical model of disease.
false discovery rate
frozen robust multiarray analysis
Gene Expression Omnibus
gene set enrichment analysis
human non-tumoral infiltrating
multiple correspondence analysis
principal component analysis
pancreas ductal adenocarcinoma
Rubio-Viqueira B, Jimeno A, Cusatis G, Zhang X, Iacobuzio-Donahue C, Karikari C, Shi C, Danenberg K, Danenberg PV, Kuramochi H, Tanaka K, Singh S, Salimi-Moosavi H, Bouraoud N, Amador ML, Altiok S, Kulesza P, Yeo C, Messersmith W, Eshleman J, Hruban RH, Maitra A, Hidalgo M: An in vivo platform for translational drug development in pancreatic cancer. Clin Cancer Res. 2006, 12: 4652-4661. 10.1158/1078-0432.CCR-06-0113
DeRose YS, Wang G, Lin YC, Bernard PS, Buys SS, Ebbert MT, Factor R, Matsen C, Milash BA, Nelson E, Neumayer L, Randall RL, Stijleman IJ, Welm BE, Welm AL: Tumor grafts derived from women with breast cancer authentically reflect tumor pathology, growth, metastasis and disease outcomes. Nat Med. 2011, 17: 1514-1520. 10.1038/nm.2454
Sivanand S, Peña-Llopis S, Zhao H, Kucejova B, Spence P, Pavia-Jimenez A, Yamasaki T, McBride DJ, Gillen J, Wolff NC, Morlock L, Lotan Y, Raj GV, Sagalowsky A, Margulis V, Cadeddu JA, Ross MT, Bentley DR, Kabbani W, Xie XJ, Kapur P, Williams NS, Brugarolas J: A validated tumorgraft model reveals activity of dovitinib against renal cell carcinoma. Sci Transl Med. 2012, 4: 137ra75-
Dranoff G: Experimental mouse tumour models: what can be learnt about human cancer immunology?. Nat Rev Immunol. 2012, 12: 61-66.
Eichhorn PJ, Rodón L, Gonzàlez-Juncà A, Dirac A, Gili M, Martínez-Sáez E, Aura C, Barba I, Peg V, Prat A, Cuartas I, Jimenez J, García-Dorado D, Sahuquillo J, Bernards R, Baselga J, Seoane J: USP15 stabilizes TGF-β receptor I and promotes oncogenesis through the activation of TGF-β signaling in glioblastoma. Nat Med. 2012, 18: 429-435. 10.1038/nm.2619
Huynh H, Soo KC, Chow PK, Panasci L, Tran E: Xenografts of human hepatocellular carcinoma: a useful model for testing drugs. Clin Cancer Res. 2006, 12: 4306-4314. 10.1158/1078-0432.CCR-05-2568
Cook J: Tuveson: Predictive in vivo animal models and translation to clinical trials. Drug Discov Today. 2012, 17: 253-260. 10.1016/j.drudis.2012.02.003
Ruggeri BA, Camp F: Animal models of disease: pre-clinical animal models of cancer and their applications and utility in drug Discovery. Biochem Pharmacol. 2014, 87: 150-161. 10.1016/j.bcp.2013.06.020
Lowy AM, Leach SD: Pancreatic Cancer. 2008, New York, London: Springer
Valencia A, Hidalgo M: Getting personalized cancer genome analysis into the clinic: the challenges in bioinformatics. Genome Med. 2012, 4: 61-
Tentler JJ, Tan AC, Weekes CD, Jimeno A, Leong S, Pitts TM, Arcaroli JJ, Messersmith WA, Eckhardt SG: Patient-derived tumour xenografts as models for oncology drug development. Nat Rev Clin Oncol. 2012, 9: 338-350. 10.1038/nrclinonc.2012.61
Sausville EA, Burger AM: Contributions of human tumor xenografts to anticancer drug development. Cancer Res. 2006, 66: 3351-3354. 10.1158/0008-5472.CAN-05-3627
Sparrow S, Jones M, Billington S, Stace B: The in vivo malignant transformation of mouse fibroblasts in the presence of human tumour xenografts. Br J Cancer. 1986, 53: 793-797. 10.1038/bjc.1986.134
Budhu A, Forgues M, Ye QH, Jia HL, He P, Zanetti KA, Kammula US, Chen Y, Qin LX, Tang ZY, Wang XW: Prediction of venous metastases, recurrence, and prognosis in hepatocellular carcinoma based on a unique immune response signature of the liver microenvironment. Cancer Cell. 2006, 10: 99-111. 10.1016/j.ccr.2006.06.016
Subhi AL, Tang B, Balsara BR, Altomare DA, Testa JR, Cooper HS, Hoffman JP, Meropol NJ, Kruger WD: Loss of methylthioadenosine phosphorylase and elevated ornithine decarboxylase is common in pancreatic cancer. Clin Cancer Res. 2004, 10: 7290-7296. 10.1158/1078-0432.CCR-04-0972
Harada T, Chelala C, Bhakta V, Chaplin T, Caulee K, Baril P, Young BD, Lemoine NR: Genome-wide DNA copy number analysis in pancreatic cancer using high-density single nucleotide polymorphism arrays. Oncogene. 2008, 27: 1951-1960. 10.1038/sj.onc.1210832
Tan AC, Jimeno A, Lin SH, Wheelhouse J, Chan F, Solomon A, Rajeshkumar NV, Rubio-Viqueira B, Hidalgo M: Characterizing DNA methylation patterns in pancreatic cancer genome. Mol Oncol. 2009, 3: 425-438. 10.1016/j.molonc.2009.03.004
Gadaleta E, Cutts RJ, Kelly GP, Crnogorac-Jurcevic T, Kocher HM, Lemoine NR, Chelala C: A global insight into a cancer transcriptional space using pancreatic data: importance, findings and flaws. Nucleic Acids Res. 2011, 39: 7900-7907. 10.1093/nar/gkr533
Edgar R, Domrachev M, Lash AE: Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 2002, 30: 207-210. 10.1093/nar/30.1.207
Barrett T, Troup DB, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, Marshall KA, Phillippy KH, Sherman PM, Muertter RN, Holko M, Ayanbule O, Yefanov A, Soboleva A: NCBI GEO: archive for functional genomics data sets–10 years on. Nucleic Acids Res. 2011, 39: D1005-D1010. 10.1093/nar/gkq1184
Jimeno A, Tan AC, Coffa J, Rajeshkumar NV, Kulesza P, Rubio-Viqueira B, Wheelhouse J, Diosdado B, Messersmith WA, Iacobuzio-Donahue C, Maitra A, Varella-Garcia M, Hirsch FR, Meijer GA, Hidalgo M: Coordinated epidermal growth factor receptor pathway gene overexpression predicts epidermal growth factor receptor inhibitor sensitivity in pancreatic cancer. Cancer Res. 2008, 68: 2841-2849. 10.1158/0008-5472.CAN-07-5200
Badea L, Herlea V, Dima SO, Dumitrascu T, Popescu I: Combined gene expression analysis of whole-tissue and microdissected pancreatic ductal adenocarcinoma identifies genes specifically overexpressed in tumor epithelia. Hepatogastroenterology. 2008, 55: 2016-2027.
Pei H, Li L, Fridley BL, Jenkins GD, Kalari KR, Lingle W, Petersen G, Lou Z, Wang L: FKBP51 affects cancer cell response to chemotherapy by negatively regulating Akt. Cancer Cell. 2009, 16: 259-266. 10.1016/j.ccr.2009.07.016
Maupin KA, Sinha A, Eugster E, Miller J, Ross J, Paulino V, Keshamouni VG, Tran N, Berens M, Webb C, Haab BB: Glycogene expression alterations associated with pancreatic cancer epithelial-mesenchymal transition in complementary model systems. PLoS One. 2010, 5: e13002- 10.1371/journal.pone.0013002
Collisson EA, Sadanandam A, Olson P, Gibb WJ, Truitt M, Gu S, Cooc J, Weinkle J, Kim GE, Jakkula L, Feiler HS, Ko AH, Olshen AB, Danenberg KL, Tempero MA, Spellman PT, Hanahan D, Gray JW: Subtypes of pancreatic ductal adenocarcinoma and their differing responses to therapy. Nat Med. 2011, 17: 500-503. 10.1038/nm.2344
Huynh H, Chow PKH, Palanisamy N, Salto-Tellez M, Goh BC, Lee CK, Somani A, Lee HS, Kalpana R, Yu K: Bevacizumab and rapamycin induce growth suppression in mouse models of hepatocellular carcinoma. J Hepatol. 2008, 49: 52-60. 10.1016/j.jhep.2008.02.022
Barretina J, Caponigro G, Stransky N, Venkatesan K, Margolin AA, Kim S, Wilson CJ, Lehár J, Kryukov GV, Sonkin D, Reddy A, Liu M, Murray L, Berger MF, Monahan JE, Morais P, Meltzer J, Korejwa A, Jané-Valbuena J, Mapa FA, Thibault J, Bric-Furlong E, Raman P, Shipway A, Engels IH, Cheng J, Yu GK, Yu J, Aspesi P, de Silva M, et al.: The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature. 2012, 483: 603-607. 10.1038/nature11003
Liao YL, Sun YM, Chau GY, Chau YP, Lai TC, Wang JL, Horng JT, Hsiao M, Tsou AP: Identification of SOX4 target genes using phylogenetic footprinting-based prediction from expression microarrays suggests that overexpression of SOX4 potentiates metastasis in hepatocellular carcinoma. Oncogene. 2008, 27: 5578-5589. 10.1038/onc.2008.168
Chiang DY, Villanueva A, Hoshida Y, Peix J, Newell P, Minguez B, LeBlanc AC, Donovan DJ, Thung SN, Solé M: Focal gains of VEGFA and molecular classification of hepatocellular carcinoma. Cancer Res. 2008, 68: 6779-6788. 10.1158/0008-5472.CAN-08-0742
Villanueva A, Hoshida Y, Battiston C, Tovar V, Sia D, Alsinet C, Cornella H, Liberzon A, Kobayashi M, Kumada H, Thung SN, Bruix J, Newell P, April C, Fan JB, Roayaie S, Mazzaferro V, Schwartz ME, Llovet JM: Combining clinical, pathology, and gene expression data to predict recurrence of hepatocellular carcinoma. Gastroenterology. 2011, 140: 1501-1512. 10.1053/j.gastro.2011.02.006
Toffanin S, Hoshida Y, Lachenmayer A, Villanueva A, Cabellos L, Minguez B, Savic R, Ward SC, Thung S, Chiang DY, Alsinet C, Tovar V, Roayaie S, Schwartz M, Bruix J, Waxman S, Friedman SL, Golub T, Mazzaferro V, Llovet JM: MicroRNA-based classification of hepatocellular carcinoma and oncogenic role of miR-517a. Gastroenterology. 2011, 140: 1618-1628. 10.1053/j.gastro.2011.02.009
Van den Broeck A, Vankelecom H, Van Eijsden R, Govaere O, Topal B: Molecular markers associated with outcome and metastasis in human pancreatic cancer. J Exp Clin Cancer Res. 2012, 31: 68- 10.1186/1756-9966-31-68
Chaika NV, Yu F, Purohit V, Mehla K, Lazenby AJ, DiMaio D, Anderson JM, Yeh JJ, Johnson KR, Hollingsworth MA, Singh PK: Differential expression of metabolic genes in tumor and stromal components of primary and metastatic loci in pancreatic adenocarcinoma. PLoS One. 2012, 7: e32996- 10.1371/journal.pone.0032996
McCall MN, Bolstad BM, Irizarry RA: Frozen robust multiarray analysis (fRMA). Biostatistics. 2010, 11: 242-253. 10.1093/biostatistics/kxp059
McCall MN, Uppal K, Jaffee HA, Zilliox MJ, Irizarry RA: The Gene Expression Barcode: leveraging public data repositories to begin cataloging the human and murine transcriptomes. Nucleic Acids Res. 2011, 39: D1011-D1015. 10.1093/nar/gkq1259
Greenacre MJ, Blasius J: Multiple Correspondence Analysis and Related Methods. 2006, Boca Raton: Chapman & Hall/CRC
Rausell A, Juan D, Pazos F, Valencia A: Protein interactions and ligand binding: from protein subfamilies to functional specificity. Proc Natl Acad Sci U S A. 2010, 107: 1995-2000. 10.1073/pnas.0908044107
Yoshihara K, Shahmoradgoli M, Martínez E, Vegesna R, Kim H, Torres-Garcia W, Treviño V, Shen H, Laird PW, Levine DA, Carter SL, Getz G, Stemke-Hale K, Mills GB, Verhaak RGW: Inferring tumour purity and stromal and immune cell admixture from expression data. Nat Commun. 2013, 4: 2612-
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005, 102: 15545-15550. 10.1073/pnas.0506580102
Matthews L, Gopinath G, Gillespie M, Caudy M, Croft D, de Bono B, Garapati P, Hemish J, Hermjakob H, Jassal B, Kanapin A, Lewis S, Mahajan S, May B, Schmidt E, Vastrik I, Wu G, Birney E, Stein L, D'Eustachio P: Reactome knowledgebase of human biological pathways and processes. Nucleic Acids Res. 2009, 37: D619-D622. 10.1093/nar/gkn863
Smyth GK, Michaud J, Scott HS: Use of within-array replicate spots for assessing differential expression in microarray experiments. Bioinformatics. 2005, 21: 2067-2075. 10.1093/bioinformatics/bti270
Garrido-Laguna I, Uson M, Rajeshkumar NV, Tan AC, de Oliveira E, Karikari C, Villaroel MC, Salomon A, Taylor G, Sharma R, Hruban RH, Maitra A, Laheru D, Rubio-Viqueira B, Jimeno A, Hidalgo M: Tumor engraftment in nude mice and enrichment in stroma- related gene pathways predict poor survival and resistance to gemcitabine in patients with pancreatic cancer. Clin Cancer Res. 2011, 17: 5793-5800. 10.1158/1078-0432.CCR-11-0341
Lukk M, Kapushesky M, Nikkilä J, Parkinson H, Goncalves A, Huber W, Ukkonen E, Brazma A: A global map of human gene expression. Nat Biotechnol. 2010, 28: 322-324. 10.1038/nbt0410-322
Daniel VC, Marchionni L, Hierman JS, Rhodes JT, Devereux WL, Rudin CM, Yung R, Parmigiani G, Dorsch M, Peacock CD, Watkins DN: A primary xenograft model of small-cell lung cancer reveals irreversible changes in gene expression imposed by culture in vitro. Cancer Res. 2009, 69: 3364-3373. 10.1158/0008-5472.CAN-08-4210
Laurent C, Gentien D, Piperno-Neumann S, Némati F, Nicolas A, Tesson B, Desjardins L, Mariani P, Rapinat A, Sastre-Garau X, Couturier J, Hupé P, de Koning L, Dubois T, Roman-Roman S, Stern MH, Barillot E, Harbour JW, Saule S, Decaudin D: Patient-derived xenografts recapitulate molecular features of human uveal melanomas. Mol Oncol. 2013, 7: 625-636. 10.1016/j.molonc.2013.02.004
Park B, Jeong BC, Choi Y, Kwon GY, Lim JE, Seo SI, Jeon SS, Lee HM, Choi HY, Lee K: Development and characterization of a bladder cancer xenograft model using patient-derived tumor tissue. Cancer Sci. 2013, 105: 631-638.
Harrell JC, Dye WW, Harvell DME, Sartorius CA, Horwitz KB: Contaminating cells alter gene signatures in whole organ versus laser capture microdissected tumors: a comparison of experimental breast cancers and their lymph node metastases. Clin Exp Metastasis. 2008, 25: 81-88. 10.1007/s10585-007-9105-7
Bradford JR, Farren M, Powell SJ, Runswick S, Weston SL, Brown H, Delpuech O, Wappett M, Smith NR, Carr TH, Dry JR, Gibson NJ, Barry ST: RNA-Seq differentiates tumour and host mRNA expression changes induced by treatment of human tumour xenografts with the VEGFR tyrosine kinase inhibitor Cediranib. PLoS One. 2013, 8: e66003- 10.1371/journal.pone.0066003
Samuels AL, Peeva VK, Papa RA, Firth MJ, Francis RW, Beesley AH, Lock RB, Kees UR: Validation of a mouse xenograft model system for gene expression analysis of human acute lymphoblastic leukaemia. BMC Genomics. 2010, 11: 256- 10.1186/1471-2164-11-256
Conway T, Wazny J, Bromage A, Tymms M, Sooraj D, Williams ED, Beresford-Smith B: Xenome–a tool for classifying reads from xenograft samples. Bioinformatics. 2012, 28: i172-i178. 10.1093/bioinformatics/bts236
Valdes C, Seo P, Tsinoremas N, Clarke J: Characteristics of cross-hybridization and cross-alignment of expression in pseudo-xenograft samples by RNA-seq and microarrays. J Clin Bioinforma. 2013, 3: 8- 10.1186/2043-9113-3-8
Uronis JM, Osada T, McCall S, Yang XY, Mantyh C, Morse MA, Lyerly HK, Clary BM, Hsu DS: Histological and molecular evaluation of patient-derived colorectal cancer explants. PLoS One. 2012, 7: e38422- 10.1371/journal.pone.0038422
Reyal F, Guyader C, Decraene C, Lucchesi C, Auger N, Assayag F, De Plater L, Gentien D, Poupon M, Cottu P, De Cremoux P, Gestraud P, Vincent-Salomon A, Fontaine J, Roman-Roman S, Delattre O, Decaudin D, Marangoni E: Molecular profiling of patient-derived breast cancer xenografts. Breast Cancer Res. 2012, 14: R11- 10.1186/bcr3095
Stratford JK, Bentrem DJ, Anderson JM, Fan C, Volmar KA, Marron JS, Routh ED, Caskey LS, Samuel JC, Der CJ, Thorne LB, Calvo BF, Kim HJ, Talamonti MS, Iacobuzio-Donahue CA, Hollingsworth MA, Perou CM, Yeh JJ: A six-gene signature predicts survival of patients with localized pancreatic ductal adenocarcinoma. PLoS Med. 2010, 7: e1000307- 10.1371/journal.pmed.1000307
Vanharanta S, Massagué J: Origins of metastatic traits. Cancer Cell. 2013, 24: 410-421. 10.1016/j.ccr.2013.09.007
This work was supported by grants from the Instituto de Salud Carlos III COMBIOMED (RD07/0067/0014), the Spanish MINECO (BIO2012-40205) and USA National Cancer Institute (R01-CA116554 and R01-CA129963). We are indebted to Fatima Al-Shahrour and Simone Marsili for advice on statistical methodology and helpful discussions. We thank Eduardo Andrés and Ángel Carro for technical support.
The authors declare that they have no competing interests.
RM, DR, AV and MH were responsible for conception and design. PPL and MH coordinated the experimental work. NB and CM generated animal models. MM processed the samples for microarray hybridization. RM, AR, DJ and DR were responsible for the development of the analysis protocol. RM, DJ, DR, MH and AV were responsible for interpretation of data. RM and DR drafted the initial manuscript. All authors critically revised and approved the manuscript for publication.
Electronic supplementary material
Additional file 2: Table S1: Core genes from each pathway with the expression frequency value in the PDX models and human primary tumors and presence or absence in each functional group. (XLS 78 KB)
Additional file 4: Table S3: Information about origin of PDXs and microarray hybridization, the publication status of expression data and number of passages of PDX models. (XLS 44 KB)
About this article
Cite this article
Martinez-Garcia, R., Juan, D., Rausell, A. et al. Transcriptional dissection of pancreatic tumors engrafted in mice. Genome Med 6, 27 (2014). https://doi.org/10.1186/gm544
- Uveal Melanoma
- Multiple Correspondence Analysis
- PDAC Cell
- Human Primary Tumor
- Expression Space