Genomic landscape of colorectal cancer in Japan: clinical implications of comprehensive genomic sequencing for precision medicine
- Masayuki Nagahashi†1,
- Toshifumi Wakai†1Email author,
- Yoshifumi Shimada1,
- Hiroshi Ichikawa1,
- Hitoshi Kameyama1,
- Takashi Kobayashi1,
- Jun Sakata1,
- Ryoma Yagi1,
- Nobuaki Sato2,
- Yuko Kitagawa3,
- Hiroyuki Uetake4,
- Kazuhiro Yoshida5,
- Eiji Oki6,
- Shin-ei Kudo7,
- Hiroshi Izutsu8,
- Keisuke Kodama8,
- Mitsutaka Nakada8,
- Julie Tse9,
- Meaghan Russell9,
- Joerg Heyer9,
- Winslow Powers9,
- Ruobai Sun9,
- Jennifer E. Ring9,
- Kazuaki Takabe10, 11,
- Alexei Protopopov9,
- Yiwei Ling12,
- Shujiro Okuda12Email author and
- Stephen Lyle9, 13Email author
© The Author(s). 2016
Received: 3 August 2016
Accepted: 1 December 2016
Published: 22 December 2016
Comprehensive genomic sequencing (CGS) has the potential to revolutionize precision medicine for cancer patients across the globe. However, to date large-scale genomic sequencing of cancer patients has been limited to Western populations. In order to understand possible ethnic and geographic differences and to explore the broader application of CGS to other populations, we sequenced a panel of 415 important cancer genes to characterize clinically actionable genomic driver events in 201 Japanese patients with colorectal cancer (CRC).
Using next-generation sequencing methods, we examined all exons of 415 known cancer genes in Japanese CRC patients (n = 201) and evaluated for concordance among independent data obtained from US patients with CRC (n = 108) and from The Cancer Genome Atlas-CRC whole exome sequencing (WES) database (n = 224). Mutation data from non-hypermutated Japanese CRC patients were extracted and clustered by gene mutation patterns. Two different sets of genes from the 415-gene panel were used for clustering: 61 genes with frequent alteration in CRC and 26 genes that are clinically actionable in CRC.
The 415-gene panel is able to identify all of the critical mutations in tumor samples as well as WES, including identifying hypermutated tumors. Although the overall mutation spectrum of the Japanese patients is similar to that of the Western population, we found significant differences in the frequencies of mutations in ERBB2 and BRAF. We show that the 415-gene panel identifies a number of clinically actionable mutations in KRAS, NRAS, and BRAF that are not detected by hot-spot testing. We also discovered that 26% of cases have mutations in genes involved in DNA double-strand break repair pathway. Unsupervised clustering revealed that a panel of 26 genes can be used to classify the patients into eight different categories, each of which can optimally be treated with a particular combination therapy.
Use of a panel of 415 genes can reliably identify all of the critical mutations in CRC patients and this information of CGS can be used to determine the most optimal treatment for patients of all ethnicities.
KeywordsColorectal cancer Precision medicine Ethnicity Japanese Comprehensive genomic sequencing Actionable driver mutation Hypermutation
Cancer remains the leading cause of death worldwide with colorectal cancer (CRC) among the most common indications, accounting for 700,000 deaths per year . Utilizing next-generation sequencing technology, projects such as The Cancer Genome Atlas (TCGA) and others have profiled genomic changes in several cancer types including CRC [2–9]. The ultimate goal of cancer genome profiling is to enable precision medicine, the tailoring of treatments based on unique genomic changes of each patient’s individual tumor. For instance, the importance of genomic evaluation of RAS and RAF for advanced CRC patients has been widely accepted, since it has been revealed that tumors with RAS or RAF mutations show resistance to anti-EGFR therapies . Initially, mutations in these genes were found to occur in “hot-spots” (i.e. KRAS codon 12, 13, or BRAF V600E) [11–13], however, whole exome sequencing (WES) has revealed that mutations outside of hot-spots can also influence therapeutic responses [14, 15]. Yet, WES may not be practical in the clinical setting due to its high cost, shallow sequencing depth, and excessive information about variants/genes of unknown significance [16, 17]. Although sequencing studies of CRC have been reported [4, 18–20], tumors from Asian populations have not been the subject of comprehensive evaluation. We now report the results from the analysis of 201 Japanese CRC patients.
Since all of the reported studies examined the mutational spectrum using WES, and WES is clinically expensive and time-consuming, we hypothesized that sequencing a panel of cancer-associated genes would identify essentially all actionable genomic driver mutations and further determine mutational burden in CRC, both of which can enable development of personalized treatment strategies. In the current study, we tested this hypothesis utilizing a 415-gene panel designed for solid tumors at a very high depth of coverage (~500×) in Japanese patients (n = 201 tumors) and evaluated for concordance among independent data obtained from US patients with colon cancer (n = 108 tumors) (J-CRC and US-CRC, respectively) and from the TCGA-CRC WES database (n = 224 tumors). Here, we report that comprehensive genomic sequencing (CGS) with a 415-gene panel can accurately determine high mutation burden (somatic mutation rate) and that there are differences in the frequency of mutations in ERBB2 and BRAF. Hierarchical clustering of clinical data revealed that a subset of 26 genes can classify all of the CRC patients into eight categories, each of which can be effectively treated with available drugs or drugs in development.
Patient cohorts and sample inclusion criteria
A total of 201 patients diagnosed with stage I–IV CRC according to AJCC 7th edition  who had curative surgery between 2009 and 2015 at Niigata University Medical and Dental Hospital or Niigata Cancer Center Hospital were enrolled (Additional file 1: Table S4). Patients with familial adenomatous polyposis, inflammatory bowel disease, or synchronous multiple CRCs were excluded.
A total of 108 patients with histologically confirmed diagnosis of primary colorectal adenocarcinoma (stage I–IV) between 2014 and 2016 submitted for CGS as part of routine medical examination were included in this study. All tumor samples that had > 50% tumor content after macrodissection, as determined through routine hematoxylin and eosin (H&E) stain by an independent pathologist, were included. A full waiver of authorization under the Health Insurance Portability and Accountability Act (HIPAA) was granted to enable retrospective analyses for samples obtained without prior consent. All data were de-identified prior to inclusion in this study.
Sequencing library preparation
For Japanese and US patient samples, archival tissue in the form of formalin-fixed, paraffin embedded (FFPE) tumor or unstained tissue sections obtained during routine biopsy and/or resection were used for analysis. An independent pathologist evaluated tumor content on H&E stained slides for each study sample to ensure > 50% tumor content was present. Where applicable, unstained slides were macro-dissected to enrich for tumor content and genomic DNA (gDNA) was extracted using BiOstic FFPE Tissue DNA Isolation Kit (Mo Bio Laboratories, Inc.). All sample prep, CGS, and analytics were performed in a CLIA/CAP-accredited laboratory (KEW Inc; Cambridge, MA, USA).
Comprehensive genomic sequencing
FFPE gDNA (50–150 ng) was converted into libraries and enriched for the 415 genes with CANCERPLEX (KEW Inc.; Cambridge, MA, USA). CANCERPLEX is a clinically validated 415-gene panel enriched for coding regions and selected introns of genes with known association in cancer. Sequencing was performed on the Illumina MiSeq and NextSeq platforms with average 500× sequencing depth. Genomic data were then processed through a proprietary bioinformatics platform and knowledge base to identify multiple classes of genomic abnormalities including single nucleotide substitutions (SNPs), small insertions/deletions (indels), copy number variations (CNV), and translocations in ALK, RET, and ROS1. A threshold of 10% allelic fraction was used for SNPs and indels and thresholds of >2.5-fold (gains) and 0.5-fold (loss) were used. To assess somatic status of mutations in a tumor-only setting, we employed a filtering strategy similar to one recently published  with minor differences. In short, variants were deprioritized if they were present in a combination of dbSNP, 1000 Genomes, and ExAC databases (at AF > 1%). Next, allele frequencies for each mutation were used to fit a model to determine whether the variant is likely germline heterozygous or somatic. Finally, results underwent manual molecular pathologist review validating somatic versus possible germline status of a variant. Based on published and our experience, this approach allows the correct discrimination between germline and somatic variants in more than 99% of cases. Mutated burden was determined by non-synonymous SNPs present in the tumor that have population frequency of < 1% dbSNP and 1000 Genomes databases.
Downsampling TCGA mutation data
COAD-READ mutation data for the TCGA-CRC samples (n = 224 samples) were downloaded from the Broad GDAC Firehose website (https://gdac.broadinstitute.org/). Similar to the 415-gene panel bioinformatics pipeline, silent mutations that were not protein altering were removed from the dataset. To compare mutation burden of the 415-gene panel to TCGA WES data, the dataset of SNPs was downsampled to the 415 genes in the panel and the mutation rate determined in the panel was calculated as mutations/Mb. To produce receiver operating characteristic (ROC) curves, genes were selected randomly to produce panels of 400, 300, 200, 100, and 50 genes. Mutation burden was calculated using only CGS panel genes and individual ROC curves were used to evaluate how well mutation burden predicted hypermutated samples. This process was repeated 100 times and average ROC curves were produced at each panel size. In addition, individual ROC curves were produced using all genes and only those genes in KEW’s CANCERPLEX panel.
Each single nucleotide variant (SNV) was classified in a matrix of the 96 possible substitutions based on the sequence context comprising the nucleotides 5′and 3′ to the position of the mutation. Mutational signatures were extracted using non-negative matrix factorization analysis with the SomaticSignatures R package  and plotted with ggplots R package (http://ggplot2.org/). This analysis identified complex signatures, different between hypermutated and non-hypermutated cases. Deconvolution of the complex profiles in order to identify components matching to COSMIC mutational signatures was done using deconstruct Sigs R package .
Mismatch repair immunohistochemistry (MMR-IHC)
Immunohistochemistry (IHC) staining was performed on the 40 samples of Japanese CRC with highest mutation rates. Slides were stained for four mismatch repair (MMR) proteins, MLH1 (clone G168-15), MSH2 (clone FE11) MSH2 (clone BC/44), and PMS2 (clone A16-4), and were scored by two pathologists. For US clinical cases, clinical records were reviewed and results of MMR studies were recorded when available.
Mutation analysis and visualization
Genomic data for Japanese (n = 201) and US patients (n = 108) obtained from CGS were mined in OncoPrinter (www.cbioportal.org). Pathway genes were selected based on previously published TCGA data  that are included in the 415-gene panel. For TCGA analyses, genomic profiles were selected in cBioPortal for mutations and putative copy-number alterations from GISTIC for which tumor sequence data are available (n = 224). For each pathway, the number of total uniquely altered cases was determined. Statistical significance was determined by Fisher’s exact two-tail test with a 95% confidence interval. For dsDNA break repair pathway analysis, the statistical significance of Japanese and US datasets was determined as compared to TCGA.
To align mutations with their protein domains, genomic data for Japanese, US, and TCGA datasets were analyzed in Mutation Mapper (www.cbioportal.org). Lollipop figures were generated for select genes implicated in colorectal adenocarcinoma. For BRAF and KRAS, data were further segregated by hypermutation status (hypermutated versus non-hypermutated).
Gene clustering analysis
Mutation data from non-hypermutated J-CRC patients (n = 184 tumors) were extracted and clustered by gene mutation patterns. Two different sets of genes from the 415-gene panel were used for clustering: (1) 61 genes with frequent alteration in CRC; and (2) 26 genes that are clinically actionable in CRC. For this analysis, KRAS and NRAS were integrated into one gene as a RAS.
The number of common mutated genes related to donors i and j was presented as an element cij of an N × N matrix, where N is the number of non-hypermutated donors. In order to normalize the elements of this N dimension symmetric matrix into values ranging from 0 to 1, the original element was replaced by 1 / (cij + 1) that indicated the level of similarity between donors i and j. Because of this normalization, donors with more common mutated genes would more possibly come from a relatively close group. Consequently, a matrix with the normalized values between all donors was created. Hierarchical clustering of the matrix was performed for classifying donor groups with different mutated-gene patterns by Euclidean distance and Ward’s clustering. For the 26-gene set, donors were divided into eight groups based on the hierarchical clustered dendrogram, which clearly distinguished donors by the different mutated-gene patterns. On the other hand, for the 61-gene set, donors were divided into 17 groups. These clusterings were performed by software R (https://www.r-project.org/).
Model selection of clustering
Clustering stability was evaluated by R package clValid for statistical and biological validation of clustering results (https://cran.r-project.org/web/packages/clValid/index.html). This method would produce the results of four stability measures called APN (average portion of non-overlap), AD (average distance), ADM (average distance between means), and FOM (figure of merit). For each index, a lower value means higher stability. We attempted clustering stabilities for combinations of different numbers of clusters obtained by cutting a dendrogram (2–12 for the 26-gene set and 2–24 for the 61-gene set) with different distance methods (“Euclidean,” “maximum,” “manhattan,” “canberra,” and “minkowski”) and clustering methods (“ward.D,” “ward.D2,” “single,” “complete,” “average,” “mcquitty,” “median,” and “centroid”). All combinations of these three parameters were evaluated and the parameters with the lowest values of each stability index were extracted. Of these, the common parameter sets with relatively lower values among the four stability indices were selected. The most appropriate cluster number, distance method, and clustering method were determined from the resulted parameter settings, taking into account that the number of donors presented in clusters (>5 donors) would be maximized as possible and the primary mutated genes would be clear. The final selected parameter settings were the Euclidean distance method and ward.D clustering in both sets and eight clusters for the 26-gene set and 17 clusters for the 61-gene set.
Statistical analysis of clinical information
To estimate associations between mutated-gene patterns and clinical information such as sex, rectum/colon, and left/right, a two-tailed Fisher’s exact test was applied in each cluster. Additionally, in order to explore associations between mutated-gene patterns and tumor aggressiveness, seven clinical variables were dichotomized into less or more aggressive factors for colon cancer onsets in the following manner: lymphatic invasion (absence/presence), vascular invasion (absence/presence), histopathological grade (G1/G2 or G3), size of primary tumor (T1/T2 or T3/T4), spread to regional lymph node (N0 or N1/N2), distant metastasis (M0 or M1), and tumor stage (I/II or III/IV). In each cluster, two-tailed Fisher’s exact test was applied to all clinical categories by comparing the distribution in a cluster group to that of all the donors in the other groups. Note that in the case of statistical signature for 17 hypermutated donors, two-tailed Fisher’s exact test was conducted against 184 non-hypermutated donors as a reference set.
Patients were followed every 1–6 months at outpatient clinics. Medical records and survival data were obtained for all 104 Stage IV CRC patients. Among them, 46 patients received anti-EGFR therapies. Seven out of the 46 patients with surgical resection were excluded and 39 patients were included for the analysis of clinical outcomes. Tumor assessments at baseline included a computed tomography (CT) scan of the abdomen as well as of other relevant sites of the disease. Follow-up scans to assess response were obtained after cycles 1 and 2 and every two cycles thereafter. Responses were determined using RECIST 1.0. Six patients who showed progression disease before the first assessment for RECIST were excluded and 33 patients were included for waterfall plot analysis. The best calculated responses on the basis of measurable lesions were analyzed by waterfall plot.
The follow-up period for progression-free survival was defined as the interval between the date of diagnosis of metastatic disease and that of progression disease. Survival curves were constructed using the Kaplan–Meier method and differences in survival were evaluated using the log-rank test. Three out of 39 patients were excluded for Kaplan–Meier analysis based on the clustering, since each one of three patients was classified into each different subtype alone. All statistical evaluations were performed using the SPSS 22 software package (SPSS Japan Inc., Tokyo, Japan). All tests were two-sided and a P value < 0.05 was considered statistically significant.
While conducting the two-tailed Fisher’s exact test as above, the statistical powers of the tests were also estimated by R package statmod (https://cran.r-project.org/web/packages/statmod/index.html). Some clinical categories showing significant differences (p < 0.05) were at insufficient power levels (power < 0.8). It is known that power is related to sample size and, in other words, the power of tests could be promoted by adjusting the effect size of samples . Therefore, for these significant but low-power contingency tables, we made a prediction of the number of donors that could meet a sufficient power level under the premise that the hypothetical cross-tabulations had the same cell percentages as that of 184 non-hypermutated donors. The prediction was performed for sample sizes in the range of 20–500 with increments of ten donors for each step and both P value and power of Fisher’s exact test were calculated for assumed contingency table at each step. By this means, a minimum effect non-hypermutated donor number was obtained and this sample size could become a reference in future studies. The statistical power calculation and prediction for the above-mentioned Fisher’s exact test were simulated 1000 times for each cross-tabulation.
Gene-based statistical analysis
To estimate associations between genes and tumor aggressiveness, we performed Fisher’s exact test for each gene in seven clinical categories. Subsequently, significant genes with at least one clinical category (p < 0.05) were extracted. A matrix between the genes and the clinical categories were created based on log odds ratio for the extracted genes. Finally, the matrix was clustered by Euclidean distance and Ward’s method. In this clustering, positive and negative infinity values are replaced by 4 and −4 as pseudonumbers, respectively.
Genomic alterations in cancer signaling pathways
Given the recent recognition that tumors with DNA double-strand break repair defects (most notably BRCA1/2 mutations) are more sensitive to PARP inhibitors  and the recent approval of olaparib for advanced ovarian cancer, we undertook a comprehensive analysis of the DNA double-strand break repair pathway. Currently BRCA1/2 mutation status alone is used to identify patients for olaparib treatment; however, mutations in other genes can lead to DNA double-strand break repair defects [28, 29]. Therefore, those genes may also be useful in determining olaparib sensitivity. Excluding TP53, which is not used for selection of PARP inhibitors, we analyzed the five DNA repair pathway genes that are most commonly mutated in Japanese and US patients and compared with TCGA samples (Fig. 1d and e). We found genomic alterations in all five DNA repair genes, including BRCA2, which represent a significant proportion of CRC patients (26% of Japanese, 21% of US, and 19% of TCGA samples).
Mutation rates detected by targeted sequencing with cancer gene panel
We further explored the utility of CGS to provide clinically meaningful patterns of mutational signatures  from the J-CRC cohort (Fig. 2e). Based upon the signatures described in COSMIC (http://cancer.sanger.ac.uk/cosmic)), we found that Signatures 20 and 26 contributed the largest proportion of total somatic SNVs and were similar to previous findings. Both signatures were associated with defective DNA repair . Interestingly, in the hypermutated-cases only we identified Signature 10 (C > A SNVs at TpCpT context), previously shown to correlate with altered activity of DNA polymerase epsilon  (termed “ultra-hypermutators” by COSMIC). Indeed, we determined that the two cases with the highest mutation burdens were MMR-intact with mutations in their POLE gene: V411L in the exonuclease (proofreading) domain in one case and P286R in the polymerase domain in the other demonstrating the capacity of CGS in identifying clinically useful mutational signatures.
Genomic evaluation of key driver genes
Recent updates in clinical guidelines, in both Japan and in the US, have made the genomic evaluation of KRAS, NRAS, and BRAF essential for treatment planning. Most mutations in these genes cluster in “hot-spots” (i.e. KRAS codon 12, 13; NRAS codon 61; BRAF codon 600); however, data from large full-gene sequencing projects have identified additional mutations outside these hot-spots (e.g. KRAS codon 22, 33, 59, etc.). We compared the distribution of somatic mutation across these key genes between Japanese and US cohorts and with the TCGA (Fig. 2f–h, Additional file 1: Figure S1). While the KRAS mutation patterns in different cohorts appeared similar, BRAF mutation patterns presented key differences. BRAF mutations present in TCGA-CRC samples were predominantly represented by V600E which is often restricted to hypermutated tumors and agrees with previous reports [35–37]. The TCGA database shows that BRAF mutations in non-hypermutated tumors were also significantly more frequent in right-sided tumors. In contrast to previous studies, both Japanese and US-CRC cases had a wide range of non-V600E mutations inside and outside the kinase domain including D594G, a kinase-dead BRAF that can drive tumor progression through interactions with CRAF . In addition, BRAF mutations were found in both left-sided and right-sided tumors (Additional file 1: Table S2). This finding may suggest unique therapeutic strategies for not only right-sided, but also left-sided tumors that were enriched for alternate BRAF mutations. Consistent with previous findings in TCGA-CRC cases , we found APC and RNF43 truncating mutations mutually exclusive in J-CRC and in US-CRC (Fig. 1) with significant enrichment of RNF43 alterations, particularly G659 mutations, in MMR-deficient tumors (Additional file 1: Figure S2). Analysis of additional key driver genes showed similar patterns of mutation between Japanese, US, and TCGA cohorts (Additional file 1: Figure S1). Similar to TCGA results, no gene fusions were found in well-characterized driver genes ALK, RET, or ROS1.
Genomic alterations and tumor aggressiveness
Unlike earlier genomic profiling studies, this study also included clinical outcomes data that was used to determine the relationship between mutation profile and patient outcomes. CRC is a clinically diverse disease and it has been long considered that genomic heterogeneity is vital to understanding this diversity. Tumors can be classified by degree of lymphatic invasion, vascular invasion, histopathological grade, TNM classifications, and tumor stage . We therefore examined the association between gene alterations and clinical features. Among the 415 genes, we found that genes significantly enriched in at least one certain category (p < 0.05) were distinctly classified into more aggressive or less aggressive groups (Additional file 1: Figure S3 and Table S3). For example, mutations in genes such as PTEN, SMAD2, TGFB2, and SRC implicated in epithelial-mesenchymal transition, metastasis, and cancer progression [40, 41], were enriched in more aggressive groups while the other genes clustered in the less aggressive groups.
Cluster analysis for Japanese CRC mutations
Several approaches to identify genomic subtypes have been proposed to correlate genomic landscape with clinical features in CRC. Despite differing methods of classification, the hypermutated subtype has commonly emerged across various genomic profiling efforts. In agreement with these findings, we identified a subgroup of 17 Japanese patients with hypermutated tumors as characterized by CGS (Fig. 1). We therefore performed hierarchical clustering of mutations in a subset of genes frequently altered in CRC (n = 61 genes) in the Japanese cohort of non-hypermutated patients (n = 184 tumors) to further assess the association between gene alterations and clinical features in CRC (Additional file 1: Figure S4). We identified that all patients can be classified into 12 typical clusters (Additional file 1: Figure S4). We further examined associations between each of these clusters with clinicopathological features, such as sex, tumor location, and pathologic stage (Additional file 1: Figure S4B). Of note, patients in Cluster 7 (n = 49 tumors) with primary mutated genes APC and TP53 significantly associated with the location of left side (p < 0.01), less lymph node metastasis (p < 0.05), and less distant metastasis (p < 0.05) compared with patients in all other clusters (Additional file 1: Figure S4B). These findings suggest that there are clear associations between mutation spectrum and clinical characteristics of Japanese CRC patients.
Outcome of Stage IV CRC patients and clinical potential of cluster analysis based on CGS platform
In the current study, we performed CGS sequencing with a 415-gene panel to probe actionable driver mutations at a very high depth of coverage in the largest series of Japanese patients (n = 201 tumors) and evaluated for concordance among independent data obtained from US patients with colon cancer (n = 108 tumors) and from the TCGA-CRC WES database (n = 224 tumors). We identified overall similarities and some distinct population differences in detecting clinically actionable oncogenic driver events. We correlated mutation burden with DNA mismatch repair status, obtained clear genomic mutational signatures, and identified genomic alteration patterns in Japanese and the US-CRC patients similar to those previously identified by WES by the TCGA. We also found statically significant increases in ERBB2 APC, TP53, and NRAS mutations in Japanese patients as compared with US patients, which may reflect epidemiological differences between the two populations. Interestingly, we found that 11 of 24 BRAF mutations occurred outside the hot-spot V600E. Since mutations other than V600E are known to be activating, our results underscore the importance of sequencing all BRAF exons to assess the optimal therapeutic approach. Moreover, we report here a novel, significant correlation between APC and TP53 mutations with tumors presented on the left side, emphasizing the utility of CGS sequencing as an invaluable resource for better understanding the genomic landscape of CRC.
To explore the clinical potential of CGS, we performed cluster analysis with the set of clinically actionable genes in CRC (n = 26 genes) related to targeted therapies either approved or in late-phase development in Japan and obtained eight typical subgroups in addition to the “hypermutated” subgroup. CRC patients in the “hypermutated” subgroup are expected to benefit most from treatment with immune checkpoint inhibitors. Patients in the “all wild-type” cluster (Cluster 1) may respond best to anti-EGFR therapies, such as Cetuximab and Panitumumab given the lack of contraindicated KRAS mutations. However, patients in Clusters 2–5 had driver mutations downstream of the EGFR pathway, suggesting resistance to anti-EGFR therapies and hence better response to therapies targeting PIK3CA, ERBB2, RNF43/BRAF, or PTEN. Patients in Clusters 6–8 had KRAS mutations and therefore may benefit from chemotherapy + Bevacizumab given their expected resistance to anti-EGFR therapy. Thus, these findings underscore the clinical potential of examining a smaller (26 gene) panel, by which we could identify suitable targeted therapies based on the clustering of actionable gene mutations.
Given the clinical significance of hot-spot KRAS mutations (codons 12 and 13) in patients with advanced CRC to anti-EGFR therapy resistance, KRAS mutation testing has become mandatory testing in Japanese patients before administering anti-EGFR therapy . Indeed, most of the patients treated with anti-EGFR therapies in this study had been identified not to have hot-spot KRAS mutations (codons 12 and 13) and thus considered as KRAS wild-type, except for a few patients who had been treated before testing became required. Recent studies have identified alterations in genes downstream of EGFR (RTKs and RAS pathway) in addition to hot-spot KRAS mutations as likely indicators of primary and secondary resistance to anti-EGFR antibody therapies . We therefore probed the clinical relevance of gene alterations in RTKs and RAS pathway in addition to KRAS mutations as identified by CGS in Japanese CRC patients. Interestingly, there were three patients with progressive disease on anti-EGFR therapy and CGS revealed that two out of the three patients had previously unidentified mutations downstream of EGFR emphasizing that hot-spot testing alone is inadequate in guiding therapeutic strategies. Moreover, Kaplan–Meier analysis demonstrated that patients in the subgroup without alterations in RTKs and RAS pathway showed significantly better progression-free survival than patients in subgroups with mutations, although most of the patients had been previously considered as KRAS wild-type. Taken together, we have demonstrated that CGS captures broad actionable genomic driver mutations in Japanese patients with advanced CRC satisfying a currently unmet critical need to better guide personalized therapeutic approaches in Japan.
We demonstrate concordance of CGS between Japanese and US patients with CRC and with WES in the TCGA database. We further illustrate how CGS testing captures broad actionable genomic driver mutations as well as high mutational burden and highlight its potential to impact clinical outcomes of patients. These findings emphasize the clinical potential of CGS for patients with CRC in Japan and warrant further clinical investigation through prospective randomized clinical trials to confirm the application.
Comprehensive genomic sequencing
Copy number variation
Formalin-fixed, paraffin embedded
Mismatch repair deficiency
Single nucleotide variant
The Cancer Genome Atlas
Whole exome sequencing
We thank our colleagues Karen Dresser, Yosuke Tajima, Takuma Okamura, Hitoshi Nogami, Satoshi Maruyama, Yasumasa Takii, Takashi Kawasaki, and Keiichi Homma.
This project was supported by funding from Denka Co., Ltd. M. Nagahashi is supported by the Japan Society for the Promotion of Science (JSPS) Grant-in-Aid for Scientific Research Grant Number 15H05676 and 15 K15471, the Uehara Memorial Foundation, Nakayama Cancer Research Institute, Takeda Science Foundation, and Tsukada Medical Foundation. T. Wakai is supported by the JSPS Grant-in-Aid for Scientific Research Grant Number 15H04927 and 16 K15610. S. Okuda is supported by the JSPS Grant-in-Aid for Scientific Research Grant Number 26700029. K. Takabe is supported by NIH/NCI grant R01CA160688 and Susan G. Komen Investigator Initiated Research Grant IIR12222224. S. Lyle is supported by a grant from the Massachusetts Life Sciences Center.
Availability of data and materials
The datasets generated during and/or analyzed during the current study are not publicly available due to data and privacy protection considerations but may be available on justified request. The raw data (COAD-READ) for the 224 TCGA samples were downloaded from the Broad GDAC Firehose website (https://gdac.broadinstitute.org/).
M. Nagahashi, TW, SO, and SL were project leaders. M. Nagahashi, TW, YS, SO, JER, KT, AP, and SL wrote the manuscript. H. Ichikawa, HK, TK, JS, RY, NS, YK, HU, KY, EO, and SK analyzed clinical data. H. Izutsu, KK, M. Nakada, JT, MR, JH, WP, RS, JER, AP, YL, SO, and SL examined genomic alterations and pathways. YL and SO performed bioinformatics analysis. All authors read and approved the final manuscript.
H. Izutsu, KK, and M. Nakada are employees of Denka Co., Ltd. M. Nakada holds stock in Denka Co., Ltd. JT, MR, JH, RS, JER, AP, and SL are employees of KEW Inc., who have been granted stock options by KEW Inc. The remaining authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
Collection and use of all specimens in this study were approved by the Institutional Review Boards of Niigata University Graduate School of Medical and Dental Sciences (#772) and Niigata Cancer Center Hospital (#642). The protocol for this study was approved by Western Institutional Review Board in US. Informed consent was obtained from all participants.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Torre LA, Bray F, Siegel RL, Ferlay J, Lortet-Tieulent J, Jemal A. Global cancer statistics, 2012. CA Cancer J Clin. 2015;65:87–108.View ArticlePubMedGoogle Scholar
- Cancer Genome Atlas Research Network. Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature. 2008;455:1061–8.View ArticleGoogle Scholar
- Cancer Genome Atlas Research Network. Integrated genomic analyses of ovarian carcinoma. Nature. 2011;474:609–15.View ArticleGoogle Scholar
- Cancer Genome Atlas Network. Comprehensive molecular characterization of human colon and rectal cancer. Nature. 2012;487:330–7.View ArticleGoogle Scholar
- Cancer Genome Atlas Research Network. Comprehensive genomic characterization of squamous cell lung cancers. Nature. 2012;489:519–25.View ArticleGoogle Scholar
- Cancer Genome Atlas Network. Comprehensive molecular portraits of human breast tumours. Nature. 2012;490:61–70.View ArticleGoogle Scholar
- Cancer Genome Atlas Research Network. Comprehensive molecular characterization of gastric adenocarcinoma. Nature. 2014;513:202–9.View ArticleGoogle Scholar
- Cancer Genome Atlas Network. Comprehensive genomic characterization of head and neck squamous cell carcinomas. Nature. 2015;517:576–82.View ArticleGoogle Scholar
- Hudson TJ, Anderson W, Artez A, Barker AD, Bell C, Bernabe RR, et al. International network of cancer genome projects. Nature. 2010;464:993–8.View ArticlePubMedGoogle Scholar
- Bertotti A, Papp E, Jones S, Adleff V, Anagnostou V, Lupo B, et al. The genomic landscape of response to EGFR blockade in colorectal cancer. Nature. 2015;526:263–7.View ArticlePubMedPubMed CentralGoogle Scholar
- Normanno N, Tejpar S, Morgillo F, De Luca A, Van Cutsem E, Ciardiello F. Implications for KRAS status and EGFR-targeted therapies in metastatic CRC. Nat Rev Clin Oncol. 2009;6:519–27.View ArticlePubMedGoogle Scholar
- Domingo E, Laiho P, Ollikainen M, Pinto M, Wang L, French AJ, et al. BRAF screening as a low-cost effective strategy for simplifying HNPCC genetic testing. J Med Genet. 2004;41:664–8.View ArticlePubMedPubMed CentralGoogle Scholar
- French AJ, Sargent DJ, Burgart LJ, Foster NR, Kabat BF, Goldberg R, et al. Prognostic significance of defective mismatch repair and BRAF V600E in patients with colon cancer. Clin Cancer Res. 2008;14:3408–15.View ArticlePubMedPubMed CentralGoogle Scholar
- Morris VK, Lucas FA, Overman MJ, Eng C, Morelli MP, Jiang ZQ, et al. Clinicopathologic characteristics and gene expression analyses of non-KRAS 12/13, RAS-mutated metastatic colorectal cancer. Ann Oncol. 2014;25:2008–14.View ArticlePubMedPubMed CentralGoogle Scholar
- Chevrier S, Arnould L, Ghiringhelli F, Coudert B, Fumoleau P, Boidot R. Next-generation sequencing analysis of lung and colon carcinomas reveals a variety of genetic alterations. Int J Oncol. 2014;45:1167–74.PubMedGoogle Scholar
- Lipson D, Capelletti M, Yelensky R, Otto G, Parker A, Jarosz M, et al. Identification of new ALK and RET gene fusions from colorectal and lung cancer biopsies. Nat Med. 2012;18:382–4.View ArticlePubMedPubMed CentralGoogle Scholar
- Beltran H, Yelensky R, Frampton GM, Park K, Downing SR, MacDonald TY, et al. Targeted next-generation sequencing of advanced prostate cancer identifies potential therapeutic targets and disease heterogeneity. Eur Urol. 2013;63:920–6.View ArticlePubMedGoogle Scholar
- Schell MJ, Yang M, Teer JK, Lo FY, Madan A, Coppola D, et al. A multigene mutation classification of 468 colorectal cancers reveals a prognostic role for APC. Nat Commun. 2016;7:11743.View ArticlePubMedPubMed CentralGoogle Scholar
- Seshagiri S, Stawiski EW, Durinck S, Modrusan Z, Storm EE, Conboy CB, et al. Recurrent R-spondin fusions in colon cancer. Nature. 2012;488:660–4.View ArticlePubMedPubMed CentralGoogle Scholar
- Brannon AR, Vakiani E, Sylvester BE, Scott SN, McDermott G, Shah RH, et al. Comparative sequencing analysis reveals high genomic concordance between matched primary and metastatic colorectal cancer lesions. Genome Biol. 2014;15:454.View ArticlePubMedPubMed CentralGoogle Scholar
- Edge SB, Compton CC, Fritz AG, Greene FL, Trotti A, editors. AJCC cancer staging manual. 7th ed. New York: Springer; 2010.Google Scholar
- Garofalo A, Sholl L, Reardon B, Taylor-Weiner A, Amin-Mansour A, Miao D, et al. The impact of tumor profiling approaches and genomic data strategies for cancer precision medicine. Genome Med. 2016;8:79.View ArticlePubMedPubMed CentralGoogle Scholar
- Gehring JS, Fischer B, Lawrence M, Huber W. SomaticSignatures: inferring mutational signatures from single-nucleotide variants. Bioinformatics. 2015;31:3673–5.PubMedPubMed CentralGoogle Scholar
- Rosenthal R, McGranahan N, Herrero J, Taylor BS, Swanton C. deconstructSigs: delineating mutational processes in single tumors distinguishes DNA repair deficiencies and patterns of carcinoma evolution. Genome Biol. 2016;17:31.View ArticlePubMedPubMed CentralGoogle Scholar
- Indira V, Vasanthakumari R, Jegadeeshwaran R, Sugumaran V. Determination of minimum sample size for fault diagnosis of automobile hydraulic brake system using power analysis. Eng Sci Technol Int J. 2015;18:59–69.View ArticleGoogle Scholar
- Inra JA, Steyerberg EW, Grover S, McFarland A, Syngal S, Kastrinos F. Racial variation in frequency and phenotypes of APC and MUTYH mutations in 6,169 individuals undergoing genetic testing. Genet Med. 2015;17:815–21.View ArticlePubMedPubMed CentralGoogle Scholar
- Yang K, Lamprecht SA, Shinozaki H, Fan K, Yang W, Newmark HL, et al. Dietary calcium and cholecalciferol modulate cyclin D1 expression, apoptosis, and tumorigenesis in intestine of adenomatous polyposis coli1638N/+ mice. J Nutr. 2008;138:1658–63.View ArticlePubMedGoogle Scholar
- Lord CJ, Tutt AN, Ashworth A. Synthetic lethality and cancer therapy: lessons learned from the development of PARP inhibitors. Annu Rev Med. 2015;66:455–70.View ArticlePubMedGoogle Scholar
- Fong PC, Boss DS, Yap TA, Tutt A, Wu P, Mergui-Roelvink M, et al. Inhibition of poly(ADP-ribose) polymerase in tumors from BRCA mutation carriers. N Engl J Med. 2009;361:123–34.View ArticlePubMedGoogle Scholar
- Garon EB, Rizvi NA, Hui R, Leighl N, Balmanoukian AS, Eder JP, et al. Pembrolizumab for the treatment of non-small-cell lung cancer. N Engl J Med. 2015;372:2018–28.View ArticlePubMedGoogle Scholar
- Rizvi NA, Hellmann MD, Snyder A, Kvistborg P, Makarov V, Havel JJ, et al. Cancer immunology. Mutational landscape determines sensitivity to PD-1 blockade in non-small cell lung cancer. Science. 2015;348:124–8.View ArticlePubMedPubMed CentralGoogle Scholar
- Le DT, Uram JN, Wang H, Bartlett BR, Kemberling H, Eyring AD, et al. PD-1 blockade in tumors with mismatch-repair deficiency. N Engl J Med. 2015;372:2509–20.View ArticlePubMedPubMed CentralGoogle Scholar
- Kim TM, Laird PW, Park PJ. The landscape of microsatellite instability in colorectal and endometrial cancer genomes. Cell. 2013;155:858–68.View ArticlePubMedGoogle Scholar
- Alexandrov LB, Stratton MR. Mutational signatures: the patterns of somatic mutations hidden in cancer genomes. Curr Opin Genet Dev. 2014;24:52–60.View ArticlePubMedPubMed CentralGoogle Scholar
- Brim H, Mokarram P, Naghibalhossaini F, Saberi-Firoozi M, Al-Mandhari M, Al-Mawaly K, et al. Impact of BRAF, MLH1 on the incidence of microsatellite instability high colorectal cancer in populations based study. Mol Cancer. 2008;7:68.View ArticlePubMedPubMed CentralGoogle Scholar
- Rasuck CG, Leite SM, Komatsuzaki F, Ferreira AC, Oliveira VC, Gomes KB. Association between methylation in mismatch repair genes, V600E BRAF mutation and microsatellite instability in colorectal cancer patients. Mol Biol Rep. 2012;39:2553–60.View ArticlePubMedGoogle Scholar
- Patil DT, Shadrach BL, Rybicki LA, Leach BH, Pai RK. Proximal colon cancers and the serrated pathway: a systematic analysis of precursor histology and BRAF mutation status. Mod Pathol. 2012;25:1423–31.View ArticlePubMedGoogle Scholar
- Heidorn SJ, Milagre C, Whittaker S, Nourry A, Niculescu-Duvas I, Dhomen N, et al. Kinase-dead BRAF and oncogenic RAS cooperate to drive tumor progression through CRAF. Cell. 2010;140:209–21.View ArticlePubMedPubMed CentralGoogle Scholar
- Giannakis M, Hodis E, Jasmine Mu X, Yamauchi M, Rosenbluh J, Cibulskis K, et al. RNF43 is frequently mutated in colorectal and endometrial cancers. Nat Genet. 2014;46:1264–6.View ArticlePubMedPubMed CentralGoogle Scholar
- Lindsey S, Langhans SA. Crosstalk of oncogenic signaling pathways during epithelial-mesenchymal transition. Front Oncol. 2014;4:358.View ArticlePubMedPubMed CentralGoogle Scholar
- Bhat AA, Pope JL, Smith JJ, Ahmad R, Chen X, Washington MK, et al. Claudin-7 expression induces mesenchymal to epithelial transformation (MET) to inhibit colon tumorigenesis. Oncogene. 2015;34:4570–80.View ArticlePubMedGoogle Scholar
- Roth AD, Tejpar S, Delorenzi M, Yan P, Fiocca R, Klingbiel D, et al. Prognostic role of KRAS and BRAF in stage II and III resected colon cancer: results of the translational study on the PETACC-3, EORTC 40993, SAKK 60–00 trial. J Clin Oncol. 2010;28:466–74.View ArticlePubMedGoogle Scholar
- De Roock W, Claes B, Bernasconi D, De Schutter J, Biesmans B, Fountzilas G, et al. Effects of KRAS, BRAF, NRAS, and PIK3CA mutations on the efficacy of cetuximab plus chemotherapy in chemotherapy-refractory metastatic colorectal cancer: a retrospective consortium analysis. Lancet Oncol. 2010;11:753–62.View ArticlePubMedGoogle Scholar
- Yoshino T, Muro K, Yamaguchi K, Nishina T, Denda T, Kudo T, et al. Clinical validation of a multiplex kit for RAS mutations in colorectal cancer: Results of the RASKET (RAS KEy Testing) Prospective, Multicenter Study. EBioMed. 2015;2:317–23.View ArticleGoogle Scholar