Visualizing multidimensional cancer genomics data
© BioMed Central Ltd 2013
Published: 31 January 2013
Cancer genomics projects employ high-throughput technologies to identify the complete catalog of somatic alterations that characterize the genome, transcriptome and epigenome of cohorts of tumor samples. Examples include projects carried out by the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). A crucial step in the extraction of knowledge from the data is the exploration by experts of the different alterations, as well as the multiple relationships between them. To that end, the use of intuitive visualization tools that can integrate different types of alterations with clinical data is essential to the field of cancer genomics. Here, we review effective and common visualization techniques for exploring oncogenomics data and discuss a selection of tools that allow researchers to effectively visualize multidimensional oncogenomics datasets. The review covers visualization methods employed by tools such as Circos, Gitools, the Integrative Genomics Viewer, Cytoscape, Savant Genome Browser, StratomeX and platforms such as cBio Cancer Genomics Portal, IntOGen, the UCSC Cancer Genomics Browser, the Regulome Explorer and the Cancer Genome Workbench.
Oncogenomics data and their dimensions
Cancer genomics benefits from high-throughput technologies that allow the comparison of the genomic sequences, epigenomic profiles, and transcriptomes of tumor cells with those of normal cells. These technologies often characterize different types of somatic alterations (or variations) in a tumor cell population that are absent from normal cells - including copy number alterations (CNAs), mutations, gene expression changes and methylation changes [1–4]. Together, these somatic alterations constitute multidimensional oncogenomics datasets that describe the variations that coexist in common elements (for example, the genes) of the genome (or transcriptome) of a particular cohort of tumor cells. Such data are currently being used to identify cancer-driver genes and pathways, to discover molecular targets for new therapies, and to define molecular profiles that characterize clinically meaningful patient categories. An array of analytical methods are currently used to exploit the information contained within this multidimensional layout [5–12].
Tools and resources for visualizing multidimensional cancer genomics data
Data that can be visualized
cBio Cancer Genomics Portal 
Resource for visualizing TCGA and other data sets with many features, of which the network viewer and OncoPrint are of special interest. In the network viewer, the portal overlays multidimensional genomics data onto all nodes that are representing genes. This provides the frequency of mutations and copy number alterations (and optionally, mRNA up-/downregulation). OncoPrint shows the same alteration data in a matrix heatmap
Pre-calculated TCGA and other data sets
Tool that produces heatmaps with a circular layout. Different data sets coming from the same samples can be plotted as different layered circles that form a node. The data layers are plotted maintaining the sample order, which can be adjusted by the user
Command line application
Any user-prepared data
Tool for visualizing data and information in a circular layout. It allows intuitive exploration of the relationships between genomic positions, which are depicted as ribbons. Different genomic data types can be represented in different layers of the circle. To a great extent, the color code and plot style for each layer (or data set) can be adjusted by the user
Circular genomic coordinates
Command line application
Any user-prepared data
Tool prepared for the visualization of interdependencies between multiple datasets. It allows exploration of relationships between multiple groupings and different datasets. It can cluster genomics data of different alterations and represents them as matrix heatmaps. The different groupings are connected by ribbons whose width corresponds to the number of samples shared by the connected clusters. Clinical data and pathway maps can be integrated to characterize the clusters
Matrix heatmap with option to visualize pathway maps
Desktop application (Java)
Any user-prepared data (matrices, clusterings). Prepared TCGA data available at http://compbio.med.harvard.edu/tcga/stratomex
Software for visualizing complex networks and integrating these with any type of attribute data such as genomics data and clinical patient information. An extensive library of community-developed plugins is available, some of which (for example, Reactome FIs) focus on cancer data analysis 
Desktop application (Java)
The stand-alone application supports any user-prepared network or attribute data. Additional data are available via various plugins (for example, GeneMANIA  for networks)
Genomica  http://genomica.weizmann.ac.il
Tool that can be used to analyze and visualize genomic data. Data can be visualized as heatmaps or along genomic coordinates. Module maps and module networks can be created from expression data and can integrate gene expression data, DNA sequence data, and gene and experiment annotations
Matrix heatmap Genomic coordinates
Desktop application (Java)
Tool for analysis and visualization of genomic data using interactive heatmaps. It allows loading of multidimensional matrices (with several values per cell), and thus is very well suited for the visualization and exploration of multidimensional cancer genomics data. It contains several analyses and options that are specifically designed for the exploration of cancer genomics data
Matrix heatmap with interactive features
Desktop application (Java)
Visualization tool for interactive exploration of integrated genomics datasets, with a focus on good performance when working with large data sets. All tracks can be annotated with color-coded sample and clinical information; genomic regions can be annotated with text labels. All of the common genomic file formats are supported, including array-based data, next-generation sequence data formats and genomic annotations
Desktop application (Java)
User-prepared data and data from the IGV server, including some TCGA data. In addition, IGV can be accessed from external tools such as GenePattern 
Resource that is used to analyze and visualize cancer genomics data, including expression, copy number variation and somatic mutation data from cancer genomic projects. Various visualization options are offered, of which web-interactive heatmaps (using jheatmap ) are of special interest. These are used to display alterations per gene in a cohort of tumor samples or in a set of tumor types
Matrix heatmaps with interactive features
Pre-calculated data from more than 300 cancer genomic experiments and user-prepared data for somatic mutations in tumors
Tool for visualizing and analyzing protein-protein interaction networks (Network Analysis, Visualization and Graphing TORonto). The network visualization options can be customized to represent genomic data properties by automatically mapping attribute values to visual properties
Desktop application (Java)
Tool for the integrative exploration of associations between clinical and molecular features of data from the TCGA project. The visualization is interactive and the displayed data can be filtered according to different criteria. Visualization options include circular and linear genomic coordinates and networks
Circular and linear genomic coordinates
Pre-calculated TCGA data
Desktop visualization and analysis browser for genomics data. This tool was primarily developed for the effective visualization of large sets of high-throughput sequencing data, similar to IGV. Multiple visualization modes enable the exploration of genome-based sequence, points, intervals, or continuous datasets. Plugins are available, amongst which is the WikiPathways  plugin, which aids the navigation of the data by the integration of pathways
Desktop application (Java)
User-prepared data or data that can be downloaded through plugins such as the USCS Explorer plugin
The Cancer Genome Workbench (CGWB) 
Host for mutation, copy number, expression, and methylation data from a number of projects. It has tools for visualizing sample-level genomic and transcription alterations in various cancers. The main viewers in CGWB are Integrated tracks view, Heatmap view and Bambino, an alignment viewer. The interface of CGWB is based on the UCSC Genome Browser 
The user can also add custom data tracks for visualization
UCSC Cancer Genomics Browser 
Tool for hosting, visualizing, and analyzing cancer genomics datasets. The browser can display genome-wide experimental measurements for multiple samples, which can originate from multiple data sets alongside their associated color-coded clinical information. The browser provides interactive views of data from genomic regions to annotated biological pathways and user-contributed collections of genes. Integrated statistical tools provide quantitative analysis within all available datasets
TCGA data and data from independent publications available from the UCSC server. In addition to open access to public datasets, the browser provides controlled access to private project data
To aid our review of the tools, we describe four case studies that illustrate their use: the visual exploration of 1) alterations in cancer-driver genes per tumor through a representation based on OncoPrint (described below); 2) cause-effect relationships between different alteration types in tumor samples, through the use of Gitools and the Network viewer from the cBio Cancer Genomics Portal; 3) the stratification of tumor samples based on clinical annotations, using CircleMap, the Integrative Genomics Viewer (IGV) and Gitools; and 4) dramatic structural alterations that encompass the rearrangement of large chromosomal regions, employing the Circos tool and data obtained from the Catalogue of Somatic Mutations in Cancer (Cosmic).
Types of genomic data visualization
Numerous methods have been developed to automate the analysis of genomic data [13–15]. Nonetheless, the visual exploration of alterations in cancer genomes, epigenomes and transcriptomes in multidimensional datasets, and of the relationships between these alterations, presents specific challenges. This review focuses on the visualization principles, methods and tools employed to analyze these multidimensional oncogenomics datasets. (For general reviews on omics data visualization, see [16–19].)
A common way to visualize oncogenomics data is to show alterations tied to their genomic loci. This approach is well suited to provide answers to questions about the genomic topography of alterations or to inspect particular genome loci. We distinguish between two main visualization approaches that use genomic coordinates: Genome Browsers and Circular Plots. Three of the most popular genome browsers employed to visualize cancer alterations are the Integrative Genomics Viewer (IGV) , the UCSC Cancer Genomics Browser , and the Savant Genome Browser . All three support multiple data formats that are used to represent various types of alterations. They display the alterations in each tumor sample as genomic tracks, which can be loaded onto the browser and navigated by zooming and by scrolling to particular genomic regions.
The web-based UCSC Cancer Genomics Browser offers an easy-to-use interface that can be used to browse cancer genomics datasets, such as those of The Cancer Genome Atlas (TCGA), which have been pre-analyzed with various tools and include clinical information. The user can choose between different plotting types: heatmaps, box plots and proportions. The features are shown in either the classic representation bound to genomic locations or in a gene-set visualization, analogous to the IGV split-screen view, resulting in a browser-like heatmap (Figure 2). Unlike IGV and Savant, the UCSC browser does not allow users to upload data.
Circos  is a flexible and popular tool that can be used in many different research fields to plot circular ideograms. In the case of multidimensional oncogenomics data, the genomic coordinates of all chromosomes are represented in a circular layout (Figure 3f). This tool aptly illustrates relationships between distinct alterations, represented as data tracks outside the ideogram, that take place at different locations within the genome. These relationships between regions are normally depicted as ribbons. Intra- and inter-chromosomal translocations are particularly well represented in Circos.
Genome browser tools in general have limited capacity to display relationships between genomic features that are independent of location, such as the coordinated expression of genes. The IGV and UCSC Cancer Genomics Browser attempt to tackle this problem using the split-screen and heatmap approaches, respectively. Another issue with visualization-based genomic reference is that it falls short in visualizing extensive genomic rearrangements. The circular layout of Circos can compensate for this deficit, or it can be resolved by the use of specific tools such as Gremlin . Many other tools also perform specific tasks, exploiting the genomic coordinates representation scheme. For example, putative translocation events can be verified by the command-line tool Pairoscope , which generates relational diagrams of paired-end sequencing reads to aid in the discovery of translocation events. To view and analyze single nucleotide polymorphism (SNP) and comparative genomic hybridization (CGH) array alteration data tools and methods such as VAMP  and waviCGH  are options based on web technologies, whereas Genome Alteration Print  is a desktop application. Furthermore, it has been proposed that there should be a move towards visualizing genomic rearrangements, such as gene fusions, graphically to emphasize the order of the rearranged segments rather than the genomic distance between the breakpoints .
Heatmaps are graphical representations that are frequently used to describe transcriptomics and genomics data stored in the form of matrices. In oncogenomics datasets, the columns in a heatmap usually correspond to tumor samples, whereas the rows are genes, transcripts, microarray probes, or other genomic elements (Figure 1). The color of each cell represents a value indicating a measurement of, let's say, for simplicity, the gene in the tumor, such as its expression level or mutational status. As matrices, heatmaps impose no restriction on the order of the data. This allows data from distant genome loci to be grouped and visualized together for comparison. For example, genes in the same pathway or genes that are associated with certain tumor types might be grouped together. In other words, rows or columns can be clustered according to molecular or clinical features. It is precisely this flexibility to explore visually patterns within the alterations that are correlated to external characteristics, such as the function of genes or the features of the tumor samples, that make heatmaps so popular as a way of representing multidimensional oncogenomics data.
Many tools and programs generate heatmaps from numerical or categorical matrices. We focus here on tools that have features that are particularly well suited to the visual exploration of multidimensional oncogenomics data.
Gitools  is an open-source java application for the analysis and visualization of matrices using interactive heatmaps. The heatmaps in Gitools can contain multiple dimensions, that is, multiple values in each cell, which makes it especially well suited to the exploration of multidimensional cancer genomics data. Its interactive capabilities allow the user to filter, sort, move, and hide rows and columns in the heatmap and to launch several common exploratory analyses (such as correlation, clustering, enrichment and differential expression analyses). Multi-value data matrices, which can contain all types of alterations detected across a cohort of cancer samples, can be explored visually in Gitools, either focusing on a single dimension (that is, one type of alteration) or fixing one dimension to explore its influence on others. Gitools also allows the integration of these data with clinical information.
The cBio Cancer Genomics Portal  is a web resource for visualization of oncogenomics datasets that uses heatmap representation, among other options. The OncoPrint heatmaps display alterations in arrays of genes across tumor samples. Individual genes are represented as rows, and individual cases or patients as columns. Different colors and shapes are used to show different alteration types, so that multiple alterations in a patient's gene can be distinguished easily.
IntOGen is a resource that can be used to analyze and visualize oncogenomics data . It presents different values, estimating the accumulation of somatic mutations, CNA or transcriptional alterations in genes and pathways across tumor samples. Pre-computed data for more than 300 cancer genome experiments are currently available. Web-interactive heatmaps are used to explore gene and pathway alterations across samples and tumor types.
Caleydo StratomeX  is a visualization tool built upon the Caleydo framework , with a focus on exploring interdependencies between different stratifications of cancer samples within a given study. Genomics data on different alterations can be clustered and visualized as matrix heatmaps. The clusters of different alterations are connected by ribbons whose widths correspond to the number of samples shared by the connected clusters. Clusters can also be visualized as pathway diagrams, allowing the researcher to observe the impact of alterations on pathway function (Figure 2).
Heatmaps can also be represented not as rectangles but as circles, as with CircleMap  (Figure 2). With this command-line tool, dimensions can be aligned in a circular plot accompanying a gene, which is represented as a circle that can be attached to other genes in a network layout (Figure 3d).
A general limitation of the heatmap visualization is that structural relationships between genes are difficult to grasp. For instance, it is very hard to discern whether the coincidence of CNA in several genes reflects a possible synergy or is simply a result of their location within a recurrently amplified or deleted chromosomic fragment. Gitools tries to solve this problem by offering the possibility of adding genomic annotations to the rows that can encode functional or structural information. Caleydo StratomeX solves this problem by incorporating pathway diagrams displaying functional relationships between the genes, and CircleMap plots can also be used as nodes to construct a network diagram for this purpose.
Networks represent functional relationships between different entities, such as genes. This type of information is difficult to represent in heatmaps and non-circular visualizations of genomic coordinates. Genetic features can be coded in node attributes such as color, size, or shape. Different alterations can be displayed as additional halos around the node. The network arrangement allows the researcher to explore visually clusters of nodes representing highly interconnected altered genes that can constitute driver pathways or subnetworks.
Cytoscape , a collaborative open-source project, is a widely used and intuitive network visualization and analysis tool in genomics research. No special bioinformatics knowledge is needed to use Cytoscape. The properties of the nodes and the edges and the network layout are customizable, and the comprehensive array of plugins constitutes an added value for researchers. This tool has proven useful for integrating expression data into a gene network , as well as for mapping genes with cancer somatic alterations directly to a functional interactions (FI) network  that identifies subnetworks of altered genes in order to find cancer drivers. A web version, Cytoscape-web , is compatible with common internet browsers and facilitates interaction with the networks displayed. The cBio Cancer Genomics Portal  implements an adaption of this tool optimized for visually exploring multidimensional oncogenomics data from TCGA . Node colors and halos encode the alteration status of cancer genes.
Representation of the genomic alterations present in individual tumor samples in network viewers presents a challenge. As a consequence, many details about the individual tumor samples are normally left out of network figures. In the case of the cBio Cancer Genomics Portal network viewer, this problem is alleviated by the inclusion of plots that show the proportions of samples with different genomic alterations. Similar effects can be achieved with plugins for Cytoscape that transform nodes into pie charts (such as GoogleChartFunctions  and nodeCharts ).
The case studies presented here elaborate on four different oncogenomic research questions that can be answered visually with the available tools and resources. The description of the case studies focuses on their biological interpretation. Supporting documentation on how to generate images corresponding to those in Figure 3 is included in the 'Additional file 1 and 2'. Learning to use most of these tools requires a certain investment of time, and tutorials provided by the developers are highly recommended as a starting point.
Visual exploration of cancer drivers
Distinguishing the alterations that give cancer cells a selective advantage (drivers) from those that are merely side effects (passengers) of the destabilization of the cancer genome is a major problem in oncogenomics research. Several new methodologies [5–8, 11, 38, 43–46] address this problem by exploiting the properties of driver genes. For example, the mutually exclusive alteration of genes in a pathway is a characteristic of cancer drivers [5, 6, 47]. One plausible explanation of this behavior is that an alteration that targets an affected pathway does not confer further selective advantage to the cancer cell. A built-in Gitools option sorts genes and samples within a heatmap to present the pattern of mutually exclusive alterations, which is one approach to visual exploration of driver genes that are involved in the same pathway (Figure 3a) . Oncoprint (cBio Cancer Genomics Portal) uses the same principle to display the alterations across TCGA datasets of a gene set provided by the user.
An alternative approach to identify cancer drivers involves mapping altered genes to a FI network (Figure 3b) [7, 38, 46]. The Reactome FI Cytoscape plugin offers this functionality. After a gene list is submitted, a FI network is constructed using so-called linker genes: genes that are not in the user-submitted list but that can connect two of the submitted genes. Usually, this approach identifies network regions in which recurrently altered genes, which are thought to point to driver genes and subnetworks, are enriched. The visualization of genes and their alterations in the form of FI networks is thus very useful (see Figure 3b for an example).
Visualizing cause-effect relationships between different types of alterations
The effect of genomic alterations can be manifested at the genome, transcriptome or proteome level. Single nucleotide variants (SNVs) might not directly influence transcription of the mutated gene but usually affect protein functionality. On the other hand, CNA and changes in methylation status frequently perturb the expression levels of the altered genes or other genes under their control. Determining the cause-effect relationships of such alterations is important to our understanding of cancer mechanisms. One approach is to plot one type of alteration (for example, CNAs) in a heatmap, sorting the tumor samples to separate diploid genes from altered genes. Changes in gene expression values, presented in another heatmap, can then be readily compared between these two groups (Figure 3a), allowing the detection of any significant differences.
Gitools can load a multidimensional data matrix containing different alterations for each sample, and a simple switch between the values shown in the heatmap cells easily changes the display from one heatmap to the other .
Networks offer another way of visualizing cause-effect relationships. The interactions between genes in a network can represent their functional relationships, for example, one gene might regulate the expression of another. Overlaying the alterations within a cohort of tumors on top of each node of the network might illustrate the effect of a gene alteration on the expression of other genes in the network (analogous to Figure 3b).
The network viewer of the cBio Cancer Genomics Portal supports the visualization of expression data, if available. Similar visual effects could be achieved in Cytoscape by mapping data onto node properties.
Visualizing cancer patient stratifications
Cancer is a complex disease. Tumors that seem very similar when examined through conventional diagnostic methods might look markedly different from the molecular viewpoint, which can lead to different outcomes or treatment responses. Therefore, the molecular features of tumors can be used to stratify patients to support more accurate clinical and therapeutic decisions. Over the past decade, molecular stratification of tumors using expression microarrays has been an important area of cancer research [50–53]. The visualization of molecular alteration patterns in a heatmap is often used to explore subgroups of tumors and to associate them with particular clinical features. These heatmaps usually portray the expression patterns of genes or transcripts across samples, but the benefit of data analysis at the level of gene groups, for example pathways [54–56], is increasingly evident. Stratification and visualization can also be done at the level of pathways or other gene modules (Figure 3c), for example using sample level enrichment analysis (SLEA) [57, 58], which analyzes the transcriptional status of pathways (or other gene sets) in each tumor sample.
In the case of multidimensional oncogenomics data, various clinical features and alterations such as CNA or changes in mRNA or microRNA expression can be used to cluster or stratify tumors, leading to different groupings of samples. In Figure 3c,d, we show ways of representing the results of applying SLEA to the TCGA glioblastoma dataset, with the samples grouped by the corresponding glioblastoma subtype. The alterations are visualized using both Gitools and CircleMap. Please see the 'Additional 1 and 2' for a more detailed description of this process.
Stratifications can also be meaningful when exploring a single locus. Figure 3e illustrates the same grouping of samples by glioblastoma subtype, employing copy number and expression data from the TCGA glioblastoma study using IGV (Figure 3e).
Caleydo StratomeX is especially well suited to exploring relationships between groups of samples (Figure 2). These relationships are visualized as ribbons of varying width drawn between neighboring columns. Wide ribbons encode a high co-occurrence of samples in different groupings, whereas their absence indicates mutual exclusion. This coding provides a straightforward and scalable overview of the consistency of group memberships of tumor samples across different data types.
Visualizing global alteration profile patterns
Various alteration phenotypes have been observed in cancer cells. One of the most conspicuous of these is the mutator phenotype : tumor cells typically have an abnormally high mutational burden. Tumor samples with chromothripsis [60, 61] or many chromosomal translocations are also common. Categorization of the alteration events in a cancer cell population could influence the therapeutic decision, and requires a simultaneous exploratory view of all the alteration events.
One approach to exploring visually all the alterations of a sample is the circular genome mapping proposed by Krzywinski et al. using their tool Circos . Several cancer studies [59, 62–64] have used Circos to show the landscape of alterations. This tool is highly configurable, which is evident from the figures in the cited publications. One compact figure can represent all somatic alteration events in a given tumor sample. Data from different alteration types can be organized in layered circles while rearrangement events occupy the innermost space. Figure 3f is composed of three Circos plots of breast cancer samples  as they are represented on the Cosmic website . The outer-most circle of each diagram represents the human chromosomes, followed by a plot of ticks showing point mutations. The next layer plots CNA along all the chromosomes; the links in the middle visualize the structural rearrangements.
The recently developed ggbio package  for the R programming environment allows, among other things, the creation of circular genome plots, and supports a variety of data formats for sequencing data.
Interfacing of tools
Researchers often need to use several of the complementary tools described here to explore their datasets. Nevertheless, the landscape of visualization tools for multidimensional oncogenomics data seems rather fragmented. This is the result of different groups focusing on the development of tools optimized to solve one particular visualization issue, which is probably a more efficient way of investing resources instead of engineering one single monolithic tool that has all possible visualization capabilities. Unfortunately, this fragmentation makes the use of different tools problematic: they accept very different data formats, they look different to users and so on. Thus, users need to spend time learning how to use each tool and reformatting their data to each tool's requirements. This extra effort could be alleviated if developers were to facilitate the combined use of tools.
One of the major efforts to develop a universal interface that will bridge the gap between different bioinformatic tools is the GenomeSpace project . GenomeSpace allows the user to store data in a common repository and the same web interface guides users to execute the integrated tools, load data, and store results. Conveniently, it contains several built-in converters for some often-used data formats. Several tools listed in Table 1 (IGV, Genomica, Cytoscape and Gitools) are included in this pilot project. This platform interface approach is promising and possibly the most user-friendly option for users who lack a background in bioinformatics.
Another approach to facilitate the use of several tools is the creation of direct tool-to-tool interfaces. These are possible when a tool offers an API that defines the form of communication between the tool and the rest of the world. There are different kinds of APIs, which allows the implementation of different approaches. If the API offers external control, it can send the tool a command and indicate whether the execution of this command has been successful or not. This is the case, for example, with IGV and Gitools: both offer a set of commands that the other application can use. Gitools has a built-in link that sends a 'find locus' command to IGV, whereas IGV exports data into a matrix format and commands Gitools to load it. In practical terms, this means that the user can explore the same data with two complementary visualization tools that can communicate with each other.
Another kind of API can be used for plugin development. This is a general way of creating new capabilities for established tools. As mentioned above, Cytoscape and Savant support plugability, meaning that they possess internal commands that can be used by an application to extend the functions of the tool.
Unidirectional APIs are typically employed by databases and allow easy data transfer between the data source and tools. For example, IGV's external control of the software allows the cBio Cancer Genomics Portal and GenePattern  to load data directly into IGV, and Gitools accepts imported data for all BioMart  databases.
Conclusions and future directions
The cancer genomics research field is rapidly evolving in parallel with advances in high-throughput genomics technologies. This evolution of the field requires continuous advancement in visualization techniques and tools. For instance, the amount of data it is possible to generate for an oncogenomics project continues to increase, requiring visualization tools that very efficiently load and process large amounts of data.
As this rapid scientific evolution continues, cancer researchers are highly dependent on computational scientists and bioinformatics professionals to help them manage, analyze and visualize data. To speed up research advances, the barrier between the large amount of data generated in oncogenomics projects and the effective exploration of these data by cancer researchers must be minimized. Visualization and exploration tools should be intuitive and easy to use, not requiring computational or bioinformatics expertise. Not all tools currently meet these standards, as some programming or even technological knowledge is required of the user. In recent years, however, there has been an important effort to facilitate access by 'non-bioinformaticians' to visualization tools for the analysis of oncogenomics data [20, 31, 32]. Continued work to improve the usability of visualization software is highly important, but requires great effort from developers for low scientific reward when compared to the development of new methods or visualization techniques. Funding agencies must understand that increased investment in personnel dedicated to the development and maintenance of new tools, as well as user training and support, is crucial to the achievement of improvements in the field.
The complexity of oncogenomics data and the multitude of questions to be addressed ensure that a static plot is often insufficient for data visualization. The user needs to explore the data interactively in order to address a wide range of questions. Several tools listed in Table 1 (including IGV, Gitools and Caleydo) make use of interactive visualization techniques to make this possible. Other web frameworks with various visualization and some optional analysis possibilities are being developed, including the cBio Cancer Genomics Portal , IntOGen  and Regulome Explorer . Open source and plug-in architecture facilitates quick adoption of these new platforms.
Although not discussed at length in this review, the use of cancer genomics data visualization in the clinical setting is likely to become a key topic in the near future, as the results of cancer genome projects begin to be translated into personalized cancer medicine. Clinicians will be the main users of this information as they make decisions regarding patient treatment. In this regard, simple, efficient tools that support the visual stratification of tumor genomic profiles and that highlight their relationships to known drugs or treatments will be more useful than the existing research-oriented tools. As a result, it will probably be necessary to develop specialized clinical tools or to adapt existing ones to the clinical setting. This has been achieved in the case of the MedSavant Browser , a clinical adaption of the Savant Genome Browser.
In summary, visualization of multidimensional oncogenomics data is essential for the extraction of useful knowledge from the vast amount of data generated by high-throughput technologies. Important efforts have been made in recent years to create visualization tools that can explore these datasets. Further efforts are needed to develop those resources and to create new tools to meet the changing needs of the field. Long-term investment and funding are needed to guarantee the maintenance, improvement, and evolution of visualization tools beyond their first publication.
List of abbreviations
application programming interface
Catalogue of Somatic Mutations in Cancer
Integrative Genomics Viewer
The Cancer Genome Atlas.
We acknowledge funding from the Spanish Ministry of Science and Technology (grant number SAF009-06954 and FPI fellowship assigned to MPS), and from the Spanish National Institute of Bioinformatics (INB). We are thankful to Elaine M Lilly for assistance with language editing.
- Parsons DW, Jones S, Zhang X, Lin JC-H, Leary RJ, Angenendt P, Mankoo P, Carter H, Siu I-M, Gallia GL, Olivi A, McLendon R, Rasheed BA, Keir S, Nikolskaya T, Nikolsky Y, Busam DA, Tekleab H, Diaz LA, Hartigan J, Smith DR, Strausberg RL, Marie SKN, Shinjo SMO, Yan H, Riggins GJ, Bigner DD, Karchin R, Papadopoulos N, Parmigiani G, et al: An integrated genomic analysis of human glioblastoma multiforme. Science. 2008, 321: 1807-1812. 10.1126/science.1164382.PubMed CentralView ArticlePubMedGoogle Scholar
- The Cancer Genome Atlas Consortium: Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature. 2008, 455: 1061-1068. 10.1038/nature07385.View ArticleGoogle Scholar
- The Cancer Genome Atlas Research Network: Integrated genomic analyses of ovarian carcinoma. Nature. 2011, 474: 609-615. 10.1038/nature10166.PubMed CentralView ArticleGoogle Scholar
- International Cancer Genome Consortium, Hudson TJ, Anderson W, Artez A, Barker AD, Bell C, Bernabé RR, Bhan MK, Calvo F, Eerola I, Gerhard DS, Guttmacher A, Guyer M, Hemsley FM, Jennings JL, Kerr D, Klatt P, Kolar P, Kusada J, Lane DP, Laplace F, Youyong L, Nettekoven G, Ozenberger B, Peterson J, Rao TS, Remacle J, Schafer AJ, Shibata T, Stratton MR, et al: International network of cancer genome projects. Nature. 2010, 464: 993-998. 10.1038/nature08987.View ArticleGoogle Scholar
- Ciriello G, Cerami EG, Sander C, Schultz N: Mutual exclusivity analysis identifies oncogenic network modules. Genome Res. 2012, 22: 398-406. 10.1101/gr.125567.111.PubMed CentralView ArticlePubMedGoogle Scholar
- Vandin F, Upfal E, Raphael BJ: De novo discovery of mutated driver pathways in cancer. Genome Res. 2012, 12: 375-385.View ArticleGoogle Scholar
- Vandin F, Upfal E, Raphael BJ: Algorithms for detecting significantly mutated pathways in cancer. J Comput Biol. 2011, 18: 507-522. 10.1089/cmb.2010.0265.View ArticlePubMedGoogle Scholar
- Vaske CJ, Benz SC, Sanborn JZ, Earl D, Szeto C, Zhu J, Haussler D, Stuart JM: Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM. Bioinformatics. 2010, 26: i237-i245. 10.1093/bioinformatics/btq182.PubMed CentralView ArticlePubMedGoogle Scholar
- Louhimo R, Lepikhova T, Monni O, Hautaniemi S: Comparative analysis of algorithms for integration of copy number and expression data. Nat Methods. 2012, 9: 351-355. 10.1038/nmeth.1893.View ArticlePubMedGoogle Scholar
- Akavia UD, Litvin O, Kim J, Sanchez-Garcia F, Kotliar D, Causton HC, Pochanard P, Mozes E, Garraway LA, Pe'er D: An integrated approach to uncover drivers of cancer. Cell. 2010, 143: 1005-1017. 10.1016/j.cell.2010.11.013.PubMed CentralView ArticlePubMedGoogle Scholar
- Tamborero D, Lopez-Bigas N, Gonzalez-Perez A: Oncodrive-CIS: a method to reveal likely driver genes based on the impact of their copy number changes on expression. PLoS ONE.
- Zhang S, Liu C-C, Li W, Shen H, Laird PW, Zhou XJ: Discovery of multi-dimensional modules by integrative analysis of cancer genomic data. Nucleic Acids Res. 2012, 40: 9379-9391. 10.1093/nar/gks725.PubMed CentralView ArticlePubMedGoogle Scholar
- Medvedev P, Stanciu M, Brudno M: Computational methods for discovering structural variation with next-generation sequencing. Nat Methods. 2009, 6: S13-S20. 10.1038/nmeth.1374.View ArticlePubMedGoogle Scholar
- Rueda OM, Diaz-Uriarte R: Finding recurrent copy number alteration regions: a review of methods. Curr Bioinform. 2010, 5: 1-17.View ArticleGoogle Scholar
- Eifert C, Powers RS: From cancer genomes to oncogenic drivers, tumour dependencies and therapeutic targets. Nat Rev Cancer. 2012, 12: 572-578. 10.1038/nrc3299.View ArticlePubMedGoogle Scholar
- Gehlenborg N, O'Donoghue SI, Baliga NS, Goesmann A, Hibbs MA, Kitano H, Kohlbacher O, Neuweger H, Schneider R, Tenenbaum D, Gavin A-C: Visualization of omics data for systems biology. Nat Methods. 2010, 7: S56-S68. 10.1038/nmeth.1436.View ArticlePubMedGoogle Scholar
- Nielsen CB, Cantor M, Dubchak I, Gordon D, Wang T: Visualizing genomes: techniques and challenges. Nat Methods. 2010, 7: S5-S15. 10.1038/nmeth.1422.View ArticlePubMedGoogle Scholar
- O'Donoghue SI, Gavin A-C, Gehlenborg N, Goodsell DS, Hériché J-K, Nielsen CB, North C, Olson AJ, Procter JB, Shattuck DW, Walter T, Wong B: Visualizing biological data - now and in the future. Nat Methods. 2010, 7: S2-S4. 10.1038/nmeth.f.301.View ArticlePubMedGoogle Scholar
- Quinlan AR, Hall IM: Characterizing complex structural variation in germline and somatic genomes. Trends Genet. 2012, 28: 43-53. 10.1016/j.tig.2011.10.002.PubMed CentralView ArticlePubMedGoogle Scholar
- Thorvaldsdóttir H, Robinson JT, Mesirov JP, Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinform. 2012, doi: 10.1093/bib/bbs017Google Scholar
- Sanborn JZ, Benz SC, Craft B, Szeto C, Kober KM, Meyer L, Vaske CJ, Goldman M, Smith KE, Kuhn RM, Karolchik D, Kent WJ, Stuart JM, Haussler D, Zhu J: The UCSC Cancer Genomics Browser: update 2011. Nucleic Acids Res. 2011, 39: D951-959. 10.1093/nar/gkq1113.PubMed CentralView ArticlePubMedGoogle Scholar
- Fiume M, Smith EJM, Brook A, Strbenac D, Turner B, Mezlini AM, Robinson MD, Wodak SJ, Brudno M: Savant Genome Browser 2: visualization and analysis for population-scale genomics. Nucleic Acids Res. 2012, W615-W621. 40(Web Server)
- Robinson MD, McCarthy DJ, Smyth GK: edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010, 26: 139-140. 10.1093/bioinformatics/btp616.PubMed CentralView ArticlePubMedGoogle Scholar
- Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, Jones SJ, Marra MA: Circos: an information aesthetic for comparative genomics. Genome Res. 2009, 19: 1639-1645. 10.1101/gr.092759.109.PubMed CentralView ArticlePubMedGoogle Scholar
- O'Brien TM, Ritz AM, Raphael BJ, Laidlaw DH: Gremlin: an interactive visualization model for analyzing genomic rearrangements. IEEE Trans Vis Comput Graph. 2010, 16: 918-926.View ArticlePubMedGoogle Scholar
- Ding L, Ellis MJ, Li S, Larson DE, Chen K, Wallis JW, Harris CC, McLellan MD, Fulton RS, Fulton LL, Abbott RM, Hoog J, Dooling DJ, Koboldt DC, Schmidt H, Kalicki J, Zhang Q, Chen L, Lin L, Wendl MC, McMichael JF, Magrini VJ, Cook L, McGrath SD, Vickery TL, Appelbaum E, Deschryver K, Davies S, Guintoli T, Lin L, et al: Genome remodelling in a basal-like breast cancer metastasis and xenograft. Nature. 2010, 464: 999-1005. 10.1038/nature08989.PubMed CentralView ArticlePubMedGoogle Scholar
- Rosa PL, Viara E, Hupé P, Pierron G, Liva S, Neuvial P, Brito I, Lair S, Servant N, Robine N, Manié E, Brennetot C, Janoueix-Lerosey I, Raynal V, Gruel N, Rouveirol C, Stransky N, Stern M-H, Delattre O, Aurias A, Radvanyi F, Barillot E: VAMP: visualization and analysis of array-CGH, transcriptome and other molecular profiles. Bioinformatics. 2006, 22: 2066-2073. 10.1093/bioinformatics/btl359.View ArticlePubMedGoogle Scholar
- Carro A, Rico D, Rueda OM, Díaz-Uriarte R, Pisano DG: waviCGH: a web application for the analysis and visualization of genomic copy number alterations. Nucleic Acids Res. 2010, 38: W182-W187. 10.1093/nar/gkq441.PubMed CentralView ArticlePubMedGoogle Scholar
- Popova T, Manié E, Stoppa-Lyonnet D, Rigaill G, Barillot E, Stern MH: Genome Alteration Print (GAP): a tool to visualize and mine complex cancer genomic profiles obtained by SNP arrays. Genome Biol. 2009, 10: R128-10.1186/gb-2009-10-11-r128.PubMed CentralView ArticlePubMedGoogle Scholar
- Nielsen C, Wong B: Points of view: representing genomic structural variation. Nat Methods. 2012, 9: 631-10.1038/nmeth.2018.View ArticlePubMedGoogle Scholar
- Perez-Llamas C, Lopez-Bigas N: Gitools: analysis and visualisation of genomic data using interactive heat-maps. PLoS ONE. 2011, 6: e19541-10.1371/journal.pone.0019541.PubMed CentralView ArticlePubMedGoogle Scholar
- Cerami E, Gao J, Dogrusoz U, Gross BE, Sumer SO, Aksoy BA, Jacobsen A, Byrne CJ, Heuer ML, Larsson E, Antipin Y, Reva B, Goldberg AP, Sander C, Schultz N: The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer Discov. 2012, 2: 401-404. 10.1158/2159-8290.CD-12-0095.View ArticlePubMedGoogle Scholar
- Gundem G, Perez-Llamas C, Jene-Sanz A, Kedzierska A, Islam A, Deu-Pons J, Furney SJ, Lopez-Bigas N: IntOGen: integration and data mining of multidimensional oncogenomic data. Nat Methods. 2010, 7: 92-93. 10.1038/nmeth0210-92.View ArticlePubMedGoogle Scholar
- Lex A, Streit M, Schulz H-J, Partl C, Schmalstieg D, Park PJ, Gehlenborg N: StratomeX: visual analysis of large-scale heterogeneous genomics data for cancer subtype characterization. Comput Graph Forum. 2012, 31: 1175-1184. 10.1111/j.1467-8659.2012.03110.x.View ArticleGoogle Scholar
- Streit M, Lex A, Kalkusch M, Zatloukal K, Schmalstieg D: Caleydo: connecting pathways and gene expression. Bioinformatics. 2009, 25: 2760-2761. 10.1093/bioinformatics/btp432.PubMed CentralView ArticlePubMedGoogle Scholar
- Shannon P: Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003, 13: 2498-2504. 10.1101/gr.1239303.PubMed CentralView ArticlePubMedGoogle Scholar
- Cline MS, Smoot M, Cerami E, Kuchinsky A, Landys N, Workman C, Christmas R, Avila-Campilo I, Creech M, Gross B, Hanspers K, Isserlin R, Kelley R, Killcoyne S, Lotia S, Maere S, Morris J, Ono K, Pavlovic V, Pico AR, Vailaya A, Wang P-L, Adler A, Conklin BR, Hood L, Kuiper M, Sander C, Schmulevich I, Schwikowski B, Warner GJ, et al: Integration of biological networks and gene expression data using Cytoscape. Nat Protoc. 2007, 2: 2366-2382. 10.1038/nprot.2007.324.PubMed CentralView ArticlePubMedGoogle Scholar
- Wu G, Feng X, Stein L: A human functional protein interaction network and its application to cancer data analysis. Genome Biol. 2010, 11: R53-10.1186/gb-2010-11-5-r53.PubMed CentralView ArticlePubMedGoogle Scholar
- Lopes CT, Franz M, Kazi F, Donaldson SL, Morris Q, Bader GD: Cytoscape Web: an interactive web-based network browser. Bioinformatics. 2010, 26: 2347-2348. 10.1093/bioinformatics/btq430.PubMed CentralView ArticlePubMedGoogle Scholar
- The Cancer Genome Atlas. [http://cancergenome.nih.gov/]
- Smoot ME, Ono K, Ruscheinski J, Wang P-L, Ideker T: Cytoscape 2.8: new features for data integration and network visualization. Bioinformatics. 2011, 27: 431-432. 10.1093/bioinformatics/btq675.PubMed CentralView ArticlePubMedGoogle Scholar
- nodeCharts Cytoscape Plugin. [http://www.cgl.ucsf.edu/cytoscape/utilities/index.html#nodeCharts]
- Gonzalez-Perez A, Lopez-Bigas N: Functional impact bias reveals cancer drivers. Nucleic Acids Res. 2012, 40: e169-10.1093/nar/gks743.PubMed CentralView ArticlePubMedGoogle Scholar
- Mermel CH, Schumacher SE, Hill B, Meyerson ML, Beroukhim R, Getz G: GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers. Genome Biol. 2011, 12: R41-10.1186/gb-2011-12-4-r41.PubMed CentralView ArticlePubMedGoogle Scholar
- Getz G, Höfling H, Mesirov JP, Golub TR, Meyerson M, Tibshirani R, Lander ES: Comment on "The consensus coding sequences of human breast and colorectal cancers.". Science. 2007, 317: 1500-View ArticlePubMedGoogle Scholar
- Cerami E, Demir E, Schultz N, Taylor BS, Sander C: Automated network analysis identifies core pathways in glioblastoma. PLoS ONE. 2010, 5: e8918-10.1371/journal.pone.0008918.PubMed CentralView ArticlePubMedGoogle Scholar
- Thomas RK, Baker AC, DeBiasi RM, Winckler W, LaFramboise T, Lin WM, Wang M, Feng W, Zander T, MacConaill LE, Lee JC, Nicoletti R, Hatton C, Goyette M, Girard L, Majmudar K, Ziaugra L, Wong K-K, Gabriel S, Beroukhim R, Peyton M, Barretina J, Dutt A, Emery C, Greulich H, Shah K, Sasaki H, Gazdar A, Minna J, Armstrong SA, et al: High-throughput oncogene mutation profiling in human cancer. Nat Genet. 2007, 39: 347-351. 10.1038/ng1975.View ArticlePubMedGoogle Scholar
- Visualizing mutually exclusive alteration patterns in cancer with Gitools. [http://bg.upf.edu/blog/2012/03/visualizing-mutually-exclusive-alteration-patterns-in-cancer-with-gitools/]
- Exploring the effect of cancer genomic alteration on expression with Gitools. [http://bg.upf.edu/blog/2012/03/exploring-the-effect-of-cancer-genomic-alteration-on-expression-with-gitools/]
- van 't Veer LJ, Dai H, van de Vijver MJ, He YD, Hart AAM, Mao M, Peterse HL, van der Kooy K, Marton MJ, Witteveen AT, Schreiber GJ, Kerkhoven RM, Roberts C, Linsley PS, Bernards R, Friend SH: Gene expression profiling predicts clinical outcome of breast cancer. Nature. 2002, 415: 530-536. 10.1038/415530a.View ArticlePubMedGoogle Scholar
- van 't Veer LJ, Bernards R: Enabling personalized cancer medicine through analysis of gene-expression patterns. Nature. 2008, 452: 564-570. 10.1038/nature06915.View ArticleGoogle Scholar
- Parker JS, Mullins M, Cheang MCU, Leung S, Voduc D, Vickery T, Davies S, Fauron C, He X, Hu Z, Quackenbush JF, Stijleman IJ, Palazzo J, Marron JS, Nobel AB, Mardis E, Nielsen TO, Ellis MJ, Perou CM, Bernard PS: Supervised risk predictor of breast cancer based on intrinsic subtypes. J Clin Oncol. 2009, 27: 1160-1167. 10.1200/JCO.2008.18.1370.PubMed CentralView ArticlePubMedGoogle Scholar
- Sørlie T, Perou CM, Tibshirani R, Aas T, Geisler S, Johnsen H, Hastie T, Eisen MB, van de Rijn M, Jeffrey SS, Thorsen T, Quist H, Matese JC, Brown PO, Botstein D, Lønning PE, Børresen-Dale AL: Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications. Proc Natl Acad Sci USA. 2001, 98: 10869-10874. 10.1073/pnas.191367098.PubMed CentralView ArticlePubMedGoogle Scholar
- Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA. 2005, 102: 15545-15550. 10.1073/pnas.0506580102.PubMed CentralView ArticlePubMedGoogle Scholar
- Khatri P, Sirota M, Butte AJ: Ten years of pathway analysis: current approaches and outstanding challenges. PLoS Comput Biol. 2012, 8: e1002375-10.1371/journal.pcbi.1002375.PubMed CentralView ArticlePubMedGoogle Scholar
- Bild AH, Potti A, Nevins JR: Linking oncogenic pathways with therapeutic opportunities. Nat Rev Cancer. 2006, 6: 735-741. 10.1038/nrc1976.View ArticlePubMedGoogle Scholar
- Gundem G, Lopez-Bigas N: Sample level enrichment analysis (SLEA) unravels shared stress phenotypes among multiple cancer types. Genome Med. 2012, 4: 28-10.1186/gm327.PubMed CentralView ArticlePubMedGoogle Scholar
- Sample Level Enrichment Analysis (SLEA) tutorial and Gitools 1.6.2. [http://bg.upf.edu/blog/2012/04/sample-level-enrichment-analysis-slea-tutorial-and-gitools-1-6-2/]
- Stephens PJ, McBride DJ, Lin M-L, Varela I, Pleasance ED, Simpson JT, Stebbings LA, Leroy C, Edkins S, Mudie LJ, Greenman CD, Jia M, Latimer C, Teague JW, Lau KW, Burton J, Quail MA, Swerdlow H, Churcher C, Natrajan R, Sieuwerts AM, Martens JWM, Silver DP, Langerød A, Russnes HEG, Foekens JA, Reis-Filho JS, van 't Veer L, Richardson AL, Børresen-Dale A-L, et al: Complex landscapes of somatic rearrangement in human breast cancer genomes. Nature. 2009, 462: 1005-1010. 10.1038/nature08645.PubMed CentralView ArticlePubMedGoogle Scholar
- Stephens PJ, Greenman CD, Fu B, Yang F, Bignell GR, Mudie LJ, Pleasance ED, Lau KW, Beare D, Stebbings LA, McLaren S, Lin M-L, McBride DJ, Varela I, Nik-Zainal S, Leroy C, Jia M, Menzies A, Butler AP, Teague JW, Quail MA, Burton J, Swerdlow H, Carter NP, Morsberger LA, Iacobuzio-Donahue C, Follows GA, Green AR, Flanagan AM, Stratton MR, et al: Massive genomic rearrangement acquired in a single catastrophic event during cancer development. Cell. 2011, 144: 27-40. 10.1016/j.cell.2010.11.055.PubMed CentralView ArticlePubMedGoogle Scholar
- Rausch T, Jones DTW, Zapatka M, Stütz AM, Zichner T, Weischenfeldt J, Jäger N, Remke M, Shih D, Northcott PA, Pfaff E, Tica J, Wang Q, Massimi L, Witt H, Bender S, Pleier S, Cin H, Hawkins C, Beck C, von Deimling A, Hans V, Brors B, Eils R, Scheurlen W, Blake J, Benes V, Kulozik AE, Witt O, Martin D, et al: Genome sequencing of pediatric medulloblastoma links catastrophic DNA rearrangements with TP53 mutations. Cell. 2012, 148: 59-71. 10.1016/j.cell.2011.12.013.PubMed CentralView ArticlePubMedGoogle Scholar
- Kloosterman WP, Hoogstraat M, Paling O, Tavakoli-Yaraki M, Renkens I, Vermaat JS, van Roosmalen MJ, van Lieshout S, Nijman IJ, Roessingh W, van 't Slot R, van de Belt J, Guryev V, Koudijs M, Voest E, Cuppen E: Chromothripsis is a common mechanism driving genomic rearrangements in primary and metastatic colorectal cancer. Genome Biol. 2011, 12: R103-10.1186/gb-2011-12-10-r103.PubMed CentralView ArticlePubMedGoogle Scholar
- Ellis MJ, Ding L, Shen D, Luo J, Suman VJ, Wallis JW, Tine BAV, Hoog J, Goiffon RJ, Goldstein TC, Ng S, Lin L, Crowder R, Snider J, Ballman K, Weber J, Chen K, Koboldt DC, Kandoth C, Schierding WS, McMichael JF, Miller CA, Lu C, Harris CC, McLellan MD, Wendl MC, DeSchryver K, Allred DC, Esserman L, Unzeitig G, et al: Whole-genome analysis informs breast cancer response to aromatase inhibition. Nature. 2012, 486: 353-360.PubMed CentralPubMedGoogle Scholar
- Nik-Zainal S, Alexandrov LB, Wedge DC, Van Loo P, Greenman CD, Raine K, Jones D, Hinton J, Marshall J, Stebbings LA, Menzies A, Martin S, Leung K, Chen L, Leroy C, Ramakrishna M, Rance R, Lau KW, Mudie LJ, Varela I, McBride DJ, Bignell GR, Cooke SL, Shlien A, Gamble J, Whitmore I, Maddison M, Tarpey PS, Davies HR, Papaemmanuil E, et al: Mutational processes molding the genomes of 21 breast cancers. Cell. 2012, 149: 979-993. 10.1016/j.cell.2012.04.024.PubMed CentralView ArticlePubMedGoogle Scholar
- Bamford S, Dawson E, Forbes S, Clements J, Pettett R, Dogan A, Flanagan A, Teague J, Futreal PA, Stratton MR, Wooster R: The COSMIC (catalogue of somatic mutations in cancer) database and website. Br J Cancer. 2004, 91: 355-358.PubMed CentralPubMedGoogle Scholar
- Yin T, Cook D, Lawrence M: ggbio: an R package for extending the grammar of graphics for genomic data. Genome Biol. 2012, 13: R77-10.1186/gb-2012-13-8-r77.PubMed CentralView ArticlePubMedGoogle Scholar
- GenomeSpace. [http://www.genomespace.org/]
- Reich M, Liefeld T, Gould J, Lerner J, Tamayo P, Mesirov JP: GenePattern 2.0. Nat Genet. 2006, 38: 500-501. 10.1038/ng0506-500.View ArticlePubMedGoogle Scholar
- Haider S, Ballester B, Smedley D, Zhang J, Rice P, Kasprzyk A: BioMart Central Portal -unified access to biological data. Nucleic Acids Res. 2009, 37: W23-W27. 10.1093/nar/gkp265.PubMed CentralView ArticlePubMedGoogle Scholar
- Network TCGA: Comprehensive molecular characterization of human colon and rectal cancer. Nature. 2012, 487: 330-337. 10.1038/nature11252.View ArticleGoogle Scholar
- MedSavant. [http://genomesavant.com/medsavant/]
- Mostafavi S, Ray D, Warde-Farley D, Grouios C, Morris Q: GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function. Genome Biol. 2008, 9: S4-PubMed CentralView ArticlePubMedGoogle Scholar
- Segal E, Friedman N, Koller D, Regev A: A module map showing conditional activity of expression modules in cancer. Nat Genet. 2004, 36: 1090-1098. 10.1038/ng1434.View ArticlePubMedGoogle Scholar
- jHeatmap. [http://bg.upf.edu/jheatmap/]
- Brown KR, Otasek D, Ali M, McGuffin MJ, Xie W, Devani B, van Toch IL, Jurisica I: NAViGaTOR: Network Analysis, Visualization and Graphing Toronto. Bioinformatics. 2009, 25: 3327-3329. 10.1093/bioinformatics/btp595.PubMed CentralView ArticlePubMedGoogle Scholar
- Joshi-Tope G: Reactome: a knowledgebase of biological pathways. Nucleic Acids Res. 2004, 33: D428-D432. 10.1093/nar/gki072.PubMed CentralView ArticleGoogle Scholar
- Ogata H, Goto S, Sato K, Fujibuchi W, Bono H, Kanehisa M: KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 1999, 27: 29-34. 10.1093/nar/27.1.29.PubMed CentralView ArticlePubMedGoogle Scholar
- Kelder T, van Iersel MP, Hanspers K, Kutmon M, Conklin BR, Evelo CT, Pico AR: WikiPathways: building research communities on biological pathways. Nucleic Acids Res. 2012, D1301-D1307. 40 Database
- Zhang J, Finney R, Edmonson M, Schaefer C, Rowe W, Yan C, Clifford R, Greenblum S, Wu G, Zhang H, Liu H, Nguyen C, Hu Y, Madhavan S, Ding L, Wheeler DA, Gerhard DS, Buetow KH: The Cancer Genome Workbench: identifying and visualizing complex genetic alterations in tumors. NCI Nature Pathway Interaction Database. 2010, doi: 10.1038/pid.2010.1Google Scholar
- Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, Haussler D: The Human Genome Browser at UCSC. Genome Res. 2002, 12: 996-1006.PubMed CentralView ArticlePubMedGoogle Scholar
- Shoemaker RH: The NCI60 human tumour cell line anticancer drug screen. Nat Rev Cancer. 2006, 6: 813-823.View ArticlePubMedGoogle Scholar