Understanding and Using Information about Cancer Genomes

Published on 09/04/2015 by admin

Filed under Hematology, Oncology and Palliative Medicine

Last modified 22/04/2025

Print this page

This article have been viewed 3724 times

Figure 24-1 Schematic illustrations of the types of genome aberrations found in human cancers. ¹⁸

Table 24-1

Cancer Gene Census Summary

Aberration Type	Number of Aberrations	Examples of Prominent Affected Genes
Amplification	16	ERBB2, EGFR, MYCN, MDM2, CCND1
Frameshift mutation	100	APC, RB1, ATM, MLH1, NF1
Germline mutation	76	BRCA1/2, TP53, ERCC2, RB1, VHL
Missense mutation	141	ARID1A, ATM, PIK3CA, IDH1, KRAS
Nonsense mutation	92	CDKN2A, FANCA, PTCH, PTEN
Other mutation	26	BRAF, PDGFRA, PIK3R1, SOCS1
Splicing mutation	63	GATA3, MEN1, MSH2, TSC1
Translocation	326	ABL1, ALK, BCL2, TMPRSS2, MYC

For more details see http://www.sanger.ac.uk/genetics/CGP/Census.

One important observation from many genomic studies is the existence of recurrent molecular features that allow cancers that occur in specific anatomic regions to be organized into subtypes. The subtypes likely arise in distinct cell types within each tissue and are different diseases that differ in clinical outcome and/or response to therapy. Early genomic studies relied on expression patterns for cancer subtype definition, but current strategies use multiple data types (e.g., genome copy number, mutation, and expression) for subtype definition. Interestingly, epithelial and mesenchymal subtypes appear to be present in tumors that are of epithelial origin. The mesenchymal-like cancers tend to be more rapidly proliferating and motile and associated with reduced survival duration. Some tumor types show remarkably high transcriptional similarity, for example, in triple-negative breast cancer and high-grade serous ovarian cancers. ⁶ Many genomic aberrations also appear in multiple tumor subtypes. Some of the most common aberrations observed in multiple tumor types include amplifications of MYC and EGFR, deletion of CDKN2A and PTEN, and mutation of TP53 and PIK3CA. For a more comprehensive assessment, Kim and colleagues summarize recurrent genome copy number aberrations in 8000 cancers. ²⁰ Efforts are now under way to combine data types (e.g., expression, genome copy number, and mutations) to increase the number of subtypes in order to increase the precision with which patients can be stratified according to outcome and/or therapeutic response. ²¹ Of course, this divides cancers into increasingly smaller subpopulations, so very large numbers of samples are needed to establish subtype differences in treatment response or overall outcome.

Table 24-2

Candidate Cancer Hallmark–Associated Aberrant Genes

Cancer Hallmark	Aberrant Gene
Resisting cell death	BCL2, BAX, FAS
Genome instability and mutation	TP53, BRCA1/2, MLH1
Inducing angiogenesis	CCK2R
Activating invasion and metastasis	ADAMTSL4, ADAMTS3
Tumor-promoting inflammation	IL32
Enabling replicative immortality	TERT
Avoiding immune destruction	HLA loci, TAP1/2, B2M
Evading growth suppressors	RB1, CCND1, CDKN2A
Sustaining proliferative signaling	KRAS, ERBB2, MYC
Deregulating cellular energetics	PIK3CA, PTEN

The number of aberrations that are present in an individual tumor can be remarkably high. The somatic mutation rate in human cancers varies between cancer types from about 0.1 to 10 mutations per megabase,^22,23but individual tumors may carry as few as a hundred to more than a million somatic aberrations. High genomic instability occurs because of loss of telomere function during progression in the absence of telomerase,^24,25diminished DNA repair capacity resulting from genomic and epigenomic deregulation of DNA repair pathways, ²⁶ increased damage resulting from oncogene-induced oxidative stress, ²⁷ and toxic environmental exposures.^28,29In some cases, the exact DNA sequence change in a mutation reflects the type of agent that causes the cancer—for example, mutations in sun-related cancers show CC to TT mutations caused by UV-induced cytosine dimers, whereas smoking-induced cancers in the lung are characterized by G→T transversions caused by the polycyclic aromatic hydrocarbons in tobacco smoke.^30,31Ultimately, the functions and/or expression levels of hundreds to thousands of genes may be altered in an individual tumor. An unknown number of these will be drivers. Among these, some will have a strong, possibly dominant influence on an individual tumor, whereas others may have a more modest or near-negligible impact. So far, most attention in the field has focused on the strong drivers. However, it seems likely that the ensemble of aberrations will have to be taken into account in explaining the overall behavior of an individual tumor, which is addressed in a later section.

The same drivers of genome instability that enable tumor development also operate during tumor progression. As a result, individual tumors become increasingly heterogeneous as distinct clonal populations within the tumor evolve in diverse microenvironments, producing highly branched lineages. For example, events that enable metastasis may occur late during the genetic evolution, ³² whereas mutation of TP53, a key player in genome stability, can be an early event. ³³ These instabilities and the resultant intratumor heterogeneity in an individual tumor are likely responsible for the rapid evolution of therapeutic resistance. This heterogeneity complicates clinical decision making because the importance of a low-frequency but actionable aberration remains unclear. One possible way forward is to focus treatment on aberrations that occur early during tumor development. The order in which aberrations occur can be inferred by examining a tissue at various stages of disease progression ³⁴ by serial sampling of clinical tissue from individual patients, ¹⁸ by computational methods that examine mutation frequency,^35–37or in some cases by analysis of the interactions between mutations and copy-number abnormalities. ³³

Functional Assessment of Cancer Genomes

Transforming cancer genomic data into interpretable knowledge consists of finding the parts and learning how they work together to enable aspects of cancer pathophysiology. Hypothesis-driven research has gone quite far in this process, but full understanding will require systematic analysis, both computational and experimental, of the aberrations that occur within a tumor genome.

Computational Approaches

Computational strategies to identify candidate driver aberrations begin with the cataloging of all aberrations and then move to the selection of high-priority candidate drivers.

Cataloging Approaches

Identification of genes that enable aspects of cancer pathophysiology (driver genes) is complicated by the high genomic heterogeneity within and between tumors. Nearly all cancer genomes analyzed to date appear to have at least one driving oncogenic point mutation, and the vast majority show copy number changes over both large chromosomal segments and smaller, more targeted regions of the genome. The evidence for structural rearrangements being a primary cause in most tumor types is less clear, but diseases including many leukemias, lymphomas, sarcomas, and prostate cancers all incontrovertibly show that rearrangements can be critical (http://atlasgeneticsoncology.org). Changes to chromatin state also are partly responsible for many cancers.^38–40Over the past 20 years a number of technologies (predominantly microarray based) have been successfully used to catalog cancer genome aberrations, but nearly all efforts now depend on nucleic acid sequencing technology (Mardis, chapter on “The Technology of Analyzing Nucleic Acids”).

Point mutations are identified by aligning DNA sequences obtained from cancer samples to normal genomes using tools such as BWA. ⁴¹ The requirement for the normal genome sequence is paramount because of private single-nucleotide polymorphisms (SNPs) that occur about once every 100,000 base pairs, ⁴² a rate that is about 10 times higher than the mutation rate in most epithelial tumors and 100 times higher than the rate of mutations in childhood cancers such as neuroblastoma.^3,6,22Read depth and read quality are critical factors in determining how well mutations can be called within each patient’s cancer genome. Read quality is the error rate per thousand base pairs of sequence. High quality is usually defined as having fewer than 1 error per 1000 bases of sequence. Read depth (the number of times a position in the genome has been sequenced) for high-quality bases then governs both the false-positive rate caused by sequencing errors and misidentifying private variants as mutations and false negatives caused by not generating sufficient data to observe mutations reliably. The greater the depth, the more confident mutation calls will be. Typically, 30× coverage of the normal genome and 40× to 80× coverage of the tumor produces high-quality results. Increasing read depth is needed for analysis of samples in which the tumor fraction is low because the presence of normal DNA reads dilutes the aberrant reads. Mutation detection is further complicated by intratumor heterogeneity that causes some aberrations to be present in only a small fraction of the tumor cells. Many groups find value in exome sequencing—that is, targeting the small fraction of the genome that is coding, at even deeper levels (for example, 150×). Verifying the sensitivity of mutation calling remains difficult because there are no good true mutation standards.

Detection of insertions and deletions (indels) remains challenging. In principle, the same sequence coverage necessary to find point mutations can be used to identify indels. Unfortunately, the algorithmic methods for indel identification are much more computationally intense. ⁴³ No good estimates exist on how well indel detection software works because of the lack of gold standards against which to measure algorithm performance. In general, indel detection is even more difficult than evaluating the substitution mutations.

Copy number and structural aberrations are identified using a combination of microarray and sequencing approaches. Microarrays and whole-genome shotgun sequencing are capable of identifying changes in DNA copy number that are as small as 1000 base pairs in length. This resolution is sufficiently good that nearly all gene-level aberrations can be detected. Microarray approaches look for differential signal gains from the hybridization, whereas DNA sequences detect changes in read depth. Direct sequencing of genomic DNA represents the most direct way to identify the breakpoints for structural rearrangements, but the methodology is challenging, requiring a high-coverage, high-quality DNA sequence. Often, structural rearrangements cannot be detected with the standard technologies because the sequencing approaches used cannot span the length of repetitive sequences in the human genome. Once a whole-genome shotgun sequence is generated, methods such as BreakDancer ⁴⁴ and Delly ⁴⁵ can be used to find the chromosome junctions. Other structural aberration detection technologies are emerging, so it is likely that we will be able to identify the majority of structural breakpoints in the near future.

Detection of promoter methylation is usually accomplished using microarray technologies. Microarrays that can measure methylation at more than 485,000 sites are now commonly used by groups such as TCGA. ⁷ In principle, DNA sequencing can be used for this purpose, but this is currently economically impractical, with costs 10 to 50 times greater than for microarray approaches. In addition, sequencing approaches currently require unreasonably large quantities of tumor DNA.

RNAseq is now the standard for measuring gene expression. RNA is depleted of ribosomal RNA (rRNA) by either polyA+ selection or any number of rRNA depletion steps and fragmented before complementary DNA (cDNA) production. Short cDNA fragments are sequenced and mapped to the human genome reference. Algorithms to estimate which transcripts are being produced and their relative abundances ⁴⁶ are used to interpret the fragment data. One strength of RNAseq analysis is that it does not require that the transcriptome be known, and thus it has enabled the study of noncoding RNAs, including lincRNAs and, with adapted protocols, miRNAs.^47,48RNAseq methods are still being refined, with improvements in molecular and algorithmic approaches regularly being developed.

Integrating Information

A central challenge in cancer genomics today is in distinguishing the causal components of disease from the effects of the disease, or even more importantly from the random aberrations that occur during progression and are carried along by chance association with driver mutations. Suites of tools have been developed to answer these key questions.

The major focus of efforts such as TCGA and ICGC has been to identify the recurrently mutated genes in specific cancer types. For example, in serous ovarian cancer 95% of all tumors have point mutations in TP53. Statistics are not needed for the average scientist to decide that TP53 is a critical gene. In most cases, however, the process for deciding if a gene is recurrently mutated in a specific tumor type is much more complicated, even after one has identified the mutations. First, not all genes are of the same length; longer genes should have more mutations by chance if mutations are equally likely at each position. Failure to control for gene size often leads to the identification of genes encoding long proteins such as Titin, whose coding sequence is 100 times longer than that of the average human gene. Second, mutations within a tumor type are not evenly split among all possibilities. For example, tumors caused by UV light will show high rates of C→T mutations in general, especially at CC dinucleotides. Further, we now know that mutations are not randomly distributed over the genome. For example, regions of the genome near late replication forks can have mutation rates 10 times higher than the average rate. Without accounting for this, many genes will be identified as showing more mutations than expected by chance when in fact they do not. ⁴⁹ Identifying driver genes based on patterns of recurrence is partly about understanding the mutagenic processes as a whole and performing appropriate statistical tests to incorporate them.^5,6

Many genes have hotspots where mutations occur preferentially. For example, mutations in the HRAS gene have a bias to alter the 12th amino acid to valine from glycine. When these events occur repeatedly, similar statistics for overall mutation rate can be used, but instead constrained for a specific event. Thus, with far fewer examples, a specific gene mutation can be associated with cancer because of the increased power from decreasing the search space. Similarly, mutations that are clustered in a specific protein domain can be identified. Finally, if a variant has been found in one tumor type—for example, the canonical KRAS mutations found in 50% of melanomas—then when they occur in other tumor types, it is parsimonious to assume that they are oncogenic there as well even if they are rare.

At least a dozen methods have now been developed to identify genes (or sets of genes) that are selected by altering copy number changes. The principles for the detection of these genes are simple even if the implementations differ. First, copy number data are segmented to identify the locations of copy number change points using an algorithm such as CBS. ⁵⁰ Once segmented, the data are normalized and germline copy number differences compared to the reference are removed. Finally, the data are analyzed to locate the genetic elements that are present in copy number aberrations more likely than expected by chance (e.g., STAC ⁵¹ ). Copy number aberrations are thought to follow two distinct distributions: broad events that cover whole (or nearly whole) chromosome arms, and narrow events targeting much smaller regions (often fewer than 10 genes). ⁵² These software tools provide a list of the genes and chromosome arms that are frequently included in both broad and narrow events across many tumors. Although specific types of tumors have specific biases for (or against) specific genes/chromosome arms, many copy number aberrations are present in a diverse set of tumor types. ²⁰ Methods to identify structural changes in the genome increasingly are based on the application of genome sequencing to both ends of genomic clones or fragments. The ends of each clone are then mapped onto a representation of the normal genome sequence. Structural aberrations are inferred when the paired ends of a clone map too close (signaling a deletion) or too far (signaling an insertion or translocation) along the genome. This approach was initially proposed for analysis of cloned sequences ⁵³ but has become routine with the advent of massively parallel sequencing. ⁴⁴ Once individual events are identified, standard statistical principles are then used to estimate the likelihood of seeing similar aberrations more frequently than expected by chance.

Organization into Pathways

A major challenge in cancer genomics is to understand how the ensemble of driver aberrations in an individual tumor influences its clinical and biological behavior. The remarkable genomic heterogeneity that exists in individual tumors can be managed to some extent by mapping aberrations onto pathways that influence the development of cancer hallmarks. The goal of these approaches is to reduce a dauntingly large number of functional genomic aberrations by mapping these onto a manageably small number of important pathways. Several approaches have been developed to organize omic information in ways that enable identification of pathways. We discuss gene-set enrichment approaches, pathway enrichment methods, and newer approaches that extend the repertoire of tools for pathway identification.

One of the most popular approaches is to use statistical tests on gene sets to implicate pathways that are deregulated by changes in the expression of that and related genes. A score is used to measure the degree to which each gene aberration is associated with the disease process, and then an enrichment analysis is performed using a large database of gene sets. For example, genes can be scored based on their length-normalized mutation frequency in a cohort, or assessed with more sophisticated analyses such as MutSig ⁵⁴ or OncoDriveFM ⁵⁵ to gauge how likely mutations in the gene provide a selective advantage to tumor cells. Once an appropriate score is applied to rank the genes, statistical tests can be used to identify enriched pathways. One approach is to threshold the list of genes to obtain those that are ranked toward the top of the list. These top-ranked genes then can be overlapped with each candidate pathway and a Fisher’s exact or Hypergeometric test used to assess the statistical significance of the overlap to determine if it is higher than chance expectation. Overlap methods are implemented in web servers such as the DAVID ⁵⁶ resource.

Gene Set Enrichment Analysis ⁵⁷ (GSEA) compares the entire distribution of scores against a random background using a Kolmogorov-Smirnov–inspired test. Implicated pathways contain significantly more gene members with extreme (either high or low) scores. Gene set–based approaches are used frequently to test for enriched sets of genes, revealing important biological themes. However, the approach makes no use of known interactions between the tested genes. Thus, it is possible for a small but still significant subnetwork of genes to have significantly high scores and go undetected by these set-based approaches. In addition, all genes in a set are treated uniformly. However, some genes in the network may control many other genes while others are specialized effectors performing a specific cellular task in a limited set of conditions. Such genes may be weighted differently in the enrichment analysis to improve the sensitivity of the approach. Methods that incorporate notions of the local network organization of the scored genes can incorporate such intuitions and are discussed next.

“Master Regulator” algorithms attempt to identify genes residing at the logical “top” of predictive pathways whose manipulation would be expected to change the expression of downstream genes. ⁵⁸ Signaling Pathway Impact Analysis (SPIA), ⁵⁹ MARINa, ⁶⁰ and GeneRank ⁶¹ are examples of algorithms in this class. The principle behind these algorithms can be likened to identifying authoritative pages on the Internet. A web page is considered authoritative if many other authoritative pages reference the page. The definition is necessarily recursive, forcing the algorithms to propagate information through the network to determine a solution. For master regulators, the links in the network are reversed so that the methods home in on genes that control many other control genes, again in an iterative fashion. The approach has been used to propose master regulators for B-cell lymphoma. ⁶⁰

Another strategy is to search through large background networks for smaller subnetworks with a concentrated number of altered genes. Such subnetworks could represent pathways where disruptions in any of several gene members could interfere with the functioning of the pathway. These approaches make use of networks derived from high-throughput studies such as the collections of protein-protein interactions in BioGRID, ⁶³ HPRD, ⁶⁴ iREF, ⁶⁵ and STRING ⁶⁶ to identify novel pathways involved in tumorigenesis. These high-throughput sources can be used either alone or together with curated and directed signaling pathways found in resources like Reactome ⁶⁷ and NCI’s Protein Interaction Database. ⁶⁸ Integrating somatic alterations and protein-protein interactions has the potential to provide a powerful means for cutting down false-positive rates present in either dataset because the sources of error are independent. Whether the subnetworks produced from these analyses are physiologically relevant is largely an open question but an area of intense activity.

HotNet ⁶⁹ is a method for identifying enriched subnetworks, given a set of frequently altered mutations in a cohort. HotNet uses a heat-diffusion approach in which a mutated gene is considered to be a heat “source.” The heat is allowed to dissipate on the background network for a short time interval so that genes neighboring the sources also heat up. Those residing close to multiple sources receive more heat than genes far away as an exponentially decaying function of the distance in the network. The algorithm then uses a hierarchical statistical test to identify significantly hot subnetworks. HotNet has been used to identify Notch-related pathways implicated in ovarian cystadenocarcinoma ³ and chromatin-remodeling pathways in clear-cell kidney carcinoma. ^69a These methods are especially well suited to the identification of subtype-specific subnetworks both within and across tumor types.

The Mutually Exclusive Modules (MEMo) algorithm ⁷⁰ identifies novel networks from perturbation patterns observed across samples. This approach is based on the concept of mutual exclusivity—that is, mutation of a second gene in a cancer-related pathway provides no advantage in fitness beyond that provided by the first. The MEMo algorithm takes advantage of this mutual exclusivity property and builds an exhaustive graph of all approximate mutually exclusive gene pairs. Although the statistical significance of any two genes exhibiting such a mutually exclusive pattern is tenuous even in cohorts of hundreds of samples, the observation of a set of genes that all transitively share this property can be significant if the gene set is large enough (e.g., greater than three). MEMo leverages the significance of groups by exhaustively searching its network for subnetworks representing approximate cliques of sufficient size. Identified subnetworks are considered as candidate novel networks. New approaches in this vein, such as DENDRIX, ⁷¹ are also available that include additional statistical associations between genes beyond mutual exclusivity, such as the co-occurrence of mutational events.

The PARADIGM network analysis tool^72,73uses information from multiple profiling measurements (copy number, mutations, transcription, etc.) to calculate inferred pathway activity levels (IPLs) for more than 1300 curated cell signaling pathways associated with specific recurrent aberrations, cancer types, or cancer subtypes. These data can be further combined into “superpathways” to identify subpathways therein whose activities differ between comparator populations (e.g., between transcriptional subtypes or between populations that differ in drug sensitivity). This approach has the advantage that it takes advantage of community knowledge of pathway architecture but has the disadvantage that the pathways may be inaccurate in some situations. PARADIGM has been used in several analyses,^3,4,6,73demonstrating the power of inferred activities for identifying important tumor subtypes.

An extension of PARADIGM, PARADIGM-SHIFT ⁷⁴ (PS), infers the impact of mutational events using network inference. Many mutations in advanced tumors are neutral passenger events resulting from the loss of genome integrity. In this background of a myriad spurious genomic perturbations, it is of interest to identify those that increase tumor fitness or that drive tumorigenesis forward. Several sequence-based methods are available to attack this important problem. However, an additional very important aspect, which has eluded computational analysis until very recently, is to predict whether the driving mutation causes a gain of function (GOF) or loss of function (LOF) to the protein. GOF mutations can lead to therapeutic manipulation because our biomedical tools often fare better at shutting down erroneously activated oncogenes than at introducing functional copies to rescue lost tumor suppressor activity. Pathway-based approaches offer promise in this area because the predicted activity of proteins in the pathway neighborhood can be inspected for signals of GOF and LOF. This is the approach taken by PS. PS predicts the impact of a mutation on the function of a protein by estimating the effects in the protein’s pathway context. It uses two runs of the PARADIGM algorithm ⁷² —a “Targets-only” and “Regulators-only” run—to make this assessment. In the “Regulators-only” run, PS uses PARADIGM to infer the protein’s activity after leaving connections only to the protein’s upstream connections. In the “Targets-only” run, it estimates the activity of the protein with PARADIGM after leaving only the downstream connections intact. The difference, or “shift” between these two estimates provides an estimate of the loss or gain of function in the protein. PS has been successfully used to predict several known positive controls in glioblastoma multiforme, lung squamous, and breast carcinomas. ⁷⁴ One critical aspect for these network-based approaches is to select an informative local neighborhood around the protein, which can significantly influence overall accuracy. Thus machine-learning–based approaches such as the one described next could provide important synergies with these mutation-impact approaches.

Network-Induced Classification Kernels (NICK ⁷⁵ ) use networks to train support-vector machines to predict patient outcomes. Supervised machine learning is a well-established field that has contributed classification approaches for predicting discrete outcomes, and regression-based approaches for predicting continuous-valued outcomes. These methods face the “curse of dimensionality” problem when attempting to use the available large feature spaces (e.g., gene expression vectors) of high-throughput functional genomics to predict outcomes in a relatively small set (e.g., less than a thousand) of samples. Classifiers can suffer problems of robustness, reproducibility, and accuracy and can also misassess the importance of any single feature in the classification task. Only recently have approaches been developed to make use of a priori pathway knowledge for this task. NICK encodes the gene-gene interactions found in a network into the formulation of a support-vector machine classifier. The resulting method rewards selection of features that are adjacent in the network, thus resulting in solutions that are more robust, while maintaining classification accuracy. Methods such as NICK promise to stabilize solutions determined when the same task, such as predicting recurrence of disease, is applied to different datasets because the use of the same network should steer the solutions toward being comparable.

In summary, pathway- and network-based approaches represent a highly active area of current research in the analysis of cancer genomics datasets. New methods are still sorely needed to use the results of these approaches in a worthwhile effort to translate the findings to patient treatment. For example, the networks identified by these approaches could provide important insights into “Achilles’ heel” attack points for cancer cells. We therefore need methods that can predict how a tumor might respond to a drug by simulating manipulations on such networks. An important antecedent to this, of course, is to prove that the networks capture enough of the salient features of a patient’s tumor for it to be used as an “avatar” for in silico testing.

Experimental Approaches

The computational approaches just described attempt to predict functional genes based on their frequency, association with behavior, activation of pathways, and so forth. However, such approaches are limited by the number of samples available for computational assessment, the high heterogeneity within and between human tumors, and our imperfect understanding of the regulatory mechanisms that govern normal and malignant cell behavior. Thus, they serve to generate hypotheses that guide experimental validation in laboratory models.

Tumor Intrinsic Assessments

A wide range of in vitro and in vivo experimental systems are now available for functional assessment of the effects of genomic aberrations that occur in tumors and their impacts on therapeutic response. Given the extremely large number of aberrant genes and networks now being discovered, this summary focuses on methods that are sufficiently high throughput to allow “first pass” assessment of function. In general, these strategies assess the impact of manipulating cancer genes or networks on aspects of growth or immortalization and less frequently other aspects of cancer biology such as differentiation, angiogenesis, senescence, motility, and DNA repair activity. Biological systems now in widespread use for this purpose include well-characterized collections of immortalized cancer cell lines grown in two- or three-dimensional cultures,^73,76,77cell lines such as IL-3–dependent, Ba/F3 hematopoietic cells that proliferate and survive in the absence of IL-3 when transfected with a constitutively active oncogene,^78,79tumor xenograft collections,^80,81genetically engineered murine models of cancer,^82,83and mice subjected to transposon-mediated gene alteration leading to tumor formation. ⁸⁴

One powerful strategy for the manipulation of gene function introduces inhibitory RNA (RNAi) oligonucleotides into model organisms^85–87to downregulate candidate genes or activated cancer regulatory networks. These RNAi precursors include short hairpin RNA (shRNA) oligonucleotides that are delivered through viral or bacterial vectors^87,88and double-stranded RNA molecules, 20 to 25 base pairs in length, called small interfering RNAs (siRNAs)^85,89that are transfected directly into target cells. Two general strategies are now commonly used to test the impact of RNAis in model organisms. One is to introduce libraries of RNAis that have been individually “barcoded” with unique nucleic acid sequences that can be identified by hybridization to oligonucleotide microarrays^89,90or by massively parallel DNA sequencing. ⁹¹ The loss (selected against) or gain (selected for) of specific RNAis during growth is taken as evidence of the importance of the selected RNAis during growth. This approach has the advantage of enabling genome-wide screens at low cost but has the disadvantage of assessing only aspects of gene manipulation that affect aspects of cell growth. Another approach is to test the impact of siRNAs that target individual genes in cells grown in microwells ⁹² or on cell spot microarrays. ⁹³ The biological responses can be assessed by measuring changes in cancer-related properties relative to a control using assays that estimate cell number, or by using high-content imaging of cancer phenotypes such as DNA repair activity, differentiation, senescence, and motility after immunofluorescent staining for molecular surrogates for these phenotypes^94–96and dynamic responses measured using time-lapse imaging. ⁹⁷ These approaches have been useful in assessing the activity of specific pathways, ⁸⁹ identifying genomic vulnerabilities that might be attacked therapeutically with single agents, ⁹² and developing strategies to combine therapeutic agents.^98,99

Manipulation of gene function by transfection of cDNA libraries into nonmalignant cells also has been used to identify genes that enable the development of malignant phenotypes such as immortalization or colony-forming potential. ¹⁰⁰ Another approach to cancer gene identification takes advantage of the tumorigenic integration of transposons into specific genes in murine model systems. The genomic locations in which transposons integrate are mapped by DNA sequencing approaches. Recurrent sites of integration identify genes that may contribute to tumor formation when activated or inactivated.^84,101

Information about gene network function also can be inferred from measurements of responses of well-characterized cancer models to treatment with therapeutic agents that target specific genes or networks. Treatment with compounds in large collections of well-characterized cancer cell lines, for example, enables links to be established between specific aberrant genes or networks and biological responses using machine learning or pathway-based correlative strategies. The NCI’s Discovery Therapeutic Program pioneered the use of cell lines to link omic features to response by measuring molecular features and responses to more than 100,000 compounds in a collection of about 60 cancer cell lines. ¹⁰² However, the NCI60 panel is of limited power in detecting subtype-specific responses because of the relatively sparse representation of specific cancer subtypes in the collection. This has led to the development of large collections of cell lines that represent the diversity within individual tumor types.^73,76The Cancer Cell Line Encyclopedia (CCLE) and Sanger Cancer Cell Line (SCCL) projects have taken this approach to a higher level by assessing associations between responses to compounds in collections of approximately 800 cancer cell lines.^77,103Several studies support the utility of in vitro testing in cell line panels. For example, in vitro model systems accurately show that (1) lung cancers with EGFR mutations respond to gefitinib, ¹⁰⁴ (2) breast cancers with HER2/ERBB2 amplification respond to trastuzumab and/or lapatinib,^76,105and (3) tumors with mutated or amplified BCR-ABL respond to imatinib mesylate. ¹⁰⁶ Panels of xenografts also are now being developed for this purpose. ¹⁰⁷

Interaction with the Microenvironment

Much of cancer genomics research focuses on the tumor-intrinsic effects generated by aberrations in the tumors as discussed earlier. However, it is now apparent that the cancer-inducing functions of these aberrations are modified by signals from the microenvironments in which the cancer cells reside. Early research by Bissell and colleagues demonstrated that some extracellular microenvironments can counter the cancer-associated phenotypes generated by genomic aberrations ¹⁰⁸ ; Folkman and colleagues demonstrated the key role that angiogenesis plays in cancer progression. ¹⁰⁹ Since then an explosion of research has illuminated many ways in which the microenvironment can affect aspects of cancer progression. These studies of the tumor-microenvironment interaction have been reviewed recently by Coussens and Hanahan. ¹¹⁰ They suggest that three general classes of cells from the microenvironment modulate cancer behavior in important ways: angiogenic vascular cells (AVCs), infiltrating immune cells (IICs), and cancer-associated fibroblastic cells (CAFs) as illustrated in Figure 24-2 . They further suggest that the effects of these microenvironments influence aspects of cancer cell behavior including proliferation, growth, cell death, replicative immortality, inducing angiogenesis, energy metabolism, invasion, and metastasis. It is also apparent that the microenvironment influences responses to therapeutic agents—for example, by rendering cancer cells dormant so that they do not respond to cell-cycle active agents or by activating signaling therapy pathways. A challenge for the future will be to determine how diverse microenvironments experienced by metastatic cells influence the biological behavior of these cells—especially their responses to therapeutic interventions. Several model systems are now being developed to facilitate the study of the microenvironment on cancers. These include three-dimensional matrigel cultures,^111,112two-dimensional systems engineered to carry many different proteins and growth factors from diverse microenvironments,^113,114xenografts engineered to mirror important aspects of the human stroma, ¹¹⁵ and genetically engineered mice that model specific tumor intrinsic and extrinsic properties. ¹¹⁶

Clinical Applications

Diagnosis and Detection

The manner in which normal tissue changes to malignant at the omic level is now being documented for a variety of cancers by international efforts. These efforts will provide the basis for improved precision in cancer diagnosis and will show that most tumor types can be divided into subtypes that vary in outcome and often in response to therapy. For example, breast cancer tumors have been treated according to estrogen receptor status and according to whether HER2 is amplified for more than a decade. The advent of transcriptional profiling enabled breast cancers to be divided into six major transcriptional groups,^117,118and adding information about genome copy number allows the definition of 10 subtypes. ¹¹⁹ Adding information about recurrent mutations or functional mutations will further subdivide these groups. Some of the associations with outcome are so strong that changes in cancer management practices have resulted. For example, several commercial assays that measure expression levels of multiple genes are now marketed that predict therapeutic benefit in breast cancer patients.^120–122Since then, potentially useful diagnostic signatures have been developed for many cancer types including leukemia^123,124and colorectal, ¹²⁵ pancreatic, ¹²⁶ and lung cancer. ¹²⁷ More recently, expression levels of noncoding RNAs have been proven prognostic in cancers of the colon, ¹²⁸ lung, ¹²⁹ and bladder. ¹³⁰ In some cases, these signatures are cancer type specific and as a result can be used to classify cancers of unknown origin.^131,132Although most of these diagnostic signatures focus on molecular events that arise in the cancer, some reflect molecular features of the environments in which the tumors reside—for example, molecular signatures that originate in invading immune cells that influence tumor outcome.^133,134

Figure 24-2 Interactions between tumor intrinsic and extrinsic features that influence cancer cell behavior and clinical outcome. Cell image provided by Juha Rantala.

Figure 24-3 Schematic illustration of a genome-based approach to early cancer detection. IHC, Immunohistochemistry; MRI, magnetic resonance imaging; PET, positron emission tomography.

The identification of molecular features that are unique to cancers and associated with poor outcome also provides the basis for the development of assays that may identify cancers at high risk of progressing to metastatic disease at a time before they have metastasized so that they can still be treated successfully. Development of such assays would improve outcomes in patients afflicted with cancers of high metastatic potential and would reduce overtreatment of patients with low propensity for recurrence. These assays likely will be composed of a tiered combination of blood-based, anatomic, or histopathological assays with increasing sensitivity, specificity, and cost as illustrated in Figure 24-3 .

Blood-based assays to date have focused on the detection of cancer-specific proteins and are low cost but also relatively low in sensitivity and specificity. Assays of prostate-specific antigen (PSA) for prostate cancer and CA-125 for ovarian cancer are prototypical, but omic analyses are now revealing a wide range of cancer-specific changes in gene expression and/or splicing that might increase the specificity of these tests. For example, powerful mass spectrometry techniques and computational analyses of genomic changes are revealing increasing numbers of cancer-specific proteins that may be detected in blood.^135,136In addition, it is now apparent that the ongoing process of tumor cell death leads to the appearance of tumor DNA fragments or microRNAs in peripheral blood or urine. Some of these tumor-derived DNA fragments carry aberrations such as mutations, translocations, and changes in methylation that are unique or very specific to the tumor. As a consequence, sensitive blood-based assays are now being developed to detect the presence of these sequences as an indication of the presence of cancer. Recent examples include an epigenetic marker panel for detecting lung cancer using cell-free serum DNA, ¹³⁷ analysis of mutations in DNA isolated from plasma and stool of cancer patients,^138,139detection of translocations as an indication of cancers of the prostate ¹⁴⁰ or ovary, ¹⁴¹ and detection of genome copy-number changes as an indication of the presence of metastatic breast cancer. ¹³⁹

Anatomic cancer detection strategies based on the detection of specific molecular species using positron emission tomography (PET) and magnetic resonance imaging (MRI) are now being developed to enable the detection of cancer-specific genomic features. This requires the development of contrast reagents that make tumors and the aberrant microenvironments they produce visible when the tumors are still small and locally contained. ¹⁴² Genome profiling studies are revealing molecular features that are unique to early cancers. A variety of contrast reagents that target these are now being developed. These include reagents for the detection of estrogen receptor ¹⁴³ and PSA ¹⁴⁴ ; a range of nanoparticles carrying affinity molecules that detect cancer-associated proteins^145–147; and molecular features associated with cancer-associated stroma. ¹⁴⁸

Histological assessment of tissue samples taken from cancerous lesions has long been the gold standard for cancer detection and diagnosis. However, routine analyses of tissue sections stained with hematoxylin and eosin (H&E) currently do not provide sufficient information to distinguish between lesions of high and low malignant potential. Genome studies such as those described earlier are increasingly able to define molecular features associated with the most aggressive malignant lesions. This information is fueling the development of multiplex immunohistochemical assays and/or histologically targeted genomic assays that are better able to identify lesions at high risk of progressing.^149,150These same assays also offer the potential of detecting isolated cancer cells that might be otherwise missed during an assessment of H&E-stained sections.

Therapeutic Targets and Predictive Markers

Discovery of strong driver aberrations that can be attacked with therapeutic benefit was an early motivating factor in the development of international genomics efforts. ¹⁵¹ Early discoveries showed that chronic myelogenous leukemias driven by the BCR-Abl tyrosine kinase could be effectively targeted by imatinib mesylate ¹⁵² and breast tumors driven by amplification of HER2 could be effectively treated with trastuzumab. ¹⁵³ Table 24-3 ¹⁵⁴ summarizes more recent driver genomic aberrations, the cancers in which they occur, and the successful therapeutic agents that attack them. This list will expand continuously as additional therapeutic agents for recurrent genomic aberrations are tested. Additional genes harboring genomic aberrations for which therapies are now being tested include AKT1, PIK3CA, PTEN, MYC, VHL, and HRAS. ¹⁵¹

These studies are stimulating the development of a wealth of new therapeutic agents. Almost 900 small-molecule inhibitors and biological therapeutics are now under development for the treatment of human malignancies. ¹⁵⁵ These agents target molecular features ranging from broad-specificity conventional therapeutics to inhibitors that selectively target specific molecular aberrations and deregulated pathways. The general trend in drug development today is moving toward agents that are targeted toward pathways. ¹⁵⁶

Table 24-3

Genomic Aberrations, Therapeutic Agents, and Relevant Cancers ¹⁵⁴

ALL, Acute lymphocytic leukemia; AML, acute myeloid leukemia; CML, chronic myelogenous leukemia; GIST, gastrointestinal stromal tumor.

The traditional path to the clinic for new cancer drugs is to test them in phased trials in the metastatic setting, followed by testing in randomized Phase III registration trials in the adjuvant setting. This approach requires a substantial investment in time, number of patients, and money. The U.S. Food and Drug Administration (FDA) has published draft guidance for using pathological complete response in neoadjuvant treatment for accelerated approval in high-risk breast cancer, which would dramatically accelerate the approval process. ¹⁵⁷ Although a step forward, this approach has the weakness that drugs that are effective only in a small population of patients may be discarded because of lack of apparent efficacy. Biomarkers that predict response to therapy would enable identification of these small subpopulations so that they can be targeted early in the clinical trials. As described earlier, this can be accomplished by developing initial insights about subpopulation specificity using preclinical models of aspects of tumor-intrinsic and tumor-extrinsic heterogeneity that influence responses.

It is also becoming clear that specific regulatory pathways can differ among cancer subtypes so that these subtypes respond differently to targeted and nontargeted therapies. It has long been recognized, for example, that estrogen-receptor–positive (ER⁺) breast cancers will respond well to selective estrogen response modifiers ¹⁵⁸ and that a subset of prostate cancers is responsive to inhibitors of androgen receptors. ¹⁵⁹ However, it now appears that most anticancer agents will be preferentially active in cancer subtypes defined according to their genomic characteristics. ⁷³ The explanation for this seems to be that the use of molecular pathways that regulate cell behavior (and response to therapy) differs among subtypes. Efforts in the TCGA project and other international genomics efforts are defining subtypes in most anatomically defined cancers that can be considered for stratification of therapeutic response. Full use of this information will require the development of approved molecular assays that can stratify patients according to subtype.

Summary

International efforts are now defining the genomic and epigenomic landscapes of most major tumor types. The first set of cross-tumor (a.k.a. “Pan-Cancer”) studies are now emerging to help delineate core and lineage-specific contributors of the disease. ¹⁶⁰ These studies are revealing a few strong driver aberrations in each cancer type and many—sometimes thousands—of aberrations of unknown consequence. Much work remains to determine which of these contribute to the pathophysiology of each cancer type, but it is already clear that these analyses will have a profound effect on the way most cancers are managed. Aspects of cancer management that will benefit include early detection of the most lethal cancers, identification of recurrently aberrant genes and networks for high-priority therapeutic attack, and development of molecular markers that predict response to gene- or network-targeted therapies.

References

1. Bhat K.P. et al. The transcriptional coactivator TAZ regulates mesenchymal differentiation in malignant glioma . Genes Dev . 2011 ; 25 : 2594 – 2609 .

2. Cancer Genome Atlas Research Network . Comprehensive genomic characterization defines human glioblastoma genes and core pathways . Nature . 2008 ; 455 : 1061 – 1068 .

3. Cancer Genome Atlas Research Network . Integrated genomic analyses of ovarian carcinoma . Nature . 2011 ; 474 : 609 – 615 .

4. Cancer Genome Atlas Network . Comprehensive molecular characterization of human colon and rectal cancer . Nature . 2012 ; 487 : 330 – 337 .

5. Cancer Genome Atlas Research Network . Comprehensive genomic characterization of squamous cell lung cancers . Nature . 2012 ; 489 : 519 – 525 .

6. Cancer Genome Atlas Network . Comprehensive molecular portraits of human breast tumours . Nature . 2012 ; 490 : 61 – 70 .

7. Wood L.D. et al. The genomic landscapes of human breast and colorectal cancers . Science . 2007 ; 318 : 1108 – 1113 .

8. Shah S.P. et al. The clonal and mutational evolution spectrum of primary triple-negative breast cancers . Nature . 2012 ; 486 : 395 – 399 .

9. Russnes H.G. et al. Genomic architecture characterizes tumor progression paths and fate in breast cancer patients . Sci Transl Med . 2010 ; 2 38ra47 .

10. Stephens P.J. et al. Complex landscapes of somatic rearrangement in human breast cancer genomes . Nature . 2009 ; 462 : 1005 – 1010 .

11. Jones S. et al. Core signaling pathways in human pancreatic cancers revealed by global genomic analyses . Science . 2008 ; 321 : 1801 – 1806 .

12. Prensner J.R. et al. Transcriptome sequencing across a prostate cancer cohort identifies PCAT-1, an unannotated lincRNA implicated in disease progression . Nat Biotechnol . 2011 ; 29 : 742 – 749 .

13. Liu J. et al. Genome and transcriptome sequencing of lung cancers reveal diverse mutational and splicing events . Genome Res . 2012 ; 22 : 2315 – 2327 .

14. Dalgliesh G.L. et al. Systematic sequencing of renal carcinoma reveals inactivation of histone modifying genes . Nature . 2010 ; 463 : 360 – 363 .

15. Berger M.F. et al. Melanoma genome sequencing reveals frequent PREX2 mutations . Nature . 2012 ; 485 : 502 – 506 .

16. Chapman M.A. et al. Initial genome sequencing and analysis of multiple myeloma . Nature . 2011 ; 471 : 467 – 472 .

17. Ding L. et al. Clonal evolution in relapsed acute myeloid leukaemia revealed by whole-genome sequencing . Nature . 2012 ; 481 : 506 – 510 .

18. Chin L. , Gray J.W. Translating insights from the cancer genome into clinical practice . Nature . 2008 ; 452 : 553 – 563 .

19. Hanahan D. , Weinberg R.A. Hallmarks of cancer: the next generation . Cell . 2011 ; 144 : 646 – 674 .

20. Kim T.M. et al. Functional genomic analysis of chromosomal aberrations in a compendium of 8000 cancer genomes . Genome Res . 2013 ; 23 : 217 – 227 .

21. Curtis C. et al. The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups . Nature . 2012 ; 486 : 346 – 352 .

22. Pugh T.J. et al. The genetic landscape of high-risk neuroblastoma . Nat Genet . 2013 ; 45 : 279 – 284 .

23. Hodis E. et al. A landscape of driver mutations in melanoma . Cell . 2012 ; 150 : 251 – 263 .

24. Chin K. et al. In situ analyses of genome instability in breast cancer . Nat Genet . 2004 ; 36 : 984 – 988 .

25. Artandi S.E. et al. Telomere dysfunction promotes non-reciprocal translocations and epithelial cancers in mice . Nature . 2000 ; 406 : 641 – 645 .

26. You J.S. , Jones P.A. Cancer genetics and epigenetics: two sides of the same coin? Cancer Cell . 2012 ; 22 : 9 – 20 .

27. Sharpless N.E. , DePinho R.A. p53: good cop/bad cop . Cell . 2002 ; 110 : 9 – 12 .

28. Venkatesan R.N. , Bielas J.H. , Loeb L.A. Generation of mutator mutants during carcinogenesis . DNA Repair (Amst) . 2006 ; 5 : 294 – 302 .

29. Ting A.H. , McGarvey K.M. , Baylin S.B. The cancer epigenome—components and functional correlates . Genes Dev . 2006 ; 20 : 3215 – 3231 .

30. Davies H. et al. Somatic mutations of the protein kinase gene family in human lung cancer . Cancer Res . 2005 ; 65 : 7591 – 7595 .

31. Pleasance E.D. et al. A small-cell lung cancer genome with complex signatures of tobacco exposure . Nature . 2010 ; 463 : 184 – 190 .

32. Yachida S. et al. Distant metastasis occurs late during the genetic evolution of pancreatic cancer . Nature . 2010 ; 467 : 1114 – 1117 .

33. Durinck S. et al. Temporal dissection of tumorigenesis in primary cancers . Cancer Discov . 2011 ; 1 : 137 – 143 .

34. Vogelstein B. et al. Genetic alterations during colorectal-tumor development . N Engl J Med . 1988 ; 319 : 525 – 532 .

35. Attolini C.S. et al. A mathematical framework to determine the temporal sequence of somatic genetic events in cancer . Proc Natl Acad Sci U S A . 2010 ; 107 : 17604 – 17609 .

36. Cheng Y.K. et al. A mathematical methodology for determining the temporal order of pathway alterations arising during gliomagenesis. PLoS Comput Biol. 8, e1002337 . 2012 .

37. Nik-Zainal S. et al. The life history of 21 breast cancers . Cell . 2012 ; 149 : 994 – 1007 .

38. Fullgrabe J. , Kavanagh E. , Joseph B. Histone onco-modifications . Oncogene . 2011 ; 30 : 3391 – 3403 .

39. van Engeland M. , Derks S. , Smits K.M. , Meijer G.A. , Herman J.G. Colorectal cancer epigenetics: complex simplicity . J Clin Oncol . 2011 ; 29 : 1382 – 1391 .

40. Luijsterburg M.S. , van Attikum H. Chromatin and the DNA damage response: the cancer connection . Mol Oncol . 2011 ; 5 : 349 – 367 .

41. Li H. , Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform . Bioinformatics . 2009 ; 25 : 1754 – 1760 .

42. Abecasis G.R. et al. An integrated map of genetic variation from 1,092 human genomes . Nature . 2012 ; 491 : 56 – 65 .

43. Ye K. , Schulz M.H. , Long Q. , Apweiler R. , Ning Z. Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads . Bioinformatics . 2009 ; 25 : 2865 – 2871 .

44. Chen K. et al. BreakDancer: an algorithm for high-resolution mapping of genomic structural variation . Nat Methods . 2009 ; 6 : 677 – 681 .

45. Rausch T. et al. DELLY: structural variant discovery by integrated paired-end and split-read analysis . Bioinformatics . 2012 ; 28 : i333 – i339 .

46. Trapnell C. et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks . Nat Protoc . 2012 ; 7 : 562 – 578 .

47. Weng L. et al. MicroRNA profiling of clear cell renal cell carcinoma by whole-genome small RNA deep sequencing of paired frozen and formalin-fixed, paraffin-embedded tissue specimens . J Pathol . 2010 ; 222 : 41 – 51 .

48. Lee C. , Kikyo N. Strategies to identify long noncoding RNAs involved in gene regulation . Cell Biosci . 2012 ; 2 : 37 .

49. Liu L. , De S. , Michor F. DNA replication timing and higher-order nuclear organization determine single-nucleotide substitution patterns in cancer genomes . Nat Commun . 2013 ; 4 : 1502 .

50. Olshen A.B. , Venkatraman E.S. , Lucito R. , Wigler M. Circular binary segmentation for the analysis of array-based DNA copy number data . Biostatistics . 2004 ; 5 : 557 – 572 .

51. Diskin S.J. et al. STAC: A method for testing the significance of DNA copy number aberrations across multiple array-CGH experiments . Genome Res . 2006 ; 16 : 1149 – 1158 .

52. Beroukhim R. et al. The landscape of somatic copy-number alteration across human cancers . Nature . 2010 ; 463 : 899 – 905 .

53. Volik S. et al. End-sequence profiling: sequence-based analysis of aberrant genomes . Proc Natl Acad Sci U S A . 2003 ; 100 : 7696 – 7701 .

54. Cibulskis K. et al. Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples . Nat Biotechnol . 2013 ; 31 : 213 – 219 .

55. Gonzalez-Perez A. , Lopez-Bigas N. Functional impact bias reveals cancer drivers . Nucleic Acid Res . 2012 ; 40 : e169 .

56. Huang da W. , Sherman B.T. , Lempicki R.A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources . Nat Protoc . 2009 ; 4 : 44 – 57 .

57. Subramanian A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles . Proc Natl Acad Sci U S A . 2005 ; 102 : 15545 – 15550 .

58. Page L. , Brin S. , Motwani R. , Winograd T. The PageRank citation ranking: bringing order to the web . http://infolab.stanford.edu/∼backrub/pageranksub.ps 1999 .

59. Tarca A.L. et al. A novel signaling pathway impact analysis . Bioinformatics . 2009 ; 25 : 75 – 82 .

60. Lefebvre C. et al. A human B-cell interactome identifies MYB and FOXM1 as master regulators of proliferation in germinal centers . Mol Syst Biol . 2010 ; 6 : 377 .

61. Morrison J.L. , Breitling R. , Higham D.J. , Gilbert D.R. GeneRank: using search engine technology for the analysis of microarray experiments . BMC Bioinformatics . 2005 ; 6 : 233 .

62. REFERENCE DELETED IN PROOFS.

63. Chatr-Aryamontri R. et al. The BioGRID interaction database: 2013 update . Nucleic Acid Res . 2013 ; 41 : D816 – D823 .

64. Mathivanan S. et al. Human Proteinpedia enables sharing of human protein data . Nat Biotechnol . 2008 ; 26 : 164 – 167 .

65. Turner B. et al. iRefWeb: interactive analysis of consolidated protein interaction data and their supporting evidence . Database (Oxford) . 2010 baq023 .

66. Franceschini A. et al. STRING v9.1: protein-protein interaction networks, with increased coverage and integration . Nucleic Acid Res . 2013 ; 41 : D808 – D815 .

67. Matthews L. et al. Reactome knowledgebase of human biological pathways and processes . Nucleic Acid Res . 2009 ; 37 : D619 – D622 .

68. Schaefer C.F. et al. PID: the Pathway Interaction Database . Nucleic Acid Res . 2009 ; 37 : D674 – D679 .

69. Vandin F. , Upfal E. , Raphael B.J. Algorithms for detecting significantly mutated pathways in cancer . J Comput Biol . 2011 ; 18 : 507 – 522 .

69a. Cancer Genome Atlas Research Network . Comprehensive molecular characterization of clear cell renal cell carcinoma . Nature . 2013 ; 499 : 43 – 49 .

70. Ciriello G. , Cerami E. , Sander C. , Schultz N. Mutual exclusivity analysis identifies oncogenic network modules . Genome Res . 2012 ; 22 : 398 – 406 .

71. Vandin F. , Upfal E. , Raphael B.J. De novo discovery of mutated driver pathways in cancer . Genome Res . 2012 ; 22 : 375 – 385 .

72. Vaske C.J. et al. Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM . Bioinformatics . 2010 ; 26 : i237 – i245 btq182 [pii]10.1093/bioinformatics/btq182 .

73. Heiser L.M. et al. Subtype and pathway specific responses to anticancer compounds in breast cancer . Proc Natl Acad Sci U S A . 2012 ; 109 : 2724 – 2729 .

74. Ng S. et al. PARADIGM-SHIFT predicts the function of mutations in multiple cancers using pathway impact analysis . Bioinformatics . 2012 ; 28 : i640 – i646 .

75. Lavi O. , Dror G. , Shamir R. Network-induced classification kernels for gene expression profile analysis . J Comput Biol . 2012 ; 19 : 694 – 709 .

76. Neve R.M. et al. A collection of breast cancer cell lines for the study of functionally distinct cancer subtypes . Cancer Cell . 2006 ; 10 : 515 – 527 S1535-6108⁰⁶0314-X [pii]10.1016/j.ccr. 2006.10.008 .

77. Barretina J. et al. The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity . Nature . 2012 ; 483 : 603 – 607 .

78. Loriaux M.M. et al. High-throughput sequence analysis of the tyrosine kinome in acute myeloid leukemia . Blood . 2008 ; 111 : 4788 – 4796 .

79. Warmuth M. , Kim S. , Gu X.J. , Xia G. , Adrian F. Ba/F3 cells and their use in kinase drug discovery . Curr Opin Oncol . 2007 ; 19 : 55 – 60 .

80. Kerbel R.S. Human tumor xenografts as predictive preclinical models for anticancer drug activity in humans: better than commonly perceived-but they can be improved . Cancer Biol Ther . 2003 ; 2 : S134 – S139 .

81. Sausville E.A. , Burger A.M. Contributions of human tumor xenografts to anticancer drug development . Cancer Res . 2006 ; 66 : 3351 – 3354 discussion 3354 .

82. Chen Z. et al. A murine lung cancer co-clinical trial identifies genetic modifiers of therapeutic response . Nature . 2012 ; 483 : 613 – 617 .

83. Nardella C. , Lunardi A. , Patnaik A. , Cantley L.C. , Pandolfi P.P. The APL paradigm and the “co-clinical trial” project . Cancer Discov . 2011 ; 1 : 108 – 116 .

84. Copeland N.G. , Jenkins N.A. Harnessing transposons for cancer gene discovery . Nat Rev Cancer . 2010 ; 10 : 696 – 706 .

85. Chang K. , Marran K. , Valentine A. , Hannon G.J. RNAi in cultured mammalian cells using synthetic siRNAs . Cold Spring Harb Protoc . 2012 : 957 – 961 .

86. Dow L.E. et al. A pipeline for the generation of shRNA transgenic mice . Nat Protoc . 2012 ; 7 : 374 – 393 .

87. Muerdter F. et al. Production of artificial piRNAs in flies and mice . RNA . 2012 ; 18 : 42 – 52 .

88. Silva J.M. et al. Second-generation shRNA libraries covering the mouse and human genomes . Nat Genet . 2005 ; 37 : 1281 – 1288 .

89. Berns K. et al. A large-scale RNAi screen in human cells identifies new components of the p53 pathway . Nature . 2004 ; 428 : 431 – 437 .

90. Brummelkamp T.R. , Bernards R. , Agami R. A system for stable expression of short interfering RNAs in mammalian cells . Science . 2002 ; 296 : 550 – 553 .

91. Sims D. et al. High-throughput RNA interference screening using pooled shRNA libraries and next generation sequencing . Genome Biol . 2011 ; 12 : R104 .

92. Cheung H.W. et al. Systematic investigation of genetic vulnerabilities across cancer cell lines reveals lineage-specific dependencies in ovarian cancer . Proc Natl Acad Sci U S A . 2011 ; 108 : 12372 – 12377 .

93. Rantala J.K. et al. A cell spot microarray method for production of high density siRNA transfection microarrays . BMC Genomics . 2011 ; 12 : 162 .

94. Bjorkman M. et al. Systematic knockdown of epigenetic enzymes identifies a novel histone demethylase PHF8 overexpressed in prostate cancer with an impact on cell proliferation, migration and invasion . Oncogene . 2012 ; 31 : 3444 – 3456 .

95. Krausz E. , Korn K. High-content siRNA screening for target identification and validation . Expert Opin Drug Discov . 2008 ; 3 : 551 – 564 .

96. Krausz E. High-content siRNA screening . Mol Biosyst . 2007 ; 3 : 232 – 240 .

97. Neumann B. et al. High-throughput RNAi screening by time-lapse imaging of live human cells . Nat Methods . 2006 ; 3 : 385 – 390 .

98. Rehman F.L. , Lord C.J. , Ashworth A. Synthetic lethal approaches to breast cancer therapy . Nat Rev Clin Oncol . 2010 ; 7 : 718 – 724 .

99. Turner N.C. et al. A synthetic lethal siRNA screen identifying genes mediating sensitivity to a PARP inhibitor . EMBO J . 2008 ; 27 : 1368 – 1377 .

100. Wan D. et al. Large-scale cDNA transfection screening for genes related to cancer development and progression . Proc Natl Acad Sci U S A . 2004 ; 101 : 15724 – 15729 .

101. Copeland N.G. , Jenkins N.A. Deciphering the genetic landscape of cancer—from genes to pathways . Trends Genet . 2009 ; 25 : 455 – 462 .

102. Bussey K.J. et al. Integrating data on DNA copy number with gene expression levels and drug sensitivities in the NCI-60 cell line panel . Mol Cancer Ther . 2006 ; 5 : 853 – 867 .

103. Garnett M.J. et al. Systematic identification of genomic markers of drug sensitivity in cancer cells . Nature . 2012 ; 483 : 570 – 575 .

104. Paez J.G. et al. EGFR mutations in lung cancer: correlation with clinical response to gefitinib therapy . Science . 2004 ; 304 : 1497 – 1500 .

105. Konecny G.E. et al. Activity of the dual kinase inhibitor lapatinib (GW572016) against HER-2-overexpressing and trastuzumab-treated breast cancer cells . Cancer Res . 2006 ; 66 : 1630 – 1639 .

106. Scappini B. et al. Changes associated with the development of resistance to imatinib (STI571) in two leukemia cell lines expressing p210 Bcr/Abl protein . Cancer . 2004 ; 100 : 1459 – 1471 .

107. Julien S. et al. Characterization of a large panel of patient-derived tumor xenografts representing the clinical heterogeneity of human colorectal cancer . Clin Cancer Res . 2012 ; 18 : 5314 – 5328 .

108. Rizki A. et al. A human breast cell model of preinvasive to invasive transition . Cancer Res . 2008 ; 68 : 1378 – 1387 .

109. Kieran M.W. , Kalluri R. , Cho Y.J. The VEGF pathway in cancer and disease: responses, resistance, and the path forward . Cold Spring Harb Perspect Med . 2012 ; 2 : a006593 . doi: 10.1101/cshperspect.a006593 .

110. Hanahan D. , Coussens L.M. Accessories to the crime: functions of cells recruited to the tumor microenvironment . Cancer Cell . 2012 ; 21 : 309 – 322 .

111. Nelson C.M. , Bissell M.J. Modeling dynamic reciprocity: engineering three-dimensional culture models of breast architecture, function, and neoplastic transformation . Semin Cancer Biol . 2005 ; 15 : 342 – 352 .

112. Debnath J. , Brugge J.S. Modelling glandular epithelial cancers in three-dimensional cultures . Nat Rev Cancer . 2005 ; 5 : 675 – 688 .

113. Lin C.H. , Lee J.K. , LaBarge M.A. Fabrication and use of microenvironment microarrays (MEArrays) . J Vis Exp . 2012 ( 68 ) .

114. Wilson T.R. et al. Widespread potential for growth-factor-driven resistance to anticancer kinase inhibitors . Nature . 2012 ; 487 : 505 – 509 .

115. Kuperwasser C. et al. Reconstruction of functionally normal and malignant human breast tissues in mice . Proc Natl Acad Sci U S A . 2004 ; 101 : 4966 – 4971 .

116. Marks C. Mouse Models of Human Cancers Consortium (MMHCC) from the NCI . Dis Model Mech . 2009 ; 2 : 111 .

117. Prat A. , Parker J.S. , Fan C. , Perou C.M. PAM50 assay and the three-gene model for identifying the major and clinically relevant molecular subtypes of breast cancer . Breast Cancer Res Treat . 2012 ; 135 : 301 – 306 .

118. Perou C.M. et al. Molecular portraits of human breast tumours . Nature . 2000 ; 406 : 747 – 752 .

119. Stephens P.J. et al. The landscape of cancer genes and mutational processes in breast cancer . Nature . 2012 ; 486 : 400 – 404 .

120. NSABP study confirms oncotype DX predicts chemotherapy benefit in breast cancer patients . Oncology (Williston Park) . 2006 ; 20 : 789 – 790 .

121. Ross J.S. , Hatzis C. , Symmans W.F. , Pusztai L. , Hortobagyi G.N. Commercialized multigene predictors of clinical outcome for breast cancer . Oncologist . 2008 ; 13 : 477 – 493 .

122. Glas A.M. et al. Converting a breast cancer microarray signature into a high-throughput diagnostic test . BMC Genomics . 2006 ; 7 : 278 .

123. Dave S.S. et al. Molecular diagnosis of Burkitt’s lymphoma . N Engl J Med . 2006 ; 354 : 2431 – 2442 .

124. Litvinov I.V. , Jones D.A. , Sasseville D. , Kupper T.S. Transcriptional profiles predict disease outcome in patients with cutaneous T-cell lymphoma . Clin Cancer Res . 2010 ; 16 : 2106 – 2114 .

125. Nannini M. et al. Gene expression profiling in colorectal cancer using microarray technologies: results and perspectives . Cancer Treat Rev . 2009 ; 35 : 201 – 209 .

126. Collisson E.A. et al. Subtypes of pancreatic ductal adenocarcinoma and their differing responses to therapy . Nat Med . 2011 ; 17 : 500 – 503 nm.2344 [pii]10.1038/nm.2344 .

127. Roepman P. et al. An immune response enriched 72-gene prognostic profile for early-stage non-small-cell lung cancer . Clin Cancer Res . 2009 ; 15 : 284 – 290 .

128. Chen H.Y. et al. miR-103/107 promote metastasis of colorectal cancer by targeting the metastasis suppressors DAPK and KLF4 . Cancer Res . 2012 ; 72 : 3631 – 3641 .

129. Boeri M. , Pastorino U. , Sozzi G. Role of microRNAs in lung cancer: microRNA signatures in cancer prognosis . Cancer J . 2012 ; 18 : 268 – 274 .

130. Rosenberg E. et al. Predicting progression of bladder urothelial carcinoma using microRNA expression . BJU Int . 2013 ; 112 : 1027 – 1034 .

131. Varadhachary G.R. et al. Prospective gene signature study using microRNA to identify the tissue of origin in patients with carcinoma of unknown primary . Clin Cancer Res . 2011 ; 17 : 4063 – 4070 .

132. Bender R.A. , Erlander M.G. Molecular classification of unknown primary cancer . Semin Oncol . 2009 ; 36 : 38 – 43 .

133. Hsu D.S. et al. Immune signatures predict prognosis in localized cancer . Cancer Invest . 2010 ; 28 : 765 – 773 .

134. DeNardo D.G. et al. Leukocyte complexity predicts breast cancer survival and functionally regulates response to chemotherapy . Cancer Discov . 2011 ; 1 : 54 – 67 .

135. Anderson N.L. et al. A human proteome detection and quantitation project . Mol Cell Proteomics . 2009 ; 8 : 883 – 886 .

136. Drake P.M. et al. Lectin chromatography/mass spectrometry discovery workflow identifies putative biomarkers of aggressive breast cancers . J Proteome Res . 2012 ; 11 : 2508 – 2520 .

137. Carvalho A.L. et al. Detection of promoter hypermethylation in salivary rinses as a biomarker for head and neck squamous cell carcinoma surveillance . Clin Cancer Res . 2011 ; 17 : 4782 – 4789 .

138. Diehl F. et al. Analysis of mutations in DNA isolated from plasma and stool of colorectal cancer patients . Gastroenterology . 2008 ; 135 : 489 – 498 .

139. Forshew T. et al. Noninvasive identification and monitoring of cancer mutations by targeted deep sequencing of plasma DNA . Sci Transl Med . 2012 ; 4 136ra168 .

140. Morris D.S. , Tomlins S.A. , Montie J.E. , Chinnaiyan A.M. The discovery and application of gene fusions in prostate cancer . BJU Int . 2008 ; 102 : 276 – 282 .

141. Salzman J. et al. ESRRA-C11orf20 is a recurrent gene fusion in serous ovarian carcinoma . PLoS Biol . 2011 ; 9 e1001156 .

142. Nahrendorf M. et al. Hybrid PET-optical imaging using targeted probes . Proc Natl Acad Sci U S A . 2010 ; 107 : 7910 – 7915 .

143. Lee J.H. et al. Synthesis and biological evaluation of two agents for imaging estrogen receptor beta by positron emission tomography: challenges in PET imaging of a low abundance target . Nucl Med Biol . 2012 ; 39 : 1105 – 1116 .

144. Evans M.J. et al. Noninvasive measurement of androgen receptor signaling with a positron-emitting radiopharmaceutical that targets prostate-specific membrane antigen . Proc Natl Acad Sci U S A . 2011 ; 108 : 9578 – 9582 .

145. Condeelis J. , Weissleder R. In vivo imaging in cancer . Cold Spring Harb Perspect Biol . 2010 ; 2 a003848 .

146. Holland J.P. et al. Annotating MYC status with 89Zr-transferrin imaging . Nat Med . 2012 ; 18 : 1586 – 1591 .

147. Jagoda E.M. et al. Immuno-PET of the hepatocyte growth factor receptor Met using the 1-armed antibody onartuzumab . J Nucl Med . 2012 ; 53 : 1592 – 1600 .

148. Nabavizadeh N. et al. Topographic enhancement mapping of the cancer-associated breast stroma using breast MRI . Integr Biol (Camb) . 2011 ; 3 : 490 – 496 .

149. Cuzick J. et al. Prognostic value of a combined estrogen receptor, progesterone receptor, Ki-67, and human epidermal growth factor receptor 2 immunohistochemical score and comparison with the Genomic Health recurrence score in early breast cancer . J Clin Oncol . 2011 ; 29 : 4273 – 4278 .

150. Bordeaux J.M. et al. Quantitative in situ measurement of estrogen receptor mRNA predicts response to tamoxifen . PLoS One . 2012 ; 7 e36559 .

151. Dancey J.E. , Bedard P.L. , Onetto N. , Hudson T.J. The genetic basis for cancer treatment decisions . Cell . 2012 ; 148 : 409 – 420 .

152. Druker B.J. et al. Efficacy and safety of a specific inhibitor of the BCR-ABL tyrosine kinase in chronic myeloid leukemia . N Engl J Med . 2001 ; 344 : 1031 – 1037 .

153. Pegram MD, et al. Phase II study of receptor-enhanced chemosensitivity using recombinant humanized anti-p185HER2/neu monoclonal antibody plus cisplatin in patients with HER2/neu-overexpressing metastatic breast cancer refractory to chemotherapy treatment. J Clin Oncol. 2098;16:2659-2671.

154. Mills G.B. An emerging toolkit for targeted cancer therapies . Genome Res . 2012 ; 22 : 177 – 182 .

155. Castellan J. http://www.phrma.org/sites/default/files/1000/medicinesindevelopmentcancer2011_0.pdf 2011 .

156. Sawyers C. Targeted cancer therapy . Nature . 2004 ; 432 : 294 – 297 .

157. Prowell T.M. , Pazdur R. Pathological complete response and accelerated drug approval in early breast cancer . N Engl J Med . 2012 ; 366 : 2438 – 2441 .

158. Peng J. , Sengupta S. , Jordan V.C. Potential of selective estrogen receptor modulators as treatments and preventives of breast cancer . Anticancer Agents Med Chem . 2009 ; 9 : 481 – 499 .

159. Friedlander T.W. , Ryan C.J. Targeting the androgen receptor . Urol Clin North Am . 2012 ; 39 : 453 – 464 .

160. PMID:24071849.

Recent Posts

Categories

Search Engine

Understanding and Using Information about Cancer Genomes

Functional Assessment of Cancer Genomes

Computational Approaches

Cataloging Approaches

Integrating Information

Organization into Pathways

Experimental Approaches

Tumor Intrinsic Assessments

Interaction with the Microenvironment

Clinical Applications

Diagnosis and Detection

Therapeutic Targets and Predictive Markers

Summary

Recent Posts

Categories

Search Engine

Understanding and Using Information about Cancer Genomes

Functional Assessment of Cancer Genomes

Computational Approaches

Cataloging Approaches

Integrating Information

Organization into Pathways

Experimental Approaches

Tumor Intrinsic Assessments

Interaction with the Microenvironment

Clinical Applications

Diagnosis and Detection

Therapeutic Targets and Predictive Markers

Summary

Related posts: