Cancer genomics and evolution
William P. D. Hendricks, PhD Aleksandar Sekulic, MD, PhD Alan H. Bryce, MD Muhammed Murtaza, MBBS, PhD Pilar Ramos, PhD Jeffrey M. Trent, PhD
Overview
Over 100 years ago, the Nobel Prize in Physiology or Medicine was given to Paul Ehrlich for postulating that “magic bullets” could specifically target and kill cells such as cancer cells based on their unique molecular features. The completed Human Genome Project and the cancer genomics revolution have now mapped the specific genetic changes underlying unique features of many common malignancies. However, although cancer has long been recognized to be heterogeneous in its clinical presentation, course, and pathology, we now recognize that it is far more molecularly heterogeneous than anticipated and that such variability will require an individualized approach to patient care. Rather than a single magic bullet, we will need an arsenal and this arsenal will require precise delivery. While inter- and intratumoral genomic heterogeneity present significant challenges to cancer management and drug discovery, a number of developments foster hope for accelerated progress in the war on cancer. First, our catalog of genomic targets underlying diverse cancers is rapidly growing. Second, we are beginning to understand how diverse mutations converge on a small number of druggable pathways. Third, we continue to develop drugs and biologic agents (e.g., immune checkpoint inhibitors) that target an increasingly wide array of genomic subtypes of cancer. Finally, advances in next-generation sequencing technologies now enable far earlier detection of disease recurrence than ever before. This chapter will focus on these developments and how their integration is aiding in the precise delivery of “magic bullets.”
Precis
Cancer is a genetic disease in which the stepwise accumulation of oncogene and tumor suppressor gene (TSG) mutations in clonally expanding cell populations drives malignant transformation. Throughout the twentieth century, overwhelming evidence steadily accumulated in support of this model while a growing understanding of the mutations underlying cancer subtypes helped to guide innovation in clinical cancer management. The genomics revolution begun in the twenty-first century has now transformed our understanding of cancer by generating exhaustive catalogs of cancer-driving mutations and identifying the great genomic diversity existing between and within individual tumors. These data are enabling targeted drug development, genomics-guided clinical management, and new approaches to early disease detection. Yet, gaps nonetheless remain in our knowledge of the causative mutations underlying some cancers, in our understanding of the biology of the mutations we have identified, in developing drugs capable of targeting many of these diverse mutations, and in fending off the inevitable emergence of drug resistance. In this chapter, we review the history and methods of cancer genetics and genomics, summarize current knowledge of the spectrum of mutations in cancers, discuss emerging concepts in cancer evolution and genomic heterogeneity, and present a case study on the clinical impact of melanoma genomics.
Introduction
As detailed in the previous chapters, cancer is a genetic disease resulting from the stepwise accumulation of mutations that confer upon cells a selective growth advantage.1 Such mutations alter the birth and death rate as well as the genomic stability of cells as they undergo waves of clonal expansion. With the accumulation of mutations in cancer genes, cell populations acquire increasingly malignant phenotypes in successive generations. These hallmark cancer phenotypes include sustained proliferation, evasion of growth suppression, evasion of the immune system, promotion of inflammation, activation of invasion and metastasis, induction of genomic instability, immortal replication, induction of angiogenesis, deregulation of cellular energetics, and resistance to cell death.2 Cancer genes in which mutation confers a growth advantage are classified as oncogenes or TSGs. Oncogenes, when activated through mutation in recurrent hotspots, drive constitutive activation of cell signaling pathways accelerating the cell birth rate. TSGs, when mutationally inactivated, are no longer able to promote cell death and thereby decrease the cell death rate. This model has been well characterized in colorectal cancer in which mutations in the APC tumor suppressor have been shown in most cases to drive adenoma formation. Additional mutations in cancer genes comprising proliferative, cell cycle, and apoptosis pathways then result in the formation of malignant tumors capable of invasion and metastasis.3, 4 Multiple other cancers have also been shown to follow this model, although we now know that there are many disparate evolutionary paths to malignancy even within the same cancer type (Figure 1).
While some mutations in cancer genes occur in the germ line and are heritable, the majority (90%) arise sporadically in somatic tissues over an individual’s lifetime (i.e., they are tumor specific) owing to replication error, genotoxic stress, and/or environmental damage.5 These mutations may result from subtle sequence alterations (single-base substitutions and insertions or deletions of one or a few bases), changes in chromosome copy number (amplification, deletion, chromosome loss, or duplication), or changes in chromosome structure (inter- and intrachromosomal translocation, inversion, or other types of rearrangement). For the remainder of this chapter, we will refer to these alterations as somatic single nucleotide variants (SNVs), copy number variants (CNVs), or structural variants (SVs). Classic examples of SNVs, CNVs, and SVs are shown in Figure 2 and include an activating missense SNV in NRAS (one of the most commonly mutated oncogenes), an inactivating homozygous deletion CNV in one of the most commonly inactivated tumor suppressors TP53, and an activating translocation SV involving BCR and ABL. Epigenomic modifications of cancer genes can also significantly alter gene function. These modifications of DNA, while not altering the DNA sequence itself, can include DNA methylation changes, chromatin modifications, or noncoding RNA-dependent gene regulation that may be heritable and play a role in tumorigenesis. This increasingly important area of study will be discussed in other chapters. We will focus here on DNA sequence-altering mutations.6–9 In this chapter, we use specific examples to highlight the state of cancer genomics knowledge by describing current methods in genomic analysis, compiling results from landmark cancer genomic studies, and assessing the role of cancer evolution in determining its clinical management. We then conclude with a detailed case study of melanoma genomics, evolution, and genomics-guided medicine.
The history and methods of cancer genomics
The early history of cancer genetics
The elucidation of cancer’s genetic etiology and breadth of its diversity has paralleled seminal advances in genetics and genetic technology (Figure 3a). Many landmark studies in the late nineteenth and early twentieth century laid the groundwork for cancer genomics—development of the theory of evolution by natural selection,10 discovery of heritable biological units,11 identification of chromosomes,12 chromosomal heredity13, 14 and chromosomal genes,15 and modern evolutionary synthesis linking biology, genetics, and evolutionary theory.16 The subsequent discovery of DNA’s structure then provided a framework for understanding the genetic code and the mechanism for molecular transmission of genetic information.17 Meanwhile, reports of heritable breast and colorectal cancers had arisen18 and cancer was increasingly understood to be able to be caused by chemical agents such as coal tar19 and cigarette smoke,20 physical agents such as radiation,21 and biological agents such as viruses.22 Discovery that these carcinogenic agents were also mutagenic drove speculation that cancer was a disease of mutation. Ultimately, although cancer had been suggested by some to be a chromosomal disease or a disease of mutation, the concept of cancer as a genetic disease did not gain traction until later in the twentieth century when cytogenetic and sequencing technology alongside experimental transformation of normal cell lines with oncogenes enabled assessment of cancer gene mutations at the molecular level and establishment of their causal role in tumorigenesis. Some of these landmark studies are shown in Figure 3a.
Molecular cytogenetics: Identification of recurrent chromosomal alterations in cancer
Mid-twentieth century developments in the field of cytogenetics such as chromosome staining in fixed leukocytes from peripheral blood finally enabled empirical observation of the chromosomal complement of a cell.23, 24 Using this approach, Nowell was the first to discover a specific recurrent genetic change associated with a cancer type—the “Philadelphia chromosome” fragment occurring in nearly all evaluated cases of chronic myelogenous leukemia (CML).25 Over a decade later, Rowley then used chromosome banding to map the components of this fragment, determining the Philadelphia chromosome to be a balanced reciprocal translocation between chromosomes 9 and 22.26, 27 These discoveries drove a search for recurrent structural aberrations both within and across cancer types. They led to identification of characteristic translocations in sarcomas, leukemias, and lymphomas. However, most carcinomas were found to possess few recurrent alterations, instead displaying great variety in the number and type of chromosome aberrations.28
After Nowell and Rowley’s discoveries, rapid innovation in chromosomal banding and spectral karyotyping (SKY) allowed progressively detailed views of chromosome aberrations in cancer cells using techniques based on fluorescence in situ hybridization (FISH), which relies on fluorescent DNA probes that hybridize to specific chromosomal regions.29–38 Approaches for genome-wide cytogenetic analysis have also been developed including SKY,29 multiplex-fluorescence in situ hybridization (M-FISH),39 chromosome microdissection,38 and comparative genomic hybridization (CGH).40 CGH is a particularly powerful fluorescent molecular cytogenetic technique for screening genome-wide chromosomal copy number changes in tumor genomes.
Novel approaches to printing nucleic acids on slides (microarrays) soon revolutionized molecular cytogenetics by reducing cost and increasing throughput. Array comparative genomic hybridization (aCGH) originally involved spotting large probes (large-insert clones or artificial chromosomes)41, 42 onto slides in intervals (1–3 Mb resolution) across chromosomal regions. Now, high-resolution genome-wide platforms consisting of small, customizable oligonucleotide probes43, 44 have demonstrated increased throughput and sensitivity in characterizing cancers such as breast,45–47 melanoma,48–50 B-cell lymphoma,51, 52 and many others. Similarly, oligonucleotide arrays designed to genotype thousands of single nucleotide polymorphisms (SNPs) are useful for characterizing tumor genome complexity with rapid genotyping of over millions of SNPs now possible (Affymetrix, Illumina). Similar to CGH arrays, SNP arrays quantitate locus-specific hybridization signal and can be used to estimate copy number or loss of heterozygosity (LOH).53–55 Notable cancer studies incorporating SNP arrays include genotyping of the NCI-60 cell line panel56 as well as lung cancer,57 AML,55, 58 neuroblastoma,59 melanoma,56, 60, 61 basal cell carcinoma,54 breast cancer,62 colorectal cancer,62–64 glioblastoma,65, 66 and pancreatic cancer.67
Both SVs and CNVs can be characterized using the tools of molecular cytogenetics and cytogenomics. Translocations are some of the most common SVs observed in cancer and, as in the case of the Philadelphia Chromosome in CML, are signature aberrations for many liquid cancers and sarcomas. They involve the movement of a chromosomal segment from one position to another, either within the same chromosome (intrachromosomal) or a different chromosome (interchromosomal). Other types of SV include chromosomal deletions, duplications, inversions, insertions, rings, and isochromosomes. These SVs may result from errors in double-stranded break (DSB) repair or other means of intra- or interchromosomal recombination.68 Translocations and other SVs can either activate oncogenes (such as when the ABL tyrosine kinase is constitutively activated by fusion with the BCR gene) or inactivate TSGs (such as when translocations involving the TP53 tumor suppressor interrupt the gene and lead to loss of function). CNVs involving a gain or loss of an entire chromosome are some of the simplest and most common chromosomal aberrations. They result from defective chromosomal segregation. The functional consequence of CNVs can be difficult to establish because the aberrations may extend over tens of thousands of megabases and may affect hundreds to thousands of genes. It has been easier to establish the cancer relevance of more limited regions of chromosomal gain and loss, created by amplification or deletion, as these smaller aberrations have been shown to alter the dosage of known oncogenes or TSGs. Classic examples of oncogene amplification in solid tumors discovered through molecular cytogenetics include ERBB2 in breast cancers and MYC in a variety of tumor types, while loss of specific regions of the genome is often associated with loss of TSGs such as TP53, RB1, PTEN, and CDKN2A. Such deletions of TSGs are critical for promotion of tumorigenesis both in cases of heritable cancers in which germline mutation of a single TSG allele must be coupled with loss of the wild-type allele as well as in sporadic cancers in which both alleles must be inactivated.
Molecular genetics: Identification of recurrent sequence alterations in cancer
While some molecular cytogenetic techniques are capable of detecting chromosomal breakpoints at base pair resolution, advances in molecular genetics were required to allow detection of subtle sequence changes in DNA itself. These advances were enabled by developments in molecular cloning and invention of the polymerase chain reaction (PCR) capable of generating the millions of DNA copies necessary for detection of these mutations.69 Coupled with the Sanger sequencing method based on dideoxynucleotide chain termination and agarose gel electrophoresis,70 progressive advances in this methodology, including fluorescently labeled nucleotides, capillary electrophoresis, paired-end sequencing, shotgun sequencing, and improved laboratory automation have allowed scaling to sequence entire genomes. This technology enabled the Human Genome Project (HGP), launched in 1990 as a $3 billion dollar publicly funded international collaboration. The rough draft of the human genome, covering 94% of the genome, was published in 200171, 72 with the complete draft finished ahead of schedule in 2003.73 In addition to establishing genome-wide ground truth for the locations, organization, and sequences of all human genes, the technical and informatic lessons learned in the course of this project made possible the first ambitious cancer mutation screens on the level of entire gene families74–77 and the majority of the protein-coding genome.62, 65–67 Nonetheless, genome sequencing projects using Sanger-based sequencing continued to be expensive and logistically challenging and thereby limited the application of this approach toward large-scale characterization of cancer genomes until the advent of massively parallel sequencing.
Gene expression microarrays: Cancer signatures and pathways
Microarrays such as those employed in aCGH studies were originally developed for high-throughput gene expression studies. Complementary DNA (cDNA formed from reverse transcription of RNA) and later oligonucleotide microarrays were implemented in high-throughput quantitation of mRNA transcripts. Advances in this technology paralleled improvements in automated Sanger sequencing in the 1990s and the turn of the millennium. They provided some of the first insights into gene signatures and pathways correlating to specific cancer subtypes. A parallel method, serial analysis of gene expression (SAGE) was developed to enable high-throughput digital quantitation of transcripts.78 While originally designed to assay the expression of individual genes, gene expression profiling methods were later adapted to profile and map global transcription on the exon and genome levels,79 as well as to profile the expression of mature human micro-RNAs (miRNAs).80 Gene expression patterns have been widely used to subclassify cancers into homogeneous entities not easily discernible using traditional histopathologic or cytogenetic techniques, for instance, in diffuse large B-cell lymphoma (DLBCL),81, 82 lung cancer,83 melanoma,84 and breast cancer.85, 86 In some instances, subgroups have been shown to represent distinct disease states responding differently to standard therapies. Gene expression profiles have also been widely mined to specifically identify sets of genes predictive of disease progression, response to therapy, or metastasis,87–89 as well as the presence of specific recurrent cytogenetic abnormalities,90–92 or gene mutations.93–96 Although advances in massively parallel sequencing have largely supplanted microarrays in sensitivity and robustness, if not cost and clinical readiness, gene expression profiling continues to play a significant role in cancer biology and medicine. In fact, commercially available clinical tests based on microarray signatures (OncoType DX and MammaPrint) are available for breast cancer profiling.
Massively parallel sequencing: Charting cancer genome landscapes
Explosive growth in our understanding of cancer genomics has been built on the backbone of the advances that emerged from the completion of the human genome at the turn of the twenty-first century. However, although these advances had driven the cost of one genome down from an estimated $1 billion to the $1 million mark at the time of the HGP’s completion, this extraordinary cost nonetheless held back large-scale genomic studies.97 Thus, the National Human Genome Research Institute initiated a $70 million sequencing technology program intended to drive the price down to the $1000 mark. The cost of a human genome sequence now lies slightly below $2000 and is steadily approaching $1000 (Figure 3b).
The quest for the $1000 genome required fundamental rethinking of the approach to DNA sequencing. Sanger sequencing, even in its most expert execution, faced insurmountable bottlenecks including (1) the need for molecular cloning steps and/or individual PCR reactions to generate sequencing templates and (2) every sequencing reaction can only accommodate a relatively small genomic region. Therefore, cloning/amplifying and sequencing many regions, even when run in parallel in multiwell plates, requires large amounts of materials, equipment, and analysis time. The paradigm shift in sequencing cost and throughput was brought about by the invention of approaches that allowed generation of sequencing templates without cloning or individual PCR reactions, immobilization of entire sequencing libraries in a single reaction chamber, and massively parallel sequencing of millions of templates in that chamber. The ability of this approach to collect data on clonal populations derived from single DNA molecules additionally provides great power for detection of rare variants from heterogeneous tumor samples. These rapid, accurate, and inexpensive approaches are also referred to as next-generation sequencing (NGS).
Key inventions in the 1990s and early 2000s such as sequencing-by-synthesis, pyrosequencing, colony sequencing, and emulsion PCR enabled development of the first NGS platforms.98–101 The first such platform was developed by Lynx Therapeutics (later acquired by Solexa) in 2000, although it did not achieve widespread adoption.102 454 Life Sciences offered the first commercially available system—the 454 FLX Pyrosequencer—in 2004,103 followed by the Solexa (now Illumina) Genome Analyzer in 2006, and the Supported Oligonucleotide Ligation and Detection (SOLiD) System from Applied Biosystems in 2007 that also incorporated polony sequencing.104, 105 The Solexa system was used in 2008 to sequence James Watson’s genome within a single laboratory in only 2 months.106 These platforms at last made the possibility of sequencing and assembling multiple cancer genomes a reality as evidenced by the characterization of genome-wide SVs in two lung cancer genomes.107 This study utilized a paired-end approach, demonstrating strikingly complex genomic rearrangements including those resulting in previously unreported fusion transcripts. Importantly, this study also demonstrated that DNA copy number can be estimated by the relative local abundance of genomic fragments sequenced, providing sensitivity comparable to aCGH platforms while at the same time providing DNA sequence information. Massively parallel sequencing methods have now dramatically decreased the cost of sequencing while greatly increasing throughput. Presently, sequencing of 500 billion bases (e.g., sequencing the equivalent of 156 human genomes or 156X average genome coverage) within a single 6-day commercial instrument run is standard, with throughput still steadily increasing.
Many massively parallel sequencing platforms exist today and are reviewed in more detail elsewhere.108 Table 1 provides an overview of some of the most common platforms in use today. They have been adapted to a variety of purposes ranging from diverse DNA sequence, copy number, and structural analyses to transcriptomics and epigenomics. DNA sequencing applications include de novo sequencing, such as the first such application in determining the previously unknown genomic sequence of Acinetobacter baumannii.109 In addition, resequencing of human genomes is performed in order to characterize normal variation or variation in specific disease states such as cancer. Resequencing is undertaken by mapping NGS sequencing reads to the reference genome and identifying differences between the sample and reference.65 Of course, it is not always necessary or informative to assess whole genomes, and thus, a number of target-enrichment strategies have been conceived that enable more cost-effective and focused analyses. The first of these—direct genomic selection—was based on hybridizing defined genomic regions to biotinylated BACs and capturing these sequences on streptavidin beads before sequencing.110 New enrichment methods such as hybridization-based capture (using oligonucleotides either in solution or on microarrays), highly multiplexed PCR, and microdroplet PCR are common.111–114 A relatively small fraction of the human genome has been functionally characterized, and therefore, it is often most expedient to assess the 1% of the genome containing coding regions (all exons—the exome). Thus, a number of commercially available human exome and other targeted capture kits (such as panels of common cancer genes) are available. This ability to target specific genes to enable rapid, accurate, affordable sequencing is aiding not only in biological discovery but also in clinical laboratory testing by enabling practical complex testing.
Table 1 Massively parallel sequencing platforms
Company | System platforma | Latest release | Template preparation | Sequencing chemistry | Maximum read length | Bases per run (Gb) | Accuracy | Run time | Application |
Illumina | Illumina HiSeq 4000 | 2015 | Emulsion PCR | Reversible terminator | 150 | 1500 | >99% | <1–3.5 days | WGS, E-S, RNA-S, T-S, C-S, MG |
Illumina HiSeq 3000 | 2015 | Emulsion PCR | Reversible terminator | 150 | 750 | >99% | <1–3.5 days | WGS, E-S, RNA-S, T-S, C-S, MG | |
Illumina HiSeq 2500 | 2014 | Solid-phase | Reversible terminator | 125 | 500 | > 99% | 6 days | WGS, E-S, RNA-S, T-S, C-S, MG, DN-S | |
Illumina HiSeq X (Ten/Five) | 2014 | Solid-phase | Reversible terminator | 150 | 1800 | >99% | <3 days | WGS | |
Illumina MiSeq | 2014 | Solid-phase | Reversible terminator | 300 | 15 | >99% | 4 h | E-S, RNA-S, T-S, C-S, DN-S | |
Illumina NextSeq 500 | 2014 | Solid-phase | Reversible terminator | 150 | 120 | >99% | 30 h | WGS, E-S, RNA-S, T-S, C-S, MG | |
Life Technologies | IonTorrent Personal Genome Machine | 2014 | Emulsion PCR | Proton detection | 400 | 2 | >99% | 2 h | Microbial WGS, E-S, RNA-S, T-S, C-S, MG |
IonTorrent Proton | 2014 | Emulsion PCR | Proton detection | 200 | 10 | >99% | 2–4 h | Microbial WGS, E-S, RNA-S, T-S, C-S, MG | |
SOLiD | 2014 | Emulsion PCR | Sequencing by ligation | 75 | 120 | 99.99% | 8 days | WGS, DN-S, E-S, RNA-S, T-S, C-S, MG | |
Pacific Biosciences | PacBio RS II | 2014 | Single molecule | Real-time sequencing | 15,000 | 3 | 85% | 20 min | DN-S, T-S |
Roche | 454 GS Junior | 2014 | Emulsion PCR | Pyro-sequencing | 400 | 0.07 | >99% | 10 h | Microbial DN-S, E-S |
454 GS FLX | 2014 | Emulsion PCR | Pyro-sequencing | 700 | 0.7 | 100% | 24 h | Microbial DN-S, E-S |
WGS, whole genome sequencing; E-S, exome sequencing; RNA-S, RNA sequencing; T-S, targeted sequencing; C-S, ChIP-sequencing; MG, MetaGenomics; and DN-S, de novo sequencing.
a Additional versions are available for some models.
Cancer genome landscapes
Landmark cancer genomic studies
Despite commonalities observed within and between individual types of cancer in historic genetic studies, the high-throughput genomic studies of the past decade have provided a much more complex view of cancer. The first large-scale cancer mutation screens implemented high-throughput Sanger sequencing of individual genes such as BRAF in large cohorts115, 116 or focused on gene families such as protein kinases in diverse cancer types.74–77, 117, 118 Notably, such screens have led to the identification of activating BRAF mutations in a large percentage of human melanomas115 and nevi,119 as well as ERBB2 mutations in human breast cancer.118 The first cancer gene panels (110 exons from 12 target genes) were completed in 140 acute myeloid leukemia (AML) samples in 2003.120 This study identified six previously described and, strikingly, seven previously unknown coding mutations in AML genomes. Soon after, results of Sanger-based exome sequencing of 13,023 genes in 11 breast and 11 colorectal cancers were published in 2006.121 This study revealed a surprising total of 189 frequently mutated genes with an average of 90 mutant genes per tumor, although only a subset was thought to be causally related to tumorigenesis or progression (an average of 11 per tumor). In a follow-up analysis of an additional 7000 genes in the same tumors, it was confirmed that a handful of commonly mutated genes (the “mountains” in the genomic landscape) existed among a large number of less frequently mutated “hills”. In 2008, four Sanger-based exome projects additionally characterized over 20,000 genes in glioblastoma multiforme62, 65, 66 and pancreatic cancer.67 Amidst an average of 63 and 47 mutations per pancreatic tumor or glioblastoma, these studies identified novel genes bearing low-frequency mutations such as IDH1 and PIK3R1 mutations in glioblastoma. Strikingly, they uncovered few unknown genes recurrently mutated at high rates. In these and future studies, it was found that the high-frequency mountains had already largely been identified using the low-throughput genetic technologies of the past decades. The growing challenge has since been interpretation and characterization of the many hills. However, these studies all show that multiple genes within core signaling pathways, rather than single genes, tend to be mutated in these cancers, suggesting an avenue for treating tumors based on aberrant pathways instead of individual aberrant genes.
Following these landmark studies, the number, breadth, and size of cancer genome projects rapidly accelerated in keeping with the advent of massively parallel sequencing. The first cancer whole genome sequences generated by massively parallel sequencing were those of two lung cancers published in 2008107 followed shortly by those of individual AML cases published in 2008 and 2009.122, 123 These studies focused on assessment of genome-wide SVs and missense mutations, respectively. The first analysis of paired primary and metastatic tumors enabling genome-wide assessment of evolution was conducted in breast cancer in 2009, identifying both shared and exclusive mutations in each sample.124 The first studies to systematically assess genome-wide mutations of all classes, published in 2010, were conducted in the melanoma cell line COLO-829125 and a small cell lung cancer cell line.126 These studies uncovered an extraordinary mutation burden (over 33,000 mutations in melanoma and 23,000 in lung) and mutational signatures supporting ultraviolet (UV) radiation etiology and tobacco smoke etiology in melanoma and lung, respectively. Discovery has since continued at a rapid pace and, as discussed later, large genomic analyses having been completed in all common cancers, many rare cancers, primary/recurrent/metastatic matched cohorts, and even cancers undergoing treatment.
Large-scale cancer genomic studies have also been powered by national and international consortia. The US National Cancer Institute (NCI) and National Human Genome Research Institute (NHGRI) launched a large collaborative project in 2005, the Cancer Genome Atlas (TCGA; see http://cancergenome.nih.gov/). Using multiple state-of-the-art genomic technologies to comprehensively identify all genomic alterations associated with multiple cancer types, the TCGA effort began with large pilot studies of glioblastoma multiforme65 ovarian cancer,127 and squamous cell lung cancer.128 TCGA studies have since included colorectal,129 breast,130 endometrial,131 AML,132 clear cell renal cell,133 expanded glioblastoma,134 urothelial bladder,135 lung adenocarcinoma,136 gastric,137 chromophobe renal cell,138 papillary thyroid,139 head and neck squamous cell,140 low-grade glioma,141 cutaneous melanoma,142 and a suite of 18 pan-cancer studies.143 Across 34 cancer types, data have been collected from more than 11,000 cases. Similarly ambitious studies have been undertaken by members of the International Cancer Genome Consortium (ICGC; see http://icgc.org)144 launched in 2010 to comprehensively catalog all genomic aberrations associated with at least 50 different cancers. This consortium currently encompasses 55 projects covering 33 cancer types with over 13,000 donors and has produced publications on bisulfite sequencing techniques,145, 146 DLBCL,147 Burkitt lymphoma,148 X chromosome hypermutation,149 signatures of mutational processes,150 and primary CNS lymphoma.151 The ultimate success of these comprehensive, large-scale projects will continue to rapidly advance our understanding of cancer genetics and genomics and will potentially revolutionize our approach to the diagnosis and treatment of cancer.
Cancer genomic data repositories and analysis tools
Data generated by the TCGA, ICGC, and other curated genomic repositories are publicly available. Systematic analysis of these data remains challenging, but improved web-based tools for both novices and experienced bioinformaticians are greatly democratizing cancer genomics research (Table 2). One of the oldest and most comprehensive resources for mining such data is the Catalogue of Somatic Mutations in Cancer (COSMIC; see http://www.sanger.ac.uk/genetics/CGP/cosmic) maintained by the Wellcome Trust Sanger Institute Cancer Genome Project.152 COSMIC is manually curated from publications and therefore entails a broad reach across targeted studies and genome projects (currently more than 20,000). The current build (August 2014) describes over 2 million mutations in more than 1 million samples encompassing 12,000 cancer genomes. In addition to SNVs, it details over 6 million noncoding mutations, 10,000 fusions, 61,000 SVs, 700,000 CNVs, and 60 million expression variants. These data are easily queried using key words or by gene or cancer type. COSMIC also includes more advanced tools that are in some cases linked to other databases. These include a detailed census of all human genes that have been causally linked to tumorigenesis (Cancer Gene Census; see http://www.sanger.ac.uk/genetics/CGP/Census)5, 153 as well as tools to assess mutation signatures and drug sensitivity. Another highly versatile data portal is the cBioPortal for Cancer Genomics maintained by Memorial Sloan-Kettering Cancer Institute.154 This portal primarily contains highly processed Cancer Cell Line Encyclopedia (CCLE) and TCGA data sets but also provides a powerful, intuitive web interface enabling queries by gene or cancer type as well as more advanced analysis through an application programming interface and also integration with R and MATLAB. A similar portal is available for ICGC data (see http://dcc.icgc.org).
Table 2 Cancer genomics databases
Name | Detail | Link |
canEvolve | Analysis of mRNA, miRNA, protein expression, and CNV data in 10,000 patient samples from TCGA, GEO, and Array Express | www.canevolve.org |
canSAR | Integrated analysis of biological, chemical, and pharmacological data from COSMIC, chEMBl, UniProt, BindingDB, Array Express, and STRING | https://cansar.icr.ac.uk |
cBioPortal | TCGA data portal; graphical visualization and analysis | http://www.cbioportal.org |
CGAP | Graphical summary and analysis of gene expression; integration of cytogenetic data | http://cgap.nci.nih.gov |
CGHub | Secure, comprehensive data repository; TCGA, CCLE, and TARGET projects | https://cghub.ucsc.edu |
CPRG | Integrative analysis tools for cancer research | http://www.broadinstitute.org/software/cprg |
COSMIC | Largest genomics data repository; manually curated publications and output from large sequencing studies | http://www.sanger.ac.uk/genetics/CGP/cosmic |
EBI Array Express | Annotated functional genomics data; data generated via microarray and high-throughput sequencing projects | http://www.ebi.ac.uk/microarray-as/ae |
EGA | Comprehensive data repository; restricted access; ICGC output; SNV and CNV data | https://www.ebi.ac.uk/ega |
GDAC | Pipelines for genomic analysis; user-friendly interface | http://gdac.broadinstitute.org |
GEO | Gene expression microarray and functional genomics data repository | http://www.ncbi.nlm.nih.gov/geo |
ICGC | Visualization tool; genomic, transcriptomic, and epigenomic characterization of 50 tumor types | http://dcc.icgc.org |
MethylCancer | Methylation database; interprets the correlation of methylation, gene expression, and cancer biology | http://methycancer.psych.ac.cn |
SomamiR | Archive of experimentally validated somatic mutations in noncoding RNA | http://compbio.uthsc.edu/SomamiR |
UCSC Cancer Genome Browser | Multipurpose data viewer incorporating multiple data types including clinical information | https://genome-cancer.soe.ucsc.edu |
While the above-mentioned portals enable advanced analysis with only modest bioinformatics training, additional repositories exist for advanced users to access various levels of raw data. The Cancer Genomics Hub (CGHub), hosted by the University of California Santa Cruz (UCSC), is a secure central repository for data generated through the NCI including TCGA, CCLE, and Therapeutically Applicable Research to Generate Effective Treatments (TARGET).155 The European Genome-phenome Archive (see http://ega.crg.eu) similarly collects and distributes sequencing and genotyping data, primarily cancer data emerging from the ICGC. Data repositories and analysis tools are also available for other data types. Cytogenetic aberrations and fusions observed in over 65,000 human tumors have been compiled and are maintained online (The Mitelman Database of Chromosome Aberrations in Cancer at the US National Cancer Institute [NCI] Cancer Genome Anatomy Project [CGAP] Web site: http://cgap.nci.nih.gov).28 Data from numerous gene expression microarray studies are warehoused by multiple entities, including the US National Center for Biotechnology Information (NCBI Gene Expression Omnibus; see http://www.ncbi.nlm.nih.gov/geo/) and the European Bioinformatics Institute (EBI Array Express; see http://www.ebi.ac.uk/microarray-as/ae/). Additional data repositories and web resources are outlined in Table 2.
The vast and varied landscapes of human cancers
Although few new mutational “mountains” have been discovered in the past 10 years, systematic study of cancer genomes has revealed many new “hills” as well as mutational processes and dysregulated pathways. The total mutation burden itself can reflect cancer etiology and have bearing on clinical course. This burden is much more variable than originally anticipated. In some pediatric cancers such as rhabdoid tumors156 and small cell carcinoma of the ovary,157 cases with only a single coding SNV have been discovered, while leukemias such as AML bear a median of nine coding SNVs (Figure 4). Conversely, cancers with mutagenic etiology such as bladder, lung, and melanoma bear high mutation rates with medians of 148, 217, and 254 coding SNVS, respectively, while tumors with mismatch repair defects can contain thousands of coding SNVs. Distinct mutational signatures can also reflect these external and internal mutagenic factors such as enrichment for C > T transition in dipyrimidines as seen in cancers from UV-damaged sites.150 Overall, coding SNVs are generally more common in cancers than coding CNVs and SVs, although broad variability in CNV and SV profiles also exists across cancer types. Thirty-seven percent of cancers experience whole-genome doublings with a quarter of most solid tumor genomes containing large-scale chromosomal variations and 10% containing focal CNVs. Over 140 genomic regions have been found to contain recurrent CNVs, only 38 of which contain known cancer genes.158–160 Most solid tumors also contain dozens of SVs although many of these occur in regions that do not contain genes and are likely passengers. Whole genome sequencing has also recently uncovered “chromothripsis” events, occurring in 2–3% of cancers when errors in chromosome segregation during mitosis lead to shattering of a single or a few chromosomes and massive rearrangement in a single cell generation.161–163
In addition to mutagenic etiology, variability in mutation burden is the likely result of mutation timing, patient age, and the number of divisions that have occurred in precursor cells with mutations accumulating in dividing cells over time, although a “punctuated evolution” model has also been proposed.164–166 Ultimately, only a few such mutations can “drive” cancer (driver mutations) insofar as they confer a selective growth advantage on the cells that carry them. Other mutations are “passengers” with no phenotypic effect on cell growth.167 Several methods have been developed to classify mutations as drivers or passengers based on the pattern and frequency with which they arise across cancers,4, 168, 169 but experimental validation is required to confirm the role of these mutations in cancer.
It has been estimated that more than 138 cancer driver genes exist4 with 571 genes causally implicated in cancer catalogued in the Cancer Gene Census (see http://cancer.sanger.ac.uk/census).5 The top 10 most commonly mutated cancer genes are given in Table 3, while cancer genes associated with the most lethal cancers in the United States are given in Table 4. While this vast array of mutations may seem overwhelmingly complex, particularly from the therapeutic perspective, there is nonetheless some hope. For example, dramatic clinical responses are seen with agents targeting mutant proteins in highly complex cancers such as mutant BRAF and ALK in melanoma and lung cancer.194, 195 Further, multiple cancer genes often converge on an individual pathway and may be functionally equivalent. Thus, specific pathways may collectively be subject to genomic aberrations at a frequency far higher than any of their individual gene components. Indeed, most driver genes converge on the hallmark pathways and can ultimately be reduced to those impacting cell differentiation, cell proliferation, cell death, and genomic maintenance.2 Thus, the integration of multiple types of genomic data will clearly be necessary to unravel the complexity of somatic cancer genetics.
Table 3 Ten most commonly mutated oncogenes and tumor suppressor genes (TSG) in the COSMIC database
Gene | Classificationa | SNVs in COSMIC | Homozygous deletions (TSGs) or amplifications (oncogenes) in COSMICb | Fusions in COSMIC |
TP53 | TSG | 28,253 | 17 | 1 |
NPM1 | TSG | 5224 | 1 | 319 |
CDKN2A | TSG | 4836 | 1046 | 0 |
APC | TSG | 4351 | 10 | 0 |
PTEN | TSG | 3296 | 275 | 0 |
VHL | TSG | 2443 | 2 | 0 |
TET2 | TSG | 2085 | 3 | 0 |
NOTCH1 | TSG | 2009 | 12 | 0 |
NF2 | TSG | 1076 | 12 | 0 |
CEBPA | TSG | 706 | 3 | 0 |
JAK2 | Oncogene | 42,963 | 109 | 35 |
BRAF | Oncogene | 41,637 | 105 | 625 |
KRAS | Oncogene | 32,021 | 231 | 1 |
EGFR | Oncogene | 24,189 | 646 | 0 |
FLT3 | Oncogene | 15,789 | 102 | 0 |
IDH1 | Oncogene | 8508 | 11 | 0 |
PIK3CA | Oncogene | 7983 | 425 | 0 |
KIT | Oncogene | 7413 | 106 | 0 |
CTNNB1 | Oncogene | 5372 | 12 | 22 |
NRAS | Oncogene | 4278 | 31 | 0 |
a Oncogene and TSG status were determined by the 20/20 rule4 using sample data in the COSMIC database (see http://www.sanger.ac.uk/genetics/CGP/cosmic/).
b Amplifications shown for samples in which average genome ploidy ≤ 2.7 and total gene copy number ≥ 5 or average genome ploidy > 2.7 and total gene copy number ≥ 9. Homozygous deletions shown for samples in which the average genome ploidy ≤ 2.7 and total gene copy number = 0 or average genome ploidy > 2.7 and total copy number < (average genome ploidy − 2.7).
Table 4 Commonly mutated cancer genes in the deadliest cancers
Cancera | Familial cancer genes | Common somatically mutated genes | References |
Breast | BRCA1, BRCA2, PTEN, TP53 | PIK3CA, TP53, MAP3K1, GATA3, MLL3, CDH1, PTEN, ERBB2, MAP2K4, CDKN2A, PTEN, RB1 | 62, 121, 130, 170–172 |
Colorectal | APC, MSH2, MLH1, MSH6, PMS2, MUTYH, LKB1, SMAD4, BMPR1A, PTEN, KLLN | APC, TP53, KRAS, PIK3CA, FBXW7, SMAD4, TCF7L2, NRAS, ARID1A, SOX9, FAM123B, ERBB2, IGF2, NAV2, TCF7L1 | 62, 121, 129, 173, 174 |
Liver | HFE, SLC25A13, ABCB11, FAH, HMBS, UROD | TP53, CTNNB1, AXIN1, RPS6KA3, RB1, FAM123A, CDKN2A, MYC, RSPO2, CCND1, FGF19, ARID1A, ARID1B, ARID2, MLL, MLL3 | 175–178 |
Lung | EGFR, BRAF, KRAS, TP53 | TP53, KRAS, STK11, EGFR, ALK, BRAF, AKT1, DDR2, HER2, MEK1, NRAS, PIK3CA, PTEN, RET, ROS1, EML4, NTRK1, FHIT, FRA3B, FGFR1, HER2 | 57, 75, 116, 118, 126, 128, 136, 179 |
Pancreatic | BRCA2, PALB2 | KRAS, TP53, CDKN2A, SMAD4, MLL3, TGFRB2, ARID1A, SF3B1, ROBO2, KDM6A, PREX2 | 67, 180–183 |
Prostate | BRCA2, BRCA1, HOXB13 | EPHB2, ERG, TMPRSS2, PTEN, TP53, SPOP, FOXA1, MED12, NKX3-1 | 166, 184–188 |
Ovarian | STK11, BRCA1, BRCA2 | FBXW7, AKT2, ERBB2, TGFBR1, TGFBR2, BRAF, KRAS, PIK3CA, PTEN, ARID1A, BRCA1, MMP-1, BRCA1, BRCA2, MLPA, MAPH | 127, 189–192 |
a The seven cancers estimated to cause the greatest number of deaths in the United States in 2015.193
While it seems possible that the majority of cancer genes have now been discovered, new genes are occasionally implicated in cancer, although at low frequency across many cancer types or at high frequency in a rare and previously uncharacterized cancer. However, most cancers require 5–8 driver mutations based on epidemiological studies196 and only 3–6 drivers have been found in most cancers with notable exceptions (such as pediatric cases) in which three or fewer drivers have been discovered.4 Remaining knowledge gaps are likely due to both technical limitations of sequencing technology and study design as well as limitations in our growing understanding of mutations occurring in noncoding regions of DNA or through epigenomic mechanisms. Clearly, the catalogue of cancer-driving mutations is still incomplete and although the maps of cancer genome landscapes have been broadly charted, much work remains to characterize these vast landscapes in detail in order to guide clinical cancer management.
Clinical implications of cancer genome landscapes
Exhaustive information now available from cancer genomics projects has greatly improved our comprehension of the development, progression, and clinical behavior of human neoplasms. These data and the technology used to generate them are also impacting cancer screening, diagnostics, and treatment. As sequencing costs continue to decline, exome- and genome-wide population screening becomes increasingly viable and it is likely that each patient’s personal genome sequence will comprise a key component of their medical record. Such screening may reveal de novo mutations that would otherwise go undiagnosed until disease presentation and the power of this approach will grow exponentially as we continue to connect genetic variants to disease phenotypes. The NIH has acknowledged the importance of this new healthcare approach through the establishment of the Precision Medicine Initiative.197 Targeted Sanger sequencing of small gene panels is already commonly used in familial cancer screening. In cases such as germline BRCA1/2 or MLH1/MSH2 mutations, which dramatically increase lifetime risk for breast/ovarian or colorectal cancer, genomic testing can guide clinical decisions about surveillance and prophylactic surgery. Now, NGS-based sequencing of hundreds of genes can be completed at a similar cost and thereby enable detection of lower penetrance rare variants not detectable with smaller panels. For example, in a study of 141 BRCA1/2-negative patients with a family or personal history of breast cancer, a panel of 40 additional genes identified 16 patients with pathogenic variants in nine genes other than BRCA1/2 such as TP53 and PTEN.198 These findings have shaped new guidelines from the National Comprehensive Cancer Network recommending targeted panels for patients’ breast or ovarian cancer family history who are negative for common hereditary mutations.199 Targeted NGS panels have also shown greater sensitivity and specificity than traditional diagnostic tools in genetic testing of familial cancer genes such as BRCA1/2, TP53, and APC in additional studies.170, 200–202 NGS panels may also facilitate differential diagnosis and patient stratification given that the diagnostic yield of NGS panels, exomes, or even whole genomes has been shown in multiple studies to outperform small Sanger-based panels.203, 204
Cancer genetics discoveries have driven targeted drug development since the first report of a recurrent chromosomal abnormality—the BCR-ABL translocation in CML. The selective tyrosine kinase inhibitor imatinib, designed to target the BCR-ABL chimeric fusion gene and its constitutively active tyrosine kinase protein product in CML, was the first successful targeted therapy.205–207 The paradigm of targeting cancer-specific mutations to vastly improve the therapeutic index of chemotherapy has now been repeated with dramatic success. Examples include expansion of imatinib to PDGFR– and KIT-mutated gastrointestinal stromal tumors (GIST) and hypereosinophilic syndromes208–210; dasatinib and nilotinib in primary and imatinib-resistant CML211–213; trastuzumab, a neutralizing antibody targeted to the Her2/ErbB2 tyrosine kinase receptor whose encoding gene ERBB2 is amplified and overexpressed in 25–30% of breast carcinomas214, 215; sunitinib in renal cell carcinoma, GIST, and pancreatic neuroendocrine tumors216–218; gefitinib and erlotinib with striking efficacy in the 5–10% of lung adenocarcinoma patients with European ancestry and 25–30% of Japanese patients who harbor activating mutations in EGFR219, 220; crizotinib in ALK-rearranged lung cancer221, 222; vismodegib in basal-cell carcinoma bearing hedgehog pathway mutations223; and vemurafenib in BRAF-mutant melanoma.224 Cancer gene mutations have also been associated with innate or acquired resistance to these targeted therapies. For example, amplification of the BCR-ABL gene is common in CML patients resistant to imatinib.205, 225 Preclinical studies and clinical trials are ongoing for numerous novel therapeutic agents directed against genomic targets in cancer in addition to agents capable of circumventing or treating drug-resistant disease.
The fundamental goal of oncology practice is recommendation of the most effective treatment supported by scientific evidence for an individual patient. Genomic data would be expected to provide just such data. However, although the HGP was completed in 2003, genomic medicine’s entrance into the care stream has been painstakingly slow. In 2010, results from one of the first pilot studies using personalized molecular profiling to guide treatment selection in refractory metastatic cancer were published. Despite considerable challenges including the absence of a precedent for the novel trial design, overall patient attrition, and a diverse pharmacopeia, it was found that in 27% of 68 patients, treatment selection guided by molecular profiling (gene expression microarray analysis) resulted in a longer progression-free survival than that during the most recent regimen on which the patient had progressed.226 The next major step in bringing genomic tools to the patient was a novel study combining whole genome sequencing and comprehensive RNA sequencing to individualize treatment of metastatic triple-negative breast cancer (TNBC) patients.227 TNBC is characterized by the absence of expression of estrogen receptor (ER), progesterone receptor (PR), and HER-2. This study identified somatic mutations in the RAS/RAF/MEK and PI3K/AKT/mTOR signaling pathways that have led to clinical trials combining agents targeting MEK and mTOR genes with encouraging results.
New trial designs, such as those above and including adaptive, basket, and umbrella trials, are aimed at extending individualized therapy to all cancer patients. Multiple large academic observational and interventional trials have now appeared in publication228–233 or are ongoing (see http://clinicaltrials.gov). These include adaptive trials such as I-SPY 2 (Investigation of Serial Studies to Predict your therapeutic response with imaging and molecular analysis) in which treatment arms are modified in the course of the study based on patient response.234 Basket trials such as the NCI MATCH trial assign patients with various advanced cancers to treatment baskets based on mutational profiles, then treat with agents matched to their tumor’s mutations.231 Umbrella trials are also ongoing in which multiple drugs are studied in a single disease. For example, the ongoing Stand Up To Cancer and the Melanoma Research Alliance Dream Team Clinical trial is assessing molecularly guided therapy in a single-histology patient population that lacks a universal therapeutic target—non-V600 BRAF metastatic melanoma. In a nontreatment pilot study (now a randomized treatment study), a combination of whole genome and whole transcriptome sequencing was used to identify molecular aberrations matched to an appropriate clinical treatment from a pharmacopeia comprised standard of care, FDA-approved and investigational agents.230 The challenge of returning actionable information during a clinically relevant time period is considerable. This includes patient consent, tumor biopsy, quality DNA/RNA extraction, DNA- and RNA sequencing, data integration, report generation, and tumor board review to formulate a treatment plan (Figure 5). Streamlining this process will be critical to expansion of these tools beyond major academic research centers.
Large-scale genomic characterization of tumors in the context of clinical trials has allowed identification of predictive biomarkers—recurrent genomic alterations that identify patients most likely to benefit from a particular drug. These provide tremendous clinical value by allowing patient stratification to the most relevant treatment option and have been rapidly commercialized. A broadening array of sequencing tests that may aid in diagnosis and treatment decisions is available to oncologists and their patients. Over 100 academic centers and 50 commercial laboratories have made such tests available. Examples of such commercial tests are provided in Table 5. Although cost is still prohibitive for widespread use, expanded panel-, exome-, or whole genome-based sequencing will continue to transform screening and diagnostics in coming years. However, routine clinical use will not only require further cost reduction but also more comprehensive data supporting genomics-correlated clinical outcomes to illustrate the benefit of such testing and off-label drug use. Accordingly, commercial sequencing laboratories will need to acquire FDA and Medicare approval. These data will ensure that insurance companies will cover these tests. Promising signs already include the addition of procedural terminology codes and a preliminary list of 21 sequencing tests included in the fee schedule of the Centers for Medicare and Medicaid Services. Finally, germline genomic analysis (whether part of hereditary cancer testing or conducted in tumor-matched normal tissue) entails the high likelihood of incidental secondary findings of putatively pathogenic genomic variants with unclear disease association and therefore presents a challenge. Widespread controversy exists over the balance between patient autonomy and the perception that patients knowledge of this information would result in physical or mental harm.235
Table 5 Commercial cancer genome tests
Provider | Product | Descriptiona |
Ambry Genetics | Exome Next | Full exome + mitochondrial genome, SNVs, CNVs |
Ambry Genetics | BRCA1 and BRCA2 gene sequencing | SNVs, CNVs, BRCA1/2 SVs |
Arup Laboratories | Gastrointestinal hereditary cancer panel | 15 genes + intron/exon junctions, SNVs, CNVs |
Ashion Analytics | GEM Cancer Panel | 562 genes, tumor/germline, SNVs, CNVs |
GEM GW | Whole exome, tumor/germline, SNVs, CNVs, SVs | |
RNA sequencing | RNA, tumor/germline. Gene expression, fusions, alternative splicing, SNVs | |
Cancer Genetics Incorporated (CGI) | FOCUS:CLL | Seven actionable CLL targets |
FOCUS:Myeloid | 54 genes, prognostic and therapeutic assessment | |
Caris | MI Profile | 47 genes, SNVs |
Foundation Medicine | Foundation One | 236 genes, 47 introns from 19 genes associated with SV. SNVs, CNVs, SVs |
Foundation One Heme | 405 genes, 31 introns associated with SV, and RNA-seq of 265 genes. SNVs, CNVs, SVs, fusions, and gene expression in hematologic tumors | |
GeneDx | XomeDx | Full exome, SNVs |
XomeDx Plus | Full exome + mitochondrial genome, SNVs | |
XomeDx Slice | Targeted exome, SNVs | |
Comprehensive cancer panel | 29 genes, SNVs, CNVs | |
GPS@WUSTL | Comprehensive Cancer Gene Set Analysis | Sequencing of 42 genes. Mutation analysis |
Agendia | Mammaprint | Microarray gene expression analysis of 70 genes. Predicts chemotherapy benefit and risk of recurrence of breast cancer |
Myriad Genetics | BRACAnalysis | Sanger, BRCA1/2, breast and ovarian cancer risk |
COLARIS | Sanger, six genes, hereditary colorectal cancer | |
COLARIS AP | Sanger, APC and MYH genes, adenomatous polyposis colon cancer risk | |
MELARIS | Sanger, CDKN2A, hereditary melanoma | |
PANEXIA | Sanger, PALB2 + BRCA2 genes, pancreatic cancer risk | |
Myriad myRisk Hereditary Cancer | Sanger, 25 genes, breast, ovarian, gastric, colorectal, pancreatic, melanoma, prostate, and endometrial cancer risk | |
NeoGenomics Laboratories | NeoTYPE Cancer Exome Profile | 4813 genes, identify SNVs |
NeoTYPE Profiles | Custom gene panels, SNVs | |
OncotypeDX | OncotypeDX Breast Cancer Assay | RT-PCR, 21 genes, gene expression, chemotherapy benefit, and likelihood of recurrence for invasive breast cancer |
OncotypeDX Colon Cancer Assay | RT-PCR, 18 genes, gene expression, recurrence risk in stage II and stage III colon cancer | |
OncotypeDX Prostate Cancer Assay | RT-PCR <17 genes, gene expression, treatment selection for prostate cancer | |
Paradigm | PCDx | Number of genes not specified, DNA and RNA analysis, SNVs, CNVs, SVs, gene expression, fusions, alternative splicing |
Personal Genome Diagnostics (PGDX) | Whole Genome Analysis | Whole genome, tumor or ctDNA and germline, SNVs, CNVs, SVs |
CancerXome | Full exome, SNVs, CNVs, SVs | |
Cancer Select (R88, R203) | 88 or 203 genes, SNVs, CNVs, SVs | |
ImmunoSelect-R | Exome, ineoantigen prediction to asses utility of immunotherapy | |
PlasmaSelect-R | 58 genes in ctDNA, SNVs | |
Quest Diagnostics | OncoVantage | 34 genes, SNVs |
a These descriptions are derived from company web sites and may not be complete. They also do not constitute endorsement of any particular product.
Unlike relative differences in mRNA or protein expression between tumor and normal tissue, somatic mutations in cancer genes are exclusive and specific markers of cancer cells. This fact has been recently leveraged to use cell-free tumor-specific DNA in plasma (ctDNA) as an accurate circulating biomarker of tumor burden. The value of ctDNA for monitoring tumor burden, treatment response, and recurrence was first recognized in colorectal cancer.236 This study identified somatic mutations in patient tumors in recurrently mutated colorectal cancer genes (TP53, PIK3CA, APC, and KRAS) and retrospectively analyzed plasma samples using highly sensitive assays specifically design for each patient. The results showed ctDNA levels reflect changes in tumor burden during treatment when compared with imaging and carcinoembryonic antigen. The challenges in implementation of ctDNA as a routine biomarker for tumor monitoring include the need to identify somatic alterations in each patient, design of patient-specific molecular assays, and low abundance of mutations in circulation, particularly in early stages of disease. Another recent study used NGS of tumors and digital PCR and found ctDNA levels during treatment reflected disease progression in metastatic breast cancer.237 In these results, ctDNA was found to be more responsive and applicable to the largest fraction of patients when compared with enumeration of circulating tumor cells or CA-125, a glycoprotein biomarker of breast cancer. Similar results supporting the role of serial analysis of ctDNA for monitoring tumor burden have been published for lung cancer,238, 239 melanoma,240 and osteosarcoma.241
Advances in molecular methods including the use of high-depth noise-corrected targeted NGS assays are enabling the study of patients with localized, potentially resectable cancers.242, 243 Recent work in localized breast cancer shows ctDNA can predict postoperative recurrence of cancer a median of 8 months before tumors become detectable on imaging.244 This study and additional papers describing ctDNA in localized cancers demonstrate novel opportunities for optimizing treatment of cancer with curative intent by individualizing therapeutic strategies.245
Prospective trials are needed to establish benefit of guiding clinical cancer management using ctDNA as a biomarker. Nevertheless, observational studies reported so far show superior performance of ctDNA to monitor tumor burden compared to a number of circulating tumor cell or glycoprotein biomarkers.246 With some variations between cancer types, ctDNA has wide applicability across patients as shown in a recent comprehensive survey of 640 patients.246 The same study found ctDNA was detectable in 55% of patients with potentially curable disease. These results suggest that improved molecular assays may enable detection of ctDNA in presymptomatic individuals with cancer, potentially leading to a screening test with higher specificity than traditional methods.247–250
Cancer evolution
The ability to sequence cancer genomes has shed light on the process of molecular evolution and its importance for understanding cancer biology and treatment. Although not new, historical consideration of this concept has largely been confined to a process of linear clonal evolution driving the development of an increasingly malignant cancer phenotype.1 However, it is increasingly clear that cancer evolution generates tremendous heterogeneity by continuously generating clonal diversity. As a result, multiple related, but distinct clonal lineages may coexist within a single patient.251 These lineages differ in their ability to progress and metastasize252 as well as to respond to or resist therapy.253 The implications of this complex, branched evolution model will be critically important for understanding cancer biology and developing better treatment approaches.
The cornerstone of cancer evolution lies in the accumulation of genomic mutations, ultimately impacting cellular phenotype. Although the majority of acquired cancer mutations are likely to be passengers, a small proportion will impact critical cellular pathways and processes, thereby acting as disease drivers. However, the exact molecular mechanisms leading to increased mutagenesis in cancer are incompletely understood. Recent analysis, carried out on thousands of cancer genomes, identified at least 20 distinct mutational signatures pointing to diverse processes driving mutagenesis in various cancers, with most cancers showing evidence of more than one such process at play.150 Although the factors leading to several identified mutational signatures such as UV, smoking, or exposure to chemotherapy agents are well known, many are not associated with a known causative processes. Elucidating such novel mechanisms of mutagenesis will be important for facilitating development of better prevention and treatment strategies.
The accumulation of mutations clearly provides the raw material (population diversity) for cancer evolution. However, this accumulation often contributes to the process of clonal evolution within temporal and spatial constrains. The order in which mutations are acquired can dramatically impact cell fate. For example, a loss of BRCA1 or BRCA2 leads to cell cycle arrest in the context of normal TP53 but not in the setting of TP53 loss.254 Consistent with this observation, a loss of the second copy of BRCA1 in breast cancer typically occurs after the loss of TP53. The order of mutational events may also depend on the cell type and state of differentiation. This is seen with KRAS or NRAS mutations, which occur early in colon cancer development, whereas similar NRAS mutations are seen mainly at a late stage in myelodysplastic syndrome.255, 256 Spatial constraints of clonal diversification, on the other hand, have been noted in clear cell renal cell carcinoma and non-small-cell lung cancer,257, 258 where divergent clones develop in specific geographic locations of the tumor. In fact, such spatial separation could lead to “parallel evolution” characterized by distinct mutations arising independently in individual clones, but targeting the same gene or the same pathway, as observed in a variety of cancers.259 An important practical implication of such parallel evolution is that an effective targeted therapy would have to be capable of simultaneously inhibiting independent and molecularly distinct clones within a single patient.
Although clonal diversification is crucial, recent data indicates that simple outgrowth of the most aggressive clones does not fully account for the extent of clonal evolution in cancer. Indeed, similar to the principles of evolutionary biology in general, where no species evolves in isolation, cancers seem to evolve within an ecosystem of interactions not only with the host but also with other coexisting tumor clones. For example, in an experimental model of glioblastoma, a minor clone harboring an EGFR mutation appears to support the major EGFR-wt clone through paracrine mechanisms.260 Similarly, within heterogeneous tumors in zebrafish melanoma xenograft models, a phenomenon of “cooperative invasion” is seen. Here, the presence of an invasive clone enables invasion by another, otherwise poorly invasive clone.261 Such clonal cooperation does not only apply to tumor survival and progression but also extends into the realm of therapeutic resistance. In a model of colon cancer, for instance, the presence of an EGFR inhibitor-resistant, KRAS-mutated minor clone appears to support the survival of a drug-sensitive, KRAS-wt clone through paracrine mechanisms involving transforming growth factor α and amphiregulin.253
On the basis of the above-mentioned temporal and spatial constraints as well as clonal interactions, several models of clonal evolution have been proposed.259 These range from (1) a simple model of linear evolution, where successive accumulation of mutations leads to increasingly aggressive clones that outcompete their predecessors, to (2) models of allopatric speciation, where subpopulations evolve in geographically distinct areas of the tumor, to the models incorporating clonal interactions including (3) a model of clonal competition with distinct clones competing for growth advantages in the form of antagonistic evolution and (4) a model of clonal cooperation with a symbiotic relationship between individual clones (Figure 6).
Understanding of the molecular spectrum and mechanisms of cancer evolution impacts our ability to treat cancer. Molecular heterogeneity and clonal evolution limit the benefits derived from targeted treatments. The repertoire of clones with distinct molecular signatures within each patient’s tumor(s) allows for treatment-driven selection, manifesting as acquired therapeutic resistance even in patients who initially respond to treatment. Pre-existing somatic mutations (even passenger events) or new mutations acquired during therapy can drive therapeutic resistance. While improvements in experimental models have enhanced our ability to understand and perturb complex patterns of clonal evolution in the laboratory, development of methods such as ctDNA analysis in plasma can allow monitoring of clonal evolution in the clinic.262 In patients who progress on targeted therapies, multiple studies have shown that somatic mutations driving resistance are detectable in plasma. For instance, lung cancer patients treated with erlotinib or gefitinib (an EGFR inhibitor) most often develop acquired treatment resistance owing to a second mutation in EGFR p.T790M that affects drug binding. This resistance-driving mutation is detectable in ctDNA up to 16 weeks before disease progression is evident on routine imaging.239 Similar data has been reported for patients with colorectal cancer who progress on cetuximab (an anti-EGFR antibody) where KRAS mutations are evident weeks before disease progression.263 A recent study of colorectal cancer patients confirmed that if targeted therapy is discontinued upon disease progression (a drug holiday) and replaced with chemotherapy, circulating levels of the KRAS mutation in ctDNA recede (suggesting recession of KRAS-mutation bearing tumor subclone).264 While these studies used deep targeted sequencing strategies to investigate hypothetical genes driving treatment resistance, hypothesis-free genome-wide analysis to discover novel drivers of treatment resistance has also been described.265 These proof of principle results describing ctDNA analyses to track clonal evolution warrant investigation of ctDNA-based, personalized adaptive, sequential, or combination treatment strategies.
Cancer genomics and evolution in clinical practice: A case study in melanoma
Background
Malignant melanoma is the sixth most common cancer in the United States and is one of few that is increasing in incidence.266 Although detection and surgical resection of early-stage disease can be curative, a tumor thickness of merely 4 mm portends metastatic disease and an abysmal prognosis. Dramatic recent progress in melanoma precision medicine powerfully illustrates the potential for genomics to impact clinical cancer management. In this section, we will focus on this progress and its impact on melanoma treatment while clinical features will be discussed in detail in Chapter 111. Melanoma is a disease of transformed melanocytes and is clinically subdivided by anatomic location, histopathology, sun exposure, and progression (using tumor-node-metastasis staging).267 By site, cutaneous melanoma most commonly originates on sun-exposed skin and is further categorized based on type of sun exposure of the primary site including areas with chronic sun exposure (e.g., face and forearms) and those with intermittent sun exposure (e.g., back). Less common subtypes include acral melanoma, occurring on non-hair-bearing areas of skin on hands and feet; ocular melanoma, which is predominantly uveal with a smaller proportion of conjunctival melanomas; mucosal (nasopharyngeal, bowel, anorectal, or vulvovaginal) and primary CNS melanoma. This categorization has implications for prognosis and treatment, although it is sometimes unable to predict outcome or treatment response. Further, until the discoveries of the past decade, metastatic melanoma was uniformly treated with cytotoxic chemotherapy to limited effect.268
The genetic basis of melanoma
Where variation in clinical course by melanoma histopathologic subtype is confounding, genomic characterization is now helping to clarify etiology, biology, and optimal treatment. Just as breast cancer can best be characterized according to ER, PR, and Her2 status, genotype should now take a prominent role in the clinical approach to melanoma owing to its implications for treatment. Although melanomagenesis is still poorly understood, melanoma is known to occur through the stepwise accumulation in melanocytes of mutations in cancer genes (Figure 7).269 Most melanomas are sporadic with 5–10% of cases due to familial predisposition predominantly driven by germline variants in the first familial melanoma gene identified—the tumor suppressor CDKN2A (40% of familial cases).270–272 Additional rare variants in other cancer genes have since been found in familial cases (CDK4, BRCA2, BAP1, TERT promoter, MITF, and POT1).273–278 Beyond heredity, UV radiation is recognized as the greatest risk factor for cutaneous melanomas, although not for sun-shielded mucosal, acral, or ocular disease.279 In keeping with UV etiology, additional melanoma risk factors include fair skin, increased freckling and benign nevi, MC1R germline variants, and tanning.280–283
Early genetic and functional characterization of melanoma determined that it is driven both by mutations that inactivate cell cycle gatekeepers and by those that activate cell proliferation pathways. Building on discovery of familial mutations in CDKN2A, a locus that regulates the p53 and RB proteins, these tumor suppressors were found to be frequently disrupted by disparate mutations that enable unconstrained cell growth such as CDKN2A, CDK4, RB1, TP53, and MDM2 mutations.284–288 Meanwhile frequent mutations in the proliferation pathway genes BRAF, KIT, NRAS, and PTEN were identified and found to cooperate with inactivation of cell cycle tumor suppressors to promote malignancy.115, 119, 289–302 Cytogenetic and molecular studies further pointed to as-yet unidentified genes in recurrently altered regions of chromosomes 1p, 6q, 7p, and 11q.288, 303–311 As with many early genetic analyses of human cancers, these studies were typically performed in disparate small cohorts or model systems, focused on only one or several genes in a single clinical subtype, and often produced inconclusive results.
Melanoma genomic landscapes
The striking genetic complexity between subtypes and within individual tumor genomes revealed by the above analyses was soon confirmed at high resolution through a succession of genomic studies. Genome-wide aCGH profiling and targeted sequencing of BRAF and NRAS in 126 tumors first revealed distinct patterns of somatic mutation based on subtype. BRAF and NRAS mutations were found to be enriched in intermittently sun-exposed and cutaneous melanoma but rare in chronically sun-exposed, mucosal, and acral melanoma. Patterns of mutation also varied by subtype with CNVs enriched in mucosal melanoma. Subsequent analyses identified patterns of CNV correlating with poor prognosis49, 289 as well as subtype-specific KIT mutations occurring in 10–20% of mucosal and acral as well as at low frequency in cutaneous melanoma in sun-damaged sites.48, 312 The existence of molecular subsets of disease was also borne out by gene expression microarray studies84, 313, 314 and such molecular subsetting of clinical melanoma categories seeded hope that these types could be tied to clinical outcome or exploited for therapeutic targeting.315
A spate of massively parallel sequencing studies beginning in 2011 spurred explosive growth in the genomic understanding of cutaneous melanoma, recently culminating in two large multiplatform studies that jointly span over 500 cases.142, 275, 315–325 These studies not only corroborate prior identification of important melanoma drivers (BRAF, NRAS, NF1, TP53, CDKN2A, and RB1) but also point to new genes not previously linked to the disease including RAC1, PREX2, PPP6C, ARID2, TACC1, GRM3, MAP3K4, MAP3K9, IDH1, MRPS31, RPS27, and the TERT promoter. Melanoma is now predominantly categorized according to BRAF, RAS, and NF1 status, all of which, when mutated, constitutively activate the mitogen-activated protein kinase (MAPK) pathway, a central regulator of cell proliferation (Figure 8).142 These mutations tend to be mutually exclusive and account for all but 10% of cases, suggesting four genomic subtypes: BRAF, NRAS, NF1, and triple wild type (TWT). Genome-wide sequencing has also enabled characterization of mutation signatures in melanoma, shedding further light on the role of UV radiation in various subtypes. Melanomas on sun-exposed skin are characterized by UV signature mutations—cytosine to thymine (C > T) substitutions occurring in dipyrimidines as more than 60% of the total mutation burden.142, 318, 326 Curiously, most common mutations in BRAF and NRAS are not C > T substitutions despite their driver roles in the majority of sun-exposed melanomas.317 While it is often noted that melanoma has the highest average mutational burden of any cancer type, the mutation frequency can range from 0.1–100 mutations per megabase.169 This marked heterogeneity tracks with the clinical heterogeneity of the disease, with the highest mutational rate in melanoma on sun-exposed skin at >100/Mb, intermediate rates in mucosal melanoma at 2–5/Mb, and the lowest rates in uveal and CNS melanoma at <0.1/Mb.316, 327, 328
Cutaneous BRAF melanomas account for over 50% of tumors and predominantly bear the hotspot missense mutation V600E, although other activating BRAF mutations, amplifications, and even fusions do occur. They are characterized by incidence in younger patients, UV signature in 91%, relatively fewer CNVs compared to TWT, amplifications of BRAF, MITF, PD-1, and PD-L1, enrichment for TP53 mutations, and more frequent PTEN deletions. RAS melanomas are the second most common (30%), typically NRAS Q61R/K/L and with occasional mutations in K– or H-RAS. These mutations are nearly always mutually exclusive with BRAF mutations although molecular features of this category otherwise resemble the BRAF subtype. NF1 melanomas comprise 15% of cases, the majority of which contain inactivating NF1 mutation. This subtype bears the highest SNV burden, occurs in older patients, and contains a UV signature. It can co-occur with BRAF and NRAS mutations and tends to show a pattern of co-occurrence with other mutated “RASopathy” genes such as RASA2.319 Similar to BRAF and RAS mutations, most NF1 mutations drive RAS and MAPK pathway activation. Finally, TWT melanoma is a heterogeneous subtype that lacks recurrent driver mutations although KIT, CTNNB1, GNAQ, GNA11, and EZH2 mutations have been found at low frequency. Notable characteristics of this subtype include higher numbers of SVs, fusions and CNVs, only 30% containing UV signatures, amplifications containing KIT, PDGFRA, KDR, CDK4, CCND1, MDM2, and TERT, and only few (7%) with TERT promoter mutations.142 It is important to highlight that despite diversity across genomic subtypes, the vast majority of melanoma driver mutations converge on the MAPK and PI3K pathways, such that approximately 91% of cutaneous melanomas depend on aberrant activation of this single pathway.329
Despite exhaustive characterization of cutaneous melanoma, genomic landscapes of less-common clinical subtypes remain poorly characterized. Several small studies of uveal, acral, desmoplastic, and mucosal melanoma have identified recurrent mutations, but much still remains unknown. Uveal melanoma, the most common ocular tumor, has been shown in studies encompassing nearly 80 cases to have one of the lowest melanoma mutation burdens and to be characterized by mutations in GNAQ (50%) as well as frequent BAP1, EIF1AX, and SF3B1 mutations in tumors with high metastatic propensity.324, 325, 330–332 Acral melanoma occurs on non-hair-bearing skin of the palms, soles, and nail beds, is the most common melanoma subtype in non-Caucasians, and has a poorer prognosis than cutaneous melanoma on hair-bearing skin owing to later diagnosis.333 Although known for nearly a decade to be largely BRAF wild type, it is rarely included in larger cutaneous genomics studies and limited focused data sets are available. In total, 38 acral melanomas have been characterized across six studies that have identified the following characteristics: about 30% KIT-mutant, 20% BRAF-mutant, 10% NRAS-mutant, low SNV burden, high CNV and SV burden, and absence of UV signature.49, 316–318, 332, 334, 335 Data on mucosal melanoma, another rare and aggressive subtype that is predominantly BRAF wild type, is more sparse with only 10 cases comprehensively assessed in a single study.327 This study largely confirmed prior targeted and cytogenomic analyses, showing a low SNV frequency, high CNV and SV burden, absence of UV signature, and recurrent mutations in KIT, PTEN, and other putative cancer genes in two or fewer samples. Desmoplastic melanoma, a rare and invasive fibrous form of dermal melanoma has also recently been characterized (62 cases) and preliminary results reveal a notably high SNV burden (one of the highest to date at 62 mutations/Mb), UV signatures, absence of BRAF or NRAS mutations, and a diverse array of mutations that activate the MAPK and PI3K pathways.336 Notable similarities and differences between these rare tumor types support their potentially distinct etiology and may help guide therapeutic management.
Clinical implications of melanoma genomics
Although no definitive correlation with outcome has been identified based on the above-mentioned genomic subtyping or other molecular classifications in melanoma, these genomic subtypes have direct bearing on treatment of advanced metastatic melanoma (Table 6). The BRAF subtype carries the greatest array of clinical options. Mutant BRAF has now proven to be an effective therapeutic target, with BRAFV600E/K and K601 targetable by inhibitors of BRAF or MEK. As of 2015, three such small molecule inhibitors have demonstrated an overall survival (OS) benefit in randomized clinical trials. Vemurafenib and dabrafenib are BRAF inhibitors (BRAFi) with demonstrated response rates greater than 90% in BRAF melanoma.224, 337 Trametinib is a MEK inhibitor (MEKi) which, while less dramatically effective than BRAFi, nevertheless triples progression-free survival (PFS) versus chemotherapy.338 The best results to date from trials of targeted agents in BRAF melanoma have come from combined BRAFi and MEKi with dabrafenib and trametinib, a combination with marked improvement in PFS and OS with an overall reduction in toxicity.339, 340
Table 6 Therapeutic options in melanoma by genomic subtype
Genotype | Approved treatments | Selected phase I or II data |
BRAF mutant | Targeted—vemurafenib, dabrafenib, and trametinib Immunotherapya—ipilimumab, nivolumab, and pembrolizumab |
Cobimetinib |
RAS mutant | Targeted therapy—none Immunotherapya—ipilimumab, nivolumab, and pembrolizumab |
Trametinib, MEK162 |
NF1 mutant | Targeted therapy—none Immunotherapya—ipilimumab, nivolumab, and pembrolizumab |
None |
Triple wild type (TWT)b | Targeted therapy—none Immunotherapya—ipilimumab, nivolumab, and pembrolizumab |
None |
TWT — KIT mutant or overexpressed | Targeted therapy—none Immunotherapya—ipilimumab, nivolumab, and pembrolizumab |
Imatinib, nilotinib |
TWT — GNAQ/GNA11 mutant | Targeted therapy—none Immunotherapya—ipilimumab, nivolumab, and pembrolizumab |
Selumetinib, trametinib, MEK 162 |
a Phase III studies for immunotherapy have not included genotyping beyond BRAF. FDA approval for these does not specify genotype and thus is inclusive of all genotypes.
b No trials to date have specifically identified TWT patients, although KIT and GNAQ/GNA11 cases do occur in this category.
Development of targeted agents for RAS, NF1, TWT, and rare histological melanoma subtypes has lagged behind that for BRAF melanoma, with no completed phase III studies. Nevertheless, early phase studies have shown sufficient efficacy to encourage ongoing efforts at developing agents for these patients. RAS melanoma has been treated with a variety of MEK inhibitors in clinical trials including trametinib,341 MEK162,342 and selumetinib.343 Of these, only MEK162 has shown a significant response rate (around 20%), with the others showing a best response of stable disease. No trials have been conducted specifically in NF1 melanoma, in part due to the emerging understanding of this subtype and overlap with RAF and RAS subtypes. Conflicting preclinical reports have suggested NF1 mutation and loss confers sensitivity or resistance to MEKi and that this relationship is also dependent on RAF/RAS status.319, 344–346 KIT-mutant melanoma (which is common in TWT cutaneous as well as acral and mucosal subtypes) has been shown in two randomized phase II studies to have an overall response rate to imatinib of 19–29%, with the best responses in cases with canonical KIT SNVs rather than amplifications.286, 347 The most robust data in GNAQ/GNA11-mutant disease comes from a randomized phase II study with the MEKi selumetinib.348 Selumetinib induced tumor regression in 49% of patients and led to a nonstatistically significant improvement in OS to 11.8 versus 9.1 months. Additional agents in development that are relevant to many subtypes and/or may motivate additional subtyping include CDK, MDM2, and PI3K/Akt/mTOR inhibitors in clinical trials in addition to ERK, IDH1, EZH2, and aurora kinase inhibitors in preclinical development.142
Little is known about the impact of genomics on immunotherapy. Immunotherapy with the CTLA-4 inhibitor ipilimumab and the PD1 inhibitors pembrolizumab and nivolumab have revolutionized melanoma care with long-term survival even in advanced disease, but the role of genomics as predictive or prognostic for immunotherapy remains to be fully elucidated.349–351 While there is interest in mutational burden as a predictor of response to immunotherapy based on the hypothesis that immunogenicity is a probabilistic aggregate that increases in proportion to total mutation burden, to date no prospective data bears this out. While the hypothesis is supported by the fact that PD1 inhibitors were first approved in melanoma and then lung cancer (the two cancers with the highest mutational burdens), renal cell carcinoma has a very low mutational burden but has demonstrated responsiveness to PD1 inhibitors in phase II studies. Only a minority of patients on the registration studies for pembrolizumab and nivolumab had noncutaneous melanoma and ocular melanomas were excluded. While 20–35% of patients had BRAF V600 mutations, genotyping was not routinely done so that the rate of NRAS mutant and TWT patients is unknown.
Drug resistance and evolution in melanoma
Despite the activity of targeted agents in BRAF melanoma, drug resistance (both innate and acquired) remains a key challenge. Only about 50% of patients respond to BRAFi and in those who do respond, acquired resistance to BRAFi invariably develops.224 BRAFi resistance is driven by a diverse range of mechanisms, most of which reactivate MAPK activity or redirect proliferative signaling through the PI3K pathway. Resistance can occur through further mutation in NRAS,352 MEK1,353, 354 MEK2,354 NF1,344 or BRAF (via amplification).355 BRAF splice variants,356 EGFR activation,357, 358 and COT activation359 have also been implicated. Activation of other receptor tyrosine kinases such as PDGFR-β and IGF-1R or loss of PTEN has also been demonstrated in laboratory models.354, 360–364 Clinical trials are ongoing to test whether identification of these resistance mechanisms in the clinical setting will allow for targeted therapeutic approaches that may circumvent resistance to BRAFi. Hypothetical approaches to circumvent specific resistance mechanism after progression on dual inhibition with BRAFi and MEKi are given in Table 7.
Table 7 Hypothetical treatment approaches to acquired BRAFi/MEKi resistance
Mechanism of resistance | Therapeutic strategy |
EGFR activation | EGFR inhibitor—erlotinib, gefitinib, afatinib, and dasatinib PIK3 inhibitor—in trials |
IGF-1R activation
|
PIK3Ca inhibitor—in trials |
cMET activation | cMET inhibitor—crizotinib |
PDGFR-β activation | RTK inhibitor—dasatinib, sunitinib |
FGFR activation | FGFR inhibitors—ponatinib, dasatinib |
COT activation | ERK inhibitor—in trials |
With the recent approval of new targeted and immunologic therapies for metastatic disease, it is now critically important that the clinician clearly identify the genetic signature of the patient’s disease in order to optimize outcomes. Similarly, genomic testing can accelerate new drug development and target identification for the less common subtypes of melanoma and potentially lead to identification of resistance pathways to guide drug selection in individual patients. Thus, the genomic landscape of melanoma conforms to the clinical heterogeneity of the disease, with melanomas of different locations having markedly different genotypic signatures that speak to distinct etiologies. The challenge, then, is to use available genomic technologies in a real time clinical setting. In theory, one could identify the evolutionary mechanisms active in the individual patient to guide treatment selection.
In addition to molecular diversity across melanoma subtypes and even between individual patients with the same disease subtype, a higher order of complexity is becoming apparent. As outlined in the previous section on cancer evolution, recent data indicates that development of melanoma cannot be fully explained by a linear model of accumulating mutations leading to an increasingly aggressive phenotype. Instead, an emerging picture points to cancer as a complex ecosystem driven by the process of branched clonal evolution. As a result, multiple related, but distinct clonal lineages may exist within a single patient,251 with some lineages being more likely to progress and metastasize.252 Furthermore, this diversity may provide an escape mechanism leading to therapeutic resistance. This is well illustrated by almost universal development of resistance to BRAF inhibition in melanoma, where rapidly relapsing disease can harbor multiple resistance mechanisms differing between individual subclones even within a single patient.365 This challenge is potentiated by the fact that, while some mechanisms of resistance are driven by mutations, others appear to be mediated through epigenetic or posttranslational processes.366 The obvious and critical question raised by these observations is whether a single tumor biopsy in a metastatic setting can capture the full spectrum of disease heterogeneity, its drivers, and potential therapeutic targets. The solution to this conundrum may lie in approaches with the potential to monitor the clonal complexity on a larger scale, such as “liquid biopsies” that attempt to molecularly characterize circulating tumor DNA in the patient’s bloodstream. Emerging data describing detection of ctDNA and concordance of BRAF mutation status between tumor and plasma samples is encouraging.246, 367, 368 In addition, anecdotal evidence suggests that serial ctDNA analysis may be a useful marker of treatment response to immunotherapy.240
Summary
Detailed characterization of the genomic lesions underlying cancer has led to identification of biological pathways driving tumorigenesis, improved cancer diagnosis through molecular classification, enhanced selection of therapeutic targets for drug development, promoted the development of faster and more efficient clinical trials using targeted agents and sophisticated biomarkers, and created markers for early detection and recurrence surveillance. Yet, gaps nonetheless remain in our knowledge of the causative mutations underlying some cancers, in our understanding of the biology of the mutations we have identified, in developing drugs capable of targeting many of these diverse mutations, and in fending off the inevitable emergence of drug resistance. As summarized in this chapter, genomics has already made extraordinary contributions to our understanding of cancer biology and cancer medicine. Nevertheless, the enormous potential of cancer genomics has only just begun to be realized.