Early Detection Research Network
Phase	Category	Descriptor
1	Preclinical/exploratory	Promising directions identified
2	Clinical assay and validation	Clinical assay detects established disease
3	Retrospective/longitudinal	Biomarker detects disease early, before it becomes clinical, and a screen positive rule is defined
4	Prospective screening	Extent and characteristics of disease detected by the test and the false referral rate are identified
5	Cancer control	Impact of screening on reducing the burden of disease on the population is quantified

As will be seen, biomarker development for early detection has been uneven across the various organs sites; successes and challenges faced for three exemplary organs are placed in context in this chapter. These three organs were chosen because they exemplify organ sites for which early detection trials have demonstrated reduction in disease morbidity and mortality but for which success of early detection biomarker development has been variable. The organ site for which biomarker development is most advanced is the cervix; a clear viral etiology has been identified for this site, and virus constituents can be used for risk assessment, early detection, and diagnosis, triggering colposcopic intervention and vaccination, and thus achieving the stage 5 status of changing clinical practice described by Pepe and colleagues.

The Cervix and a Validated Biomarker

Invasive cervical carcinoma is one of the few tumors that results from well-defined premalignant lesions that can be identified and effectively treated, reducing morbidity and mortality from a tumor that is the third most common in women worldwide.⁴ More than 60 years ago, it was recognized that cellular abnormalities in cervical epithelium detectable in exfoliated cells could predict the presence of carcinoma.⁵ By 1988, premalignant cervical cytology had been codified as the Bethesda System for reporting cervical/vaginal cytologic diagnoses.⁶ Population-based light microscopic screening for atypical cells in cervical smears with surgical removal of detected abnormal epithelium in Great Britain is estimated to have reduced the prevalence of invasive squamous carcinoma by 80% after 1988.⁷ Conventional cervical cytology is thus one of the most effective early detection biomarkers in use today.

Delineation of molecular carcinogenesis in the cervix (Figs. 17-1 and 17-2) has provided alternative methods for detection, risk assessment, and diagnosis. Nearly all cervical carcinomas and their precursor lesions are caused by human papillomavirus (HPV).⁸ HPV is a DNA tumor virus that in 1934 was shown by Peyton Rous to produce squamous tumors in rabbits.^11–11 It was not until 1974 that zur Hausen recognized the association between HPV and cervical cancer.^14–14 Persistent infection with high-risk strains of HPV, most commonly type 16 and 18,¹⁵ is an essential step in cervical carcinogenesis (reviewed in reference 16).¹⁶ The virus encodes three proteins that stimulate cell proliferation and survival: E5, E6, and E7 (reviewed in reference 17).¹⁷ The viral proteins E6 and E7 interfere with p53 and Rb, respectively,^18,19 resulting in resistance to apoptosis and chromosomal instability. The HPV genome may be integrated into the host cell genome, accounting for long-term persistence of the virus. Integration of E6 and E7 genes^20,21 is crucial to progression of the infected cell to a malignant state, whereas other viral genes may be lost.

Figure 17-1 Three potentially premalignant lesions are depicted. 1, *Normal cervical mucosa* at the squamocolumnar junction is composed of a basilar zone from which squamous cells progressively mature and ultimately give way to flattened cells on the mucosal surface. The p16 immunostain in the lower frame is negative, reflecting the absence of expression of p16 protein by normal cells. 2, *Low grade squamous dysplasia* may occur as a consequence of human papillomavirus infection. In this lesion, the basilar zone is expanded through the lower third of the mucosa and cells have become more disorderly, with irregular spacing between cells, variable nuclear size and density, and clearing of cytoplasm in some of the cells in the upper mucosa (koilocytosis). p16 immunostains show strong expression in the lower third of the mucosa. 3, Progression to *high-grade squamous dysplasia* is indicated by the near-complete absence of cellular maturation toward the surface. Cells contain large, misshapen, and irregularly distributed nuclei. In immunostained sections, *CDKN2A* (*p16*) is strongly expressed through the full thickness of the epithelium. 4, Finally, *invasive carcinoma* occurs when precursor cells acquire the capacity to invade the underlying cervical stroma as shown in this photomicrograph. Invasive cervical squamous cell carcinoma strongly expresses p16 as shown in the lower frame. Vaccination may prevent the earliest changes in the progression to invasive carcinoma. Low-grade squamous dysplasia (CIN I) usually regresses and does not require ablation. High-grade in situ dysplastic changes can be effectively treated by excision of dysplastic epithelium, but if invasive carcinoma occurs, more aggressive surgery and/or radiation therapy may be required.

Figure 17-2 Molecular changes that follow infection with high-risk subtypes of human papillomavirus. In most cases the virus is cleared, but in persistent infection, the virus may lead to dysplasia. The virus may also clear from the dysplastic epithelium, but in a subset of cases, persistent infection ultimately leads to carcinoma.

Detection of HPV DNA in clinical samples can be accomplished by a number of methods, with the dominant techniques based on polymerase chain reaction (PCR) or hybridization with signal amplification (reviewed in reference 22).²² In the United States, the predominant assay is the Food and Drug Administration (FDA)-approved Hybrid Capture 2 method (Qiagen, Germantown, Md.). In this technique, viral DNA hybridizes with an HPV RNA probe cocktail that detects multiple high-risk HPV types. The RNA : DNA hybrid is captured by hybrid-specific antibody that coats the surface of a microplate well. The hybrids are detected by a second hybrid-specific antibody conjugated to alkaline phosphatase. Each hybrid may bind to multiple antibodies, amplifying signal. Antibody-conjugated alkaline phosphatase cleaves a chemiluminescent substrate, emitting a light signal that is quantified with use of a luminometer. Light emitted above a standardized threshold intensity indicates the presence of virus. The test covers 11 HPV types in addition to HPV 16 and 18 and has a reported analytic sensitivity of 1.09 picograms/mL. A reengineered version of this hybrid capture test for low resource settings, careHPV, covers 14 HPV types, requires no running water, and has a turnaround time of 2.5 hours. The test has been successfully used in rural China with a clinical sensitivity of 90% for the detection of cervical intraepithelial neoplasia types 2 and 3.²³

Understanding of the molecular oncogenesis of cervical squamous carcinoma has provided a standardized biomarker of risk. More than 95% of HPV infections clear up without further consequences. However, persistent infection for months to years with HPV type 16 or 18 carries a 40% risk of squamous intraepithelial lesions (SILs),²⁴ and untreated high-grade SILs have been estimated to result in invasive carcinoma in approximately 30% of persons over 5 to 10 years.^24,25 Persistent HPV infection thus indicates high risk for eventual invasive carcinoma and represents an additional biomarker that could be useful in the prevention of invasive carcinoma.

Testing for HPV DNA has been found to be more sensitive than cytologic screening for detection of high-grade SILs 26,27 but somewhat less specific. In view of the number of infections that are self-limited, relying on a single HPV test may result in greater numbers of negative colposcopy examinations. To address this problem, co-testing strategies have been evaluated whereby both cytology and HPV testing are performed on initial examination.²⁸ Negative results have a high negative predictive value, and the rescreening interval can be extended from 1 to 3 years, reducing overall screening expense. Persons with positive results of a viral DNA test but negative cytologic findings have been rescreened at 1 year and examined with colposcopy if positive results are found. Whether this strategy or other strategies for cervical cancer risk management will be universally adopted remains to be determined.^22,29 What is clear is that HPV testing provides a means for improving sensitivity for detection of early cervical lesions, further reducing mortality from invasive cervical carcinoma while potentially reducing screening expense.

Finally, HPV is not only a biomarker for risk assessment and colposcopic intervention but is an immunogen that has been successfully incorporated into bivalent (types 16 and 18, Cervarix) and quadrivalent (types 6, 11, 16, and 18, Gardasil) vaccines to prevent infection and carcinogenesis (reviewed in references 30 and 31).30,31 An in-depth discussion of HPV vaccines for tumor prevention is beyond the scope of this chapter. However, the many facets of HPV expression in cervical carcinoma and the emerging role of HPV in other tumors, including vulvar and vaginal condylomata, certain skin cancers, and head and neck carcinoma, illustrate how a valuable biomarker that is integral to oncogenesis can be used to improve the accuracy of estimating tumor risk and the specificity of treatment. Unfortunately, such clearly defined and easily accessible biomarkers are rare. Rather, the more usual case is described in the next sections for colon and lung, where potential biomarkers are defined but have not found a role in early detection, diagnosis, prevention, or treatment.

Colon and Multistep Carcinogenesis with Hereditary Components

Like cervical carcinoma, colonic adenocarcinoma can be effectively prevented by extirpation of premalignant epithelium, which in the colon is represented by polyps. Several recent trials have indicated that removal of polyps by colonoscopy 34–34 reduces the frequency of colorectal carcinoma (CRC), and consensus opinion is that endoscopic polypectomy has a significant impact on the incidence of CRC. Recent data have also suggested that colonoscopic screening reduces CRC mortality by more than 50%³⁴ and that the less extensive sigmoidoscopy reduces mortality from distal CRC by a nearly equal percentage (50%).³⁵ The latter sigmoidoscopic study was a large, randomized, population-based screening trial. Colonoscopic screening for colon polyps beginning at age 50 years has become a standard of care.

Molecular biomarkers to date have not played a direct role in reduction of morbidity and mortality in persons with CRC. However, molecular studies have facilitated understanding of colon carcinogenesis (Figs. 17-3 and 17-4) and provided a rationale for colonoscopic surveillance and polypectomy. At the mid twentieth century, an association between benign polyps was suspected, but at that time no direct evidence was available to show that polyps undergo transition to CRC.³⁶ Suspicions were finally confirmed in the 1980s, when the clonal nature of both sporadic polyps and CRC was documented.³⁷ Activating mutations in the KRAS oncogene and inactivating mutations in several tumor suppressor genes were described.³⁸ The same mutations were demonstrated in both polyps and carcinoma, and the number of mutations was found to increase with the histologic grade of the adenoma. This finding suggested a progression to invasive carcinoma through the accumulation of genetic and epigenetic changes. These changes provide a proliferative and survival advantage to mutant cells that continue to proliferate and mutate until a malignant phenotype is reached. The combined data from these studies led to the proposal of a multistep model of colon carcinogenesis³⁹ that continues as the standard model.⁴⁰

Figure 17-3 Progression from (1) normal mucosa through (2) low-grade and (3) high-grade polyps to (4) invasive carcinoma, the last represented by poorly formed clusters of cells pushing aside the normal intestinal crypts visible on the right of the frame. Sporadic polyps shown in the upper row of hematoxylin and eosin–stained sections are typically pedunculated and composed of proliferating cells extending to the mucosa surface. As cellular atypia increases, an overlapping of cells occurs along the irregular mucosal surface of the high-grade polyp. 5, Polyps of familial adenomatous polyposis (FAP), attenuated FAP, and MUTYH are morphologically similar to sporadic polyps but occur at an earlier age. In (6) serrated adenoma are flat and individual crypts are preserved without the convolutions present in sporadic polyps. 7, Juvenile polyps are characterized by the presence of mucous cysts (visible on the left side of the frame) and an edematous and inflamed stroma. The surface of the juvenile polyp may be denuded of epithelium, and there may granulation tissue in the mucosa. Finally, (8) Peutz-Jeghers polyps have a complex lobulated appearance and contain smooth muscle bundles in the stalk. Removal of sporadic polyps has been found to reduce the frequency of progression to invasive carcinoma. The presence of invasive carcinoma may require vigorous intervention (discussed elsewhere in this volume).

Figure 17-4 The accumulation of molecular changes that precede colorectal carcinoma (CRC). Sporadic tumors are initiated by a serial accumulation of somatic mutations that may eventuate in CRC. The initiating event in these polyposes is the mutation present at birth. Second hits may include a variety of molecular changes including tumor suppressor gene promoter methylation, mutations, and copy number changes. Mismatch repair (*MMR*) includes *hMLH1, hMLH2, hMSH2, hPMS2*, and *EpCAM* genes.

Further insight into colon carcinogenesis has been provided by studies of hereditary forms of colon cancer and the molecular changes that underlie them. A detailed discussion of hereditary colon cancer is provided elsewhere in this volume and is beyond the scope of this chapter. However, a brief description of the major forms of hereditary tumors will provide insight into colon carcinogenesis and perspective on the strengths and limitations of the role of genetic data in early detection of this prevalent tumor type.

Several forms of hereditary colon cancer are recognized (reviewed in reference 41)⁴¹ and are listed in Table 17-2.

Table 17-2

Genetics of Polyposis

Syndrome (Abbreviation, Synonym)	Gene(s)	Molecular Phenotype	Noncolonic Organs Affected
Sporadic adenomatous polyp	Unknown	Chromosomal instability (aneuploidy)	None known
Familial adenomatous polyposis (FAP)	APC	Chromosomal instability (aneuploidy)	Duodenum, stomach, pancreas, thyroid, liver, central nervous system
Attenuated familial adenomatous polyposis (AFAP)	APC	Chromosomal instability (aneuploidy)	Duodenum, thyroid, liver, central nervous system
Hereditary nonpolyposis colon carcinoma (HNPCC, Lynch syndrome)	hMLH1, hMSH2, hMSH6, hPMS2	Microsatellite instability	Endometrium, stomach, ovary, biliary and urinary tracts, small bowel, central nervous system
MUTYH-linked adenomatous polyposis (MAP)	MUTYH	Chromosomal instability (aneuploidy)	Duodenum
Peutz-Jeghers polyposis (PJS)	STK1 (70%)	Unknown	Breast, pancreas, stomach, ovary, lung, small bowel, uterus, testis
Juvenile polyposis syndrome	SMAD4, BMPR1A	Unknown	Blood vessels (hereditary hemorrhagic telangiectasia)
Hyperplasic polyposis	SMAD4, BMPR1A (40%), PTEN	Unknown	Stomach, pancreas, small bowel

Familial Adenomatous Polyposis

The first hereditary colon carcinoma to be described, familial adenomatous polyposis (FAP), is characterized in its classic form by the occurrence of hundreds to thousands of polyps in the colon of affected persons with the formation of invasive carcinomas at an early age. Polyps may also sometimes be found in the upper gastrointestinal tract. As long ago as 1882, siblings were described with numerous and recurring polyps,⁴² and by the mid twentieth century, FAP was established as a hereditary autosomal-dominant disorder.⁴³ A gene mutation associated with FAP was identified on chromosome 5q by linkage analysis^44,45 and positional cloning studies.^46–49 The gene, adenomatous polyposis coli (APC), is now known to encode a tumor suppressor that modulates wnt signaling through binding and stabilization of a complex of proteins that includes GSK3b, β−catenin, and axin family members. Mutations in APC truncate the protein at its carboxyl terminus and disrupt the protein signaling complex, which results in deregulation of cell cycle control, cell migration, differentiation, and apoptosis (reviewed in reference 50).⁵⁰ In addition to being responsible for FAP, APC mutation is present in approximately 60% of sporadic colon adenocarcinomas^51,52 and an approximately equal number of sporadic polyps,^51,53 suggesting that APC mutation is an early event in sporadic carcinogenesis, as well as in hereditary disease. An attenuated form of FAP with smaller numbers of tumors (<100) has been described.^54,55 This condition is referred to as attenuated FAP⁵⁴ and may be driven by the mutations in APC in the 3′ and 5′ ends rather the central portion of the gene that is affected in classical FAP.⁵⁵

Hereditary Nonpolyposis Colon Cancer

The most common form of hereditary colon cancer is hereditary nonpolyposis colon cancer (HNPCC). HNPCC is a familial syndrome that differs from FAP in that it manifests a smaller numbers of polyps compared with FAP. The earliest description of this disorder can again be traced back to the early twentieth century,⁵⁶ but the hereditary character and syndromic features of the disease were firmly established by the diligent kindred studies of Lynch and Lynch,⁵⁷ after whom the syndrome is named (Lynch syndrome). In brief, the syndrome consists of CRC that occurs in close relatives, often younger than 50 years,^58,59 without adenomatous polyposis. HNPCC is also associated with an increased frequency of tumors in the endometrium, stomach, ovary, hepatobiliary tract, upper urinary tract, small bowel, and central nervous system.⁴¹ In contrast to FAP polyps, which are frequently found on the left side of the colon, persons with HNPCC frequently have flat lesions on the right side of the colon with a distinct histologic appearance referred to as “serrated adenoma” (Fig. 17-3).

The molecular genetic underpinnings of HNPCC were deciphered beginning with the linkage to microsatellite markers on chromosome 2p.⁶⁰ The genetic basis of the disease was established when it was found that patients with the syndrome harbored mutations in the mismatch repair (MMR) genes hMLH1,^61,62 hMSH2,^63,64 hMSH6,⁶⁵ and hPMS2,^66,67 with mutations in hMLH1 and hMSH2 accounting for more than 90% of HNPCC cases. The germline mutations are heterozygous, and colon carcinogenesis results from silencing of the nonmutant allele by methylation^68,69 or by acquired allelic loss.

The phenotypic consequence of gene silencing is loss of MMR protein and the resulting inability to repair DNA replication errors, ultimately leading to accumulation of mutations contributing to carcinogenesis. At a clinical level, protein loss can be the demonstrated by immunochemistry and optimally requires four separate stains. The inability of affected cells to repair DNA replication errors results in insertions or deletions in short tandem repeats (microsatellites) that exist throughout the genome and is referred to as “microsatellite instability” (MSI).70,71 Currently, MSI is documented by testing of a standard set of five microsatellite markers recommended by a consensus panel.⁷² Results are scored on the basis of the number of unstable loci and are classified as MSI-High (MSI-H), MSI-Low (MSI-L), or MSI-Stable (MSI-S). More than 90% of HNPCC tumors are MSI-H, but approximately 15% of all sporadic tumors are also MSI-H,⁷³ and thus the test is not specific for hereditary disease.

MUTYH-Associated Polyposis

A third hereditary variant of colon carcinoma is MUTYH-associated polyposis (MAP), which is the newest polyposis syndrome (first described in 2002).⁷⁴ MAP is a disorder similar to attenuated FAP and is characterized by adenomatous polyps that may number in the hundreds but not in the thousands, usually ranging from 10 to 1000.^77–77 Polyps may also occur in extra colonic sites, usually the duodenum. Adenomas tend to occur in the proximal colon.⁷⁶ MUTYH is a glycosylase and part of the base excision repair pathway. Oxidative damage to DNA frequently results in the modification of guanine (G) to 8-oxoG. 8-oxoG can pair with adenine (A) during DNA replication, a mismatch that is recognized by MUTYH, which removes the mismatched A and restores the correct base, cytosine (C). The damaged G (8-oxoG) is then replaced with an intact G. Mutation of MUTYH at specific hot spots hinders recognition of the 8-oxoG : A mispairing, resulting in G : C to A : T transversion at sites of oxidative damage. The mutational hot spots are found in exon 7, resulting in amino acid substitution Y179C, and in exon 13, resulting in substitution G496D. Germline mutation at both hot spots (biallelic mutation) occurs in 0.3% of patients with CRC. Biallelic mutation confers 28-fold relative risk for CRC compared with control subjects. Monoallelic mutation is more common with a frequency of approximately 2% in CRC but also occurs in approximately the same frequency in control populations. Most studies suggest that monoallelic mutation confers little or no increased risk of CRC.^75,76 Among control patients without polyposis, biallelic mutation is rare.⁷⁶

Hamartomatous Polyposes

The fourth hereditary form of colon cancer is the hamartomatous polyposes, a category that includes Peutz-Jeghers syndrome (PJS) and juvenile polyposis syndrome. First described in the 1890s,78,79 PJS was named in 1954 by Bruwer and colleagues⁸³ in recognition of the work of Peutz⁸⁰ and Jeghers.^81,82 PJS is an autosomal-dominant condition characterized by multiple histologically distinct polyps (Fig. 17-3) that occur in the large and small bowel in association with melanin pigmentation of the lips, mouth, nose, anus, and extremities.⁸⁴ The condition is associated with a high risk for cancer in not only the large and small intestine but also in the esophagus, stomach, pancreas, breast, uterus, ovary, and lung.^87–87 Gene mapping has identified a locus on chromosome 19.3 p13^88,89 that encodes a serine-threonine kinase, STK11 (LKB1), a gene that is mutated in more than 90% of affected individuals.^90–94 A variety of mutations including deletions, truncations, and missense point mutations may occur over nearly the full coding region of the gene.⁸⁶ STK11 protein is multifunctional, playing a role in cell cycle arrest, wnt signaling, and the tuberous sclerosis complex pathway, with effects on mammalian target of rapamycin signaling.⁸⁴ Mutation, often associated with loss of heterozygosity,^95,96 results in loss of function, and the gene is thus considered to be a tumor suppressor gene. The overall frequency of PJS is estimated at 0.5 to 2.0 per 100,000 live births.⁹⁷

Juvenile Polyposis

Juvenile polyps are characteristically solitary peculated lesions that contain tortuous, often cystically dilated crypts, stromal granulation tissue and inflammation, and denudation of the surface epithelium.⁹⁸ These lesions frequently occur in children, and there is no evidence that solitary juvenile polyps pose an increased of risk for carcinoma.^100–100 However, multiple juvenile polyps may occur in young persons and their family members,^103–103 and the presence of >5 juvenile polyps or a single juvenile polyp in a person with a family member with juvenile polyposis now defines the condition, termed “juvenile polyposis.” This form of polyposis carries a lifetime risk of colon carcinoma of 39% and a relative risk of 34.¹⁰⁴ Fifty percent to 60% of affected individuals carry germline mutations in SMAD4 or BMPR1A.^105–107 Nearly all patients with SMAD4 mutations also have hereditary hemorrhagic telangiectasia¹⁰⁸ (overlap syndrome). Both SMAD4 and BMPR1A are involved in the transforming growth factor–β (TGF-β)/bone morphogenetic protein pathway.

Other Polyposes

Finally, a variety of polyposis syndromes with potential genetic linkage have been described and are best exemplified by hyperplastic polyposis syndrome (serrated polyposis syndrome).109,110 This condition consists of the occurrence of multiple sessile polyps variously referred to as hyperplastic polyps or serrated adenomas, frequently on the right side of the colon and numbering from 5 to >100.¹¹¹ Although familial associations of the syndrome¹¹² and linkage to high risk of colon carcinoma¹¹³ have been reported, the genetic basis is complex and associated with mutations in several driver genes, including BRAF and KRAS.¹¹⁰ No single genetic mechanism has been linked to the syndrome, and the true frequency is not established. To date, the number of reported cases is small.

Biomarkers for Colon Cancer Screening

Genetic tests

Although the number of highly penetrant mutations that predispose to colon carcinoma is large, the proportion of colon carcinomas that can be accounted for by these mutations is approximately 5%⁴¹ and has been overestimated in the past.⁷³ Markers of hereditary predisposition and molecular progression in sporadic tumors that have been delineated to date have not been easily converted into an early detection test suitable for broad population-based applications for several reasons. First, the types of accessible specimens that can be used for tests for early detection consist primarily of blood and fecal DNA. Although DNA from invasive tumors can be detected in the blood by PCR-based mutation testing, it is not as useful at earlier polyp stages. In addition, collection and processing of fecal specimens for a biomarker that is expected to be present at a low level and heavily contaminated by nonneoplastic cells is a logistical challenge. Second, some of the genetic abnormalities identified in colon carcinogenesis do not readily lend themselves to screening assays. Mutations in APC, for example, consist mainly of truncations and deletions that may occur over a wide region of the gene, and thus designing a sensitive clinical assay has been difficult. Finally, the wide availability of colonoscopy services that combine polyp detection and excision reduces the cost and improves the attractiveness of direct and systematic colonoscopic population-based surveillance.

Other Screening Biomarkers

Biomarkers that have been most successful for colon cancer screening are fecal tests such as the guaiac fecal occult blood test (FOBT), the fecal immunochemical test, and the stool DNA (sDNA) test. The greatest experience has been with the FOBT. This test is based on a color change whereby α-guaiaconic acid extracted from the wood of tropical Guaiacum tree is rapidly oxidized by hydrogen peroxide to a blue-colored quinone in the presence of heme derived from blood. A wide variability in results occurs ¹¹⁴ as a consequence of the level of compliance with collection methods, the type or brand of kit used, the frequency of testing, user interpretation, and other factors. Despite a lack of standardization, FOBT screening with follow-up colonoscopy is reported to reduce both incidence¹¹⁵ and mortality^116,117 from colon carcinoma. The former is attributed to polyp removal. The fecal immunochemical test is a second test for occult blood and is reported to be more sensitive and specific than the guaiac assays. It has been shown to have a cancer detection rate similar to colonoscopy but has a lower sensitivity for polyps.¹¹⁸

Occult blood tests measure host response (hypervascularity and bleeding) to a tumor. The sDNA test, however, aims to measure a direct product of tumor cells. A recent version of the sDNA test measures multiple markers in stool using an array-based platform ¹¹⁹ and for this reason has been referred to as “next generation,” although this technology does not use deep sequencing methods and should not be confused with “next generation sequencing.” The biomarkers that are incorporated into the platform include four methylation probes, as well as mutated KRAS and hemoglobin. Patterns of gene methylation have been identified in colon tumors that have been characterized as CpG island methylation high and CpG island methylation low (reviewed in reference 120).¹²⁰ When applied to a preliminary cohort of 678 subjects with polyps or colorectal carcinoma, the combination test including methylation markers, KRAS mutation, and occult blood had a sensitivity of 85% for carcinoma and 54% for polyps.

Despite the availability of these fecal-based tests, the target in early colon cancer intervention appears to be shifting from detection of invasive carcinoma to prevention by detection and thorough clearing of adenomatous polyps from the colon. Defining the biology of colon carcinogenesis and confirming that polyps are the precursor lesions along with improvements in colonoscopic methods has led to reduction in mortality from colon cancer. Colonoscopy enables direct identification of potential precursor lesions and at the same time provides a means for ablating them, which is strikingly evident in the new sigmoidoscopy findings,³⁵ indicating that although sigmoidoscopy reduces mortality from visible distal colon tumors, mortality from proximal lesions is unaffected. It seems likely that colonoscopy will be the main vehicle for early detection and intervention in colon cancer for the foreseeable future.

Lung Carcinogenesis and a Known Carcinogen

For several decades, lung carcinoma has had the highest mortality of all tumors, accounting for more than twice as many deaths as the next most lethal tumor.121,122 Initial large-scale efforts during the 1970s and 1980s to reduce mortality by early detection using chest radiographs and/or sputum cytology were not successful. Chemoprevention using β-carotene and/or alpha tocopherol dietary supplementation also failed.^123–126 Recently, however, early detection efforts have begun to yield some success in reducing mortality even in this problematic tumor. Low-dose computed tomography (LDCT) has a higher clinical sensitivity for detecting smaller tumors than does a chest radiograph. Early studies in smokers documented a 2.7% prevalence of malignant disease detected by LDCT versus 0.7% in the same patients screened by chest radiograph.¹²⁷ In a multicenter study of more than 30,000 smokers, LDCT detected lung cancer in 1.3% at baseline and an additional 0.3% at annual follow-up examinations for an overall total of detection rate of approximately 1.5%.¹²⁸ Although this study was not randomized and had no unscreened control arm, it showed a survival advantage in comparison with historical control subjects. Recently, a large randomized trial of CT versus radiography conducted annually in three rounds found that LDCT detected carcinoma in 2.4% of the screened high-risk population and resulted in a 20% reduction in lung cancer mortality that was not observed in the chest radiograph control group.¹²⁹ The weight of evidence supporting improved long-term mortality rates in smokers has prompted the publication of multisociety consensus guidelines recommending annual LDCT screening for smokers and former smokers ages 55 to 74 years with access to appropriate follow-up.¹³⁰

Premalignant Lesions in the Lung

Unlike mortality reductions that have been achieved in the colon and cervix, where preinvasive and therefore nonmalignant lesions are targeted, the benefits of LDCT screening for lung cancer have been achieved through early detection of invasive tumors. A strategy to identify in situ premalignant lesions could potentially reduce lung cancer mortality to a much lower rate. Unfortunately, what are thought to be premalignant lesions in the lung are difficult to detect and may occur at unpredictable locations anywhere in the central and alveolar airways (Figs. 17-5 and 17-6), exemplifying the challenges confronting prevention efforts in some organ systems.

Figure 17-5 Multistep carcinogenesis in lung is complicated by the heterogeneity of tumor types that arise in separate locations in the respiratory tract. Squamous tumors typically form in the central airways where they are thought to arise from the (1) respiratory mucosa. The earliest premalignant lesion (2), low-grade squamous dysplasia, exhibits a polarity in which dark basaloid cells mature into large cells that flatten along the surface, as seen in the cervix (Fig. 17-1). 3, High-grade squamous dysplasia exhibits less distinct maturation and a higher level of cytological diversity. 4, Invasive squamous cell carcinoma occurs when tumor cells extend in the underlying stroma, sometimes forming a mass visible on low-dose computed tomography (*LDCT*). Adenocarcinomas typically arise peripherally in the (5) alveoli and terminal bronchioles. The best-defined precursor lesion of adenocarcinoma is (6) atypical adenomatous hyperplasia (AHH); these are small lesions (<5 mm) in which the alveolar architecture is preserved. 7, Adenocarcinoma in situ (AIS) is a further progression of peripheral airway epithelium toward malignancy. This category represents a redefinition of the low-grade adenocarcinoma previously known as “bronchiolo-alveolar carcinoma.” These lesions are solitary, small (≤3 cm) lesions consisting of a single cell layer that spreads in a nondestructive pattern of growth along the alveolar septae that is referred to as “lepidic” and is detectable as a “ground glass” nodule by LDCT. 8, Invasive adenocarcinoma spreads beyond the alveolar surface, forming a mass that appears solid on LDCT. The presences of a mass visible by LDCT suggests tumor. However, by the time this occurs, the tumor is often invasive and must be treated aggressively.

Figure 17-6 The molecular pathology of smoking-related lung cancer begins with the direct interaction between metabolites of carcinogenic polyaromatic hydrocarbons in cigarette smoke with host DNA, which creates bulky adducts that affect DNA repair and create transcription errors. The result is a wide range of genomic losses, gains, translocations, and point mutations that account for the high degree of molecular heterogeneity in lung tumors with diverse physiological effects.

Squamous Dysplasia

The predominant premalignant lesion of the central airways is squamous dysplasia ¹³¹ (also referred to as “atypia”). Squamous dysplasia was documented more than 50 years ago in association with carcinoma,¹³² as well as in smokers without concurrent carcinoma.¹³³ Central squamous lesions are flat, inconspicuous, and only rarely identified by white light fiberoptic bronchoscopy. They are somewhat better visualized by laser-induced fluorescence emission bronchoscopy.¹³⁴ This method relies on a loss of autofluorescence, and signal from dysplasias may be mimicked by fluorescence loss caused by mucosal hemorrhage and other reactive changes. Thus, although sensitivity of laser-induced fluorescence emission bronchoscopy is high, specificity is low.

Prospective studies on the power of dysplasia to prospectively predict future central airway malignancy are few and face the forbidding challenges of inaccessibility of the bronchi and the prolonged follow-up that is required to achieve significant results in a sufficiently powered cohort. Sputum studies indicate that atypia of cells exfoliated from the central airways is associated with elevated risk for incidental carcinoma. The hazard ratio increases with the proximity of the sputum collection date to lung cancer diagnosis, but sputum atypia even 3 years prior to lung cancer diagnosis is associated with increased risk of lung carcinoma.¹³⁵ Confirmation of sputum analyses in biopsy studies to identify sources of dysplastic epithelium will require a large and disciplined multicenter trial involving bronchial mapping and revisiting dysplastic sites to evaluate the long-term disposition of discovered lesions.

The corollary question of whether ablation of premalignant central airway lesions could reduce lung cancer mortality remains an open one. A recent chemoprevention study ¹³⁶ using Iloprost, a prostacyclin agonist, has suggested that reversal of premalignant dysplasia morphology may be possible, but the study was a small one, and the end point was purely histologic. The long-term effect on mortality of agents that may alter the premalignant morphology of central airway epithelium remains unknown.

Atypical Adenomatous Hyperplasia

The predominant premalignant lesion in the peripheral airways is thought to be atypical adenomatous hyperplasia. This lesion is usually beyond the reach of the bronchoscope and is not easily accessible for monitoring and treatment. Data regarding the formation and natural history of these lesions are based entirely on prevalence studies of lesions discovered at autopsy or discovered incidentally in surgical resections for carcinoma.

Molecular Changes during Lung Carcinogenesis

As in other organs, most notably the colon, molecular changes that accompany the morphologic changes of dysplasia have been investigated in considerable detail. These studies are informed by the well-established role of tobacco smoke as a carcinogen in lung cancer. The dominant carcinogenic compounds found in cigarette smoke are the polyaromatic hydrocarbons and nicotine-derived nitrosamine ketone.¹³⁷ These compounds are metabolized to highly reactive intermediates that are inactivated largely by mitochondrial glutathione transferases. However, repair is imperfect, and a small proportion of the intermediates react directly with DNA, forming bulky adducts. Unless removed by the DNA repair system, adducts can create errors of replication. Over time, molecular alterations accumulate, providing a selective advantage to mutant cells for growth and survival.

Components of this pathway have been associated with increased risk for future lung cancer. Case control studies indicate that current smokers with tumors have higher levels of adducts in blood or tissue but that nonsmokers and former smokers do not have these higher levels.¹³⁸ Bulky adducts appear to reflect ongoing smoking-related DNA damage rather than biological changes specifically associated with malignant transformation. Clinically applicable adduct levels predictive of lung cancer have not been established.

Glutathione transferase and DNA repair enzyme mutations and polymorphisms that affect DNA repair activity could represent a risk factor for lung cancer because efficient deactivation of reactive carcinogens and repair of DNA damage caused by bulky adducts are important in protecting potential precursor cells from critical mutational changes. However, single-nucleotide polymorphism (SNP) association studies in many carcinogen metabolizing and DNA repair enzymes have demonstrated only weak and inconsistent associations and have not been sufficiently informative to guide intervention in potentially affected persons.139–143

Tumor Suppressor Gene Methylation

DNA damage in lung premalignancy engenders a number of genetic and epigenetic changes that provide a cellular survival or proliferative advantage to affected bronchial epithelial cells. Perhaps the earliest molecular events may be epigenetic and include DNA methylation, histone modification, and alterations in micro (mi)RNA expression (reviewed in references 144 and 145).144,145 The greatest experience regarding the use of epigenetic changes in the context of early detection of lung cancer has been obtained in regard to methylation. Methylation refers to the conversion of cytosine to 5′ methylcytosine by DNA methyltransferase. In lung carcinogenesis, methylation preferentially occurs in CpG-rich regions (CpG islands) that are present in normally unmethylated promoter regions of critical genes (hypermethylation) and may silence gene expression. In lung tumors, interest has focused on genes such as p16,¹⁴⁶ which are silenced in tumors but expressed in normal control tissue and are considered to be tumor suppressor genes (TSGs). The development of a simple and sensitive assay based on the conversion of methyl cytosine to uracil in the presence of metabisulfite followed by PCR with primers specific for the methylated bases, termed methylation-specific PCR,¹⁴⁷ has greatly facilitated the assessment of methylation status as a risk marker for lung carcinoma.

Methylation of key TSG promoter regions in cells exfoliated from bronchial surfaces into the sputum is nearly universally associated with prevalent lung carcinomas ¹⁴⁸ and frequently occurs in sputa of smokers without a tumor as well. TSG methylation is not present in sputum of people who have never smoked. This finding initially led to optimism that methylation could be used to screen selected populations at risk for lung carcinoma. However, the predictive power of single gene methylation or methylation of a small number of genes is limited by the ubiquity of methylation in smokers and the consequent high rate of false-positive results. This problem has led to the strategy of testing promoter regions in large numbers of genes to estimate overall methylation burden.¹⁴⁹ Identification of panels of genes to optimize the predictive power of methylation testing is a process that is still under way. In receiver-operator curve analyses, a seven-gene methylation profile selected from a 31-gene panel achieved a predictive accuracy of 71% to 77%. However, with a sensitivity set at 75%, the false-positive rate was 75%.¹⁵⁰ Although TSG promoter methylation may prove problematic as a general screening tool, it may be useful in specific niche populations or to supplement other methods of screening such as LDCT, and ongoing studies aim to further evaluate the usefulness of this approach.

Aneuploidy

A second molecular consequence of tobacco exposure is chromosomal instability, which is detected as aneuploidy by fluorescence in situ hybridization (FISH) in bronchial epithelial cells in situ and in sputum. Aneuploidy is nearly universal in lung carcinoma. It is also frequently observed in high-grade dysplastic lesions in the central airways.¹⁵¹ Aneuploidy in sputum has been associated with prevalent carcinoma^152,153 and has recently been shown to have a sensitivity of 76% within a detection window of 18 months before the appearance of invasive carcinoma.¹⁵⁴ However, FISH performed on sputum is labor intensive and difficult to perform both in the specimen preparation and slide interpretation. FISH requires a high level of expertise at every step, which has led to the development of automated methods of FISH screening. However, to date, these automated methods have not been broadly applied. It seems likely that FISH may most successfully be applied in restricted settings in conjunction with other technologies.

TP53 mutation

Mutations in several tumor-associated genes have been documented in adjacent nonmalignant airways or in subjects at high risk for carcinoma but without concurrent malignancy. Chief among these genes is TP53, which marks clonal populations that may spread laterally across the bronchial surfaces without invading stroma. TP53 mutation appears late in lung carcinogenesis, and although it is the single most commonly mutated gene in lung carcinoma, it is mutated in only half of tumors and in fewer benign airways. It seems unlikely that single gene mutation analysis will be sufficiently sensitive for use as a screening tool.

High-Throughput Technologies

High-throughput methodologies have been suggested as possible early detection tools. Recently, genome-wide association studies have identified an SNP variant at 15q15.2 that is significantly associated with cancer risk, with an overall odds ratio of 1.15.¹⁵⁵ The low odds ratio suggests that the SNP variant itself will not represent a useful predictor. However, this region of the gene still needs to be explored for more powerful biomarkers.

Gene expression microarrays have identified consistent changes in the transcriptome of epithelium brushed from nonmalignant bronchial surfaces that distinguish smokers from nonsmokers.¹⁵⁶ An additional set of changes are reported to identify subjects with cancer from smokers without cancer at an accuracy of 83%.¹⁵⁷ Finally, large-scale genomic sequencing studies have identified differences in the “genomic landscape” between smokers and nonsmokers.¹⁵⁸ It is expected that manageable numbers of biomarkers and clinically applicable testing platforms will emerge from these exploratory studies that will improve risk assessment and management of high-risk smokers, as well as patients with already invasive carcinomas.

Other Organ Sites

Premalignant cellular and molecular changes are described in several other tumors, including breast, prostate, bladder, and esophagus. A detailed review of these changes may be found in appropriate chapters of this text (see Chapters 74, 83, 84, and 91). However, identification of these changes has not led to mortality reductions.

Conclusion: Cells and Molecules in Early Detection

The examples previously described illustrate the uneven progress that has been made and the challenges that remain in reducing morbidity and mortality from cancer. What emerges from this review is that progress depends on improved understanding of site-specific carcinogenesis and intervention strategies that are informed by basic biology. Relatively simple technologies such as cytology or HPV testing can profoundly reduce cancer burden. Understanding the biology of premalignancy provides a lead time advantage that can be translated into greatly reduced cancer morbidity and mortality. Appropriate molecular strategies may supplement and eventually replace morphology but will require thorough validation.

Diagnosis

Histology of tumor tissue remains the bedrock upon which diagnosis is established and is the basis for selecting appropriate testing to guide treatment. Histologic diagnosis, although simple in concept, is extremely complex in practice and crucially depends on the expertise of the examining pathologist. The primary tool of the pathologist remains the hematoxylin and eosin stained section but is now supplemented by an arsenal of immunohistochemical (IHC) stains and other tissue testing procedures that permit much greater specificity of diagnosis than was previously possible. A detailed discussion of tumor histology is beyond the purview of this chapter. However, the relevance of IHC to specific diagnoses and selected testing algorithms is briefly discussed in the next section.

Diagnostic Tests for Tumor Classification

Diagnostic IHC tests are relatively simple, inexpensive, and can be assessed in conjunction with in situ cellular architecture, which ensures a high level of specificity and permits an assessment of the relevance of results in relation to the overall clinical and anatomic features. Interpretation of IHC data is now assisted by online databases that provide frequencies of IHC results in specific tumors and organ sites. The requirements for satisfactory results in IHC testing are discussed in the Best Practices section later in this chapter.

IHC analysis is semiquantitative at best, and interpretation may be subjective. More quantitative methods are currently being evaluated, including automated quantitative analysis, which is based on quantification of fluorescent or chromogen signals generated from IHC preparations. The automated quantitative analysis platform has the advantages of the histologic section in addition to its quantitative features. However, the technology is not inexpensive and requires specialized operator training.

Quantitative messenger RNA (mRNA) testing platforms that assess RNA signatures that are compared with a database of results from similar tumors have also been proposed as diagnostic aids. RNA testing platforms are multiple and include quantitative PCR and microarrays (see the Best Practices section in this chapter). mRNA and microRNA quantification systems have been modified to accommodate formalin-fixed paraffin-embedded (FFPE) tissue and may be useful in the identification of tumor sites of origin when it cannot be determined through morphologic and IHC evaluation. However, this technology requires specialized and equipment, and in situ microscopic detail is lost. Despite their many conceptual advantages, RNA diagnostic applications will require extensive standardization and validation before they find a place in the diagnostic armamentarium.

Prognosis and Prediction

Tissue biomarkers may be used to assess prognosis (natural history) or to predict response to therapy. The former predicts outcome without intervention and the latter measures the power of a biomarker to predict the effect of specific treatment on natural history. These two aspects of biomarker testing are not always easily separated, and it is for this reason that appropriate control arms are critical to clinical trials.

Protein Biomarkers for Prediction of Outcome

Protein biomarkers may be assessed in situ by immunohistochemistry or in fluid by several methods that now allow for the measurement of multiple markers simultaneously. Cellular morphology is retained in IHC methods, which adds to specificity and sensitivity over protein measurement in fluids or homogenates of disrupted cells.

IHC Markers

IHC staining for estrogen receptors (ERs) and progesterone receptors (PRs) was one of the earliest applications to find a place in patient management. Until the early 1980s, antibodies to these proteins were not widely available. However, with the development of monoclonal antibody technology, antibodies that could identify hormone receptors were created ¹⁵⁹ and found to reliably label receptors even in FFPE sections of breast carcinomas.^160,161 Unexpected results of this testing were that these receptors are typically found in the cell nucleus and that the intratumoral and intertumoral range of expression is large. Because of this finding, semiquantitative scoring that takes into account intensity of staining and percentage of cells stained has been used to establish expression levels that define positive and negative results.^162,163 These scoring systems continue to prove their prognostic value to the present time.¹⁶⁴

A second IHC test that has been successfully applied to the management of breast carcinoma is the test for HER2/neu (ERBB2) expression. HER2 is a tyrosine kinase receptor (TKR) and a member of the epidermal growth factor receptor (EGFR) family of signaling molecules. Its role in human carcinogenesis was initially discovered through DNA screening, where it was found to be amplified in 30% of the tested breast cancers.¹⁶⁵ RNA and protein were shown to be overexpressed in tumors that are amplified, as well as in a subset of nonamplified tumors.¹⁶⁶ HER2 testing became crucial when it was found that HER2 protein overexpression was a determining factor for in vitro and xenograft responsiveness to anti-HER2 monoclonal antibody treatment.^167,168 In early clinical trials, anti-HER2 IHC was used as the method for determination of HER2 status and determined eligibility for treatment with the anti-HER2 antibody drug, Herceptin.¹⁶⁹ A standardized approach to HER2 IHC testing was established several years later when expert panels agreed upon criteria for determining HER2 status (Fig. 17-7).¹⁷⁰

Figure 17-7 Scoring of the HER2/neu immunohistochemical predictive marker in breast carcinoma is illustrated as follows:

1+: faint partial membrane labeling (brown staining) of >10% of tumor cells

2+: weak to moderate complete membrane labeling of >10% of tumor cells

3+: strong complete membrane labeling or >10% of tumor cells

Recently, tests for multiple IHC markers have been combined into panels that include ER, PR, HER2, and the proliferation marker Ki-67,¹⁷¹ as well as several other markers thought to be of prognostic importance.¹⁷² These panels have been found to provide prognostic information that is comparable with that of multiplex mRNA expression assays discussed later and have the added advantage of accessibility in local laboratories and ready correlation with histologic features.

A substantial subset (~15%) of breast tumors do not express ER, PR, or HER2 and are now referred to as triple-negative tumors.¹⁷³ These lesions are notable for their aggressive histology and poor long-term outcome. This category overlaps but is not identical to the basaloid-like breast cancer category defined by microarray-based expression profiling¹⁷⁴ and includes nonhereditary tumors that frequently express perturbations of BRCA1 pathway gene expression. The triple-negative category thus has increased the importance of accuracy in both positive and negative IHC testing and reemphasizes the need for local validation and controls for accurate IHC procedures (see the Best Practices section in this chapter).

Finally, EGFR is expressed at a high level in most non–small cell lung carcinomas, including both adenocarcinoma and squamous carcinoma, and expression correlates with gene amplification and copy number.175,176 When a scoring system was applied in a recent phase 3 trial testing the effectiveness of the anti-EGFR chimeric (mouse/human) monoclonal antibody, Cetuximab, in advanced cases of non–small cell carcinoma, only patients with the highest levels of EGFR expression (scores >200) experienced a survival benefit.¹⁷⁷ This result suggests that EGFR expression levels determined that IHC may be a reasonable selection criterion for anti-EGFR antibody treatment. Properly scored EGFR IHC expression may prove to be a valuable predictor of response to anti-EGFR monoclonal antibody treatment.

Measurement of Gene Expression for Prognosis and Prediction of Treatment Response

The analysis of gene expression in cancer was accelerated greatly during the 1990s with the introduction of microarray technology that allowed simultaneous measurement of expression of large numbers of genes.¹⁷⁸ The underlying technology of microarrays involves the binding of gene probes to specific addresses on a solid matrix. A fluorescence-labeled complementary DNA library is then hybridized to the solid matrix, and the level of expression at each address is measured by fluorescence microscopic scanning of the array. Many variations on this basic concept have resulted in high-density arrays that can be used to evaluate expression levels of virtually all genes in a single tissue sample. Large-scale gene expression profiling has been used to cluster tumors according similarities in gene expression patterns and to identify new subsets of lung and breast tumors,^{174,179–182} with prognostic implications. However, this method creates unwieldy amounts of data, and to produce a clinically relevant result, it has been necessary to filter expression data to identify profiles that can be used for clinical decision making.

This refinement has been most thoroughly applied in breast cancer, where dominant competing platforms—Mammaprint (Agendia, Amsterdam, The Netherlands) and Oncotype DX (Genomic Health, Redwood City, Calif.)—have been created primarily for predicting likelihood of benefit from chemotherapy in early-stage breast cancer (reviewed in reference 183).¹⁸³ To create these platforms, RNA markers are tested in tissue samples and the most powerful and efficient combinations of markers are assembled into a panel to provide an estimate of the likelihood of benefit from therapy. Available platforms have varying tissue requirements, with some platforms needing unfixed frozen tissue and others effective with FFPE tissues (see the Best Practices section). Preanalytical processing is thus a major consideration in the selection of a microarray platform. The use of this kind of gene expression testing is growing, but the role these tests will ultimately play in the dawning era of high-throughput molecular profiling and specifically targeted treatment remains to be determined.

DNA Sequencing as a Biomarker Predicting Response to Treatment

The large number of mutations found in tumors of all types during the past few decades has prompted comprehensive sequencing and gene expression analysis of virtually all solid tumor types through The Cancer Genome Atlas (TCGA).¹⁸⁴ A number of genetic aberrations discovered by the TCGA and earlier efforts have been exploited for treatment of patients with cancer, with the primary focus of interest being the tyrosine kinase signaling pathway. TKRs (Fig. 17-8) are transmembrane immunoglobulin-like molecules¹⁸⁵ containing an extracellular ligand-binding region, a transmembrane region, and an intracellular tyrosine kinase region that contains tyrosine residues whose phosphorylation regulates signal transduction. Specific mutations in tyrosine kinase domains enhance phosphorylation and stimulate downstream intracellular signaling. The physiological result is accelerated cell proliferation, extended cell survival, and increased angiogenesis and cellular migration, properties that are hallmarks of carcinogenesis.¹

Figure 17-8 Tyrosine kinase receptors of many types including KIT, EGFR, HER2/neu, HGHR (MET), IGFR1, ROS1, RET, and PDGFRA have similar molecular structures comprising extracellular, transmembrane, and tyrosine kinase domains. Physiological activation of receptor by binding of ligand induces conformational changes in the intracellular tyrosine kinase domain that permits binding of adenosine triphosphate (ATP) and phosphorylation of signaling proteins that ultimately may lead to cell proliferation, increased motility, and cell survival. Activating mutations (*zipper line*) affect the ATP binding pocket of the tyrosine kinase domain, causing phosphorylation of signaling proteins in the absence of ligand binding. Receptor mutations may ultimately lead to uncontrolled cell proliferation and tumor formation. Treatment of mutated tumors with tyrosine kinase small molecular inhibitors (TKI) blocks ATP binding and phosphorylation of signaling proteins. Resistance to TKI may be caused by additional mutation (*double zipper line*) affecting the tyrosine kinase domain. Cells carrying the resistance mutations may exist in low numbers in untreated tumors but selectively proliferate in the presence of inhibitor.

A number of small-molecule tyrosine kinase inhibitors (TKIs; Table 17-3) specific for these receptors have been designed to block phosphorylation and suppress tumor growth and spread. Also, the extracellular domains of these molecules have proven to be antigenic, and therefore the extracellular ligand binding domain is also a target for therapeutic antibody development. Several of these drugs have successfully transferred to the clinic, and the number of such drugs available for cancer treatment is likely to continue to expand, creating an ongoing demand for more comprehensive molecular testing of malignancies of virtually all types.

Table 17-3

Treatable Molecular Lesions in Solid Tumors

Tumor(s)	Gene(s)/Protein	Molecular Lesion(s)	Drug
Breast	ER	Overexpression	Fulvestrant/tamoxifen
Breast	ER/PGR	Overexpression	Exemestane/letrozole
Breast/stomach	HER2/NEU	Amplification/overexpression	Everolimus/lapatinib/pertuzumab/ trastuzumab
Colon	KRAS	Absence of mutation	Cetuximab/panitumumab
Colon	KRAS	Activating mutations	MEK inhibitors (AZD6244, others)
Colon/liver/melanoma/NSCLC	CTNNB1	Activating mutation (wnt pathway)	Porcupine inhibitors (LGK974, others)
Colon/NSCLC	NRAS	Activating mutations	MEK inhibitors (AZD6244, others)
Colon/NSCLC	PIK3CA	Activating mutations	PIK3CA inhibitor (BKM120, others)
Colon/NSCLC	EGFR	Overexpression/increased gene copy number	Cetuximab/panitumumab
GIST	C-KIT/PDGFRα	Activating mutations	Imatinib/midostaurin (PKC412)
Lymphoma	CD20 antigen	Express/overexpression	Rituximab/tositumomab/ofatumumab/ ibritumomab-tiuxetan (radiolabeled)
Lymphoma	CD25 antigen	Express/overexpression	Denileukin-diftitox (antibody-toxin conjugate)
Lymphoma	CD30 antigen	Express/overexpression	Brentuximab-vedotin (antibody-toxin conjugate)
Melanoma	KIT	Activating mutations	Imatinib
Melanoma/colon/lung	BRAF	Activating mutations	Vemurafenib, dabrafenib
NSCLC	ALK	Gene rearrangement (multiple fusion partners)	Crizotinib
NSCLC	EGFR	Activating mutation (small deletion/point mutation)	Erlotinib/gefitinib
NSCLC	RET	Translocation	Vandetanib/cabozantinib/sunitinib
NSCLC	ROS	Gene rearrangement (multiple fusion partners)	Crizotinib/sunitinib

GIST, Gastrointestinal stromal tumor; NSCLC, non–small cell lung cancer/carcinoma.

One of the earliest successes of this mutation-targeted approach to cancer treatment was the discovery of mutations in the KIT gene in gastrointestinal stromal tumor (GIST), a rare abdominal malignancy, about which there was little information on cell of origin or pathogenesis.

Gastrointestinal Stromal Tumors

GISTs were, until recently, a rare and ill-defined neoplasm of somewhat mysterious origin. In the early 1990s these tumors were defined as spindle cell malignancies that exhibited no evidence of cell of origin.¹⁸⁶ The status of these tumors began to change as it was recognized in the mid 1990s that GIST tumors express CD34 but not desmin¹⁸⁷ and thus displayed a distinct immunophenotype. A watershed discovery was reported by Hirota et al.¹⁸⁸ in 1998. These investigators described nearly uniform expression of KIT (CD117) in GIST tumors and suggested that they may arise from the immunophenotypically similar paraganglionic interstitial cells of Cajal. Most importantly, they identified deletions and point mutations in the juxtamembrane portion of the KIT TKR gene. These mutations were shown to activate KIT signaling, and KIT was thus a potential therapeutic target. KIT is a member of the TKR signaling family and is encoded at chromosome 4q11-q12. In GIST tumors, mutations in the KIT gene (reviewed in references 189, 190, and 191)^191–191 are located predominantly in the juxtamembrane domain (exon 11), with two thirds of tumors harboring mutations in this region of the gene. Exon 11 mutations allow adenosine triphosphate to phosphorylate receptor in the absence of ligand (stem cell factor). Phosphorylation of the receptor triggers downstream signaling of RAF, MEK, MAPK, PIK3CA, and STAT3, transforming mutated cells to malignant status. Approximately 10% of mutations are found in exon 9, which encodes the extracellular domain of the molecule. These mutations induce a change in receptor conformation mimicking that seen in the presence of ligand and resulting in receptor phosphorylation. Exon 9 mutations are commonly seen in large and small intestinal tumors but rarely in gastric neoplasms. Finally, mutations in exons 13 (tyrosine kinase domain) and 17 (activation loop) are occasionally found in GIST tumors. The effects of exons 13 and 17 mutations on the receptor are less well established, but the net result of all these mutations is receptor activation and oncogenic transformation of mutant cells. KIT mutations occur in approximately three fourths of GIST tumors.^191–191

Tumors that do not contain a KIT mutation may contain a mutation in the closely homologous tyrosine kinase receptor gene, platelet-derived growth factor receptor (PDGFR), alpha polypeptide (PDGFRA).192,193 PDGFR protein has two isoforms, α and β, and it is the α form that is mutated. Approximately 10% of GIST tumors contain mutations in PDGFRA.^191–191 The mutated loci in PDGFRA are distributed similarly to KIT, but frequencies differ. The most frequent mutation site (exon 18) encodes a portion of the tyrosine kinase domain (TK2) and occurs in 6% of all GISTs. Juxtamembrane mutations in PDGFRA are relatively infrequent, occurring in <1% of GISTs.

Introduction of TKIs that block signaling by mutant receptor has had a profound effect on the outlook for this tumor. Prior to 2000, the outlook for patients with an advanced GIST was bleak. Most patients experienced a recurrence after surgical resection, and responses to chemotherapy were poor. Only 10% of patients were disease free after extended follow-up.194,195 A dramatic reversal of this situation occurred when it was realized that GIST is exquisitely sensitive to tyrosine kinase blockade. The drug successfully used to treat GIST was originally designed to block signaling by the BCR-ABL fusion protein in chronic myelogenous leukemia. Structural similarities between the intended target of the drug and KIT led to the discovery that the TKI now known as imatinib strongly inhibits mutated KIT.¹⁹⁶ Subsequent clinical trials demonstrated high response rates and prolonged progression-free survival (PFS),^199–199 leading to FDA approval of the drug as discussed elsewhere in this volume. KIT testing is now used routinely to identify persons likely to respond to the drug.

PDGFRA mutations are not equivalent to those in KIT. More than half of PDGFRA mutations encode a D842V amino acid substitution that is resistant to imatinib, whereas other mutations in PDGFRA are sensitive.²⁰⁰ Alternative drugs that show activity against the D842V resistance mutation such as midostaurin (PKC412)²⁰¹ are under investigation.

Colon Carcinoma

KRAS as a Prognostic Marker in CRC

KRAS is a member of a superfamily of guanosine-5-triphosphatase proteins that also includes NRAS and HRAS and a number of less well known proteins. The role of these proteins is to transduce stimuli from cell surface growth factor receptors. Upon activation, RAS undergoes prenylation (the addition of a 15-carbon chain) of a CAAX (C, cysteine; A, aliphatic amino acid; X, serine or methionine) motif by a farnesyl transferase. This process makes RAS more hydrophobic and adherent to the inner aspect of the cytoplasmic membrane, where it activates transducers in an intracellular signaling cascade involving numerous effector molecules, including PI3K and MAPK.

Consistent mutations in several components of the RAS signaling pathway occur in human solid tumors. Mutations in KRAS are particularly common. They permit stimulus-independent activation of intracellular signaling. Hydrolysis of guanosine triphosphate bound to RAS must occur for RAS to return to an inactive state. Typical mutations in codon 12 that normally encodes glycine result in substitution with an amino acid that sterically interferes with guanosine triphosphate hydrolysis, which perpetuates independent intracellular signaling and permanently stimulates proliferation and evasion of apoptosis. Because activation of the KRAS gene is stimulus independent, targeting of the upstream TKRs has proven to be ineffective in countering the proliferative and invasive properties of tumors that are driven by KRAS mutation.

KRAS mutation occurs in approximately 40% of colon carcinomas. The independent effect of KRAS mutation on outcome in colon cancer is still unclear and remains controversial.²⁰² In a series of 3,439 patients with CRC, it was found that of the 12 possible mutations on codons 12 and 13, only the glycine to valine mutation on codon 12 (8.6%) had a statistically significant impact on outcome.²⁰³ Smaller series have shown similar results, but retrospective data from other large randomized studies have failed to consistently demonstrate a meaningful effect of KRAS mutation on outcome in CRC. In a study of a large number of untreated patients, there was no difference in PFS (7.3 weeks) between mutant and nonmutant tumors.²⁰⁴

KRAS as a Predictor of Treatment Response in CRC

In 2007, a study published in the New England Journal of Medicine²⁰⁵ reported that administration of the monoclonal antibody, Cetuximab, to patients with advanced CRC expressing immunohistochemically detectable EGFR resulted in improved survival compared with untreated control subjects. Both treatment and control arms had been treated with a fluoropyrimidine, irinotecan, and oxaliplatin or had contraindications to treatment with these drugs. The difference in survival, although not large, suggested that this specific treatment could be useful in managing CRC and prompted a closer look at responders versus nonresponders. Several subsequent studies indicated that KRAS mutation status dictates response to anti-EGFR monoclonal antibody treatment in persons with CRC. In the randomized studies summarized in Table 17-4,^{204,206–208} wild-type tumors treated with anti-EGFR antibody (Cetuximab or Panitumumab) have a PFS advantage of several months and a significantly reduced hazard ratio in comparison with mutated tumors in first-line and third-line settings.

Table 17-4

KRAS Mutation as a Predictive Biomarker in Clinical Trials of Metastatic Colorectal Carcinoma

Study	Treatment	No. Patients	KRAS Mutant*	Hazard Ratio	KRAS Wild Type*	Hazard Ratio	Reference
Amado	Panitumumab vs. BSC† (third line)	427	7.4 wk	0.99	12.3 wk	0.45	Amado et al., 2008²⁰⁴
Karapetis	Cetuximab vs. BSC† (third line)	572	1.8 mo	0.99	3.7 mo	0.40	Karapetis et al., 2008²⁰⁶
Van Cutsem (CRYSTAL)	FOLFIRI ± Cetuximab (first line)	540	7.6 mo	1.07	9.9 mo	0.68	Van Cutsem et al., 2009²⁰⁷
Bokemeyer (OPUS)	FOLFOX ± C (first line)	233	5.5 mo	1.83	7.7 mo	0.57	Bokemeyer et al., 2009²⁰⁸

^*Progression-free survival.

^†Best supportive care.

It has recently been suggested that not all mutations are equally predictive of poor response to monoclonal anti-EGFR therapy. Patients with codon 13 mutation (p.G13D) appear to respond to such treatment with a significantly longer overall survival and reduced hazard ratio in comparison with patients with other KRAS mutations.²⁰⁹

BRAF Mutation and Prognosis

A second mutation in colon carcinoma of clinical importance is BRAF. BRAF mutations were first identified in colon cancer through a survey of BRAF mutations in all human tumors.²¹⁰ The reported frequency of BRAF mutation in clinical samples ranges from approximately 5% to 15%.^211–214 BRAF mutation is more frequent in tumors on the right side of the colon.^217–217 MSI-H occurs in approximately half of BRAF-mutated tumors.^214,217,218 BRAF mutation is associated with aggressive tumor behavior and shortened survival, but only in tumors that are MSI stable^{214,217,219,220}; mutation in MSI-H tumors carries no survival disadvantage. MSI in BRAF-positive tumors is attributed to methylation of MMR rather than mutation and appears to be unrelated to HNPCC and germline mutation of MMR genes.^221,222 BRAF mutation in colon tumors confers resistance to anti-EGFR monoclonal antibody inhibitors,^{213,223–225} and BRAF mutated tumors respond poorly to BRAF inhibitors. Resistance to BRAF inhibitors is thought to be due to activation of EGFR in response to BRAF inhibition.²²⁶ It has been suggested that BRAF-positive tumors may therefore respond to dual inhibition of both EGFR and BRAF.

Other Mutations

Additional mutations of predictive importance are NRAS and PIK3CA, which occur at frequencies of <5%211,213,227 and 10% to 15%,^213,225,227 respectively, in colon carcinoma. PIK3CA mutations overlap those found in other genes, but NRAS mutations are nonoverlapping with mutations in other genes, including KRAS and BRAF. Mutations in either PIK3CA or NRAS are associated with poor response to EGFR monoclonal antibody therapy.^213,225 Finally, initial results of comprehensive molecular analysis by the TCGA have been reported and identify large numbers of mutations that are likely to become targets of specific molecular inhibitors.²²⁷ Comprehensive molecular testing is expected to contribute greatly to colon carcinoma management in the near future.

Lung Cancer—A Heterogeneous Tumor

For many years, molecular characterization of lung cancer was impeded by the extreme heterogeneity of this disease as reflected in the most recent revision of the anatomic classification of lung carcinoma.²²⁸ Since the 1920s, lung tumors have been divided into small cell (“oat cell”) lung carcinoma (SCLC) and non–small cell lung carcinoma (NSCLC) that differ not only morphologically but express different classes of cell surface molecules, possess different mutational profiles, and exhibit different biological behavior. Small cell carcinomas express a number of markers that are also found on neural cells and in current classifications are categorized as one of several different types of “neuroendocrine” carcinomas. The vast majority of NSCLC does not express neuroendocrine markers but instead expresses large quantities of Erb-B cell surface TKR. SCLC is highly genetically unstable and appears to be driven predominantly by seven transmembrane receptors. In NSCLC, the level of genetic instability is more variable, and the TKRs are major tumor cell drivers. Finally, SCLCs are uniformly aggressive and are particularly sensitive to mitotic inhibitors.

More recently, a high degree of heterogeneity has been observed within the category of NSCLC. The two main histologic categories are adenocarcinoma, a tumor characterized by the formation of acinar or papillary structures, and squamous carcinoma, which is characterized by the formation of keratin pearls and intercellular bridges. These tumor types are also distinguished by variation in expression of specific keratin subgroups and other lineage specific markers, most notably TTF1, which is expressed by adenocarcinoma but not squamous carcinoma. Finally, within histologic categories, there is molecular variation that is reflected by differences in driver protein expression and activation that can be therapeutically exploited. Most of these differences involve activation of TKRs and their signal transducers. The first of these receptors to be defined as a therapeutic target was EGFR.

EGFR

The original isolation of EGF from mouse salivary gland in 1962,²²⁹ the demonstration of its binding to cell membrane receptor,²³⁰ and sequencing of the receptor²³¹ established the role of intercellular signaling in epithelial cell growth and maintenance. By the 1990s, overexpression of EGFR had been demonstrated in squamous carcinoma. Monoclonal antibodies to EGFR that were effective in localizing EGFR in situ became available in the late 1990s and documented the broad expression of EGFR in adenocarcinoma.¹⁷⁵ EGFR expression in NSCLC was found to be closely linked to gene copy number.¹⁷⁶

Further studies of EGFR were driven by of the discovery of a subset of patients who were mostly nonsmokers with adenocarcinoma who responded dramatically to EGFR TKIs (Fig. 17-9). Simultaneously, two groups reported that mutations in the EGFR tyrosine kinase domain were present in a subset of adenocarcinomas and that their presence correlated with sensitivity to TKI.^232,233 The mutations most commonly consisted of either single base substitution in exon 21 (c.2573G>C, p.L858R) or in-frame deletion within exon 19 ranging from 9 to 18 base pairs in the region coding for the adenosine triphosphate binding pocket of the tyrosine kinase domain. These mutations resulted in strong constitutive activation of the encoded EGFR protein.

Figure 17-9 A, Schematic view of epidermal growth factor receptor and its key domains, including the extracellular ligand-binding domain, transmembrane domain, tyrosine kinase domain, and autophosphorylation domain. The tyrosine kinase domain (exons 18 through 24) is expanded, showing the sites of described point mutations at G719, S752, R776, H835, L858, and L861, and in-frame deletions at L747-A750. B, Three-dimensional ribbon structure of the intracellular domain: the positions of the three most common mutations are shown in relation to the activation loop.

Activating mutations of EGFR result in tumor susceptibility to TKI therapy.²³⁴ However, responses to therapy are temporary, and hopes of a long-lasting therapy are generally unrealized because of resistance to targeted therapy that inevitably arises in treated patients. Two common mechanisms of resistance to EGFR TKI are MET amplification^237–237 and a point mutation in exon 20 of EGFR (c.2369 C>T, p.T790M),²³⁸ the effect of which is depicted in Figure 17-8. As a result of the point mutation in EGFR exon 20, a bulky methionine side chain is thought to sterically hinder binding of TKIs. Tumors cells harboring this mutation may exist in low numbers in untreated tumors and emerge through selective pressure exerted by targeted therapy. In addition, insertions in exon 20 of EGFR are associated with primary resistance to TKI therapy,²³⁹ which also is thought to be due to steric hindrance of TKIs.

KRAS

KRAS mutations are more common in lung cancer than are EGFR mutations and, unlike EGFR mutations, they occur predominantly in smokers.240,241 Co-existence of KRAS and EGFR mutation is uncommon but has been reported.²⁴² Because of its position “downstream” of EGFR signaling, the hypothesis that an activating mutation in KRAS would render a tumor nonsusceptible to TKI therapy in lung tumors has been extensively investigated with conflicting results.²⁴³

ALK and Other Large-Scale Rearrangements

Point mutations and small deletions are not the only molecular mechanisms driving lung cancer. Overexpression of oncogenes may also be driven by gene activation through chromosomal rearrangements. Rearrangements that fuse gene promoters to the tyrosine kinase domain of growth factors, growth factor receptors, or transcription factors may result in inappropriate signaling from the fusion products. Examples of activation by gene fusion in lung cancer involve ALK, ROS1, and RET. In 2007, Soda et al.²⁴⁴ discovered a transcript derived from a patient with NSCLC that could transform T3T fibroblasts. Further investigation demonstrated the transcript to be fusion of a gene called echinoderm microtubule-associated protein-like 4 (EML4) with the anaplastic lymphoma kinase (ALK) gene (Fig. 17-10), which is also rearranged in large cell lymphomas. This rearrangement has been detected in 4% to 7% of pulmonary adenocarcinomas, with an increased proportion seen when cohorts are selected based on clinicopathological features including lack of smoking history.^245–248 The driver of tumorigenesis in this rearrangement is thought to be ALK, which is not typically expressed in adult lung tissue. Juxtaposition of a 5′ partner with a promoter results in high-level expression of the kinase domain of ALK and leads to inappropriate signaling by this molecule.^244,249

Figure 17-10 The EML-ALK fusion is generated by a paracentric inversion in the short arm of chromosome 2. The break-apart fluorescence in situ hybridization probe includes DNA sequences centromeric to the breakpoint (5′ ALK) labeled in SpectrumGreen and sequences telomeric to the breakpoint (3′ ALK) labeled in SpectrumOrange (A). Normal cells carry only fused 3′ ALK-5′ ALK signals, indicated by yellow arrows (B). Cells with rearrangement in the ALK gene, exhibit split 3′ (*orange*) and 5′ (*green*) signals (C) or additional copies of single 3′ (*orange*) signals (D). Red arrows indicate 3′ ALK signals, and green arrows indicate 5′ ALK signals.

Less than 5 years after its initial discovery, more than 10,000 lung cancers have been tested for the presence of this rearrangement. More than 13 molecular variants of EML4–ALK fusion have been detected,246,250 and three other rare activating gene partners have been identified, including TRK-fused gene (TFG),²⁵¹ KIF5B,²⁵² and KLC1.²⁵³ A novel targeted therapy agent, crizotinib, has been found to be effective against tumors driven by this molecular marker²⁵⁴ and received approval for treatment of these specific tumors by the U.S. Food and Drug Administration. Numerous other ALK inhibitors are in initial clinical trials.

Summary

These three examples, EGFR, KRAS, and ALK, underscore the diversity of molecular alterations in NSCLC, and with ongoing extensive efforts, “driver mutations” have been identified in a significant proportion of lung cancers. Molecular testing can provide guidance for prioritizing therapies to patients who are likely to receive the greatest benefit and, conversely, can identify patients proven not to benefit from a targeted therapy. Finally, molecular testing can provide clinicians with a rationale to guide certain patients toward clinical trials of targeted therapeutics. These selected examples of targets are by no means the only ones of interest in NSCLC, in which a growing number of molecularly targeted agents are under investigation in association with specific molecular alterations. These alterations include mutations in BRAF, HER2, PIK3CA, and rearrangements in RET, among others.²⁵⁵

Melanoma

Several targetable mutations have been found in melanoma in recent years involving the BRAF, NRAS, and KIT genes (see Table 17-3). Of these, the most common is BRAF.²⁵⁶

BRAF

BRAF protein is a serine-threonine kinase that is encoded on chromosome 7q34. It is an important signal transduction molecule that mediates signals from RAS to MEK, ultimately promoting cell proliferation and mobility. Activating missense mutations in BRAF were first identified in a screening of more than 500 cells lines from multiple organ sites.²¹⁰ Mutations were found in 60% of melanoma lines and tumors at lower frequencies in carcinomas of the colon (15%), lung (3%), breast (3%), and ovary (4%), as well as in glioma (11%). BRAF mutation is now thought to occur in approximately half of primary and metastatic melanoma clinical samples,^257–260 with some variation according to melanoma subtype. Activating BRAF mutations largely occur in exon 15, which encodes the catalytic domain of BRAF. By far the most frequent single base change (c.1799T>A) results in an amino acid change, V600E.^210,260 This mutation activates continuous intracellular signaling, resulting in cell proliferation, increased tumor cell survival, and enhanced mobility. Two compounds selectively block the kinase activity of mutated BRAF, vemurafenib and dabrafenib. In clinical trials, both compounds have produced high response rates and improved PFS or overall survival in tumors with BRAF mutations.^261–264 These studies reinforce the need for accurate molecular characterization of melanoma to allow for selection of effective treatment.

KIT

A second mutation target in melanoma is KIT, reviewed in references 256 and 265 (previously discussed in regard to GIST). KIT mutations differentially affect subsets of melanoma. Mutation frequencies in sun-damaged cutaneous, acral, and mucosal melanomas range from 2% to 20%, but KIT mutations are absent from cutaneous tumors arising in non–sun-damaged skin.256,266 In addition, a different spectrum of mutations affects melanoma compared with GIST.^268–268 Whereas KIT mutations in GIST are often short deletions or insertions in exon 11, the vast majority of mutations in melanoma reported to date are point mutations. Mutations in exon 9, which constitute more than 10% of GIST mutations, have not been found in melanoma. Mutations in exon 13, which encode the kinase domain of KIT, are more common in melanoma than in GIST. Finally, KIT gene amplification is frequent in melanoma^268–268 but is not described in GIST.¹⁸⁹

Therapeutic trials testing the effectiveness of KIT inhibitors on KIT-mutated tumors have had limited success. In a recent phase 2 trial enrolling 51 patients with metastatic melanoma who had genetic alterations of KIT, durable complete responses to imatinib have been reported in two patients with mutations in exon 11 and partial durable responses have been reported in four others for an overall response rate of 16%.²⁶⁸ Another study has reported similar findings (23% overall partial response rate) in tumors with exon 11 and 13 mutations.²⁶⁹ Imatinib blockade of KIT may be a life-extending alternative for patients with BRAF wild-type/KIT-mutated metastatic tumors of specific subtype. This new clinically relevant development will add urgency to the implementation of multiplex testing at many institutions that are equipped to perform molecular testing.

NRAS

NRAS, which is encoded on chromosome 1, is a member of the RAS family of signaling molecules and was originally identified as an oncogene in neuroblastoma cells.270,271 Mutation in melanoma was reported shortly after discovery of the gene.²⁷² NRAS is altered in approximately 20% to 30% of melanomas of all types^256,260 except for uveal melanoma. Mutations are restricted mainly to exon 3, codon 61 (>80%), with a smaller proportion occurring at exon 2, codon 13.²⁶⁰ These mutations are thought to lock NRAS in its activated state. However, NRAS mutation by itself may not be sufficient for malignant transformation because both sporadic and congenital nevi of several types have been shown to harbor NRAS mutation without evidence of malignant transformation.^273,274 To date, no drugs are known to directly and effectively target NRAS signaling. However, downstream signal transduction has been the subject of ongoing preclinical investigation.^256,265

GNAQ/GNA11/BAP1

Mutations in GNAQ, GNA11, and BAP1 are confined largely to uveal melanomas, blue nevi, and meningeal melanoma. The GNAQ and GNA11 genes encode membrane-bound guanosine-5-triphosphatases that regulate signaling through seven transmembrane receptors. Activating mutations in either GNAQ or GNA11 are reported in more than 80% of uveal melanomas and blue nevi 275,276 at codon Q209 and R183 hot spots. BRCA1-associated protein 1 (BAP1) is encoded on chromosome 3p21.1 and is mutated in more than 80% of metastatic uveal melanomas.²⁷⁷ BAP1 mutation has also recently been described in a familial syndrome with development of clinically and histologically characteristic tumors with some features of melanoma. This mutation also occurs in sporadic tumors with similar features.²⁷⁸ BAP1 mutations cause protein truncation and therefore inactivation of the protein. For this reason, BAP1 is thought to represent a tumor suppressor gene. BAP1 mutations are of limited prognostic value and are not yet therapeutic targets but illustrate the highly specific relationships that may exist between morphology and molecular pathology.

Current Technologies and Best Practices for Solid Tumors

Best practices for clinical laboratory testing are regulated under the Clinical Laboratory Improvement Amendments passed by Congress in 1988 and facilitated by the College of American Pathologists through its rigorous laboratory accreditation program. Although other inspecting agencies exist, the College of American Pathologists is the only one of its kind that utilizes practicing laboratory professionals as inspectors and is essential for the delivery of standardized, uniformly high-quality testing services. The aim of this section is to provide an understanding of components of tumor testing services that are essential for accurate and efficient laboratory operations.

Biopsies

Improved imaging and tissue sampling techniques in recent years, exemplified by advances in lung sampling methods,279,280 have resulted in increasing reliance on small biopsies for diagnosis and monitoring of treatment. The primary objective of these tissue acquisition methods has been to obtain tissue for unequivocal diagnosis, which can require multiple immunohistochemical stains in addition to conventionally stained sections. Meeting this objective can leave little remnant tissue behind for ancillary studies. When tumor is of an advanced stage and surgery is not appropriate, remnant tissue from these diagnostic biopsies may be the only tissue available for molecular testing. Because morbidity from biopsy procedures has declined and the value of information derived from molecular testing has become more critical, performing biopsies solely for molecular analysis has gained increasing acceptance by clinicians and patients alike. The major limitation of biopsy specimens for molecular testing is small tumor cell yield. Also, testing of DNA extracted from paraffin sections must take into account shearing of DNA caused by sectioning. Despite these challenges, isolation of tumor DNA from small specimens of many types including tissue cores, cell block preparations, smears, and imprinted slides has been well documented.^281–284 Individual laboratories must validate testing protocols for each specimen type.

Enriching for Tumor Cells

Because nonmalignant cells invariably accompany (“contaminate”) all clinical samples, specimens must be enriched for tumors to a level that is consistent with the analytical sensitivity of the platform that will be used for testing. As a rule of thumb, a minimum of 100 viable tumor cells in a 10-µm section is required for a single PCR-based assay. The quality of the biopsy specimen may be more important than quantity for critical molecular studies.285,286

Although tumor enrichment methods may vary from laboratory to laboratory, it is essential that tumor enrichment methodology be appropriately matched to the testing platform. At least five different procedures may be used for tumor cell enrichment depending on the amount of contaminating normal tissue that is present in the specimen: (1) enface sectioning of a trimmed block; (2) coring of paraffin blocks in regions enriched for tumor cells; (3) macrodissection; (4) manual microdissection; and (5) laser capture microdissection. Macrodissection can refer to the practice of scraping of tumor cells from a glass slide, which typically is done without direct microscopic visualization. Manual microdissection, by contrast, involves the use of some form of microscopic assistance to identify tumor cells. Manual microdissection is typically performed using a dissecting microscope and usually involves counterstaining of slides to identify regions of interest, which can be removed from the slide with a scalpel, needle, or glass micropipette. Laser capture microdissection involves melting of supporting matrix using a laser beam created with an epifluorescence lens and removal of the target cells for extraction and quantitative testing of DNA and other analytes. Although it is more precise than manual microdissection, laser capture microdissection is slower, more costly, and generates lower yields of extraction products.

Choosing Methods of Analysis

A large number of testing methods are now available for molecular analysis, and choosing an appropriate platform can be challenging. For mutation determinations, several testing platforms are available. These platforms include Sanger sequencing, which remains the gold standard for mutation detection, high-resolution melting curve analysis, allele-specific PCR, amplification refractory system (ARMS), often coupled with a bifunctional self-probing primer (Scorpions), pyrosequencing, and allele-specific tests that may involve placement of a lock on mutant DNA and digestion of wild type. In addition, multiplex tests can detect multiple mutations, and Next Generation (“NextGen”) sequencing permits mutational testing for expanded panels, or potentially, the entire genome or exome. Each of these platforms must be independently evaluated and validated in the specific setting of the testing laboratory. Among the parameters that must be assessed are (1) amount and quality of starting material (DNA), (2) assay sensitivity, (3) specificity, (4) coverage of genome, (5) turnaround time, (6) simplicity, (7) start-up costs, (8) cost per test, and (9) clinical validation. The approach to laboratory validation of any individual method is highly complex and requires appropriate evaluation and documentation of a host of performance characteristics, including accuracy, reproducibility, robustness, interfering substances, analytic sensitivity, analytic specificity, and limit of detection.²⁸⁷ Several methods currently used for the identification of mutations in clinical samples are briefly discussed in the following sections.

Sanger Sequencing

Sanger sequencing has been the gold standard for many years and, like many other assays, is based on the dideoxy-chemistry illustrated in Figure 17-11. The sequencing reaction is read as a color code that distinguishes oligonucleotides of variable length with four specific color labels. A ladder of nucleotides is created that can be identified by their electrophoretic mobility and color of terminal fluorescent nucleotide. The identification of abnormal (mutant) peaks in chromatograms of this ladder can be facilitated by computer programs that not only create the chromatogram but compare them with reference sequences (RefSeq) and identify abnormalities that represent mutations and polymorphisms.

Figure 17-11 Controlled blockage of DNA transcription is the basis for sequencing reactions of many types. Shown here, elongation of nucleotide chain is blocked by the presence of a hydrogen at the 3′ position in a dideoxynucleoside (ddN) instead of an hydroxyl group (*circle*) shown on the incoming unmodified nucleotide. The terminal ddN is labeled with a florescent tag (+) and the reaction mixture of labeled, truncated nucleotides are separated, usually by capillary gel electrophoresis, forming a color-coded sequencing ladder that is detected fluorometrically. This elongation termination reaction has been adapted in single nucleotide extension protocols for multiplex single-nucleotide polymorphism analyses as applied in SNaPshot and Sequenom platforms. The tracing on the right shows a representative sequencing chromatogram for BRAF with a mutation peak at nucleotide 1799 (c.1799T>A, p.V600E).

Sanger sequencing is readily accomplished in DNA extracted from FFPE tissue that has been amplified by PCR provided that the amplified sequence is short (<300 base pair). This method has the advantage that it provides unbiased sequence results that will detect virtually any mutation in the targeted region. However, the analytic sensitivity is limited, with tumor cell concentration of approximately 50% required for accurate results. Because of the PCR amplification, the amount of DNA starting material required is usually small, depending on the number of DNA templates selected for amplification.

PCR-Based Mutation-Specific Assays

Mutation-specific techniques depend largely on the observation that mismatch of a 3′ terminal nucleotide in PCR primer results in failure of PCR amplification so that primers can be created that are highly selective for a single nucleotide. This is the basis for the ARMS that can be used to detect mutations in KRAS codons 12 and 13. In ARMS, mismatch at the 3′ prime end of the primer results in minimal amplification of the normal base but successful amplification of the sequence-containing mutant base. Elongation of the amplified segment displaces a fluorophore from its quencher, resulting in a quantitative increase in signal that can be read on a variety of real-time PCR instruments. The number of PCR cycles needed to detect signal above a background referred to as Cycle threshold (Ct) is used to quantify fluorescence. Differences in Ct between exogenous control DNA and test DNA (ΔCt) reflect the quantity of mutant DNA in each sample. In one variation of this process, ARMS/Scorpions, primer, and probe are combined in a single molecule.

Only a single mutation per amplification reaction is detected with PCR-based assays, resulting in the need for a considerable amount of DNA and increasing the cost of the overall analysis. In currently available kits for KRAS analysis, seven primer/probe sets are needed for testing each target allele at codon 12 and 13. Thus although only a small amount of DNA is needed for each reaction, the amount of DNA required for the entire analysis can escalate quickly, and an upper limit of clinical feasibility is quickly reached, especially with smaller specimens. The advantages of simplicity and speed of turnaround make this platform desirable for performing large numbers of tests for a small number of mutations.

High-Resolution Melting Analysis

High-resolution melting analysis (Fig. 17-12) measures differences in melting-point temperatures between matched and mismatched DNA doublets. Mismatching causes a reduction in melting-point temperatures that can be quantified using software that is readily available in quantitative PCR instrumentation. The advantage of melting-point analysis is that it is sensitive and requires relatively small amounts of DNA (<5 ng). It also has excellent coverage, detecting any mutational abnormality that may be present in the amplified segment. However, any perturbation of DNA structure, including untargeted SNPs and adducts, may produce aberrant melting-point curves, and nonspecific background for this method is relatively high. The specificity of this reaction is therefore low without confirmation by a second technology, typically direct sequencing, which can negate the rapid turnaround and relative low cost of the assay. Many investigators regard this test as a screening assay that requires confirmation of abnormal results by a second assay.

Figure 17-12 An example of a melting point difference plot is shown. In the top frame (A) are normalized high-resolution melt curves. Polymerase chain reaction (PCR) products are labeled with a fluorescent dye and fluorescent signal is plotted as the temperature increases. Strand melting results in a decrease in fluorescent signal. Below, in frame B, a difference plot displays the melting curve of each sample subtracted from the reference curve to visually accentuate the melt curve differences and aid in genotype grouping. Products of three mutant templates are shown in red and pink. Products of wild-type templates are shown in blue. The upper (*pink curve*) represents codon 12 mutation Gly12Ser, whereas the red curves represent amplification products for a single mutation at codon 12 (Gly12Val) at two different concentrations. The latter two curves are similar in shapes but have different in heights due to variation in the quantity of PCR amplicon.

Pyrosequencing

Pyrosequencing does not rely on truncation of the transcription reaction but rather on the release of the pyrophosphate during nucleotide chain elongation, which initiates a chemiluminescent response that can be detected with appropriate instrumentation. For this technology, DNA is immobilized and reagents needed for the addition of successive nucleotides are added to the template. Additions of nucleotide are registered by the release of a pyrophosphate, which induces a fluorescent signal in a reaction catalyzed by luciferase.

Pyrosequencing is relatively inexpensive and sensitive and largely unbiased in that it detects all possible mutations in a target sequence, provided the release of nucleotides is cycled to identify all possible alterations. Sensitivity is higher than Sanger sequencing and approximates the sensitivity of allele-specific and multiplex methods (~5%). However, the technology is somewhat complicated by short read lengths and difficulties in interpretation when multiple nucleotides of the same species are added in sequence (homopolymers). Turnaround times are relatively short, making this technology popular.

Multiplex Assays

MASSarray (Sequenom, San Diego, Calif.) and SNaPshot (ABI, New York, NY) are based on similar chemistry but use different methods to measure the outcome of sequencing reactions. For MASSarray, a mass biased probe is used, which is detected by mass spectrometry.²⁸⁸ For SNaPshot, a fluorescent probe is used, and standard sequencing capillary gel electrophoresis is used to detect mutant or wild type peaks. PCR reactions are performed with multiplex primers, and panels of primer sets are constructed to detect mutations at multiple sites. The chemistry is based on single base extension from a probe complimentary to the region of interest. Primers are positioned immediately adjacent to the mutated base, and addition of a dideoxynucleotide at the mutation site results in the formation of a labeled oligonucleotide. Probe sets can be designed at different lengths for specific alleles so that they can be multiplexed into panels of up to eight or nine analytes per panel. Amplified probe sets can be measured by the appropriate instrumentation, either mass spectroscopy (MASSarray) or capillary gel electrophoresis (SNaPshot, Fig. 17-13). The amplification incorporated into this technology results in a high level of sensitivity so that only 10% concentration of tumor cells is required for mutation detection.

Figure 17-13 A, Multiplex testing is illustrated by a SNaPshot platform adopted as part of a program to identify selected mutations in 1000 pulmonary carcinomas (the Lung Cancer Mutation Consortium). The platform is based on single base extension polymerase chain reaction with labeled dideoxynucleoside (see Fig. 17-10). Mutational status was determined for 15 genes (*upper panel*) using 69 primer pairs. Up to eight primer pairs of appropriate length could be amplified in a single reaction and separated by capillary gel electrophoresis using an ABI 3730 sequencer. A representative chromatogram showing a mutant peak for BRAF (*arrow*) is shown in the lower panel, as well as wild-type alleles for EGFR, KRAS, NRAS, PIK3CA, TP53, and CTNBB1 (two alleles). This program demonstrated the feasibility of multiplex testing of formalin-fixed paraffin-embedded clinical samples at affordable cost to select targeted agents with potential for therapeutic success. The technology used in this demonstration project is rapidly being supplanted by higher throughput platforms. B, Application of the SNaPshot platform described in Figure 17-11, A, and fluorescence in situ hybridization testing for ALK identified driver mutations in 54% (280/516) of tumors tested. Ninety-seven percent of the mutations were mutually exclusive. Target agents were not available for most of the tumors with mutations and for all without mutations, emphasizing an ongoing need for a larger inventory of appropriately targeted agents in lung cancer.

NextGen Sequencing

Advances in sequencing technology are making possible the complete sequencing of the cancer genome 158,289,290 through such programs as TCGA, described elsewhere in this volume. It is now nominally possible to sequence the entire genome of a single tumor for individual patients at an accessible charge. Demand for genome-based selection of targeted anticancer agents is expected to rapidly increase. There is thus a developing need for flexible deep sequencing platforms to provide high-density sequencing data with broad coverage in a clinical setting. Two such platforms are currently available, Ion Torrent and MiSeq. Ion Torrent exploits the release of a hydrogen ion by DNA polymerase during sequencing (see Fig. 17-11) and measures changes in pH using an integrated semiconductor array for nonoptical sequencing.²⁹¹ The alternative MiSeq platform is a “sequence by synthesis” approach that utilizes serial additions of fluorescent-labeled nucleotides to immobilized products across a two-dimensional array that allows for parallel sequencing of thousands of nucleotide strands simultaneously. Published performance characteristics of MiSeq compare favorably with Ion Torrent.²⁹² Both platforms must be thoroughly validated for each target mutation before introduction into a clinical setting. These systems are being rapidly introduced in kit format and are expanding sequencing capabilities of clinical laboratories from single gene or small multiplex sequencing to large-scale parallel sequencing of hundreds of genes simultaneously. It is expected that in the near future, complete sequence coverage of all relevant genomic treatment targets will be available to treating physicians at a local clinical laboratories.

Assays for Gene Rearrangements

Large-scale genomic rearrangements are not easily detected by conventional gene sequencing, and specialized methods have been used for decades to demonstrate innumerable complex abnormalities for which nomenclature is now standardized ²⁹³ and uniformly applied in searchable databases (http://cgap.nci.nih.gov/Chromosomes/Mitelman). Large-scale genomic changes could potentially be identified by molecular methods such as array CGH, SNP arrays, or NextGen sequencing. However, at the present time, array CGH and SNP arrays test only copy number,²⁹⁴ and NextGen sequencing is not yet sufficiently refined to routinely detect rearrangements in a clinical setting. Only FISH is validated for application to clinical samples to identify complex rearrangements. FISH has the advantages of maintaining histologic architecture, accuracy on small sample sizes, and a wide availability in clinical laboratories. The large number of FISH tests currently in active use cannot be fully addressed here because of space limitations. However, lung cancer provides an example of the relevance of discovery and clinical testing for large-scale gene rearrangements to targeted therapy, as discussed in the next section.

FISH Assays and ALK Rearrangement

Copy number changes and chromosomal rearrangement are ubiquitous in lung carcinoma, but to date, only a small number of these changes are recurrent. One exception is the recently discovered ALK rearrangement. In these rearrangements, ALK is the oncogenic driver, switched on by the rearrangement. The EML4-ALK fusion is generated by a paracentric chromosomal inversion in the short arm of chromosome 2, between bands p21 and p23.2 (see Fig. 17-10). These two genes originally are separated by approximately 12.8 MB. The other described ALK fusions in lung cancer, TFG-ALK,²⁵¹ KIF5B–ALK,²⁵² KLC1-ALK,²⁵³ and PTPN3-ALK,²⁹⁵ are caused by chromosomal translocations t(2;10)(p23.2; p11.22), t(2;3)(p23.2;q12.1), t(2;9)(p23.2;q31.3), and t(2;14)(p23.2;q32.1), respectively.

Numerous technical platforms are available for molecular diagnosis of ALK rearrangements in lung cancer, but the most commonly used in the clinical setting is FISH. The standard FISH assay uses a dual-color break-apart (BA) probe set encompassing the 3′ and the 5′ sequences immediately adjacent to the breakpoint area in ALK (intron 19), labeled in orange and green fluorophores, respectively.²⁹⁶ The chromogenic in situ hybridization BA assay for ALK rearrangements is also commercially available. Despite interesting results and a high level of concordance with FISH,^297,298 the ALK chromogenic in situ hybridization assay has not yet been comprehensively explored. The ALK BA FISH probe was used in the initial clinical studies with crizotinib^254,299 and is approved by the FDA as a companion diagnostic test. The ALK BA FISH assay is DNA based and is thus robust and resistant to technical artifacts. It has multiple other advantages, such as being highly suitable for FFPE tissue sections, and detects all variants of EML4–ALK, as well as any of the reported or uncharacterized partners. On the other hand, the FISH platform requires specialized resources (e.g., a fluorescence microscope and laboratory space protected from light), along with highly trained personnel.

Recently, two other receptor kinase genes were identified as activated in lung cancers by fusions, ROS1²⁵¹ and RET.³⁰⁰ Integrated molecular and histopathology-based screening systems have been successful in detecting a large number of fusion partners in lung cancer for these two genes, and preclinical studies have confirmed their tumorigenic potential.²⁵² Clinical trials of targeted agents for ROS1 are currently under way, and rapid expansion of this field is expected.