S Phase and DNA Replication

Published on 28/02/2015 by admin

Filed under Basic Science

Last modified 22/04/2025

Print this page

rate 1 star rate 2 star rate 3 star rate 4 star rate 5 star
Your rating: none, Average: 2 (2 votes)

This article have been viewed 5557 times

CHAPTER 42 S Phase and DNA Replication

Accurate replication of DNA, which is crucial for cellular propagation and survival, occurs during the S phase (DNA synthesis phase) of the cell cycle. This chapter begins with a brief primer on the events of replication and then discusses its regulation. Next, the chapter covers the proteins that bind origins of replication and ensure that each region of DNA is replicated once and only once per cell cycle. It closes by discussing how the structure of the nucleus influences replication.

DNA Replication: A Primer

One of the most exciting predictions of the Watson-Crick model for the structure of DNA was a mechanism for DNA replication. Because DNA strand pairing is determined by complementary base pairing, it was logical to propose the existence of DNA polymerases, enzymes that would move along a single strand of DNA, recognize each base in turn, and insert the proper complementary base at the end of the growing chain. Thus, one might have surmised that only a single enzyme was required for DNA synthesis. In fact, DNA replication in eukaryotic cells involves a complex macromolecular machine.

In the basic reaction of DNA replication, the 3′ hydroxyl at the end of the growing DNA strand makes a nucleophilic attack on the α-phosphate of the incoming nucleoside triphosphate to form a phosphodiester bond. This incorporates the nucleotide into the growing chain and releases pyrophosphate (Fig. 42-1). Subsequent hydrolysis of the pyrophosphate provides the driving force for the reaction. This reaction requires the presence of a template strand of DNA that specifies, through base pairing, which of the four nucleoside triphosphates is added to the growing complementary strand.

Before discussing DNA replication and its regulation, an introduction to some terminology describing the geometry of replicating DNA is required. The exact site on the chromosomal DNA where replication begins is termed the origin of bidirectional replication. As the termbidirectional implies, two sets of DNA replication machinery head off in opposite directions from the origin. Each set of replication machinery, together with the DNA that it is replicating, is called a replication fork because at the site of replication, one parental DNA molecule splits into two (Fig. 42-2). It is not known whether replication forks move along the DNA like trains along a track or whether the fork sits at a stationary site (referred to as a replication factory) through which the DNA is “reeled in” as it is replicated.

The bidirectional nature of DNA replication causes a fundamental problem, as DNA synthesis invariably proceeds in a 5′ to 3′ direction. Replication of the so-called leading strand poses no problems. This is the strand along which the fork moves in a 3′ to 4′ direction, so the newly synthesized DNA is laid down smoothly in a 5′ to 3′ direction (Fig. 42-2). However, the other template strand faces in the opposite direction, apparently requiring DNA polymerase to synthesize DNA in the wrong direction as the replication fork progresses away from the origin (i.e., adding nucleotides in a 3′ to 4′ direction). No DNA polymerase with this polarity has been found. Instead, this lagging strand replicates in a series of short segments. Every time the DNA strands have been peeled apart (unwound) by 250 nucleotides or so, a polymerase/primase complex (see Fig. 42-11) initiates DNA synthesis on the lagging strand, with the polymerase running back toward the replication origin in a 5′ to 3′ direction. Locally, synthesis on the lagging strand proceeds in a direction opposite to the overall direction of fork movement. Synthesis of each lagging strand fragment stops when DNA polymerase runs into the 5′ end of the previous fragment. Thus, the lagging strand is copied in a highly discontinuous fashion into short fragments known as Okazaki fragments (named after their discoverer [Fig. 42-2]). Fig. 42-11 describes the enzymes and events at the replication fork in greater detail.

image

Figure 42-11 the main events of dna replication. For a more detailed description, see the text.

(PDB file for Fen1: 1A76. PDB file for RFC/PCNA: 1SXJ. PDB file for Cdt1: 1WLQ.)

Origins of Replication

Bacteria such as Escherichia coli replicate their circular chromosomes using two replication forks starting from a single origin of replication (Fig. 42-3A), but eukaryotes must use multiple origins of replication to duplicate their large genomes during a relatively short S phase, which can be limited to as little as a few minutes in some early embryos. These numerous origins are distributed along the chromosome: up to 400 in budding yeast and about 60,000 in human cells. These origins are positioned so that all of the DNA is replicated in the available time, and to be on the safe side, more origins are prepared than are actually needed.

The existence of multiple origins creates a potential hazard: If any origin were used more or less than once per cell cycle, genes would be duplicated or lost. How is the “firing” of all of these origins orchestrated so that each is used once and only once per S phase? Cells manage this problem by a mechanism termed licensing, which ensures that each origin is used once and only once per S phase. Each origin is licensed to replicate once and only once per cell cycle. Replication of the origin removes the license, which cannot normally be renewed until the cell has completely traversed the cycle and has passed through mitosis.

A unit of chromosomal DNA whose replication is initiated at a single origin is termed a replicon. The origin is defined genetically as a replicator element. The classic replicon is the E. coli chromosome (which is 4 × 106 base pairs [bp] in size); this has a single replicator site called oriC (Fig. 42-3). An initiator protein (product of the E. coli DnaA gene [Fig. 42-12]) binds to this origin and either directly or indirectly promotes melting of the DNA duplex, giving the replication machinery access to two single strands of DNA. Other factors bind to the initiator, and their concerted action produces a wave of DNA replication proceeding outward in both directions along the DNA (a replication “bubble”) at about 750 to 1250 bases per second.

An average human chromosome contains about 150 × 106 bp of DNA. Because the replication machinery in mammals moves only about 20 to 100 bases per second (probably reflecting the fact that the DNA is packaged into chromatin [see Chapter 13]), it would take up to 2000 hours to replicate this length of DNA from a single origin. In most human cells, the duration of the S phase is about eight hours. This means that at least 25 to 125 origins of replication would be required to replicate an average chromosome in the allotted time. In fact, origins of replication are much more closely spaced than this. It has been estimated that mammalian origins of replication are spaced about 100,000 to 150,000 bp apart. Thus, approximately 60,000 origins of replication participate in replication of the entire human genome.

To explain the events at origins of replication, the budding yeast Saccharomyces cerevisiae serves as a good example. Its DNA replication is better understood than that of any other eukaryote.

Replication Origins in S. Cerevisiae

About 400 origins of replication participate in replicating the budding yeast genome. A major breakthrough in understanding DNA replication in S. cerevisiae was the identification of short (100 to 150 bp) segments of DNA that act as replication origins in vivo when cloned into a yeast plasmid (circular DNA molecule). These autonomously replicating sequences (or ARS elements) allow yeast plasmids to replicate in parallel with the cellular chromosomes (Fig. 42-4). ARS elements are often, although not always, bona fide replication origins in their native chromosomal context. Replication always initiates within ARS elements, but not all ARS ele-ments act as origins of DNA replication in every cell cycle.

Yeast replication origins are spaced about every 30,000 bp, with a maximum separation of about 130,000 bp. Even this longest interval should replicate easily within the 30 minutes available during the S phase. Because the number of origins exceeds the number required to replicate the genome within the allotted time, some origins need not “fire” every cell cycle. The probability that any given origin will be used in a given cell cycle ranges from less than 0.2 to more than 0.9. It is important to note that replication of an origin by a fork coming from an adjacent origin inactivates it, thereby preventing excess replication during the cell cycle.

The ARS element does two things to establish an origin of replication. First, it has conserved sequences that act as binding sites for a protein complex that marks it as a potential origin. Second, it has nearby sequences that can readily be induced to unwind (become un-base-paired).

Budding yeast ARS elements share a common DNA sequence motif called the ARS core consensus sequence: 5′-(A/T)TTTAT(A/G)TTT(A/T)-3′ (Fig. 42-5). Single base mutations at several locations within this sequence completely inactivate ARS activity. Other, less well-conserved DNA sequences also contribute to the activity of the ARS as a replication origin. One of these, termed B1, together with the ARS core, forms the binding site for a complex of six proteins (five of which are AAA ATPases) termed the origin recognition complex (ORC [see later section]). The DNA unwinding element is thought to be another short sequence (B2) located a bit further along the DNA. DNA synthesis begins at an origin of bidirectional replication midway between the ORC binding site and the DNA unwinding element.

ORC was identified by its ability to bind the 11-bp ARS core sequence (Fig. 42-5). This binding has two noteworthy features. First, it requires adenosine triphosphate (ATP), which remains associated with the ORC complex. Second, in yeast, the ORC complex remains bound to the origins of replication across the entire cell cycle. Thus, something other than the presence of ORC must be responsible for regulating the periodic activation of origins in the S phase (see Fig. 42-14). In metazoans, ORC behavior is more complex; the largest subunit, Orc1, cycles on and off the DNA in a cell-cycle-regulated manner.

image

Figure 42-14 Measurement of the time of replication of particular chromosomal regions in Saccharomyces cerevisiae. A–C, This protocol is based on a classic density shift experiment of Messelson and Stahl that proved that DNA replication is semiconservative. S. cerevisiae cells are grown for several generations in a medium containing 13C and 15N heavy isotopes. As a result, their DNA is fully substituted with heavy isotopes. At the beginning of the experiment, the cells are synchronized so that they enter the S phase in a single wave. At the same time, the heavy (H) isotope medium is removed and replaced with “light medium” (L) containing 12C and 14N. At various times after the initiation of the S phase, aliquots of cells are removed, and the DNA is isolated. The DNA is then cleaved with restriction enzymes so that the chromosomes are cut into many fragments. DNA from each time point is then subjected to CsCl density gradient centrifugation. When any local region of DNA is replicated, its density alters from heavy/heavy to heavy/light. After very short incubations with light isotopes, only DNA near the origin of replication will be heavy/light; all other DNA will be heavy/heavy. These two populations of molecules are separated from one another by the density gradient centrifugation. To examine the timing of replication of a specific gene, a cloned segment of DNA corresponding to the region of interest is used to probe (by DNA hybridization) the heavy/heavy and heavy/light peaks from each gradient. The time of replication of each locus is the time at which the restriction fragment being detected by DNA hybridization moves from the heavy/heavy peak to the heavy/light peak. The numbers in panels B and C refer to the numbered regions of the chromosomes shown in A. D, Data from a replication timing experiment show that in budding yeast, centromeres replicate early in the S phase and telomeres replicate late. To generate curve a, fractions from a gradient like that shown in panel B were hybridized to a cloned centromere region. To generate curve b, fractions from the same gradient were hybridized to a cloned telomere region probe. Note that in mammalian cells, centromeres replicate late and telomeres replicate earlier. (This figure is based on the work of the laboratory of B. J. Brewer and W. L. Fangman.

ARS elements typically contain binding sites for other sequence-specific DNA binding proteins, such as transcription factors. For example, a transcription factor called ARS-binding factor 1 (ABF-1) binds to the B3 sequence within the ARS1 element (Fig. 42-5). Deletion of the ABF-1 binding site only slightly reduces the ability of ARS1 to act as a replication origin in vivo. Furthermore, substitution of DNA binding sequences for other transcription factors within the B3 sequence has little effect on replication efficiency.

In addition to their role in DNA replication, several ORC components also seem to regulate heterochromatin formation and transcription (see Chapters 13 and 15). This cross talk between the machinery used for transcription and DNA replication may explain why regions of chromosomes with actively transcribed genes typically replicate early in the S phase (see the discussion that follows). The Orc6 subunit also functions in mitosis at kinetochores and during cytokinesis. Its detailed role in those processes is not known.

Replication Origins in Mammalian Cells

Far less is known about the structure and function of mammalian origins of DNA replication than about ARS elements in budding yeast. Attempts to develop a mammalian equivalent to the yeast ARS assay have had few successes. It is now accepted that mammalian origins of replication are much more complex than those of their budding yeast counterparts. Mammalian origin activity is affected by DNA sequence, DNA modifications, chromatin structure, and nuclear organization.

At present, two types of mammalian replication origins are known. The first is exemplified by the origin of replication adjacent to the lamin B2 gene (Fig. 42-6A). This origin “fires” within the first several minutes of the S phase, and a variety of methods have succeeded in mapping it to a stretch of less than 500 bp. Within this region, a single origin of bidirectional replication appears to be used. Thus, the lamin B2 origin of replication appears to be analogous to the well-characterized budding yeast origins.

The second is exemplified by the widely studied replication origin lying just downstream of the hamster gene for dihydrofolate reductase, an enzyme that is essential for biosynthesis of thymidine. This origin is accessible to experimental study because it is possible to select for cells with this chromosomal region amplified as hundreds or even thousands of copies (Fig. 42-6B). By looking for the first regions of the amplified DNA to replicate, the origin of replication was initially located within a region of about 55,000 bp. It now appears that DNA replication can initiate with low efficiency at roughly 20 sites distributed throughout this broad zone. Two of these sites are used with relatively higher efficiency, accounting for about 20% of all initiation in the region. These sites, termed Ori-β and Ori-γ (Fig. 42-6), each encompass about 0.5 to 2 kb of DNA.

A third view of vertebrate DNA replication origins came from studies of DNA replication in Xenopus eggs. Once activated by fertilization or by various experimental tricks, Xenopus eggs divide about an hour later and then undergo a rapid sequence of cell cycles, each of which lasts about 30 minutes. Any DNA that is injected into these eggs is rapidly and efficiently replicated. Prokaryotic DNA and eukaryotic DNA are replicated with similar efficiency, and careful studies demonstrated that this replication initiates randomly with respect to DNA sequence (i.e., does not use defined origins). This promiscuous initiation of DNA replication appears to be a specialized adaptation by early embryos to permit replication of the chromosomes in the very brief temporal window available.

The emerging view is that the replication machinery is highly conserved between budding yeasts and vertebrates but that the location of replication origins is much more flexible in vertebrates. This might in part reflect the diverse range of cell cycles required to make a complex metazoan. The mix of conserved components with divergent uses is a recurring theme in DNA replication.

Assembly of the Prereplication Complex

To preserve the integrity of the genome, each origin of replication must “fire” only once per cell cycle. We now have a reasonable understanding of the various solutions to this problem that have been reached by differing model organisms and vertebrates.

Recall that yeast ORC is stably bound to replication origins throughout the cell cycle. However, ORC is not the trigger for DNA replication. Rather, it acts as a “landing pad” for assembly of a prereplication complex of other proteins that initiates DNA replication. During late anaphase or very early G1 phase, several proteins, including Cdc6p and Cdt1, bind to the ORC complex at origins of replication (Table 42-1). ORC-Cdc6p-Cdt1 then recruits a complex of Mcm proteins to the origin and loads it onto the DNA. This prereplication com-plex of ORC, Cdc6p, CDT1, and minichromosome maintenance (Mcm) proteins (Fig. 42-7) assembles at each replication origin before the onset of the S phase.

Table 42-1 BIOCHEMICAL ACTIVITIES REQUIRED FOR REPLICATION OF DNA IN EUKARYOTES

Activity Name of Protein
Origin recognition ORC (origin recognition complex; five of six subunits are AAA ATPases)
Pre-replication complex Cdc6 (recruits Mcm 2–7)
Cdt1 (recruits Mcm 2–7)
Mcm 10 (stimulates Cdc45 and polymerase α binding)
Origin activation Cdk2-cyclin A
Cdc7p-Dbf4p
Cdc45p (recruits RPA and polymerases. Needed for elongation of growing chain)
GINS complex (needed for polymerase binding and elongation of growing chain)
DNA unwinding (helicase) Mcm 2–7 proteins (precede other fork components)
Mcm 8 (controversial, may be elongation helicase)
Stabilization of single-stranded DNA RPA (binds single-stranded DNA)
Polymerase/primase DNA polymerase α (no editing function)
Replicative polymerases DNA polymerase δ
DNA polymerase ε (both have 3′–5′ exonuclease editing capability)
Processivity factor PCNA (ring-shaped clamp that slides along the DNA. Keeps polymerases δ and ε attached to the template strand so that they make longer chains; coordination of cell cycle control and replication; role in repair)
PCNA loader RF-C (Binds primer: template junction. AAA ATPase. Loading factor for PCNA, important for polymerase switch)
Closing Factors
Removal of RNA primer Fen1 5′  3′ exonuclease
RNase H
Ligation of discontinuous DNA fragments DNA ligase I
Releasing superhelical tension DNA topoisomerase I
Disentangling daughter strands DNA topoisomerase II

Mcm proteins were identified in a screen for genes of budding yeast that are required for the stability of small artificial chromosomes. Six of these Mcm genes encode a structurally related group of proteins, termed Mcm 2–7, that are required for DNA replication. Mcm 2–7 proteins form a hexameric complex that is thought to be shaped like a doughnut. Somehow, Cdc6p–Cdt1 uses ATP hydrolysis to thread DNA through the central hole of the Mcm doughnut. Although the function of the Mcm 2–7 complex is not known for certain, the predominant view is that it is a DNA helicase, an enzyme that uses ATP hydrolysis to separate DNA strands (see Fig. 42-11). It is currently thought that Mcm 2–7 binding to the prereplication complex is the key point of regulation at which origins are “licensed” so that they replicate only once per cell cycle.

In mammals, licensing occurs in several stages, all before passage of the restriction point (see Chapter 41). During telophase, Cdc6, Cdt1, and Mcm 2–7 bind to origins all across the chromosomes. Later, in the early G1 phase, these licensed origins are somehow processed to select the subset of origins that will fire in the subsequent S phase. A third step establishes the relative temporal order in which origins will fire.

At least three mechanisms regulate licensing. The first involves negative regulation of Cdc6p activity by Cdks, which inhibit Cdc6p–Cdt1 from loading Mcm proteins onto DNA. At the exit from mitosis, destruction of cyclins and synthesis of inhibitory proteins inactivates Cdks, creating a window of time between anaphase and the restriction point for licensing replication origins (see Fig. 40-18). Once mammalian cells pass the restriction point, the levels of Cdk2–cyclin E, and subsequently, Cdk2–cyclin A rise again (see Fig. 41-11) preventing the reassembly of prereplication complexes until after the next mitosis. In yeasts, the single Cdk that is complexed with B-type cyclins inhibits prereplication complex reassembly. Experimental inactivation of Cdk1 during the G2 phase in the fission yeast Schizosaccharomyces pombe demonstrated the importance of kinase activation: Cells lacking Cdk1 activity assembled prereplication complexes on already replicated DNA and then carried out further rounds of “illegal” DNA replication without division.

In vertebrates a protein called geminin is a critical regulator of origin “licensing.” Geminin binds to Cdt1 and prevents it from loading Mcm proteins onto DNA. The anaphase-promoting complex/cyclosome (APC/C) (see Fig. 40-16) degrades or inactivates geminin, keeping its concentration very low from anaphase through late G1 when prereplication complexes assemble. Accumulation of geminin starting in the S phase prevents the assembly of new prereplication complexes until after the next mitosis. Yeasts lack geminin, but in vertebrates, regulation of geminin and Cdt1 levels by proteolysis appears to be the primary method of controlling origin licensing.

A third way to regulate origin “licensing” involves sequestering molecules that are required to assemble the prereplication complex in the cytoplasm following the onset of the S phase. This was first suggested by studies of DNA replication in Xenopus egg extracts (see Fig. 40-8) in which nuclei replicate their DNA once and only once unless their nuclear envelopes are perforated, in which case the DNA replicates again. In living cells, this regulation by the nuclear envelope appears to be significant only in yeasts, in which factors that are excluded from the nucleus after replication include Mcm proteins.

Components of the prereplication complex are absent from differentiated (G0) cells. In fact, detection of these proteins with antibodies in cells from cervical smears is currently being developed as a sensitive method for the early detection of cancer cells (Fig. 42-8).

image

Figure 42-8 Sections of human cervix stained with antibodies to Mcm5. A, Normal G0 cells in this stratified epithelium lack Mcm5 and other replication proteins. B–C, Cancer cells express Mcm5 at higher levels as they become more malignant.

(Adapted from Williams GH, Romanowski P, Morris L, et al: Improved cervical smear assessment using antibodies against proteins that regulate DNA replication. Proc Natl Acad Sci U S A 95:14932–14937, 1998.)

Signals That Start Replication

A classic experiment (Fig. 42-9) demonstrated that (1) a cytoplasmic inducer triggers the transition into the S phase and (2) this inducer triggers DNA replication in a G1 nucleus but not in a G2 nucleus. The inducer is very likely a combination of protein kinases, including Cdk-cyclin pairs, as well as a specialized kinase, Cdc7p-Dbf4p. In mammals, Cdk2–cyclin E, whose activity is maximal at the G1/S transition (Fig. 42-10), phosphorylates Rb, thereby opening the restriction point “gate” and allowing the E2F/DP dimer to function as a transcription factor and stimulate the transcription of genes involved in DNA replication (see Chapter 41). In addition to cyclin E itself, genes targeted by E2F include cyclin A, Cdc25A, enzymes required for synthesis of DNA precursors (dihydrofolate reductase, thymidine kinase, and thymidylate synthase), origin-binding proteins Cdc6p, Orc1, Cdt1 and its inhibitor geminin, and two components of the replication machinery (DNA polymerase a and proliferating cell nuclear antigen [PCNA]; see Fig. 42-11).

In the S phase, the Cdk inhibitor p27Kip1 is a target for the SCFSkp2 ubiquitin ligase complex, which marks it for destruction by proteasomes (see Chapter 40). SCF gets its name from three of its components: Skp2, cullin, and F-box proteins (see Fig. 40-17). Skp2, which is short for “S-phase kinase-associated protein,” got its name because it was first identified in a complex with Cdk2–cyclin A. This kinase targets proteins for recognition by SCF, which recognizes and ubiquitinates its substrates only after they have been phosphorylated at certain key positions. E2F/DP and cyclin E are also degraded when cells enter the S phase, and this is apparently triggered by Cdk2–cyclin A (Fig. 42-10).

The second kinase involved with initiation of DNA replication is Cdc7p with its associated subunit Dbf4p. This kinase seems to act at the level of individual DNA replication origins. Careful analysis has revealed that Cdc7p-Dbf4p is required for firing of origins in both early and late S phase. Cdc7p is capable of phosphorylating several Mcm proteins. This phosphorylation may somehow trigger the start of replication fork movement.

Dbf4p, which is responsible for targeting Cdc7p to origins, is very unstable from anaphase through G1 phase. This period of Dbf4 instability coincides with the cell-cycle period during which prereplication complexes are assembled, and it may provide a mechanism to ensure that origins do not fire prematurely until the cell is ready to enter the S phase.

At the onset of DNA replication, each origin of replication has bound to it the ORC complex, Cdc6p-Cdt1, and multiple hexameric Mcm complexes. (See Table 42-1 for a description of the major activities involved in DNA replication. See Box 42-1 for an introduction to DNA replication in E. coli.)

BOX 42-1 DNA Replication in Escherichia Coli

The DNA replication system of Escherichia coli has been reconstituted entirely from purified components. Analysis of this system reveals many similarities with eukaryotic replication, indicating that this process is highly conserved. E. coli DNA replication can be subdivided into three phases: initiation, elongation, and termination. Thus far, at least 28 polypeptides are known to be involved.

Initiation: E. coli chromosomal DNA replication initiates within a 245-bp region, termed oriC. This region contains four 9-bp binding sites for the E. coli initiator protein, DnaA. Nearby are three repeats of a 13-bp A: T-rich sequence. oriC also contains specific binding sites for two small histone-like proteins called HU and IHF. Replication is initiated with the cooperative binding of 10 to 20 DnaA monomers to their specific binding sites (Fig. 42-12). To be active, these monomers must each have bound ATP. Binding of DnaA permits unwinding of the DNA at the 13-bp repeats, in a reaction that requires the histone-like proteins. Next, DnaC binds to DnaB and escorts it to the unwound DNA. DnaB is the key helicase that will drive DNA replication by unwinding the double helix, but it binds DNA poorly on its own in the absence of its DnaC escort. Once DnaB has docked onto the DNA, DnaC is released, and the helicase can then start to unwind the DNA, provided that ATP, SSB, and DNA gyrase are present. SSB is a single-stranded DNA binding protein that stabilizes the unwound DNA, and DNA gyrase is a topoisomerase (see Chapter 13) that removes the twist that is generated when the two strands of the double helix are separated.

Elongation: As in eukaryotes, E. coli DNA replication involves a leading strand, with the daughter DNA synthesized as a single continuous molecule, as well as a lagging strand, with the DNA synthesized as discontinuous Okazaki fragments. All daughter strands are started by an RNA primase that deposits primers of 11 ± 1 nucleotides. The enzyme that actually synthesizes the DNA is the polymerase III holoenzyme, which has at least 10 subunits. This contains polymerase and proofreading subunits and is held to the DNA by a doughnut-like “sliding clamp” (b). The b is loaded onto the DNA by a pentameric complex in a process that requires ATP. The parallel with PCNA and RFC in eukaryotes is striking. Activities specific for the lagging strand include RNase H, which removes the RNA primers; DNA polymerase I, which fills in the gaps left behind by primer removal; and DNA ligase, which links the Okazaki fragments together. DNA replication in E. coli is significantly faster than it is in eukaryotes, with the fork moving at a rate of about 1000 bp per second. This higher speed is presumed to be at least partially attributable to the absence of nucleosomes on the bacterial chromosome.

Termination: A specialized termination zone is found on the circular E. coli chromosome opposite oriC. This zone contains binding sites called ter sites, to which the ter binding protein binds. This protein appears to block the movement of DNA helicases, such as DnaB, thereby stalling the DNA replication fork. Following termination of replication, a specialized topoisomerase, the product of the parC and parE genes, is required to separate the daughter chromosomes from one another.

Mechanism of DNA Synthesis

For DNA replication to start, the paired strands of the double helix must be separated. This permits the DNA polymerase to bind and begin synthesizing the daughter strand. DNA strand separation is driven by a DNA helicase, an enzyme that uses ATP hydrolysis to peel apart the paired strands of the DNA double helix. Despite exhaustive efforts, the identity of this helicase is not firmly established in eukaryotes, but it is striking that both viral and bacterial helicases are hexameric protein complexes. This, plus limited experimental evidence, has led to the general belief that the hexameric Mcm complex is the eukaryotic DNA helicase. Other helicases may also participate.

Locally, DNA replication appears to start when the Cdk2-cyclin A and Cdc7p-Dbf4p kinases activate the prereplication complex. Key phosphorylated proteins include Mcm 2–7 and Cdc6 (Fig. 42-11A). Phosphorylation triggers a change in the binding of Cdc6p and Cdt1 to the DNA. Cdc6p remains bound to the chromatin throughout the S phase, while Cdt1 is released and degraded. Activation recruits to the origin a protein called Cdc45p together with a single-strand DNA-binding protein, RPA (Fig. 42-11B). Several other proteins also bind at this time, but their detailed functions are still being elucidated (Table 42-1).

Cdc45p appears to associate with the Mcm proteins and promote the binding of RPA, forming a complex that somehow activates the Mcm helicase. Cdc45p and RPA then recruit DNA polymerase to the origin (Fig. 42-11C). As the helicase starts to separate the DNA strands, moving outward in both directions from the origin of bidirectional replication, RPA stabilizes the separated strands, ensuring that they do not base-pair with one another again. Recent results suggest that the Mcm 8 protein may take over as the helicase once the replication fork has moved away from the replication origin (Fig. 42-11H), but this remains under investigation.

The separated DNA strands are ready for replication, but DNA synthesis always involves addition of an incoming nucleoside triphosphate to a free 3′ OH group at the terminus of a preexisting nascent polynucleotide (Fig. 42-1). In the absence of a nascent DNA chain, how does DNA polymerase get started? This problem is solved by a DNA-dependent RNA polymerase called a primase, which, like other RNA polymerases, can initiate synthesis de novo without the need for a 3′ OH group. In eukaryotes, all DNA chains are started by a complex of DNA polymerase a and a primase subunit, collectively known as Pol α/Primase. Primase synthesizes an RNA chain of about 10 nucleotides to which DNA polymerase a adds another 20 to 30 nucleotides of so-called initiator DNA (iDNA) (Fig. 42-11D–E). These initiating reactions are potentially hazardous, because DNA polymerase a lacks proofreading ability. Any errors in matching up an incoming base would create a mutation. Given the huge number of initiation events that are required to replicate an entire genome, this potential for errors is not acceptable. Therefore, the RNA primer and most or all of the initiator DNA laid down by Pol α/Primase are subsequently replaced.

Once Pol α/Primase has done its job, two further essential factors act. A pentameric protein complex called replication factor C (RFC) binds the 3′ end of the initiator DNA. RFC uses energy from ATP hydrolysis to load the trimeric protein PCNA onto the DNA (see Fig. 42-11F–G). The PCNA trimer is doughnut-shaped, and when the DNA is inserted into its central hole, it is topologically locked onto the DNA. RFC binding and PCNA loading displace Pol α/Primase from the DNA, and PCNA then recruits DNA polymerases δ and ε to the DNA. Moving along with the sliding platform of PCNA, these polymerases then process along the DNA, synthesizing DNA continuously on the leading strand (see Fig. 42-11H). On the lagging strand, they synthesize about 250 bp of DNA until they run into the next Okazaki fragment. Cdc45p may be a scaffolding factor that holds the Mcm hexamer and the replicative DNA polymerases together as the fork moves.

Both polymerases δ and ε have associated exonuclease activities. This enables them to proofread the newly synthesized DNA and correct any mistakes that they have made. This may explain the amazing fidelity of DNA replication, with typically only one error per 109 bp polymerized.

The final steps of DNA replication are removal of the RNA primer (and probably initiator DNA) and ligation of adjacent stretches of newly synthesized DNA. Removal of the primer can be accomplished in two ways (Fig. 42-11I). On one hand, an RNA exonuclease called RNase H can chew in from the 5′ end of the primer. However, this enzyme cannot remove the last ribonucleotide that is joined to initiator DNA. That requires a second nuclease, called Fen1. Alternatively, Fen1 can do the whole job itself if it gets help from a helicase. In this case, the helicase peels the RNA (and possibly the initiator DNA) away from the template, creating a sort of flap. Fen1 then cleaves at the junction where the flap is anchored to the DNA template, removing the oligomer of unwanted nucleotides in one step.

Following removal of initiator RNA, the Pol d/PCNA complex extends the upstream nascent chain until it runs into the 5′ end created by Fen1. DNA ligase I then joins the two stretches of DNA together (Fig. 42-11J).

Higher-Order Organization of DNA Replication in the Nucleus

A wide variety of experimental evidence revealed that the unit of replication in eukaryotic chromosomes is not the individual replicon but rather a replicon cluster. Evidence for this higher-order organization of DNA replication within the nucleus was first obtained by fiber autoradiography. Cells were fed radioactive precursors for DNA synthesis and then examined by electron microscopy. (For an explanation of this technique, see Fig. 40-3.) The spatial distribution of DNA replication during the S phase is more readily observed by using BrdU, a nucleotide base analog that is incorporated into DNA by the replication machinery in place of thymidine (this is called by its more correct name of Br-dUTP in Fig. 42-13). Incorporation of BrdU into DNA makes the newly synthesized daughter DNA strand heavier, allowing its separation from the parental DNA by centrifugation on a cesium chloride density gradient (see Chapter 6). In addition, specific antibodies that recognize DNA containing either BrdU or the related reagents IdU and CldU can be used to localize the changing patterns of DNA synthesis as cells traverse the S phase. More recently, analogs have been developed in which the Br in Fig. 42-13A has been replaced by a fluorescent group. This allows the newly replicated DNA to be observed directly in living cells.

image

Figure 42-13 visualization of dna replication within the nucleus. A, The protocol for fluorescent labeling of newly replicated DNA. BrdUTP is introduced into DNA in place of dTTP. The incorporated BrdU molecules are detected by fluorescence labeling with labeled antibodies. B, In a related technology, green-dUTP and red-dUTP, when added together, show the many sites of DNA replication in a cell nucleus. Because both UTP analogs are incorporated simultaneously into the DNA, the sites of replication appearyellow. C, Green-dUTP is followed by red-dUTP added three hours later. The later sites of DNA replication show very little overlap with the earlier sites. D, Mitotic chromosome from a cell that was labeled early in the S phase with IdU (green), and then four hours later with CldU (red). The late-replicating and early-replicating regions of the chromosome are segregated into discrete bands. E, CldU (green) added early in the S phase and IdU (red) added four hours later show little overlap. F, CldU (green) added early in the S phase and IdU (red) added six hours later show no overlap. The large red blocks of labeling seen with the IdU are characteristic of the pattern of replicating heterochromatin seen late in the S phase. Bodipy-TR-dUTP, a red fluorescent form of dUTP; BrdUTP, Bromo-deoxyuridine triphosphate; CldU, Chlorine-dUTP; Fluorescein-dUTP, a green fluorescent form of dUTP; IdU, Iodine-dUTP. All are used in place of dTTP (thymidine triphosphate) in DNA synthesis.

(B–C, Courtesy of P. R. Cook, University of Oxford, England; reproduced from Manders EMM, Kimura H, Cook PR: Direct imaging of DNA in living cells reveals the dynamics of chromosome formation. J Cell Biol 144:813–821, 1999. Copyright 1999 The Rockefeller University Press. D, Courtesy of A. I. Lamond, University of Dundee, Scotland; reproduced from Ferreira J, Paolella G, Ramos C, et al: Spatial organization of large-scale chromatin domains in the nucleus: A magnified view of single chromosome territories. J Cell Biol 139:1597–1610, 1997. Copyright 1997 The Rockefeller University Press. E–F, Reproduced from Ma H, Samarabandu J, Devdhar RS, et al: Spatial and temporal dynamics of DNA replication sites in mammalian cells. J Cell Biol 143:1415–1425, 1998. Copyright 1998 The Rockefeller University Press.)

These methods reveal up to 1000 sites of active replication, called replication foci, at any one time during the S phase in a mammalian cell nucleus (Fig. 42-13B–C and E–F). Given that each of these replication foci is active for only about one hour out of the eight- to ten-hour S phase, a cell will replicate DNA at about 10,000 of these foci. Given roughly 60,000 origins in a mammalian cell, each replication focus represents five or six replication origins that are activated coordinately. These replication foci may be associated with the nuclear matrix or nucleoskeleton (see Chapter 13).

Temporal Control of Replication during the S Phase

The term S phase gives the impression that all DNA replicates more or less synchronously, but this is far from true. At any given time during the S phase, only 10% to 15% of the replicons actively synthesize DNA. Some replicate earlier, others later. It is important to note that this pattern of replication is not random; some origins consistently replicate early in the S phase, whereas others consistently replicate late in the S phase. Overall, the human genome can be subdivided into at least 1000 “zones,” each of which replicates at a characteristic time during the S phase. The organization of replication zones corresponds roughly to the organization of chromosomes into banding patterns: early-replicating regions typically correspond to gene-rich R bands, whereas late-replicating regions typically correspond to gene-poor G-bands (Fig. 42-13D; compare with Fig. 13-14). A similar division of chromosomes into early- and late-replicating regions also holds true for budding yeast, although many fewer replication origins are involved.

BrdU labeling experiments show that the basic unit of chromosomal DNA replication is a cluster of roughly five replication origins that fire coordinately. What must now be superimposed on this view of the replicating chromosome is a second level of regulation: the time at which each replicon cluster fires during the S phase. This can be seen clearly by synchronizing cells at the beginning of the S phase, releasing them from cell-cycle arrest, and then exposing them to BrdU at various times thereafter. This experiment reveals very distinctive patterns of DNA synthesis occurring at different times during the S phase (Fig. 42-13B–C and E–F). Early on, euchromatin replicates throughout the nucleus. Later, replicating regions appear concentrated around nucleoli and other areas of more condensed chromatin. Toward the end of the S phase, replication is largely concentrated in blocks of heterochromatin. These observations show that DNA replication occurs throughout the nucleus, wherever DNA is located. DNA does not move to a small number of discrete sites to be replicated (as was previously thought).

The most striking aspect of these patterns of DNA synthesis is their reproducibility from one cell cycle to the next. For example, regions of DNA labeled early in the S phase overlap little or not at all with DNA labeled three hours later (Fig. 42-13C and E). However, DNA labeled at corresponding points of the S phase in two successive cell cycles superimposes almost entirely. Thus, the chromosomal substructure that gives rise to replication foci is stable from one cell cycle to the next. This strongly suggests that particular regions of chromosomes are organized into reproducible structural domains and that each domain has a particular “window” during the S phase during which it replicates. This is significant. In one study, chromosomal regions that replicated at the wrong time during the S phase as a result of a mutation in an ORC subunit had a defective condensed structure in the next mitosis.

The timing of replication of particular replication origins has been studied most carefully in budding yeast. First, a procedure was developed whereby all cells in a population could be induced to enter the S phase synchronously. Next, the shift in the density of the DNA following BrdU incorporation was used to distinguish between DNA that had replicated and DNA that had not (Fig. 42-14). It then became relatively simple to take DNA probes from different regions of the chromosome and determine when each replicated (changed its density) during the S phase. This protocol demonstrated that each ARS element replicates at a characteristic time during the S phase.

There are at least three possible explanations for the sequence of replication patterns seen for different chromosomal regions:

The Intra-S Checkpoint

A powerful group of three checkpoints, which we here collectively call the intra-S checkpoint, monitors the process of DNA replication and stops it if DNA breaks or stalled replication forks are detected (Fig. 42-15). A third aspect of these checkpoints is to delay the onset of mitosis until the replication of the genome is complete.

These checkpoints have similarities and differences compared to other DNA damage checkpoints. For example, unlike the G1 DNA damage checkpoint, p53-mediated transcription is not required. However, as is the case in the G1 and G2 phases, if DNA breaks are detected, the kinases ATM and ATR and their downstream effectors phosphorylate Cdc25A, triggering its rapid destruction mediated by SCFb-TrCP (see Fig. 41-12). The resulting inactivation of Cdks during the S phase prevents Cdc45p from loading onto prereplication complexes and blocks the initiation of new replication forks. This prevents replication forks from running across DNA breaks, which could lead to chromosome breaks and the loss of genetic material, with lethal consequences for the cell.

The intra-S checkpoint also detects stalled replication forks. Why would a replication fork stall? This could happen if, for example, the fork encounters a damaged DNA base or bases that it cannot “read.” Stopping the fork gives time for the DNA repair machinery to detect and repair the damage (see Box 43-1). Stalled forks activate the ATR kinase, leading to Cdc25A inactivation as described earlier and the cessation of new fork initiation. In addition, through an unknown mechanism, the intra-S checkpoint also has a mechanism to protect existing forks from disassembly. This is important because replication forks contain unwound and nicked DNA molecules that could be turned into breaks if the structure disassembled.

ATR kinase is activated by binding single-stranded DNA associated with RPA. As this is normally present at every replication fork during DNA replication, ATR signaling appears to be an intrinsic aspect of the replication process. It has been proposed that ATR normally limits excessive firing of replication origins by keeping the concentration of Cdc25A low and coordinates replication with other cell-cycle events. For example, Chk2, a kinase activated by ATM and ATR (Fig. 42-15), is required for the dependence of late origin firing on completion of early replication. This role during the normal S phase could explain why ATR is essential for the life of the cell.

Synthesis of the Histone Proteins

Chromatin contains approximately equal weights of DNA and core histones. Human cells require about 62 × 106 copies of each core histone, assuming a genome size of 6.2 × 109 bp and 200 bp per nucleosome. Because about 90% of histone transcription occurs during the S phase, enormous amounts of these proteins are made during a relatively brief period. Histone synthesis apparently keeps pace, in part, because there are about 40 sets of histone genes.

Synthesis of histones during the S phase is tightly coupled to ongoing DNA replication. If replication is blocked either by addition of drugs or by temperature-sensitive mutants, histone synthesis declines abruptly shortly thereafter. This link between histone synthesis and DNA replication appears to involve at least three components (Fig. 42-16).

First, transcription of the histone genes rises threefold to fivefold as cells enter the S phase. Each histone gene has a cell-cycle-responsive element in its promoter to which a transcription factor binds specifically during the S phase.

Second, theprocessing of histone mRNAs increases sixfold to 10-fold as cells enter the S phase. Histone mRNAs are not polyadenylated, and the primary transcripts are considerably longer than the mature forms. Processing of the 3′ end of histone pre-mRNAs involves the U7 snRNP (see Chapter 16), a portion of which recognizes histone mRNA and base-pairs with it during processing. Cell-cycle-dependent regulation of processing appears to involve changes in the accessibility of the necessary portion of U7 snRNA. This region is inaccessible in G0 cells but becomes accessible when cells that have reentered the cycle begin the S phase. The mechanism for this change in RNA conformation is not known.

Third, changes in the stability of the mRNA also regulate histone synthesis. Normally, the level of histone mRNA on free polysomes drops rapidly by about 35-fold as cells enter the G2 phase. If DNA synthesis is interrupted during the S phase, a region at the 3′ end of the mature message somehow targets the mRNA for degradation. If this region is removed from the 3′ terminus of the histone mRNA, the normal link between ongoing replication and mRNA stability is lost. Furthermore, this sequence, transposed onto the 3′ terminus of a globin mRNA, renders that mRNA sensitive to degradation if DNA synthesis is blocked. Degradation of histone mRNA requires ongoing protein synthesis, and it has been speculated that histones themselves participate in the control.

As discussed in Chapter 13, specialized variant forms of histones are synthesized and inserted into the chromatin outside of the S phase. These histones are encoded by mRNAs with introns and normal poly(A) tails and are therefore not processed by the specific S phase–associated pathway (see Chapter 16). Their insertion into chromatin is typically correlated with RNA transcription rather than DNA replication.