Advertisement
Perspective Series Free access | 10.1172/JCI19552
1National Institute on Aging, NIH, Baltimore, Maryland, USA2 Department of Therapeutic Radiology and Department of Genetics, Yale University School of Medicine, New Haven, Connecticut, USA
Address correspondence to: Peter M. Glazer, Department of Therapeutic Radiology, Yale University School of Medicine, P.O. Box 208040, New Haven, Connecticut 06520-8040, USA. Phone: (203) 737-2788; Fax: (203) 737-2630; E-mail: peter.glazer@yale.edu.
Find articles by Seidman, M. in: JCI | PubMed | Google Scholar
1National Institute on Aging, NIH, Baltimore, Maryland, USA2 Department of Therapeutic Radiology and Department of Genetics, Yale University School of Medicine, New Haven, Connecticut, USA
Address correspondence to: Peter M. Glazer, Department of Therapeutic Radiology, Yale University School of Medicine, P.O. Box 208040, New Haven, Connecticut 06520-8040, USA. Phone: (203) 737-2788; Fax: (203) 737-2630; E-mail: peter.glazer@yale.edu.
Find articles by Glazer, P. in: JCI | PubMed | Google Scholar
Published August 15, 2003 - More info
Triplex-forming oligonucleotides (TFOs) can bind to polypurine/polypyrimidine regions in DNA in a sequence-specific manner. The specificity of this binding raises the possibility of using triplex formation for directed genome modification, with the ultimate goal of repairing genetic defects in human cells. Several studies have demonstrated that treatment of mammalian cells with TFOs can provoke DNA repair and recombination, in a manner that can be exploited to introduce desired sequence changes. This review will summarize recent advances in this field while also highlighting major obstacles that remain to be overcome before the application of triplex technology to therapeutic gene repair can be achieved.
DNA triple helices form in a sequence-specific manner on polypurine:polypyrimidine tracts (1–3) which are widespread in mammalian genomes (4–6). The third strand lies in the major groove of an intact duplex (Figure 1) and is stabilized by two Hoogsteen hydrogen bonds between third strand bases and the purines in the duplex (3, 7). The third strand may consist of pyrimidines, or purines, depending on the nature of the target sequences (Figures 2 and 3). In the pyrimidine (or Y.R:Y) motif, a homopyrimidine oligonucleotide binds in a direction parallel to the purine strand in the duplex, with canonical base triplets of T.A:T and C.G:C. In the alternate purine motif (R.R:Y), a homopurine strand binds antiparallel to the purine strand, with base triplets of A.A:T and G.G:C (8, 9).
Diagrammatic depiction of a DNA triple helix, with the third strand binding in the major groove.
Diagram of the pyrimidine motif for triple helix formation. (a) Orientation of the third strand in the pyrimidine triple-helix motif. Note that the third strand is parallel in terms of 5′ to 3′ orientation with respect to the purine-rich strand of the duplex target. (b) Base triplets formed in the pyrimidine motif and illustration of the Hoogsteen hydrogen bonds that stabilize triple-helix formation.
Diagram of the purine motif for triple-helix formation. (a) Orientation of the third strand in the purine triple-helix motif. Note that the third strand is oriented antiparallel in terms of 5′ to 3′ direction with respect to the purine-rich strand of the duplex target. (b) Base triplets formed in the purine motif and illustration of the reverse Hoogsteen hydrogen bonds that stabilize triple-helix formation.
The demonstration that synthetic oligonucleotides could form stable triplexes (10, 11) suggested that TFOs could be developed as sequence-specific gene targeting reagents in living cells (12–17). However, a number of obstacles have been, and still must be, overcome. Triplex chemistry and biochemistry impose fundamental limitations to TFO activity in the nuclear environment, and target options are limited to polypurine:polypyrimidine sequences. In addition, it has been shown that nucleosomes can inhibit triplex formation (18–20). Consequently, accessibility of genomic targets is an important issue.
Recent developments in nucleoside and oligonucleotide analogue chemistry show great promise for solving problems of TFO bioactivity and target options. We will discuss the challenges posed by the cellular environment and target restrictions, and some of the chemistry that may address these issues. We will also consider the activity of TFOs in biological assays. We will not attempt an in depth review of the chemistry of TFOs. Instead, we will stress some of the strategic themes and cite a few specific examples. The interested reader should see (21–23) for excellent and comprehensive discussions of TFO chemistry.
Biological applications of TFOs are compromised by fundamental biophysical considerations, as well as limitations imposed by physiological conditions. Triplex formation involves the approach and binding of a negatively charged third strand to a double-negatively charged duplex. Neutralization of charge repulsion is typically provided experimentally by levels of Mg++ (5–10 mM) (24) that are much higher than what is thought to be available in cells (25). Furthermore, triplex formation involves conformational changes on the part of the third strand, and some distortion of the underlying duplex (26–28). Pyrimidine motif triplexes are unstable at physiological pH because of the requirement for cytosine protonation that occurs at relatively acidic pH (pKa = 4.5). This is necessary for the second Hoogsteen hydrogen bond, although the resultant positive charge apparently makes the more important contribution to triplex stability (29). Pyrimidine motif triplexes containing adjacent cytosines are often less stable than those with isolated cytosines. Traditionally this has been ascribed to charge-charge repulsion effects (30), although a recent study suggests incomplete protonation of adjacent cytosines may be the critical factor (31). In addition, purine motif third strands (which are G rich) may form G tetrads in physiological levels of K+, which inhibit triplex formation (32). All these factors impose kinetic barriers on triplex formation and reduce the stability of triplexes once formed (most triplexes, even under optimal conditions in vitro, are less stable than the underlying duplex).
The use of 5-methylcytosine partially alleviates the pH restriction of TFOs in the pyrimidine motif (33, 34). This is thought to be due to the contribution of the methyl group in the major groove to base stacking (35), and/or the exclusion of water molecules from the groove (36). Cytosine has also been replaced with analogues such as 8-oxoadenine (37), pseudoisocytidine (38), and a 6-keto derivative of 5-methylcytydine (39). These can form two hydrogen bonds at physiological pH, and may be useful for sequence targets with adjacent cytosines. Two groups have presented interesting data regarding 2-aminopyridine (2AP) (40, 41). This analogue is protonated at neutral pH and stable triplexes were formed by TFOs containing adjacent 2AP residues on targets with adjacent cytosines. These results are consistent with the argument that proton competition between adjacent cytosines is the basis for the difficulty of targets with adjacent cytosines (31). Although the biochemical data with these analogues are encouraging, only 5-methylcytosine has been tested in biological assays.
RNA third strands formed more stable pyrimidine-motif triplexes than the corresponding DNA strands (42), which prompted the use in third strands of 2′-O-Methoxy (OMe) sugar residues (27, 43), and recently, 2′, 4′ bridged ribose substitutions (44). These modifications preorganize the third strand in a conformation that is compatible with triplex formation and imposes minimal distortion on the underlying duplex (28, 44). Other modifications also improve stability. Intercalators linked to TFOs improve binding (45), propynyl-deoxyuridine reduces the Mg++ dependence (46), as does replacement of the ribose with a morpholino analogue (47).
Backbone modifications that replace the phosphate linkage (48), or a bridging oxygen atom with a nitrogen (49) improve TFO binding in vitro. Replacement of a nonbridging oxygen atom in the backbone with a charged amine reduces the likelihood of self-structure formation of purine TFOs in physiological K+ (50, 51). A positive charge on a thymidine analogue (52), or linkage of positively charged moieties to TFOs, also enhances triplex stability (53, 54).
A positive charge and an RNA-like sugar conformation have been joined in the 2′-O-(2-aminoethyl) (AE) ribose derivatives developed by Cuenoud and colleagues (55–57). TFOs carrying these substitutions show enhanced kinetics of triplex formation and greater stability of the resultant complex at physiological pH and low Mg++ concentration. NMR analysis indicates a specific interaction between the positively charged amines (at physiological pH) and phosphate groups in the purine strand of the duplex (58). A related approach has been described in a thymidine analogue containing both 5-aminopropylargyl and 2′-aminoethoxy moieties (59).
In early biological studies, TFOs were envisioned as tools to inhibit gene expression by blocking transcription initiation or elongation (the “antigene” strategy) (60). Triplex formation within promoter sites has been shown to block transcription factor access and inhibit gene activation in vitro, and several studies have demonstrated that TFOs can decrease gene expression in mammalian cells in a directed way (see ref. 61 for a recent description of this approach). As an alternative strategy for genetic manipulation, we, and others, have investigated the use of TFOs to mediate genome modification, resulting in a change in target sequence (13, 17, 62–70). This has the advantage of introducing permanent changes in the target sequence, which simplifies interpretation of the experiments. However, more importantly, it also has potential as a gene knockout tool and as a means for gene correction.
In our initial work, the third strand was conjugated to a mutagen, such as psoralen, so that the sequence specificity of the third strand binding could be conferred on the action of the mutagen (71, 72). Psoralen-conjugated TFOs transfected into monkey COS cells could induce base pair–specific mutations within a supF mutation reporter gene in a simian virus 40-based plasmid episome in the cells, at frequencies in the range of 1–5% (16). The key finding in this work was that the binding affinity of the TFO to its target site, as measured in vitro, was highly correlated with its intracellular activity. TFOs with equilibrium dissociation constants (Kd’s) of approximately 10–9 M were active; those with Kd’s of 10–6 M were not.
This work was subsequently extended to psoralen-TFO–mediated knockout of chromosomal genes. In one study, the supFG1 reporter integrated into the chromosome of mouse fibroblasts was used as a target (73). Again, only high-affinity TFOs were active, achieving targeted mutagenesis frequencies of 0.1%. In the supFG1 experiments, essentially unmodified G-rich oligonucleotides (except for 3′ end capping) designed to bind in the antiparallel purine motif were used, but it should be noted that in those experiments the 30-bp polypurine site in the supFG1 gene afforded the possibility of high affinity third-strand binding by G-rich TFOs synthesized with standard DNA chemistry (16).
We have determined the activity of variously modified pyrimidine motif psoralen-TFOs, designed to target a sequence in the endogenous hamster hypoxanthine phosphoribosyltransferase (HPRT) gene, in HPRT knockout assays (17). TFOs with uniform 2′-O-methyl substitutions were inactive, but addition of a pyrene intercalator conferred knockout activity with HPRT–4 clones recovered at frequencies in the range of 10–4 to 10–3. Sequence analysis of the mutant clones confirmed the localization of mutations to the target region (17). More recently, we have prepared psoralen-TFOs containing several AE residues as well as 2′-O-methyl substitutions. These mixed substitution TFOs form triplexes at lower levels of Mg++ than required for the TFOs with only the 2′-O-methyl residues. Thermal melting experiments showed that the AE triplexes were more stable than the 2′-O-methyl only triplexes, and, notably, were more stable than the underlying duplex at physiological pH. They were also more stable in the cellular compartment in which replication and mutagenesis occur (62, 74). Most importantly they are quite active in the HPRT knockout assay (62). These results indicate that the introduction of positive charge can confer biological activity on pyrimidine motif TFOs.
Other oligonucleotide modifications also support biological activity. For example, a psoralen-linked pyrimidine TFO with a phosphoramidate backbone (in this case, substitution of a bridging oxygen with nitrogen) was shown to have statistically significant activity in a yeast reversion assay (75). Psoralen-linked G-rich TFOs in the purine motif, with a nonbridging phosphoramidate substitution (N,N-diethyl-ethylenediamine), were active in targeted mutagenesis experiments at a short (10 bp) homopurine/homopyrimidine site in mammalian cells that otherwise is not susceptible to targeting by unmodified phosphodiester oligonucleotides (67). Improved binding activity in reduced Mg++ is probably the basis for the bioactivity of the TFOs carrying these modifications.
In addition to the use of psoralen-conjugated TFOs in site-directed mutagenesis studies in mammalian cells and in yeast, a few groups have used these reagents as tools for direct physical demonstrations of triplex formation in permeabilized mammalian cells. Such studies have used PCR and restriction enzyme protection assays to argue for the formation of TFO-targeted crosslinks in genomic DNA in chromatin (76–78). These results, and the mutagenesis experiments, show that the packaging of the DNA into chromatin is not an absolute barrier to gene targeting with TFOs.
In the course of our work with psoralen-TFOs, we observed that unconjugated TFOs were also capable of inducing mutations in the target gene in an SV40 vector, at least when the binding affinity was sufficiently high (79). This effect was shown to be a consequence of the stimulation of DNA repair by the formation of the triple helix, which seems to be recognized by the nucleotide excision repair complex as a lesion.
Recently, Vasquez et al. demonstrated that systemically administered TFOs (without psoralen or any other DNA reactive conjugate) could induce mutations at specific genomic sites in the somatic cells of adult mice (68). In this work, a 30-mer purine TFO, with a 3′ propanolamine group to prevent exonuclease-mediated degradation, was injected intraperitoneally for 5 consecutive days. The TFO was targeted to chromosomal copies of an integrated supF reporter gene. After an additional 10 days, the mice were sacrificed, and tissues were taken for mutation analysis. In general, mice treated with the sequence-specific TFO had a fivefold elevated mutation rate in the targeted supF gene, but not in the nontargeted cII gene. All tissues tested showed TFO-induced mutagenesis except the brain, which had no mutagenesis over background, consistent with TFOs inability to cross the blood-brain barrier. This work established that site-directed DNA binding molecules, upon systemic administration, can mediate gene targeting and gene modification in vivo in whole animals.
Based on the concept that third-strand binding, with or without psoralen coupling, can trigger DNA repair, we hypothesized that such binding might also be recombinogenic due to the production of repair-dependent DNA strand breaks. Using an SV40 vector containing two mutant copies of the supF gene, we found that both psoralen-TFOs (64) and non-psoralen TFOs (63) could trigger recombination within an SV40 virus genome. This induced recombination, in the case of the TFOs, without psoralen, was found to be dependent on the presence of functional Xeroderma pigmentosum group A protein (XPA) protein (63), the key recognition factor in nucleotide excision repair, a result consistent with our hypothesis.
These results were extended to a chromosomal target, in which two mutant thymidine kinase (TK) genes were integrated into a single chromosomal site in mouse fibroblasts (69). Transfection (via cationic liposomes) of the cells with high-affinity TFOs targeting a region between the two TK genes yielded recombination at a frequency of approximately 30 × 10–6, about sevenfold above background. When the TFOs were microinjected into the nuclei of the cells (about 70,000 copies/cell), the yield of recombinants increased to 1–2%, more than 1000-fold over background. Analysis of the recombinant clones revealed all the recombination events involved gene conversion rather than cross-over recombination. Similar work to use TFOs to stimulate recombination has been described by Wilson and colleagues using a locus in CHO cells containing duplicated APRT genes as a target (80). Besides providing proof-of-principle, the results obtained in the TK system highlight another challenge in using triplex technology for gene repair: achieving efficient delivery into the cell nucleus.
The observation of the ability of third strand binding to provoke DNA repair and stimulate recombination led us to develop a strategy to mediate targeted gene conversion using a TFO linked to a short DNA fragment homologous to the target site (except for the base pair to be corrected) (70). In this bifunctional molecule, the TFO domain mediates site-specific binding to target the molecule to the desired gene. This binding also triggers repair to sensitize the target site to recombination. The tethered homologous donor fragment can participate in recombination and/or gene conversion with the target gene to correct or alter the nucleotide sequence.
Using a bifunctional oligomer with a 40-mer donor domain and a 30-mer TFO domain, correction of a single base-pair mutation in the supF reporter gene within an SV40 vector in COS cells was achieved (70). Correction frequencies were in the range of 0.1–0.5% with the full bifunctional molecule. Oligomers consisting of either domain above or of either domain substituted with heterologous sequences reduced activity by tenfold or more. The donor domain alone consistently did mediate some gene correction, as would be expected, based on the known ability of short DNA fragments to mediate some level of recombination (81–83). However, there was a clear synergism due to combination with the TFO domain.
In vitro studies in human cell-free extracts further demonstrated that triplex formation could induce recombination of the target site with a short donor oligonucleotide, and they also revealed a requirement for XPA and Rad51 function in the pathway (84). Importantly, the in vitro work also revealed that the donor DNA does not need to be covalently linked to the TFO, suggesting that the ability of the third strand to stimulate repair and recombination is a key property, presumably because of the production of strand breaks as recombinogenic intermediates.
Several other in vitro studies have demonstrated synergistic binding of bifunctional oligonucleotides containing domains designed to form triplexes as well as duplexes with a target DNA (66, 85, 86). Interestingly, in one strategy, the triplex-forming domain was linked to the duplex-forming domain by annealing via a short stretch of complementary nucleotides (86), rather than via covalent linkage (the latter either by postsynthetic coupling or by cosynthesis as a single, long oligonucleotide).
Although we have just begun to explore the applications of TFOs for genome modification, we would argue that two fundamental issues have been resolved in recent years: that it is possible to prepare TFOs with gene-targeting activity in vivo, and that bioactivity implies at least some degree of target accessibility. Thus, although there is a great deal to be done to extend these conclusions, expansion of target options to mixed purine/pyrimidine sequences would appear to be the most important challenge facing the field. We will discuss some of the efforts to expand the triplex-binding code, although it should be pointed out that only a few compounds have been tested in bioassays. For thorough accounts of this aspect of the field the reader should consult recent reviews (21, 23).
Purine bases engaged in Watson Crick pairing can form two additional hydrogen bonds, while duplex pyrimidines can only form one additional hydrogen bond. These hydrogen bonding patterns are the basis for much of the stability of conventional triplexes, and one of the reasons for the instability of triplexes on mixed sequences. An additional problem presented by T:A inversions is the projection into the major groove by the 5-methyl group of T resulting in steric hindrance to groove occupancy by third strands. Candidate compounds for binding at inversion sites fall into different categories: natural bases, intercalators, analogues that make a single hydrogen bond with the inverted pyrimidine base, analogues that make hydrogen bonds with the purine base, and, perhaps, also with the pyrimidine base (base pair binders).
Natural bases in parallel triplexes have been shown to form triplets at inverted base pairs, specifically G.T:A, and T.C:G (87, 88). Although these have been incorporated in TFOs employed in biological experiments, triplexes containing them are not as stable as perfectly matched triplexes (89, 90). Thus we found that a TFO containing G, for triplet formation at a T:A inversion site, was not effective in our hprt gene knockout assay (17). The T.C:G triplet can form in both parallel and antiparallel triplexes. Use in parallel triplexes is compromised by the lack of sequence stringency (since T also forms triplets with A:T pairs). However this limitation does not apply to antiparallel triplexes and we have used T in a biologically active purine motif TFO (AG30, described above) for triplet formation at T:A sites in an otherwise perfect polypurine target (16).
Intercalators, incorporated into the TFOs and located adjacent to mismatch sites, have been used to stabilize natural base triplets with inverted base pairs. An acridine derivative stabilized either inversion depending on its position relative to the site (45). A naphthalene-based intercalator linked to 5-methylcytosine, designed to form one hydrogen bond and also intercalate, showed selectivity for C:G inversions (91). We introduced a pyrene derivative at a T:A inversion site in our active TFO in our initial demonstration of HPRT gene knockout (17). It is clear that intercalators can enhance triplex stability, however it is likely that this will be at the cost of some degree of sequence specificity. Precisely this observation was made with one of the earliest efforts to construct an analogue for expansion of target options. A benzamidophenyl derivative of imidazole formed stable triplets at both T:A and C:G base pairs by intercalation next to the target base pairs (92, 93).
There have been a number of attempts to synthesize compounds with hydrogen bonding potential that would form triplets with inverted pairs with good affinity and specificity. There have been some promising developments with the C:G inversion, particularly in the pyrimidine motif. A thymidine analogue designed to form a hydrogen bond with the C showed selectivity relative to the other base pairs (94). Similar results with a related compound have been reported in one of the few instances in which the new sugar chemistry has been combined with a base analogue (95). A base pair recognition strategy was pursued by Miller and colleagues who prepared several N4-substituted cytosine derivatives with side-chain extensions designed to make hydrogen bonds with both the cytosine and the guanine in the inverted base pair (96, 97). This approach is conceptually attractive because formation of hydrogen bonds with both bases should enhance the specificity of the interaction with the intended base pair, as indeed was observed with some of the derivatives (97). While these and other analogues show promise for solving the C:G inversion they do not form triplets as strong as the canonical C+.G:C. However, this is, in part, because they do not carry a positive charge. Combination with sugar and backbone modifications that introduce positive charge may overcome this limitation.
Analogues that bind C:G interruptions in antiparallel purine–motif triplexes have also been described. An example of an analogue that can reach across the major groove and form hydrogen bonds with the guanine base in the inverted C:G pair is 2′-deoxyformycin A (98). Another interesting approach involved the replacement of natural bases with azole derivatives. The use of the smaller aromatic ring was intended to overcome some of the steric problems associated with base pair inversions (99). They were able to demonstrate enhanced binding by the substituted TFOs to targets containing inversions, but base pair discrimination was poor. However, this may be a useful point of departure for side-chain derivatives that would improve selectivity.
Current efforts to overcome the T:A inversion have been less successful. A recent effort employed a post synthetic modification strategy, at an internal acyclic linker, to prepare a number of candidates (100). Although this approach has a great deal of potential, the most effective binders recovered in the study were intercalators, which failed to distinguish between T:A and C:G base pairs in the target sequence. This result is similar to those reported previously and suggests that in further work analogues with intercalative potential should be avoided. Finally, a pyridazine derivative on an acyclic linker has been shown to form triplets with T:A pairs (101). This analogue showed reasonable base pair discrimination, but was analyzed in the context of a peptide nucleic acid, and has not been examined in an oligonucleotide.
While this is a cursory overview, it is striking how much thoughtful chemistry has been described, some with considerable promise, and how rare has been the analysis of these developments in biological assays.
Overall, the work to date suggests that TFOs can be used to mediate site-specific genome modification. This capacity derives not only from the ability of TFOs to bind as third strands with sequence specificity but also from the ability of the resulting triple helices, or TFO-mutagen complexes, to provoke repair and recombination, leading to directed mutagenesis, recombination, and, potentially, gene correction. It seems likely that recent advances in oligonucleotide chemistry have considerable potential for the development of TFOs with robust gene targeting activity. This will require coordinated effort between the chemists and biologists, but recent data suggest that this effort will be rewarded.
B.J. Gold and colleagues have recently described a novel approach towards the development of TFOs that can bind general sequences (102).
This work was supported by the NIH (grants GM54731 and CA64186 to P.M. Glazer).
Conflict of interest: The authors have declared that no conflict of interest exists.
Nonstandard abbreviations used: triplex-forming oligonucleotide (TFO); 2-aminopyridine (2AP); 2′-O-methoxy (OMe); 2-aminoethyl (AE); equilibrium dissociation constant (Kd); hypoxanthine phosphoribosyltransferase (HPRT); Xeroderma pigmentosum group A protein (XPA); thymidine kinase (TK).
Genetic repair. Bruce A. Sullenger, Series Editor
Ribozyme-mediated revision of RNA and DNAMeredith B. Long et al.
Genetic repair. Bruce A. Sullenger, Series Editor
Targeted genetic repair: an emerging approach to genetic therapyBruce A. Sullenger
Messenger RNA reprogramming by spliceosome-mediated RNA trans-splicingMariano A. Garcia-Blanco
Therapeutic potential of antisense oligonucleotides as modulators of alternative splicingPeter Sazani et al.
The potential for gene repair via triple helix formationMichael M. Seidman et al.
Sequence-specific modification of genomic DNA by small DNA fragmentsDieter C. Gruenert et al.
Targeted gene repair – in the arenaEric B. Kmiec