|
|
||||||||
|
Plant Physiol, March 2001, Vol. 125, pp. 1293-1303 Mutator Transposase Is Widespread in the Grasses1Department of Plant and Microbial Biology, 111 Koshland Hall, University of California, Berkeley, California 94720
Although the Mutator (Mu) system is well characterized in maize (Zea mays), very little is known about this highly mutagenic system of transposons in other grasses. Mutator is regulated by the MuDR class of elements, which encodes two genes, one of which, mudrA, has similarity to a number of bacterial transposases. Experiments in our laboratory, as well as database searches, demonstrate that mudrA sequences are ubiquitous and diverse in the grasses. In several species it is clear that multiple paralogous elements can be present in a single genome. In some species such as wheat (Triticum aestivum) and rice (Oryza sativa), mudrA-similar sequences are represented in cDNA databases, suggesting the presence of active Mu transposon systems in these species. Further, in rice and in sorghum, mudrA-like genes are flanked by long terminal inverted repeats, as well as the short host sequence direct repeats diagnostic of insertion. Thus, there is ample evidence that systems related to Mu in maize are at least potentially active in a wide variety of grasses. However, the mudrB gene, though important for Mu activity in maize, is not necessarily a component of Mu elements in other grasses.
The Mutator
(Mu) system is a highly mutagenic transposable element
system that is used extensively for gene identification and mutagenesis
in maize (Zea mays). Although a great deal is known about
Mu in maize (for review, see Bennetzen, 1996 In maize the Mu system is composed of a number of different
variants, all of which have similar 200-bp terminal inverted repeats (TIRs), but each of which contains unrelated internal sequences (for
review, see Chandler and Hardeman, 1992 Because grass phylogeny has been the subject of intense scrutiny
(Clayton and Renvoize, 1986 Our PCR strategies are designed to amplify elements that are most closely related to mudrA in maize. Thus, we would only expect to identify a subset of these sequences, and variations in specific sequences could result in a lack of amplification in any given species. We use DNA-blot hybridization whenever possible to limit this bias. However, even with this control, this work does not provide a comprehensive view of all mudrA-similar sequences. Nevertheless, analysis of our limited data set makes clear that there are multiple families of mudrA-similar sequences in the grasses, each one of which diverges to varying degrees from mudrA in maize.
Potentially Functional mudrA-Homologous Sequences Are Widespread among the Grasses Our initial survey of the grasses employed DNA blots that were
probed with a conserved portion of mudrA from maize and
washed at low stringency (see "Materials and Methods"; Fig.
1A). DNA quantities were not generally
adjusted for C value, although a relatively low amount of DNA was used
for Coix lacryma and sorghum (Sorghum bicolor),
both of which have a lower C value than does maize (Kellogg, 1998
To confirm these results, DNA samples of Zea luxurians, Tripsacum dactyloides, C. lacryma (at two concentrations), Muhlenbergia macroura, and Sporabolus airoides (same subtribe as Muhlenbergia) were blotted and probed with a portion of mudrA at a medium stringency (Fig. 1B). The results clearly showed that hybridization at this stringency was restricted to Z. luxurians (lane 1), T. dactyloides (lane 2), and M. macroura (lane 5). These results indicate that there is incongruity between the species phylogeny and the distribution of mudrA-similar sequences. Conserved mudrB Sequences Are Less Widespread than mudrA Sequences In contrast to mudrA, mudrB-similar sequences do not appear to be widely distributed, and the degree of hybridization matches species phylogeny. At low stringency only the closely related Zeas hybridize well to mudrB (Fig. 1C, lanes 1-3). DNA from a genus in the same subtribe as maize, Tripsacum (lanes 4 and 5), hybridized less efficiently, and DNA from C. lacryma (lane 6) less still. These mudrB hybridization results are as expected for any genomic sequence in these species. In contrast to the results using mudrA as a probe, DNA from species of the Muhlenbergia genus (lanes 10 and 11) does not hybridize to mudrB probe at this stringency. PCR amplification using primer pairs designed to span the region between mudrA and mudrB were successfully used to produce amplicons that contain sequences of both genes in Z. luxurians and Z. diploperennis, demonstrating that both genes are found together in these species, presumably within MuDR transposons. It is interesting that although the genic regions are quite well conserved, the intergenic region, which contains a great deal of internally repetitive sequence, is poorly conserved (data not shown). Amplification from any species more distantly related to maize than the Zeas, using a variety of mudrB specific primers, has as yet been unsuccessful. PCR Amplification of mudrA Orthologs To gain a more detailed understanding of mudrA-similar
sequences in various grasses, PCR primers were designed that could amplify a rice mudrA ortholog with a good open reading
frame, as well as mudrA from maize. Selection of primer
sites was based on the high degree of conservation of certain amino
acids motifs present in previously identified rice (Ishikawa et al.,
1996 Using these primers it was possible to amplify product from all the major subfamilies of grasses and most species examined (Table I). In all cases, an amplicon of the expected size proved to contain sequences similar to mudrA. Additional fragments of other sizes were produced in several of these species, but in all cases those amplicons proved to be completely unrelated to mudrA (data not shown). Because our primers amplified only a portion of the mudrA gene in these species, it is not known whether any particular sequence we obtained lies between TIRs. However, in several species, DNA-blot analysis revealed the presence of multiple fragments, consistent with their identity as transposable elements (Fig. 1A).
Selective Constraints on Sequence Variation of mudrA Orthologs Although many of these nucleotide sequences are quite different
from mudrA in maize, ranging from 50% to 60% identity,
many of the inferred amino acids are conserved among all or nearly all
orthologs (Fig. 2), and the frequency of
similar and identical amino acids is considerably higher than that of
identical nucleotides in all sequences examined. The nucleotide
sequence of the mudrA sequence from M. macroura,
for instance, is 65% identical to mudrA in maize, but the
protein sequence is 78% similar to MURA in maize. Several of the
products we amplified encoded portions of intact open reading frames,
consistent with continued function (Table I). It is not surprising that
a number of the conserved amino acids are identical or similar to those
already identified as being part of a Mutator motif present
in mudrA and a large number of bacterial transposases (Eisen
et al., 1994
mudrA Sequences Are Organized into Distinct Subfamilies When these mudrA sequences are subjected to phylogenetic analysis, they fall into several distinct groups or classes (Fig. 3) that are often independent of species phylogenetic relationships. Class I includes mudrA from maize and its close relatives, as well as a sequence (MoPV-1) from P. virgatum (a different subtribe), and all of the sequences from the genus Muhlenbergia (a different subfamily). Class II includes sequences from both species of Setaria we examined (MoSaN-1and MoSS-1), as well as sequences from C. lacryma (MoCL-1), Z. diploperennis (MoZD-2), and V. zizanioides (MoVZ-1). Class III includes one of the sequences from C. lacryma (MoCL-2), one from S. anceps (MoSaN-2), one from P. virgatum (MoPV-2), as well an element from rice (MoOS-557, accession no. BAA89557). Sequences from the Pooidea generally group together, as do those from the three species of Bambusoideae we examined.
It is quite apparent that phylogenetic trees made from the sequences we amplified do not match that of the grasses in which they are found (Fig. 4). Most striking, the sequences amplified from three Muhlenbergia species (different subfamily than maize), as well as one sequence we have from P. virgatum (different tribe than maize) are consistently more similar to mudrA in maize than are sequences from species as closely related to maize as C. lacryma (same subtribe as maize) and even the sequences from Z. luxurians and Z. diploperennis (same genus as maize).
Individual genomes can carry more than one class of sequence. For instance, Z. dactyloides and has class I and class II elements, and P. verbatim has class I and class III elements. C. lacryma has representatives of class II and class III elements, but it appears to lack class I elements, the class to which mudrA from maize belongs. The lack of amplification of class I elements in C. lacryma could be due to incomplete sampling due to bias introduced by PCR. However, DNA from the Muhlenbergia species hybridizes more intensely to mudrA than does DNA from C. lacryma (Fig. 1B), suggesting that class I elements are indeed missing in this species. In a similar manner, we were unable to hybridize to sorghum DNA using mudrA from maize, and our PCR from this species using a several primer pairs was unsuccessful. Database searches revealed that there are at least two sorghum elements that have mudrA-similar sequences (MoSB-1 and MoSB-2; Table II), but they are much more distantly related to MuDR than any of the sequences we successfully amplified. It is interesting that one of the two elements, MoSB-1, is closer to a maize element recently identified in the Chandler laboratory, MuSE (M. Stam, D. Selenger, and V.L. Chandler, personal communication) than it is to MuDR in maize (data not shown). The second element, MoSB-2, is quite similar to the group that includes MoOS-557 (Fig. 5B). As in the case of C. lacryma, these data suggest that sorghum may have lost one class of mudrA-containing element, but retained others.
Our observation that species such as C. lacryma and P. vergatum can carry more than one distinct class of mudrA orthologous sequence is part of a larger pattern of inter- and intraspecific variation. Database searches reveal that there are multiple paralogous mudrA-similar sequences in rice and maize that are even more remotely related to mudrA than our PCR-amplified sequences. We refer to all elements carrying mudrA-similar sequences and long TIRs or their derivatives as MuLEs. Like the sequences we PCR amplified, phylogenetic analysis of these elements does not match that of the species in which they are found (Fig. 5). The presence of these elements suggests that MuLE elements represent a broadly diversified group of transposable elements that can coexist as potentially active elements within a single genome. Thus, the various classes identified in our analysis may represent an earlier stage in a continuous process of element diversification. In maize, in addition to MuDR, there is a
MuLE identified as part of a polycistronic message that
includes a Hox1A homolog (Comelli et al., 1999
A recently identified transposon, Jittery, in maize has
clear similarities to mudrA, and contains characteristic
long inverted repeats (H. Dooner; accession no. AF247646). However, it
is more similar to sequences in rice and Arabidopsis than to
MuDR. In rice, an element with TIRs and host sequence direct
repeats (MoOS-J1, accession no. AC078840) clusters with
Jittery in a phylogenetic tree (Fig. 5). It is interesting
that Jittery is also more closely related to a gene in
Arabidopsis, Far1 (accession no. AAD51282), than it is to
MuDR in maize. Mutations in Far1 are deficient in
the response to far-red light (Hudson et al., 1999 Rice contains an element, RMu1-IR36 (Ishikawa et al., 1996 The TIRs of the each of these widely disparate elements have few sequences in common (Fig. 7), although they are all longer on average that the TIRs of other plant transposons (Table II), suggesting a requirement by the mudrA class of transposases for relatively long TIRs. With the exception of Trap whose TIRs are clearly degenerate, the left-right homology of these TIRs range from 85% to 95% over their length. Further, in all cases there is evidence of host sequence direct repeats, consistent with insertion. Phylogenetic analysis of these TIRs reveals that although the tree derived from parsimony analysis is not well supported at all nodes, it is generally consistent with the tree derived from the transposases (Fig. 5B). It is notable that the MoSB-2 (from sorghum) TIRs are 76% identical to those of MoOS-557 (from rice) over the 185-bp length of the MoSB-2 TIR. It is interesting that this TIR sequence in sorghum is identified as a high-copy repeat sequence in the database ("HCSR-2," accession no. AF061282).
The database for rice and maize is as yet incomplete, and at least three distinct branches of MuLEs have already been identified in both of them. It is likely that additional subfamilies will be identified as additional sequences for these species are obtained. It is clear from the data already available that MuLE elements represent a broadly diversified and widely distributed group of transposable elements. It is interesting to note that of all the elements identified to date, only MuDR in maize contains the mudrB gene. The mudrB gene, which is required for germinal insertions of Mu elements in maize, matches nothing in the database. Analysis of the natural history of this gene will be the subject of a future report. Database searches also suggests that mudrA-similar transcripts may be
ubiquitous. Our wheat (Triticum aestivum) product is 96%
identical over 494 bp to a wheat cDNA (BE497524), suggesting that the
element in wheat is expressed. In rice, RMu1-A23 (accession no. AB023047) is 99% identical to a cDNA from rice (accession no.
C98506) and a rice cDNA with homology to mudrA has been isolated previously (Yoshida et al., 1998 The presence of such a diverse array of Mu-like transposons
whose distribution does not always match species phylogenies raises some interesting questions regarding the dynamics of Mu
element evolution. Functional transposons are only maintained in a
given genome if they can continue to duplicate themselves or if they can provide a benefit to their host. The Far1 gene suggests
that some MuLE sequences may provide such a benefit.
However, if it is assumed that most Mu transposons are
selectively neutral or negative, there are no selective forces
maintaining their integrity (Charlesworth, 1988 One mechanism that has been proposed for continued activity of
transposon lineages is horizontal transfer to transposons from "immunized" hosts in which regulatory mechanisms have evolved to
"naive" hosts, which lack such specific mechanisms (Kidwell, 1994 Given the presence of multiple paralogous lineages of mudrA-containing elements in the grasses, it is difficult to distinguish horizontal transfer from selective retention of particular subfamilies. Based on our phylogenetic analysis of PCR products, as well as our DNA-blot analysis, it appears that class I (the class to which mudrA belongs) sequences are missing in the Panicoids Coix, sorghum, Vetevaria, and Setaria, and in the Chlorinoid S. airoides. The class I sequences are present in the Chlorinoid Muhlenbergia species, and the Panicoid, P. virgatum. These data can be explained by two horizontal transfers (one to P. virgatum and one to Muhlenbergia), or two losses (once in the lineage leading to C. lacryma, sorghum, and V. zizanioides and once in S. airoides). Given the presence of multiple classes of elements that may predate the divergence of all of these species, we find the loss hypothesis to be more parsimonious. Additional sequences using degenerate primers will help to determined whether class I elements are truly missing in all of these species. Supporting the possibility of horizontal transfer of MuLE elements, some similarities from database searches are strikingly high for a transposable element sequence that would be expected to have a reasonably high mutation rate. For instance, a wheat cDNA (accession no. BE443436) is 86% identical at the nucleotide level and 85% identical and 91% similar at the amino acid level to a portion of MoOS-373 (accession no. BAA82373) from rice. In a similar manner, a class II element from C. lacryma is 96% identical at the nucleotide level to a class II element from V. zizanioides. These results suggests a very strong degree of selection or horizontal transfer between these species or their ancestors. A second mechanism for continued activity that one could hypothesize is the rapid production of subvarients of transposable elements. If we assume that the amplification of one subclass of elements provokes a response by the host specific in some way to that subclass, then transposon variants may emerge that are different enough to avoid that regulation. We have ample evidence that mudrA-containing elements have produced multiple subvarients that can coexist within the same genome. Despite having been only partially sequenced, rice and maize contain at least three distinct branches of MuLEs. In each case at least one member of each branch appears to have been active recently enough to have retained evidence of insertion in the form of host sequence direct repeats (Table II). The continuing selection for certain key amino acids in all branches (Figs. 2 and 6) suggests continued selection for activity over at least 70 million years. The conservation of amino acid sequence within specific branches such as Trap suggests that more than one active branch of MuLE elements can exist in a given genome. Further, if Mu is any guide, each branch or class of elements may have its own family of nonautonomous elements, as is suggested by the presence of an element in rice that has Mu-like TIRs flanking unrelated sequences (accession no. X16597). It will be interesting to see if and how nonautonomous variants derived from one branch of MuLEs interact with MuLEs from other branches. The presence of potentially active mudrA-containing transposons in many species, including Arabidopsis and maize, suggests that active Mu transposons have been a component of plant genomes at least since the origin of the monocots. Although we have no direct evidence for active Mu transposons in species other than maize, we do have strong circumstantial evidence for recent, or even current Mu activity in a number of the grasses. The presence of multiple branches and classes of MuLEs within individual genomes suggest that differentiation of elements within individual species lineages may be a common occurrence. A fascinating subject for future investigation will be whether and how and these distinct branches of elements interact with each other and with their hosts.
DNA Gel-Blot Analysis DNA extraction and DNA gel-blot analysis was performed according
to Dorweiller and coworkers (Dorweiler et al., 2000 Plant Material Plant material was collected from the University of California-Berkeley Botanical Garden, grown from seeds supplied by the Royal Botanical Gardens at Kew, and several commercial sources. Accession numbers and collection data are available upon request. PCR Primers PCR primers for the initial screen were designed to sequences
encoding well-conserved amino acids in rice (Oryza
sativa) and maize (Zea mays). One set of primers
was specific to mudrA in maize and one set was specific
to rice. The rice sequence of RMu1-IR36 was provided by
Ryuji Ishikawa, who isolated a MuDR-like element from
maize (Ishikawa et al., 1996 Letter designations refer to the species from which the primers was designed: R from rice, V from Vetevaria zizanioides, and S from Setaria anceps. PCR conditions were then 94°C for 30 s, 48°C for 45 s, and 72°C for 45 s. The samples were subjected to 35 rounds of amplification, and were then gel isolated, purified using the Qiaquick Gel Extraction kit (Qiagen, Valencia, CA), and subcloned using the TOPO TA Cloning kit (Invitrogen, Carlsbad, CA). Following transformation, when possible, several individual clones from each amplification reaction were sequenced. Sequencing was performed by the University of California, Berkeley sequencing facility, using an ABI sequencer. DNA Sequence Analysis Database searches were performed using the BLAST algorithm.
Multiple alignments were performed using a web site maintained by the
Belozersky Institute of Physico-Chemical Biology
(www.genebee.msu.su/services/malign reduced.html) and were corrected
by hand based on the correct reading frame of mudrA in
maize. The alignments shown in Figures 2, 6, and 7 were displayed using
the boxshade web site (www.ch.embnet.org/software/BOX form.html).
Phylogenetic analysis was performed using the PHYLIP package (version 3.5) available through the University of
Washington. Trees in Figures 3, 4, and 5 were derived from DNA
alignments analyzed using DNADIST for distances and NIEGHBOR for
tree inference. Distances were bootstrapped 100 times. The maximum
likelihood method was employed, using a transition/transversion ratio
of 1.4 based on the analysis of the coding sequences from several plant
species (Langdon et al., 2000 Nomenclature Devising nomenclature for transposable elements is rarely
clear-cut, and there are no accepted practices for doing so. However, in the interests of clarity, we have attempted where possible to name
these elements systematically. Individual mudrA-like
sequences are named with the prefix "Mo" followed by
a letters for the genus and species of origin. Thus, a sequence from
Zea luxurians is designated "MoZL,"
with successive numbers for additional sequences from the same species
(e.g. MoZL-1). In those cases where the transposase was
previously identified as such in the database, the numerical
designation is drawn from the last three letters of the protein
accession number of that transposase. Thus, an element from rice
carrying a transposase with the accession number BAA89557 is designated
"MoOS-557." Previously identified elements such as the Trap (Comelli et al., 1999 We refer to all Mu-like elements as MuLEs. This includes all elements with long TIRs and significant homology to mudrA, as well as derivatives and nonautonomous variants thereof. Major subdivisions, or branches, of this general group of MuLEs include the Jittery branch of elements, the Trap branch of elements, and the MuDR branch of elements. Within the MuDR branch of elements, we designate subclades as classes distinguished by Roman numerals. Thus, the clade that includes MuDR is class I of the MuDR branch of elements. This class includes sequences from maize, and its close relatives, as well as sequences from Muhlenbergia and Panicum.
Thanks to Randall Tyers and Nancy Nelson for a critical reading of the manuscript.
Received November 10, 2000; returned for revision December 12, 2000; accepted December 18, 2000. 1 This work was supported entirely by the Novartis Agricultural Discovery Institute Inc. (now Syngenta Agricultural Discovery Institute) University of California-Berkeley Strategic Alliance.
* Corresponding author; e-mail dlisch{at}uclink4.berkeley.edu; fax 510-642-4995.
This article has been cited by other articles:
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ASPB Publications | PLANT PHYSIOLOGY® | THE PLANT CELL | |
|---|---|---|---|