|
|
||||||||
|
First published online November 14, 2008; 10.1104/pp.108.129734 Plant Physiology 149:258-270 (2009) © 2009 American Society of Plant Biologists OPEN ACCESS ARTICLE
Analysis of Intraspecies Diversity in Wheat and Barley Genomes Identifies Breakpoints of Ancient Haplotypes and Provides Insight into the Structure of Diploid and Hexaploid Triticeae Gene Pools1,[OA]Institute of Plant Biology, University of Zurich, 8008 Zurich, Switzerland (T.W., S.G.K., L.R., B.K.); Commonwealth Scientific and Industrial Research Organization Plant Industry, Canberra, Australian Capital Territory 2601, Australia (E.S.L.); National Institute of Agrobiological Sciences, Tsukuba 305–8602, Japan (T.K., M.P., T.M.); Cereal Research Centre, Agriculture and Agri-Food Canada, Winnipeg, Manitoba, Canada R3T 2M9 (S.C.); Institute of Society for Techno-Innovation of Agriculture, Forestry, and Fisheries, Tsukuba 305–0854, Japan (H.K.); Research Institute for Bioresources, Okayama University, Okayama 710–0046, Japan (K.S.); and Leibniz Institute of Plant Genetics and Crop Plant Research, D–06466 Gatersleben, Germany (D.P., N.S.)
A large number of wheat (Triticum aestivum) and barley (Hordeum vulgare) varieties have evolved in agricultural ecosystems since domestication. Because of the large, repetitive genomes of these Triticeae crops, sequence information is limited and molecular differences between modern varieties are poorly understood. To study intraspecies genomic diversity, we compared large genomic sequences at the Lr34 locus of the wheat varieties Chinese Spring, Renan, and Glenlea, and diploid wheat Aegilops tauschii. Additionally, we compared the barley loci Vrs1 and Rym4 of the varieties Morex, Cebada Capa, and Haruna Nijo. Molecular dating showed that the wheat D genome haplotypes diverged only a few thousand years ago, while some barley and Ae. tauschii haplotypes diverged more than 500,000 years ago. This suggests gene flow from wild barley relatives after domestication, whereas this was rare or absent in the D genome of hexaploid wheat. In some segments, the compared haplotypes were very similar to each other, but for two varieties each at the Rym4 and Lr34 loci, sequence conservation showed a breakpoint that separates a highly conserved from a less conserved segment. We interpret this as recombination breakpoints of two ancient haplotypes, indicating that the Triticeae genomes are a heterogeneous and variable mosaic of haplotype fragments. Analysis of insertions and deletions showed that large events caused by transposable element insertions, illegitimate recombination, or unequal crossing over were relatively rare. Most insertions and deletions were small and caused by template slippage in short homopolymers of only a few base pairs in size. Such frequent polymorphisms could be exploited for future molecular marker development.
The Triticeae tribe contains some of the world's most important crops, among them wheat (Triticum aestivum) and barley (Hordeum vulgare). Wheat and barley diverged approximately 11.6 million years ago (MYA) and have further diversified since then into several subspecies (Chalupska et al., 2008
In contrast to wheat, barley is a diploid species and there is evidence for extensive gene flow from wild to cultivated forms (Pickering and Johnston, 2005
Intraspecific variation at the haplotype level has been studied to varying degrees in different plant species. The most comprehensive data are available for Arabidopsis (Arabidopsis thaliana), in which oligonucleotide resequencing arrays allowed comparison of multiple ecotypes at the whole-genome level (Clark et al., 2007
The high repeat content of more than 80% (Bennett and Smith, 1976
The studies cited above were focused on genes in the A genome, which, as part of tetraploid wheat, has been part of the polyploid wheat gene pool for a long time. The Lr34 locus, which is described in this study, is located on the D genome. This genome is the most recent addition to hexaploid wheat and therefore shows the lowest degree of polymorphism. Two recent studies showed a surprisingly high level of polymorphism between the D genome of hexaploid wheat and the diploid wheat Ae. tauschii, suggesting that the Ae. tauschii line that was used to produce the respective BAC library had diverged at least 1 MYA from the donor of the D genome (Chantret et al., 2005
Divergence times of sequences from different species (or varieties) can be estimated based on the number of nucleotide substitutions in intergenic regions. Triticeae species are especially suited for such analyses because of their high TE content. TEs and other intergenic regions are believed to be largely free from selection pressure (Petrov, 2001
When sequences diverge over time, they accumulate not only nucleotide substitutions but also insertions and deletions (InDels). In Triticeae, the most prominent source of insertions are TEs, which can change the size and sequence organization of a genomic region dramatically within a relatively short evolutionary time (SanMiguel et al., 2002 Here, we present a comparative analysis of the levels of intraspecific variation in two barley loci, Vrs1 and Rym4, and the wheat locus Lr34. The Vrs1 locus regulates spike morphology in barley, whereas Rym4 and Lr34 are involved in disease resistance. For the Rym4 and Lr34 loci, we had sequences from three varieties, and for Vrs1, we had sequences from two. Additionally, we analyzed the Lr34 locus in the diploid Ae. tauschii. At two of the studied loci, we found ancient haplotype fragments in the otherwise highly conserved sequences. We identified several major sequence differences that were caused by illegitimate recombination and unequal crossing over. Additionally, we found that template slippage in short homopolymers is the major source of small InDels in closely related sequences.
For our comparative analyses of intraspecific genome variability, we used large genomic sequences from the two barley loci Vrs1 (Komatsuda et al., 2007
The Barley Varieties Morex and Haruna Nijo Are Very Similar in Genomic Organization at the Vrs1 Locus But Show Hundreds of Sequence Differences
The recently published sequence of the Vrs1 locus from the barley variety Morex (Komatsuda et al., 2007
The two sequences contain a total of 61 InDels; 34 of them are deletions in the Morex sequence and 27 are deletions in the Haruna Nijo sequence. The vast majority (54) of them are InDels of 10 bp or less, while large InDels are rare. Thirty-five of them are 1-bp InDels. To get clues to the molecular mechanism that caused the InDels, we analyzed the surrounding sequences of all 61 InDels (Table II ). We found that template slippage is in 49 cases the most frequent cause for InDels in the sequence analyzed. We considered template slippage as the molecular mechanism whenever the inserted/deleted sequence was found duplicated or multiplied in its entirety in the immediately neighboring sequences; for example, when an InDel motif GGA is embedded in a GGA microsatellite sequence (Fig. 1B). The largest InDel we attributed to template slippage is 14 bp in size. The 14-bp motif is one unit of a tandem array of five and four units in Haruna Nijo and Morex, respectively (Fig. 1B). For five InDels ranging in size from 6 to 390 bp, we could identify short direct repeats flanking the breakpoints of the InDels and indicating illegitimate recombination as the molecular mechanism (Fig. 1C). The most striking differences in size between the two sequences were apparently caused by unequal crossing over. In one large event, the internal domain of an RLC_BARE1 element was eliminated in the Morex sequence (see below). The second event resulted in the presence/absence of two 70-bp units in a large tandem repeat array (Fig. 1D). For four InDels, it is not clear by which mechanism they were caused.
Comparison of Three Barley Varieties at the Rym4 Locus Reveals a Breakpoint between Two Ancient Haplotypes The three genomic sequences that cover the barley Rym4 locus from the three varieties Morex, Haruna Nijo, and Cebada Capa have a common overlap of approximately 87 kb. This region is highly repetitive and contains only two genes (EIF4E-1 and MCT-1; Table I), which are located tail to tail immediately adjacent to each other and are surrounded by a large number of transposable elements (Fig. 1E). Haruna Nijo and Cebada Capa are very similar to each other in the Rym4 region, as they show only 70 nucleotide substitutions and 28 InDels in a total of 92,305 aligned bases. Twenty-one InDels can be attributed to template slippage in SSRs. For six of them, the mechanism is unclear, and in one case, an unequal crossing-over event led to the expansion of a direct repeat array by two units in Haruna Nijo (Table II). Excluding the region containing the two genes plus 1 kb of upstream and downstream region, a total of 81,760 bp could be aligned. This fraction contains 69 nucleotide substitutions, corresponding to an estimated divergence time of 32,478 ± 3,525 years ago. The comparison of Morex and Haruna Nijo yielded much more complex results, indicating that the Rym4 locus of Morex is the most divergent of the three. At the level of sequence organization, Morex and Haruna Nijo differ in numerous InDels (Fig. 1E). The Morex sequence contains eight TEs that are absent in Haruna Nijo, while the latter contains two specific TEs (Fig. 1E). Three of the additional TEs in Morex are nested in other TEs (Fig. 1E). There are seven full-length retrotransposons for which the insertion times could be estimated. Only one of them is common to both sequences and was inserted approximately 1.87 MYA. The other six, which are all only present in the Morex sequence, were inserted between 0.87 and 0.05 MYA (Fig. 1E). These data imply that the two loci diverged sometime between 1.87 and 0.87 MYA, the time span between the youngest retrotransposon common to both and the oldest one that appears in only one variety. We studied the distribution of nucleotide substitutions between the loci by excising in silico from both sequences those regions (TEs and other InDels) that are present in only one of the two. This resulted in a hypothetical ancestor sequence that contains only those regions that are found in both varieties (Fig. 1E). Interestingly, SNP frequency in the left part of the sequence is more than five times higher than in the right part. The first approximately 25 kb of the sequence has an average of 23.7 SNPs per 1,000 bp. This value drops abruptly to 4.1 SNPs per 1,000 bp between positions 25,000 and 26,000 and remains at this low level for the rest of the sequence (Fig. 1E). We interpret this drop in SNP frequency as the breakpoint of two ancient haplotypes that were recombined in Morex but not in Haruna Nijo. When the TE fractions of the strongly conserved and the less conserved parts were used separately for molecular dating, their estimated divergence times were 159,621 ± 10,335 and 927,306 ± 39,229 years ago, respectively. Based on these numbers, we developed a model of the evolution of the two varieties (Fig. 1F). We propose that two different barley lineages diverged approximately 930,000 years ago, giving rise to the two different haplotypes 1 and 2. This divergence time estimate is based on the number of nucleotide substitutions found in the 25 kb at the left end of the hypothetical ancestor sequence. The lineages giving rise to Haruna Nijo and part of the Morex sequence diverged approximately 160,000 years ago (based on nucleotide substitutions on the right part of the sequence). The lineage leading to Morex later recombined with haplotype 2, resulting in a chimerical sequence that includes regions from both haplotypes. The breakpoint of the recombination event lies 3 to 5 kb upstream of the HvEIF4E-1 gene (Fig. 1E). The results of the comparison of Morex and Cebada Capa were basically identical, because Cebada Capa and Haruna Nijo are much more similar to each other at the Rym4 locus than to Morex (data not shown).
The largest wheat sequence we analyzed has a size of 207 kb, derived from the Lr34 locus of the hexaploid wheat variety Chinese Spring. The sequence contains eight genes, including one conserved noncoding sequence and one pseudogene (CYP-1; Table I), which was disrupted by the insertion of a DNA transposon of the Harbinger superfamily (Fig. 2 ). This sequence was compared with two sequences from the varieties Glenlea and Renan. The Glenlea sequence has a size of 147 kb and covers all of the right part of the Chinese Spring sequence but does not cover the first approximately 60 kb at the left end (Fig. 2). The Chinese Spring and Glenlea sequences are extremely similar to each other. Neither contains a major insertion or deletion that would clearly distinguish it from the other, and we detected a single nucleotide substitution that was confirmed by resequencing in 146,245 aligned bases. When all gene sequences (plus 1 kb of upstream and downstream regions) were removed from the alignment, 107,182 bp were left for a divergence time estimate. Using this figure, the two sequences were estimated to have diverged roughly within the past 700 years (359 ± 359 years). In contrast to the low number of SNPs, a total of 30 InDels were found. Twenty-eight of them are due to template slippage, one to unequal crossing over, and one has no clear cause.
Renan, at 91 kb, is the shortest sequence that was available. It is completely covered by the Chinese Spring sequence but overlaps in only 42 kb with the sequence of Glenlea (Fig. 2A). In total, we could align 81,862 bp between Chinese Spring and Renan and found 13 nucleotide substitutions. The Renan sequence contains four putative genes (total gene space of 27.2 kb, including 1 kb of upstream and downstream sequences). Nine nucleotide substitutions were found in the 54.6 kb of nongenic sequence, translating into a divergence time estimate of 6,339 ± 2,113 years ago.
We detected a total of 26 InDels between Renan and Chinese Spring and six InDels between Renan and Glenlea. All except two can be attributed to template slippage. The two other InDels are the products of illegitimate recombination. The main difference is a large 9.8-kb deletion affecting parts of an RLC_Angela retrotransposon plus part of its flanking region. The deletion is present in Glenlea and Chinese Spring, while Renan still contains the 9.8-kb fragment. In this respect, Renan is more similar to Ae. tauschii, which also does not contain that deletion (InDel
The Ae. tauschii sequence has a size of 180 kb and is completely covered by the Chinese Spring sequence. It contains one sequence gap of unknown size in a CACTA transposon of the TAT1 family (Fig. 2), but the order of the two sequence contigs could be inferred based on the alignment with the Chinese Spring sequence. Ae. tauschii, although overall similar, shows several major differences from the hexaploid wheat sequences. These differences include the insertions of two CACTA transposons and a non-LTR retrotransposon in hexaploid wheat (InDels Interestingly, the Ae. tauschii sequence also shows an uneven distribution of nucleotide substitutions, similar to the barley haplotypes described above: a large region of approximately 80 kb in the left half of the sequence is highly conserved between hexaploid wheat (i.e. Chinese Spring) and Ae. tauschii (Fig. 2B), as the two species differ in only 87 nucleotide positions. This region includes three of the genes and one non-LTR retrotransposon. However, the situation is more complex than at the barley Rym4 locus and more difficult to interpret. From the SNP density plot (Fig. 2B), it appears that there are at least three different levels of sequence conservation: a highly conserved region in the left half (region A), a highly variable segment in the middle (region D), and intermediate conservation in the center and the right half of the sequence (regions B and E). We interpret these data as combinations of at least two, maybe three, haplotypes of different ages. The exact breakpoints could not be determined, but one appears to be close to the LRK-1 gene (border between regions A and B, Fig. 2B). Comparisons of nongenic sequences to the left of that island produced divergence time estimates of roughly 57,000 years ago, while those to the right of it indicated a divergence time of approximately 400,000 to 500,000 years ago. In the right part, more TE insertions and other large InDels can be found, which fits the higher divergence time estimate. In addition, there is one region (region C) that is so divergent in the two species that sequences could not be aligned reliably. Therefore, this region was excluded from the overall sequence alignment and compared separately (Fig. 2C). Region C lies between the two LRK genes and has a size of 1,800 bp in Chinese Spring and 2,800 bp in Ae. tauschii. Some parts of region C are conserved between Chinese Spring and Ae. tauschii, although at a lower level of sequence identity. Additionally, it contains several unique fragments that indicate multiple InDel events (Fig. 2C).
In total, we identified nine unequal crossing-over events that caused differences in the compared sequences. These events must have occurred in the relatively recent evolutionary history, since the divergence of the sequences studied. All detected unequal crossing-over events occurred in non-genic regions. Seven of them affected relatively small arrays of tandem repeats, while two had a major impact on the size of the regions because they occurred between the LTRs of retrotransposons. One of them resulted in a solo-LTR, while the other produced a tandem element: the barley Haruna Nijo sequence at the Vrs1 locus contains a full-length RLC_BARE1 retrotransposon close to its left end, while the Morex sequence contains a solo-LTR at this position. Based on the LTR divergence of the full-length element, we estimated RLC_BARE1 to have inserted at its location approximately 190,000 ± 63,000 years ago. The SD is relatively high, as the two LTRs differ in only nine positions. Because the solo-LTR in the Morex sequence is a hybrid of the two LTRs of the full-length element, comparison of specific SNPs allowed us to narrow down the putative breakpoint of the unequal crossing over to a stretch of 252 bp (Fig. 3A ).
The tandem element resulting from unequal crossing over was found close to the left end of the region compared between Chinese Spring and Ae. tauschii. A large 14-kb LTR retrotransposon of the Gypsy superfamily was found in its original full-length form in Ae. tauschii, while in Chinese Spring it had undergone an unequal crossing-over event. The unequal homologous recombination occurred between the two LTRs, resulting in a tandem element with three LTRs and two internal domains. Additionally, one unit of the tandem element was subsequently partially deleted (Fig. 3B).
Methylation of cytosine in CG and CNG sites increases the likelihood of spontaneous transitions from C to T (reviewed in Walsh and Xu, 2006
Compared with other plant species such as Arabidopsis or rice, sequence information from Triticeae crops is still very limited. In addition, only relatively few BAC libraries from different varieties are available. The germplasm analyzed in this study represents a broad range of Triticeae species and varieties, allowing for comparisons within diploid and hexaploid species as well as for cross comparison. By including sequences from BAC libraries each from three wheat and barley varieties, we exploited all resources available to date for barley and hexaploid wheat. The D genome of wheat is especially interesting as it shows a low level of diversity and provides a background with low levels of gene flow. In addition, loci on the D genome can be easily compared with sequences from the diploid donor species Ae. tauschii. The analysis of intergenic sequences proved to be very efficient for understanding the evolutionary history of the loci studied. Since genes are under selection pressure, they do not diverge at a constant rate and cannot be used directly for divergence time estimates. Additionally, genes evolve more slowly and allow for many fewer genomic rearrangements than intergenic sequences. In contrast, TE and other nongenic sequences keep a trace record of all mutations and rearrangements. Therefore, the high repeat content of Triticeae genomes, although an obstacle for sequencing, makes them ideally suited to study the molecular mechanisms of genome evolution.
The extensive stretches of TE sequences allowed very precise estimates of divergence times. However, these have to be considered with caution for two reasons. First, for sequences that differ in only a few base pairs, all SNPs have to be carefully checked either by resequencing or at least by examining the quality of sequences and assembly in the respective positions. For this study, we resequenced the regions of all SNPs between the wheat varieties Chinese Spring, Glenlea, and Renan to exclude the possibility of sequencing errors. Second, because TE sequences are often heavily methylated in Triticeae, an increased mutation rate in CG and CNG sites has to be expected (Walsh and Xu, 2006
It is amazing that the comparison of only three loci from a few varieties resulted in the identification of multiple haplotypes. Our comparative analysis, therefore, suggests that haplotypes of distant evolutionary origin are common in the Triticeae gene pool. By analyzing SNP frequencies, we found that different parts of the sequences diverged at different time points. The identified haplotype divergences cover a relatively broad time span. Haplotypes 1 and 2 at the barley Vrs1 locus diverged approximately 160,000 and 930,000 years ago, while the two haplotype segments identified in Ae. tauschii diverged approximately 50,000 and 500,000 years ago. The ages of the two older haplotypes are comparable to the previously described ancient haplotypes at the wheat Lr10 locus (Isidore et al., 2005b
The evolutionary interpretation becomes more complex if the data are evaluated in more detail. For example, we estimated that the Vrs1 sequences of Haruna Nijo and Morex diverged approximately 120,000 years ago, which is significantly different from the 160,000 years ago divergence for the "younger" haplotype at the Rym4 locus. Thus, the sequences of the Vrs1 and Rym4 loci must represent yet again haplotypes that diverged at different points in time. The more divergent part of the Ae. tauschii sequence might also represent several distinct segments, as the SNP distribution along the sequence is very uneven when compared with Chinese Spring. Additionally, the Ae. tauschii sequence contains an extremely divergent DNA segment between the two LRK genes. Because of its short length, this is more likely to be the result of a gene conversion event rather than a double crossing over. It seems that this region was introgressed from yet another, more divergent haplotype or from a different locus in the genome (e.g. a homoeologous locus of the A or B genome). An example of the latter was recently published (Zhang and Dubcovsky, 2008
Interestingly, previous studies placed the divergence of the D genome of hexaploid wheat with Ae. tauschii approximately between 550,000 and 900,000 years ago (Chantret et al., 2005
The level of polymorphism observed between the three wheat varieties is much lower than that between the barley varieties. This is in perfect agreement with the fact that the hybridization event adding the D genome to tetraploid wheat occurred only about 8,000 years ago (Feldman, 2001
In contrast, the barley haplotypes are much older. There are two possible explanations that are not mutually exclusive. It has been hypothesized (Zohary, 1996 We conclude that the molecular analysis of intraspecies diversity in wheat and barley at the haplotype level confirms and extends our knowledge of the evolutionary history of these two crop species. Our data demonstrate that divergence time can only be determined for haplotypes or segments of them, but not for varieties. Therefore, we are confronted with an emerging picture of a gene pool that consists of a large number of haplotypes that have frequently recombined in the past. However, these events were not frequent enough to completely homogenize the entire gene pool, instead still allowing for distinct haplotype fragments to be identified.
The intraspecies comparative analyses provided an opportunity to study seven unequal crossing-over events in great detail. For all of them, obvious template sequences such as LTRs of retrotransposons or arrays of tandem repeats could be identified. The two unequal crossing-over events in LTR retrotransposons nicely illustrate the potential of unequal homologous recombination to either expand or contract a region. Solo-LTRs are reported frequently in plants (Shirasu et al., 2000
Five incidents of unequal crossing over affected arrays of tandem repeats. All of these arrays were found either in TEs or in noncoding intergenic sequences. From the available data, we conclude that such tandem arrays frequently expand and contract through unequal crossing over. As it was recently described for Leu-rich repeats of resistance gene analogs (Wicker et al., 2007
As in previous studies (Shirasu et al., 2000 SSRs or microsatellites have long been used as molecular markers and are formed by relatively long stretches of dinucleotide, trinucleotide, or tetranucleotide repeats that are polymorphic because of template slippage. Indeed, we also found that most InDels were apparently caused by template slippage, resulting in a short DNA motif being repeated in different numbers between two genomic sequences. Template slippage was by far the most abundant cause of InDels between sequences that diverged only recently (e.g. Glenlea and Chinese Spring). However, two observations were unexpected. First, we found that most of the motifs that were affected by template slippage were relatively short homopolymers (i.e. stretches of only three or four identical nucleotides). This is surprising because it is commonly found that SSRs are only polymorphic if they have a certain length. Second, InDels greatly outnumbered nucleotide substitutions in the Glenlea versus Chinese Spring comparison. This suggests that template slippage is a more frequent source of polymorphism than nucleotide substitution. This finding has implications for the design of highly polymorphic molecular markers for breeding germplasm, particularly for the wheat D genome, with its low degree of polymorphism: a strategy allowing the efficient isolation of short homopolymers (e.g. by resequencing arrays) might result in a very large number of potential molecular markers, as such short motifs are more frequent than the typically used SSRs and SNPs.
Although the regions studied in this work might not be fully representative for the whole genomes (e.g. because of a putatively strong selection pressure on these regions involved in disease resistance or spike morphology), there are several possible implications for the future genetic analysis of agronomical traits in Triticeae crops. The finding of recombined haplotypes has to be considered in association with mapping, as single, "haplotype-specific" markers might actually detect different, recombined haplotypes. As we have shown here, comparative analysis allows us to detect such recombined haplotypes. A future, large-scale analysis of haplotypes (including both genic and intergenic regions) in Triticeae gene pools will be greatly supported by the next generation of sequencing technologies, such as 454, Solexa, and oligonucleotide resequencing arrays similar to those used in Arabidopsis (Clark et al., 2007 imagefpc/IBSC Webpage/).
We have found that despite a great diversity and great age of the ancient haplotypes, sequence diversity within genes or differences in gene content were minimal. This contrasts with findings in maize, in which a large number of gene fragments distinguished varieties (Fu and Dooner, 2002
Shotgun Sequencing
BAC clones were obtained by screening of publicly available BAC libraries of the respective varieties (Table III
). Identified BACs were confirmed either by Southern-blot hybridization to fingerprint blots obtained after single restriction of BAC DNA or by PCR amplification of specific genic or genomic markers from the corresponding BACs. BAC shotgun sequencing was done on an ABI3730 automated sequencer (Applied Biosystems). Base calling and quality trimming of the sequences were done using PHRED version 0.020425.c (Ewing et al., 1998
Sequence Analysis
For sequence analysis, programs from the EMBOSS package (http://emboss.sourceforge.net/), ClustalW (Thompson et al., 1994 For sequences that differed in only very few base pairs, primers were designed and all regions containing SNPs were resequenced independently. The sequences described in this study were deposited at GenBank under accession numbers FJ436983, FJ436984 to FJ436986, and FJ477091 to FJ477093.
We thank Jelena Perovic for her excellent technical assistance. Received September 10, 2008; accepted November 12, 2008; published November 14, 2008.
1 This work was supported by the Swiss National Science Foundation (grant no. 3100–105620 to B.K.), by the Ministry of Agriculture, Forestry, and Fisheries of Japan (Genomics for Agricultural Innovation grant no. TRC1004 and Green Technology project no. GD3006), by the Grains Research and Development Corporation (grant nos. CSP00063 and 00099), the Bundesministerium für Bildung und Forschung (grant no. 0312280A; GABI-MAP), and Promotion of Basic Research Activities for Innovative Biosciences (to K.S.).
2 These authors contributed equally to the article.
3 Present address: Julius-Kuehn-Institute, Institute for Resistance Research and Stress Tolerance, Erwin-Baur-Strasse 27, D–06484 Quedlinburg, Germany. The author responsible for distribution of materials integral to the findings presented in this article in accordance with the policy described in the Instructions for Authors (www.plantphysiol.org) is: Beat Keller (bkeller{at}botinst.uzh.ch).
[OA] Open Access articles can be viewed online without a subscription. www.plantphysiol.org/cgi/doi/10.1104/pp.108.129734 * Corresponding author; e-mail bkeller{at}botinst.uzh.ch.
Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and psi-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389–3402 Bennett MD, Smith JB (1976) Nuclear DNA amounts in angiosperms. Philos Trans R Soc Lond B Biol Sci 274: 227–274 Brunner S, Fengler K, Morgante M, Tingey S, Rafalski A (2005) Evolution of DNA sequence nonhomologies among maize inbreds. Plant Cell 17: 343–360 Chalhoub B, Belcram H, Caboche M (2004) Efficient cloning of plant genomes into bacterial artificial chromosome (BAC) libraries with larger and more uniform insert size. Plant Biotechnol J 2: 181–188[CrossRef][Web of Science][Medline] Chalupska D, Lee HY, Faris JD, Evrard A, Chalhoub B, Haselkorn R, Gornicki P (2008) Acc homoeoloci and the evolution of wheat genomes. Proc Natl Acad Sci USA 105: 9691–9696 Chantret N, Salse J, Sabot F, Rahman S, Bellec A, Laubin B, Dubois I, Dossat C, Sourdille P, Joudrier P, et al (2005) Molecular basis of evolutionary events that shaped the hardness locus in diploid and polyploid wheat species (Triticum and Aegilops). Plant Cell 17: 1033–1045 Chao S, Zhang W, Akhunov E, Sherman J, Ma Y, Luo M-C, Dubcovsky J (2009) Analysis of gene-derived SNP marker polymorphism in US wheat (Triticum aestivum L.) cultivars. Mol Breed 23: 23–33[CrossRef] Clark RM, Schweikert G, Toomajian C, Ossowski S, Zeller G, Shinn P, Warthmann N, Hu TT, Fu G, Hinds DA, et al (2007) Common sequence polymorphisms shaping genetic diversity in Arabidopsis thaliana. Science 317: 338–342 Devos KM, Brown JKM, Bennetzen JL (2002) Genome size reduction through illegitimate recombination counteracts genome expansion in Arabidopsis. Genome Res 12: 1075–1079 Dubcovsky J, Dvorak J (2007) Genome plasticity a key factor in the success of polyploid wheat under domestication. Science 316: 1862–1866; erratum Dubcovsky J, Dvorak J (2007) Science 318: 393 Ewing B, Hillier L, Wendl M, Green P (1998) Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res 8: 175–185 Feldman M (2001) Origin of cultivated wheat. In AP Bonjean, WJ Angus, eds, The World Wheat Book: A History of Wheat Breeding. Lavoisier Publishing, Paris, pp 3–56 Fu H, Dooner HK (2002) Intraspecific violation of genetic colinearity and its implications in maize. Proc Natl Acad Sci USA 99: 9573–9578 Gaut BS, Morton BR, McCaig BC, Clegg MT (1996) Substitution rate comparisons between grasses and palms: synonymous rate differences at the nuclear gene Adh parallel rate differences at the plastid gene rbcL. Proc Natl Acad Sci USA 93: 10274–10279 Grando S, Macpherson H (2005) Food barley: importance, uses and local knowledge. In Proceedings of the International Workshop on Food Barley Improvement, Vol 1. International Center for Agricultural Research in the Dry Areas, Aleppo, Syria, pp 121–126 Gu YQ, Salse J, Coleman-Derr D, Dupin A, Crossman C, Lazo GR, Huo N, Belcram H, Ravel C, Charmet G, et al (2006) Types and rates of sequence evolution at the high-molecular-weight glutenin locus in hexaploid wheat and its ancestral genomes. Genetics 174: 1493–1504 Han B, Xue Y (2003) Genome-wide intraspecific DNA-sequence variations in rice. Curr Opin Plant Biol 6: 134–138[CrossRef][Web of Science][Medline] Isidore E, Scherrer B, Bellec A, Budin K, Faivre-Rampant P, Waugh R, Keller B, Caboche M, Feuillet C, Chalhoub B (2005a) Direct targeting and rapid isolation of BAC clones spanning a defined chromosome region. Funct Integr Genomics 5: 97–103[CrossRef][Medline] Isidore E, Scherrer B, Chalhoub B, Feuillet C, Keller B (2005b) Ancient haplotypes resulting from extensive molecular rearrangements in the wheat A genome have been maintained in species of three different ploidy levels. Genome Res 15: 526–536 Komatsuda T, Pourkheirandish M, He C, Azhaguvel P, Kanamori H, Perovic D, Stein N, Graner A, Wicker T, Tagiri A, et al (2007) Six-rowed barley originated from a mutation in a homeodomain-leucine zipper I-class homeobox gene. Proc Natl Acad Sci USA 104: 1424–1429 Lai J, Li Y, Messing J, Dooner HK (2005) Gene movement by Helitron transposons contributes to the haplotype variability of maize. Proc Natl Acad Sci USA 102: 9068–9073 Lovett ST (2004) Encoded errors: mutations and rearrangements mediated by misalignment at repetitive DNA sequences. Mol Microbiol 52: 1243–1253[CrossRef][Web of Science][Medline] Ma J, Bennetzen JL (2004) Rapid recent growth and divergence of rice nuclear genomes. Proc Natl Acad Sci USA 101: 12404–12410 Ma J, Bennetzen JL (2006) Recombination, rearrangement, reshuffling, and divergence in a centromeric region of rice. Proc Natl Acad Sci USA 103: 383–388 Ma J, Wing RA, Bennetzen JL, Jackson SA (2007) Plant centromere organization: a dynamic structure with conserved functions. Trends Genet 23: 134–139[CrossRef][Web of Science][Medline] Morgante M, Brunner S, Pea G, Fengler K, Zuccolo A, Rafalski A (2005) Gene duplication and exon shuffling by Helitron-like transposons generate intraspecies diversity in maize. Nat Genet 37: 997–1002[CrossRef][Web of Science][Medline] Nilmalgoda SD, Cloutier S, Walichnowski AZ (2003) Construction and characterization of a bacterial artificial chromosome (BAC) library of hexaploid wheat (Triticum aestivum L.) and validation of genome coverage using locus-specific primers. Genome 46: 870–878[Medline] Petrov DA (2001) Evolution of genome size: new approaches to an old problem. Trends Genet 17: 23–28[CrossRef][Web of Science][Medline] Pickering R, Johnston PA (2005) Recent progress in barley improvement using wild species of Hordeum. Cytogenet Genome Res 109: 344–349[CrossRef][Web of Science][Medline] Ramakrishna W, Dubcovsky J, Park YJ, Busso C, Emberton J, SanMiguel P, Bennetzen JL (2002) Different types and rates of genome evolution detected by comparative sequence analysis of orthologous segments from four cereal genomes. Genetics 162: 1389–1400 Ravel C, Praud S, Murigneux A, Canaguier A, Sapet F, Samson D, Balfourier F, Dufour P, Chalhoub B, Brunel D, et al (2006) Single-nucleotide polymorphism frequency in a set of selected lines of bread wheat (Triticum aestivum L.). Genome 49: 1131–1139[Medline] Saisho D, Myoraku E, Kawasaki S, Sato K, Takeda K (2007) Construction and characterization of a bacterial artificial chromosome (BAC) library from the Japanese malting barley variety Haruna Nijo. Breed Sci 57: 29–38[CrossRef] SanMiguel P, Gaut BS, Tikhonov A, Nakajima Y, Bennetzen JL (1998) The paleontology of intergene retrotransposons of maize. Nat Genet 20: 43–45[CrossRef][Web of Science][Medline] SanMiguel PJ, Ramakrishna W, Bennetzen JL, Busso CS, Dubcovsky J (2002) Transposable elements, genes and recombination in a 215-kb contig from wheat chromosome 5A(m). Funct Integr Genomics 2: 70–80[CrossRef][Medline] Scherrer B, Isidore E, Klein P, Kim JA, Bellec A, Chalhoub B, Keller B, Feuillet C (2005) Large intraspecific haplotype variability at the rph7 locus results from rapid and recent divergence in the barley genome. Plant Cell 17: 361–374 Sentry JW, Smyth DR (1989) An element with long terminal repeats and its variant arrangements in the genome of Lilium henryi. Mol Gen Genet 215: 349–354[CrossRef][Web of Science][Medline] Shirasu K, Schulman AH, Lahaye T, Schulze-Lefert P (2000) A contiguous 66-kb barley DNA sequence provides evidence for reversible genome expansion. Genome Res 10: 908–915 Sonnhammer EL, Durbin R (1995) A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis. Gene 167: GC1–GC10[CrossRef][Web of Science][Medline] Stein N, Perovic D, Kumlehn J, Pellio B, Stracke S, Streng S, Ordon F, Graner A (2005) The eukaryotic translation initiation factor 4e confers multiallelic recessive bymovirus resistance in Hordeum vulgare (L.). Plant J 42: 912–922[CrossRef][Web of Science][Medline] Thompson JD, Higgins DG, Gibson TJ (1994) CLUSTAL W, improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22: 4673–4680 Vitte C, Panaud O (2003) Formation of solo-LTRs through unequal homologous recombination counterbalances amplifications of LTR retrotransposons in rice Oryza sativa L. Mol Biol Evol 20: 528–540 Walsh CP, Xu GL (2006) Cytosine methylation and DNA repair. Curr Top Microbiol Immunol 301: 283–315[Web of Science][Medline] Wicker T, Stein N, Albar L, Feuillet C, Schlagenhauf E, Keller B (2001) Analysis of a contiguous 211 kb sequence in diploid wheat (Triticum monococcum L.) reveals multiple mechanisms of genome evolution. Plant J 26: 307–316[CrossRef][Web of Science][Medline] Wicker T, Yahiaoui N, Guyot R, Schlagenhauf E, Liu ZD, Dubcovsky J, Keller B (2003) Rapid genome divergence at orthologous low molecular weight glutenin loci of the A and Am genomes of wheat. Plant Cell 15: 1186–1197 Wicker T, Yahiaoui N, Keller B (2007) Illegitimate recombination is a major evolutionary mechanism for initiating size variation in plant resistance genes. Plant J 51: 631–641[CrossRef][Web of Science][Medline] Wicker T, Zimmermann W, Perovic D, Paterson AH, Ganal M, Graner A, Stein N (2005) A detailed look at 7 million years of genome evolution in a 439 kb contiguous sequence at the barley Hv-EIF4e locus: recombination, rearrangements and repeats. Plant J 41: 184–194[CrossRef][Medline] Zhang W, Dubcovsky J (2008) Association between allelic variation at the Phytoene synthase 1 gene and yellow pigment content in the wheat grain. Theor Appl Genet 116: 635–645[CrossRef][Web of Science][Medline] Zeller G, Clark RM, Schneeberger K, Bohlen A, Weigel D, Rätsch G (2008) Detecting polymorphic regions in Arabidopsis thaliana with resequencing microarrays. Genome Res 18: 918–929 Zohary D (1996) The mode of domestication of the founder crops of Southwest Asian agriculture. In DR Harris, ed, The Origins and Spread of Agriculture and Pastoralism in Eurasia. University College London Press, London, pp 142–158 This article has been cited by other articles:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ASPB Publications | PLANT PHYSIOLOGY® | THE PLANT CELL | |
|---|---|---|---|