|
|
||||||||
|
First published online July 23, 2008; 10.1104/pp.108.121491 Plant Physiology 148:25-40 (2008) © 2008 American Society of Plant Biologists OPEN ACCESS ARTICLE
Genome-Wide Analysis of Transposon Insertion Polymorphisms Reveals Intraspecific Variation in Cultivated Rice1,[W],[OA]National Center for Gene Research and Institute of Plant Physiology and Ecology, Shanghai Institutes of Biological Sciences, Chinese Academy of Sciences, China
Insertions and precise eliminations of transposable elements generated numerous transposon insertion polymorphisms (TIPs) in rice (Oryza sativa). We observed that TIPs represent more than 50% of large insertions and deletions (>100 bp) in the rice genome. Using a comparative genomic approach, we identified 2,041 TIPs between the genomes of two cultivars, japonica Nipponbare and indica 93-11. We also identified 691 TIPs between Nipponbare and indica Guangluai 4 in the 23-Mb collinear regions of chromosome 4. Among them, retrotransposon-based insertion polymorphisms were used to reveal the evolutionary relationships of these three cultivars. Our conservative estimates suggest that the TIPs generated approximately 14% of the genomic DNA sequence differences between subspecies indica and japonica. It was also found that more than 10% of TIPs were located in expressed gene regions, representing an important source of genetic variation. Transcript evidence implies that these TIPs induced a series of genetic differences between two subspecies, including interrupting host genes, creating different expression forms, drastically changing intron length, and affecting expression levels of adjacent genes. These analyses provide genome-wide insights into evolutionary history and genetic variation of rice.
Transposons were first discovered and characterized in maize (Zea mays; McClintock, 1948
Different from DNA transposons (class II TEs), which can be deleted precisely at a relatively low frequency, the vast majority of retrotransposon insertions (class I TEs) are irreversible, rarely undergoing precise excision. Hence, the absence of retrotransposon is regarded to be the ancestral state. Moreover, the probability that different retrotransposons would independently insert into the exact same location is negligible. Consequently, retrotransposon-based insertion polymorphisms (RBIPs), as an important subset of TIPs, are very useful in the study of deeper phylogeny in wide germplasm pools. RBIPs were developed using the PCR-based method for retrotransposon isolation (Pearce et al., 1999
Although TIPs are abundant and also informative (Du et al., 2006
Recent studies have found that more than 10% of the structural genes contained TEs in rice (Sakai et al., 2007
The Abundance of TIPs between Cultivated Rice Genomes
To investigate the difference between japonica Nipponbare and indica Guangluai 4 genome sequences, we selected an indica-japonica collinear region on chromosome 4, where both cultivars have BAC-based sequences and differ substantially in size (Fig. 1
). The total length of this region is 492 kb in Nipponbare and 394 kb in Guangluai 4. We analyzed TEs and non-TE related genes and compared the differences between the two genome sequences in this region. Consistent with the conclusions from studies of other organisms (Britten et al., 2003
To identify the differences induced by the TIPs, we performed a systematic analysis of approximately 23-Mb sequences of chromosome 4. Our approach for detecting TIPs in rice involved identifying all indels of more than 100 bp between the two genomes and then screening these insert regions to identify de novo transposon insertions. We reasoned that this approach should be effective, because many indels were related to TIPs, and, also, the lengths of most transposon insertions were longer than 100 bp, as indicated in the orthologous region mentioned above. We aligned all of the orthologous regions between Nipponbare and Guangluai 4, and mined all indels of more than 100 bp. The results were the same when individual BACs and constructed contigs of Guangluai 4 were used for alignment. We found that there were 821 insertions (>100 bp) in Nipponbare relative to Guangluai 4 and 751 insertions (>100 bp) in Guangluai 4 relative to Nipponbare, with a total length of 3.2 Mb and 2.4 Mb, respectively (Table I ). Overall, the 1,572 insertions were distributed throughout these regions, ranging from 100 to 118,675 bp in length. Large indels of greater than 2 kb were primarily responsible for the different sizes of orthologous regions between Nipponbare and Guangluai 4 (Fig. 2 ). The homology-based approach was used to identify indels that were caused by de novo transposon insertions. We regarded an indel as a TE insertion by employing the following criteria: first, it should have similarity to a known TE family and possess the structure of a transposon; second, it should be bound by target site duplication (TSD). With these criteria, 691 insertions of transposons were identified in the approximately 23-Mb orthologous regions of Nipponbare and Guangluai 4 (Supplemental Table S1). Among them, the most abundant polymorphisms identified were Ty3/gypsy insertion polymorphisms. A total of 110 insertions of Ty3/gypsy retrotransposons were detected in Nipponbare, while 127 insertions were detected in Guangluai 4, equivalent to 0.95 and 0.89 Mb of the sequences investigated, respectively (Table II ). Other abundant transposon insertions included Ty1/copia, En-Spm/CACTA, and MULE, which were consistent with their content in the Nipponbare genome.
Although the total number of TE insertions is nearly equal in Nipponbare and Guangluai 4, the size of long terminal repeat (LTR)-retrotransposon insertions varies substantially between Nipponbare and Guangluai 4. The average length of LTR-retrotransposons is 7.5 kb in Nipponbare and 6.5 kb in Guangluai 4, which may suggest that internal deletions of LTR-retrotransposons occur more frequently in Guangluai 4. Moreover, some DNA transposon families seemed to have insertion bias in the two subspecies. The insertions of En-Spm/CACTA and MULE were more abundant in Nipponbare than in Guangluai 4, while Tourist/Harbinger insertions were more abundant in Guangluai 4.
In the approximately 23-Mb orthologous regions, there are at least 179 "young LTR-retrotransposons" in the Nipponbare genome (covering about 1.34-Mb sequences), which accumulated after the divergence of japonica and indica from a common ancestor. As the total length of the rice nuclear genome was calculated to be 389 Mb and chromosome 4 had a relatively modest retrotransposon content (International Rice Genome Sequencing Project, 2005
To determine the evolutionary history of three cultivated rice varieties, Nipponbare, Guangluai 4, and 93-11, whose genomic sequences are available, we tested for the presence/absence of RBIPs between Nipponbare/Guangluai 4 in the BGI 93-11 genome by searching against the BGI 93-11 contigs. An insertion of a TE was considered to be present in 93-11 rice when the corresponding region of 93-11 had the TE insertion. Alternatively, an insertion was judged to be absent in the 93-11 genome if the TE sequences did not exist in the orthologous region of 93-11 (see "Materials and Methods" for details).
In total, 163 retrotransposon insertions present in the Nipponbare genome and 165 retrotransposon insertions present in the Guangluai 4 genome were investigated in the 93-11 genome (Table III
). Of the 163 retrotransposon insertions present in Nipponbare, 148 insertions are absent in the 93-11 genome (consistent with Guangluai 4), while only 15 insertions are present in 93-11 (consistent with Nipponbare; Fig. 3
, type I). This result indicates that the radiation between gene pools of Guangluai 4 and 93-11 probably occurred after the divergence between indica and japonica. The 15 exceptions reflect introgression between the two genome pools that may have occurred hundreds of years ago, as reported previously (Feltus et al., 2004
In addition, we examined DNA transposon polymorphisms in 93-11. Because of the possibility of the excision of DNA transposons and the lack of ancestor information, we could not determine whether an individual TIP was an insertion or a precise excision event. However, it was found that, of 119 DNA transposon insertions present in Nipponbare, only 23 insertions were also present in 93-11 (Supplemental Table S5). The 23 insertions present in 93-11 can result from the introgression events or the excision events. Deducting from the introgression portion (9.2%), there were likely only 10.1% resulting from the excision events (23/119 – 9.2% = 10.1%). According to these results, we propose that the precise excision of DNA transposons is not frequent in rice.
With the availability of two rice whole genome sequences and whole genome alignment, we started our mining from the alignment result of BGI 93-11 contigs with The Institute for Genomic Research (TIGR) Nipponbare pseudomolecule 5.0 (Ouyang et al., 2007
TIPs are not randomly distributed on five of the 12 rice chromosomes (chromosomes 1, 3, 4, 5, and 8; P < 0.01; Supplemental Table S6). The uneven distribution is to some extent caused by the position bias of TE insertions. It was also found that some regions lacking TIPs are also the regions of low polymorphism between Nipponbare and 93-11. For instance, the longest region lacking TE insertions is also the longest single-nucleotide polymorphism (SNP)-poor region, which is located on chromosome 5, as shown in Figure 5 (9–13 Mb in the pseudomolecules [Feltus et al., 2004
Types of TE-Induced Genetic Variations TIPs have considerable effect on genome structure and size, as described above. Moreover, they also contribute to the variation of individual genes. Various ways have been discovered in which TIPs can affect the intraspecific variation of individual genes (Fig. 6 ). To explore the evolutionary significance of TIPs in genetic variation, we examined all of the TIPs in the expressed gene regions and determined whether any variation caused by TIPs existed between indica and japonica (Supplemental Tables S3 and S4). Since EST and cDNA sequences can provide direct evidence for gene expression and because they are currently the most important resources for transcriptome exploration in rice, we considered a TIGR gene locus as an expressed gene region if it had at least one corresponding EST (or cDNA) in the database. The variations in these regions were classified into three types: (1) the alteration of cDNA sequence; (2) the change of intron size; and (3) the rearrangement of the promoter region. We counted the number of TIPs that were associated with the three types of genetic variation and observed that at least 10% of TIPs occurred in the expressed gene regions leading to changes ranging from subtle to dramatic (Tables IV and V ).
Alteration of cDNA Sequence After the divergence, TIPs within gene regions are likely to result in a variety of outcomes, including the alteration of gene structure and expression. To investigate these TE-induced changes in transcription level, we searched transcripts (including Fl-cDNA and ESTs) around the insertion sites. If there was a cDNA or EST match, gene annotation was inspected in Nipponbare and 93-11 on the basis of rice transcript alignments and TIGR annotation release 5. Then, individual examinations were conducted to identify the difference caused by TE insertion. TE insertions into TE-related genes were excluded manually.
Overall, 4.3% of TIPs between Nipponbare and 93-11 and 3.9% of TIPs between Nipponbare and Guangluai 4 resulted in abnormal termination or alterative splicing, respectively. TEs that insert within coding regions are most likely to result in null mutations. For example, in hexaploid wheat (Triticum aestivum), the xylanase inhibitor protein I gene (XIP-I), whose crystal structure, expression pattern, and function have been studied in detail, was shown to function in plant defense against secreted fungal pathogen xylanases by its competitive inhibiting activity against fungal endo-1,4-β-D-xylanases (Elliott et al., 2002
It was found that 3' untranslated regions (UTRs) in exons are preferentially inserted, which can be easily understood because insertions in 3' UTRs seem to be less destructive than insertions in other locations of the coding region. On the other hand, they also provide the raw material for new protein-coding regions. For example, we found that TE insertions in 3' UTRs created an alternative spliceosome. OsWRKY8, a member of the WRKY gene family encoding transcription factors that are involved in the regulation of various biological processes (Xie et al., 2005 Insertions in introns could also have an influence on gene splicing sites. For instance, we found that a putative rice purine permease, which is a homologous gene of AtPUP11, shifted its transcription start site to the transposon hAT, thus generating a truncated ORF lacking its original first exon (Fig. 6C).
Change of Intron Size
Modification of Expression Level and Rearrangement of Promoter Region We then experimentally compared relative expression levels of 15 genes that possessed TIPs in the defined upstream regions between japonica Nipponbare and indica 93-11. The results of real-time RT-PCR analyses of 14-d-old seedlings are shown in Supplemental Figure S1. Of 15 genes examined, five genes showed greater than 2-fold differences in relative expression levels between Nipponbare and 93-11. In particular, two of them, Os01g49110 and Os12g23754, showed 23-fold down-regulation and 18-fold up-regulation with the TE insertion, respectively.
Interestingly, the majority of TIPs in the upstream region of expressed genes are DNA transposons (80.5%, i.e. 62 of 77 TIPs between Nipponbare and 93-11 in the promoter region [Supplemental Table S3]), significantly higher than the average proportion (49.9%, i.e. 1,018 of 2,041 TIPs between Nipponbare and 93-11). Of these, MULEs also account for a relatively higher portion (36.4% in the promoter region versus 13.8% on average). Given the report that the vast majority of Pack-MULE transcripts is initiated from promoters in element sequences (Jiang et al., 2004
Utility of the TIPs
We have identified 691 TIPs between Nipponbare and Guangluai 4 in the 23-Mb collinear regions of chromosome 4 and 2,041 TIPs between the Nipponbare and 93-11 genomes. These TIPs can be used to develop molecular markers. Of the transposon insertions, about half of them were less than 1.5 kb. For these small TE insertions, a single PCR would be feasible, using primers derived from its flanking regions, resembling simple sequence repeat polymorphisms. For larger TE insertions, two rounds of PCR need to be performed. In the first reaction, amplification is a test using primers flanking the insertion. In the second reaction, one primer is designed from the flanking sequence and the other recognizes the LTR/TIR sequence of the corresponding TE. Then, as a codominant marker system, the different allelic states (presence and absence of the transposon insertion) at a locus will be revealed (Flavell et al., 1998 Although the TIPs identified here are based on differences between only one japonica and two indica varieties, a large portion of those TIPs could be applicable to combinations of japonica and its related wild species (e.g. Oryza rufipogon) or other combinations of japonica and indica cultivars, because RBIPs and numerous DNA TIPs identified here can be regarded as events occurring in the recent past (after the divergence between indica and japonica). For example, among 2,041 TIPs between Nipponbare and 93-11, 94 are located in regions that have corresponding Guangluai 4 BAC sequences. After a comparison with Guangluai 4, we found that 85.1% (80 of 94 polymorphisms) were also polymorphic between Nipponbare and Guangluai 4.
The marker system based on TE insertions offers an ideal tool to evaluate the transposition history, frequency, and timing of mobile elements in rice. Since the patterns of the RBIPs can reveal the relationship among observed cultivars in a phylogenetically meaningful way, phylogenetic and biodiversity studies can be carried out using RBIPs. Vitte et al. (2004)
TE insertion polymorphisms distribute quite unevenly. It may reflect the local variation in TE insertions caused by differences in chromosome physiology (e.g. chromatin features, euchromatin region, or heterochromatin region). But we also observed that regions of low TE insertion polymorphism appeared to be correlated with regions of low SNPs. Occasional crosses between ancestors of 93-11 and Nipponbare may have happened, leading to the introgression of chromosomal segments. This may explain why there are 15 retrotransposon insertions absent in Guangluai 4 but shared by Nipponbare and 93-11. In this study, two indica varieties, 93-11 and Guangluai 4, were investigated; they were the paternal cultivar of a superhybrid and a cultivar widely grown in China several decades ago, respectively. To our surprise, the TIPs between them are not rare. We found that a number of de novo transposon insertions occurred only in Guangluai 4, most of which date back to more than 0.1 million years ago. Although there may exist limited introgression, it still cannot account for the deep divergence between 93-11 and Guangluai 4 genomes. Therefore, the radiation of the indica genomes occurred unambiguously earlier than the domestication of rice, supporting multiple domestications of O. sativa.
After the completion of rice genome sequencing, the content of all types of transposons in the rice genome is estimated to be 35%. Now, our mining provided an opportunity to measure the level of variation caused by TIPs in rice varieties. The 23-Mb collinear regions of Nipponbare and Guangluai 4 are both derived from high-quality BAC-based sequences; therefore, the number of TIPs identified between Nipponbare and Guangluai 4 can be used as a gold standard to estimate the number of TIPs in the rice genome. Because the 23-Mb regions of chromosome 4 represent about 6% of the rice genome, there would be more than 11,517 TIPs in the rice genome on average (691/6% = 11,517), accounting for 53.5 Mb of DNA sequence (3.21/6% = 53.5 Mb). Hence, more than 14% of the genomic DNA sequences, which are different between indica and japonica, are due to the movements of TE. We propose that the average density of TIPs is relatively comparable between genomes of different varieties, although the 2,041 polymorphic transposon insertions identified between Nipponbare and 93-11 account for about one-sixth of the expected number. This is mainly due to the shotgun assemblies of 93-11. Despite the 6.28x coverage, the International Rice Genome Sequencing Project estimated that the nonredundant coverage of the indica 93-11 assembly was 69%. Moreover, it consists of thousands of small pieces of contigs, and misassembly of large pieces is also likely to happen. In our study, we found that the same contigs of 93-11 can be aligned to different regions in the Nipponbare genome, and several polymorphic contigs of 93-11 can be aligned to one region of the Nipponbare genome. So we had to apply relatively strict selection criteria in order to improve the accuracy of our investigation; consequently, we missed some TIPs. These observations indicate that the draft sequences of 93-11, although providing a genome-wide survey of TIPs, fell short of ascertaining all variation between subspecies.
As described above, the approach we used to recognize transposons primarily relied on sequence similarity with known repeats, and the increased improvement of the rice TE database allowed the identification of most TEs. Although this homology-based method with TSD detection performed well here, the genome comparison followed by inner structure analysis provided an innovative and complementary method for TE discovery, especially in detecting new TE families and instances (Caspi and Pachter, 2006
In this study, we showed that more than 10% of TIPs occurred in expressed gene regions. We provided a number of cases to exemplify a wide spectrum of changes induced by transposon insertions, involving deleterious effects, alternative splicing, shift of the transcription initiation site, loss or gain of exons, and so on. We estimated that the alterations at the level of the cDNA sequences between rice subspecies could add up to more than 400 (approximately 1% of all rice genes; Table IV). This is still a conservative estimation, because the variations identified in our study were mainly based on rice Fl-cDNA or EST sequences, and those lacking transcript evidence in the database were not examined. Moreover, we used quantitative RT-PCR to examine the relative transcription levels of 15 genes that possessed TIPs in the upstream regions between Nipponbare and 93-11. At least two genes showed dramatic changes in expression levels between the two cultivars. Therefore, the TIP-influenced expression difference could potentially serve as an important source of genetic variation. An explicit experimental evaluation of the impact of TIPs on global gene expression, however, awaits full-scale transcriptional profiling in future work.
Among thousands of polymorphic TE insertions identified, we did not find any elements carrying a gene fragment and creating a new gene, if the TE-related genes, like transposases taken along by them, were neglected. To our surprise, two transcription factors regulating light signaling in Arabidopsis (Arabidopsis thaliana) were reported to be co-opted from a transposase (Lin et al., 2007
Genomic Sequence Alignments and Identification of Indels
Physical mapping of the rice (Oryza sativa) indica Guangluai 4 chromosome 4 was conducted by an integrated approach (Zhao et al., 2002 The alignment results of BGI 93-11 contigs and Nipponbare pseudomolecules, which were generated by the software nucmer, were downloaded using the GFF Dumper on the TIGR Genome Browser. We found that a small quantity of anchor results were self-contradictory; that is, two 93-11 contigs that localized on the same location yielded opposite patterns (insertion or no insertion in japonica). Hence, a perl script was written to wipe off all of these abnormal anchor results. We used only maximal exact matches that were unique in both the query and reference sequences as the alignment anchors to avoid potential errors caused by misassembly or inaccurate anchoring. Then, another script was developed to mine all of the indels of more than 100 bp based on the renewed anchor results. The indels of more than 100 bp were further confirmed by BLAST2. The indica Guangluai 4 BACs were obtained from http://www.ncgr.ac.cn/chinese/databasei.htm. The genomic sequences of japonica Nipponbare were downloaded from http://www.tigr.org/tdb/e2k1/osa1, and the indica 93-11 sequences were downloaded from ftp://ftp.genomics.org.cn.
For each insertion region identified above, the query sequence, composed of the insertion region and its flanking DNA (both 100 bp upstream and 100 bp downstream), was extracted and used to screen against all known TE sequences using RepeatMasker (open version 3.0.5). The known TE sequences included all transposons and transposon-like elements collected by Repbase (volume 12, issue 9; http://www.girinst.orgthe RTEdb (Juretic et al., 2004
To determine the states (presence or absence) of transposon insertions identified between Nipponbare and Guangluai 4 in the corresponding regions of indica 93-11, we conducted sequence comparisons targeting the transposon insertion sites. For each transposon insertion identified between Nipponbare and Guangluai 4, three unique 200-bp sequences were extracted and used to search against the assembled indica 93-11 contigs, using BLASTN with a threshold e-value of 10–20. For the first two unique 200-bp sequences, each was composed of 100 bp of one transposon terminal sequence and 100 bp of its flanking DNA, from one genome with the TE insertion. The third one was a 200-bp sequence free of transposon insertion, from the other genome without the TE insertion. An insertion of a TE was considered to be shared in the indica 93-11 genome when either of the first two unique sequences was found in assembled indica 93-11 contigs (a threshold identity percentage of 95%). Alternatively, an insertion was judged to be absent in indica 93-11 when the third unique sequence was found in 93-11 contigs, with the same threshold value. We regarded the insertion as not having its explicit target region if the BLAST search did not yield any expected result or yielded two equally perfect hits, indicating both the presence and absence of the insertion in the 93-11 genome. Then, the extracted region and its clear ortholog were aligned using BLAST2 to check for the presence or absence of the insertion. Meanwhile, we also used the anchor result mentioned above to seek the corresponding locations of 93-11 contigs to confirm the states of the TE insertions in 93-11 targeted regions and eliminate all potential artifacts.
The LTR-retrotransposon insertions, which were identified between Nipponbare and Guangluai 4, were taken out for further analysis. Sequence comparisons and structural analysis were used to classify solo LTRs, intact LTR elements, and other truncated elements. Intact LTR retrotransposons were identified by the LTR_Finder program (Xu and Wang, 2007
All publicly available rice ESTs were obtained from the National Center for Biotechnology Information EST database (http://www.ncbi.nlm.nih.gov/projects/dbEST/). Full-length cDNAs of both KOME (http://red.dna.affrc.go.jp/cDNA/; japonica Nipponbare; Rice Full-Length cDNA Consortium, 2003
BLASTp search against all of the annotated proteins in the whole rice genome at TIGR (release 5) was conducted using the wheat (Triticum aestivum) XIP-I protein (GenBank accession no. CAD19479) as the query. The search resulted in the identification of 30 proteins with an e-value cutoff of 1E-5. Among them, Os06g25010 and Os06g24990, as two gene fragments after TE insertion, were replaced by indica XIP-I. The protein is coded by the longest ORF within an indica rice full-length cDNA (GenBank accession no. CT836240), and there are no nucleotide differences between the indica cDNA and 93-11 genome sequences. Os12g18750 was removed because it shows an incomplete domain and low homology when checked individually. Those protein sequences were aligned using ClustalW (Thompson et al., 1994
From Rice Multi-platform Microarray Search, we got the two Affymetrix probe set identifiers, OsAffx-27816-1-S1_at and OsAffx.27815.1.S1_s_at, which represent the two gene fragments of indica XIP-I separated by a TE insertion. The probes in the two probe sets were remapped to the rice genomes, Nipponbare pseudomolecules and 93-11 contigs, by BLASTN. We downloaded the microarray data files of each experiment from the GEO Web site (http://www.ncbi.nlm.nih.gov/geo/). Overall, there are 57 chips of indica IR64 (45 from GSE6893 and 12 from GSE6901) and 45 chips of japonica Nipponbare (13 from GSE7951, 4 from GSE6908, 24 from GSE6719, and 4 from GSE6720). The signal intensity data were extracted using a perl script. Pearson's correlation coefficient was applied in linear correlation analysis. The significance of the slope of the regression line was determined by the R language package.
RNAs of japonica Nipponbare and indica Guangluai 4 were extracted from 14-d-old rice seedlings. First-strand cDNA was synthesized by SuperScript II reverse transcriptase (Invitrogen) at 42°C for 1 h. Genomic DNA of the 10 indica (Guangluai 4, 93-11, Teqing, Kasalath, TN1, Liantangzao, Zhefu 802, Nanjing 6, IR36, and Zhaiyeqing 8) and 11 japonica (Nipponbare, Yangguang, Qiuguang, Shiokari, Lansheng, Juangguang, Jinnanfeng, Suyunuo, Nongken 58, Yueguang, and Xueheaizao) varieties was prepared from 2-week-old rice seedling shoots as described (Liu et al., 2007
RNAs of 14-d-old seedlings of indica 93-11 and japonica Nipponbare were extracted as described above. Quantitative PCR was performed on the Applied Biosystems 7500 real-time PCR System using SYBR Premix Ex Taq (TaKaRa). The PCR thermal cycle conditions were as follows: denaturing at 95°C for 10 s and 40 cycles at 95°C for 5 s and 60°C for 34 s. The two rice genes used as internal reference genes to calculate relative transcript levels were UBQ5 (AK061988) and eEF-1
The following materials are available in the online version of this article.
We thank Yiwen Yan for taking care of rice plants and Yajun Hu for help in the experiments. We also thank Dr. Thomas Bureau and his students from McGill University for their helpful comments on the detection of transposon MULEs. Received April 21, 2008; accepted July 17, 2008; published July 23, 2008.
1 This work was supported by the Ministry of Science and Technology of China (grant nos. 2006AA10A102 and 2005CB120805) and the Chinese Academy of Sciences (grant no. KSCW2–YW–N–024). The author responsible for distribution of materials integral to the findings presented in this article in accordance with the policy described in the Instructions for Authors (www.plantphysiol.org) is: Bin Han (bhan{at}ncgr.ac.cn).
[W] The online version of this article contains Web-only data.
[OA] Open Access articles can be viewed online without a subscription. www.plantphysiol.org/cgi/doi/10.1104/pp.108.121491 * Corresponding author; e-mail bhan{at}ncgr.ac.cn.
Altschul SF, Madden TL, Schaffer AA, Zhang JH, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389–3402 Batzer MA, Deininger PL (2002) Alu repeats and human genomic diversity. Nat Rev Genet 3: 370–379[CrossRef][Web of Science][Medline] Bennett EA, Coleman LE, Tsui C, Pittard WS, Devine SE (2004) Natural genetic variation caused by transposable elements in humans. Genetics 168: 933–951 Bennetzen JL (2000) Transposable element contributions to plant gene and genome evolution. Plant Mol Biol 42: 251–269[CrossRef][Web of Science][Medline] Bergman CM, Quesneville H (2007) Discovering and detecting transposable elements in genome sequences. Brief Bioinform 8: 382–392 Britten RJ, Rowen L, Williams J, Cameron RA (2003) Majority of divergence between closely related DNA samples is due to indels. Proc Natl Acad Sci USA 100: 4661–4665 Caspi A, Pachter L (2006) Identification of transposable elements using multiple alignments of related genomes. Genome Res 16: 260–270 Ding J, Araki H, Wang Q, Zhang P, Yang S, Chen JQ, Tian D (2007) Highly asymmetric rice genomes. BMC Genomics 8: 154[CrossRef][Medline] Du CG, Swigonova Z, Messing J (2006) Retrotranspositions in orthologous regions of closely related grass species. BMC Evol Biol 6: 62[CrossRef][Medline] Durand A, Hughes R, Roussel A, Flatman R, Henrissat B, Juge N (2005) Emergence of a subfamily of xylanase inhibitors within glycoside hydrolase family 18. FEBS J 272: 1745–1755[CrossRef][Medline] Elliott GO, Hughes RK, Juge N, Kroon PA, Williamson G (2002) Functional identification of the cDNA coding for a wheat endo-1,4-beta-D-xylanase inhibitor. FEBS Lett 519: 66–70[CrossRef][Web of Science][Medline] Felsenstein J (1985) Confidence intervals on phylogenies: an approach using the bootstrap. Evolution Int J Org Evolution 39: 783–791[CrossRef][Web of Science] Feltus FA, Wan J, Schulze SR, Estill JC, Jiang N, Paterson AH (2004) An SNP resource for rice genetics and breeding based on subspecies Indica and Japonica genome alignments. Genome Res 14: 1812–1819 Flatman R, McLauchlan WR, Juge N, Furniss C, Berrin JG, Hughes RK, Manzanares P, Ladbury JE, O'Brien R, Williamson G (2002) Interactions defining the specificity between fungal xylanases and the xylanase-inhibiting protein XIP-I from wheat. Biochem J 365: 773–781[Web of Science][Medline] Flavell AJ, Knox MR, Pearce SR, Ellis THN (1998) Retrotransposon-based insertion polymorphisms (RBIP) for high throughput marker analysis. Plant J 16: 643–650[CrossRef][Web of Science][Medline] Goesaert H, Gebruers K, Courtin CM, Delcour JA (2005) Purification and characterization of a XIP-type endoxylanase inhibitor from rice (Oryza sativa). J Enzyme Inhib Med Chem 20: 95–101[CrossRef][Web of Science][Medline] Han B, Xue YB (2003) Genome-wide intraspecific DNA-sequence variations in rice. Curr Opin Plant Biol 6: 134–138[CrossRef][Web of Science][Medline] Igawa T, Tokai T, Kudo T, Yamaguchi I, Kimura M (2005) A wheat xylanase inhibitor gene, Xip-I, but not Taxi-I, is significantly induced by biotic and abiotic signals that trigger plant defense. Biosci Biotechnol Biochem 69: 1058–1063[CrossRef][Medline] International Rice Genome Sequencing Project (2005) The map-based sequence of the rice genome. Nature 436: 793–800[CrossRef][Web of Science][Medline] Jain M, Nijhawan A, Tyagi AK, Khurana JP (2006) Validation of housekeeping genes as internal control for studying gene expression in rice by quantitative real-time PCR. Biochem Biophys Res Commun 345: 646–651[CrossRef][Web of Science][Medline] Jiang N, Bao ZR, Zhang XY, Eddy SR, Wessler SR (2004) Pack-MULE transposable elements mediate gene evolution in plants. Nature 431: 569–573[CrossRef][Web of Science][Medline] Jing RC, Knox MR, Lee JM, Vershinin AV, Ambrose M, Ellis THN, Flavell AJ (2005) Insertional polymorphism and antiquity of PDR1 retrotransposon insertions in Pisum species. Genetics 171: 741–752 Juretic N, Bureau TE, Bruskiewich RM (2004) Transposable element annotation of the rice genome. Bioinformatics 20: 155–160 Juretic N, Hoen DR, Huynh ML, Harrison PM, Bureau TE (2005) The evolutionary fate of MULE-mediated duplications of host gene fragments in rice. Genome Res 15: 1292–1297 Kang S, Lebrun MH, Farrall L, Valent B (2001) Gain of virulence caused by insertion of a Pot3 transposon in a Magnaporthe grisea avirulence gene. Mol Plant Microbe Interact 14: 671–674[Web of Science][Medline] Kashkush K, Feldman M, Levy AA (2003) Transcriptional activation of retrotransposons alters the expression of adjacent genes in wheat. Nat Genet 33: 102–106[CrossRef][Web of Science][Medline] Kazazian HH (2004) Mobile elements: drivers of genome evolution. Science 303: 1626–1632 Kidwell MG, Lisch D (1997) Transposable elements as sources of variation in animals and plants. Proc Natl Acad Sci USA 94: 7704–7711 Kovach MJ, Sweeney MT, McCouch SR (2007) New insights into the history of rice domestication. Trends Genet 23: 578–587[CrossRef][Web of Science][Medline] Lin RC, Ding L, Casola C, Ripoll DR, Feschotte C, Wang HY (2007) Transposase-derived transcription factors regulate light signaling in Arabidopsis. Science 318: 1302–1305 Liu XH, Lu TT, Yu SL, Li Y, Huang YC, Huang T, Zhang L, Zhu JJ, Zhao Q, Fan DL, et al (2007) A collection of 10,096 indica rice full-length cDNAs reveals highly expressed sequence divergence between Oryza sativa indica and japonica subspecies. Plant Mol Biol 65: 403–415[CrossRef][Web of Science][Medline] Ma JX, Bennetzen JL (2004) Rapid recent growth and divergence of rice nuclear genomes. Proc Natl Acad Sci USA 101: 12404–12410 McClintock B (1948) Mutable loci in maize. Carnegie Inst Washington Year Book 47: 155–169 Meyers BC, Tingley SV, Morgante M (2001) Abundance, distribution, and transcriptional activity of repetitive elements in the maize genome. Genome Res 11: 1660–1676 Ouyang S, Zhu W, Hamilton J, Lin H, Campbell M, Childs K, Thibaud-Nissen F, Malek RL, Lee Y, Zheng L, et al (2007) The TIGR rice genome annotation resource: improvements and new features. Nucleic Acids Res 35: D883–D887[CrossRef][Web of Science][Medline] Park SM, Kim DH, Truong NH, Itoh Y (2002) Heterologous expression and characterization of class III chitinases from rice (Oryza sativa L.). Enzyme Microb Technol 30: 697–702[CrossRef][Web of Science] Payan F, Leone P, Porciero S, Furniss C, Tahir T, Williamson G, Durand A, Manzanares P, Gilbert HJ, Juge N, et al (2004) The dual nature of the wheat xylanase protein inhibitor XIP-I: structural basis for the inhibition of family 10 and family 11 xylanases. J Biol Chem 279: 36029–36037 Pearce SR, Stuart-Rogers C, Knox MR, Kumar A, Ellis THN, Flavell AJ (1999) Rapid isolation of plant Ty1-copia group retrotransposon LTR sequences for molecular marker studies. Plant J 19: 711–717[CrossRef][Web of Science][Medline] Pooma W, Gersos C, Grotewold E (2002) Transposon insertions in the promoter of the Zea mays a1 gene differentially affect transcription by the Myb factors P and C1. Genetics 161: 793–801 Rice Full-Length cDNA Consortium (2003) Collection, mapping, and annotation of over 28,000 cDNA clones from japonica rice. Science 301: 376–379 Rice P, Longden I, Bleasby A (2000) EMBOSS: the European molecular biology open software suite. Trends Genet 16: 276–277[CrossRef][Web of Science][Medline] Saitou N, Nei N (1987) The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol 4: 406–425[Abstract] Sakai H, Tanaka T, Itoh T (2007) Birth and death of genes promoted by transposable elements in Oryza sativa. Gene 392: 59–63[CrossRef][Web of Science][Medline] Sang T, Ge S (2007) Genetics and phylogenetics of rice domestication. Curr Opin Genet Dev 17: 533–538[CrossRef][Web of Science][Medline] Shen YJ, Jiang H, Jin JP, Zhang ZB, Xi B, He YY, Wang G, Wang C, Qian LL, Li X, et al (2004) Development of genome-wide DNA polymorphism database for map-based cloning of rice genes. Plant Physiol 135: 1198–1205 Stoneking M, Fontius JJ, Clifford SL, Soodyall H, Arcot SS, Saha N, Jenkins T, Tahir MA, Deininger PL, Batzer MA (1997) Alu insertion polymorphisms and human evolution: evidence for a larger population size in Africa. Genome Res 7: 1061–1071 Tamura K, Dudley J, Nei M, Kumar S (2007) MEGA4: molecular evolutionary genetics analysis (MEGA) software version 4.0. Mol Biol Evol 24: 1596–1599 Tang T, Lu J, Huang J, He J, McCouch SR, Shen Y, Kai Z, Purugganan MD, Shi S, Wu CI (2006) Genomic variation in rice: genesis of highly polymorphic linkage blocks during domestication. PLoS Genet 2: e199[CrossRef][Medline] Thompson JD, Higgins DG, Gibson TJ (1994) Clustal-W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22: 4673–4680 Tokunaga T, Esaka M (2007) Induction of a novel XIP-type xylanase inhibitor by external ascorbic acid treatment and differential expression of XIP-family genes in rice. Plant Cell Physiol 48: 700–714 Vicient CM, Suoniemi A, Namthamat-Jonsson K, Tanskanen J, Beharav A, Nevo E, Schulman AH (1999) Retrotransposon BARE-1 and its role in genome evolution in the genus Hordeum. Plant Cell 11: 1769–1784 Vitte C, Ishii T, Lamy F, Brar D, Panaud O (2004) Genomic paleontology provides evidence for two distinct origins of Asian rice (Oryza sativa L.). Mol Genet Genomics 272: 504–511[CrossRef][Web of Science][Medline] Wang XS, Zhao XQ, Zhu J, Wu WR (2005) Genome-wide investigation of intron length polymorphisms and their potential as molecular markers in rice (Oryza sativa L.). DNA Res 12: 417–427 White SE, Habera LF, Wessler SR (1994) Retrotransposons in the flanking regions of normal plant genes: a role for copia-like elements in the evolution of gene structure and expression. Proc Natl Acad Sci USA 91: 11792–11796 Wu TD, Watanabe CK (2005) GMAP: a genomic mapping and alignment program for mRNA and EST sequences. Bioinformatics 21: 1859–1875 Xie Z, Zhang ZL, Zou XL, Huang J, Ruas P, Thompson D, Shen QJ (2005) Annotations and functional analyses of the rice WRKY gene superfamily reveal positive and negative regulators of abscisic acid signaling in aleurone cells. Plant Physiol 137: 176–189 Xu Z, Wang H (2007) LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Res 35: W265–W268 Yu J, Wang J, Lin W, Li SG, Li H, Zhou J, Ni PX, Dong W, Hu SN, Zeng CQ, et al (2005) The genomes of Oryza sativa: a history of duplications. PLoS Biol 3: 266–281[CrossRef][Web of Science] Zhao Q, Zhang Y, Cheng ZK, Chen MS, Wang SY, Feng Q, Huang YC, Li Y, Tang YS, Zhou B, et al (2002) A fine physical map of the rice chromosome 4. Genome Res 12: 817–823 Zuckerkandl E, Pauling L (1965) Evolutionary divergence and convergence in proteins. In V Bryson, HJ Vogel, eds, Evolving Genes and Proteins. Academic Press, New York, pp 97–166 This article has been cited by other articles:
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ASPB Publications | PLANT PHYSIOLOGY® | THE PLANT CELL | |
|---|---|---|---|