|
|
||||||||
|
First published online May 11, 2007; 10.1104/pp.107.098640 Plant Physiology 144:1632-1641 (2007) © 2007 American Society of Plant Biologists OPEN ACCESS ARTICLE
Comparative Cross-Species Alternative Splicing in Plants1,[W],[OA]Department of Plant Sciences, Weizmann Institute of Science, Rehovot 76100, Israel (H.N.-G., N.L., R.F.); and Department of Microbiology and Immunology, Ben Gurion University of the Negev, Beer-Sheva 84105, Israel (E.R.)
Alternative splicing (AS) can add significantly to genome complexity. Plants are thought to exhibit less AS than animals. An algorithm, based on expressed sequence tag (EST) pairs gapped alignment, was developed that takes advantage of the relatively small intron and exon size in plants and directly compares pairs of ESTs to search for AS. EST pairs gapped alignment was first evaluated in Arabidopsis (Arabidopsis thaliana), rice (Oryza sativa), and tomato (Solanum lycopersicum) for which annotated genome sequence is available and was shown to accurately predict splicing events. The method was then applied to 11 plant species that include 17 cultivars for which enough ESTs are available. The results show a large, 3.7-fold difference in AS rates between plant species with Arabidopsis and rice in the lower range and lettuce (Lactuca sativa) and sorghum (Sorghum bicolor) in the upper range. Hence, compared to higher animals, plants show a much greater degree of variety in their AS rates and in some plant species the rates of animal and plant AS are comparable although the distribution of AS types may differ. In eudicots but not monocots, a correlation between genome size and AS rates was detected, implying that in eudicots the mechanisms that lead to larger genomes are a driving force for the evolution of AS.
Alternative RNA processing pathways result in the combining of different splice junctions that are present in pre-mRNA transcripts. In this way, a genetic unit can have a variety of mRNA and protein products, thus expanding the potential informational content of eukaryotic genomes. Recent evidence indicates a high incidence (up to 60%) of alternative splicing (AS) is present in the human genome, predominantly in the form of exon skip while a minor form is of the type called intron retention (5%16%; Kan et al., 2002
Plants are thought to exhibit less AS and, unexpectedly, analysis in Arabidopsis (Arabidopsis thaliana) showed that intron retention is the most common type of AS, comprising 45% of the AS types (Iida et al., 2004
Comparison of AS between different species is of interest. At the genome level it can teach us about the evolution of AS, the conservation of mechanisms that control AS, and its biological consequences for a species. At the gene level, conservation of AS can teach us about gene function, the evolutionary history of particular genes, and of their gene families (Modrek and Lee, 2003
To carry out cross-species comparison on a global scale, rates of AS have been defined as the number of AS events that occur in a set number of loci or genes and by restricting the analysis to a constant number of genes, EST, or both. In this way, species with vastly different total EST coverage can be compared. It is of interest that using 650 cDNA in a comparison of AS rates from Arabidopsis, Caenorhabditis elegans, Drosophila melanogaster, and a few mammals, only small differences in species-specific rates were detected with the exception of Arabidopsis, which showed a much lower AS rate (Brett et al., 2002
It is of interest to expand the comparisons of AS rates to additional plant species, however, discovery of AS has relied on aligning EST contigs or cloned cDNA to annotated genomic sequence (Mironov et al., 1999
Types of AS Detected by EST Pairs Gapped Alignment
EST pairs gapped alignment (EPGA) is an algorithm meant to search for RNA transcripts that are the result of AS. EPGA looks for ESTs sharing two matching regions that flank a discontinuity in the alignment arising from an indel, as shown in Figure 1A
. In theory, EPGA can accurately detect exon skips, intron retention, and 5' and/or 3' alternative splice sites (Fig. 1B). In practice, to avoid indels that are a result of errors in EST sequencing, the minimum indel size was set empirically to 5 bp. It has been estimated that alternative 5' and 3' of 3 to 4 bp size make up less than 15% of the total alternative 5' and 3' type of AS (Campbell et al., 2006
Quality-Control Criteria for Selecting EST and dbESTs The accuracy by which EPGA detects exon skips, intron retention, and 5' and/or 3' alternative splice sites will be a function of the quality and uniformity of the dbESTs. Starting with a survey that included all available eudicot and monocot dbEST (Supplemental Table S1) standard quality-control parameters for accepting EST, i.e. elimination or tailoring of EST with high error, vector sequence, or poly A, were applied (see "Materials and Methods"). To control possible DNA contamination, in the absence of available genomes, we hypothesized that libraries that contained a high proportion of multiple indels may be contaminated with genomic sequence. Therefore, libraries containing more than 1% EST pairs with multiple indels were not used. Using these criteria, 57 out of 711 of the libraries were disqualified (Supplemental Table S1). Furthermore, for the same reason, in all our analyses using the EPGA algorithm we reject all EST pairs that contain multiple indels. To achieve uniformity between dbEST, the average length of ESTs in a database was required to be within 2 SDs of the average EST size of all the libraries (dbEST average size = 513). This is important as average EST size will dictate the number of exons/introns queried. Another basis for uniformity is requiring that dbEST represent cDNA made from diverse tissue types. This was examined in two ways. First, ESTs from each ecotype/cultivar were subjected to clustering as the number of clusters reflects the dbEST diversity. Low cluster number can represent enrichment for particular tissue types while high cluster number can represent genomic contamination. Thus, ecotype/cultivar with clusters that are more than 2 SDs from the average cluster number were not used (average dbEST cluster number = 2,929 using 20,000 ESTs). Second, only libraries containing a robust mixture of EST from diverse plant elements were selected. dbEST in which more than 20% of the libraries were from flowers or fruits were not used to avoid tissue-specific bias in AS (Supplemental Table S1).
EPGA analysis can be meaningful for comparative analysis of different species as long as the indel size, which represents a full or partial exon or intron, is shorter than the typical EST size. To establish this, a variety of species for which full or partial genomic sequence is available were examined for constitutive median intron and exon size by alignment of EST to genomic data (see "Materials and Methods"). As shown in Table I
for selected eudicot and monocot plants, the intron and exon median size was found to have a range of 100 to 200 bp. The values obtained here for Arabidopsis and rice exon and intron median size are similar to sizes found in recent analysis of whole genomes (Collins and Penny, 2006
Calibration of EPGA The EPGA algorithm was calibrated by comparing the results from direct EST comparisons to the results of EST genomic sequence alignments for the species Arabidopsis, rice (japonica group), and tomato. EST from Arabidopsis Columbia ecotype (National Center for Biotechnology Information [NCBI] ESTs databases 23/12/2006) yielded 111,142 ESTs after quality-control processing (see "Materials and Methods"; Supplemental Table S1). EPGA was then applied to the ESTs and detected 6,161 alternatively derived EST pairs. Additionally, all of the Arabidopsis ESTs were clustered in 25,616 clusters of which 11,466 contained more than one EST. Out of these, 467 clusters contained one or more alternatively derived EST pairs (Table II ).
The veracity of EST processing by EPGA was examined by comparison of each EST pair to the genome, at a minimum identity of 95% using the BLAT program. Pairs aligning to the same genome location indicate that EPGA identified transcripts originating from the same loci. Of the 467 splice sites, 76% (353 pairs) were found to be authentic AS candidates by the following criteria: Each EST in the pair aligned to the same genome location, with consensus intron signals (Table II). Another 64/467 (14%) pairs align to the same genome location but contained nonconsensus intron border (not GT-AG, GC-AG, AT-AC). It should be noted that nonconsensus intron borders are present in The Institute for Genomic Research (http://www.tigr.org/tdb/e2k1/ath1/Arabidopsis_nonconsensus_splice_sites.shtml) and in other databases (Larkin and Park, 1999 A similar process was carried out using EST from the rice (japonica group) database (Nipponbare strain). Of 892,016 ESTs available a random sample of 111,502 were chosen, comparable in size to that in Arabidopsis, and used for genome verification. In this case, the accuracy of EPGA is 70% without accepting noncanonical intron borders or 87% if noncanonical intron borders are accepted (Table II). For each pair the AS type was determined, as illustrated in Figure 1B, and the results are summarized in Table II. This process was also carried out for tomato for which only partial genomic sequence is available. The distribution of AS types was found to be similar to Arabidopsis and rice.
The proportion of intron retention reported by EPGA is high (above 60% of the total AS types) mainly due to the lack of retrieval of splicing events at the transcript termini. As shown previously (Ner-Gaon et al., 2004 The distribution of all indel sizes detected by EPGA was analyzed and a graphical presentation of all indel sizes retrieved from Arabidopsis and rice is illustrated together with constitutive introns and exons (Fig. 2 ). The distribution for indel sizes between 60 and 120 bp follows the distribution of the constitutive intron size in both species more closely than that of the exon sizes. Another peak in the indel size at below 40 bp in size is the result of 5' and 3' alternative splice sites. Taken together, these results indicate that EPGA can faithfully report genome-wide splicing events within the median distribution of exons and introns for the model eudicot and monocot species.
Comparative AS Rates and EST Database Size
To be able to compare AS rate in other plant species for which sequenced genomes are not available, we define the AS rate as the percentage of clusters that contain an AS event out of the total number of clusters retrieved with more than one EST. Thus, in Arabidopsis and rice the AS rate, using more than 110,000 ESTs, was 4.1 and 5.5, respectively (Table II). In tomato, the AS rate was 10.2, when using about 90,000 ESTs. Clusters do not necessarily indicate gene units, hence, the values of AS rates as defined do not specify the rate per gene (i.e. 10%20% in Arabidopsis and rice; Wang and Brendel, 2006
Estimation of AS Rates in Plant Species The EPGA algorithm was applied to other species to ascertain their relative AS rates. Species with recent polyploid ancestry (e.g. potato [Solanum tuberosum] and wheat [Triticum aestivum]) were avoided to prevent potential complications arising from comparisons of transcripts of genes arising from homologous chromosomes. To maintain sufficient sampling size, only ecotypes/cultivars that include at least 40,000 high quality ESTs were used. In addition, the different ecotypes/cultivars were processed separately to avoid polymorphisms that may originate from evolution-derived divergence rather than from AS and to enable cross-cultivar comparison. We first examined the variation in AS rate as a function of recurrent sampling size. As shown in Figure 3, A and B , for all monocot and eudicot species, the ratio of AS rates between species remains relatively constant. This indicates that the rate of AS discovery is similar within these sampling sizes and a sampling size of 20,000 was adopted. Table III summarizes the results for eudicots (groups Rosids and Asterids) that were applied to all species with sufficient EST. Due to their more recent evolutionary origin one may expect that different cultivars of the same species would exhibit similar AS rates. Indeed, the AS rates of the different strains were very similar, differing by 2% in tomato (micro-tom compared to ta496), 13% in soybean (Glycine max; Williams and williams82), and 30% in Arabidopsis (columbia versus wassilewskija). The relatively similar AS rates detected between cultivars lends credence to the significance of AS rates reported by the EPGA algorithm. In comparison, the differences in AS rates between species can be higher and reaches 3.6-fold (e.g. compare lettuce [Lactuca sativa] to Arabidopsis strain Columbia). Interestingly, when AS rates are graphed relative to the genome size, a linear correlation is obtained (R2 = 0.75, P value = 0.001; Fig. 4A ). Analysis of the lower sampling sizes of 15,000 and 10,000 ESTs showed a similar linear correlation with reduced R values of 0.67 and 0.53, respectively. Taken together, the results indicate that the majority of variance in AS rate can be attributed to the increase in eudicot genome size.
Similar analysis was carried out in monocot species and their cultivars for which sufficient EST are available (Table IV). Comparison of AS rates between monocot species (Poaceae) show 20% variance in rates between strains of the same species. Furthermore, AS rates in monocots are not correlated with genome size (Fig. 4B).
AS Rates and Genome Complexity
AS can increase the complexity of a genome, it is therefore expected that increased AS rates would be correlated with increased organism complexity. Indeed, simple eukaryotes like yeast show reduced introns and AS rate relative to animals (Brett et al., 2002
The species surveyed represent not only a large span of vascular plant evolutionary divergence, i.e. monocots and eudicots, but also includes more than a 40-fold range of genome sizes. Remarkably, the AS rate was correlated with eudicot genome size. Larger genome size does not necessarily indicate more genes. For example, soybean that has a genome 10-fold larger than Arabidopsis, is considered to have no more than 25% of additional genes (Young et al., 2005
A major exception to this trend is in monocots, in which the increased size seems to have saturated in many of the species for additional AS increment. With respect to the increase in genome size in cereals, it has been noted that the preponderance of expansion is due to nested retrotransposons, i.e. insertions within each other and not within genes (Shirasu et al., 2000
Inspection of AS rates between different cultivars of the same species reveals similar AS rates within the majority of eudicot species. As most cultivars are of very recent origin the similarity in rates is to be expected. When comparing AS rate in one species, the two ecotypes of Arabidopsis, Columbia and Wassilewskija, show the highest difference (30%) in their AS rate. Genetic diversity between Arabidopsis ecotypes as shown by amplified fragment length polymorphism analysis points to distinct subgroups or genotypes. The intraecotypic differences reflect natural variation that is fixed in discrete genotypes because of the self-fertilizing nature of Arabidopsis (Breyne et al., 1999
Despite the lack of a complex developmental style and sophisticated neuronal and adaptive immune systems that typify mammalians, we show here that plants can host a high degree of genome complexity. Plants, as sessile organisms, must adapt their growth and metabolic style to a changing environment. The great diversity in AS rates indicates that it can play a role in plant adaptation. Interestingly, environmental stress has been shown to activate plant retrotransposon mobility and activation of genes, suggesting an adaptive advantage to this type of genome restructuring (Kalendar et al., 2000
Data Sources Plant ESTs for eudicots and monocots were obtained from NCBI dbESTs 23/12/2006 (ftp://ftp.ncbi.nih.gov/genbank/). Based on their annotation, plant species were separated by cultivar to create a cultivar-specific database. For the Arabidopsis (Arabidopsis thaliana) genome sequence, the January 22, 2004 version was obtained from The Arabidopsis Information Resource center: ftp.arabidopsis.org/home/tair/Sequences/whole_chromosomes/. For the rice (Oryza sativa) genome sequence, IRGSP Releases Build 4.0, Pseudomolecules of the Rice Genome (rice sp. japonica Nipponbare) was used (http://rgp.dna.affrc.go.jp/E/IRGSP/Build4/build4.html). The Build 4.0 Pseudomolecules were constructed based on the data freeze on January 25, 2005. Programs to extract EST details and genomic location are available upon request from H.N.-G.
As ESTs are the bases for comparison it was important that the quality and the uniformity of the dbEST be rigorously examined according to the following criteria: (1) Poly A from the end of the ESTs was deleted. (2) ESTs were aligned to a vector database that was obtained from NCBI 26/11/03 (ftp://ftp.ncbi.nih.gov/blast/db/FASTA/vector.gz) using BLAT (Kent, 2002 A summary of the above analysis is shown in Supplemental Table S1.
EPGA method was used to identify AS. Pairs of ESTs that share two sequentially matching regions that flank a discontinuity in the alignment were considered to be AS pair. For each dbESTs, BLAT (Kent, 2002 The ESTs of each cultivar were clustered by simple transitive clustering. An EST was considered to belong to a cluster if it aligned to one of the ESTs in that cluster with an overlap of at least 75 bp in total, with minimum identity of 95%. The clusters were defined as AS clusters if they contained at least one AS pair. The percent AS rate was calculated by dividing the number of AS clusters by the total number of clusters with more than one EST. EPGA algorithm was applied to six different random samples, each of 10,000, 15,000, and 20,000 ESTs. The programs for filtering and clustering were all written in PERL, and are available upon request from H.N.-G. The complete analysis of the 20,000 random samples is shown in Supplemental Table S3.
Data for 1C value of genome size were obtained from Plant DNA C-values Database (http://www.kew.org/cvalues/CvalServlet?querytype=2) or from direct sequencing results from NCBI Entrez Genome Project database (http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?CMD=search&DB=genomeprj).
The list of all the plant genome accession numbers (GI) is available at ftp://ftp.ncbi.nlm.nih.gov/genomes/PLANTS/BLASTDB/. The tomato (Solanum lycopersicum) genome sequence was downloaded from http://www.sgn.cornell.edu/bulk/input.pl?mode=bac. The ESTs obtained after quality control were applied and were aligned to the genomic sequences using BLAT. Only ESTs that match the genome for at least 75 bp, with a minimum identity of 95%, and had less than 5 bp gap in the EST were used. Gaps of at least 5 bp length in the alignment were considered as constitutive introns. The sizes of the aligned regions between the gaps were registered as constitutive exons. The lengths of the exons and introns were extracted from the alignment results by a program written in PERL.
The following materials are available in the online version of this article.
Received February 27, 2007; accepted April 30, 2007; published May 11, 2007.
1 This work was supported by the Israel Science Foundation (grant no. 388/02) and the Binational Agriculture Research and Development (grant no. IS345403). The author responsible for distribution of materials integral to the findings presented in this article in accordance with the policy described in the Instructions for Authors (www.plantphysiol.org) is: Robert Fluhr (robert.fluhr{at}weizmann.ac.il).
[W] The online version of this article contains Web-only data.
[OA] Open Access articles can be viewed online without a subscription. www.plantphysiol.org/cgi/doi/10.1104/pp.107.098640 * Corresponding author; e-mail robert.fluhr{at}weizmann.ac.il; fax 97289344181.
Arabidopsis Genome Initiative (2000) Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408: 796815[CrossRef][Medline] Aukerman MJ, Hirschfeld M, Wester L, Weaver M, Clack T, Amasino RM, Sharrock RA (1997) A deletion in the PHYD gene of the Arabidopsis Wassilewskija ecotype defines a role for phytochrome D in red/far-red light sensing. Plant Cell 9: 13171326[Abstract] Brett D, Pospisil H, Valcarcel J, Reich J, Bork P (2002) Alternative splicing and genome complexity. Nat Genet 30: 2930[CrossRef][Web of Science][Medline] Breyne P, Rombaut D, Van Gysel A, Van Montagu M, Gerats T (1999) AFLP analysis of genetic diversity within and between Arabidopsis thaliana ecotypes. Mol Gen Genet 261: 627634[CrossRef][Web of Science][Medline] Brown JW, Simpson CG, Thow G, Clark GP, Jennings SN, Medina-Escobar N, Haupt S, Chapman SC, Oparka KJ (2002) Splicing signals and factors in plant intron removal. Biochem Soc Trans 30: 146149[CrossRef][Web of Science][Medline] Buckler ES, Thornsberry JM, Kresovich S (2001) Molecular diversity, structure and domestication of grasses. Genet Res 77: 213218[CrossRef][Web of Science][Medline] Campbell MA, Haas BJ, Hamilton JP, Mount SM, Buell CR (2006) Comprehensive analysis of alternative splicing in rice and comparative analyses with Arabidopsis. BMC Genomics 7: 327[CrossRef][Medline] Carninci P, Kasukawa T, Katayama S, Gough J, Frith MC, Maeda N, Oyama R, Ravasi T, Lenhard B, Wells C, et al (2005) The transcriptional landscape of the mammalian genome. Science 309: 15591563 Collins L, Penny D (2006) Proceedings of the SMBE Tri-National Young Investigators' Workshop 2005: investigating the intron recognition mechanism in eukaryotes. Mol Biol Evol 23: 901910[CrossRef][Web of Science][Medline] Croft L, Schandorff S, Clark F, Burrage K, Arctander P, Mattick JS (2000) ISIS, the intron information system, reveals the high frequency of alternative splicing in the human genome. Nat Genet 24: 340341[CrossRef][Web of Science][Medline] Iida K, Go M (2006) Survey of conserved alternative splicing events of mRNAs encoding SR proteins in land plants. Mol Biol Evol 23: 10851094 Iida K, Seki M, Sakurai T, Satou M, Akiyama K, Toyoda T, Konagaya A, Shinozaki K (2004) Genome-wide analysis of alternative pre-mRNA splicing in Arabidopsis thaliana based on full-length cDNA sequences. Nucleic Acids Res 32: 50965103 Isshiki M, Tsumoto A, Shimamoto K (2006) The serine/arginine-rich protein family in rice plays important roles in constitutive and alternative splicing of pre-mRNA. Plant Cell 18: 146158 Kalbina I, Strid A (2006) Supplementary ultraviolet-B irradiation reveals differences in stress responses between Arabidopsis thaliana ecotypes. Plant Cell Environ 29: 754763[CrossRef][Medline] Kalendar R, Tanskanen J, Immonen S, Nevo E, Schulman AH (2000) Genome evolution of wild barley (Hordeum spontaneum) by BARE-1 retrotransposon dynamics in response to sharp microclimatic divergence. Proc Natl Acad Sci USA 97: 66036607 Kalendar R, Vicient CM, Peleg O, Anamthawat-Jonsson K, Bolshoy A, Schulman AH (2004) Large retrotransposon derivatives: abundant, conserved but nonautonomous retroelements of barley and related genomes. Genetics 166: 14371450 Kalyna M, Lopato S, Voronin V, Barta A (2006) Evolutionary conservation and regulation of particular alternative splicing events in plant SR proteins. Nucleic Acids Res 34: 43954405 Kan Z, Garrett-Engele PW, Johnson JM, Castle JC (2005) Evolutionarily conserved and diverged alternative splicing events show different expression and functional profiles. Nucleic Acids Res 33: 56595666 Kan Z, Rouchka EC, Gish WR, States DJ (2001) Gene structure prediction and alternative splicing analysis using genomically aligned ESTs. Genome Res 11: 889900 Kan Z, States D, Gish W (2002) Selecting for functional alternative splices in ESTs. Genome Res 12: 18371845 Kashkush K, Feldman M, Levy AA (2003) Transcriptional activation of retrotransposons alters the expression of adjacent genes in wheat. Nat Genet 33: 102106[CrossRef][Web of Science][Medline] Kent WJ (2002) BLATthe BLAST-like alignment tool. Genome Res 12: 656664 Kim E, Magen A, Ast G (2007) Different levels of alternative splicing among eukaryotes. Nucleic Acids Res 35: 125131 Kim H, Klein R, Majewski J, Ott J (2004) Estimating rates of alternative splicing in mammals and invertebrates. Nat Genet 36: 915916; author reply 916917[CrossRef][Web of Science][Medline] Lareau LF, Green RE, Bhatnagar RS, Brenner SE (2004) The evolving roles of alternative splicing. Curr Opin Struct Biol 14: 273282[CrossRef][Web of Science][Medline] Larkin PD, Park WD (1999) Transcript accumulation and utilization of alternate and non-consensus splice sites in rice granule-bound starch synthase are temperature-sensitive and controlled by a single-nucleotide polymorphism. Plant Mol Biol 40: 719727[CrossRef][Web of Science][Medline] Leprinc AS, Grandbastien MA, Christian M (2001) Retrotransposons of the Tnt1B family are mobile in Nicotiana plumbaginifolia and can induce alternative splicing of the host gene upon insertion. Plant Mol Biol 47: 533541[CrossRef][Web of Science][Medline] Li J, Li X, Guo L, Lu F, Feng X, He K, Wei L, Chen Z, Qu LJ, Gu H (2006) A subgroup of MYB transcription factor genes undergoes highly conserved alternative splicing in Arabidopsis and rice. J Exp Bot 57: 12631273 Lorkovic ZJ, Barta A (2002) Genome analysis: RNA recognition motif (RRM) and K homology (KH) domain RNA-binding proteins from the flowering plant Arabidopsis thaliana. Nucleic Acids Res 30: 623635 Mansilla A, Lopez-Sanchez C, de la Rosa EJ, Garcia-Martinez V, Martinez-Salas E, de Pablo F, Hernandez-Sanchez C (2005) Developmental regulation of a proinsulin messenger RNA generated by intron retention. EMBO Rep 6: 11821187[CrossRef][Web of Science][Medline] Marillonnet S, Wessler SR (1997) Retrotransposon insertion into the maize waxy gene results in tissue-specific RNA processing. Plant Cell 9: 967978 Meyers BC, Tingey SV, Morgante M (2001) Abundance, distribution, and transcriptional activity of repetitive elements in the maize genome. Genome Res 11: 16601676 Mironov AA, Fickett JW, Gelfand MS (1999) Frequent alternative splicing of human genes. Genome Res 9: 12881293 Modrek B, Lee C (2002) A genomic view of alternative splicing. Nat Genet 30: 1319[CrossRef][Web of Science][Medline] Modrek B, Lee CJ (2003) Alternative splicing in the human, mouse and rat genomes is associated with an increased frequency of exon creation and/or loss. Nat Genet 34: 177180[CrossRef][Web of Science][Medline] Molina-Cano JL, Moralejo M, Igartua E, Romagosa I (1999) Further evidence supporting Morocco as a centre of origin of barley. Theor Appl Genet 98: 913918[CrossRef][Web of Science] Nagasaki H, Arita M, Nishizawa T, Suwa M, Gotoh O (2005) Species-specific variation of alternative splicing and transcriptional initiation in six eukaryotes. Gene 364: 5362[CrossRef][Web of Science][Medline] Nekrutenko A, Li WH (2001) Transposable elements are found in a large number of human protein-coding genes. Trends Genet 17: 619621[CrossRef][Web of Science][Medline] Ner-Gaon H, Halachmi R, Savaldi-Goldstein S, Rubin E, Ophir R, Fluhr R (2004) Intron retention is a major phenomenon in alternative splicing in Arabidopsis. Plant J 39: 877885[CrossRef][Web of Science][Medline] Sanderson MJ, Thorne JL, Wikstrom N, Bremer K (2004) Molecular evidence on plant divergence times. Am J Bot 91: 16561665 SanMiguel P, Gaut BS, Tikhonov A, Nakajima Y, Bennetzen JL (1998) The paleontology of intergene retrotransposons of maize. Nat Genet 20: 4345[CrossRef][Web of Science][Medline] SanMiguel P, Tikhonov A, Jin YK, Motchoulskaia N, Zakharov D, Melake-Berhan A, Springer PS, Edwards KJ, Lee M, Avramova Z, et al (1996) Nested retrotransposons in the intergenic regions of the maize genome. Science 274: 765768 Shirasu K, Schulman AH, Lahaye T, Schulze-Lefert P (2000) A contiguous 66-kb barley DNA sequence provides evidence for reversible genome expansion. Genome Res 10: 908915 Singer SS, Mannel DN, Hehlgans T, Brosius J, Schmitz J (2004) From "junk" to gene: curriculum vitae of a primate receptor isoform gene. J Mol Biol 341: 883886[CrossRef][Web of Science][Medline] Sorek R, Ast G (2003) Intronic sequences flanking alternatively spliced exons are conserved between human and mouse. Genome Res 13: 16311637 Sorek R, Ast G, Graur D (2002) Alu-containing exons are alternatively spliced. Genome Res 12: 10601067 Tenaillon MI, Sawkins MC, Long AD, Gaut RL, Doebley JF, Gaut BS (2001) Patterns of DNA sequence polymorphism along chromosome 1 of maize (Zea mays ssp. mays L.). Proc Natl Acad Sci USA 98: 91619166 Terol J, Castillo MC, Bargues M, Perez-Alonso M, de Frutos R (2001) Structural and evolutionary analysis of the copia-like elements in the Arabidopsis thaliana genome. Mol Biol Evol 18: 882892 Varagona MJ, Purugganan M, Wessler SR (1992) Alternative splicing induced by insertion of retrotransposons into the maize waxy gene. Plant Cell 4: 811820 Vicient CM, Suoniemi A, Anamthawat-Jonsson K, Tanskanen J, Beharav A, Nevo E, Schulman AH (1999) Retrotransposon BARE-1 and its role in genome evolution in the genus Hordeum. Plant Cell 11: 17691784 Wang BB, Brendel V (2006) Genomewide comparative analysis of alternative splicing in plants. Proc Natl Acad Sci USA 103: 71757180 Wessler SR (2001) Plant transposable elements: a hard act to follow. Plant Physiol 125: 149151 Xiao YL, Malik M, Whitelaw CA, Town CD (2002) Cloning and sequencing of cDNAs for hypothetical genes from chromosome 2 of Arabidopsis. Plant Physiol 130: 21182128 Young ND, Cannon SB, Sato S, Kim D, Cook DR, Town CD, Roe BA, Tabata S (2005) Sequencing the genespaces of Medicago truncatula and Lotus japonicus. Plant Physiol 137: 11741181 Zhu W, Schlueter SD, Brendel V (2003) Refined annotation of the Arabidopsis genome by complete expressed sequence tag mapping. Plant Physiol 132: 469484 Related articles in Plant Physiol.:
This article has been cited by other articles:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ASPB Publications | PLANT PHYSIOLOGY® | THE PLANT CELL | |
|---|---|---|---|