|
|
||||||||
|
First published online January 12, 2007; 10.1104/pp.106.092460 Plant Physiology 143:1086-1095 (2007) © 2007 American Society of Plant Biologists OPEN ACCESS ARTICLE
Plant Gene and Alternatively Spliced Variant Annotator. A Plant Genome Annotation Pipeline for Rice Gene and Alternatively Spliced Variant Identification with Cross-Species Expressed Sequence Tag Conservation from Seven Plant Species1,[W],[OA]Division of Biostatistics and Bioinformatics, National Health Research Institute, Miaoli County 350, Taiwan (F.-C.C.); and Genomics Research Center (S.-S.W., Y.-T.H., T.-J.C.) and Research Center for Biodiversity (S.-M.C.), Academia Sinica, Taipei 115, Taiwan
The completion of the rice (Oryza sativa) genome draft has brought unprecedented opportunities for genomic studies of the world's most important food crop. Previous rice gene annotations have relied mainly on ab initio methods, which usually yield a high rate of false-positive predictions and give only limited information regarding alternative splicing in rice genes. Comparative approaches based on expressed sequence tags (ESTs) can compensate for the drawbacks of ab initio methods because they can simultaneously identify experimental data-supported genes and alternatively spliced transcripts. Furthermore, cross-species EST information can be used to not only offset the insufficiency of same-species ESTs but also derive evolutionary implications. In this study, we used ESTs from seven plant species, rice, wheat (Triticum aestivum), maize (Zea mays), barley (Hordeum vulgare), sorghum (Sorghum bicolor), soybean (Glycine max), and Arabidopsis (Arabidopsis thaliana), to annotate the rice genome. We developed a plant genome annotation pipeline, Plant Gene and Alternatively Spliced Variant Annotator (PGAA). Using this approach, we identified 852 genes (931 isoforms) not annotated in other widely used databases (i.e. the Institute for Genomic Research, National Center for Biotechnology Information, and Rice Annotation Project) and found 87% of them supported by both rice and nonrice EST evidence. PGAA also identified more than 44,000 alternatively spliced events, of which approximately 20% are not observed in the other three annotations. These novel annotations represent rich opportunities for rice genome research, because the functions of most of our annotated genes are currently unknown. Also, in the PGAA annotation, the isoforms with non-rice-EST-supported exons are significantly enriched in transporter activity but significantly underrepresented in transcription regulator activity. We have also identified potential lineage-specific and conserved isoforms, which are important markers in evolutionary studies. The data and the Web-based interface, RiceViewer, are available for public access at http://RiceViewer.genomics.sinica.edu.tw/.
Rice (Oryza sativa) is one of the most economically important cereal plants and a model organism for studies of crop plants. The high-quality sequencing of the entire rice genome was completed and publicly released in 2004 (International Rice Genome Sequencing Project, 2005
ESTs are direct evidence of gene expression. With suitable algorithms and well-curated ESTs, the inherent errors in EST information can be effectively reduced in gene/isoform annotations. Therefore, genes and alternative splicing (AS) transcripts can be simultaneously identified with high accuracy by use of experimental evidence (e.g. support from ESTs or microarray data; Zhu et al., 2003
In this study, we developed a plant genome annotation pipeline, Plant Gene and Alternatively Spliced Variant Annotator (PGAA), for gene/AS prediction in the rice genome. PGAA is a comparative method that first identifies AS variants and genes by use of the same-species-EST-to-genome comparison and then curates the results with cross-species EST data conserved in the annotated genome. ESTs from seven plant species, rice, wheat, maize, barley (Hordeum vulgare), sorghum (Sorghum bicolor), soybean (Glycine max), and Arabidopsis, are used. All of the selected species have approximately 40,000 EST entries in the TIGR EST database. The PGAA annotation results are compared with those deposited in the National Center for Biotechnology Information (NCBI), Rice Annotation Project (RAP; First Rice Annotation Project Meeting; Ohyanagi et al., 2006
PGAA involves three consecutive steps: gene identification by use of rice ESTs (metaannotation), transcript patching by use of nonrice ESTs (the patching process), and redundancy removal. The annotation procedure is summarized in Figure 1 .
For metaannotation, the Complexity Reduction Algorithm for Sequence Analysis aligner (Chuang et al., 2003 18 bp (from the Rice Gene Index [OGI] EST and TIGR databases) that exactly match the rice genome for identification of potential gene loci. If the gap between two successive matched EST fragments is not larger than 10 bp, the gap is patched by the corresponding rice genomic sequence. Then, several criteria are used to reduce potential noise. The rice ESTs each with fewer than three rice genome-matching fragments are discarded unless the EST has at least one matching fragment longer than 100 bp or the matching fragments are also matched by a nonrice EST (Chuang et al., 2003After the preliminary screening, the system patches the remaining EST matches with nonrice ESTs that are conserved in the rice genome. For example, as shown in Figure 2A (1), EST 1 (EST 2) is a rice (nonrice) EST with two (three) split fragments, e11 and e12 (e21, e22, and e23), which match the rice genomic sequence. However, e11 and e12 overlap with e21 and e23, respectively, with two possible results in our annotation. If the alignment between EST 1 and the rice genomic sequence has high quality (defined below), two isoforms will be annotated: isoform 1 with two exons (e11 and e12) and isoform 2 with three exons (e11, e22, and e12). Otherwise, only one isoform will be annotated (with three exons: e11, e22, and e12). The newly patched exon (e.g. e22) must be flanked by AG-GT/AG-GC legal splicing sites, not disrupt the reading frame, and contain no premature stop codons. Here, high-quality mapping means that e11 and e12 (which are originally contiguous on EST 1) match the rice genomic sequence in the correct order (i.e. no gap or mismatch exists between e11 and e12 on EST 1). Otherwise, the e11'-e12' match is considered low-quality mapping because of a mismatched EST segment, which thus results in a gap in the EST-to-genome alignment between e11' and e12' (also see Fig. 2A [2]). Note that e22 and e22' are not included in rice ESTs, which indicates that PGAA can identify potential missing exons or novel AS variants with use of nonrice ESTs. Also, all transcripts identified in this study are supported by evidence from expressed sequences (rice or nonrice ESTs).
After the patching process, two criteria are used to reduce potentially redundant isoforms identified. First, if two identified isoforms overlap and the overlapping regions are identical, these two isoforms are assembled and replaced by the newly assembled isoform (Fig. 2B, case 1). Second, if an isoform is identified by a low-quality-mapped EST and is completely included in another isoform, it is discarded (Fig. 2B, case 2). Furthermore, to avoid potentially transposable-element-related isoforms in the PGAA annotation, we filter out repetitive elements by use of RepeatMasker (http://www.repeatmasker.org/) and Rice Transposable Element database (RTEdb; Juretic et al., 2004 We then compare the PGAA annotated results with those from the three well-known rice annotation sources, RAP, TIGR, and NCBI (Build 2.1), and examine the four rice annotations for rice and nonrice ESTs that are conserved in the rice genome. The isoforms identified by nonrice ESTs in our annotation are singled out for analysis.
In PGAA, the ESTs are downloaded from the TIGR gene index project (see "DATA ACCESS"). Table I illustrates the numbers of EST/tentative consensus (TC) sequences of the seven plant species analyzed in the PGAA system and the numbers (percentages) that are mapped to the rice genome. Note that TC sequences are generated by assembling ESTs into virtual transcripts, which may contain full or partial cDNA sequences (see the definition of the TIGR gene index project at http://compbio.dfci.harvard.edu/tgi/definitions.html). A large number of nonrice ESTs can be mapped to the rice genome. Particularly, as high as 32% to approximately 44% of the monocot cereal (i.e. barley, maize, sorghum, and wheat) ESTs and 48% to approximately 56% of the TCs are conserved in the rice genome. Such conserved nonrice ESTs can provide the resources for identification of potentially novel genes/AS variants in the rice genome. In addition, only 6% of the dicot plant (i.e. Arabidopsis and soybean) ESTs/TCs are alignable against the rice genome. This result is consistent with the phylogenetic relationships of the studied plants.
The PGAA system annotates a total of 34,512 genes (56,460 isoforms). The average number of isoforms per PGAA-identified gene is 1.63, and the total length of the annotated exon is 53.94 Mb (also see Table II ). The average number of exons per annotated isoform is 4.1, whereas the average length of annotated exon and intron is 292 and 487 bp, respectively. For AS variant detection, 12,749 genes (36.9%) are annotated to be alternatively spliced, with a total of 34,697 isoforms that include 44,447 AS events (i.e. more than one AS event may occur to one isoform). Events include 10,131 (22.8%) exon skipping (or cassette on/off exon), 18,022 (40.5%) alternative donor/acceptor sites, and 16,294 (36.7%) intron retentions. As well, approximately 20% of the PGAA-annotated AS events are potentially novel, because they are not observed in the other three annotations (TIGR, NCBI, and RAP).
We then estimated the false-positive rate in the PGAA annotation. The emphasis here is that PGAA is an EST-based approach, which may be able to reduce false-positive predictions, thus reducing the searching scope for functional genes of rice. Of course, PGAA might yield false-positive predictions. In our early study on humans (Chuang et al., 2004
Table II shows the differences in rice genome annotation results among TIGR (Release 4), NCBI (Build 2.1), RAP (all RAP loci), and PGAA. TIGR annotates the largest number of genes and splicing isoforms, then PGAA, then NCBI, then RAP. Note that NCBI annotates only a small number of genes for the rice chromosome 11 and not chromosome 12 at all. Meanwhile, the average number of isoforms per annotated gene is the smallest in NCBI annotations but the largest in PGAA annotations. The isoform-to-gene ratio in the PGAA annotation is 14%, 43%, and 57% larger than those of the RAP, TIGR, and NCBI annotations, respectively. Because NCBI-, RAP-, and TIGR-annotated isoforms were identified with the aid of ab initio methods (e.g. FGENESH, Genscan, etc.) or full-length cDNAs, the number of AS variants identified is relatively limited. Of note, the number of TIGR-specific isoforms is high, 14,272. Because a large portion of these genes lack experimental evidence, many may be false-positive predictions (discussed in the next paragraph). However, although some ab initio predictions may be false positive, they provide an important source of potential rice genes for which expression data are still lacking. In some ab initio predictions, a considerable proportion ( 50%) of hypothetical genes were also experimentally verified (Xiao et al., 2005
Meanwhile, PGAA annotates genes with alignments between the rice genome and TIGR gene indices, which contain not only full-length cDNAs but also partial cDNAs and singleton ESTs (Liang et al., 2000
In addition, PGAA annotates 931 isoforms that are not annotated by TIGR, RAP, or NCBI (Table II). Of these 931 isoforms, 808 (87%) are supported by both rice and nonrice EST evidence. Using the Gene Ontology (GO; Gene Ontology Consortium, 2001 Figure 3A shows a Venn diagram to compare TIGR, NCBI, RAP, and PGAA annotation results in terms of gene number. Collectively, the four annotations annotated 71,029 genes, of which 51,763 (72.9%) are annotated by at least two methods. Note that two annotated genes/isoforms are considered the same gene in this analysis if they overlap with each other and share at least one exon. These four annotations give very different results, with only 16,460 (23%) genes in common. The TIGR-specific genes account for 20% of the collective total of annotated genes, whereas the RAP-, NCBI-, and PGAA-specific genes account for 4%, 3%, and 1%, respectively.
Because some TIGR-, NCBI-, and PGAA-annotated genes are not fully supported by rice ESTs, we compared these three annotations in terms of proportion of rice EST coverage. Figure 3B shows that 32.29 Mb (approximately 41%) and 6.37 Mb (approximately 17%) of TIGR- and NCBI-annotated exonic sequences, respectively, do not overlap with any rice ESTs. Despite the limitations in rice EST information, the high proportion of non-rice-EST-overlapping isoforms in the TIGR and NCBI databases is questionable. Figure 3B also shows that only a small fraction (approximately 0.1%, or 58 kb) of PGAA-identified isoforms do not overlap with any rice ESTs (i.e. are supported only by nonrice ESTs). Note that the RAP gene loci are identified on the basis of either full-length rice cDNA matches or ab initio prediction plus rice EST coverage (see the RAP annotation document at http://rapdownload.lab.nig.ac.jp/index.html). Therefore, all of the RAP-identified loci are rice-EST supported and are not illustrated in the figure.
Table III indicates the same-species and cross-species EST conservation in rice genes annotated in TIGR, NCBI, RAP, and PGAA. In terms of length, only 67%, 60%, and 6% of the annotated isoforms overlap with either rice or nonrice ESTs, rice ESTs only, and both rice and nonrice ESTs, respectively. The absence of overlap with nonrice ESTs most likely results from inadequate EST information. However, some of the isoforms might be rice specific. Therefore, the isoforms may have been conserved across species or have different splicing forms in the orthologous genes in different species. Further analyses are required to determine which of the above two explanations is true. However, about 1% of the isoforms overlap with only nonrice ESTs, and 33% overlap with no ESTs at all. The authenticity of these isoforms needs further validation. Note that, by the definitions of RAP and PGAA, all RAP- and PGAA-annotated isoforms are EST supported. Therefore, the annotated isoforms not supported by ESTs must come from either the TIGR or the NCBI annotations. Meanwhile, some of the nonrice ESTs conserved in the rice genome do not overlap with any annotated isoforms or the rice ESTs. These ESTs might represent some gain or loss events in the crop plants after domestication. Because crop plants are known to have undergone whole-genome duplications and large-scale gene loss events (Paterson et al., 2004
Figure 4A illustrates the lengths and numbers of PGAA-annotated rice isoforms that contain exonic regions supported by nonrice but not rice ESTs. The total length of those non-rice-EST-supported exonic regions is approximately 58 kb, of which 36.5 kb is annotated by PGAA only, 4.4 kb is also annotated by NCBI but not by TIGR, 13.6 kb by TIGR but not by NCBI, and 3.8 kb by both TIGR and NCBI. Notably, the 36.5 kb accounts for 2,657 isoforms, which contain at least one PGAA-unique exonic region. Because at least one of the exons (partial or complete) in each annotated gene locus is supported by nonrice EST(s), these isoforms are likely evolutionarily conserved. Moreover, such identified exons may also represent novel AS events. Also note that RAP annotations are not considered in Figure 4, because all of the RAP-annotated loci are supported by rice ESTs.
PGAA differs from other EST-based annotation tools in that it uses cross-species EST information to compensate for the insufficiency and correct the errors of the use of same-species ESTs. These rescued genes/isoforms may have important functions in terms of evolutionary conservation. We performed a GO-based functional analysis (Gene Ontology Consortium, 2001
All data generated in this study are accessed through a Web-based interactive interface, RiceViewer (http://RiceViewer.genomics.sinica.edu.tw/). The RiceViewer presents the structure and AS variants of rice genes on the basis of four annotation sources: NCBI, TIGR, RAP, and PGAA. For each annotated gene, EST matches from the seven studied species are also provided. The interface supports three types of queries. It can accept rice mRNA or protein accession numbers, rice genomic coordinates, and nucleotide sequences for BLAST searching against the rice genome (Fig. 5A ). In the first type of query, the coordinates of the queried gene are shown in the query results, whereas in the second type, the information of all annotated genes located within the specified region is displayed in small-to-large genomic coordinate order (Fig. 5B). By clicking on one of the gene regions, the gene structures and AS variants of the selected gene region are displayed according to TIGR, NCBI, RAP, and PGAA annotations (Fig. 5C). Note that if an annotation identifies no transcripts within the selected region, then the corresponding slot will be empty. Moreover, in the PGAA annotation, the accession identifier is highlighted if it also belongs to a clone of the Knowledge-based Oryza Molecular biological Encyclopedia (KOME; Fig. 5C). By clicking on the PGAA ID, the related descriptions of the isoforms (including KOME clone ID, nucleotide sequence, and GO annotation) are shown in a pop-up window (Fig. 5D). Users can click on the visit KOME button to link to the corresponding KOME report. Also note that the colors of exons in Figure 5C indicate the direction and level of EST coverage for the exons presented. Light gray represents exons that do not have EST evidence from rice or the six other species, likely predicted exons with no current EST evidence. Green and red indicate exons that are encoded in direct and complement strands, respectively. Each color is further divided into four shade levels according to EST coverage levels: dark green for exons with EST evidence from both rice and at least one of the six other species, light green for those with only rice EST evidence, and medium green for those with only EST evidence from at least one of the six other species but not from rice. The color-coding scheme thus enables users to distinguish exons with different directions and EST coverage levels at a glance. We also provide a comparison of the four annotations (see Fig. 5E). The interface also shows the sizes and proportions of ESTs from the seven species that overlap with the four annotations in the user-specified gene region. Moreover, the lengths of ESTs not overlapping with any one of the four annotations are also illustrated (Fig. 5E). These ESTs are likely noise and hence are filtered out in the PGAA annotation process.
In addition, the coordinates of each exon can be shown by pointing the cursor to the exon of interest. By double clicking on the exon of interest, the nucleotide sequence and supporting ESTs for the selected exon are displayed in a pop-up window (Fig. 5F). Users can click on the accession number shown in Figure 5C to link to the NCBI Entrez Gene database for more information about the gene. The genomic coordinates of the gene regions for which users have browsed the corresponding EST conservation are shown on the right side in the Collection column in Figure 5G for users to track their analysis. In the third type of query, the submitted nucleotide sequences are BLASTN aligned against the rice genome, and the alignment outputs are shown in a pop-up window. The interface simultaneously retrieves the coordinates of the three best hits from each of the three top-score rice chromosomes (if applicable) in the BLAST output file and displays genes located within these coordinates, as described above. Again, all the genes identified by the four annotations within the specified region are shown unless no annotated gene is available. Given such a condition, the gene display region (Fig. 5B) will be blank.
The RAP, TIGR, and NCBI annotations of the rice genome were downloaded from http://rapdownload.lab.nig.ac.jp (RAP1, based on the International Rice Genome Sequencing Project genome sequence Build 3), http://rice.tigr.org/tdb/e2k1/osa1/data_download.shtml (release 3.0), and http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=genomeprj&cmd=Retrieve&dopt=Overview&list_uids=122 (Build 2.1), respectively. RTEdb (Juretic et al., 2004
In this study, we designed a cross-species EST-based pipeline for rice genome annotations and provided a Web-based interface for comparative studies. We found remarkable differences in results from current annotations and identified a large number of potentially novel genes and AS isoforms in rice with our system. Many of the isoforms are supported by nonrice ESTs, so they are interesting targets for future functional and evolutionary studies. As the numbers of crop plant ESTs increase, our system can help with detailed investigation of the AS isoforms of rice and AS evolution in the grass family. Sequence data from this article can be found in the GenBank/EMBL/DDBJ data libraries under accession number AAAA00000000 or AACV00000000.
The following materials are available in the online version of this article.
We especially thank Dr. Nikoleta Juretic for providing RTEdb. In addition, we extend our deep gratitude to all the administrators of the cited sequencing centers and scientists who made their sequence data and annotation results available to the public. Without their efforts, this study would not be possible. We also gratefully acknowledge the critical and valuable criticism of the two anonymous reviewers. Received November 1, 2006; accepted January 4, 2007; published January 12, 2007.
1 This work was supported by the Genomics Research Center, Academia Sinica, Taiwan, by the National Health Research Institutes, Taiwan (contract no. NHRIEX959408PC to T.-J.C., National Health Research Institutes intramural funding to F.-C.C.), and by the Research Center for Biodiversity, Academia Sinica (to S.-M.C.). The author responsible for distribution of materials integral to the findings presented in this article in accordance with the policy described in the Instructions for Authors (www.plantphysiol.org) is: Trees-Juen Chuang (trees{at}gate.sinica.edu.tw).
[W] The online version of this article contains Web-only data.
[OA] Open Access articles can be viewed online without a subscription. www.plantphysiol.org/cgi/doi/10.1104/pp.106.092460 * Corresponding author; e-mail trees{at}gate.sinica.edu.tw; fax 886227898757.
Bonizzoni P, Rizzi R, Pesole G (2005) ASPIC: a novel method to predict the exon-intron structure of a gene that is optimally compatible to a set of transcript sequences. BMC Bioinformatics 6: 244[CrossRef][Medline] Bruskiewich RM, Cosico AB, Eusebio W, Portugal AM, Ramos LM, Reyes MT, Sallan MA, Ulat VJ, Wang X, McNally KL, et al (2003) Linking genotype to phenotype: the International Rice Information System (IRIS). Bioinformatics (Suppl 1) 19: i6365[Abstract] Chen FC, Chen CJ, Ho JY, Chuang TJ (2006) Identification and evolutionary analysis of novel exons and alternative splicing events using cross-species EST-to-genome comparisons in human, mouse and rat. BMC Bioinformatics 7: 136[CrossRef][Medline] Chen FC, Wang SS, Chen CJ, Li WH, Chuang TJ (2005) Alternatively and constitutively spliced exons are subject to different evolutionary forces. Mol Biol Evol 23: 675682[CrossRef][Web of Science][Medline] Chuang TJ, Chen FC, Chou MY (2004) A comparative method for identification of gene structures and alternatively spliced variants. Bioinformatics 20: 30643079 Chuang TJ, Lin WC, Lee HC, Wang CW, Hsiao KL, Wang ZH, Shieh D, Lin SC, Ch'ang LY (2003) A complexity reduction algorithm for analysis and annotation of large genomic sequences. Genome Res 13: 313322 Foissac S, Schiex T (2005) Integrating alternative splicing detection into gene prediction. BMC Bioinformatics 6: 25[CrossRef][Medline] Gene Ontology Consortium (2001) Creating the gene ontology resource: design and implementation. Genome Res 11: 14251433 Iida K, Seki M, Sakurai T, Satou M, Akiyama K, Toyoda T, Konagaya A, Shinozaki K (2004) Genome-wide analysis of alternative pre-mRNA splicing in Arabidopsis thaliana based on full-length cDNA sequences. Nucleic Acids Res 32: 50965103 International Rice Genome Sequencing Project (2005) The map-based sequence of the rice genome. Nature 436: 793800[CrossRef][Medline] Ito Y, Arikawa K, Antonio BA, Ohta I, Naito S, Mukai Y, Shimano A, Masukawa M, Shibata M, Yamamoto M, et al (2005) Rice Annotation Database (RAD): a contig-oriented database for map-based rice genomics. Nucleic Acids Res 33: D651655 Jaiswal P, Ni J, Yap I, Ware D, Spooner W, Youens-Clark K, Ren L, Liang C, Zhao W, Ratnapu K, et al (2006) Gramene: a bird's eye view of cereal genomes. Nucleic Acids Res 34: D717723 Juretic N, Bureau TE, Bruskiewich RM (2004) Transposable element annotation of the rice genome. Bioinformatics 20: 155160 Kan Z, Castle J, Johnson JM, Tsinoremas NF (2004) Detection of novel splice forms in human and mouse using cross-species approach. Pac Symp Biocomput 9: 4253 Karlowski WM, Schoof H, Janakiraman V, Stuempflen V, Mayer KF (2003) MOsDB: an integrated information resource for rice genomics. Nucleic Acids Res 31: 190192 Kikuchi S, Satoh K, Nagata T, Kawagashira N, Doi K, Kishimoto N, Yazaki J, Ishikawa M, Yamada H, Ooka H, et al (2003) Collection, mapping, and annotation of over 28,000 cDNA clones from japonica rice. Science 301: 376379 Liang F, Holt I, Pertea G, Karamycheva S, Salzberg SL, Quackenbush J (2000) An optimized protocol for analysis of EST sequences. Nucleic Acids Res 28: 36573665 Mulder NJ, Apweiler R, Attwood TK, Bairoch A, Bateman A, Binns D, Bradley P, Bork P, Bucher P, Cerutti L, et al (2005) InterPro, progress and status in 2005. Nucleic Acids Res 33: D201205 Ner-Gaon H, Fluhr R (2006) Whole-genome microarray in Arabidopsis facilitates global analysis of retained introns. DNA Res 13: 111121 Ner-Gaon H, Halachmi R, Savaldi-Goldstein S, Rubin E, Ophir R, Fluhr R (2004) Intron retention is a major phenomenon in alternative splicing in Arabidopsis. Plant J 39: 877885[CrossRef][Web of Science][Medline] Ohyanagi H, Tanaka T, Sakai H, Shigemoto Y, Yamaguchi K, Habara T, Fujii Y, Antonio BA, Nagamura Y, Imanishi T, et al (2006) The Rice Annotation Project Database (RAP-DB): hub for Oryza sativa ssp. japonica genome information. Nucleic Acids Res 34: D741744 Ouyang S, Buell CR (2004) The TIGR Plant Repeat Databases: a collective resource for the identification of repetitive sequences in plants. Nucleic Acids Res 32: D360363 Paterson AH, Bowers JE, Chapman BA (2004) Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics. Proc Natl Acad Sci USA 101: 99039908 Quevillon E, Silventoinen V, Pillai S, Harte N, Mulder N, Apweiler R, Lopez R (2005) InterProScan: protein domains identifier. Nucleic Acids Res 33: W116120 Wang BB, Brendel V (2006) Genomewide comparative analysis of alternative splicing in plants. Proc Natl Acad Sci USA 103: 71757180 Xiao YL, Smith SR, Ishmael N, Redman JC, Kumar N, Monaghan EL, Ayele M, Haas BJ, Wu HC, Town CD (2005) Analysis of the cDNAs of hypothetical genes on Arabidopsis chromosome 2 reveals numerous transcript variants. Plant Physiol 139: 13231337 Yuan Q, Ouyang S, Liu J, Suh B, Cheung F, Sultana R, Lee D, Quackenbush J, Buell CR (2003) The TIGR rice genome annotation resource: annotating the rice genome and creating resources for plant biologists. Nucleic Acids Res 31: 229233 Yuan Q, Ouyang S, Wang A, Zhu W, Maiti R, Lin H, Hamilton J, Haas B, Sultana R, Cheung F, et al (2005) The institute for genomic research osa1 rice genome annotation database. Plant Physiol 138: 1826 Zhu W, Schlueter SD, Brendel V (2003) Refined annotation of the Arabidopsis genome by complete expressed sequence tag mapping. Plant Physiol 132: 469484 This article has been cited by other articles:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ASPB Publications | PLANT PHYSIOLOGY® | THE PLANT CELL | |
|---|---|---|---|