Plant Physiology 138:18-26 (2005)
© 2005 American Society of Plant Biologists
BIOINFORMATICS-PLANT DATABASES
The Institute for Genomic Research Osa1 Rice Genome Annotation Database1
Qiaoping Yuan2,
Shu Ouyang,
Aihui Wang,
Wei Zhu,
Rama Maiti,
Haining Lin,
John Hamilton,
Brian Haas,
Razvan Sultana,
Foo Cheung,
Jennifer Wortman and
C. Robin Buell*
The Institute for Genomic Research, Rockville, Maryland 20850
We have developed a rice (Oryza sativa) genome annotation database (Osa1) that provides structural and functional annotation for this emerging model species. Using the sequence of O. sativa subsp. japonica cv Nipponbare from the International Rice Genome Sequencing Project, pseudomolecules, or virtual contigs, of the 12 rice chromosomes were constructed. Our most recent release, version 3, represents our third build of the pseudomolecules and is composed of 98% finished sequence. Genes were identified using a series of computational methods developed for Arabidopsis (Arabidopsis thaliana) that were modified for use with the rice genome. In release 3 of our annotation, we identified 57,915 genes, of which 14,196 are related to transposable elements. Of these 43,719 nontransposable element-related genes, 18,545 (42.4%) were annotated with a putative function, 5,777 (13.2%) were annotated as encoding an expressed protein with no known function, and the remaining 19,397 (44.4%) were annotated as encoding a hypothetical protein. Multiple splice forms (5,873) were detected for 2,538 genes, resulting in a total of 61,250 gene models in the rice genome. We incorporated experimental evidence into 18,252 gene models to improve the quality of the structural annotation. A series of functional data types has been annotated for the rice genome that includes alignment with genetic markers, assignment of gene ontologies, identification of flanking sequence tags, alignment with homologs from related species, and syntenic mapping with other cereal species. All structural and functional annotation data are available through interactive search and display windows as well as through download of flat files. To integrate the data with other genome projects, the annotation data are available through a Distributed Annotation System and a Genome Browser. All data can be obtained through the project Web pages at http://rice.tigr.org.
1 This work (on rice genome annotation) was supported by the National Science Foundation (grant no. DBI0321538 to C.R.B.) and the U.S. Department of Agriculture (grant no. 20033531713173 to C.R.B.).
2 Present address: Laboratory of Neurogenetics, NIAAA, NIH, 5625 Fishers Lane, Suite 3532, MSC 9412, Bethesda, MD 20892.
www.plantphysiol.org/cgi/doi/10.1104/pp.104.059063.
* Corresponding author; e-mail rbuell{at}tigr.org; fax 3018380208.
Received December 31, 2004;
returned for revision February 24, 2005;
accepted March 21, 2005.
This article has been cited by other articles:

|
 |

|
 |
 
N. O'Toole, M. Hattori, C. Andres, K. Iida, C. Lurin, C. Schmitz-Linneweber, M. Sugita, and I. Small
On the Expansion of the Pentatricopeptide Repeat Gene Family in Plants
Mol. Biol. Evol.,
June 1, 2008;
25(6):
1120 - 1128.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. Qiu, J. Xiao, W. Xie, H. Liu, X. Li, L. Xiong, and S. Wang
Rice Gene Network Inferred from Expression Profiling of Plants Overexpressing OsWRKY13, a Positive Regulator of Disease Resistance
Mol Plant,
May 1, 2008;
1(3):
538 - 551.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
X. Gao, Y. Hou, H. Ebina, H. L. Levin, and D. F. Voytas
Chromodomains direct integration of retrotransposons to heterochromatin
Genome Res.,
March 1, 2008;
18(3):
359 - 369.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. A. Campbell, W. Zhu, N. Jiang, H. Lin, S. Ouyang, K. L. Childs, B. J. Haas, J. P. Hamilton, and C. R. Buell
Identification and Characterization of Lineage-Specific Genes within the Poaceae
Plant Physiology,
December 1, 2007;
145(4):
1311 - 1322.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. S. Merchant, S. E. Prochnik, O. Vallon, E. H. Harris, S. J. Karpowicz, G. B. Witman, A. Terry, A. Salamov, L. K. Fritz-Laylin, L. Marechal-Drouard, et al.
The Chlamydomonas Genome Reveals the Evolution of Key Animal and Plant Functions
Science,
October 12, 2007;
318(5848):
245 - 250.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. R. Bush and J. E. Leach
Translational Genomics for Bioenergy Production: There's Room for More Than One Model
PLANT CELL,
October 1, 2007;
19(10):
2971 - 2973.
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
W. Zhu and C. R. Buell
Improvement of whole-genome annotation of cereals through comparative analyses
Genome Res.,
March 1, 2007;
17(3):
299 - 310.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
F.-C. Chen, S.-S. Wang, S.-M. Chaw, Y.-T. Huang, and T.-J. Chuang
Plant Gene and Alternatively Spliced Variant Annotator. A Plant Genome Annotation Pipeline for Rice Gene and Alternatively Spliced Variant Identification with Cross-Species Expressed Sequence Tag Conservation from Seven Plant Species
Plant Physiology,
March 1, 2007;
143(3):
1086 - 1095.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Spannagl, O. Noubibou, D. Haase, L. Yang, H. Gundlach, T. Hindemitt, K. Klee, G. Haberer, H. Schoof, and K. F. X. Mayer
MIPSPlantsDB--plant database resource for integrative and comparative plant genome research
Nucleic Acids Res.,
January 12, 2007;
35(suppl_1):
D834 - D840.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. Chaparro, R. Guyot, A. Zuccolo, B. Piegu, and O. Panaud
RetrOryza: a database of the rice LTR-retrotransposons
Nucleic Acids Res.,
January 12, 2007;
35(suppl_1):
D66 - D70.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Ouyang, W. Zhu, J. Hamilton, H. Lin, M. Campbell, K. Childs, F. Thibaud-Nissen, R. L. Malek, Y. Lee, L. Zheng, et al.
The TIGR Rice Genome Annotation Resource: improvements and new features
Nucleic Acids Res.,
January 12, 2007;
35(suppl_1):
D883 - D887.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. Johnson, L. Bowman, A. T. Adai, V. Vance, and V. Sundaresan
CSRDB: a small RNA integrated database and browser resource for cereals
Nucleic Acids Res.,
January 12, 2007;
35(suppl_1):
D829 - D833.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Yokoyama, T. Yamashino, Y.-I. Amano, Y. Tajima, A. Imamura, H. Sakakibara, and T. Mizuno
Type-B ARR Transcription Factors, ARR10 and ARR12, are Implicated in Cytokinin-Mediated Regulation of Protoxylem Differentiation in Roots of Arabidopsis thaliana
Plant Cell Physiol.,
January 1, 2007;
48(1):
84 - 96.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. A. Jackson, D. Rokhsar, G. Stacey, R. C. Shoemaker, J. Schmutz, and J. Grimwood
Toward a Reference Sequence of the Soybean Genome: A Multiagency Effort
Crop Sci.,
November 1, 2006;
46(Supplement_1):
S-55 - S-61.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Pareek, A. Singh, M. Kumar, H. R. Kushwaha, A. M. Lynn, and S. L. Singla-Pareek
Whole-Genome Analysis of Oryza sativa Reveals Similar Architecture of Two-Component Signaling Machinery with Arabidopsis
Plant Physiology,
October 1, 2006;
142(2):
380 - 397.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
H. Yan, H. Ito, K. Nobuta, S. Ouyang, W. Jin, S. Tian, C. Lu, R.C. Venu, G.-l. Wang, P. J. Green, et al.
Genomic and Genetic Characterization of Rice Cen3 Reveals Extensive Transcription and Evolutionary Implications of a Complex Centromere
PLANT CELL,
September 1, 2006;
18(9):
2123 - 2133.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. Xie, C. Wu, and L. Xiong
Genomic Organization, Differential Expression, and Interaction of SQUAMOSA Promoter-Binding-Like Transcription Factors and microRNA156 in Rice
Plant Physiology,
September 1, 2006;
142(1):
280 - 293.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
B.-B. Wang and V. Brendel
Genomewide comparative analysis of alternative splicing in plants
PNAS,
May 2, 2006;
103(18):
7175 - 7180.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
U. Radetzki, U. Leser, S. C. Schulze-Rauschenbach, J. Zimmermann, J. Lussem, T. Bode, and A. B. Cremers
Adapters, shims, and glue--service interoperability for in silico experiments
Bioinformatics,
May 1, 2006;
22(9):
1137 - 1143.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. L. McNally, R. Bruskiewich, D. Mackill, C. R. Buell, J. E. Leach, and H. Leung
Sequencing multiple and diverse rice varieties. Connecting whole-genome variation with phenotypes.
Plant Physiology,
May 1, 2006;
141(1):
26 - 31.
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. Jaiswal, J. Ni, I. Yap, D. Ware, W. Spooner, K. Youens-Clark, L. Ren, C. Liang, W. Zhao, K. Ratnapu, et al.
Gramene: a bird's eye view of cereal genomes
Nucleic Acids Res.,
January 1, 2006;
34(suppl_1):
D717 - D723.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Nakano, K. Nobuta, K. Vemaraju, S. S. Tej, J. W. Skogen, and B. C. Meyers
Plant MPSS databases: signature-based transcriptional resources for analyses of mRNA and small RNA
Nucleic Acids Res.,
January 1, 2006;
34(suppl_1):
D731 - D735.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
H. Ohyanagi, T. Tanaka, H. Sakai, Y. Shigemoto, K. Yamaguchi, T. Habara, Y. Fujii, B. A. Antonio, Y. Nagamura, T. Imanishi, et al.
The Rice Annotation Project Database (RAP-DB): hub for Oryza sativa ssp. japonica genome information
Nucleic Acids Res.,
January 1, 2006;
34(suppl_1):
D741 - D744.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. A. Hayden, T. J. Wheeler, and R. A. Jorgensen
Evaluating and improving cDNA sequence quality with cQC
Bioinformatics,
December 15, 2005;
21(24):
4414 - 4415.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
B. K. Zolman, M. Monroe-Augustus, I. D. Silva, and B. Bartel
Identification and Functional Characterization of Arabidopsis PEROXIN4 and the Interacting Protein PEROXIN22
PLANT CELL,
December 1, 2005;
17(12):
3422 - 3435.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Zhang, C. Chen, L. Li, L. Meng, J. Singh, N. Jiang, X.-W. Deng, Z.-H. He, and P. G. Lemaux
Evolutionary Expansion, Gene Structure, and Expression of the Rice Wall-Associated Kinase Gene Family
Plant Physiology,
November 1, 2005;
139(3):
1107 - 1124.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. Vandepoele, K. Vlieghe, K. Florquin, L. Hennig, G. T.S. Beemster, W. Gruissem, Y. Van de Peer, D. Inze, and L. De Veylder
Genome-Wide Identification of Potential Plant E2F Target Genes
Plant Physiology,
September 1, 2005;
139(1):
316 - 328.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Y. Rhee and B. Crosby
Biological Databases for Plant Research
Plant Physiology,
May 1, 2005;
138(1):
1 - 3.
[Full Text]
[PDF]
|
 |
|
|
|