|
|
||||||||
|
First published online November 12, 2008; 10.1104/pp.108.128926 Plant Physiology 149:111-116 (2009) © 2009 American Society of Plant Biologists
Poaceae Genomes: Going from Unattainable to Becoming a Model Clade for Comparative Plant GenomicsDepartment of Plant Biology, Michigan State University, East Lansing, Michigan 48824
Genomics has an immense potential for improving our understanding of critical issues in plant growth and development, some of which can be applied to improvement of crop production. Midway into the second decade of genomics, genome and transcriptome sequencing efforts with the Poaceae are impressive given the technical and fiscal challenges presented by the typically large, repetitive genomes found within the Poaceae (Smith and Flavell, 1975
The first genome-scale sequences generated for the Poaceae were ESTs that represent the transcribed portion of a genome and provide a rapid, economic approach to sampling the gene space of an organism. As early access sequence datasets, ESTs can be used for (1) development of genetic markers (for example, see Harushima et al., 1998
With continued advancements in technology and concomitant reductions in costs over the last decade, whole-genome sequences have been generated for multiple species within the Poaceae (Table I). Rice was not only the first crop species but also the second plant species with a genome sequence (Barry, 2001
In addition to transcriptome and whole-genome sequences, large sets of genomic sequences are available within the GSS, HTG, WGS, and PLN divisions of GenBank. Within the GSS division, which includes gene enrichment as well as BAC end sequences, 5,072,454 Poaceae sequences (3,337 Mb) are available. Although the maize and sorghum gene enrichment sequences within the GSS division have now been superceded by draft genome sequences, the gene enrichment approaches of methylation filtration and high Cot were highly successful in generating genic sequences for maize and sorghum, thereby providing early access to the gene space (Palmer et al., 2003
It should be noted that the majority of the sequence available currently for the Poaceae are derived from a few species of high agricultural importance (rice, maize, wheat, and sorghum). As shown in Figure 2
, although 13 of the 47 species with genome-scale datasets, resources, or initiatives listed in Table I have >100 Mb of total sequence in GenBank, three-quarters of the sequence are from maize or Oryza species reflective of the heavy bias in Poaceae genome sequencing projects to date. However, with access to the next generation of genome sequencing technologies (Margulies et al., 2005
Certainly, this is an exciting time to be engaged in Poaceae research as even if genomics is not your research discipline, access to not just one but multiple Poaceae genome sequences provides not only a robust set of resources for biological inquiries, but also provides a perspective of gene function in a phylogenetic context. With this deluge of genomic sequence data, the storage, handling, analysis, and use of the large-scale genomic sequence and annotation data becomes problematic for most researchers. Consequently, resources, databases, and analyses tools need to be developed to ensure these genome datasets can be used in a feasible and intelligent manner, thereby maximizing the return on the investment of obtaining the genome sequence. Certainly, Poaceae researchers are not alone in forging a path through the morass of genome sequence data in the early 21st century, and the tools, resources, software, and knowledge gained from other genomic research endeavors throughout the Tree of Life will be instrumental in obtaining a full understanding of the pan-Poaceae genome.
Efforts in phylogenetic tree construction by E. Kellogg are greatly appreciated. Work on rice genomics was supported by the National Science Foundation (grant nos. DBI–0321538 and DBI–0834043 to C.R.B.). Received September 1, 2008; accepted November 5, 2008; published November 12, 2008.
The author responsible for distribution of materials integral to the findings presented in this article in accordance with the policy described in the Instructions for Authors (www.plantphysiol.org) is: C. Robin Buell (buell{at}msu.edu). www.plantphysiol.org/cgi/doi/10.1104/pp.108.128926
Albert TJ, Molla MN, Muzny DM, Nazareth L, Wheeler D, Song X, Richmond TA, Middle CM, Rodesch MJ, Packard CJ, et al (2007) Direct selection of human genomic loci by microarray hybridization. Nat Methods 4: 903–905[CrossRef][Web of Science][Medline] Arumuganathan K, Earle E (1991) Nuclear DNA content of some important plant species. Plant Mol Biol Rep 9: 208–218[CrossRef] Barry GF (2001) The use of the Monsanto draft rice genome sequence in research. Plant Physiol 125: 1164–1165 Bedell JA, Budiman MA, Nunberg A, Citek RW, Robbins D, Jones J, Flick E, Rholfing T, Fries J, Bradford K, et al (2005) Sorghum genome sequencing by methylation filtration. PLoS Biol 3: e13[CrossRef][Medline] Cloonan N, Forrest AR, Kolle G, Gardiner BB, Faulkner GJ, Brown MK, Taylor DF, Steptoe AL, Wani S, Bethel G, et al (2008) Stem cell transcriptome profiling via massive-scale mRNA sequencing. Nat Methods 5: 613–619[CrossRef][Web of Science][Medline] Ewing RM, Ben Kahla A, Poirot O, Lopez F, Audic S, Claverie JM (1999) Large-scale statistical analyses of rice ESTs reveal correlated patterns of gene expression. Genome Res 9: 950–959 Goff SA, Ricke D, Lan TH, Presting G, Wang R, Dunn M, Glazebrook J, Sessions A, Oeller P, Varma H, et al (2002) A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). Science 296: 92–100 Haas BJ, Delcher AL, Mount SM, Wortman JR, Smith RK Jr, Hannick LI, Maiti R, Ronning CM, Rusch DB, Town CD, et al (2003) Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res 31: 5654–5666 Harushima Y, Yano M, Shomura A, Sato M, Shimano T, Kuboki Y, Yamamoto T, Lin SY, Antonio BA, Parco A, et al (1998) A high-density rice genetic linkage map with 2275 markers using a single F2 population. Genetics 148: 479–494 Hodges E, Xuan Z, Balija V, Kramer M, Molla MN, Smith SW, Middle CM, Rodesch MJ, Albert TJ, Hannon GJ, et al (2007) Genome-wide in situ exon capture for selective resequencing. Nat Genet 39: 1522–1527[CrossRef][Web of Science][Medline] Holt KE, Parkhill J, Mazzoni CJ, Roumagnac P, Weill FX, Goodhead I, Rance R, Baker S, Maskell DJ, Wain J, et al (2008) High-throughput sequencing provides insights into genome variation and evolution in Salmonella typhi. Nat Genet 40: 987–993[CrossRef][Web of Science][Medline] Kim H, Hurwitz B, Yu Y, Collura K, Gill N, SanMiguel P, Mullikin JC, Maher C, Nelson W, Wissotski M, et al (2008) Construction, alignment and analysis of twelve framework physical maps that represent the ten genome types of the genus Oryza. Genome Biol 9: R45[CrossRef][Medline] Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA, Berka J, Braverman MS, Chen YJ, Chen Z, et al (2005) Genome sequencing in microfabricated high-density picolitre reactors. Nature 437: 376–380[Medline] McNally KL, Bruskiewich R, Mackill D, Buell CR, Leach JE, Leung H (2006) Sequencing multiple and diverse rice varieties: connecting whole-genome variation with phenotypes. Plant Physiol 141: 26–31 Okou DT, Steinberg KM, Middle C, Cutler DJ, Albert TJ, Zwick ME (2007) Microarray-based genomic selection for high-throughput resequencing. Nat Methods 4: 907–909[CrossRef][Web of Science][Medline] Palmer LE, Rabinowicz PD, O'Shaughnessy AL, Balija VS, Nascimento LU, Dike S, de la Bastide M, Martienssen RA, McCombie WR (2003) Maize genome sequencing by methylation filtration. Science 302: 2115–2117 Rosenkranz R, Borodina T, Lehrach H, Himmelbauer H (2008) Characterizing the mouse ES cell transcriptome with Illumina sequencing. Genomics 92: 187–194[CrossRef][Web of Science][Medline] SanMiguel P, Tikhonov A, Jin YK, Motchoulskaia N, Zakharov D, Melake-Berhan A, Springer PS, Edwards KJ, Lee M, Avramova Z, et al (1996) Nested retrotransposons in the intergenic regions of the maize genome. Science 274: 765–768 Sarin S, Prabhu S, O'Meara MM, Pe'er I, Hobert O (2008) Caenorhabditis elegans mutant allele identification by whole-genome sequencing. Nat Methods 5: 865–867 Smith DB, Flavell RB (1975) Characterisation of the wheat genome by renaturation kinetics. Chromosoma 50: 223–242[Web of Science] The International Rice Genome Sequencing Project (2005) The map-based sequence of the rice genome. Nature 436: 793–800[CrossRef][Medline] Whitelaw CA, Barbazuk WB, Pertea G, Chan AP, Cheung F, Lee Y, Zheng L, van Heeringen S, Karamycheva S, Bennetzen JL, et al (2003) Enrichment of gene-coding sequences in maize by genome filtration. Science 302: 2118–2120 Yamamoto K, Sasaki T (1997) Large-scale EST sequencing in rice. Plant Mol Biol 35: 135–144[CrossRef][Web of Science][Medline] Yu J, Hu S, Wang J, Wong GK, Li S, Liu B, Deng Y, Dai L, Zhou Y, Zhang X, et al (2002) A draft sequence of the rice genome (Oryza sativa L. ssp. indica). Science 296: 79–92 Yu J, Wang J, Lin W, Li S, Li H, Zhou J, Ni P, Dong W, Hu S, Zeng C, et al (2005) The genomes of Oryza sativa: a history of duplications. PLoS Biol 3: e38[CrossRef][Medline] This article has been cited by other articles:
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ASPB Publications | PLANT PHYSIOLOGY® | THE PLANT CELL | |
|---|---|---|---|