Plant Physiol.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Plant Physiology 137:31-42 (2005)
© 2005 American Society of Plant Biologists

This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Supplemental Data
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via CrossRef
Right arrow Citing Articles via Web of Science (14)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Vandepoele, K.
Right arrow Articles by Van de Peer, Y.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Vandepoele, K.
Right arrow Articles by Van de Peer, Y.
Agricola
Right arrow Articles by Vandepoele, K.
Right arrow Articles by Van de Peer, Y.
BIOINFORMATICS

Exploring the Plant Transcriptome through Phylogenetic Profiling1,[w]

Klaas Vandepoele and Yves Van de Peer*

Department of Plant Systems Biology, Flanders Interuniversity Institute for Biotechnology (VIB), Ghent University, B–9052 Ghent, Belgium

Publicly available protein sequences represent only a small fraction of the full catalog of genes encoded by the genomes of different plants, such as green algae, mosses, gymnosperms, and angiosperms. By contrast, an enormous amount of expressed sequence tags (ESTs) exists for a wide variety of plant species, representing a substantial part of all transcribed plant genes. Integrating protein and EST sequences in comparative and evolutionary analyses is not straightforward because of the heterogeneous nature of both types of sequence data. By combining information from publicly available EST and protein sequences for 32 different plant species, we identified more than 250,000 plant proteins organized in more than 12,000 gene families. Approximately 60% of the proteins are absent from current sequence databases but provide important new information about plant gene families. Analysis of the distribution of gene families over different plant species through phylogenetic profiling reveals interesting insights into plant gene evolution, and identifies species- and lineage-specific gene families, orphan genes, and conserved core genes across the green plant lineage. We counted a similar number of approximately 9,500 gene families in monocotyledonous and eudicotyledonous plants and found strong evidence for the existence of at least 33,700 genes in rice (Oryza sativa). Interestingly, the larger number of genes in rice compared to Arabidopsis (Arabidopsis thaliana) can partially be explained by a larger amount of species-specific single-copy genes and species-specific gene families. In addition, a majority of large gene families, typically containing more than 50 genes, are bigger in rice than Arabidopsis, whereas the opposite seems true for small gene families.


1 This work was supported by the Instituut voor de aanmoediging van Innovatie door Wetenschap en Technologie in Vlaanderen (predoctoral fellowship to K.V.).

[w] The online version of this article contains Web-only data.

www.plantphysiol.org/cgi/doi/10.1104/pp.104.054700.

* Corresponding author; e-mail yves.vandepeer{at}psb.ugent.be; fax 32–9–33–13809.

Received October 11, 2004; returned for revision November 10, 2004; accepted November 10, 2004.




This article has been cited by other articles:


Home page
Proc. Natl. Acad. Sci. USAHome page
C. Martens, K. Vandepoele, and Y. Van de Peer
Whole-genome analysis reveals molecular innovations and evolutionary transitions in chromalveolate species
PNAS, March 4, 2008; 105(9): 3427 - 3432.
[Abstract] [Full Text] [PDF]


Home page
J Exp BotHome page
A. Rohde, T. Ruttink, V. Hostyn, L. Sterck, K. Van Driessche, and W. Boerjan
Gene expression during the induction, maintenance, and release of dormancy in apical buds of poplar
J. Exp. Bot., December 1, 2007; 58(15-16): 4047 - 4060.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
M. A. Campbell, W. Zhu, N. Jiang, H. Lin, S. Ouyang, K. L. Childs, B. J. Haas, J. P. Hamilton, and C. R. Buell
Identification and Characterization of Lineage-Specific Genes within the Poaceae
Plant Physiology, December 1, 2007; 145(4): 1311 - 1322.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
ASPB Publications PLANT PHYSIOLOGY® THE PLANT CELL
Copyright © 2005 by the American Society of Plant Biologists