|
|
||||||||
|
Plant Physiology 134:951-959 (2004) © 2004 American Society of Plant Biologists Evaluation of Monocot and Eudicot Divergence Using the Sugarcane Transcriptome1,[w]Centro de Biologia Molecular e Engenharia Genética (M.V., F.R.d.S., A.N.C., L.G., A.L.V., E.L.K., P.A.) and Laboratório de Bioinformática, Instituto de Computação (F.A.A.C., V.K.O., G.L.P.), Universidade de Campinas, Caixa Postal 6010, 13083-970, Campinas SP, Brazil; Departamento de Bioquímica, Centro de Ciências da Saúde, Universidade Federal do Rio de Janeiro, Rio de Janeiro RJ, Brazil (A.S.H., P.C.F.); Departamento de Biotecnologia de Plantas Medicinais, Universidade de Ribeirão Preto, 14096-380, Ribeirão Preto SP, Brazil (M.M., S.C.F.); Centro de Tecnologia Copersucar, Copersucar Caixa Postal 162, 13400-970, Piracicaba SP, Brazil (W.L.B.); Centro APTA de Citros Sylvio Moreira, Instituto Agronômico de Campinas, Caixa Postal 04, 13490-970, Cordeirópolis SP, Brazil (M.L.P.T.); Centro de Genética, Biologia Molecular e Fitoquímica, Instituto Agronômico de Campinas, Caixa Postal 28, 13001-970, Campinas SP, Brazil (W.J.S.); Departamento de Defesa Fitossanitária, Faculdade de Ciências Agronômicas, Universidade Estadual Paulista, Caixa Postal 237, 18603-970, Botucatu SP, Brazil (E.E.K.); Departamento de Genética, Instituto de Biociências, Universidade Estadual Paulista, 18618-000, Botucatu SP, Brazil (C.L.M.); Escola Superior de Agricultura Luiz de Queiroz, Universidade de São Paulo, 13418-900, Piracicaba SP, Brazil (L.E.A.C., H.C., L.L.C.); Departamento de Tecnologia, Faculdade de Ciências Agrárias e Veterinárias de Jaboticabal, Universidade Estadual Paulista, 14884-900, Jaboticabal SP, Brazil (L.R.F.); Departamento de Biologia Aplicada à Agropecuária, Faculdade de Ciências Agrárias e Veterinárias de Jaboticabal, Universidade Estadual Paulista, 14884-900, Jaboticabal SP, Brazil (M.V.F.L.); Núcleo Integrado de Biotecnologia, Universidade de Mogi das Cruzes, 08780-911, Mogi das Cruzes SP, Brazil (L.R.N.); Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, 05508-900, São Paulo, SP, Brazil (S.L.G., R.V.S.); Departamento de Biologia, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, 14040-901, Ribeirão Preto SP, Brazil (M.H.G.); Centro de Estudos de Insetos Sociais, Instituto de Biociências, Universidade Estadual Paulista, 13506-900, Rio Claro SP, Brazil (M.B.); Departamento de Biotecnologia Vegetal, Centro de Ciências Agrárias, Universidade Federal de São Carlos, Caixa Postal 153, 13600-970, Araras SP, Brazil (E.A.G.); Instituto de Física de São Carlos, Universidade de São Paulo, 13560-970, São Carlos SP, Brazil (O.H.T.); Departamento de Genética e Evolução, Universidade Federal de São Carlos, 13565-905 São Carlos SP, Brazil (F.H.S.); Departamento de Botânica, Instituto de Biociências, Universidade de São Paulo, 05508-900, São Paulo SP, Brazil (M.-A.V.S.); Laboratório de Genética Molecular e Genomas, Instituto de Pesquisa e Desenvolvimento, Universidade do Vale do Paraíba, 12244-000, São José dos Campos SP, Brazil (F.G.N.); and Department of Microbiology, Institute of Biomedical Sciences, Universidade de São Paulo, 1374 Avenida Professor Lineu Prestes 1374, 05508-900, São Paulo SP, Brazil (C.F.M.M.)
Over 40,000 sugarcane (Saccharum officinarum) consensus sequences assembled from 237,954 expressed sequence tags were compared with the protein and DNA sequences from other angiosperms, including the genomes of Arabidopsis and rice (Oryza sativa). Approximately two-thirds of the sugarcane transcriptome have similar sequences in Arabidopsis. These sequences may represent a core set of proteins or protein domains that are conserved among monocots and eudicots and probably encode for essential angiosperm functions. The remaining sequences represent putative monocot-specific genetic material, one-half of which were found only in sugarcane. These monocot-specific cDNAs represent either novelties or, in many cases, fast-evolving sequences that diverged substantially from their eudicot homologs. The wide comparative genome analysis presented here provides information on the evolutionary changes that underlie the divergence of monocots and eudicots. Our comparative analysis also led to the identification of several not yet annotated putative genes and possible gene loss events in Arabidopsis.
Flowering plants (angiosperms) originated approximately 200 million years ago (MYA; Wolfe et al., 1989
Comparative genomics provides a starting point for understanding the genetic basis of the biological diversity among plant species. In this regard, the genome sequences of two model plants, the eudicotyledon Arabidopsis (family Brassicaceae; Arabidopsis Genome Initiative, 2000
One approach to obtain information about genome diversity among angiosperms is through comparative analysis of the available angiosperm sequences, which include the Arabidopsis and rice genomes and the large amount of expressed sequence tags (ESTs) that have been produced from several monocots and eudicots. Within this framework, we have generated approximately 240,000 ESTs from sugarcane, an economically important monocot crop belonging to the grass family (Grivet and Arruda, 2002
A total of 237,954 sugarcane ESTs were assembled into 42,982 SASs, which were estimated to represent over 30,000 unique genes (Vettore et al., 2003
The first step of the analysis was to estimate the set of SASs that was similar to sequences of the model eudicot organism Arabidopsis. This step revealed that 70.5% of the SASs matched the Arabidopsis sequences (denoted "Arabidopsis matches" in Table I and Fig. 1). Interestingly, 99.1% of the SASs belonging to the Arabidopsis matches class also had a significant match with the rice genome (Table I), so that these SASs may define a core set of conserved eudicot-monocot sequences. The remaining 29.5% of the SASs were further compared with the genome sequence of rice and with ESTs from other eudicots and monocots (Fig. 1). Approximately 2.0% of the SASs matched sequences from other eudicots ("other eudicots" in Table I and Fig. 1) and may represent genes that have been lost in the Arabidopsis genome. Fourteen percent of the SASs matched rice or other monocots, but not eudicot sequences, and, thus, were assumed to be monocot-specific genes ("monocot class" in Table I and Fig. 1). The remaining SASs (13.5%, "no matches" in Table I and Fig. 1) were only found in the sugarcane transcriptome (see also Supplemental Table I).
Of the 2% of SASs not present in Arabidopsis but with significant similarity with ESTs from other eudicots ("other eudicots" in Table I and Figs. 1 and 2A), 90% matched ESTs from monocots. The simplest interpretation for this is that the genes corresponding to these conserved angiosperm sequences were lost in Arabidopsis. To evaluate this possibility, the set of SASs in the "other eudicots" group that had a significant similarity (E values lower than e-10) with proteins in GenBank was investigated further. We identified 16 SASs, which could represent 13 gene loss events in Arabidopsis (Table II, see also Supplemental Table II). Of these, three encoded proteins involved in stress- and pathogen-induced responses in plants, one was similar to a bacterial protein from the family of atrazine and melanine chlorohydrolases, and another one was homologous to the human tRNA-guanine transglycosylase. The remaining eight genes encoded proteins with unknown functions (Table II). Interestingly, three of the hypothetical proteins were similar to proteins from cyanobacteria (Synechocystis sp). Therefore, these SASs may represent chloroplast proteins encoded by nuclear genes acquired from the ancestral cyanobacterial symbiont (Rujan and Martin, 2001
Phylogenetic analyses for alignments generated for each of these 13 genes and their homologs, which were retrieved from GenBank, were consistent with known species phylogeny, thus supporting the view of gene loss in the Arabidopsis lineage (data not shown). An example of the phylogeny analysis is shown in Figure 3 for the SASs encoding a polypeptide similar to the E. coli apaG (Table II). Two groups of sequences homologous to the bacterial apaG gene were identified in some angiosperms. One group included sequences from several eudicots and monocots but not from Arabidopsis and metazoans (group A, Fig. 3). The other group (group B, Fig. 3) suggested that in the ancestral lineage of plants and metazoans, an apaG homologous sequence was recruited to a protein containing an F box to form a new protein, which has been conserved in Arabidopsis. This evolutionary pattern of apaG homologous sequences can be explained most simply by differential gene loss events.
Two draft sequences of the full rice genome have been published and were predicted to encode 32,000 to 55,000 genes (Goff et al., 2002
A fraction of 27.5% of the SASs showed no significant similarity with any eudicot sequences. Approximately half of these SASs (5,996) had significant matches with monocot sequences, including the rice genome, and were designated "monocots" (Table I; Figs. 1 and 2A) because these SASs may correspond to sequences restricted to monocot species. Alternatively, some of them may correspond to fast-evolving portions of proteins belonging to families conserved among angiosperms. Such cases could be detected through anchor monocot homologous sequences that would link the portion evolving at high rates to the more conserved sequences i.e. important functional domains, that characterize these conserved protein families (Fig. 4). According to this rational, the monocot EST corresponding to the best match of each SAS was used as anchor query sequences in a further comparison with eudicot EST sequences. When significant matches were found, the SAS contained in the anchor sequence was interpreted to be a fast-evolving sequence. Using this approach, 21% (1,368) of the SASs in the "monocots" were found to be sequences possibly evolving at high rates. The remaining SASs (79%, 5,028) may represent sequences that can be defined more strictly as monocot specific and could, theoretically, represent evolutionary novelties, loss events in eudicots, or horizontal gene transfers.
To further evaluate the participation of these different evolutionary processes in the production of monocot-specific sequences, a manual validation of SASs that had a significant similarity with proteins in GenBank was undertaken. Most of these (164 of 215) could be organized into the four following categories: mobile genetic elements, transcription factors, stress and defense responses, and putative hypothetical rice proteins (Table III; see also Supplemental Table III). This pattern is likely to partly reflect the bias introduced by the more representative monocot proteins present in the databases. A large proportion (approximately 76%) of the 215 SASs were found to correspond to fast-evolving sequences, according to the rational described above and using the GenBank best matches as anchor query sequences (Table III).
These results indicate that accelerated evolution of specific sequences of conserved eudicot-monocot protein families is an important aspect of angiosperm evolution. Consistent with this idea, and considering the mobile genetic elements within the "monocots," the anchor analysis revealed eudicot relationships for the transposon and retrotransposon sequences. Thus, these mobile genetic elements represent highly divergent sequences of eudicot-monocot conserved transposons. Also of interest were the rapidly evolving sequences in the category of transcription factors (Table III), which may promote changes in transcriptional regulatory networks (Van der Hoeven et al., 2002
The proportion of SASs that did not match sequences from any other angiosperm was still high (5,812, corresponding to 13.5% of the SASs). This "no match" group (Table I; Fig. 2A) may correspond to highly variable sequences that either diverged significantly (evolving at high rates) from their homologs in other monocots or that are specific for sugarcane. Alternatively, they may represent 5'- or 3'-untranslated sequences that are likely to be under low selective pressure and, therefore, would not be detected by this comparison, which relied on open reading frames. The 68 SASs with significant matches to proteins of other organisms (non-plant) were analyzed further. Of these, 34 were similar to fungal proteins, 11 to bacterial proteins, 16 to proteins from other organisms (including human), and three to plant virus proteins. Some of these examples are most likely contaminants from endophytic organisms or other sources, but horizontal gene transfer cannot be excluded. The remaining four SASs matched DNA sequences from plants that were not included in the pipeline. The extent to which these 5,812 SASs of the "no match" class represent novel genes restricted to sugarcane remains to be determined.
ESTs are useful for locating and annotating potential genes in chromosomal sequences. This search is normally done using ESTs from the same organism because direct nucleotide sequence alignment allows the identification of the chromosomal region that is potentially transcribed and processed to produce the original mRNA (Seki et al., 2002 The fact that these SASs matched the Arabidopsis genome does not mean that they correspond to nonannotated genes. Limitations are expected with this approach because false hits and pseudogenes can be traced. However, some features were used as criteria to validate the matches as candidate genes in Arabidopsis. First, 759 of 871 SASs (87%) had significant matches with DNA from other plants, implying that they encode conserved proteins, also expected to be found in Arabidopsis. Another criterion was the fact that the similarity of several SASs along the Arabidopsis DNA was discontinuous, indicating the presence of several exons. We found that 294 (34%) of these SASs fulfilled this last criterion. However, single matches (one exon) could also locate potential genes.
Two examples of potentially new genes in Arabidopsis are illustrated in Figure 5. One of these (Fig. 5A) identified five exons of a putative protein involved in cell wall biogenesis. Two of the exons did not match the GenMark prediction (Lukashin and Borodovsky, 1998
A comprehensive genome analysis using the consensus sequences assembled from approximately 240,000 sugarcane ESTs has revealed a core set of conserved eudicot-monocot sequences corresponding to 70.5% of the SASs. This result agrees with that found in a maize versus Arabidopsis (Brendel et al., 2002
A set of sugarcane sequences was found to be conserved among angiosperms but was missing in Arabidopsis (Table II; Fig. 2). This finding suggests that the corresponding genes were present in the ancestor of monocots and eudicots and were subsequently lost in Arabidopsis. This conclusion agrees and complements the recent reports (Allen, 2002
Our comparative analysis revealed a set of sequences that appears to be specific to monocots. This monocot class of SAS is of particular interest because it may include sequences that could be related to monocot-specific traits and may represent true evolutionary novelties or gene losses in eudicots. A detailed analysis of these sequences indicated that a significant proportion corresponded to fast-evolving sequences found in members of conserved angiosperm gene families. A high rate of evolution can be related to low functional constraints and/or functional diversification. This latter possibility is more likely to be responsible for the production of new protein functions that may be involved in the differentiation of a specific evolutionary lineage. Gene duplication followed by sequence divergence is the main means for functional diversification (Lynch and Conery, 2000 Some SASs of the Monocot class appeared to define new protein architecture that resulted from the association of monocot-specific sequences with conserved eudicot-monocot protein domains. These cases may represent exon shuffling events that could possibly lead to new functions. Together, these data indicate that evolutionary events underlying the differentiation of eudicots and monocots relies on functional diversification (generating new proteins functions) from duplicated copies of conserved gene families and on the acquisition of novel protein sequences.
Recently, a rice versus Arabidopsis comparison indicated that almost one-half of the 53,398 genes predicted in the rice genome (Yu et al., 2002
The sugarcane EST collection used in this study represents one of the largest and most representative transcriptome data set for a monocotyledon species. The evolutionary proximity of rice and sugarcane means that a large number of homologous sequences are to be expected from these two species. Hence, the sugarcane ESTs provide an important contribution to studies of the published draft genome of rice (Goff et al., 2002
The sugarcane (Saccharum officinarum) EST sequences were from the SUCEST project, which has been described previously (Telles and da Silva, 2001
SASs with no similarity to Arabidopsis sequences were compared against eudicotyledon (tomato [Lycopersicon esculentum], soybean [Glycine max], and Lotus japonicus) and monocotyledon (barley [Hordeum vulgare], rice, sorghum [Sorghum bicolor], and maize [Zea mays]) EST sequences. All SASs were compared with the complete GenBank database for categorization (Telles et al., 2001
The presence of chloroplast target peptide was predicted with the ChloroP program (Emanuelsson et al., 1999
The authors thank the technicians and researchers who contributed to the sequencing effort and whose names are listed at the Web site http://sucest.lad.ic.unicamp.br/public. The authors also acknowledge the contribution of Dr. Nicolas Carels for critical reading of the manuscript. Received September 24, 2003; returned for revision October 23, 2003; accepted December 12, 2003.
http://www.plantphysiol.org/cgi/doi/10.1104/pp.103.033878.
1 This work was jointly supported by the Fundação de Amparo à Pesquisa do Estado de São Paulo (São Paulo, Brazil), by the Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq, Brasília, Brazil), and by COPERSUCAR (Piracicaba, Brazil).
[w] The online version of this article contains Web-only data.
2 This paper is dedicated to the memory of Roberto V. Santelli. * Corresponding author; e-mail cfmmenck{at}usp.br; fax 55-11-3091-7354.
Allen KD (2002) Assaying gene content in Arabidopsis. Proc Natl Acad Sci USA 99: 9568-9572
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389-3402 Arabidopsis Genome Initiative (2000) Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408: 796-815[CrossRef][Medline]
Aravind L, Watanabe H, Lipman DJ, Koonin EV (2000) Lineage-specific loss and divergence of functionally linked genes in eukaryotes. Proc Natl Acad Sci USA 97: 11319-11324
Bergelson J, Kreitman M, Stahl EA, Tian D (2001) Evolutionary dynamics of plant R-genes. Science 292: 2281-2285
Braun EL, Halpern AL, Nelson MA, Natvig DO (2000) Large-scale comparison of fungal sequence information: mechanisms of innovation in Neurospora crassa and gene loss in Saccharomyces cerevisiae. Genome Res 10: 416-430 Brendel V, Kurtz S, Walbot V (2002) Comparative genomics of Arabidopsis and maize: prospects and limitations. Genome Biol 3: REVIEWS1005 Burge C, Karlin S (1997) Prediction of complete gene structures in human genomic DNA. J Mol Biol 268: 78-94[CrossRef][ISI][Medline] Copley R, Letunic I, Bork P (2001) Genome and protein evolution in eukaryotes. Curr Opin Chem Biol 6: 39-45 Cronk QC (2001) Plant evolution and development in a post-genomic context. Nat Rev Genet 2: 607-619[CrossRef][ISI][Medline]
Doebley J, Lukens L (1998) Transcriptional regulators and the evolution of plant form. Plant Cell 10: 1075-1082 Emanuelsson O, Nielsen H, von Heijne G (1999) ChloroP, a neural network-based method for predicting chloroplast transit peptides and their cleavage sites. Protein Sci 8: 978-984[Abstract] Felsenstein J (1993) PHYLIP (Phylogeny Inference Package) Version 3.5c. Department of Genetics, University of Washington, Seattle Felsenstein J (1997) An alternating least squares approach to inferring phylogenies from pairwise distances. Syst Biol 46: 101-111[CrossRef][ISI][Medline] Feng Q, Zhang Y, Hao P, Wang S, Fu G, Huang Y, Li Y, Zhu J, Liu Y, Hu X et al. (2002) Sequence and analysis of rice chromosome 4. Nature 420: 316-320[CrossRef][Medline]
Fulton TM, Van der Hoeven R, Eannetta NT, Tanksley SD (2002) Identification, analysis, and utilization of conserved ortholog set markers for comparative genomics in higher plants. Plant Cell 14: 1457-1467
Goff SA, Ricke D, Lan TH, Presting G, Wang R, Dunn M, Glazebrook J, Sessions A, Oeller P, Varma H et al. (2002) A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). Science 296: 92-100 Grivet L, Arruda P (2002) Sugarcane genomics: depicting the complex genome of an important tropical crop. Curr Opin Plant Biol 5: 122-127[Medline]
Henikoff S, Greene E, Pietrokovski S, Bork P, Attwood T, Hood L (1997) Gene families: the taxonomy of protein paralogs and chimeras. Science 278: 609-614
Kellog E (2001) Evolutionary history of the grasses. Plant Physiol 125: 1198-1205
Kikuchi S, Satoh K, Nagata T, Kawagashira N, Doi K, Kishimoto N, Yazaki J, Ishikawa M, Yamada H, Ooka H et al. (2003) Collection, mapping, and annotation of over 28,000 cDNA clones from japonica rice. Science 301: 376-379 Kondrashov F, Rogozin I, Wolf Y, Koonin E (2002) Selection in the evolution of gene duplication. Genome Biol 3: 1-9 Koonin EV, Aravind L, Kondrashov AS (2000) The impact of comparative genomics on our understanding of evolution. Cell 101: 573-576[CrossRef][ISI][Medline]
Ku HM, Vision T, Liu J, Tanksley SD (2000) Comparing sequenced segments of the tomato and Arabidopsis genomes: large-scale duplication followed by selective gene loss creates a network of synteny. Proc Natl Acad Sci USA 97: 9121-9126 Lander E, Linton L, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, Fitzhugh W et al. (2001) Initial sequencing and analysis of the human genome. Nature 409: 860-921[CrossRef][Medline]
Liu H, Sachidanandam R, Stein L (2001) Comparative genomics between rice and Arabidopsis shows scant collinearity in gene order. Genome Res 11: 2020-2026
Lukashin AV, Borodovsky M (1998) GeneMark.hmm: new solutions for gene finding. Nucleic Acids Res 26: 1107-1115
Lynch M, Conery JS (2000) The evolutionary fate and consequences of duplicate genes. Science 290: 1151-1155
Poethig RS (2001) Life with 25,000 genes. Genome Res 11: 313-316
Quackenbush J, Cho J, Lee D, Liang F, Holt I, Karamycheva S, Parvizi B, Pertea G, Sultana R, White J (2001) The TIGR Gene Indices: analysis of gene transcript sequences in highly sampled eukaryotic species. Nucleic Acids Res 29: 159-164
Rubin G, Yandell M, Wortman J (2000) Comparative genomics of the eukaryotes. Science 287: 2204-2215 Rujan T, Martin W (2001) How many genes in Arabidopsis come from cyanobacteria? An estimate from 386 protein phylogenies. Trends Genet 17: 113-120[CrossRef][ISI][Medline]
Salzberg SL, White O, Peterson J, Eisen JA (2001) Microbial genes in the human genome: lateral transfer or gene loss? Science 292: 1903-1906 Sankoff D (2001) Gene and genome duplication. Curr Opin Genet Dev 11: 681-684[CrossRef][ISI][Medline] Sasaki T, Matsumoto T, Yamamoto K, Sakata K, Baba T, Katayose Y, Wu J, Niimura Y, Cheng Z, Nagamura Y et al. (2002) The genome sequence and structure of rice chromosome 1. Nature 420: 312-316[CrossRef][Medline]
Schoof H, Zaccaria P, Gundlach H, Lemcke K, Rudd S, Kolesov G, Arnold R, Mewes HW, Mayer KF (2002) MIPS Arabidopsis thaliana Database (MAtDB): an integrated biological knowledge resource based on the first complete plant genome. Nucleic Acids Res 30: 91-93
Seki M, Narusaka M, Kamiya A, Ishida J, Satou M, Sakurai T, Nakajima M, Enju A, Akiyama K, Oono Y et al. (2002) Functional annotation of a full-length Arabidopsis cDNA collection. Science 296: 141-145
Somerville C, Somerville S (1999) Plant functional genomics. Science 285: 380-383 Souza GM, Simoes ACQ, Oliveira KC, Garay HM, Fiorini LC, Gomes FdS, Nishiyama-Junior MY, Silva AM (2001) The sugarcane signal transduction (SUCAST) catalogue: prospecting signal transduction in sugarcane. Genet Mol Biol 24: 25-34 Stanhope MJ, Lupas A, Italia MJ, Koretke KK, Volker C, Brown JR (2001) Phylogenetic analyses do not support horizontal gene transfers from bacteria to vertebrates. Nature 411: 940-944[CrossRef][Medline] Telles GP, Braga MVD, Dias Z, Quitzau JAA, da Silva FR, Meidanis J (2001) Bioinformatics of the sugarcane EST project. Genet Mol Biol 24: 8-15 Telles GP, da Silva FR (2001) Trimming and clustering sugarcane ESTs. Genet Mol Biol l 24: 17-23
Van der Hoeven R, Ronning C, Giovannoni J, Martin G, Tanksley S (2002) Deductions about the number, organization, and evolution of genes in the tomato genome based on analysis of a large expressed sequence tag collection and selective genomic sequencing. Plant Cell 14: 1441-1456 Vettore AL, da Silva FR, Kemper EL, Arruda P (2001) The libraries that made SUCEST. Genet Mol Biol 24: 1-7
Vettore AL, da Silva FR, Kemper EL, Souza GM, Silva AM, Ferro MIT, Henrique-Silva F, Giglioti EA, Lemos MV, Coutinho LL et al. (2003) Analysis and functional annotation of an expressed sequence tag collection for tropical crop sugarcane. Genome Res 13: 2725-2735
Vision T, Brown D, Tanksley S (2000) The origins of genomic duplications in Arabidopsis. Science 290: 2114-2117 Wendel J (2000) Genome evolution in polyploids. Plant Mol Biol 42: 225-249[CrossRef][ISI][Medline] Wikstrom N, Savolainen V, Chase MW (2001) Evolution of the angiosperms: calibrating the family tree. Proc R Soc Lond B Biol Sci 268: 2211-2220[Medline]
Wolfe KH, Gouy M, Yang YW, Sharp PM, Li WH (1989) Date of the monocot-dicot divergence estimated from chloroplast DNA sequence data. Proc Natl Acad Sci USA 86: 6201-6205 Yang Y, Lai K, Tai P, Li W (1999) Rates of nucleotide substitution in angiosperm mitochondrial DNA sequences and dates of divergence between Brassica and other angiosperm lineages. J Mol Evol 48: 597-604[CrossRef][ISI][Medline]
Yu J, Hu S, Wang J, Wong GK, Li S, Liu B, Deng Y, Dai L, Zhou Y, Zhang X et al. (2002) A draft sequence of the rice genome (Oryza sativa L. ssp. indica). Science 296: 79-92 Related articles in Plant Physiol.:
This article has been cited by other articles:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ASPB Publications | PLANT PHYSIOLOGY | THE PLANT CELL | |
|---|---|---|---|