|
|
||||||||
|
First published online August 20, 2008; 10.1104/pp.108.122457 Plant Physiology 148:993-1003 (2008) © 2008 American Society of Plant Biologists OPEN ACCESS ARTICLE
Importance of Lineage-Specific Expansion of Plant Tandem Duplicates in the Adaptive Response to Environmental Stimuli1,[W],[OA]Department of Plant Biology, Michigan State University, East Lansing, Michigan 48824 (K.H., C.Z., M.D.L.-S., S.-H.S.); and Gene Discovery Research Group, RIKEN Plant Science Center, Yokohama, Kanagawa 230–0045, Japan (K.H., K.S.)
Plants have substantially higher gene duplication rates compared with most other eukaryotes. These plant gene duplicates are mostly derived from whole genome and/or tandem duplications. Earlier studies have shown that a large number of duplicate genes are retained over a long evolutionary time, and there is a clear functional bias in retention. However, the influence of duplication mechanism, particularly tandem duplication, on duplicate retention has not been thoroughly investigated. We have defined orthologous groups (OGs) between Arabidopsis (Arabidopsis thaliana) and three other land plants to examine the functional bias of retained duplicate genes during vascular plant evolution. Based on analysis of Gene Ontology categories, it is clear that genes in OGs that expanded via tandem duplication tend to be involved in responses to environmental stimuli, while those that expanded via nontandem mechanisms tend to have intracellular regulatory roles. Using Arabidopsis stress expression data, we further demonstrated that tandem duplicates in expanded OGs are significantly enriched in genes that are up-regulated by biotic stress conditions. In addition, tandem duplication of genes in an OG tends to be highly asymmetric. That is, expansion of OGs with tandem genes in one organismal lineage tends to be coupled with losses in the other. This is consistent with the notion that these tandem genes have experienced lineage-specific selection. In contrast, OGs with genes duplicated via nontandem mechanisms tend to experience convergent expansion, in which similar numbers of genes are gained in parallel. Our study demonstrates that the expansion of gene families and the retention of duplicates in plants exhibit substantial functional biases that are strongly influenced by the mechanism of duplication. In particular, genes involved in stress responses have an elevated probability of retention in a single-lineage fashion following tandem duplication, suggesting that these tandem duplicates are likely important for adaptive evolution to rapidly changing environments.
Plant genomes contain a higher proportion of recently duplicated genes compared with most other eukaryotes (Lockton and Gaut, 2005
Several evolutionary and population genetic models of duplicate gene fate have been proposed that provide the theoretical and mechanistic explanations for gene retention (Ohno, 1970 To study patterns of functional bias among genes derived from lineage-specific expansion events, we first classified genes from Arabidopsis, poplar (Populus trichocarpa), rice, and the moss Physcomitrella patens into orthologous groups (OGs) and determined the degree of expansion for each OG. After identifying genes in expanded versus nonexpanded OGs, we used Arabidopsis Gene Ontology (GO) annotations to examine the functional bias of retained duplicates in the Arabidopsis lineage. To better understand the relationship between the nature or types of stress responses (such as different biotic and abiotic conditions) and duplication mechanisms (tandem versus nontandem), we examined an Arabidopsis expression data set containing 15 abiotic and biotic stress treatments and identified stress conditions enriched in retained genes duplicated via tandem or nontandem mechanisms. Finally, we compared patterns of lineage-specific expansion and functional bias among tandem and nontandem genes to determine how duplication mechanism contributed to gene family expansion during land plant evolution.
Rate of Lineage-Specific Expansion Plants have substantially higher rates of gene duplication than other organisms. This, together with substantial functional bias in gene retention, has contributed to dramatic differences in the degree of lineage-specific expansion among plant gene families. To address the question if the rate of gene gains was constant throughout the evolutionary history of land plants, protein-coding genes from Arabidopsis, moss, rice, and poplar were classified into similarity clusters (referred to as gene families; see "Materials and Methods"). Among 14,745 gene families, 5,060 are shared among all four plant species. A gene tree was generated for each shared family (see "Materials and Methods"). The gene tree and the species tree of these four plants were reconciled for estimating ancestral gene numbers. These ancestral gene numbers were then used to determine gene-gain events in the lineage leading to Arabidopsis (Fig. 1A ). The rates of gene gain (total gain during a time period divided by the estimated duration) are not constant over the three time periods we examined (branches 1, 2, and 3 in Fig. 1A). The gain rates for branches linked together by older ancestral nodes are smaller than those linked by "younger" branches. For example, the gain rate in branch 1 (14.1–28.2 gains per million years) is approximately four times slower than that in branch 3 (44.3–53.2 gains per million years).
One explanation for this gain rate difference is that, early in vascular plant evolution, the duplication rate was substantially lower. If this is the case, one may expect plant lineages that diverged earlier in the vascular plant lineage to have low duplication rates. However, it is estimated that Ceratopteris richardii, a fern in the lineage that split from the flowering plant lineage early in vascular plant evolution, likely has a higher proportion of paralogs than the flowering plants (Nakazato et al., 2006
In addition to differences in the rates of gene gain over the course of vascular plant evolution, it is likely that the functional bias in duplicate retention may differ depending on the time period examined, due to differences in organismal complexity and environment. To determine if the patterns of functional biases differ depending on the timing of gene duplication, we evaluated the representation of duplicate genes in expanded versus nonexpanded OGs in various functional categories. Here, duplicates in expanded OGs are genes with elevated rates of retention/duplication compared with genes in nonexpanded OGs. By examining OGs, the ancestral gene numbers and subsequent gains can be estimated; therefore, all sorts of classifications, in particular functional categories of genes, can be compared in the same evolutionary period. Ideally, we would evaluate the functional categories for each internal or external node in the four-plant phylogeny to directly determine the functional bias of genes in expanded OGs. However, this is problematic due to the difficulty in inferring ancestral functions. Although we have gene-gain data for all four plant species, we focused on the comparison of Arabidopsis with other plants in a pair-wise fashion for two reasons. First, the functional annotation data for other plant species are either absent or not as comprehensive as those for Arabidopsis. Second, functional annotation criteria may differ between species and influence the interpretation of results. It should be noted that this approach resulted in analyzing functional biases in a somewhat nonindependent fashion, since some of the branches overlap.
We assessed if there is a functional bias among genes in expanded OGs by determining the GO categories with overrepresented numbers of genes in OGs that have expanded in the Arabidopsis lineage (Supplemental Fig. S1). For each shared family, we identified three "types" of OGs (Fig. 1): Arabidopsis-poplar (A-P), Arabidopsis-rice (A-R), and Arabidopsis-moss (A-M). Each OG represents the presence of one ancestral gene from the progenitor of the species pair and all duplicates generated and retained after speciation. Currently, orthology is determined either by reconciling species and gene trees (referred to as tree-based; Chen et al., 2000
Based on the three types of OGs defined (A-P, A-R, and A-M) encompassing different periods in the evolution of the Arabidopsis lineage, we could compare the overrepresented functional categories and determine which categories are specific to a time period. The number of gene gains in an OG is defined as the number of Arabidopsis genes in an OG minus one (because each OG indicates the presence of an ancestral gene). In each functional category, the number of gene gains was compared with the number of expected gene gains using the
Based on published reports, it is anticipated that tandem duplication will be more closely associated with stress-related genes than nontandem duplication (Parniske et al., 1997
Signaling networks can be partitioned into three major layers: transducers, transcriptional regulators, and regulatory targets (Doebley and Lukens, 1998
Assuming that OGs that have expanded via nontandem mechanisms are mostly derived from whole genome duplications, our finding is consistent with earlier reports that transcriptional regulation categories have overrepresented numbers of duplicates from polyploidization (Blanc and Wolfe, 2004
In this study, we focus on the properties of lineage-specific expanded OGs instead of the properties of paralogous genes, as in previous studies (Maere et al., 2005
Based on the analysis of functional categories with overrepresented numbers of genes in expanded OGs, one of the most notable differences between tandem and nontandem duplicates is their involvement in the response to environmental stimuli and biotic stress. However, it remains an open question if this is a property of stress genes in general or genes involved in certain types of stress conditions. To address this question, we examined the expansion patterns of stress-responsive genes in the Arabidopsis lineage using the AtGenExpress stress expression data set (Kilian et al., 2007
To determine whether stress-responsive genes found in expanded OGs tend to be derived from tandem or nontandem duplications, we asked if there is a relative enrichment of stress-responsive genes in OGs that expanded via tandem or nontandem duplication for each stress condition (the Tandem [T]/Nontandem [N] columns in Table I). Up-regulated genes in general belong to OGs with higher rates of expansion compared with the OGs containing nonresponsive genes. However, there are clear differences in how duplication mechanism contributed to the expansion of OGs containing stress-responsive genes. Genes in OGs that expanded via tandem duplication are more likely to be up-regulated under biotic stresses than those in OGs that expanded via nontandem duplication (the T/N columns for up-regulation in Table I), which is consistent with the GO-based analysis. In contrast, genes up-regulated by abiotic stress and down-regulated genes in general belong to OGs that are equally likely to have expanded via either tandem or nontandem mechanisms (Table I). Interestingly, the fact that biotic stress-responsive genes tend to be derived from tandem duplication does not affect the relationship between functional partitioning in signaling networks and duplication mechanism, as postulated in the previous section. For example, among genes up-regulated by biotic stress, molecular function categories, including DNA binding and transcription regulator activity, are still enriched in nontandem duplicates (Fig. 2C). Taken together, these findings indicate that expansion of biotic stress response genes has occurred more often via tandem duplication than expected. However, the position of a gene in the cellular signaling network has an overriding influence on the predominant duplication mechanisms that contribute to lineage-specific expansion.
Our study examines the functional bias of retained duplicates in the Arabidopsis lineage and shows that genes in OGs that expanded via tandem and nontandem duplication mechanisms are related to stress responses and intracellular regulatory roles, respectively. Since different plant lineages have very different life histories, the nature of selection pressure imposed by their environmental conditions is expected to be diverse. Therefore, if expansion of certain OGs with tandem duplicates were involved in adaptive responses to the environment specific to one species, their orthologous genes in another species will tend not to be retained. To test this prediction, we first evaluated if the number of OGs with a particular lineage-specific gene number combination (e.g. for an OG with two Arabidopsis and three moss genes; Fig. 3A ) is overrepresented or underrepresented (Fig. 3). Overrepresentation or underrepresentation of a gene number combination was evaluated using the log ratios between observed and expected numbers of gains generated based on the power distribution of branch size in a particular lineage. Interestingly, we found that the gene number combinations are obviously nonrandom, with two extreme patterns (examples comparing the Arabidopsis lineage against the other three lineages are shown in Fig. 3). The first is "convergent expansion," where two or more genes were gained independently in different lineages. The other is "single-lineage expansion," where gene gains have predominantly occurred in one lineage and are coupled with a loss in the other.
To determine whether either of the two expansion patterns is correlated with tandem or nontandem duplication, we classified genes into tandem and nontandem categories (see "Materials and Methods"). The ratios between the numbers of tandem and nontandem duplicated genes in OGs that expanded in either a convergent or single-lineage fashion were used to assess the significance of enrichment (Table II ). Here, OGs are regarded as convergently expanded if the number of genes is more than two in both lineages. OGs are defined as expanded in a single-lineage fashion if the number of genes is two or more in one lineage and zero in the other. Here, we show the results where the tandem/nontandem genes are defined solely based on that of the Arabidopsis genome in comparison with the other three lineages. We found that tandem duplicates tend to be in OGs that experienced single-lineage expansion ( 2 test; P < 10–12). In contrast, genes duplicated via nontandem mechanisms are mostly found in OGs that underwent convergent expansion ( 2 test; P < 10–12). These findings are consistent regardless of which types or sets of OGs (A-P, A-R, or A-M; similarity- or tree-based) were examined. We also observed that in OGs experiencing single-lineage expansion, the ratio of tandem to nontandem duplicated genes is highest in the A-P pairs and decreases as the divergence time between the species pair increases (Table II). One potential explanation for this trend is that some of these tandem genes may look younger because of gene conversion (Gao and Innan, 2004
Most highly duplicated genes have been reported to reside in OGs that expanded in parallel in different lineages (Oliver et al., 2000
We have also conducted analyses to identify functional categories enriched in OGs that expanded in a single-lineage fashion (Supplemental Fig. S4). A large number of these categories, including response to biotic stress and secondary metabolism, are similar to those identified as being overrepresented among tandem duplicates. Therefore, genes found in OGs that experienced single-lineage expansion are likely involved in adaptive evolution in a lineage-specific fashion and turn over rapidly. In an earlier study, it was shown that genes derived from small-scale (mostly tandem) duplication events tend to have low decay rates (Maere et al., 2005
Plant genes in OGs that expanded via tandem duplication tend to be involved in responses to biotic stress and environmental stimuli based on GO categories. Furthermore, by examining stress microarray data sets, we found that tandem duplicates are enriched in stress-responsive genes. Therefore, genes influencing stress response have an elevated probability of retention following tandem duplication. Why do these stress-responsive tandem genes tend to be retained? An important feature of tandem genes is their high rate of duplication per generation. As a result, new tandem gene paralogs are continuously generated, likely providing a pool of highly dynamic targets for selection. In addition, tandem genes are highly variable within species. For example, sequence variation between Arabidopsis ecotypes is enriched in regions containing tandem duplicates (Borevitz and Nordborg, 2003
Interestingly, gene families that have been shown to be important for responding to biotic stresses, such as Leu-rich repeat disease resistance gene and receptor kinase, have a higher proportion of members in polymorphic regions than other genes families (Clark et al., 2007
Defining Gene Families and Tandem Duplicate Clusters
The amino acid sequences of four plant species (Arabidopsis [Arabidopsis thaliana], TAIR6; poplar [Populus trichocarpa], version 1.1; rice [Oryza sativa japonica], version 2; and the moss Physcomitrella patens, version 1.1) were obtained from The Arabidopsis Information Resource (www.arabidopsis.org), The Institute for Genomic Research (www.tigr.org), and the Joint Genome Institute (www.jgi.doe.gov). To define gene families among sequences from these four species, all-against-all similarity searches were conducted using BLAST with an E-value cutoff of 1e-5 (Altschul et al., 1997 Tandem duplicated genes were defined as genes in any gene pair, T1 and T2, that (1) belong to the same gene family, (2) are located within 100 kb each other, and (3) are separated by zero, one or fewer, five or fewer, or 10 or fewer nonhomologous (not in the same gene family as T1 and T2) spacer genes. Therefore, there are four sets of tandem gene definitions. All analyses were conducted using all four sets, and we found that the results were consistent regardless of the criteria. Therefore, only the analysis results based on the 10 or fewer spacer gene criteria are reported.
We took two approaches to infer OGs between Arabidopsis and poplar, rice, and moss. In the first approach, protein sequences of members in each family were aligned with ClustalW (Thompson et al., 1994
In the second approach, we used a search algorithm similar to Inparanoid (Remm et al., 2001 To evaluate the consistency of OGs constructed with the tree- and similarity-based approaches, we determined the degree of overlap between these two OG data sets. Since each gene is assigned to tree- and similarity-based OGs, the number of overlapping genes was counted between the tree- and similarity-based OGs containing the gene. The proportion of the overlapping genes was calculated for tree, and similarity-based OGs. The average proportion was calculated for all genes.
GO assignments for Arabidopsis genes were obtained from The Arabidopsis Information Resource (http://www.arabidopsis.org/). Three top GO categories, cellular components, molecular functions, and biological processes, were analyzed as described earlier (Shiu et al., 2006
For functional categories with overrepresented numbers of genes in expanded OGs, the observed ratio of tandem to nontandem duplicated genes in each category was compared with the expected ratio to determine whether the overrepresentation is due mainly to the contribution of tandem or nontandem duplications. The expected ratio was estimated from the number of all Arabidopsis genes duplicated via tandem and nontandem mechanisms in expanded OGs. To correct for multiple testing, the moderated P value (q) was estimated from raw To determine if stress-responsive genes (based on gene expression data) in expanded OGs derived from tandem or nontandem duplication tend to have certain molecular functions, we used the GO molecular function assignments of stress response genes for the overrepresentation analysis. Genes with only IEA (Inferred from Electronic Annotation) and IEP (Inferred from Expression Pattern) evidence were excluded.
Gene expression data under eight abiotic and eight biotic stress conditions were obtained from AtGenExpress (http://www.uni-tuebingen.de/plantphys/AFGN/atgenex.htm). The array intensities were processed using the Bioconductor (www.bioconductor.org) affy package in the R software environment (www.r-project.org). After background correction and quantile normalization, significantly up- and down-regulated genes under each stress condition were identified by comparing the hybridization intensities of arrays hybridized with treated samples against their corresponding control with LIMMA (Wettenhall and Smyth, 2004
In every OG, there are two lineages (I and J), each with i and j genes. To determine if an OG with a particular gene number combination (i, j) is overrepresented or underrepresented, we determined a log ratio by the following equation:
The following materials are available in the online version of this article.
We thank Takeshi Itoh and Takashi Makino for reading the manuscript and for discussion. We also thank The Arabidopsis Information Resource, The Institute of Genome Research, and the Joint Genome Institute for providing annotations and sequences and the Arabidopsis Functional Genomics Network for making the stress expression data sets available. Received May 8, 2008; accepted August 16, 2008; published August 20, 2008.
1 This work was supported by the Michigan State University Intramural Research Grant Program (grant no. 06–IRGP–875) and the National Science Foundation (grant nos. DBI–0638591 and MCB–00749634) to S.-H.S. The author responsible for distribution of materials integral to the findings presented in this article in accordance with the policy described in the Instructions for Authors (www.plantphysiol.org) is: Shin-Han Shiu (shius{at}msu.edu).
[W] The online version of this article contains Web-only data.
[OA] Open Access articles can be viewed online without a subscription. www.plantphysiol.org/cgi/doi/10.1104/pp.108.122457 * Corresponding author; e-mail shius{at}msu.edu.
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389–3402 Arabidopsis Genome Initiative (2000) Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408: 796–815[CrossRef][Web of Science][Medline] Aravind L, Dixit VM, Koonin EV (2001) Apoptotic molecular machinery: vastly increased complexity in vertebrates revealed by genome comparisons. Science 291: 1279–1284 Blanc G, Wolfe KH (2004) Widespread paleopolyploidy in model plant species inferred from age distributions of duplicate genes. Plant Cell 16: 1667–1678 Borevitz JO, Hazen SP, Michael TP, Morris GP, Baxter IR, Hu TT, Chen H, Werner JD, Nordborg M, Salt DE, et al (2007) Genome-wide patterns of single-feature polymorphism in Arabidopsis thaliana. Proc Natl Acad Sci USA 104: 12057–12062 Borevitz JO, Nordborg M (2003) The impact of genomics on the study of natural variation in Arabidopsis. Plant Physiol 132: 718–725 Chen F, Mackey AJ, Vermunt JK, Roos DS (2007) Assessing performance of orthology detection strategies applied to eukaryotic genomes. PLoS ONE 2: e383[CrossRef] Chen K, Durand D, Farach-Colton M (2000) NOTUNG: a program for dating gene duplications and optimizing gene family trees. J Comput Biol 7: 429–447[CrossRef][Web of Science][Medline] Clark RM, Schweikert G, Toomajian C, Ossowski S, Zeller G, Shinn P, Warthmann N, Hu TT, Fu G, Hinds DA, et al (2007) Common sequence polymorphisms shaping genetic diversity in Arabidopsis thaliana. Science 317: 338–342 Doebley J, Lukens L (1998) Transcriptional regulators and the evolution of plant form. Plant Cell 10: 1075–1082 Force A, Lynch M, Pickett FB, Amores A, Yan YL, Postlethwait J (1999) Preservation of duplicate genes by complementary, degenerative mutations. Genetics 151: 1531–1545 Fortna A, Kim Y, MacLaren E, Marshall K, Hahn G, Meltesen L, Brenton M, Hink R, Burgers S, Hernandez-Boussard T, et al (2004) Lineage-specific gene duplication and loss in human and great ape evolution. PLoS Biol 2: E207[CrossRef][Medline] Gao LZ, Innan H (2004) Very low gene duplication rate in the yeast genome. Science 306: 1367–1370 Goff SA, Ricke D, Lan TH, Presting G, Wang R, Dunn M, Glazebrook J, Sessions A, Oeller P, Varma H, et al (2002) A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). Science 296: 92–100 Heckman DS, Geiser DM, Eidell BR, Stauffer RL, Kardos NL, Hedges SB (2001) Molecular evidence for the early colonization of land by fungi and plants. Science 293: 1129–1133 Holub EB (2001) The arms race is ancient history in Arabidopsis, the wildflower. Nat Rev Genet 2: 516–527[CrossRef][Web of Science][Medline] Hughes AL, Friedman R (2003) Parallel evolution by gene duplication in the genomes of two unicellular fungi. Genome Res 13: 794–799 Kilian J, Whitehead D, Horak J, Wanke D, Weinl S, Batistic O, D'Angelo C, Bornberg-Bauer E, Kudla J, Harter K (2007) The AtGenExpress global stress expression data set: protocols, evaluation and model data analysis of UV-B light, drought and cold stress responses. Plant J 50: 347–363[CrossRef][Web of Science][Medline] Kimura M (1980) A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J Mol Evol 16: 111–120[CrossRef][Web of Science][Medline] Kovalchuk I, Kovalchuk O, Kalck V, Boyko V, Filkowski J, Heinlein M, Hohn B (2003) Pathogen-induced systemic plant signal triggers DNA rearrangements. Nature 423: 760–762[CrossRef][Web of Science][Medline] Leister D (2004) Tandem and segmental gene duplication and recombination in the evolution of plant disease resistance gene. Trends Genet 20: 116–122[CrossRef][Web of Science][Medline] Lespinet O, Wolf YI, Koonin EV, Aravind L (2002) The role of lineage-specific gene family expansion in the evolution of eukaryotes. Genome Res 12: 1048–1059 Lockton S, Gaut BS (2005) Plant conserved non-coding sequences and paralogue evolution. Trends Genet 21: 60–65[CrossRef][Web of Science][Medline] Lucht JM, Mauch-Mani B, Steiner HY, Metraux JP, Ryals J, Hohn B (2002) Pathogen stress increases somatic recombination frequency in Arabidopsis. Nat Genet 30: 311–314[CrossRef][Web of Science][Medline] Lynch M, Conery JS (2000) The evolutionary fate and consequences of duplicate genes. Science 290: 1151–1155 Lynch M, Force A (2000) The probability of duplicate gene preservation by subfunctionalization. Genetics 154: 459–473 Maere S, De Bodt S, Raes J, Casneuf T, Van Montagu M, Kuiper M, Van de Peer Y (2005) Modeling gene and genome duplications in eukaryotes. Proc Natl Acad Sci USA 102: 5454–5459 Meyers BC, Kozik A, Griego A, Kuang H, Michelmore RW (2003) Genome-wide analysis of NBS-LRR-encoding genes in Arabidopsis. Plant Cell 15: 809–834 Michelmore RW, Meyers BC (1998) Clusters of resistance genes in plants evolve by divergent selection and a birth-and-death process. Genome Res 8: 1113–1130 Mondragon-Palomino M, Gaut BS (2005) Gene conversion and the evolution of three leucine-rich repeat gene families in Arabidopsis thaliana. Mol Biol Evol 22: 2444–2456 Moore RC, Purugganan MD (2003) The early stages of duplicate gene evolution. Proc Natl Acad Sci USA 100: 15682–15687 Moore RC, Purugganan MD (2005) The evolutionary dynamics of plant duplicate genes. Curr Opin Plant Biol 8: 122–128[CrossRef][Web of Science][Medline] Nakazato T, Jung MK, Housworth EA, Rieseberg LH, Gastony GJ (2006) Genetic map-based analysis of genome structure in the homosporous fern Ceratopteris richardii. Genetics 173: 1585–1597 Nei M, Rooney AP (2005) Concerted and birth-and-death evolution of multigene families. Annu Rev Genet 39: 121–152[CrossRef][Web of Science][Medline] Ohno S (1970) Evolution of Gene Duplication. Springer-Verlag, New York Oliver MJ, Tuba Z, Mishler BD (2000) The evolution of vegetative desiccation tolerance in land plants. Plant Ecol 151: 85–100[CrossRef] Parniske M, Hammond-Kosack KE, Golstein C, Thomas CM, Jones DA, Harrison K, Wulff BB, Jones JD (1997) Novel disease resistance specificities result from sequence exchange between tandemly repeated genes at the Cf-4/9 locus of tomato. Cell 91: 821–832[CrossRef][Web of Science][Medline] Paterson AH, Bowers JE, Chapman BA (2004) Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics. Proc Natl Acad Sci USA 101: 9903–9908 Raes J, Vandepoele K, Simillion C, Saeys Y, Van de Peer Y (2003) Investigating ancient duplication events in the Arabidopsis genome. J Struct Funct Genomics 3: 117–129[CrossRef][Medline] Remm M, Storm CE, Sonnhammer EL (2001) Automatic clustering of orthologs and in-paralogs from pairwise species comparisons. J Mol Biol 314: 1041–1052[CrossRef][Web of Science][Medline] Rensing SA, Lang D, Zimmer AD, Terry A, Salamov A, Shapiro H, Nishiyama T, Perroud PF, Lindquist EA, Kamisugi Y, et al (2008) The Physcomitrella genome reveals evolutionary insights into the conquest of land by plants. Science 319: 64–69 Rizzon C, Ponger L, Gaut BS (2006) Striking similarities in the genomic distribution of tandemly arrayed genes in Arabidopsis and rice. PLoS Comput Biol 2: e115[CrossRef][Medline] Rostoks N, Borevitz JO, Hedley PE, Russell J, Mudie S, Morris J, Cardle L, Marshall DF, Waugh R (2005) Single-feature polymorphism discovery in the barley transcriptome. Genome Biol 6: R54[CrossRef][Medline] Rouquier S, Friedman C, Delettre C, van den Engh G, Blancher A, Crouau-Roy B, Trask BJ, Giorgi D (1998) A gene recently inactivated in human defines a new olfactory receptor family in mammals. Hum Mol Genet 7: 1337–1345 Saitou N, Nei M (1987) The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol 4: 406–425[Abstract] Seoighe C, Gehring C (2004) Genome duplication led to highly selective expansion of the Arabidopsis thaliana proteome. Trends Genet 20: 461–464[CrossRef][Web of Science][Medline] Shiu SH, Bleecker AB (2001) Receptor-like kinases from Arabidopsis form a monophyletic gene family related to animal receptor kinases. Proc Natl Acad Sci USA 98: 10763–10768 Shiu SH, Byrnes JK, Pan R, Zhang P, Li WH (2006) Role of positive selection in the retention of duplicate genes in mammalian genomes. Proc Natl Acad Sci USA 103: 2232–2236 Shiu SH, Karlowski WM, Pan R, Tzeng YH, Mayer KF, Li WH (2004) Comparative analysis of the receptor-like kinase family in Arabidopsis and rice. Plant Cell 16: 1220–1234 Shiu SH, Shih MC, Li WH (2005) Transcription factor families have much higher expansion rates in plants than in animals. Plant Physiol 139: 18–26 Simillion C, Vandepoele K, Van Montagu MC, Zabeau M, Van de Peer Y (2002) The hidden duplication past of Arabidopsis thaliana. Proc Natl Acad Sci USA 99: 13627–13632 Storey JD, Tibshirani R (2003) Statistical significance for genomewide studies. Proc Natl Acad Sci USA 100: 9440–9445 Thompson JD, Higgins DG, Gibson TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22: 4673–4680 Tuskan GA, Difazio S, Jansson S, Bohlmann J, Grigoriev I, Hellsten U, Putnam N, Ralph S, Rombauts S, Salamov A, et al (2006) The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science 313: 1596–1604 van Dongen S (2000) Graph clustering by flow simulation. PhD thesis. University of Utrecht, Utrecht, The Netherlands Vision TJ, Brown DG, Tanksley SD (2000) The origins of genomic duplications in Arabidopsis. Science 290: 2114–2117 Walsh JB (1995) How often do duplicated genes evolve new functions? Genetics 139: 421–428[Abstract] Wapinski I, Pfeffer A, Friedman N, Regev A (2007) Natural history and evolutionary principles of gene duplication in fungi. Nature 449: 54–61[CrossRef][Web of Science][Medline] Wettenhall JM, Smyth GK (2004) limmaGUI: a graphical user interface for linear modeling of microarray data. Bioinformatics 20: 3705–3706 Wolfe KH, Gouy M, Yang YW, Sharp PM, Li WH (1989) Date of the monocot-dicot divergence estimated from chloroplast DNA sequence data. Proc Natl Acad Sci USA 86: 6201–6205 Young JM, Friedman C, Williams EM, Ross JA, Tonnes-Priddy L, Trask BJ (2002) Different evolutionary processes shaped the mouse and human olfactory receptor gene families. Hum Mol Genet 11: 535–546 Zhang L, Gaut BS (2003) Does recombination shape the distribution and evolution of tandemly arrayed genes (TAGs) in the Arabidopsis thaliana genome? Genome Res 13: 2533–2540 This article has been cited by other articles:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ASPB Publications | PLANT PHYSIOLOGY® | THE PLANT CELL | |
|---|---|---|---|