|
|
||||||||
|
First published online April 20, 2007; 10.1104/pp.107.097097 Plant Physiology 144:623-636 (2007) © 2007 American Society of Plant Biologists OPEN ACCESS ARTICLE
Molecular Evolution of Lysin Motif-Type Receptor-Like Kinases in Plants1,[W],[OA]Division of Plant Sciences and National Center for Soybean Biotechnology (X.-C.Z., X.W., S.F., J.W., M.L., H.T.N., G.S.) and Division of Biochemistry, Department of Molecular Microbiology and Immunology (G.S.), University of Missouri, Columbia, Missouri 65211; and United States Department of Agriculture-Agricultural Research Service and Department of Agronomy, Iowa State University, Ames, Iowa 50011 (S.B.C.)
The lysin motif (LysM) domain is an ancient and ubiquitous protein module that binds peptidoglycan and structurally related molecules. A genomic survey in a large number of species spanning all kingdoms reveals that the combination of LysM and receptor kinase domains is present exclusively in plants. However, the particular biological functions and molecular evolution of this gene family remain largely unknown. We show that LysM domains in plant LysM proteins are highly diversified and that a minimum of six distinct types of LysM motifs exist in plant LysM kinase proteins and five additional types of LysM motifs exist in nonkinase plant LysM proteins. Further, motif similarities suggest that plant LysM motifs are ancient. Although phylogenetic signals are not sufficient to resolve the earliest relationships, plant LysM motifs may have arisen through common ancestry with LysM motifs in other kingdoms. Within plants, the gene family has evolved through local and segmental duplications. The family has undergone further duplication and diversification in legumes, where some LysM kinase genes function as receptors for bacterial nodulation factor. Two pairs of homeologous regions were identified in soybean (Glycine max) based on microsynteny and fluorescence in situ hybridization. Expression data show that most plant LysM kinase genes are expressed predominantly in the root and that orthologous LysM kinase genes share similar tissue expression patterns. We also examined synteny around plant LysM kinase genes to help reconstruct scenarios for the evolution of this important gene family.
The lysin motif (LysM) is an ancient protein domain originally identified in bacterial autolysin (Joris et al., 1992 (1-4)-linked GlcNAc (GlcNAc) by short peptides and a major component of the cell walls of both Gram-positive and Gram-negative bacteria. The LysM motif is usually about 40 amino acids in length (http://www.sanger.ac.uk/cgi-bin/Pfam/getacc?PF01476). Despite the fact that more than 1,500 LysM proteins are identified in Pfam (http://www.sanger.ac.uk/cgi-bin/Pfam/speciesdist.pl?acc=PF01476&id=LysM&depth=all), the three-dimensional structure of the LysM motif has been determined for only two bacterial proteins, Escherichia coli membrane-bound lytic murein transglycosulase D (Bateman and Bycroft, 2000![]() ![]() ![]() secondary structure with the two -helices stacking onto one side of a plate made up of a two-stranded antiparallel -sheet.
More recently, increasing amounts of transcript and genomic sequence have allowed identification of LysM-encoding proteins in a broad range of organisms spanning all kingdoms except archaea (Bateman and Bycroft, 2000
LysM proteins in plant species have attracted increasing attention since the identification of NFR1 and NFR5 in another model legume species, Lotus japonicus (Lj; Madsen et al., 2003
Although the LysM domain is associated with a variety of protein domains across a large number of organisms spanning all kingdoms, it is intriguing that the linkage between the LysM domain and the protein kinase domain appears to occur exclusively in plants (Bateman and Bycroft, 2000
Based on the molecular phylogeny of plant kinase domains, the LYK proteins in Arabidopsis and rice were categorized into two clades, LysM-I and LysM-II (Shiu et al., 2004 Here we report comprehensive characterization of plant LysM domains and the molecular evolution and comparative genomics of the plant LYK gene family. Our data show that plant genomes harbor a minimum of 11 distinct types of LysM motifs. Plant LYK genes have duplicated locally, due to whole-genome duplications, and have subsequently functionally diversified. Plant LYK proteins fall into three major clades: two are represented by LjNFR1 and LjNFR5 and the third major clade has remained undescribed. We determined orthologous and paralogous relationships of plant LYK genes based on sequence similarities, molecular phylogenies, nucleotide substitution rates, genomic microsynteny (conserved gene content and order), and tissue expression patterns. We observed strong microsynteny in LjNFR1 and LjNFR5 orthologous regions across multiple species and dispersed microsynteny in ancestral LYK genes in more distantly related plant species.
Genome-Wide Exploration of Plant LYK Genes Six plant species, Arabidopsis, rice, M. truncatula, L. japonicus, poplar (Populus trichocarpa), and soybean, are included in this study. The genomes of the first five species are either completed or close to being completed, whereas the random shotgun sequencing of the soybean genome is under way. We used the LysM domain sequences of LjNFR1 and LjNFR5 to search the public databases of Arabidopsis, rice, poplar, M. truncatula, and L. japonicus (see "Materials and Methods"). We identified soybean LYK genes by shotgun sequencing bacterial artificial chromosomes (BACs) with homologies to LysM-encoding ESTs (see "Materials and Methods"). The resulting putative LYK protein sequences from all species were then searched against the Pfam server to verify LysM and kinase domains. Collectively, a total of 48 LYK genes were identified in the six plant genomes (Supplemental Table S1).
In comparison to the LysM proteins in other kingdoms, plant LYK proteins possess unique features: (1) the combination of LysM and kinase domains exists exclusively in the plant lineage; (2) plant LYK proteins have no more than three LysM motifs; (3) if more than two LysM motifs exist within a single plant LYK protein, they are always distinct from each other at the protein sequence level; and (4) the LysM domain sequences in plant LYK proteins are highly diversified due to different combinations of heterogenous LysM motifs. These facts led us to investigate the evolution of this fascinating plant LYK gene family and the phylogenies of these diversified LysM motifs. Based on the sequence phylogenies, LysM motifs (named LYKa, LYKb, and LYKc from the N to the C terminus) in plant LYK proteins largely fall into five clades (Fig. 1A ; Supplemental Fig. S1). This distribution of LysM motifs was found in all six plant species studied (i.e. LysM motifs from dicots and rice are clustered together in each clade, suggesting that the diversification event of plant LysM motifs predated the divergence of monocot and dicot plants).
We further investigated the LysM motifs from nonkinase plant LysM proteins, retrieving these using BLAST searches against genomic sequence databases of Arabidopsis, rice, and poplar and EST sequences of soybean (see "Materials and Methods"). Based on their subcellular localization predictions and domain arrangements, nonkinase plant LysM proteins can be further categorized into three subgroups, including LysM-type receptor-like proteins (LYPs), extracellular LysM proteins (LysMe), and nonsecretory intracellular LysM proteins (LysMn; Fig. 1B). This grouping will be helpful in understanding the nature of each LysM protein and providing insightful clues to the biological functions. As predicted by Pfam, LYP proteins have exactly two LysM motifs and LysMe and LysMn proteins have only one LysM motif. Sequence alignments show that, among the 11 types of LysM motifs, motif sequences of LysMn (the motif within LysMn proteins, LysM motif type XI), one group of LysMe (the motif within LysMe proteins, LysM motif type X), and one group of LYPb (the second motif from the N terminus within LYP proteins, LysM motif VII) are extremely conserved (Supplemental Fig. S2). In these motifs, the amino acid identities averaged across the alignments are 91% for LysMe (type X), 86% for LysMn (type XI), and 75% for LYP (type VII; Supplemental Fig. S2). LysMn motif sequences always start with a His and end with a Pro. Similarly, LYPb motif sequences always end with a Pro. LYKa motifs are seven to 10 residues shorter. To determine the phylogenetic relationships of plant LysM motifs, we calculated a majority-rule parsimony tree (Fig. 1A) using every plant LysM motif sequence (one to three LysM motif sequences from individual LysM proteins; Supplemental Table S2). Neighbor-joining and maximum-likelihood trees also showed generally similar topologies (data not shown). Plant LysM motifs can be separated into 11 early diverging clades (Fig. 1A; Supplemental Fig. 1). Most of these clades contain sequences from both monocots and dicots, indicating that the duplication events leading to these LysM proteins occurred prior to the monocot-dicot split. Domain and motif arrangements are shown in Figure 1B.
To understand the origins of plant LysM motifs, we identified LysM motif sequences (Supplemental Table S2) from nonplant species, including bacteria, fungi, insects, and animals, and calculated phylogenetic trees using majority-rule parsimony (Fig. 2 ; Supplemental Fig. S3), neighbor joining (Supplemental Fig. S4), and maximum likelihood (Supplemental Fig. S5). Consistent with our notion that plant LysM motifs are highly diversified, they can be classified into several multikingdom clades characterized by distinct motifs (Fig. 2; Supplemental Figs. S3S5). Although bootstrap support is generally low, most of the indicated clades are rooted with bacterial LysM motifs and at least two bacterial-rooted clades (XI and the clade above V) include sequences from fungi, worms, insects, plants, and animals in expected taxonomic order. This suggests that several LysM motifs may be very ancient, with common origin predating the divergence of fungal, insect, plant, and animal lineages.
Phylogenies of Plant LYK Proteins
We calculated plant LYK phylogenies using either LysM domain sequences (all LysM motifs sequences + spacer sequences) or the full protein sequence (LysM + kinase domain). Notably, the two sets of LYK phylogenies calculated using the parsimony method matched each other quite well. Therefore, only the full-length sequence trees are represented in this study. The plant LYK phylogenies calculated using parsimony (Fig. 3
), distance, and maximum-likelihood methods showed similar topologies (Supplemental Figs. S6 and S7). A parsimony tree with maximum-likelihood branch lengths is shown in Figure 3, with supporting values calculated using the parsimony method shown for supporting branches. Generally, the plant LYK phylogeny reflects species phylogeny [i.e. in an evolutionary direction of (rice, (Arabidopsis, (poplar, (legume)))) in most of the clades]. Five well-supported, distinct, multiplant family clades are evident in Figure 3 (indicated by bold horizontal lines). Three of them contain more than 10 members, whereas the other two clades consist of only three genes each. LjNFR1 and LjNFR5 fall into separate clades, consistent with previous studies (Arrighi et al., 2006
MtLYK13 (MtNFP) and GmNFR5 cluster with LjNFR5 with very high bootstrap values, consistent with their functional similarities (Arrighi et al., 2006 are considered to be LjNFR1 orthologs based on the phylogeny and functional similarities (Limpens et al., 2003
Phylogenetic relationships alone are usually not sufficient to infer orthology, but can be strengthened by information derived from genomic contexts. The most strongly conserved gene collinearity (also synteny or microsynteny) lies in LjNFR5 orthologous regions. These involve 11 blocks across the six species studied (Fig. 4A
). Less-conserved synteny was observed in the LjNFR1 orthologous regions (Fig. 4D). More degraded synteny also exists in ancestral LYK regions involving mainly two species (Fig. 4, B and C). In all these cases, the conservation was not only observed in gene content and order, but also in the gene orientations, except that MtLYK3 and MtLYK4 genes are in reverse orientations. We took advantage of the gene orientation as one important criterion to determine orthology because most LYK genes are tandemly duplicated. As reflected by LYK phylogenies, we claim the following four sets of orthology: LjNFR5 orthology, including MtLYK13 (MtNFP), GmNFR5
The most strongly conserved microsyntenies are observed between the GmNFR1
We also noticed gene duplications, either large scale or local, followed by gene diversification or gene loss. Tandem duplications of LYK genes in the NFR5 syntenic regions are present in both legume and poplar genomes, but not in rice and Arabidopsis, suggesting that this duplication predates the split of poplar and legumes (both in Rosid I). It is also clear that LjNFR5 orthologous regions are remnants of segmental (probably whole-genome) duplications. At least two scenarios concerning the fates of duplicated LYK genes are possible. In the first scenario, LjNFR5 homologs in legumes evolved a new function involved in nodulation. In the second scenario, LjNFR5 homologs in MtLYK9 and PtLYK5 blocks were lost, most likely after the large-scale duplication. Gene duplications also occurred in the LjNFR1 syntenic regions, but seem to exist exclusively in legumes. Consistent with previous results (Limpens et al., 2003 region (Fig. 4).
We calculated rate changes in user-defined parsimony phylogenies based on synonymous (dS) sites, nonsynonymous (dN) sites, and dN/dS ratios. The topologies of the three trees agree for branches under purifying selection, but vary for branches that have undergone rapid changes. Figure 6
shows the dN/dS topology with average dN/dS ratios calculated for each clade and subclade only on terminal branches. As shown in the dN/dS tree (Fig. 6), the average terminal dN/dS ratios of the LjNFR5 clade and its sister clade are slightly greater than 1. However, the average dN/dS ratio of LjNFR5 orthologs is significantly less than 1. It is also notable that, in the codon alignments for calculating dN/dS ratios, all insertions and deletions were removed except for a gap of more than 30 nucleotides that was retained to demonstrate the lack of the p loop and the activation loop in the kinase domains of LjNFR5 orthologs (Limpens et al., 2003
The LYK genes under rapid nucleotide changes can be classified into two categories: duplicated LYKs in syntenic blocks and singleton LYKs that show no or degraded synteny. In the first category, GmNFR5 , MtLYK9, LjLYK4, and PtLYK9 lie in syntenic blocks (Fig. 4). The higher rates of nucleotide changes of these genes are likely due to the relaxation of selection pressure after duplications. This is consistent with the observations that duplicated genes that are retained evolve slower than their singleton partners (after an initial evolutionary rate increase), and that evolutionary rates for duplicated genes are inversely correlated with the copy of paralogs per gene (Jordan et al., 2004
For the six plant species in this study, tissue expression levels of LYK genes were only reported for M. truncatula (Limpens et al., 2003
Comparative Genomics of Plant LYK Genes
The M. truncatula, soybean, and poplar genomes harbor larger numbers of LYK genes than the Arabidopsis and rice genomes (Fig. 3; Supplemental Table S1). That poplar would harbor large numbers of LYK genes (11) is not surprising because it has changed more slowly (and lost duplicated genes more slowly) than Arabidopsis or Medicago since their common ancestries (Tuskan et al., 2006
Tandem duplications of plant LYK genes are common in legume and poplar plants at a percentage of more than 50% per LYK-containing region. In total, 10 pairs of tandemly duplicated LYK duos were identified in rice, soybean, M. truncatula, and poplar, and one LYK trio was identified in the Lotus genome (Figs. 3 and 4). A large cluster of LYK genes (MtLYK17) was identified (Limpens et al., 2003
There are two types of kinase domains, predicted to be either active or inactive, found in plant LYK proteins. It was predicted that LjNFR1 and its close homologs have an intact, functional kinase domain, whereas LjNFR5 and its orthologs each have a short, inactive kinase domain, due to the lack of the p loop and the activation loop (Limpens et al., 2003 LYK genes have various numbers of exons, ranging from one to 13 (Fig. 3). Comparisons of gene structures suggest that ancestral LYK genes, LjNFR1 orthologs, and paralogs have 10 to 13 exons, whereas the rest of the LYKs, including LjNFR5 orthologs and paralogs, have one to five exons. All LjNFR5 orthologs, except PtLYK11, which has two exons, have an intronless gene structure. PtLYK11 is unusually small compared to PtLYK2 and its orthologs. The presence of one intron and small size may be due to incorrect annotation, resulting from the low-quality genomic sequence generated via a random whole-genome shotgun approach.
Plant LysM and LYK genes are common and important in plants, yet their biological functions, molecular evolution, and comparative genomics have not been well understood. In this study, we performed comprehensive molecular evolutionary and comparative genomic studies of the LYK gene family. The goal was to reconstruct a plausible evolutionary scenario and to generalize genomic characteristics of the plant LYK gene family. Our focus is on kinase-type LysMs rather than nonkinase types for several reasons: (1) There are limited genomic sequences available in legumes so far; (2) biological functions have not yet been assigned to most nonkinase LysM genes; and (3) the existing nonkinase LysM genes in rice, Arabidopsis, and poplar are quite divergent from LYK genes, making it difficult to integrate them together with LYK genes for evolutionary and genomic analyses.
Compared to M. truncatula, L. japonicus, rice, and Arabidopsis, soybean and poplar have been the subject of relatively few evolutionary studies due to the lack of finished genomic sequence. Use of soybean and poplar in this study, by means of cloning soybean LYK genes in a near-saturated manner and utilizing the newly completed poplar genome sequence, allowed us to expand on previous studies of plant LYKs (Arrighi et al., 2006
In this study, we adapted the nomenclature of LYK from Limpens et al. (2003)
There are several distinct types of LysM motifs in nature (Fig. 2). Although the alignments are difficult and bootstrap values are low for some branches, our data suggest that LysM motifs fall into several distinct eukaryotic clades (Fig. 2), possibly with separate bacterial origins. However, given the great evolutionary distances and short and diverse sequences, it is not possible to rule out convergent evolution of motif patterns or misinformative groupings on the basis of alignment artifacts. Plant LysM domains have evolved differently from nonplant LysM domains in several ways. If there are two LysM motifs in an individual plant LYK or LYP protein, they are heterogenous from each other. This is a phenomenon predominantly observed in plants, but not other lineages. The LysM motifs, if more than two, in an individual nonplant LysM protein are similar or almost identical. An extreme example is the nematode gene (NP_504862), which contains 12 nearly identical tandem LysM motifs. The fact that LYK genes exist exclusively in plants suggests that plants gave birth to LYK genes via an ancient de novo event. The simplest explanation is that membrane-bound LYP proteins fused with plant-specific kinase domains to become LYK proteins. LYP recognize peptidoglycan-like molecules in the extracellular space of plant cells. It is likely that LYP transduces the signals into the inner part of plant cells via an intracellular kinase domain. We calculated a tree of the topology of plant LysM motifs (Fig. 1), but cannot ascertain whether LYP occurred prior to LYK proteins in plants. However, we are able to track the duplications and losses of LYK genes in plants (Figs. 3 and 4). As predicted from the phylogenetic tree (Fig. 3), there are probably multiple ancestral LYK genes in primitive plant species. These ancestral LYKs underwent multiple rounds of duplication and speciation in descendants and more recently evolved plant species (Fig. 4), producing a significant reservoir of genetic resources for neo- and subfunctionalization. One notable success resulting from the expansion of LYKs in plant genomes is the formation of Nod factor receptors (LjNFR1 and LjNFR5 and their legume orthologs) when legumes evolved nodulation functions approximately 60 to 70 million years ago.
Striking features about the LysM domains are their diversity and complexity in almost every organism. Besides the fact that we identify a minimum of 11 distinct types of LysM motifs in plants, the numbers and combinations of LysM motifs in individual plant LysM proteins vary from one to another. This probably reflects the complexity of the ligands, which are mostly peptidoglycan-like oligosaccharides. The presence of multiple, diverse LysM domains in a single protein may increase ligand affinity and allow for a wider range of ligand binding.
As stated above, LysM domains have undergone clearly different evolutionary paths in animals and plants. These differences likely reflect the differences in ligand chemistry encountered by the LysM motif. Mulder et al. (2006)
Clearly, genomic duplications (such as polyploidy) have played an essential role in genomic expansion and functional diversification of plant LYK genes. At least two rounds of duplication, one local tandem duplication and one large scale (maybe genome wide), occurred in LjNFR5 orthologous and paralogous I regions in Rosid I plants (Fig. 4). The timing of these two duplications is therefore critical to understanding the dynamics of LYK duplications and genomic expansion. We believe that the local tandem duplication occurred before the large-scale duplication because it is improbable that LYK genes within large-scale duplicated fragments underwent tandem duplication with high similarity and stringency (head to head, low-nucleotide substitution rate, and similar expression pattern). Another piece of evidence comes from the fact that three blocks syntenic to the LjNFR5 region were identified in poplar, suggesting a scenario of one local duplication followed by two large-scale duplications. Cannon et al. (2006)
Tandem duplications were also observed in LjNFR1 orthologous regions, but in a legume-specific manner. In contrast to the LjNFR5 orthologous regions, LjNFR1 regions underwent dynamic changes after duplication. The evidence for this includes: (1) GmNFR1
LjNFR1 and LjNFR5 both have orthologs in nonlegume species (Figs. 3 and 4) that are generally under purifying selection (Fig. 6), suggesting that LjNFR1, LjNFR5, and their legume orthologs might share additional functions with their nonlegume orthologs. For example, LjNFR1, LjNFR5, and their orthologs may play an important role in arbuscular mycorrhizal (AM) symbiosis. In our study, the LjNFR5 orthologous subclade is the only informative group that does not have a member from Arabidopsis, which is a nonmycorrhizal species (Figs. 3 and 4). Although AtLYK4 lies in the LjNFR5 syntenic regions, it actually is the ortholog to the duo partners that are tandemly duplicated from LjNFR5 orthologs as judged by sequence similarity (Fig. 3) and gene orientation (Fig. 4). These data suggest that the LjNFR5 orthologous group may be involved in AM symbiosis. However, this suggestion is contradicted by the fact that mutants of LjNFR1 and LjNFR5 are not impaired in the AM symbiosis (Madsen et al., 2003
Retrieval of Plant LYK Sequences and LysM Motif Sequences from All Kingdoms Plant LysM protein sequences were first searched using the key word LysM and BLASTp (1e-20) using the LysM domains of LjNFR1 and LjNFR5 against the following databases: Arabidopsis (Arabidopsis thaliana; www.arabidopsis.org); rice (Oryza sativa; http://www.tigr.org/tdb/e2k1/osa1); poplar (Populus spp.; http://genome.jgi-psf.org/Poptr1_1/Poptr1_1.home.html); Medicago truncatula (http://www.medicago.org/genome); and Lotus japonicus (http://www.kazusa.or.jp/lotus). Domain structures of the resulting potential LysM proteins were analyzed with Pfam (www.sanger.ac.uk/software/Pfam) and Inter-ProScan (http://www.ebi.ac.uk/Inter-ProScan) to identify LysM proteins. Soybean (Glycine max) LysM proteins were searched via tBLASTn (1e-5) using the same query sequences as above against the two EST databases: The Institute for Genomic Research (http://compbio.dfci.harvard.edu/tgi/cgi-bin/tgi/gimain.pl?gudb=soybean) and Monsanto (http://www.soyseq.org). Primers were designed based on the resulting soybean EST sequences to probe a six-dimensional BAC pool for LysM-containing BACs via a PCR-based approach. The probed LYK-containing BACs were verified and shotgun sequenced to either finished phase (phase 3) at the Arizona Genome Sequencing Center or prefinished phase (phase 2) at the Washington University Genome Sequencing Center. BAC sequences were annotated using the dicot species model and Arabidopsis matrix of FGENESH (www.softberry.com). Annotated proteins were similarly analyzed to screen for LYK proteins. Signal peptides and transmembrane domains were predicted using SignalP (http://www.cbs.dtu.dk/services/SignalP) using both nearest-neighbor and hidden Markov model (HMM) algorithms and transmembrane HMM (http://www.cbs.dtu.dk/services/TMHMM-2.0), respectively. The GenBank accession numbers of soybean BACs are EF533702 for GMWb098N11; EF533695 for GMWb098N15; EF533696 for GMWb026N19; EF533701 for GMWb095P01; EF533697 for GMWb035N07; EF533699 for GMWb069O12; EF533700 for GMWb080D08; and EF533698 for GMWb042I24. LysM protein sequences from species spanning all kingdoms were extracted from Pfam (http://www.sanger.ac.uk/cgi-bin/Pfam/speciesdist.pl?acc=PF01476&id=LysM&depth=all) and searched for LysM motifs at an E-value cutoff of 0.1.
Sequence alignments were performed using ClustalX 1.83 (Thompson et al., 1997 To build microsynteny maps, genomic sequences surrounding each LYK gene, about 0.5 to 0.9 Mb in length, were extracted from the above databases and from soybean BAC sequences, which are about 100 to 170 kb in length. The genomic sequences were annotated using dicot species model and Arabidopsis matrix of FGENESH for the five dicot plants and monocot species model and rice matrix for rice. The annotated protein sequences were compiled together into a peptide sequence database. Repetitive sequences were excluded from the databases. BLASTp was used to compare proteins against the database with an E-value cutoff of 1e-20 and a percent identity cutoff of 35% between species and 40% within same species and legumes. BLASTp results were then filtered once to remove retroelements. The microsynteny maps were finally drawn in Adobe Illustrator 10.0.
Soybean seeds were germinated at 30°C for 2 to 4 d. The terminal 2 to 4 cm of four to eight root tips were treated with pressurized nitrous oxide gas (Kato, 1999
BACs were labeled for FISH using optimized nick-translation reactions as previously described (Kato et al., 2004
Soybean, M. truncatula, and rice plants were grown in the greenhouse at 28°C to 30°C with a 16-h light/8-h dark cycle. Roots and vegetative tissues were sampled about 3 weeks after planting and flowers were sampled about 3 months after planting. Total RNAs were extracted using Trizol (Invitrogen) followed by Turbo DNase (Ambion) treatment to remove genomic DNA contamination. First-strand cDNAs were synthesized using Moloney murine leukemia virus reverse transcriptase (Promega). Quantitative RT-PCR was performed using a 7500 real-time PCR system (Applied Biosystems) following standard procedures. The primer sequences are listed in Supplemental Table S3. Sequence data from this article can be found in the GenBank/EMBL data libraries under accession numbers EF533695 to EF533702.
The following materials are available in the online version of this article.
We thank Dr. Peter Gresshoff, Attila Kereszt, and Arief Indrasumunar for providing GmNFR1 , GmNFR1 , GmNFR5 , and GmNFR5 genomic and cDNA sequences and Dr. John Walker for providing rice seeds. We are also grateful to Dr. Fangpu Han, Dr. Scott Jackson, and Christian Hans for helping with FISH experiments. We thank Dr. Chris Pires and Dr. Sherri Sachdev for critical reading of the manuscript. Received January 31, 2007; accepted April 6, 2007; published April 20, 2007.
1 This work was supported by the U.S. Department of Energy, Energy Biosciences Program, Office of Basic Energy Sciences (grant no. DEFG0202ER15309 to G.S.) and the Missouri Soybean Merchandising Council, and by the U.S. Department of Agriculture-Cooperative State Research, Education, and Extension Service (grant to the National Center for Soybean Biotechnology). Bacterial artificial chromosome sequencing at the Arizona Genomics Institute was supported by the National Science Foundation, Plant Genome Program (grant no. 0501877). The author responsible for distribution of materials integral to the findings presented in this article in accordance with the policy described in the Instructions for Authors (www.plantphysiol.org) is: Xue-Cheng Zhang (zhangxuec{at}missouri.edu).
[W] The online version of this article contains Web-only data.
[OA] Open Access articles can be viewed online without a subscription. www.plantphysiol.org/cgi/doi/10.1104/pp.107.097097 * Corresponding author; e-mail zhangxuec{at}missouri.edu; fax 5738849676.
Arrighi J, Barre A, Ben Amor B, Bersoult A, Soriano L, Mirabella R, Carvalho-Niebel F, Journet E, Gherardi M, Huguet T, et al (2006) The Medicago truncatula lysine motif-receptor-like kinase gene family includes NFP and new nodule-expressed genes. Plant Physiol 142: 265279 Bateman A, Bycroft M (2000) The structure of a LysM domain from E. coli membrane-bound lytic murein transglycosylase D (MltD). J Mol Biol 299: 11131119[CrossRef][Web of Science][Medline] Ben Amor B, Shaw SL, Oldroyd GED, Maillet F, Penmetsa RV, Cook D, Long SR, Denarie J, Gough C (2003) The NFP locus of Medicago truncatula controls an early step of Nod factor signal transduction upstream of a rapid calcium flux and root hair deformation. Plant J 34: 495506[CrossRef][Web of Science][Medline] Bielnicki J, Devedjiev Y, Derewenda U, Dauter Z, Joachimiak A, Derewenda ZS (2006) B. subtilis ykuD protein at 2.0 A resolution: insights into the structure and function of a novel, ubiquitous family of bacterial enzymes. Proteins 62: 144151[CrossRef][Web of Science][Medline] Birkeland NK (1994) Cloning, molecular characterization, and expression of the genes encoding the lytic functions of lactococcal bacteriophage phi-Lc3a dual lysis system of modular design. Can J Microbiol 40: 658665[Web of Science][Medline] Cannon SB, Sterck L, Rombauts S, Sato S, Cheung F, Gouzy J, Wang X, Mudge J, Vasdewani J, Scheix T, et al (2006) Legume genome evolution viewed through the Medicago truncatula and Lotus japonicus genomes. Proc Natl Acad Sci USA 103: 1495914964 Clamp M, Cuff J, Searle SM, Barton GJ (2004) The Jalview Java alignment editor. Bioinformatics 20: 426427 Eddy SR (1998) Profile hidden Markov models. Bioinformatics 14: 755763 Felsenstein J (2000) PHYLIP (Phylogeny Inference Package), Ed 3.6. University of Washington, Seattle Frankel G, Lider O, Hershkoviz R, Mould AP, Kachalsky SG, Candy DCA, Cahalon L, Humphries MJ, Dougan G (1996) The cell-binding domain of intimin from enteropathogenic Escherichia coli binds to beta integrins. J Biol Chem 271: 2035920364 Jerse AE, Yu J, Tall BD, Kaper JB (1990) A genetic locus of enteropathogenic Escherichia coli necessary fro the production of attaching and effecting lesions on tissue culture cells. Proc Natl Acad Sci USA 87: 78397843 Jordan K, Wolf Y, Koonin EV (2004) Duplicated genes evolve slower than singletons despite the initial rate increase. BMC Evol Biol 4: 22[CrossRef][Medline] Joris B, Englebert S, Chu CP, Kariyama R, Daneomoore L, Shockman GD, Ghuysen JM (1992) Modular design of the Enterococcus hirae muramidase-2 and Streptococcus faecalis autolysin. FEMS Microbiol Lett 70: 257264[Medline] Kato A (1999) Air drying method using nitrous oxide for chromosome counting in maize. Biotech Histochem 74: 160166[Web of Science][Medline] Kato A, Lamb JC, Birchler JA (2004) Chromosome painting using repetitive DNA sequences as probes for somatic chromosome identification in maize. Proc Natl Acad Sci USA 101: 1355413559 Limpens E, Franken C, Smit P, Willemse J, Bisseling T, Geurts R (2003) LysM domain receptor kinases regulating rhizobial Nod factor-induced infection. Science 302: 630633 Madsen EB, Madsen LH, Radutoiu S, Olbryt M, Rakwalska M, Szczyglowski K, Sato S, Kaneko T, Tabata S, Sandal N, et al (2003) A receptor kinase gene of the LysM type is involved in legume perception of rhizobial signals. Nature 425: 637640[CrossRef][Medline] Michelmore RW, Meyers BC (1998) Clusters of resistance genes in plants evolve by divergent selection and a birth-and-death process. Genome Res 8: 11131130 Mulder L, Lefebvre B, Cullimore J, Imberty A (2006) LysM domains of Medicago truncatula NFP protein involved in Nod factor perception: glycosylation state, molecular modeling and docking of chitooligosaccharides and Nod factors. Glycobiology 16: 801809 Pellegrini M, Marcotte EM, Thompson MJ, Eisenberg D, Yeates TO (1999) Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. Proc Natl Acad Sci USA 96: 42854288 Pfeil BE, Schlueter JA, Shoemaker RC, Doyle JJ (2005) Placing paleopolyploidy in relation to taxon divergence: a phylogenetic analysis in legumes using 39 gene families. Syst Biol 54: 441454 Ponting CP, Aravind L, Schultz J, Bork P, Koonin EV (1999) Eukaryotic signalling domain homologues in archaea and bacteria: ancient ancestry and horizontal gene transfer. J Mol Biol 289: 729745[CrossRef][Web of Science][Medline] Radutoiu S, Madsen LH, Madsen EB, Felle HH, Umehara Y, Gronlund M, Sato S, Nakamura Y, Tabata S, Sandal N, et al (2003) Plant recognition of symbiotic bacteria requires two LysM receptor-like kinases. Nature 425: 585592[CrossRef][Medline] Schlueter JA, Dixon P, Granger C, Grant D, Clark L, Doyle JJ, Shoemaker RC (2004) Mining EST databases to resolve evolutionary events in major crop species. Genome 47: 868876[Medline] Schmidt HA, Strimmer K, Vingron M, von Haeseler A (2002) TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics 18: 502504 Shiu SH, Karlowski WM, Pan RS, Tzeng YH, Mayer KFX, Li WH (2004) Comparative analysis of the receptor-like kinase family in Arabidopsis and rice. Plant Cell 16: 12201234 Stacey G, Libault M, Brechenmacher L, Wan JR, May GD (2006) Genetics and functional genomics of legume nodulation. Curr Opin Plant Biol 9: 110121[CrossRef][Web of Science][Medline] Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG (1997) The Clustal-X windows interfaceflexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res 25: 48764882 Tuskan GA, DiFazio S, Jansson S, Bohlmann J, Grigoriev I, Hellsten U, Putnam N, Ralph S, Rombauts S, Salamov A, et al (2006) The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science 313: 15961604 Yang Z (1997) PAML: a program package for phylogenetic analysis by maximum likelihood. Comput Appl Biosci 13: 555556 Zhu HY, Riely BK, Burns NJ, Ane JM (2006) Tracing nonlegume orthologs of legume genes required for nodulation and arbuscular mycorrhizal symbioses. Genetics 172: 24912499 Zmasek CM, Eddy SR (2001) ATV: display and manipulation of annotated phylogenetic trees. Bioinformatics 17: 383384 This article has been cited by other articles:
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ASPB Publications | PLANT PHYSIOLOGY® | THE PLANT CELL | |
|---|---|---|---|