|
|
||||||||
|
Plant Physiology 134:43-58 (2004) © 2004 American Society of Plant Biologists Calcium Sensors and Their Interacting Protein Kinases: Genomics of the Arabidopsis and Rice CBL-CIPK Signaling Networks1,[w]Abteilung Pflanzenphysiologie, Universität Rostock, Albert-Einstein-Strasse 3, 18051 Rostock, Germany (Ü.K.); Molekulare Botanik, Universität Ulm, Albert-Einstein-Allee 11, 89069 Ulm, Germany (S.W., D.B., O.B., J.K.); and Institut für Botanik, Universität Münster, Schlossgarten 3, 48149 Münster, Germany (J.K.)
Calcium signals mediate a multitude of plant responses to external stimuli and regulate a wide range of physiological processes. Calcium-binding proteins, like calcineurin B-like (CBL) proteins, represent important relays in plant calcium signaling. These proteins form a complex network with their target kinases being the CBL-interacting protein kinases (CIPKs). Here, we present a comparative genomics analysis of the full complement of CBLs and CIPKs in Arabidopsis and rice (Oryza sativa). We confirm the expression and transcript composition of the 10 CBLs and 25 CIPKs encoded in the Arabidopsis genome. Our identification of 10 CBLs and 30 CIPKs from rice indicates a similar complexity of this signaling network in both species. An analysis of the genomic evolution suggests that the extant number of gene family members largely results from segmental duplications. A phylogenetic comparison of protein sequences and intron positions indicates an early diversification of separate branches within both gene families. These branches may represent proteins with different functions. Protein interaction analyses and expression studies of closely related family members suggest that even recently duplicated representatives may fulfill different functions. This work provides a basis for a defined further functional dissection of this important plant-specific signaling system.
All organisms use a network of signal transduction pathways to cope with their environment, to control their metabolism, and to realize their developmental programs. Calcium has emerged as an ubiquitous second messenger involved in many of these processes (Gilroy and Trewavas, 2001
In the case of plant calcium-dependent protein kinases (CDPKs), which harbor a calcium-binding domain as well as a catalytic Ser/Thr kinase domain, calcium signals can be sensed and transmitted by a single protein (Harmon et al., 2000
To date, a physiological function has been established only for a few CBL and CIPK proteins. The CBL calcium sensor SOS3 (AtCBL4) and the CIPK-type kinase SOS2 (AtCIPK24) have been identified in genetic screens. They appear to be part of a calcium-regulated signaling pathway that specifically mediates salt stress signaling and adaptation (Liu and Zhu, 1998
The biochemical analyses of CIPKs revealed that both deletion of the CBL-interaction domain and substitution of a critical Thr in the putative activation loop by Asp result in a constitutively hyperactive kinase (Guo et al., 2001 In this study, we present a systematic genome-wide analysis of the Arabidopsis and rice (Oryza sativa) CBL and CIPK genes and encoded proteins. Our data provide a framework for future functional dissection of this calcium-regulated plant signal transduction system in both model plants.
The Complement of CBLs and CIPKs Encoded in the Arabidopsis Genome
In our previous studies, we identified and analyzed six CBLs and 15 CIPKs from Arabidopsis (Kudla et al., 1999
We performed a systematic reverse transcriptase (RT)-PCR-based cDNA cloning of all potentially functional CBL and CIPK genes to verify their active expression and to determine the structure of the encoded mRNAs. Our cDNA cloning approach revealed that all genes are transcribed and in several cases led to a correction of the theoretical gene predictions. These data have been incorporated in public databases, and the corresponding nucleotide and protein identification numbers are depicted in Tables I and II. Our comparison of the genomic loci with their encoded cDNAs revealed that almost all identified Arabidopsis CBL genes harbor six or seven introns in their coding region (Fig. 1A). In the case of AtCBL10, an additional intron appears in its N-terminal coding region. This intron lacks any homolog in other AtCBLs. Four introns are absolutely conserved in phase and position in all 10 CBL genes. Three other introns are structurally conserved, but are absent in either one or two members of the CBL gene family, most likely due to intron loss events during evolution. Interestingly, the AtCBL7 gene, which is located in tandem orientation with AtCBL3 in the Arabidopsis genome (see below), contains an intron in a position not found in any other CBL locus. Remarkably, all CBL genes for which the full-length cDNA sequences have been determined (AtCBL1-4 and AtCBL9) harbor introns in their 5'-untranslated region (UTR); one in AtCBLs1 and 9, and two in AtCBLs2-4. Whether this unusual intron composition potentially reflects regulatory functions during gene expression remains to be investigated experimentally.
In contrast to the CBL genes, only eight of 25 AtCIPK genes harbor multiple intron sequences (Fig. 1B). Again, phase and position of these introns, if present, appear to be rather conserved. The position of seven introns is absolutely invariant. The other six introns are only absent in one to three intron-containing members of the CIPK gene family. One exception is represented by an additional intron in the AtCIPK23 gene, which is not shared by any other CIPK gene. Moreover, the AtCIPK16 gene harbors only a single intron not found in other CIPK genes. This might indicate a recent intron acquisition in this gene. In addition, we identified a single intron in the 5'-UTR of AtCIPK15, which otherwise does not harbor intron sequences in its coding region. Taken together, these findings suggest the preservation of a relatively constant exon-intron composition of the CBL and CIPK genes during the evolution of the Arabidopsis genome. In addition, we assayed the Arabidopsis CBLs and CIPKs for the occurrence of alternative mRNA splicing. To this end, the cDNA sequences obtained during this study were compared with all available full-length cDNA and expressed sequence tag (EST) sequence data. These investigations identified alternatively spliced mRNAs for AtCBL2, AtCBL4, and AtCBL10 as well as for AtCIPK3 and AtCIPK9. All alternatively spliced cDNA sequences have been deposited in the respective GenBank entries. The nucleotide identification numbers of the analyzed ESTs and cDNA sequences are listed in Supplemental Table SI. The differential splicing events of the AtCBL10 and AtCIPK3 transcripts are exemplarily depicted in Figure 2. In the case of AtCBL10, the processing of an alternative exon leads to two potential protein isoforms with different N-terminal sequences (AtCBL10-1, MEQVSSRSSSLT...; AtCBL10-2, MTTGRPNNILALKISTRSSSLT...). The differential splicing of CBL2 and CBL4 transcripts affects the composition of their mature 5'-UTR. Interestingly, one spliced intron in the 5'-UTR of AtCBL4/SOS3 encompasses 3.7 kb of genomic sequence, a fact that needs to be considered when attempting to study expression of this gene by the analyses of promoter-reporter gene constructs in transgenic plants. For At-CIPK3, we detected three alternatively spliced forms, which differ in the C-terminal part of the protein. An alignment of these three protein sequences is depicted in Supplemental Fig. S1. These isoforms arose by differential splicing of intron 11 within the otherwise completely processed full-length mRNAs. In isoform AtCIPK3-2, the recognition of an alternative splice site leads to an RNA lacking parts of exon 12 and the complete exons 13 and 14. Therefore, the potentially specified AtCIPK3-2 protein would suffer a truncation of 69 amino acids. In the AtCIPK3-3 isoform, retaining of intron 11 results in an in-frame stop codon within this intron sequence. We identified three differentially spliced transcript forms of AtCIPK9, which are generated by alternative splicing of introns 7 and 13. Compared with AtCIPK9-1, the resulting variants would encode proteins either lacking two amino acids (At-CIPK9-2) or harboring four additional amino acids (AtCIPK9-3). Only a relative low number of full-length ESTs or cDNAs have been available for our analyses of potential alternative splicing events. This limits the statistical reliability of any prediction about the ratios of certain transcripts in planta. Nevertheless, the only four cDNAs identified for AtCBL10 represent two alternatively spliced forms (AtCBL10-1, two cDNAs; AtCBL10-2, two cDNAs), and the six identified AtCIPK3 cDNAs represent three isoforms (AtCIPK3-1, three cDNAs; AtCIPK3-2, one cDNA; AtCIPK3-3, two cDNAs). The rather even distribution of differentially spliced mRNAs argues against a fortuitous detection of "mis-spliced" RNAs and makes it more likely that these alternatively spliced mRNAs are functional. Currently, it remains to be established how widespread the occurrence of alternative splicing is within the CBL and CIPK families and what the functional significance of the resulting protein isoforms could be. However, our identification of multiple differentially spliced isoforms is likely to add an additional level of complexity to the regulation of this signaling system.
Analyses of the almost completely sequenced Arabidopsis genome have revealed that segmental duplications of chromosomal regions during evolution have significantly shaped the current structure of this genome (Grant et al., 2000
To uncover locus duplication events, we performed whole-genome sequence comparisons of all CBLs and CIPKs covering 100 kb of additional sequence surrounding each gene (see "Materials and Methods"). For the CBLs, these analyses identified three interchromosomal translocations resulting in two duplicated CBL genes (Fig. 3A; Supplemental Fig. S2). Two independent duplications affecting different parts of chromosomes IV and V led to the formation of the AtCBL1/AtCBL9 and AtCBL2/AtCBL3 gene pairs, respectively (Supplemental Fig. S2). In contrast, a duplication affecting regions of chromosomes IV and I was followed by loss of one CBL gene copy (Supplemental Fig. S2). Therefore, the AtCBL8 gene, whose ancestor has been involved in this rearrangement, at present represents a singular CBL lacking a closely related counterpart. We also identified a single local tandem duplication leading to the formation of the AtCBL3/AtCBL7 gene pair. Interestingly, the two gene pairs AtCBL9 and AtCBL3 that arose by segmental duplications represent the most closely related CBL pairs. They exhibit a significantly higher degree of conservation than the AtCBL7 pair encoded by tandemly oriented genes. While both the AtCBL1/AtCBL9 and the AtCBL2/AtCBL3 protein pairs share 89% and 92% identical amino acid residues, AtCBL3 and AtCBL7 exhibit only 60% sequence identity (Supplemental Table SII). This might point to a relatively ancient origin of this tandemly oriented gene pair as compared with the identified segmental duplications.
Our whole-genome sequence comparison for the 25 CIPK loci uncovered altogether eight segmental duplication events (Fig. 3B; Supplemental Fig. S3). Six of these chromosomal rearrangements led to the presence of pairs of closely related CIPKs in the genome, whereas two loci duplications were followed by subsequent deletions of the affected kinases during the evolution of the Arabidopsis genome. In addition, we identified two CIPK gene pairs (AtCIPK14/AtCIPK15 and AtCIPK19/AtCIPK20), which arose by tandem duplications. As observed for the CBLs, all segmentally duplicated CIPK gene pairs (AtCIPK1/AtCIPK17, AtCIPK2/AtCIPK10, AtCIPK4/AtCIPK7, AtCIPK5/AtCIPK25, AtCIPK12/AtCIPK19, and AtCIPK13/AtCIPK18) encode the most closely conserved protein pairs, whereas proteins encoded by tandemly oriented genes (AtCIPK14/AtCIPK15 and AtCIPK19/AtCIPK20) exhibit a lower degree of conservation (Supplemental Table SIV). Taken together, our analysis of the chromosomal evolution of the CBL and CIPK gene families indicates that segmental duplications have predominantly contributed to the current complexity of both Arabidopsis gene families. Only two of the 10 Arabidopsis CBL genes and four of the 25 AtCIPKs are encoded in tandem orientation. In contrast, four AtCBLs and 12 AtCIPKs were identified as segmentally duplicated loci, indicating the importance of this evolutionary mechanism for the growth and diversification of these gene families. A recent genome-wide analysis of NBS-leurin rich repeat (LRR) genes in Arabidopsis (Meyers et al., 2003 To extend our studies on CBLs and CIPKs beyond Arabidopsis, we searched the publicly available EST (and genomic) databases for homologous sequences in other organisms. For CBLs, the applied criteria included a significantly higher similarity with the Arabidopsis CBLs than with CaM as well as the CBL-specific spacing of the EF hand motifs. Homologous CIPK-type kinases from other species were defined by both the similarity with the AtCIPKs and the presence of the NAF domain. Although these analyses systematically excluded partial ESTs covering only N-terminal parts of the encoded proteins, we nevertheless identified ESTs encoding CBLs and CIPKs in plant species like Medicago truncatula (nine different CBLs and 11 distinct CIPKs), wheat (Triticum aestivum; 11 CBLs and 29 CIPKs), barley (Hordeum vulgare; nine CBLs and 14 CIPKs), soybean (Glycine max; seven CBLs and 13 CIPKs), the gymnosperm Pinus sp. (two CBLs and seven CIPKs), and the moss Physcomitrella patens (four CBLs and three CIPKs). The accession numbers of the identified ESTs are displayed in the Supplemental Table SIb. The occurrence of multiple isoforms of CBLs and CIPKs in all analyzed plant species indicates that these proteins generally form a complex signaling network. We did not identify EST or genomic sequences encoding CBLs or CIPKs outside the plant kingdom, suggesting that the function of these signaling components is confined to plants but arose early in plant evolution.
We next investigated the occurrence of CBLs and CIPKs in the extensively sequenced genome of the monocotyledonous plant rice. To this end, we separately analyzed the available sequence data for the two rice subspecies O. sativa subsp. japonica and O. sativa subsp. indica (for details see "Materials and Methods"). In both genomes, we identified the same set of 10 calcium sensor proteins and 30 interacting kinases. The general features of the rice CBLs and CIPKs are summarized in Tables III and IV. The available sequences of both rice species do not cover their complete genomes and are only partially assembled. Nevertheless, the consistency of identified CBL and CIPK genes suggests that we uncovered the full complement of both gene families in rice. The rice genome is not only significantly larger than the Arabidopsis genome but also predicted to encode a greater number of genes (32,00055,000 as compared with 27,300 in Arabidopsis; Goff et al., 2002
Comparison of the exon-intron composition of Arabidopsis and rice genes revealed a rather high degree of structural conservation (data not shown). The rice genes OsCBL1 to 7 all harbor seven introns, which are absolutely invariant in phase and position when compared with the respective conserved seven introns present in six of the 10 AtCBLs. This situation suggests that the single intron losses observed in AtCBL3, 5, 6, and 7 occurred after the divergence of both evolutionary lineages. These seven introns are conserved in OsCBL9 and OsCBL10. Interestingly, these two genes share the additional first intron within their N-terminal extension with their closely related counterpart AtCBL10. As observed for the Arabidopsis CIPKs, the members of the rice CIPK gene family can also be divided into intron-less (eight OsCIPKs) and intron-harboring (22 OsCIPKs) representatives (Table IV). The latter exhibit a high degree of intron conservation with their relative Arabidopsis gene family members and harbor 11 to 13 introns, which are all conserved in phase and position. Gene structures with 11 or 12 introns remaining can clearly be attributed to single or double intron loss events. For example, OsCIPK1 and OsCIPK21 have lost just one of these conserved introns (homologous to intron 8 in AtCIPK8). In our analysis, we identified the two genes OsCIPK14 and OsCIPK15 as located on different chromosomes in the both rice subspecies. Both predicted proteins are almost identical and differ only by a single amino acid substitution and in their C-terminal nine amino acids. Therefore, this CIPK pair might represent an example for a very recent duplication event in this species. Our genomic analyses of rice CBL and CIPK genes did not detect any tandemly duplicated OsCBL genes. Of the 30 OsCIPKs, three gene pairs appeared to be organized in tandem orientation (OsCIPK5/13, OsCIPK12/30, and OsCIPK2/29). Moreover, these gene pairs are only relatively distantly related, suggesting a rather ancient origin of the duplications (see below). Therefore, despite the larger genome size of rice as compared with Arabidopsis, microscale recombination events do not seem to have significantly affected the evolution of both rice gene families. To further elucidate the relationships of CBLs and CIPKs, we performed phylogenetic analyses to infer clustering patterns reflecting sequence similarity and evolutionary ancestry. Plant CaM as well as animal CNB and NCS were included as outgroups in the analysis of the calcium sensor proteins. The resulting unrooted phylogenetic tree is depicted in Figure 4. (To maintain clarity, this tree does not present bootstrap values. A cladogram displaying all bootstrap values is presented in Supplemental Fig. S4.) Plant CBLs clearly form a monophyletic group separated from CaMs and calcium sensor proteins from animals. Moreover, Arabidopsis CBL pairs (AtCBL1/AtCBL9 and AtCBL2/AtCBL3) resulting from segmental duplications of chromosomal regions appear to be most closely related, again pointing to a relatively recent formation of these gene pairs. In contrast, the CBL pair AtCBL3/AtCBL7, which evolved by tandem duplication, is far less related.
The evolutionary comparison of rice and Arabidopsis CBL genes clearly points to the occurrence of multiple species-specific duplication events. Of course, these duplications could also be diagnostic for larger evolutionarily related groups, like monocotyledonous and dicotyledonous species. However, the lack of additional plant genome sequences currently prevents a further investigation of this issue. This situation will most likely impede a precise prediction of common CBL gene functions based solely on sequence information from rice and Arabidopsis. For example, the two closely related calcium sensor proteins pairs AtCBL2/3 and OsCBL2/3 result from duplication events that occurred independently in both species analyzed. OsCBL9 and 10 are clearly related to AtCBL10 by both sequence similarity and intron composition. However, it is currently impossible to address whether the rice-specific duplication leading to OsCBL9 and 10 could result in functional overlaps or, alternatively, reflects a functional specialization. Likewise, the single OsCBL1 gene is most closely related to the duplicated AtCBL1 and AtCBL9 genes, which probably fulfill different functions in Arabidopsis (see below). Nevertheless, in combination with additional information, such as CBL-CIPK interaction specificity and gene expression data, knowledge of these evolutionary relations will help to design reverse genetic strategies to unravel CBL gene functions. Interestingly, the distribution of myristoylated Arabidopsis and rice CBL proteins coincides with two neighboring branches in the dendrogram. The only predicted non-myristoylatable calcium sensor in these branches (AtCBL8) still harbors a recognizable degenerated myristoylation consensus sequence. This might suggest a single early event during the evolution of the CBL genes leading to the loss of this lipid modification in the remaining CBL lineages. Such a diversification would have enabled the separate evolution of membrane-associated and membrane-independent calcium-signaling pathways. Plant SnRK1 and SnRK2 kinases were included as outgroups in the phylogenetic analysis of the Arabidopsis and rice CIPKs. The resulting unrooted phylogenetic tree is depicted in Figure 5. (To maintain clarity this tree does not present bootstrap values. A cladogram displaying all bootstrap values is presented in Supplemental Fig. S5.) This dendrogram clearly indicates a monophyletic origin of the complex family of plant CIPKs. Interestingly, members of both the rice and the Arabidopsis CIPKs form two distinct subgroups based on their sequence similarity. This clustering of CIPKs strictly coincides with the presence (or absence) of introns in the genes of these subgroups, indicating that the ancestors of both groups have been present in the genome before the evolutionary separation of the mono- and dicotyledonous lineages. Moreover, several groups of closely related Arabidopsis and rice CIPKs form well-supported separate branches. This might suggest a relatively early diversification of the CBL and CIPK gene families during plant evolution.
The resolution of evolutionary relationships between individual CIPK proteins within these branches varies significantly. For example, the rice and Arabidopsis representatives of CIPK8, 9, 23, and 24 are clearly recognizable as distinct orthologous gene pairs. This might allow for prediction of similar functions of their gene products in both species. In contrast, in both the CIPK4/7 and the CIPK1/17/21 branches, our analysis indicates parallel species-specific duplication events. This prevents an inter-specific gene-to-gene assignment of distinct CIPKs. However, it appears noteworthy that in rice and Arabidopsis, the evolution of these branches followed the same pattern resulting in the closely related gene groups CIPK4/7 and CIPK1/17/21 in both species. The slightly increased number of CIPKs in rice compared with Arabidopsis can be largely attributed to rice-specific duplications in the evolutionary branch encompassing AtCIPK2, 6, 15, and 20 and the respective rice homologs. Although this branch just represents four CIPKs from Arabidopsis, it encompasses 12 members of the rice kinase family. It needs to be analyzed if the amplification of this rice CIPK subgroup just results in functional overlaps or instead points to an evolutionary specialization. Finally, our data indicate that the complexity of this signaling network is not restricted to Arabidopsis and therefore is likely to reflect functional diversification.
The CBL protein family appears to be rather conserved in size and structure. Most of the genes encode predicted polypeptides ranging from 23.5 to 26 kD in Arabidopsis and 23.9 to 25.9 kD in rice (Tables I and III). The only noted exceptions are represented by AtCBL10 and the recently duplicated OsCBL9 and OsCBL10, which, due to an N-terminal extension, have predicted molecular masses of 31.7, 32.9 and 29.9 kD, respectively. Although the N termini of AtCBL10 and OsCBL9/10 do not harbor any discernible sequence or signal motif, it appears conceivable that this domain is of functional relevance. The amino acid sequence identity of different CBLs ranges from 29% to 92% (33%93% similarity) in Arabidopsis and from 40% to 92% (44%94% similarity) in rice (Supplemental Tables SII and SIII). In Arabidopsis, the most closely related proteins AtCBL1 and AtCBL9 as well as AtCBL2 and AtCBL3 share 89% and 92% identical (91% and 93% similar) amino acid residues, respectively. Even the most divergent protein pair AtCBL5 and AtCBL10 exhibits 29% sequence identity (33% similarity). The rather conserved structure of these calcium sensor proteins might point to a very similar mode of action and likely reflects their conserved interaction with the CIPKs as their common target proteins.
Almost all known animal and yeast calcium sensor proteins of the NCS and calcineurin type are modified by the 14-carbon saturated fatty acid myristate. Covalent linkage of myristate via an amide bond to the Gly in the N-terminal sequence MGXXXS/T is catalyzed by N-myristoyl transferase and occurs cotranslationally after removal of the initiator Met by N-aminopeptidase. In many systems, N-myristoylation promotes protein-protein or protein-membrane interactions (Resh, 1999
Calcium-binding proteins contain EF hand motifs as structural basis for calcium binding. Each EF hand consists of a loop of 12 amino acids flanked by two Our comparative analyses of the EF hand composition in the Arabidopsis CBL protein family revealed extensive similarities to the NCS-type calcium sensors. All 10 AtCBL proteins harbor four EF hand motifs (Fig. 6A). Interestingly, the size of the linker regions between the EF hand loops is absolutely conserved in all proteins and appears to be unique to this family of calcium sensor proteins. EF1 and EF2 are 22 amino acids apart, whereas 25 amino acids separate EF2 from EF3, and 32 amino acids are inserted between EF3 and EF4. The number of EF hands as well as their spacing is also absolutely conserved in all predicted OsCBL proteins (data not shown). Size variation of CBL proteins is therefore exclusively caused by extension or reduction of the N- and C-terminal regions (Fig. 6A). This might suggest a conserved three-dimensional structure of all plant CBLs. Also, like in NCS proteins, the first EF hand, although conserved in all CBLs, appears to be unable to bind Ca2+ ions. This is caused by either single-amino acid substitutions at critical calcium-binding positions (either positions X, Y, and Z or -Y, -X, and -Z; for the Arabidopsis representatives, see Fig. 6B) or larger structural changes (Fig. 6B). In most cases, the oxygen-containing calcium-binding residues at the Y and/or X positions in this EF loop are altered to Ala, Val, Ile, or Cys (as observed in EF1 of GCAP proteins). In addition, in AtCBL5, the essential amino acid at the -Z position is replaced by the positively charged residue Lys. Both AtCBL6 and AtCBL7 display structural changes in the first EF hand. In EF1, four amino acids of the canonical loop in AtCBL6 appear to be deleted. In AtCBL7, EF1 harbors five additional amino acids between the fourth and fifth amino acids (Z position) of the loop, thereby dividing the EF hand motif. Furthermore, several amino acids have been deleted from the entering helix, which could lead to a protein with a different structural conformation in the region of EF1.
AtCBL6, AtCBL7, and AtCBL8 also contain additional mutations in other EF hands (Fig. 6B). In the EF3 loop of AtCBL8, the normally conserved Y residue is changed to the non-oxygen-containing amino acid His. In AtCBL6, the conserved Glu at the -Z position of EF2 is altered to Ala, and at the same position in the EF3 loop of AtCBL7, the calcium-binding residue is changed to Gly. Both mutations could lead to a change in the loop conformation and therefore to a markedly reduced affinity for calcium (Lewit-Bentley and Réty, 2000 Interestingly, CBL proteins resulting from gene duplications (AtCBL1 and AtCBL9) harbor an absolutely identical amino acid composition in the EF hand loops 1 and 2. Because EF hands in animal NCSes and CNBs form functional pairs, it appears possible that the N-terminal parts of AtCBL1 and 9 fulfill identical functions (e.g. protein-protein interaction via EF1). Contrarily, the two EF hands at the C-terminal ends of theses proteins have diverged during evolution, resulting in EF motifs with potentially distinct calcium-binding affinities. Therefore, the differences in the last two EF hand loops could lead to a different affinity toward calcium ions and consequently could convey different functions of the highly related calcium sensors AtCBL1 and AtCBL9.
Like the CBLs, the 25 AtCIPK and the predicted 30 OsCIPK proteins display a high degree of conservation. In Arabidopsis, the proteins exhibit an overall amino acid sequence identity ranging from 33% to 78%. The sizes of the predicted proteins vary from 46 to 59 kD (Table II; Supplemental Table SIV). Pair-wise amino acid alignments of the highly conserved N-terminal kinase domain revealed 51% to 90% sequence identity (63%96% similarity). In contrast, the C-terminal non-catalytic region appears to be much less conserved, exhibiting an amino acid identity of 24% to 58% (36%69% similarity). The only exception is represented by the 24-amino acid NAF domain (58%86% identity; 66%94% similarity). A similar degree of conservation in size and structure is also typical for the rice CIPKs (Table IV; Supplemental Table). All CIPK proteins harbor the 11 conserved subdomains typical for Ser/Thr kinases. Biochemical analyses of AtCIPKs expressed in bacteria revealed a rather low enzymatic activity of the recombinant protein (Shi et al., 1999 To investigate the potential localization and modification of the CIPKs, we performed bioinformatics analyses using the PSORT and the MOTIF algorithms. None of the CIPKs appears to harbor any conserved lipid modification (e.g. myristoylation, palmitoylation, and farnesylation) motif or displays any detectable unambiguous cellular sorting sequence. Therefore, the determination of the sub-cellular targeting of the CIPKs will require experimental confirmation for each of these kinases. Nevertheless, a combination of CBL localization data together with information about the specificity of CBL-CIPK complex formation should allow experimentally testable predictions about the sub-cellular targeting of specific CIPKs.
Our previous yeast two hybrid analyses on a subset of Arabidopsis CBLs and CIPKs have established that specific CBL-CIPK complex formation contributes to generating specificity in this signaling network and that this differential calcium sensor/kinase affinity is also reflected in in vitro experiments with purified proteins (Albrecht et al., 2001
Interestingly, the observed preferential complex formation does not appear to coincide with the phylogenetic relationships of the AtCIPKs, because, for example, members of the evolutionary separated branches of intron-harboring (AtCIPK1, AtCIPK8, AtCIPK17, and AtCIPK24) as well as intron-free CIPKs (AtCIPK7 and AtCIPK18) interact with AtCBL1. In addition, of the closely related duplicated CIPK pairs, only AtCIPK1/AtCIPK17 exhibited a similar affinity toward AtCBL1, whereas, for example, the pairs AtCIPK4/AtCIPK7 and AtCIPK13/AtCIPK18 displayed a rather different interaction profile. These data indicate, that sequence similarity and evolutionary history are not sufficient to predict CBL-CIPK interactions. Therefore, the exact structural features determining the specificity of complex formation remain to be uncovered. In addition, these findings might suggest that even a high conservation of certain CBL or CIPK family members does not necessarily indicate functional redundancy. In our analyses, AtCBL9 displayed strong interaction with six different CIPKs and some degree of interaction with additional six CIPKs (Fig. 7). Of the strongly interacting kinases, only a subset of four CIPKs (AtCIPK1, 8, 18, and 24) exhibited a similar affinity toward both CBL proteins. In contrast, AtCIPK7 and AtCIPK17 preferentially interact with AtCBL1, whereas AtCIPK21 and AtCIPK23 appear to interact more efficiently with AtCBL9. Although the interaction observed in yeast does not necessarily mean that these proteins also interact in planta, these data nevertheless indicate that even if several kinases are expressed simultaneously in a single cell, differential interaction affinity can facilitate an efficient signal channeling. Moreover, despite the high similarity of AtCBL1 and AtCBL9, our data suggest that both calcium sensor proteins could target specific kinases with different efficiency. Kinases that are expressed in the same cells and interact equally strongly with both CBL proteins may potentially represent signaling nodes for different stimuli received by the two CBL proteins.
To further address this aspect, we comparatively analyzed the expression patterns of the closely related AtCBL1 and AtCBL9 genes in response to different environmental cues. Two-week-old seedlings were challenged with cold, drought, or salt stress, and RNA was extracted from samples harvested at the indicated time points (Fig. 8). Semiquantitative RT-PCR analyses were performed with gene-specific primers to determine the expression levels of AtCBL1, AtCBL9, and, as a constitutively expressed control, actin2. The expression of the stress-induced RD29A gene was analyzed as a control for the effectiveness of the performed treatments. These analyses revealed that whereas AtCBL1 gene expression was strongly induced under these conditions, AtCBL9 expression did not respond to these stimuli, providing additional evidence for different functions of the two proteins. This assumption is further corroborated by the recent finding that loss of AtCBL1 gene function impairs the ability of the affected mutant plants to respond to drought, cold, and salt stress (Albrecht et al., 2003
Taken together, our data suggest that functional redundancy might only occur to a limited extent within the CBL-CIPK signaling network. Therefore, its components should be amenable to functional analyses by reverse genetic approaches using T-DNA or transposon-induced knock-out lines. Our genomic and bioinformatics analyses of CBL and CIPK genes and proteins presented in this work can provide an important foundation for the further functional dissection of these important plant-specific signaling pathways.
Database Searches and Computational Analyses of CBL and CIPK Sequences and Genomic Loci
The complement CBL and CIPK sequences encoded in the Arabidopsis genome were identified by sequence comparisons of previously identified AtCBLs and AtCIPKs with entries in the GenBank database (http://www.ncbi.nlm.nih.gov) and the Arabidopsis MIPS database (http://mips.gsf.de/proj/thal/db/index.html) using BLASTP and TBLASTN and the "motif" algorithms (Altschul et al., 1990
Deduced protein sequences were aligned with ClustalX (Thompson et al., 1997
To identify conserved protein modification motifs and targeting sequences, the CBL and CIPK proteins were analyzed with the PROSITE (Falquet at al., 2002
The identification and cloning of AtCBL1 to 6 and AtCIPK1 to 15 has been described previously (Albrecht et al., 2001
The position of the EF hands within the CBL proteins was determined by comparing the complete sequence of all AtCBL proteins with the EF consensus sequence, available at PROSITE (PDOC00018). Comparisons of AtCBL EF hands 1, 2, 3, and 4 were performed based on the calculated consensus sequence. These consensus sequence was calculated with the ClustalW algorithm (Thompson et al., 1994
In all stress experiments described, tissue samples of Arabidopsis cv Wassilewskija were taken after 0, 1, 3, 6, 12, and 24 h, immediately frozen in liquid nitrogen, and stored at -80°C until use. Cold treatments were performed with 14-d-old soil-grown seedlings, which were transferred from 21°C to 2°C and kept under these conditions for 24 h in the dark. For drought assays, plants were grown hydroponically in Magenta boxes for 14 d on one-half-strength Murashige and Skoog medium with 0.5% (w/v) Suc. Magenta boxes were transferred into a flow chamber, and plants were exposed to a stream of dry air in a flow bench for 20 min (resulting in approximately 10% fresh weight loss) and then returned into Magenta boxes. Afterward the Magenta boxes were closed and samples were taken at the described time points. For salt stress response analyses, the hydroponic cultivation medium was supplemented with 100 mM NaCl, and samples were taken at the respective time points. RNA for quantitative RT-PCR analyses was isolated from 150-mg tissue samples using TriFast solution (Peqlab, Erlangen, Germany). cDNA syntheses were performed using 3 µg of random hexamer primer and 200 units of Moloney murine leukemia virus RT (Promega, Madison, WI) for 1 h at 42°C. Two hundred nanograms of cDNA was used as template in all PCR reactions with gene-specific primers. In preliminary experiments, we first determined for each gene to be analyzed the cycle number that was within the linear range of PCR product amplification. Subsequently, PCR amplifications (1 min, 94°C; 1 min 30 s, 58°C; 1 min, 72°C; and volume, 50 µL) were performed for 23 cycles. The six transcripts (corresponding to each time point) were assayed simultaneously in separate tubes using identical cDNA aliquots and the respective gene-specific primers. Aliquots (10 µL) of each amplification reaction were separated by agarose gel electrophoresis and transferred onto nylon membranes. All RT-PCR reactions were performed in triplicate. Hybridizations were carried out with an equimolar mixture of DNA fragments specific for the AtCBL1, AtCBL9, Atactin2, and RD29A genes and radioactively labeled with a Ready Prime kit (Amersham Biosciences, Uppsala). For signal detection and quantification, membranes were either exposed to x-ray films or evaluated by phosphorimaging.
We thank Dr. Veronica Albrecht for her support during the initial yeast two-hybrid analyses, Dr. Ralph Bock for stimulating discussions and critical reading of the manuscript, and the Arabidopsis stock centers for providing various EST clones and cDNA libraries. We also thank Drs. Russel Jones and Yong-Sic Hwang for sharing unpublished results and for helpful discussions of the rice CBL nomenclature. Received September 8, 2003; returned for revision October 8, 2003; accepted October 20, 2003.
www.plantphysiol.org/cgi/doi/10.1104/pp.103.033068.
1 This work was supported by the Deutsche Forschungsgemeinschaft as part of the Arabidopsis Functional Genomics Network (grant no. KU 931/41 to J.K.).
[w] The online version of this article contains Web-only data.
2 These authors contributed equally to this work. * Corresponding author; e-mail jkudla{at}uni_muenster.de; fax 492518323823.
Albrecht V, Ritz O, Linder S, Harter K, Kudla J (2001) The NAF domain defines a novel protein-protein interaction module conserved in Ca2+-regulated kinases. EMBO J 20: 1051-1063[CrossRef][ISI][Medline] Albrecht V, Weinl S, Balzevic D, D'Angelo C, Batistic O, Kolukisaouglu Ü, Bock R, Schulz B, Harter K, Kudla J (2003) The calcium sensor CBL1 integrates plant responses to abiotic stresses. Plant J 36: 457-470[CrossRef][ISI][Medline] Allen GJ, Schroeder JI (2001) Combining genetics and cell biology to crack the code of plant cell calcium signaling. Sci Signal Transduction Knowledge Environment 2001: RE13 Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215: 403-410[CrossRef][ISI][Medline]
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389-3402 Ames JB, Ishima R, Tanaka T, Gordon JI, Stryer L, Ikura M (1997) Molecular mechanics of calcium-myristoyl switches. Nature 389: 198-202[CrossRef][Medline]
Baxter I, Tchieu J, Sussman MR, Boutry M, Palmgren MG, Gribskov M, Harper JF, Axelsen KB (2003) Genomic comparison of P-type ATPase ion pumps in Arabidopsis and rice. Plant Physiol 132: 618-628
Blanc G, Barakat A, Guyot R, Cooke R, Delseny M (2000) Extensive duplication and reshuffling in the Arabidopsis genome. Plant Cell 12: 1093-1101 Bowers JE, Chapman BA, Rong J, Paterson AH (2003) Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events. Nature 422: 433-438[CrossRef][Medline] Burgoyne RD, Weiss JL (2001) The neuronal calcium sensor family of Ca2+-binding proteins. Biochem J 353: 1-12[CrossRef][ISI][Medline] Cheong YH, Kim KN, Pandey GK, Gupta R, Grant JJ, Luan S (2003) CBL1, a calcium sensor that differentially regulates salt, drought, and cold responses in Arabidopsis. Plant Cell 15: 1833-1845 |