Skip to main content

Main menu

  • For Authors
    • Submit a Manuscript
    • Instructions for Authors
  • Home
  • Content
    • Current Issue
    • Archive
    • Preview Papers
    • Focus Collections
    • Classics Collection
    • Upcoming Focus Issues
  • Advertisers
  • About
    • About the Journal
    • Editorial Board and Staff
  • Subscribers
  • Librarians
  • More
    • Alerts
    • Contact Us
  • Other Publications
    • Plant Physiology
    • The Plant Cell
    • Plant Direct
    • The Arabidopsis Book
    • Plant Cell Teaching Tools
    • ASPB
    • Plantae

User menu

  • My alerts
  • Log in
  • Log out

Search

  • Advanced search
Plant Physiology
  • Other Publications
    • Plant Physiology
    • The Plant Cell
    • Plant Direct
    • The Arabidopsis Book
    • Plant Cell Teaching Tools
    • ASPB
    • Plantae
  • My alerts
  • Log in
  • Log out
Plant Physiology

Advanced Search

  • For Authors
    • Submit a Manuscript
    • Instructions for Authors
  • Home
  • Content
    • Current Issue
    • Archive
    • Preview Papers
    • Focus Collections
    • Classics Collection
    • Upcoming Focus Issues
  • Advertisers
  • About
    • About the Journal
    • Editorial Board and Staff
  • Subscribers
  • Librarians
  • More
    • Alerts
    • Contact Us
  • Follow plantphysiol on Twitter
  • Visit plantphysiol on Facebook
  • Visit Plantae
LetterSCIENTIFIC CORRESPONDENCE
You have accessRestricted Access

A Large Family of Genes That Share Homology withCLAVATA3

J. Mark Cock, Sheila McCormick
J. Mark Cock
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sheila McCormick
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site

Published July 2001. DOI: https://doi.org/10.1104/pp.126.3.939

  • Article
  • Figures & Data
  • Info & Metrics
  • PDF
Loading
  • Copyright © 2001 American Society of Plant Physiologists

The receptor kinase superfamily in plants includes a large number of genes, but very little is known about the ligands that bind to these receptors. In the Arabidopsis genome, for example, 340 receptor kinase genes have been identified (The Arabidopsis Genome Initiative, 2000). The proteins encoded by this superfamily share a common overall structure: They are all integral membrane proteins with a single membrane-spanning region and a cytosolic kinase domain. The superfamily can be divided into at least 20 different families based on the structure of the predicted extracellular domains, which differ markedly between families (The Arabidopsis Genome Initiative, 2000; Torii and Clark, 2000). Even within families the extracellular domain can exhibit a high level of variability. For example, within the family of receptor kinases with Leu-rich repeats (LRRs) in their extracellular domains, the number of LRRs present varies from three to more than 20 and the LRRs may be associated with other protein domains.

Several members of the plant receptor kinase gene family have been shown to play key regulatory roles in various aspects of plant development (Torii and Clark, 2000) and defense (Song et al., 1995). In contrast, their mode of action is poorly understood and, until recently, the members of this family have been described as “receptor-like kinases” because none of them had been shown to interact with a ligand. In a recent report, however, Trotochaud et al. (2000) showed that the receptor kinase CLAVATA1 interacts physically with a small secreted peptide encoded by the CLAVATA3(CLV3) locus. This is supported by genetic evidence that also indicates that the products of CLV1 and CLV3function in close association (Clark et al., 1995; Fletcher et al., 1999). A second candidate for a receptor kinase ligand gene isSCR, which has been shown to encode the male component of the self-incompatibility response in Brassica oleracea(Schopfer et al., 1999). Based on several lines of data, including the highly polymorphic nature of the SCR gene, its expression pattern, and its location at the S locus, it is very likely that the SCR peptide binds to the S-locus receptor kinase (SRK) as a ligand.

The extracellular domain of CLV1 contains 21 LRRs, whereas SRK possesses an S domain with 12 conserved cysteines and a PTDT box (for review, see Torii and Clark, 2000). Both CLV1 andSRK are members of large gene families and it was initially hoped that identification of CLV3 and SCR would facilitate the identification of ligands for other receptor kinases of these gene families. However, paralogues of these genes have been more difficult to identify than was expected. For example, initial searches with the sequences of several SCR alleles did not identify any significant homologies in the databases (Schopfer et al., 1999). A more recent study, however, has identified a large family ofSCR-like genes in Arabidopsis (Vanoosthuyse et al., 2001). This family of genes exhibits an extreme level of diversity with only a small number of residues being conserved throughout the family. The high level of sequence diversity is the main reason these genes were not detected in initial database searches. In the study described here, we have searched for genes with similarity to CLV3, employing a similar approach to that used to identify theSCR-related genes.

Initial database searches with the CLV3 sequence yielded a poor match with the maize (Zea mays) embryo-surrounding region (ESR) proteins (high E values, i.e. low probability scores, in the range of 0.15–0.45). ESR genes are expressed in a specific zone of the developing endosperm and, like CLV3, encode small, secreted polypeptides (Opsahl-Ferstad et al., 1997). CLV3 and the ESR sequences share a short, conserved region of 14 amino acids but are otherwise unrelated at the sequence level. Two independently isolated, mutant alleles of CLV3 (clv3-1 and clv3-5) contain a point mutation within this conserved region (encoding clv3G75A) indicating that it is important for gene function (Fletcher et al., 1999). Based on these observations, which indicate that the conserved region is an important functional element of CLV3, and presumably of the ESR proteins, we carried out a database search for related sequences with the same conserved region. This was done, initially, by searching with tBlastn (Zhang and Madden, 1997) and either the entire CLV3 and ESR2 sequences, or their conserved regions, as queries. The newly retrieved sequences then were used to repeat the search in an iterative manner. The search was terminated when no novel sequences were retrieved. A total of 42 related sequences were identified including 28 genomic sequences and 13 sequences that were represented only by expressed sequence tags (ESTs). We have named these genes CLE (for CLAVATA3/ESR-related). The availability of ESTs indicates that the sequences identified correspond to expressed genes. The EST sequences were derived from seven different plant species, including both monocots and dicots (TableI). No homologs were identified from outside the plant kingdom. Based on the ESTs that were identified, the members of this family appear to be expressed in a wide range of tissues (Table I).

View this table:
  • View inline
  • View popup
Table I.

ESTs that share sequence similarity with CLAVATA3

In total, 24 CLE genes were found in the Arabidopsis genome. They are scattered throughout the genome, in most cases as individual genes although a head-to-tail cluster of four genes was found within a 16.5-kbp region on chromosome 2 (CLE4 to CLE7). Of these genes, CLE5, CLE6, and CLE7 are more closely related to each other than to the other members of the family, suggesting that they have arisen by local gene duplication events.CLE4, on the other hand, is more similar to CLE3on chromosome 1 than it is to the three other members of the cluster. The similarity between CLE4 and CLE3 suggests that there has been a recent transfer of one of these genes from one of these loci to the other. A similar phenomenon has been observed for resistance gene clusters in tomato (Parniske and Jones, 1999).

Apart from CLV3, where the reading frame is interrupted by two introns (Fletcher et al., 1999), all of the sequences identified here consisted of one exon with a single open reading frame. Of the EST sequences identified, 15 (including four that correspond to Arabidopsis genomic sequences) included the 3′ end of the open reading frame and analysis of these sequences showed that they all contained a stop codon between zero and 13 residues downstream of the conserved region. This supports the gene structures that we propose. In particular, it indicates that additional coding exons do not occur downstream of the sequences identified, at least for the transcript represented by the ESTs.

Twenty-seven of the sequences had complete open reading frames that were supported by good quality sequence data (Fig.1). All 27 genes contain short open reading frames that are predicted to encode small polypeptides (average molecular mass is 7,674 ± 1,840 D) with hydrophobic regions at the N-terminal end. In all but four cases the hydrophobic N termini are predicted to act as signal peptides (as predicted with SignalP; Nielsen et al., 1997), indicating that the gene products of most of the members of this family are secreted. The four exceptions, CLE16, CLE25, CLE26, and CLE27, are predicted to possess signal anchors. Note, however, that these predictions need to be tested experimentally and it is possible that these four proteins possess non-standard signal peptides.

Fig. 1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. 1.

Alignment of the deduced polypeptides of the CLE gene family (CLE1 to CLE27) with CLV3, ESR2, and ESR3. The multiple alignment was constructed using Lasergene sequence analysis software (DNASTAR, London) and optimized manually. The conserved region is indicated by a bar. The CLV3 sequence is based on the gene sequence submitted by Flecher et al. (1999; accession no. AF126009) that differs by five substitutions and two indels from the sequence submitted by the Arabidopsis Genome Initiative (Lin et al., 1999; accession no.AC006233). The two deduced CLV3 polypeptides differ only at residue seven (Val or Leu). At, Arabidopsis; Gm, soybean; Os, rice (Oryza sativa); Zm, maize.

The majority of the predicted mature polypeptides are highly basic (average pI 9.49 ± 1.57) and hydrophilic throughout their length. Somewhat different hydrophilicity profiles were observed for CLE19 and CLE15, where a short hydrophobic region occurs within the polypeptide, and for CLE25 and CLE26, where predominantly hydrophobic C termini follow the conserved region.

The conserved region was located at or near the C-terminal end of the majority of the sequences we identified (see Fig. 1). CLV3 and the ESR proteins are among the exceptions that possess additional amino acid residues C terminal to the conserved region. The conserved region contains a number of highly conserved amino acids, including between two and three Pro residues. Secondary structure predictions indicate that it adopts a turn structure and is exposed on the surface of the protein. This is consistent with this region being involved in interactions with other proteins. The Gly75 codon that is mutated in clv3-1 and clv3-5 is highly conserved in the CLE genes. The only exceptions are CLE24 andCLE27, where a Cys codon occurs at this position.

Although the CLE proteins share an overall resemblance when parameters such as length, charge, and hydrophilicity are considered, at the amino acid sequence level they are highly divergent (Fig. 1). This sequence diversity could be the result of either: (a) the conserved domain being associated with diverse, unrelated (non-homologous) protein domains, or (b) extensive sequence diversification having occurred within the CLE family (or to a combination of these two phenomena). The second possibility would not be unprecedented because a very high level of sequence diversity was also observed for the SCR-like (SCRL) gene family in Arabidopsis despite the fact that there is good reason to believe that the SCRL genes are homologous throughout their length (Vanoosthuyse et al., 2001).

There is evidence that processing of secreted signaling polypeptides occurs in plants and it has been pointed out that CLV3 contains a potential dibasic processing site that could be recognized by subtilases (Schaller and Ryan, 1994; Berger and Altmann, 2000). More recent evidence indicates that CLV3 is not processed, however. Anti-CLV3 antibodies detect a protein of approximately the size expected for the secreted, “unprocessed” polypeptide (8.4 kD) in Arabidopsis extracts (Trotochaud et al., 2000). It would, nonetheless, be interesting to determine whether other members of the CLE family are processed. If the more divergent regions of the proteins do not constitute part of the final mature gene product, this might explain, at least in part, the high level of sequence divergence in these regions.

The ESR proteins may represent ligands for LRR receptor kinases that are expressed in the embryo or endosperm, and it is tempting to speculate that the CLE proteins also represent ligands for LRR receptor kinases. Functional analysis of these genes will be required to confirm this. An additional problem will be assigning ligands to specific members of the receptor kinase superfamily. The data currently available indicate that it will not be straightforward to do this, based simply on sequence data. For example, putative ligands have recently been identified for the pollen-specific receptor kinases, which have five to six LRRs (Muschietti et al., 1998). These putative ligands are small Cys-rich proteins, unrelated to CLV3 and the CLE proteins, and are more like SCR, which is also rich in Cys (although there is no obvious homology at the sequence level; Tang et al., 2000). The ligand for another LRR receptor kinase, BRI1, which has 27 LRRs, is thought to be a brassinosteroid (He et al., 2000). Note, however, that BRI1 also has a 70-amino acid “island” in the extracellular domain that is not found in other LRR receptor kinases (Li and Chory, 1997). Therefore, it seems likely that, even within the LRR family of receptor kinases, the ligands will turn out to be structurally diverse. We suggest that a reasonable starting point for assigning candidate ligands to receptors in this family may be to group the receptors according to the number of LRRs in their extracellular domains.

Small genes, like those of the CLE family, are often overlooked by automated annotation programs (Ride et al., 1999; Vanoosthuyse et al., 2001). Of the 28 CLE sequences identified in genome sequence (24 from Arabidopsis and four from rice), only eight were annotated as genes and, of these, only five of the annotations corresponded to those proposed here. The approach we used to identify the CLE genes consisted of identifying a conserved motif, based on the alignment of a limited number of members of the gene family, and using this information to manually select additional gene family members from a pool of potential paralogues identified in a sensitive homology search using tBlastn. The SPH (Ride et al., 1999), SCRL and LCR (Vanoosthuyse et al., 2001) gene families were identified in a similar manner, indicating that this approach may be generally applicable to the identification of families of genes that encode small proteins.

Footnotes

  • ↵* Corresponding author; e-mailMark.Cock{at}ens-lyon.fr; fax 334–7272–8600.

  • Received February 5, 2001.
  • Revision received February 22, 2001.
  • Accepted March 9, 2001.

LITERATURE CITED

  1. ↵
    1. Berger D,
    2. Altmann T
    (2000) Genes Dev 14:1119–1131.
    OpenUrlAbstract/FREE Full Text
  2. ↵
    1. Clark SE,
    2. Running MP,
    3. Meyerowitz EM
    (1995) Development 121:2057–2067.
    OpenUrlAbstract
  3. ↵
    1. Fletcher JC,
    2. Brand U,
    3. Running MP,
    4. Simon R,
    5. Meyerowitz EM
    (1999) Science 283:1911–1914.
    OpenUrlAbstract/FREE Full Text
  4. ↵
    1. He Z,
    2. Wang ZY,
    3. Li J,
    4. Zhu Q,
    5. Lamb C,
    6. Ronald P,
    7. Chory J
    (2000) Science 288:2360–2363.
    OpenUrlAbstract/FREE Full Text
  5. ↵
    1. Li J,
    2. Chory J
    (1997) Cell 90:929–938.
    OpenUrlCrossRefPubMed
  6. ↵
    1. Muschietti JP,
    2. Eyal Y,
    3. McCormick S
    (1998) Plant Cell 10:319–330.
    OpenUrlAbstract/FREE Full Text
  7. ↵
    1. Nielsen H,
    2. Engelbrecht J,
    3. Brunak S,
    4. von Heijne G
    (1997) Protein Eng 10:1–6.
    OpenUrlAbstract/FREE Full Text
  8. ↵
    1. Opsahl-Ferstad HG,
    2. Le Deunff E,
    3. Dumas C,
    4. Rogowsky PM
    (1997) Plant J 12:235–246.
    OpenUrlCrossRefPubMed
  9. ↵
    1. Parniske M,
    2. Jones JD
    (1999) Proc Natl Acad Sci USA 96:5850–5855.
    OpenUrlAbstract/FREE Full Text
  10. ↵
    1. Ride JP,
    2. Davies EM,
    3. Franklin FCH,
    4. Marshall DF
    (1999) Plant Mol Biol 39:927–932.
    OpenUrlCrossRefPubMed
  11. ↵
    1. Schaller A,
    2. Ryan CA
    (1994) Proc Natl Acad Sci USA 91:11802–11806.
    OpenUrlAbstract/FREE Full Text
  12. ↵
    1. Schopfer CR,
    2. Nasrallah ME,
    3. Nasrallah JB
    (1999) Science 286:1697–1700.
    OpenUrlAbstract/FREE Full Text
  13. ↵
    1. Song WY,
    2. Wang GL,
    3. Chen LL,
    4. Kim HS,
    5. Pi LY,
    6. Holsten T,
    7. Gardner J,
    8. Wang B,
    9. Zhai WX,
    10. Zhu L-H
    (1995) Science 270:1804–1806.
    OpenUrlAbstract/FREE Full Text
  14. ↵
    1. Tang W,
    2. Ezcurra I,
    3. Cotter R,
    4. Muschietti J,
    5. McCormick S
    (2000) ASCB meeting, abstract no. L68. San Francisco.
  15. ↵
    1. The Arabidopsis Genome Initiative
    (2000) Nature 408:796–815.
    OpenUrlCrossRefPubMed
  16. ↵
    1. Kreis M,
    2. Walker JC
    1. Torii KU,
    2. Clark SE
    (2000) in Advances in Botanical Research, Thematic Volume: Plant Protein Kinases, eds Kreis M, Walker JC (Academic Press, London), 32:270–298.
  17. ↵
    1. Trotochaud AE,
    2. Jeong S,
    3. Clark SE
    (2000) Science 289:613–617.
    OpenUrlAbstract/FREE Full Text
  18. ↵
    1. Vanoosthuyse V,
    2. Miege C,
    3. Dumas C,
    4. Cock JM
    (2001) Plant Mol Biol 46:17–34.
    OpenUrlCrossRefPubMed
  19. ↵
    1. Zhang J,
    2. Madden TL
    (1997) Genome Res 7:649–656.
    OpenUrlAbstract/FREE Full Text
PreviousNext
Back to top

Table of Contents

Print
Download PDF
Email Article

Thank you for your interest in spreading the word on Plant Physiology.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
A Large Family of Genes That Share Homology withCLAVATA3
(Your Name) has sent you a message from Plant Physiology
(Your Name) thought you would like to see the Plant Physiology web site.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Citation Tools
A Large Family of Genes That Share Homology withCLAVATA3
J. Mark Cock, Sheila McCormick
Plant Physiology Jul 2001, 126 (3) 939-942; DOI: 10.1104/pp.126.3.939

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Request Permissions
Share
A Large Family of Genes That Share Homology withCLAVATA3
J. Mark Cock, Sheila McCormick
Plant Physiology Jul 2001, 126 (3) 939-942; DOI: 10.1104/pp.126.3.939
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Footnotes
    • LITERATURE CITED
  • Figures & Data
  • Info & Metrics
  • PDF

In this issue

Plant Physiology: 126 (3)
Plant Physiology
Vol. 126, Issue 3
Jul 2001
  • Table of Contents
  • About the Cover
  • Index by author
View this article with LENS

More in this TOC Section

  • A GPI Signal Peptide-Anchored Split-Ubiquitin (GPS) System for Detecting Soluble Bait Protein Interactions at the Membrane
  • ABA Accumulation in Dehydrating Leaves Is Associated with Decline in Cell Volume, Not Turgor Pressure
  • Seedlings Lacking the PTM Protein Do Not Show a genomes uncoupled (gun) Mutant Phenotype
Show more SCIENTIFIC CORRESPONDENCE

Similar Articles

Our Content

  • Home
  • Current Issue
  • Plant Physiology Preview
  • Archive
  • Focus Collections
  • Classic Collections
  • The Plant Cell
  • Plant Direct
  • Plantae
  • ASPB

For Authors

  • Instructions
  • Submit a Manuscript
  • Editorial Board and Staff
  • Policies
  • Recognizing our Authors

For Reviewers

  • Instructions
  • Journal Miles
  • Policies

Other Services

  • Permissions
  • Librarian resources
  • Advertise in our journals
  • Alerts
  • RSS Feeds

Copyright © 2021 by The American Society of Plant Biologists

Powered by HighWire