|
|
||||||||
|
Plant Physiology 140:1142-1150 (2006) © 2006 American Society of Plant Biologists MEKHLA, a Novel Domain with Similarity to PAS Domains, Is Fused to Plant Homeodomain-Leucine Zipper III Proteins1,[W]Department of Biosciences and Nutrition and Center for Genomics and Bioinformatics, Karolinska Institutet, SE141 57 Huddinge, Sweden; and School of Life Sciences, Södertörns Högskola, SE141 89 Huddinge, Sweden
Homeodomain (HD) proteins play important roles in the development of plants, fungi, and animals. Here we identify a novel domain, MEKHLA, in the C terminus of HD-Leu zipper (HD-ZIP) III plant HD proteins that shares similarity with a group of bacterial proteins and a protein from the green alga Chlamydomonas reinhardtii. The group of bacterial MEKHLA proteins is found in cyanobacteria and other bacteria often found associated with plants. Phylogenetic analysis suggests that a MEKHLA protein transferred, possibly from a cyanobacterium or an early chloroplast, into the nuclear genome of an early plant in a first step, and attached itself to the C terminus of an HD-ZIP IV homeobox gene in a second step. Further position-specific iterated-BLAST searches with the bacterial MEKHLA proteins revealed a subregion within the MEKHLA domain that shares significant similarity with the PAS domain. The PAS domain is a sensory module found in many proteins through all kingdoms of life. It is involved in light, oxygen, and redox potential sensation. The fact that HD-ZIP III proteins are transcription factors that have this sensory domain attached to their C terminus uncovers a potential new signaling pathway in plants.
Homeodomain (HD) proteins are transcription factors that play important roles in the development of plants, fungi, and animals (Bürglin, 2005
In this article, we identify a new domain at the C terminus of the HD-ZIP III proteins. This domain has similarity to a group of bacterial proteins with no known function. However, further searches with the bacterial proteins as query revealed additional sequence similarity with proteins containing PAS domains. Proteins containing PAS domains have been found through all kingdoms of life. The PAS domain was originally discovered in the Drosophila period (Per) protein, the vertebrate Arnt proteins, and the Drosophia single-minded (Sim) protein (Hoffman et al., 1991 -barrel that, together with flanking -helixes, forms a pocket that may contain various prosthetic groups. For example, bacterial FixL proteins contain both PAS and kinase domains, and their PAS domain senses oxygen via the bound heme molecule and the signal is relayed via the His kinase (Miyatake et al., 2000
The C Terminus of HD-ZIP III Proteins Contains a Conserved Domain, Termed MEKHLA
In the course of analyzing Arabidopsis homeobox genes (K. Mukherjee and T.R. Bürglin, unpublished data), we noted a difference between the HD-ZIP III and HD-ZIP IV homeobox genes in the C-terminal region. Full-length sequence alignments of HD-ZIP III and HD-ZIP IV sequences reveals that extensive sequence conservation extends throughout the length of the HD-ZIP IV proteins (Fig. 1A; Supplemental Fig. 1). In addition to the HD, the Leu zipper, and the START domain, there is a region composed of four conserved blocks that follows the START domain, which has also been noted by others (Sessa et al., 1998
HD-ZIP IV proteins are consistently somewhat shorter than HD-ZIP III proteins and the full-length alignment (Supplemental Fig. 1) reveals that HD-ZIP III have a C-terminal extension of about 150 amino acids. This region is also well conserved between the HD-ZIP III proteins from both monocots (Oryza sativa) and dicots (Arabidopsis), and we find that there is about 45% sequence identity between the moss Physcomitrella patens protein (Sakakibara et al., 2001
The MEKHLA Domain Is Found in Bacterial Proteins
Using the MEKHLA domain, we performed BLASTP searches and, apart from HD-ZIP III proteins, we found matches to bacterial proteins that had approximately 26% sequence identity and expected probabilities of 3e-04. A conservative cutoff value for a significant sequence similarity is considered to be 1e-03 (Webber and Ponting, 2004
Recently, BLAST searches of the Chlamydomonas reinhardtii genome, which is not yet incorporated into the databases at the National Center for Biotechnology Information (NCBI), using one of the bacterial MEKHLA proteins as query revealed a protein, CC-1690, with high similarity to the bacterial MEKHLA proteins (Supplemental Fig. 3A). The protein is about 330 amino acids long, but the only region containing any sequence similarity is the MEKHLA domain in the center (Fig. 2). The C. reinhardtii CC-1690 protein is very similar to the bacterial MEKHLA proteins. For example, it is 40% identical to the bacterial MEKHLA protein NE0286 (Supplemental Fig. 3B) over 150 residues. We note that CC-1690 is located in the nuclear genome of C. reinhardtii because it has nine introns; it is known to be expressed because several expressed sequence tags are present in the Chlamydomonas database. The fact that a MEKHLA domain protein occurs in this green alga and has been evolutionarily conserved suggests that it plays an important function in this simple plant.
We performed further PSI-BLAST searches using bacterial MEKHLA proteins as query sequences for iterative searches. After the second iteration, using Bp_BPSL1625 as query, we retrieved all bacterial MEKHLA proteins, and the first plant HD-ZIP III proteins (matches in the MEKHLA domain) appeared in the inclusion list, the best match with an expected probability of 2e-06. However, in addition, interspersed with the plant MEKHLA domain hits, we found new, previously undetected matches to bacterial proteins. The best new hit at this iteration was with a transduction His kinase protein from Ralstonia eutropha (26% identity over 126 residues) with an expected probability of 5e-05, which is a significant value; many of the HD-ZIP III proteins found in the hit list were further below at expected probabilities up to 0.1, although we have already shown above that the HD-ZIP III proteins have significant similarity. We also used the C. reinhardtii CC-1690 as query in PSI-BLAST searches and detected the best-matching bacterial PAS domain containing proteins in the second iteration with an expected low probability of 4e-07 (Supplemental Fig. 3C), lower than the best HD-ZIP III matches. Further iterations with the different query sequences retrieved many more two-component sensor kinases, sensory box His kinases, and PAS domain proteins, in addition to the MEKHLA domain. The PSI-BLAST iterations were ended after several rounds because the number of sequences retrieved became too large; as we have shown earlier, more than 3,000 PAS-containing proteins are now in the databases.
We examined the protein region in the new bacterial kinases, where the MEKHLA domain matched. Invariably, the location of the match was within a domain identified as PAS with the Conserved Domain Database (CDD) and SMART domain detection tools. We extracted the PAS domains of the best-scoring 20 proteins; they are shown in the multiple sequence alignment in Figure 2. Whereas significant scores were obtained using PSI-BLAST, the sequence similarity between the PAS domain and the plant MEKHLA domain is rather low. The best-matching sequence is C. reinhardtii CC-1690 with 30% identity over the central 89 residues (Supplemental Fig. 3D). The HD-ZIP III sequences show lower similarity; for example, P. patens hb10 is 13% identical over 110 residues with Nostoc alr4836. However, this low sequence similarity is not surprising, given that, in general, PAS domains are poorly conserved at the primary sequence level and average only 12% identity within the broader class of PAS domains (Gilles-Gonzalez and Gonzalez, 2004
The PAS domains with the best scores to the MEKHLA domain are bacterial sensor His kinases (Fig. 2). The best-studied proteins in this group are the FixL proteins of Rhizobia, which play an important role in nitrogen fixation. The structure of FixL PAS domains has been determined, revealing how these PAS domains sense oxygen via a bound heme molecule (Miyatake et al., 2000
We performed phylogenetic analyses of the aligned MEKHLA and the closest related PAS domain sequences (as shown in Fig. 2). These analyses show that the plant MEKHLA domain and the bacterial MEKHLA proteins form a distinct clade that is supported by a high bootstrap value of >86% (Fig. 3) and that the other PAS domain-containing proteins fall into a separate clade. This indicates that the bacterial MEKHLA proteins are clearly related to the MEKHLA domain in HD-ZIP III proteins. The fact that the MEKHLA domain is longer at the amino terminus compared to the PAS domain, and has characteristic conserved residues found both in plant and bacterial MEKLHA domains (Supplemental Figs. 4 and 5), supports the notion that MEKHLA proteins are more related to each other than to other PAS domain proteins and constitute their own group. The phylogenetic position of C. reinhardtii MEKHLA protein CC-1690 presents an interesting case; it is firmly rooted within the bacterial MEKHLA proteins (Fig. 2), suggesting it is closely related to them. Within the bacterial MEKHLA proteins, CC-1690 cannot be placed within any particular subclade. To understand the evolution of MEKHLA domain proteins in plants, we also have to consider several additional pieces of evidence. (1) Our searches have shown that, in the currently available eukaryotic genomes, the MEKHLA domain is only found in HD-ZIP III homeobox genes of higher plants and in C. reinhardtii. (2) In C. reinhardtii, we have presently not detected any HD-ZIP homeobox genes. (3) The bacterial MEKHLA proteins are found only in a select subset of bacterial species, most of which are either cyanobacteria or proteobacteria (Table I), and many of which are found associated with plants. The most likely interpretation of these data is that, in a first step, a complete bacterial MEKHLA gene transferred to the nuclear genome of an early plant because it is present in the green alga C. reinhardtii. This could have happened as part of the large-scale gene transfer from the early chloroplast to the nucleus (Martin et al., 2002
Recent studies of HD-ZIP III genes have shown that they are involved in patterning lateral organs and shoot apical meristem (AM) formation as well as lateral root development (Hawker and Bowman, 2004
Our bioinformatics analyses show that the MEKHLA domain originated as a bacterial protein and that the C-terminal three-fourths of the MEKHLA domain share significant similarity with PAS domains. The PAS domain, despite its high sequence variability, has been shown to be an internal sensor of oxygen, redox potential, and light in many different proteins, both in bacteria and in animals (Taylor and Zhulin, 1999
Our discovery of the MEKHLA domain, which shares similarity with the PAS domain, suggests the existence of a novel signaling pathway that might relay the AM signal. Like in fungal PAS transcription factors, a signal could be converted directly into a transcriptional response. Alternatively, the MEKHLA domain may not necessarily relay an AM signal. Instead, another explanation for the function of the START and PAS domains in the HD-ZIP III proteins might be that these domains are part of converging pathways that are involved in sensing the nutritional and energy state of a cell and influence transcriptional activity by determining whether sufficient resources exist for shoot, root, and lateral organ development.
The NCBI nonredundant database of protein sequences (http://www.ncbi.nlm.nih.gov/BLAST) was searched using the default parameters of the BLASTP and PSI-BLAST programs (Altschul et al., 1997
Multiple sequence alignments were constructed using ClustalX 1.83 (Thompson et al., 1997 www-jpred; Cuff et al., 1998
For phylogenetic analysis, neighbor joining as built into ClustalX 1.83 was used. For bootstrapping, 1,000 trials were run. Protein logos were generated using LogoBar (Pérez-Bercoff et al., 2006
We thank Kay Schneitz for helpful discussions and comments. Received November 7, 2005; returned for revision January 16, 2006; accepted February 6, 2006.
1 This work was supported by the Swedish Foundation for Strategic Research and the Karolinska Institutet. The author responsible for distribution of materials integral to the findings presented in this article in accordance with the policy described in the Instructions for Authors (www.plantphysiol.org) is: Thomas R. Bürglin (thomas.burglin{at}biosci.ki.se).
[W] The online version of this article contains Web-only data. www.plantphysiol.org/cgi/doi/10.1104/pp.105.073833. * Corresponding author; e-mail thomas.burglin{at}biosci.ki.se; fax 4686084510.
Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 33893402 Aso K, Kato M, Banks JA, Hasebe M (1999) Characterization of homeodomain-leucine zipper genes in the fern Ceratopteris richardii and the evolution of the homeodomain-leucine zipper gene family in vascular plants. Mol Biol Evol 16: 544552[Abstract] Bao N, Lye KW, Barton MK (2004) MicroRNA binding sites in Arabidopsis class III HD-ZIP mRNAs are required for methylation of the template chromosome. Dev Cell 7: 653662[CrossRef][Web of Science][Medline] Borgstahl GE, Williams DR, Getzoff ED (1995) 1.4 A structure of photoactive yellow protein, a cytosolic photoreceptor: unusual fold, active site, and chromophore. Biochemistry 34: 62786287[CrossRef][Medline] Bowman JL (2004) Class III HD-Zip gene regulation, the golden fleece of ARGONAUTE activity? Bioessays 26: 938942[CrossRef][Web of Science][Medline] Bürglin TR (1995) The evolution of homeobox genes. In R Arai, M Kato, Y Doi, eds, Biodiversity and Evolution. The National Science Museum Foundation, Tokyo, pp 291336 Bürglin TR (2005) Homeodomain proteins. In RA Meyers, ed, Encyclopedia of Molecular Cell Biology and Molecular Medicine, Ed 2, Vol 6. Wiley-VCH Verlag GmbH & Co., Weinheim, Germany, pp 179222 Card PB, Erbel PJ, Gardner KH (2005) Structural basis of ARNT PAS-B dimerization: use of a common beta-sheet interface for hetero- and homodimerization. J Mol Biol 353: 664677[CrossRef][Medline] Chenna R, Sugawara H, Koike T, Lopez R, Gibson TJ, Higgins DG, Thompson JD (2003) Multiple sequence alignment with the Clustal series of programs. Nucleic Acids Res 31: 34973500 Crosson S, Moffat K (2001) Structure of a flavin-binding plant photoreceptor domain: insights into light-mediated signal transduction. Proc Natl Acad Sci USA 98: 29953000 Cuff JA, Clamp ME, Siddiqui AS, Finlay M, Barton GJ (1998) JPred: a consensus secondary structure prediction server. Bioinformatics 14: 892893 Duboule D, editor (1994) Guidebook to the Homeobox Genes. Oxford University Press, Oxford Edgar RC (2004) MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics 5: 113[CrossRef][Medline] Emery JF, Floyd SK, Alvarez J, Eshed Y, Hawker NP, Izhaki A, Baum SF, Bowman JL (2003) Radial patterning of Arabidopsis shoots by class III HD-ZIP and KANADI genes. Curr Biol 13: 17681774[CrossRef][Web of Science][Medline] Engstrom EM, Izhaki A, Bowman JL (2004) Promoter bashing, microRNAs, and Knox genes: new insights, regulators, and targets-of-regulation in the establishment of lateral organ polarity in Arabidopsis. Plant Physiol 135: 685694 Gehring WJ, Affolter M, Bürglin TR (1994) Homeodomain proteins. Annu Rev Biochem 63: 487526[CrossRef][Web of Science][Medline] Getzoff ED, Gutwin KN, Genick UK (2003) Anticipatory active-site motions and chromophore distortion prime photoreceptor PYP for light activation. Nat Struct Biol 10: 663668[CrossRef][Web of Science][Medline] Gilles-Gonzalez MA, Gonzalez G (2004) Signal transduction by heme-containing PAS-domain proteins. J Appl Physiol 96: 774783 Gilles-Gonzalez MA, Gonzalez G (2005) Heme-based sensors: defining characteristics, recent developments, and regulatory hypotheses. J Inorg Biochem 99: 122[CrossRef][Web of Science][Medline] Gong W, Hao B, Mansy SS, Gonzalez G, Gilles-Gonzalez MA, Chan MK (1998) Structure of a biological oxygen sensor: a new mechanism for heme-driven signal transduction. Proc Natl Acad Sci USA 95: 1517715182 Green KA, Prigge MJ, Katzman RB, Clark SE (2005) CORONA, a member of the class III homeodomain leucine zipper gene family in Arabidopsis regulates stem cell specification and organogenesis. Plant Cell 17: 691704 Hao B, Isaza C, Arndt J, Soltis M, Chan MK (2002) Structure-based mechanism of O2 sensing and ligand discrimination by the FixL heme domain of Bradyrhizobium japonicum. Biochemistry 41: 1295212958[CrossRef][Medline] Hawker NP, Bowman JL (2004) Roles for class III HD-Zip and KANADI genes in Arabidopsis root development. Plant Physiol 135: 22612270 Hoffman EC, Reyes H, Chu FF, Sander F, Conley LH, Brooks BA, Hankinson O (1991) Cloning of a factor required for activity of the Ah (dioxin) receptor. Science 252: 954958 Kappen C, Schughart K, Ruddle FH (1993) Early evolutionary origin of major homeodomain sequence classes. Genomics 18: 5470[CrossRef][Web of Science][Medline] Letunic I, Copley RR, Schmidt S, Ciccarelli FD, Doerks T, Schultz J, Ponting CP, Bork P (2004) SMART 4.0: towards genomic data integration. Nucleic Acids Res 32: D142D144 Marchler-Bauer A, Anderson JB, Cherukuri PF, DeWeese-Scott C, Geer LY, Gwadz M, He S, Hurwitz DI, Jackson JD, Ke Z, et al (2005) CDD: a conserved domain database for protein classification. Nucleic Acids Res 33: D192D196 Martin W, Rujan T, Richly E, Hansen A, Cornelsen S, Lins T, Leister D, Stoebe B, Hasegawa M, Penny D (2002) Evolutionary analysis of Arabidopsis, cyanobacterial, and chloroplast genomes reveals plastid phylogeny and thousands of cyanobacterial genes in the nucleus. Proc Natl Acad Sci USA 99: 1224612251 McConnell JR, Barton MK (1998) Leaf polarity and meristem formation in Arabidopsis. Development 125: 29352942[Abstract] McConnell JR, Emery J, Eshed Y, Bao N, Bowman J, Barton MK (2001) Role of PHABULOSA and PHAVOLUTA in determining radial patterning in shoots. Nature 411: 709713[CrossRef][Medline] Miyatake H, Mukai M, Park SY, Adachi S, Tamura K, Nakamura H, Nakamura K, Tsuchiya T, Iizuka T, Shiro Y (2000) Sensory mechanism of oxygen sensor FixL from Rhizobium meliloti: crystallographic, mutagenesis and resonance Raman spectroscopic studies. J Mol Biol 301: 415431[CrossRef][Web of Science][Medline] Nambu JR, Lewis JO, Wharton KA Jr, Crews ST (1991) The Drosophila single-minded gene encodes a helix-loop-helix protein that acts as a master regulator of CNS midline development. Cell 67: 11571167[CrossRef][Web of Science][Medline] Ohashi-Ito K, Demura T, Fukuda H (2002) Promotion of transcript accumulation of novel Zinnia immature xylem-specific HD-Zip III homeobox genes by brassinosteroids. Plant Cell Physiol 43: 11461153 Ohashi-Ito K, Fukuda H (2003) HD-zip III homeobox genes that include a novel member, ZeHB-13 (Zinnia)/ATHB-15 (Arabidopsis), are involved in procambium and xylem cell differentiation. Plant Cell Physiol 44: 13501358 Ohashi-Ito K, Kubo M, Demura T, Fukuda H (2005) Class III homeodomain leucine-zipper proteins regulate xylem cell differentiation. Plant Cell Physiol 46: 16461656 Pellequer JL, Brudler R, Getzoff ED (1999) Biological sensors: more than one way to sense oxygen. Curr Biol 9: R416R418[CrossRef][Web of Science][Medline] Pérez-Bercoff Å, Koch J, Bürglin TR (2006) LogoBar: bar graph visualization of protein logos with gaps. Bioinformatics 22: 112114 Ponting CP, Aravind L (1997) PAS: a multifunctional domain family comes to light. Curr Biol 7: R674R677[Web of Science][Medline] Ponting CP, Aravind L (1999) START: a lipid-binding domain in StAR, HD-ZIP and signalling proteins. Trends Biochem Sci 24: 130132[CrossRef][Web of Science][Medline] Prigge MJ, Otsuga D, Alonso JM, Ecker JR, Drews GN, Clark SE (2005) Class III homeodomain-leucine zipper gene family members have overlapping, antagonistic, and distinct roles in Arabidopsis development. Plant Cell 17: 6176 Sakakibara K, Nishiyama T, Kato M, Hasebe M (2001) Isolation of homeodomain-leucine zipper genes from the moss Physcomitrella patens and the evolution of homeodomain-leucine zipper genes in land plants. Mol Biol Evol 18: 491502 Schrick K, Nguyen D, Karlowski WM, Mayer KF (2004) START lipid/sterol-binding domains are amplified in plants and are predominantly associated with homeodomain transcription factors. Genome Biol 5: R41[CrossRef][Medline] Sessa G, Carabelli M, Ruberti I, Lucchetti S, Baima S, Morelli G (1994) Identification of distinct families of HD-Zip proteins in Arabidopsis thaliana. In G Coruzzi, P Puigdomenech, eds, Molecular-Genetic Analysis of Plant Metabolism and Development, NATO ASI Series, Vol H81. Springer-Verlag, Berlin, pp 411426 Sessa G, Steindler C, Morelli G, Ruberti I (1998) The Arabidopsis Athb-8, -9 and -14 genes are members of a small gene family coding for highly related HD-ZIP proteins. Plant Mol Biol 38: 609622[CrossRef][Web of Science][Medline] Simossis VA, Heringa J (2005) PRALINE: a multiple sequence alignment toolbox that integrates homology-extended and secondary structure information. Nucleic Acids Res 33: W289W294 Sussex IM (1951) Experiments on the cause of dorsiventrality in leaves. Nature 167: 651652[CrossRef][Medline] Taylor BL, Zhulin IB (1999) PAS domains: internal sensors of oxygen, redox potential, and light. Microbiol Mol Biol Rev 63: 479506 Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG (1997) The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res 25: 48764882 Webber C, Ponting CP (2004) Genes and homology. Curr Biol 14: R332R333[Medline] Zhulin IB, Taylor BL, Dixon R (1997) PAS domain S-boxes in Archaea, bacteria and sensors for oxygen and redox. Trends Biochem Sci 22: 331333[CrossRef][Web of Science][Medline] This article has been cited by other articles:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ASPB Publications | PLANT PHYSIOLOGY® | THE PLANT CELL | |
|---|---|---|---|