|
|
||||||||
|
Plant Physiology 134:1718-1732 (2004) © 2004 American Society of Plant Biologists The GATA Family of Transcription Factors in Arabidopsis and Rice1Instituto de Bioquímica Vegetal y Fotosíntesis, Consejo Superior de Investigaciones Científicas, Universidad de Seville, Américo Vespucio s/n, E41092 Seville, Spain
GATA transcription factors are a group of DNA binding proteins broadly distributed in eukaryotes. The GATA factors DNA binding domain is a class IV zinc finger motif in the form CX2CX1720CX2C followed by a basic region. In plants, GATA DNA motifs have been implicated in light-dependent and nitrate-dependent control of transcription. Herein, we show that the Arabidopsis and the rice (Oryza sativa) genomes present 29 and 28 loci, respectively, that encode for putative GATA factors. A phylogenetic analysis of the 57 GATA factors encoding genes, as well as the study of their intron-exon structure, indicates the existence of seven subfamilies of GATA genes. Some of these subfamilies are represented in both species but others are exclusive for one of them. In addition to the GATA zinc finger motif, polypeptides of the different subfamilies are characterized by the presence of additional domains such as an acidic domain, a CCT (CONSTANS, CO-like, and TOC1) domain, or a transposase-like domain also found in FAR1 and FHY3. Subfamily VI comprises genes that encode putative bi-zinc finger polypeptides, also found in metazoan and fungi, and a tri-zinc finger protein which has not been previously reported in eukaryotes. The phylogeny of the GATA zinc finger motif, excluding flanking regions, evidenced the existence of four classes of GATA zinc fingers, three of them containing 18 residues in the zinc finger loop and one containing a 20-residue loop. Our results support multiple models of evolution of the GATA gene family in plants including gene duplication and exon shuffling.
GATA factors are a class of transcriptional regulators present in fungi, metazoans, and plants that normally recognize the consensus sequence WGATAR (W = T or A; R = G or A; Lowry and Atchley, 2000 -sheets followed by an -helix and a nonstructured basic tail. Side chains of the zinc finger make hydrophobic contacts in the major groove of the DNA whereas the carboxy-terminal basic tail contacts the phosphate backbone in the minor groove in the case of the cGATA1 structure and in the major groove in the case of the A. nidulans structure.
Evidence of the existence of GATA factors in plants came first with the identification of GATA motifs in the regulatory regions of light and circadian clock responsive genes (for review, see Terzaghi and Cashmore, 1995
The GATA Gene Family in Arabidopsis
BLAST searches in available Arabidopsis databases using the Arabidopsis GATA1, the cGATA1, and the A. nidulans AreA full-length protein sequences showed the existence of 29 different Arabidopsis loci encoding proteins containing GATA-like zinc fingers (Table I). All the deduced protein sequences present only one zinc finger domain. Twenty-six of the amino acid sequences contain zinc finger motifs with 18 residues in the zinc finger loop. The other three GATA proteins (encoded by At4g24470, At3g21175, and At1g51600) exhibit zinc fingers with 20-residue loops. To determine the relationships between the different members of the GATA family in Arabidopsis we performed an alignment of the 29 full-length GATA proteins. Phylogenetic trees were generated with the Neighbor-Joining method (Saitou and Nei, 1987
Subfamily III is formed by three genes (At1g51600, At4g24470, and At3g21175) that encode GATA factors with 20 residues in the zinc finger loop (CX2CX20CX2C). These three genes are constituted by seven exons where the zinc finger is encoded by the fifth exon (Fig. 1). In addition, the three proteins of this subfamily present another conserved domain in the middle region of the protein. BLAST searches of the plant databases demonstrated that this domain is also present in the flowering time controller protein CONSTANS (CO), in other CO-like proteins (Robson et al., 2001
Finally, subfamily IV is formed by two closely related genes with a nonhomogeneous intron-exon composition and is characterized by the presence of a CX2CX18CX2C zinc finger domain at the amino-terminal end of the protein. No other known domains were found in the rest of the protein (Fig. 1). Analysis of the Arabidopsis expressed sequence tag (EST) databases indicated that partial or complete cDNA sequences have been reported for 22 of the 29 GATA genes (Table I). The total number of ESTs found for a given cDNA provides an indication of the expression level of the corresponding gene. Interestingly, the analysis of 127 ESTs evidenced that GATA genes of the subfamily I are (with some exceptions) much more represented than genes from subfamily II. Thus, 69% of the ESTs correspond to cDNAs of subfamily I genes (14 genes) while 15% and 15.5% of the ESTs correspond to cDNAs of subfamily II genes (10 genes) and III (3 genes), respectively. Finally, 3 ESTs were found for one of the genes of subfamily IV, At4g17570, but not for At5g47140.
The high number of members of the Arabidopsis GATA factors family contrasts with the relatively small size of the gene family in metazoan and fungi (611 members), raising the question of how the expansion of this family occurred in the plant lineage. The topology of the phylogenetic tree shown in Figure 1 suggests in some cases a clear paralogous pattern of gene divergence, i.e. evolution by gene duplication. To further investigate this question we analyzed the location of GATA genes in the Arabidopsis chromosomes. Twenty-five of the 29 GATA genes are found in previously identified chromosomal duplications (Simillion et al., 2002
The GATA Gene Family in Oryza sativa BLAST searches in several rice databases using Arabidopsis full-length GATA protein sequences from the different subfamilies, as well as sequences from the cGATA1 and the A. nidulans AreA proteins, identified 28 different rice loci encoding proteins containing GATA-like zinc fingers (Table II). A first inspection of the amino acid sequences suggested a higher complexity and variety of GATA genes in rice in comparison to Arabidopsis. Twenty-five of the protein sequences contain only one zinc finger. Two of the sequences (OsGATA25 and OsGATA26) present two GATA-type zinc fingers, and one deduced sequence (OsGATA24) presents three GATA-type zinc fingers. In addition, OsGATA24 also contains one-half of a fourth GATA related zinc finger. While two-zinc finger GATA factors are well known in animals and some fungi, there is no previous evidence of the existence of two-GATA zinc finger proteins in plants. Furthermore, to our knowledge, proteins containing three or four GATA-type zinc fingers have never been reported in eukaryotes.
To determine the relationships among the different members of the GATA family in rice, we performed a phylogenetic analysis of the 28 full-length GATA protein sequences. After alignment, phylogenetic trees were also generated with the Neighbor-Joining method using the 5138.t00015 (TIGR locus accession) sequence as outgroup. This rice locus encodes a CX2CX17CX2C zinc finger protein distantly related to the GATA type zinc finger. Figure 4 shows the phylogenetic tree of polypeptide sequences together with the domain and the intron-exon organization of the corresponding deduced proteins and genes, respectively. The relationship between the Arabidopsis and the rice GATA proteins was investigated by generating an alignment of the 57 identified GATA protein sequences followed by the construction of a Neighbor-Joining phylogenetic tree (data not shown). The phylogeny of the rice sequences evidenced the existence of several subfamilies of GATA factors. The combined phylogeny demonstrated that subfamilies I, II, and III from Arabidopsis are also present in rice. Sequences with similar features to that of Arabidopsis subfamily IV are absent in rice. However, new subfamilies were found exclusively in rice.
The rice subfamily I is constituted by 7 genes with two or three exons where the 3'last exon encodes the complete zinc finger motif and the carboxy-terminal basic region (Fig. 4). As in Arabidopsis, the proteins encoded by these genes present an 18-residue zinc finger loop and an amino-terminal acidic domain. The combined phylogeny between the Arabidopsis and the rice sequences allowed us to propose putative ortholog groups of genes. For example At5g25830, At4g32890, At2g45050, and At3g60530 (subfamily I, Fig. 1) are closely related genes in Arabidopsis. The OsGATA1 and OsGATA6 genes appear more related to these genes than to other members of the rice subfamily I, suggesting that ancient plants that existed before the monocot/dicot divergence already presented two or more GATA factors of subfamily I. Subfamily II is constituted by 9 genes. Gene structure is also conserved between Arabidopsis and rice in this family. Thus, these genes present two or three exons (except OsGATA16 that has four exons) and in all cases the DNA sequence encoding the zinc finger has been split between two exons. All these proteins also present 18 residues in the zinc finger loop.
In rice, subfamily III is constituted by 6 genes that have between 6 and 9 exons. As its Arabidopsis counterparts, the zinc finger, but not the carboxy-terminal basic region, is encoded in the fifth exon. This is a small exon of around 100 bp which encodes the 28 amino acids of the zinc finger motif almost exactly. Similar to Arabidopsis, proteins encoded by subfamily III genes in rice are characterized by the presence of 20 residues in the zinc finger loop (CX2CX20CX2C) and a CCT domain (see alignment in Fig. 2). Subfamily V is constituted by a group of sequences found only in rice. The two genes that form this subfamily (OsGATA22 and OsGATA23) encode large proteins with a CX2CX20CX2C zinc finger in the amino terminal part of the protein. Searches of additional domains using the SMART (Letunic et al., 2002
Finally, the phylogeny shown in Figure 4 groups two sequences (OsGATA27 and OsGATA28) in a well-supported branch. Deduced sequences from both genes present a single zinc finger in the form CX2CX18CX2C. In contrast to genes of other subfamilies, OsGATA27 and OsGATA28 have only one exon. In the combined phylogeny between the Arabidopsis and the rice sequences, OsGATA27 and OsGATA28 appear as an independent clade not related to any sequence from Arabidopsis (not shown). We have grouped these two genes in the subfamily VII.
Next, we wanted to examine in detail the relationships between the zinc fingers of the GATA factors from different subfamilies. Structural studies have demonstrated that the chicken GATA1 (cGATA1) DNA binding domain makes specific contacts with DNA in a region of about 55 residues (from amino acid 2 to residue +53 with respect to the first Cys; Omichinski et al., 1993
Figure 6
shows the phylogenetic tree constructed using the amino acid sequence alignment shown in Figure 5. Five well-supported clusters of sequences can be observed. In addition, some other branches, originating very deeply in the tree, are only distantly related to the four major groups. These clades correspond again to domains from multi-zinc finger proteins. One of the clusters contains all the CX2CX17CX2C motifs (none of them from plants). Another group comprises 27 sequences, including zinc fingers from Arabidopsis and rice GATA factors classified in subfamilies I, VI, and VII. We have named this clade Class A of plant zinc finger domains. These zinc fingers contain an 18-residue loop and are characterized by the presence of Gln and Thr in the seventh and seventeenth positions of the zinc finger loop (Gln-17 and Thr-27 in alignment of Fig. 5) in most of the sequences. All the members of this class also present high sequence conservation in the
Next we wanted to examine the relationship between the plant GATA factors and the other eukaryotic GATA factors. To this end, we carried out BLAST searches of the general GenBank databases using sequences of the different subfamilies of the plant GATA factors. The first interesting result was that sequence similarity between plant GATA factors and other eukaryotic GATA factors is exclusively restricted to the DNA binding domain (zinc finger and the contiguous basic region). Then, we performed an alignment of all the Arabidopsis and rice GATA zinc finger amino acid sequences (excluding the very divergent OsGATA25-N, OsGATA26-N, OsGATA24-3, and OsGATA24-4) together with other 24 GATA zinc finger sequences from different eukaryotic origins. We have included sequences containing 17, 18, 19, or 20 amino acids in the zinc finger loop. The outcoming alignment was used to construct a phylogenetic tree (Fig. 7 ). The topology of the tree indicates that all the fungal and metazoan CX2CX17CX2C fingers form a well-supported clade. In addition, high bootstrapping values also support the plant Class A, B, C, and D zinc finger lineages. All these lineages originate very deeply in the tree, suggesting that ancient plants may have had members of these groups. In agreement with this, we have found sequences encoding type A, B, and C zinc fingers in the genomes and EST collections of several other angiosperms. The relationship between the fungal and the plant GATA zinc fingers is uncertain given the low support of the clades that comprise fungal and plant sequences. Interestingly, with the exception of ASH1, all the plant and fungi GATA zinc fingers with 18-, 19-, and 20-residue loops contain the conserved residues Gly-21 and Pro-22, which are not present in the CX2CX17CX2C fingers. Furthermore, all 18-residue loop zinc fingers seem to be derived from 17-residue loops by insertion of one amino acid around position 15 of the loop. Nineteen- and 20-residue loop zinc fingers also present this insertion and, therefore, seem to derivate from 18-residue zinc fingers. These data support the common origin of the fungal and plant zinc fingers and suggest a monophyletic origin for all the GATA zinc finger domains with more than 17-residue loops.
Evolution and Divergence of Genes Encoding GATA Factors
Previous studies on the evolution and diversity of the GATA family of transcription factors in eukaryotes maintained that plant GATA factors possess only one zinc finger in the form CX2CX18CX2C (Teakle and Gilmartin, 1998 Despite the evolutionary history of the plant GATA gene family, an interesting aspect is the high number of GATA factors encoded by plant genomes in contrast to the relatively low number of these transcription factors found in other eukaryotes. For example, 6 GATA encoding genes are found in humans, 8 in Drosophila melanogaster, 10 in C. elegans, 11 in S. cerevisiae, and 4 in Schizosaccharomyces pombe. The reason for the expansion of this gene family in plants remains obscure but contrasts with the small number of demonstrated functions of these transcription factors in Arabidopsis. On the other hand, the high number of members of the family suggests a high functional redundancy, which may explain the low success of classical genetic strategies in the elucidation of the function of GATA factors in plants.
Most of the animal GATA factors present two zinc fingers. While the C-finger is involved in DNA binding, several different functions have been attributed to the N-finger (see introduction). Two genes encoding GATA factors with two zinc fingers are found in the rice genome. Interestingly, in both cases the C-finger belongs to the Class A while the N-finger shows variations with respect to the consensus. This suggests that, as in animals, the N-finger may be involved in other functions different to DNA binding, whereas the C-finger is probably responsible for the DNA binding activity. Plant bi-zinc finger GATA factors do not seem to be closely related to animal or fungi bi-zinc finger GATA factors. This is supported by the fact that both zinc fingers of animals and fungus GATA factors present 17 residues in the zinc finger loop while rice bi-finger proteins have 18 residues in the zinc finger loops (except the OsGATA25-N that have 16). Thus, it is more likely that rice bi-finger proteins appeared by the tandem duplication of a CX2CX18CX2C zinc finger, followed by the divergence of the N-finger to adapt to their specific new function. Lowry and Atchley have suggested that the GAF2, SREP, SREA, URBS1, and SRE fungi bi-zinc finger GATA factors could have raised independently of the metazoan bi-zinc finger GATA factors (Lowry and Atchley, 2000
The OsGATA24 gene encodes a putative protein with three complete zinc fingers and a region that clearly resembles a half zinc finger. To our knowledge, this is the first time that the existence of a protein with three GATA-like zinc fingers is reported in eukaryotes (Lowry and Atchley, 2000
Most of the sequence conservation within each class of plant GATA zinc fingers was found in the regions that correspond to the
GATA DNA motifs have been mostly implicated in light-dependent gene regulation in plants. I-boxes, originally defined as GATAA sequences, and other GATA-related motifs have been found in many light-regulated genes such as the RBCS, CAB (chlorophyll A/B binding protein), and GAP (glyceraldydyde-3-phosphate dehydrogenase) genes (Castresana et al., 1987
The fact that some GATA factors present domains also found in light signal transduction proteins could be related to its putative roles in light signaling. For example, GATA proteins encoded by genes of the subfamily III present a CCT domain. This domain is found in two proteins involved in light signaling: TOC1 and CO. TOC1 seems to be an important part of the circadian oscillator, controlling positively the level of LHY/CCA1 (for review, see Hayama and Coupland, 2003
Rice GATA factors encoded by subfamily V genes contain two domains also found in FAR1 and FHY3 proteins (Fig. 4). far1 and fhy3 mutants display a phenotype of reduced inhibition of hypocotyl elongation in far-red light, suggesting that they are involved in the phytochrome A signaling pathway. It has been recently reported that both proteins are related to transposases of type II MuDR family transposons (Hudson et al., 2003 Our genomic analysis shows the complexity and the potential interest of the GATA family of transcription factors in Arabidopsis and rice. Since direct genetics has not been successful in elucidating the role of these proteins in plant transcriptional regulation, reverse genetic approaches will probably be required. For those studies it will be potentially interesting to know the paralogous and ortologous relationships established in our study. It would also be interesting to establish the DNA binding specificities displayed by the different classes of plant GATA zinc fingers as well as to investigate the role of accompanying domains in light signaling and transcription.
Sequence Selection, Gene Structure, and Localization on Chromosomes To collect all Arabidopsis proteins containing GATA-like zinc fingers BLAST searches of the Arabidopsis genome were conducted at two different addresses: the National Center for Biological Information (NCBI; http://www.ncbi.nlm.nih.gov/BLAST/Genome/ara.html) and the Arabidopsis Information Resources (TAIR; http://www.arabidopsis.org/wublast/index2.html). BLAST searches were carried out using the amino acid sequence of several GATA factors from different origins (chicken GATA1, Aspergillus nidulans AreA, and Arabidopsis AtGATA1 and Neurospora crassa WC1 proteins). All sequences with an E-value below 4 x 104 were selected for further analysis. Arabidopsis nucleotide and proteins sequences as well as information regarding the gene structure was obtained from the Munich Information Center for Protein Sequences Database (MIPS, MATDB; http://mips.gsf.de/proj/thal/db). Arabidopsis EST sequences were searched in the TIGR Gene Indices at TIGR (http://tigrblast.tigr.org/tgi/), as well as in the GenBank EST collection at the TAIR BLAST 2.0 page (http://www.arabidopsis.org/Blast/), using the deduced nucleotide sequence of each Arabidopsis GATA gene. Deduced amino acid and cDNA sequences were compared, when possible, with those of the corresponding EST records.
Arabidopsis gene positions on chromosomes were determined using SeqViewer (http://arabidopsis.org/servlets/sv). Gene duplications and their presence on duplicated segments were investigated using the MIPS Redundancy Viewer (http://mips.gsf.de/proj/thal/db/gv/rv/) and the Simillion database (Simillion et al., 2002 To identify GATA transcription factor sequences in rice (Oryza sativa ssp. japonica and Oryza sativa ssp. indica), we searched four different databases using the BLAST program and derivatives: (1) sequences for japonica were obtained from the Rice Annotated Protein Database at The Institute for Genome Research (http://tigrblast.tigr.org/euk-blast/index.cgi?project=osa1); (2) genomic sequences for japonica and indica were also obtained from the rice BLAST page at the NCBI (http://www.ncbi.nlm.nih.gov/BLAST/Genome/PlantBlast.shtml?7); (3) rice EST sequences were searched in the TIGR Gene Indices at TIGR (http://tigrblast.tigr.org/tgi/); and (4) in the Knowledge-based Oryza Molecular biological Encyclopedia (http://cdna01.dna.affrc.go.jp/cDNA/Wblast.html) at the National Institute of Agrobiological Sciences. All sequences with an E-value below 4 x 104 were selected for further analysis. Nucleotide, amino acid sequences, gene structure, and chromosomal positions were obtained from the same databases mentioned before. Amino acid and cDNA sequences were corrected using the EST sequence information when discrepancy was found.
Conserved structural or functional domains of all amino acid sequences were annotated according to SMART (Letunic et al., 2002
Multiple alignments of amino acid sequences were performed using ClustalW (Thompson et al., 1994
Analysis of the updated databases has revealed the existence of an additional rice GATA factor encoding gene not previously included in our work. We have named the gene OsGATA20 (EST GenBank accession AK070729). Inclusion of the deduced amino acid sequence in our previously generated alignments and phylogenetic trees indicates that OsGATA29 is the only rice GATA gene of subfamily IV. This implies that subfamily IV is present both in Arabidopsis and rice, suggesting that this subfamily appears also before the divergence between monocot and dicot, as previously commented for subfamilies I, II, and III.
We thank Marika Lindahl, Gabriel Gutierrez, and Jose L. Crespo for critical reading of the manuscript. Received December 16, 2003; returned for revision January 29, 2004; accepted January 31, 2004.
1 This work was supported by Ministerio de Ciencia y Tecnología (grant nos. BMC200203198 and BMC20012635) and by Junta de Andalucía (group CV1802). Article, publication date, and citation information can be found at www.plantphysiol.org/cgi/doi/10.1104/pp.103.037788. * Corresponding author; e-mail: jcreyes{at}cica.es; fax 34954460065.
Argüello-Astorga G, Herrera-Estrella L (1998) Evolution of light-regulated plant promoters. Annu Rev Plant Physiol Plant Mol Biol 49: 525555[CrossRef][ISI]
Bateman A, Birney E, Cerruti L, Durbin R, Etwiller L, Eddy SR, Griffiths-Jones S, Howe KL, Marshall M, Sonnhammer EL (2002) The Pfam protein families database. Nucleic Acids Res 30: 276280 Borello U, Ceccarelli E, Giuliano G (1993) Constitutive, light-responsive and circadian clock-responsive factors compete for the different l box elements in plant light-regulated promoters. Plant J 4: 611619[CrossRef][ISI][Medline] Caddick MX, Arst HN, Jr. (1990) Nitrogen regulation in Aspergillus: are two fingers better than one? Gene 95: 123127[CrossRef][Medline] Castresana C, Staneloni RJ, Malik VS, Cashmore AR (1987) Molecular characterization of two clusters of genes encoding the Type I CAB polypeptides of PSII in Nicotiana plumbaginifolia. Plant Mol Biol 10: 117126 Daniel-Vedele F, Caboche M (1993) A tobacco cDNA clone encoding a GATA-1 zinc finger protein homologous to regulators of nitrogen metabolism in fungi. Mol Gen Genet 240: 365373[ISI][Medline]
Fox AH, Kowalski K, King GF, Mackay JP, Crossley M (1998) Key residues characteristic of GATA N-fingers are recognized by FOG. J Biol Chem 273: 3359533603
Froehlich AC, Liu Y, Loros JJ, Dunlap JC (2002) White Collar-1, a circadian blue light photoreceptor, binding to the frequency promoter. Science 297: 815819
Gilmartin PM, Sarokin L, Memelink J, Chua NH (1990) Molecular light switches for plant genes. Plant Cell 2: 369378
Giuliano G, Pichersky E, Malik VS, Timko MP, Scolnik PA, Cashmore AR (1988) An evolutionarily conserved protein binding sequence upstream of a plant light-regulated gene. Proc Natl Acad Sci USA 85: 70897093
Griffiths S, Dunford RP, Coupland G, Laurie DA (2003) The evolution of CONSTANS-like gene families in barley, rice, and Arabidopsis. Plant Physiol 131: 18551867 Hayama R, Coupland G (2003) Shedding light on the circadian clock and the photoperiodic control of flowering. Curr Opin Plant Biol 6: 1319[CrossRef][ISI][Medline] Hudson ME, Lisch DR, Quail PH (2003) The FHY3 and FAR1 genes encode transposase-related proteins involved in regulation of gene expression by the phytochrome A-signaling pathway. Plant J 34: 453471[CrossRef][ISI][Medline] Jeong MJ, Shih MC (2003) Interaction of a GATA factor with cis-acting elements involved in light regulation of nuclear genes encoding chloroplast glyceraldehyde-3-phosphate dehydrogenase in Arabidopsis. Biochem Biophys Res Commun 300: 555562[CrossRef][ISI][Medline]
Lam E, Chua NH (1989) ASF-2: a factor that binds to the cauliflower mosaic virus 35S promoter and a conserved GATA motif in Cab promoters. Plant Cell 1: 11471156
Letunic I, Goodstadt L, Dickens NJ, Doerks T, Schultz J, Mott R, Ciccarelli F, Copley RR, Ponting CP, Bork P (2002) Recent improvements to the SMART domain-based sequence annotation resource. Nucleic Acids Res 30: 242244
Lisch DR, Freeling M, Langham RJ, Choy MY (2001) Mutator transposase is widespread in the grasses. Plant Physiol 125: 12931303 Lowry JA, Atchley WR (2000) Molecular evolution of the GATA family of transcription factors: conservation within the DNA-binding Domain. J Mol Evol 50: 103115[ISI][Medline]
Maxon ME, Herskowitz I (2001) Ash1p is a site-specific DNA-binding protein that actively represses transcription. Proc Natl Acad Sci USA 98: 14951500 Mouradov A, Cremer F, Coupland G (2002) Control of flowering time: interacting pathways as a basis for diversity. Plant Cell Suppl 14: S111S130 Nishii A, Takemura M, Fujita H, Shikata M, Yokota A, Kohchi T (2000) Characterization of a novel gene encoding a putative single zinc-finger protein, ZIM, expressed during the reproductive phase in Arabidopsis thaliana. Biosci Biotechnol Biochem 64: 14021409[CrossRef][Medline]
Newton A, Mackay J, Crossley M (2001) The N-terminal zinc finger of the erythroid transcription factor GATA-1 binds GATC motifs in DNA. J Biol Chem 276: 3579435801
Omichinski JG, Clore GM, Schaad O, Felsenfeld G, Trainor C, Appella E, Stahl SJ, Gronenborn AM (1993) NMR structure of a specific DNA complex of Zn-containing DNA binding domain of GATA-1. Science 261: 438446 Patient RK, McGhee JD (2002) The GATA family (vertebrates and invertebrates). Curr Opin Genet Dev 12: 416422[CrossRef][ISI][Medline] Pedone PV, Omichinski JG, Nony P, Trainor C, Gronenborn AM, Clore GM, Felsenfeld G (1997) The N-terminal fingers of chicken GATA-2 and GATA-3 are independent sequence-specific DNA binding domains. EMBO J 16: 28742882[CrossRef][ISI][Medline] Puente P, Wei N, Deng XW (1996) Combinatorial interplay of promoter elements constitutes the minimal determinants for light and developmental control of gene expression in Arabidopsis. EMBO J 15: 37323743[ISI][Medline] Rastogi R, Bate NJ, Sivasankar S, Rothstein SJ (1997) Footprinting of the spinach nitrite reductase gene promoter reveals the preservation of nitrate regulatory elements between fungi and higher plants. Plant Mol Biol 34: 465476[CrossRef][Medline] Ravagnani A, Gorfinkiel L, Langdon T, Diallinas G, Adjadj E, Demais S, Gorton D, Arst HN Jr, Scazzocchio C (1997) Subtle hydrophobic interactions between the seventh residue of the zinc finger loop and the first base of an HGATAR sequence determine promoter-specific recognition by the Aspergillus nidulans GATA factor AreA. EMBO J 16: 39743986[CrossRef][ISI][Medline] Robson F, Costa MM, Hepworth SR, Vizir I, Pineiro M, Reeves PH, Putterill J, Coupland G (2001) Functional importance of conserved domains in the flowering-time gene CONSTANS demonstrated by analysis of mutant alleles and transgenic plants. Plant J 28: 619631[CrossRef][ISI][Medline] Saitou N, Nei M (1987) The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol 4: 406425[Abstract] Scazzocchio C (2000) The fungal GATA factors. Curr Opin Microbiol 3: 126131[CrossRef][ISI][Medline] Schindler U, Cashmore AR (1990) Photoregulated gene expression may involve ubiquitous DNA binding proteins. EMBO J 9: 34153427[ISI][Medline] Schwechheimer C, Smith C, Bevan MW (1998) The activities of acidic and glutamine-rich transcriptional activation domains in plant cells: design of modular transcription factors for high-level expression. Plant Mol Biol 36: 195204[CrossRef][ISI][Medline]
Shoichet SA, Malik TH, Rothman JH, Shivdasani RA (2000) Action of the Caenorhabditis elegans GATA factor END-1 in Xenopus suggests that similar mechanisms initiate endoderm development in ecdysozoa and vertebrates. Proc Natl Acad Sci USA 97: 40764081
Simillion C, Vandepoele K, Van Montagu MC, Zabeau M, Van de Peer Y (2002) The hidden duplication past of Arabidopsis thaliana. Proc Natl Acad Sci USA 99: 1362713632 Starich MR, Wikstrom M, Arst HN Jr, Clore GM, Gronenborn AM (1998) The solution structure of a fungal AREA protein-DNA complex: an alternative binding mode for the basic carboxyl tail of GATA factors. J Mol Biol 277: 605620[CrossRef][ISI][Medline] Starich MR, Wikstrom M, Schumacher S, Arst HN Jr, Gronenborn AM, Clore GM (1998) The solution structure of the Leu22-->Val mutant AREA DNA binding domain complexed with a TGATAG core element defines a role for hydrophobic packing in the determination of specificity. J Mol Biol 277: 621634[CrossRef][ISI][Medline]
Strayer C, Oyama T, Schultz TF, Raman R, Somers DE, Mas P, Panda S, Kreps JA, Kay SA (2000) Cloning of the Arabidopsis clock gene TOC1, an autoregulatory response regulator homolog. Science 289: 768771 Swofford DL (1998) PAUP Phylogenetic Analysis Using Parsimony (and Other Methods), Version 4. Sinauer Associates, Sunderland, MA Teakle GR, Gilmartin PM (1998) Two forms of type IV zinc-finger motif and their kingdom-specific distribution between the flora, fauna and fungi. Trends Biochem Sci 23: 100102[ISI][Medline] Teakle GR, Manfield IW, Graham JF, Gilmartin PM (2002) Arabidopsis thaliana GATA factors: organisation, expression and DNA-binding characteristics. Plant Mol Biol 50: 4357[CrossRef][ISI][Medline] Terzaghi WB, Cashmore AR (1995) Light-regulated transcription. Annu Rev Plant Physiol Plant Mol Biol 46: 445474[CrossRef][ISI]
Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG (1997) The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res 25: 48764882
Thompson JD, Higgins DG, Gibson TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22: 46734680
Trainor CD, Ghirlando R, Simpson MA (2000) GATA zinc finger interactions modulate DNA binding and transactivation. J Biol Chem 275: 2815728166 Tsang AP, Visvader JE, Turner CA, Fujiwara Y, Yu C, Weiss MJ, Crossley M, Orkin SH (1997) FOG, a multitype zinc finger protein, acts as a cofactor for transcription factor GATA-1 in erythroid and megakaryocytic differentiation. Cell 90: 109119[CrossRef][ISI][Medline] This article has been cited by other articles:
|