|
|
||||||||
|
Plant Physiology 141:1167-1184 (2006) © 2006 American Society of Plant Biologists Genome-Wide Analysis of Basic/Helix-Loop-Helix Transcription Factor Family in Rice and Arabidopsis1,[W]Shanghai Jiao Tong University-Shanghai Institutes for Biological Sciences-Pennsylvania State University Joint Center for Life Sciences, Key Laboratory of Microbial Metabolism, Ministry of Education, School of Life Science and Biotechnology, Shanghai Jiao Tong University, Shanghai, People's Republic of China, 200240 (X.L., H.J., Z.Y., W.L., H.M., D.Z.); School of Life Science, Shanghai University, Shanghai, People's Republic of China, 200444 (X.D., Y.T., J.G., J.Y., J.W.); School of Life Science, Xiamen University, Xiamen, People's Republic of China, 361005 (X.L., L.C.); Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, People's Republic of China, 200032 (H.M., D.Z.); and Department of Biology and the Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania 16802 (Y.S., H.M.)
The basic/helix-loop-helix (bHLH) transcription factors and their homologs form a large family in plant and animal genomes. They are known to play important roles in the specification of tissue types in animals. On the other hand, few plant bHLH proteins have been studied functionally. Recent completion of whole genome sequences of model plants Arabidopsis (Arabidopsis thaliana) and rice (Oryza sativa) allows genome-wide analysis and comparison of the bHLH family in flowering plants. We have identified 167 bHLH genes in the rice genome, and their phylogenetic analysis indicates that they form well-supported clades, which are defined as subfamilies. In addition, sequence analysis of potential DNA-binding activity, the sequence motifs outside the bHLH domain, and the conservation of intron/exon structural patterns further support the evolutionary relationships among these proteins. The genome distribution of rice bHLH genes strongly supports the hypothesis that genome-wide and tandem duplication contributed to the expansion of the bHLH gene family, consistent with the birth-and-death theory of gene family evolution. Bioinformatics analysis suggests that rice bHLH proteins can potentially participate in a variety of combinatorial interactions, endowing them with the capacity to regulate a multitude of transcriptional programs. In addition, similar expression patterns suggest functional conservation between some rice bHLH genes and their close Arabidopsis homologs.
Since the discovery of the basic/helix-loop-helix (bHLH) motif with DNA-binding and dimerization capabilities (Murre et al., 1989 helices with a linking loop of variable lengths; the amphipathic helices of two bHLH proteins can interact, allowing the formation of homodimers or heterodimers (Murre et al., 1989
According to their phylogenetic relationships, DNA-binding motifs, and functional properties, known bHLH proteins from animals have been divided into six main groups (named as group A to F; Atchley and Fitch, 1997
Compared to animals, only a small number of plant bHLH proteins have been characterized functionally. In Arabidopsis (Arabidopsis thaliana), a model for flowering plants (particularly eudicots), 162 bHLH-encoding genes have been identified from the analysis of genome sequences (Bailey et al., 2003
Rice is one of the most important food crops in the world and it has been used as a major model species in plant (especially monocot) functional genomics research because of its relatively small genome size (approximately 390 Mb) and synteny with other cereal genomes (Gale and Devos, 1998
Identification of 167 OsbHLH Genes
To obtain sequences of bHLH genes in the rice genome, we used the criteria developed by Atchley et al. (1999)
Initially, we used the bHLH domain (64 amino acids) encoded by a putative rice bHLH gene (GeneBank number XM_463907) as a BLAST query to identify a large number of candidate bHLH sequences in The Institute for Genomic Research (TIGR) database, because this sequence fit the bHLH motif best among the known rice bHLH proteins. Because of the sequence variation among known bHLH domains, to detect additional possible bHLH domain sequences we used position-specific iterated BLAST to search the database of TIGR (version 4, 2006). Subsequently, TBLASTN was used to remove redundant sequences of candidate bHLH genes according to their corresponding sequencing bacterial artificial chromosome clone serial numbers and their chromosome locations, resulting in 167 OsbHLH genes (Table I
; Supplemental Table I). The number designation of the OsbHLH genes was based on the order of the multiple sequence alignment (Supplemental Fig. 1) and the synonymy between the names of these 167 OsbHLH genes and the previously reported 131 rice genes by Buck and Atchley (2003)
To verify the reliability of our criteria, we performed simple modular architecture research tool (SMART) analysis of the 167 putative OsbHLH protein sequences and found that 164 proteins had a typical bHLH domain and three, OsbHLH157, OsbHLH160, and OsbHLH161, contained a predicted HLH domain with low confidence values. In addition, the OsbHLH026 protein unexpectedly had two HLH domains predicted by SMART, and their E values were 8.46E-13 and 1.56E-02, respectively. The amino acid sequences of these two bHLH domains were 76% similar, with 14 identical amino acids among the 17 amino acids of the basic region and a predicated binding activity to the G box. To date, five Caenorhabditis elegans proteins have been reported to have two bHLH domains (Ledent et al., 2002
To examine sequence features of these rice bHLH domains, we performed multiple sequence alignment of the 167 rice bHLH amino acid sequences (Supplemental Fig. 1). On average, the basic regions (the N-terminal 17 positions; Supplemental Fig. 1) of OsbHLH domains have 5.7 basic residues, even though 26 of these proteins did not have the basic region. Within subsets of OsbHLH domains, there is further conservation of nonbasic residues in the basic region, as well as in the two helices and in a C-terminal region of the second helix (Supplemental Fig. 1). In contrast, the loop was the most divergent region in terms of both length (ranging from 318 amino acids) and amino acid composition. From the alignment, we identified 19 residues that are identical in at least 50% of the 167 rice bHLH domains (Supplemental Fig. 1, indicated at the bottom of the alignment). Figure 1
shows the distribution of amino acid residues at the 19 positions of the consensus motif of the bHLH domain, including the results from two previous reports (Atchley et al., 1999
The basic region of the bHLH domain has the ability to bind to DNA and is critical for function (Massari and Murre, 2000
Phylogenetic Analysis of the OsbHLH Genes
To obtain clues about the evolutionary history of the OsbHLH genes, a neighbor-joining (NJ) phylogenetic tree was generated using the multiple sequence alignments of the OsbHLH protein sequences with bootstrap analysis (1,000 replicates). The position of the bHLH domain and any conserved sequence motifs outside of the bHLH domain are shown in Figure 3. We subdivided the 167 members of the OsbHLH family into 22 subfamilies, designated A to V, according to clades with at least 50% bootstrap support. In addition, we noted that most of the members in the same subfamilies shared one or more motifs outside the bHLH domain, further supporting the subfamily definition. A total of 40 motifs outside of the bHLH domain were discovered (Supplemental Table III). However, most of these motifs have not been characterized except Leu-ZIP shared by the members of subfamily R; Leu-ZIP is known as a motif involved in protein dimerization (Tong et al., 1997
The pattern of intron positions can also provide important evidence to support phylogenetic relationships in a gene family. Among 167 rice bHLH genes, the number of introns ranged from zero to four, with 87.4% of these 167 genes having intron(s) in the bHLH domain; these genes can be grouped into 10 patterns of intron presence and positions (Fig. 4A
, IIII, VX, and XII). Among these 10 patterns, the most common ones had one or more introns at three highly conserved positions (indicated by white inverted triangles), accounting for 82.0% of the 167 genes (Fig. 4A, IIII, V and VI). The remaining patterns had introns at varying positions (patterns VIIX, and XII) and were observed in only 5.4% of the 167 genes. Furthermore, we investigated intron phases with respect to codons. An intron was designated as occurring in one of three phases: in phase 1, splicing occurred after the first nucleotide of the codon; in phase 2, splicing occurred after the second nucleotide; and in phase 0, splicing occurred after the third nucleotide of the codon (Sharp, 1981
Exons with the same splicing phase at both 5' and 3' ends are called symmetric exons. According to the intron-early theory (Gilbert, 1987
To determine the genomic distribution of the OsbHLH genes, the DNA sequence of each OsbHLH gene was used to search the rice genome database using BLASTN. Although each of the 12 rice chromosomes contains some OsbHLH genes, the distribution seems to be uneven (Fig. 5 ). Relatively high densities of bHLH genes were observed in some chromosomal regions, including the top and bottom of chromosomes 1, 2, and 3, and the bottom of chromosomes 4, 8, and 9. In particular, 17 OsbHLH members are located on the long arm of chromosome 4. In contrast, several large chromosomal regions lacked bHLH genes, such as the top half of chromosomes 4 and 9 and the central sections of chromosomes 7, 8, 11, and 12. Fourteen OsbHLH gene clusters were identified by members with high levels of sequence similarity (Fig. 5); for instance, the entire protein sequences of OsbHLH081 and OsbHLH082 share 75% similarity, and OsbHLH013 and OsbHLH015 are 68% similar (Fig. 5, linked with red line).
Genome duplication events are thought to have occurred throughout the process of plant evolution (Kent et al., 2003 Additional evidence for a common origin of closely related bHLHs came from the intron position patterns in the bHLH domain. As shown in Figure 5, the genes related by putative duplications shared conserved intron position pattern. Only a few pairs of the probable duplicated genes, e.g. OsbHLH104 and OsbHLH152, in the same subfamily had different intron distribution patterns, which can be explained by a loss (or gain) of intron following the duplication event.
Using the alignment of the bHLH domain amino acid sequences of OsbHLHs and AtbHLHs (Supplemental Fig. 3), a phylogenetic tree was constructed (Fig. 6
). Because of the large number of taxa and relatively small number of characters, the bootstrap values of internal nodes were low; nevertheless, the outer nodes had more credible bootstrap values, allowing for clustering of the bHLH genes of rice and Arabidopsis into 25 subfamilies (AY). In addition, our analysis of the OsbHLH gene family (Fig. 3, AV) and the AtbHLH result (121) of Toledo-Ortiz et al. (2003) Moreover, intron position patterns of the OsbHLHs were also consistent with the phylogenetic subfamilies defined in Figure 3. For instance, the members in subfamily A had the same intron distribution pattern, and so did members of the subfamilies G, I, K, L, N, O, P, Q, S, U, and V (Fig. 4B). Members of other subfamilies had the same intron distribution pattern with one or two exceptions. In addition, the intron/exon position pattern shown in Figure 4A agreed with the evolutionary relationship between OsbHLHs and AtbHLHs (see below; Fig. 6). There were 12 different groups of intron position patterns among OsbHLH and AtbHLH domains. Nine of the patterns are shared by the genes from both rice and Arabidopsis, although patterns IV and XI are found only among Arabidopsis genes, and pattern VII was only present in one rice gene. The nonconserved patterns shared by AtbHLH and OsbHLH showed that most of the intron patterns existed in the ancestor of monocots and eudicots. The percentages of each pattern in AtbHLHs and OsbHLHs were quite close; e.g. pattern I was found in 32.3% of OsbHLH members and 28.6% of AtbHLH members. The Arabidopsis bHLH introns also had identical splicing phase to those of subfamily members in OsbHLH domains. The gene structures in terms of intron position and length were also displayed in Figure 6 to provide further clues about the evolutionary relationships among OsbHLHs and AtbHLHs. Most members in the same subfamilies had similar intron/exon structure. For example, members of subfamily P had only one intron with similar lengths. The fact that they not only had similar coding sequences but also very similar intron/exon structure supports their close evolutionary relationship and membership in the same subfamily. We also examined the sizes of introns and found that most members of the same bHLH subfamily had similar intron patterns, while the sizes of their introns of some members were similar too, i.e. the members of subfamily K have the single intron of 99 bp (OsbHLH149), 123 bp (OsbHLH150), and 130 bp (OsbHLH151), respectively (Fig. 6). Approximately 73% of the introns existing within the domains of OsbHLHs were shorter than 300 bp, and other introns in this bHLH domain of these family members were more than 300 bp. Although rice and Arabidopsis bHLH genes share many similarities, there were a few differences of intron patterns between the OsbHLH and AtbHLH domains. The length of one intron within the bHLH domain of two OsbHLHs (OsbHLH064 and OsbHLH076) was more than 5 kb (Fig. 6), but no AtbHLH gene had such a long intron. This might have resulted from insertion(s) into introns in OsbHLH family members after the divergence of monocots and eudicots. A similar explanation would be reasonable for the fact that a protein with two bHLH domains was observed in the rice genome, but not in Arabidopsis. Overall, combining the bHLH domain intron patterns and the bHLH subfamilies, we can recognize two major categories of intron patterns that correspond to two major groups of subfamilies. The first one includes subfamilies A to K and W, mainly with members that have intron pattern I, which had the introns 1, 2, and 3 (Fig. 4B). Other intron patterns for members of these families can be explained by loss of specific introns in different lineages, starting from pattern I. For example, in the subfamily B, intron pattern II with intron 1 and 2, III with intron 1 and 3, IV with intron 2 and 3, and V with intron 1 might be obtained by losing one or two introns. The other category, including subfamilies L to V and X and Y mainly consisted of members with intron pattern VI, which had only the intron 2 (Fig. 4B). Members of some subfamilies without an intron in the bHLH, such as subfamilies A, W, H, I, and N, might have lost all introns. The intron position patterns in the bHLH family strongly support the hypothesis that introns have lost independently multiple times.
We have also observed that some members of different bHLH subfamilies were located within the same small chromosomal region, whereas some members of the same subfamily were distributed in different chromosomal regions, suggesting that bHLH genes were distributed widely in the genome of the common ancestor of monocots and eudicots. The phylogenetic tree of Arabidopsis and rice bHLH genes (Fig. 6) provides a way to estimate the number of bHLH genes in the MRCA. There were 45 branches with bootstrap values of 50 or greater that included both Arabidopsis and rice bHLH members, 11 branches had only Arabidopsis members, and 10 branches had only rice members. This result suggests that there were at least 66 bHLH genes in the MRCA of monocots and eudicots. Furthermore, the phylogenetic analysis provides evidence for birth-and-death evolution (Nei et al., 1997
Because the expression pattern of a gene is often correlated with its function, we examined the expression information of OsbHLHs and AtbHLHs using reverse transcription (RT)-PCR analysis, microarray experiments, expressed sequence tag (EST) data of the National Center for Biotechnology Information (NCBI), and massively parallel signature sequencing (MPSS) data. We first analyzed the expression of the OsbHLHs using RT-PCR with RNA from rice root, leaf, stem, flower, and seed (Fig. 6). The RT-PCR products of a number of OsbHLHs have been confirmed by sequencing, providing a measure of the reliability of the RT-PCR results of OsbHLHs expression. In addition, we searched for information on OsbHLHs through the EST data from NCBI and the expression data of MPSS. Even though the EST information was incomplete, we found EST data for 61 OsbHLHs (May, 2005). Forty seven of these OsbHLHs with EST data had positive RT-PCR results (Fig. 6), whereas a few OsbHLHs had EST data but were not detected by RT-PCR (Fig. 6, represented by the boxes with italic lines). Expression information from the MPSS database demonstrated that 93 OsbHLH genes are expressed (Fig. 6), but 38 bHLH genes with positive RT-PCR signals were not detected by MPSS. In total, after integrating these data together, 33 OsbHLHs, such as OsbHLH012 and OsbHLH028, were not detectably expressed according to RT-PCR, EST, and MPSS data (Fig. 6, boxes with X). These genes might be pseudogenes, or expressed at specific developmental stages or under special conditions. Furthermore, we summarize the expression of AtbHLHs from RT-PCR analysis (Heim et al., 2003
From Figure 6, 72 bHLH genes were expressed in all four tissues tested, suggesting that many bHLHs play regulatory roles at multiple development stages in rice and Arabidopsis. For example, both rice (OsbHLH031, OsbHLH032, and OsbHLH033) and Arabidopsis (AtbHLH046 and AtbHLH102) members of subfamily L are expressed in all four organs. It is possible that some members have preferential expression that is specific tissues or cells within these organs. Some bHLH genes show preferential expression, including 10 rice and Arabidopsis bHLH genes with expression in the root, one in the stem, nine in the leaf, and 30 in the flower and seed. This result indicates that members of this large family might take part in different biological processes in rice. It might be a common character of large transcription factor families, such as MYB family (Martin and Paz-Ares, 1997
Eleven rice bHLH genes have been characterized. For example, LAX (OsbHLH122) regulates shoot branching (Komatsu et al., 2003
Although the function of most rice bHLH genes is unknown, the phylogenetic and expression analyses provide a solid foundation for future functional studies in both rice and Arabidopsis. Identification of putative orthologs in different species will benefit the study of gene function, such as AtMYC2 (Abe et al., 2003
We have performed extensive analyses of the rice bHLH genes and compared them with Arabidopsis bHLH genes. We found that the rice and Arabidopsis bHLH genes form 25 subfamilies that are supported by phylogeny, additional protein motifs, and intron/exon structures. This phylogenetic analysis is in good agreement with previous results; at the same time, it presents new members in some existing subfamilies and defines new subfamilies by including additional bHLH genes from the completed rice genome sequence. The fact that the majority of subfamilies contain members from both rice and Arabidopsis suggests that the functions of most of bHLH genes are possibly conserved during angiosperm evolution. In addition, we estimate that the MRCA of monocots and eudicots had at least 66 bHLH genes. Phylogenetic analysis also suggests that there have been numerous gene birth events in this gene family during the evolution of flowering plants, in part due to putative genome duplications, compared with relatively few gene death events. The analysis of intron/exon structures revealed that most introns have conserved positions and phases, providing the evidence for the intron-early theory, and that multiple independent intron loss events likely have occurred during the evolution of flowering plants. Extensive expression data and available functional data support the hypothesis that bHLH genes in plants perform a variety of functions in different tissues at multiple developmental stages, also this summarized expression data of bHLH genes can be easily referred by readers. In short, our studies indicate that the ancient bHLH gene family has likely expanded considerably during flowering plant evolution to include many relatively young members, allowing both the conservation and divergence of gene function. Our results have established a solid foundation for future studies using molecular genetic, biochemical, physiological, and developmental approaches, which will likely reveal the functional significance of this dynamic and fascinating gene family.
Database Search for Rice bHLH Genes
To find the assembly of ESTs as candidate bHLH genes, one method of the BLAST (Altschul et al., 1990 Based on the results of BLASTN searches in the rice genome database of NCBI using the predicted cDNA sequences of rice bHLH genes, we obtained the information of the chromosome locations of these genes. Also, we obtained the information of intron distribution pattern and intron/exon boundaries of these bHLH genes from both the results of BLASTN and the TIGR annotation database. The data sets that were retrieved from the TBLASTN search and the annotation database were combined as our rice data set.
To further confirm the obtained cDNA sequences, the nucleotide sequences were translated into amino acid sequences, which were then examined for the bHLH domain using the hidden Markov model of SMART tool (http://smart.embl-heidelberg.de/; Schultz et al., 1998
Multiple sequence alignments using the Multalin tool (http://prodes.toulouse.inra.fr/multalin/; Corpet, 1988
To compare the evolutionary relationship of rice and Arabidopsis (Arabidopsis thaliana) bHLHs, we also performed Multalin tool using our obtained OsbHLH domains and 147 AtbHLH domains predicted by Toledo-Ortiz et al. (2003)
A phylogenetic tree was constructed with the aligned rice bHLH protein sequences using MEGA (version 3.0; http://www.megasoftware.net/index.html; Kumar et al., 2004 The phylogenetic tree of the AtbHLH and OsbHLH domains was developed by using PHYLIP, and the resulting clades were assessed by bootstrap of 1,000 replicates. The Dayhoff PAM matrix in the protein distance algorithm and NJ method were employed to construct the unrooted tree. The constructed tree file was visualized by TreeView1.6.6 (http://taxonomy.zoology.gla.ac.uk/rod/rod.html).
Following verification of the location of OsbHLH genes, the distribution of OsbHLH family members throughout the rice genome was drawn by the software MapInspect (http://www.dpw.wau.nl/pv/pub/MapComp/). For the detection of large segmental duplications, we consulted the duplicated blocks map provided by Xiong et al. (2005)
We used RT-PCR to detect the expression patterns of the OsbHLHs. The PCR primers were designed to avoid the conserved region and to amplify products of 150 to 400 bp long. Primer sequences were shown in detail in Supplemental Table II. RNA of roots, leaves, stems, and flowers of rice japonica cultivar Nipponbare was isolated from the plants with 8 to 10 cm inflorescences. The plants grew in the greenhouse under long-day conditions. Total RNA was isolated as described (Chomczynski and Sacchi, 1987
We thank Haisheng Liu and Huayong Xu for useful suggestions at the beginning of this work, Mingjiao Chen for supply of the rice material, and Professor Mingsheng Chen for helpful suggestions on phylogenetic analysis. Received March 16, 2006; returned for revision May 16, 2006; accepted May 20, 2006.
1 This work was supported by the funds from the National Key Basic Research Developments Program of the Ministry of Science and Technology, People's Republic of China (2001CB109002 and 2005CB120802), National 863 High-Tech Project (2005AA2710330), Shanghai Municipal Committee of Science and Technology (03JC14061), the Program for New Century Excellent Talents in University (NCET040403), the Shuguang Scholarship (04SG15), the Shanghai Institutes of Biological Sciences (Reproductive Development Project), and the U.S. Department of Energy (DEFG0202ER15332).
2 These authors contributed equally to the paper. The author responsible for distribution of materials integral to the findings presented in this article in accordance with the policy described in the Instructions for Authors (www.plantphysiol.org) is: Dabing Zhang (zhangdb{at}sjtu.edu.cn).
[W] The online version of this article contains Web-only data. www.plantphysiol.org/cgi/doi/10.1104/pp.106.080580. * Corresponding author; e-mail zhangdb{at}sjtu.edu.cn; fax 862134204869.
Abe H, Urao T, Ito T, Seki M, Shinozaki K, Yamaguchi-Shinozaki K (2003) Arabidopsis AtMYC2 (bHLH) and AtMYB2 (MYB) function as transcriptional activators in abscisic acid signaling. Plant Cell 15: 6378 Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215: 403410[CrossRef][ISI][Medline] Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 33893402 Atchley WR, Fitch WM (1997) A natural classification of the basic helix-loop-helix class of transcription factors. Proc Natl Acad Sci USA 94: 51725176 Atchley WR, Terhalle W, Dress A (1999) Positional dependence, cliques, and predictive motifs in the bHLH protein domain. J Mol Evol 48: 501516[CrossRef][ISI][Medline] Bailey PC, Martin C, Toledo-Ortiz G, Quail PH, Huq E, Heim MA, Jakoby M, Werber M, Weisshaar B (2003) Update on the basic helix-loop-helix transcription factor gene family in Arabidopsis thaliana. Plant Cell 15: 24972502 Bailey TL, Elkan C (1994) Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc Int Conf Intell Syst Mol Biol 2: 2836[Medline] Brown DD, Wensink PC, Jordan E (1972) A comparison of the ribosomal DNA's of Xenopus laevis and Xenopus mulleri: the evolution of tandem genes. J Mol Biol 63: 5773[CrossRef][ISI][Medline] Brownlie P, Ceska T, Lamers M, Romier C, Stier G, Teo H, Suck D (1997) The crystal structure of an intact human Max-DNA complex: new insights into mechanisms of transcriptional control. Structure 5: 509520[Medline] Buck MJ, Atchley WR (2003) Phylogenetic analysis of plant basic helix-loop-helix proteins. J Mol Evol 56: 742750[CrossRef][ISI][Medline] Cannon SB, Mitra A, Baumgarten A, Young ND, May G (2004) The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana. BMC Plant Biol 4: 10[CrossRef][Medline] Chomczynski P, Sacchi N (1987) Single-step method of RNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction. Anal Biochem 162: 156159[ISI][Medline] Corpet F (1988) Multiple sequence alignment with hierarchical clustering. Nucleic Acids Res 16: 1088110890 Crozatier M, Valle D, Dubois L, Ibnsouda S, Vincent A (1996) Collier, a novel regulator of Drosophila head development, is expressed in a single mitotic domain. Curr Biol 6: 707718[CrossRef][ISI][Medline] Ellenberger T, Fass D, Arnaud M, Harrison SC (1994) Crystal structure of transcription factor E47: E-box recognition by a basic region helix-loop-helix dimer. Genes Dev 8: 970980 Eulgem T, Rushton PJ, Robatzek S, Somssich IE (2000) The WRKY superfamily of plant transcription factors. Trends Plant Sci 5: 199206[CrossRef][ISI][Medline] Facchini LM, Penn LZ (1998) The molecular role of Myc in growth and transformation: recent discoveries lead to new insights. FASEB J 12: 633651 Fairman R, Beran-Steed RK, Anthony-Cahill SJ, Lear JD, Stafford WF III, DeGrado WF, Benfield PA, Brenner SL (1993) Multiple oligomeric states regulate the DNA binding of helix-loop-helix peptides. Proc Natl Acad Sci USA 90: 1042910433 Felsenstein J (1989) PHYLIP: Phylogeny Inference Package. Cladistics 5: 164166 Fisher A, Caudy M (1998) The function of hairy-related bHLH repressor proteins in cell fate decisions. Bioessays 20: 298306[CrossRef][ISI][Medline] Gale MD, Devos KM (1998) Comparative genetics in the grasses. Proc Natl Acad Sci USA 95: 19711974 Gilbert W (1987) The exon theory of genes. Cold Spring Harb Symp Quant Biol 52: 901905[ISI][Medline] Goding CR (2000) Mitf from neural crest to melanoma: signal transduction and transcription in the melanocyte lineage. Genes Dev 14: 17121728 Guyot R, Keller B (2004) Ancestral genome duplication in rice. Genome 47: 610614[Medline] Hall TA (1999) BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp Ser 41: 9598 Heim MA, Jakoby M, Werber M, Martin C, Weisshaar B, Bailey PC (2003) The basic helix-loop-helix transcription factor family in plants: a genome-wide study of protein structure and functional diversity. Mol Biol Evol 20: 735747 Henriksson M, Luscher B (1996) Proteins of the Myc network: essential regulators of cell growth and differentiation. Adv Cancer Res 68: 109182[ISI][Medline] Hu J, Anderson B, Wessler SR (1996) Isolation and characterization of rice R genes: evidence for distinct evolutionary paths in rice and maize. Genetics 142: 10211031[Abstract] Hua X, Yokoyama C, Wu J, Briggs MR, Brown MS, Goldstein JL, Wang X (1993) SREBP-2, a second basic-helix-loop-helix-leucine zipper protein that stimulates transcription by binding to a sterol regulatory element. Proc Natl Acad Sci USA 90: 1160311607 |