|
|
||||||||
|
First published online November 23, 2005; 10.1104/pp.105.070722 Plant Physiology 139:1870-1880 (2005) © 2005 American Society of Plant Biologists OPEN ACCESS ARTICLE
Expression Profile of Two Storage-Protein Gene Families in Hexaploid Wheat Revealed by Large-Scale Analysis of Expressed Sequence Tags1,[W],[OA]Laboratory of Genetic Engineering, Kyoto Prefectural University and Kyoto Prefectural Institute of Agricultural Biotechnology, Shimogamo, Kyoto 6068522, Japan (K.K., Y.O.); and Nagahama Institute of Bio-Science and Technology, Shiga 5260829, Japan (K.M.)
To discern expression patterns of individual storage-protein genes in hexaploid wheat (Triticum aestivum cv Chinese Spring), we analyzed comprehensive expressed sequence tags (ESTs) of common wheat using a bioinformatics technique. The gene families for / -gliadins and low molecular-weight glutenin subunit were selected from the EST database. The alignment of these genes enabled us to trace the single nucleotide polymorphism sites among both genes. The combinations of single nucleotide polymorphisms allowed us to assign haplotypes into their homoeologous chromosomes by allele-specific PCR. Phylogenetic analysis of these genes showed that both storage-protein gene families rapidly diverged after differentiation of the three genomes (A, B, and D). Expression patterns of these genes were estimated based on the frequencies of ESTs. The storage-protein genes were expressed only during seed development stages. The / -gliadin genes exhibited two distinct expression patterns during the course of seed maturation: early expression and late expression. Although the early expression genes among the / -gliadin and low molecular-weight glutenin subunit genes showed similar expression patterns, and both genes from the D genome were preferentially expressed rather than those from the A or B genome, substantial expression of two early expression genes from the A genome was observed. The phylogenetic relationships of the genes and their expression patterns were not correlated. These lines of evidence suggest that expression of the two storage-protein genes is independently regulated, and that the / -gliadin genes possess novel regulation systems in addition to the prolamin box.
Comprehensive analyses of expressed sequence tags (ESTs) have been carried out among various plant species to provide powerful tools for functional genomics, such as DNA microarray, gene chips, databases for comparative genomics, and single nucleotide polymorphism (SNP) analysis (Ewing et al.,1999
The gliadins and the glutenins are major components of the storage proteins in wheat endosperm. Wheat gluten is composed of a protein complex of monomeric gliadins and polymeric glutenins, and this complex also plays a substantial role in determining processed food quality (for review, see Shewry et al., 2003
Estimated copy numbers of storage-protein genes in the hexaploid wheat genome differ among the cultivars from >100 (Okita et al., 1985
Here, we carefully traced the expression patterns of individual genes encoding the
Assembling Multigenes for / -Gliadin and LMW-GS
Through computer analysis, 361,180 ESTs derived from 32 cDNA libraries that confer more than 10,000 ESTs (Supplemental Table I) were collected and grouped into 53,976 contigs by the phrap method (University of Washington Genome Center; http://www.genome.washington.edu/UWGC). The phrap parameters were sufficiently strict to classify ESTs into each multigene from three homoeologous genomes (Mochida et al., 2003
On the other hand, the number of genes for LMW-GS in hexaploid wheat was previously estimated to be 22 to 39 (Sabelli and Shewry, 1991
The number of EST members comprising each contig varied from two to 114 for
To assess the phylogenetic relationships among the individual genes for both storage proteins, we aligned the coding regions of respective contigs with previously reported counterpart genes (Table I). Dendrograms showing the phylogenetic relationships of both genes were constructed using Clustal W (Thompson et al., 1994
/ -Gliadin GenesBecause the / -gliadin genes comprise a multigene family and are highly variable (Gu et al., 2004 / -gliadin genes were used for dendrogram construction. The resultant dendrogram is shown in Figure 1. The expressed contigs were classified into five major groups. Each major group included contigs identified in this study and their counterparts registered in DNA databases, except for group C, which only contained singleton ESTs from the DNA databases. Novel classes of genes consisting of independent branches within the major classes were obtained in this study (Fig. 1), indicating the effectiveness of comprehensive EST analysis for accumulation of expressed genes. The chromosome locations of all contigs were determined by allele-specific PCR (Table II) in combination with aneuploids of CS wheat. Typical agarose gel electrophoresis patterns are depicted in Figure 2. All contigs were located on the short arm of chromosome group 6, indicating that the contigs are expressed from the Gli-2 loci (Payne, 1987
LMW-GS Genes LMW-GS genes also comprise a multigene family and are highly variable (Ikeda et al., 2002
The Two Storage-Protein Genes Are Specifically Expressed during Seed Development
By counting the constituents of each contig in various tissues, relative expression patterns of each contig (gene) can be monitored during the wheat life cycle (Ogihara et al., 2003
Expression Profiles of the Two Storage-Protein Genes during Seed Maturation
The two storage-protein genes were abundantly expressed during specific seed maturation stages. As presented in Figure 5A, expression of the two storage-protein genes was substantially induced at 10 DPA, and gradually decreased as seeds matured. In fact, 4.2%, 4.2%, and 0.6% of all ESTs at 10 DPA, 20 DPA, and 30 DPA corresponded to
/ -Gliadin Genes Show Two Distinct Expression Patterns during Seed MaturationEach contig is distinguishable from its locus among the three homoeologous chromosomes and the expression patterns of individual genes from the three homoeoloci are shown in Figure 5B. Genes from the 6DS were preferentially expressed in comparison to other genes. Expression of genes from 6BS appeared to be suppressed (Table III); the number of contigs was similar among the three homoeoloci (11 loci for 6AS, 13 for 6BS, and 12 for 6DS). However, the number of ESTs varied among the three homoeoloci (327 ESTs for 6AS, 216 for 6BS, and 465 for 6DS). Although the expression of genes from 6BS and 6DS were mostly observed at 10 DPA and gradually decreased during maturation, gene expression from 6AS increased at 20 DPA and decreased dramatically at 30 DPA. Subsequently, the expression patterns of each contig were carefully traced (Figs. 4 and 5C). It is remarkable that two expression patterns for the / -gliadin genes can be distinguished. Expression of some genes peaked at 10 DPA, after which expression levels decreased with maturation, while other genes peaked at 20 DPA. The former were designated as early expression genes, and the latter as late expression genes. The expression patterns of the early expression and late expression genes were simultaneously detected in three homoeologous genes (Fig. 5C). However, the expression levels of the two late expression genes from 6AS were extremely high at 20 DPA: contig35171 (group A) and contig34890 (group B). These expression levels are attributed to the high expression of / -gliadin genes from genome A at 20 DPA. While the late expression / -gliadin genes from genome A were preferentially expressed, and those from genome D were expressed to some extent (Table III), those from genome D were apparently suppressed (33% relative to genome A).
LMW-GS Genes Reveal Uniform Expression Pattern in Seed Maturation The LMW-GS genes from 1DS were preferentially expressed, as depicted in Figure 5B. Nine contigs located on 1DS were detected (Table III), while only three contigs from 1AS and 1BS were identified. About 60 ESTs were involved in each contig of 1AS and 1BS, while 350 ESTs were included in the contigs of 1DS. Genes were markedly induced at 10 DPA, after which expression levels decreased rapidly. All contigs for LMW-GS revealed similar expression patterns during seed maturation, irrespective of expression levels (Fig. 5C).
Early Expression
Phylogenetic Relationships between Expressed / -Gliadin and LMW-GS Genes in CS Wheat
ESTs of CS wheat homologous to
We have presented evidence that the
It is well known that plant storage-protein genes exist as multigene families and evolved rapidly, even among related plant species (Shewry et al., 2003 Because a large number of wheat ESTs has been pooled in the public domain (National Center for Biotechnology Information), bioinformatics studies enable us to monitor wheat expression patterns during the wheat life cycle and in response to environmental stresses. Profiling of expression patterns of storage-protein genes using a comprehensive EST database provides a powerful tool to analyze the structure and expression of individual multigenes and to manipulate their functions. Wheat EST databases are now indispensable to the study of functional wheat genomics.
Collection of EST Sequences Encoding Two Wheat Storage Proteins
A total of 361,180 EST sequences were collected from the 32 cDNA libraries constructed from various tissues obtained throughout the life cycle and from abiotically stressed tissues of common wheat (Triticum aestivum cv CS, T. aestivum cv Kitakei 1354, and T. aestivum cv Valuevskaya; Ogihara et al., 2003
The nucleotide sequences of EST contigs for
Total DNA was isolated from seedlings of common wheat (T. aestivum cv CS) and from nullisomic-tetrasomic as well as ditelosomic lines of CS (Sears, 1965
Because the sequenced cDNAs were more or less changed in their libraries, the number of ESTs was normalized among the 32 libraries. The expression pattern of each gene in certain tissues and/or after certain treatments was monitored by counting the constituents involved in each contig. The contigs were clustered using Pearson's correlation coefficient (Eisen et al., 1998 Sequence data from this article can be found in the GenBank/EMBL data libraries under accession numbers BJ207047 to BJ323305.
We thank Dr. Yukiko Yamazaki of the National Institute of Genetics, Japan, for her maintenance of the EST database of wheat, "KOMUGI" (http://shigen.lab.nig.ac.jp/wheat/komugi/ests/tissueBrowse.jsp). Received August 29, 2005; returned for revision October 11, 2005; accepted October 12, 2005.
1 This work was supported by Grants-in-Aid for Scientific Research on Priority Areas ("Molecular mechanisms of species differentiation"; no. 14087204) and National Bioresource Project from the Ministry of Education, Culture, Sports, Science and Technology of Japan. The author responsible for distribution of materials integral to the findings presented in this article in accordance with the policy described in the Instructions for Authors (www.plantphysiol.org) is: Yasunari Ogihara (yogihara{at}kpu.ac.jp).
[W] The online version of this article contains Web-only data.
[OA] Open Access articles can be viewed online without a subscription. Article, publication date, and citation information can be found at www.plantphysiol.org/cgi/doi/10.1104/pp.105.070722. * Corresponding author; e-mail yogihara{at}kpu.ac.jp; fax 81774933261.
Altschul S, Gish W, Miller W, Myers E, Lipman D (1990) Basic local alignment search tool. J Mol Biol 215: 403410[CrossRef][Web of Science][Medline] Anderson O (1991) Characterization of members of a pseudogene subfamily of the wheat alpha-gliadin storage protein genes. Plant Mol Biol 16: 335337[Medline] Anderson O, Greene F (1997) The Anderson O, Litts J, Greene F (1997) The Arentz-Hansen EH, McAdam SN, Molberg O, Kristiansen C, Sollid LM (2000) Production of a panel of recombinant gliadins for the characterisation of T cell reactivity in coeliac disease. Gut 46: 4651 Asamizu E, Nakamura Y, Sato S, Tabata S (2000) A large scale analysis of cDNA in Arabidopsis thaliana: generation of 12,028 non-redundant expressed sequence tags from normalized and size-selected cDNA libraries. DNA Res 7: 175180[Abstract] Asamizu E, Nakamura Y, Sato S, Tabata S (2004) Characteristics of the Lotus japonicus gene repertoire deduced from large-scale expressed sequence tag (EST) analysis. Plant Mol Biol 54: 405414[CrossRef][Web of Science][Medline] Cassidy BG, Dvorak J, Anderson OD (1998) The wheat low-molecular-weight glutenin genes: characterization of six new genes and progress in understanding gene family structure. Theor Appl Genet 96: 743750[CrossRef] Clarke BC, Mukai Y, Appels R (1996) The Sec-1 locus on the short arm of chromosome 1R of rye (Secale cereale). Chromosoma 105: 269275[Medline] D'Ovidio R, Marchitelli C, Ercoli Cardelli L, Porceddu E (1999) Sequence similarity between allelic Glu-B3 genes related to quality properties of durum wheat. Theor Appl Genet 98: 455461[CrossRef] Duan H, Schuler M (2005) Differential expression and evolution of the Arabidopsis CYP86A subfamily. Plant Physiol 137: 10671081 Eisen MB, Spellman PT, Brown PO, Botstein D (1998) Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci USA 95: 1486314868 Ewing R, Ben Kahla A, Poirot O, Lopez F, Audic S, Claverie J (1999) Large-scale statistical analyses of rice ESTs reveal correlated patterns of gene expression. Genome Res 9: 950959 Garcia-Maroto F, Marana C, Garcia-Olmedo F, Carbonero P (1990) Nucleotide sequence of a cDNA encoding an alpha/beta-type gliadin from hexaploid wheat (Triticum aestivum). Plant Mol Biol 14: 867868[CrossRef][Medline] Gu YQ, Crossman C, Kong X, Luo M, You FM, Coleman-Derr D, Dubcovsky J, Anderson OD (2004) Genomic organization of the complex alpha-gliadin gene loci in wheat. Theor Appl Genet 109: 648657[Medline] Gupta RB, Shepherd KW (1990) Two-step one-dimensional SDS-PAGE analysis of LMW subunits of glutelin. 1. Variation and genetic control of the subunits in hexaploid wheats. Theor Appl Genet 80: 6574 Hammond-Kosack MCR, Holdsworth MJ, Bevan MW (1993) In vivo footprinting of a low molecular weight glutenin gene (LMWG-1D1) in wheat endosperm. EMBO J 12: 545554[Web of Science][Medline] Ikeda T, Nagamine T, Fukuoka H, Yano H (2002) Identification of new low-molecular-weight glutenin subunit genes in wheat. Theor Appl Genet 104: 680687[CrossRef][Medline] Jackson EA, Holt LM, Payne IP (1983) Characterisation of high molecular weight gliadin and low-molecular-weight glutenin subunits of wheat endosperm by two-dimensional electrophoresis and the chromosomal localisation of their controlling genes. Theor Appl Genet 66: 2937 Johal J, Gianibeli MC, Rahman S, Morell MK, Gale KR (2004) Characterization of low-molecular-weight glutenin genes in Aegilops tauschii. Theor Appl Genet 109: 10281040[CrossRef][Medline] Kasarda D, Okita T, Bernardin J, Baecker P, Nimmo C, Lew E, Dietler M, Greene F (1984) Nucleic acid (cDNA) and amino acid sequences of alpha-type gliadins from wheat (Triticum aestivum). Proc Natl Acad Sci USA 81: 47124716 Kirch H, Schlingensiepen S, Kotchoni S, Sunkar R, Bartels D (2005) Detailed expression analysis of selected genes of the aldehyde dehydrogenase (ALDH) gene superfamily in Arabidopsis thaliana. Plant Mol Biol 57: 315332[CrossRef][Web of Science][Medline] Maruyama N, Ichise K, Katsube T, Kishimoto T, Kawase S, Matsumura Y, Takeuchi Y, Sawada T, Utsumi S (1998) Identification of major wheat allergens by means of the Escherichia coli expression system. Eur J Biochem 255: 739745[Medline] Masci S, D'Ovidio R, Lafiandra D, Kasarda D (1998) Characterization of a low-molecular-weight glutenin subunit gene from bread wheat and the corresponding protein that represents a major subunit of the glutenin polymer. Plant Physiol 118: 11471158 Mochida K, Kawaura K, Shimosak E, Shin-I T, Kohara Y, Yamazaki Y, Ogihara Y (2005) Tissue expression map of comprehensive expressed sequence tags and its application to in silico screening of stress response genes in common wheat. Mol Genet Genomics (in press) Mochida K, Yamazaki Y, Ogihara Y (2003) Discrimination of homoeologous gene expression in hexaploid wheat by SNP analysis of contigs grouped from a large number of expressed sequence tags. Mol Genet Genomics 270: 371377[CrossRef][Web of Science][Medline] Moczulski M, Salmanowicz BP (2003) Multiplex PCR identification of wheat HMW glutenin subunit genes by allele-specific markers. J Appl Genet 44: 459471[Medline] Mueller M, Knudsen S (1993) The nitrogen response of a barley C-hordein promoter is controlled by positive and negative regulation of the GCN4 and endosperm box. Plant J 4: 343355[CrossRef][Web of Science][Medline] Ogihara Y, Mochida K, Nemoto Y, Murai K, Yamazaki Y, Shin-I T, Kohara Y (2003) Correlated clustering and virtual display of gene expression patterns in the wheat life cycle by large-scale statistical analyses of expressed sequence tags. Plant J 33: 10011011[CrossRef][Web of Science][Medline] Ogihara Y, Shimizu H, Hasegawa K, Tsujimoto H, Sasakuma T (1994) Chromosome assignment of four photosynthesis-related genes and their variability in wheat species. Theor Appl Genet 88: 383394 Okita T, Cheesbrough V, Reeves C (1985) Evolution and heterogeneity of the alpha-/beta-type and gamma-type gliadin DNA sequences. J Biol Chem 260: 82038213 Pavy N, Laroche J, Bousquet J, Mackay J (2005) Large-scale statistical analysis of secondary xylem ESTs in pine. Plant Mol Biol 57: 203224[CrossRef][Web of Science][Medline] Payne IP (1987) Genetics of wheat storage proteins and the effect of allelic variation on bread-making quality. Annu Rev Plant Physiol 38: 141153[CrossRef][Web of Science] Payne IP, Holt LM, Worland AJ, Law CN (1982) Structural and genetical studies on the high-molecular-weight subunits of wheat glutenin. Part 3. Telocentric mapping of subunit genes on the long arms of the homoeologous group 1 chromosomes. Theor Appl Genet 63: 129138[CrossRef] Payne IP, Jackson EA, Holt LM, Law CN (1984) Genetic linkage between endosperm storage protein genes on each of the short arms of chromosomes 1A and 1B in wheat. Theor Appl Genet 67: 235243[CrossRef] Rozen S, Skaletsky HJ (2000) Primer3 on the WWW for general users and for biologist programmers. In S Krawetz, S Misener, eds, Methods in Molecular Biology. Humana Press, Totowa, NJ, pp 365386 Sabelli P, Shewry PR (1991) Characterization and organisation of gene families at the Gli-1 loci of bread and durum wheats by restriction fragment analysis. Theor Appl Genet 83: 209216[Web of Science] Sears E (1965) Nullisomic-tetrasomic combinations in hexaploid wheat. In R Riley, K Lewis, eds, Chromosome Manipulation and Plant Genetics. Oliver and Boyd, Edinburgh, pp 2958 Shewry PR, Halford NG, Lafiandra D (2003) Genetics of wheat gluten proteins. Adv Genet 49: 111184[CrossRef][Web of Science][Medline] Song R, Llaca V, Linton E, Messing J (2001) Sequence, regulation and evolution of the maize 22-kD alpha zein gene family. Genome Res 11: 18171825 Sterky F, Bhalerao R, Unneberg P, Segerman B, Nilsson P, Brunner A, Charbonnel-Campaa L, Lindvall J, Tandre K, Strauss S, et al (2004) A Populus EST resource for plant functional genomics. Proc Natl Acad Sci USA 101: 1395113956 Sumner-Smith M, Rafalski J, Sugiyama T, Stoll M, Soll D (1985) Conservation and variability of wheat alpha/beta-gliadin genes. Nucleic Acids Res 13: 39053916 Thompson RD, Bartels D, Harberd NP, Flavell RB (1983) Characterization of the multigene family coding for HMW glutenin subunits in wheat using cDNA clones. Theor Appl Genet 67: 8796[CrossRef] Thompson JD, Higgins DG, Gibson TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22: 46734680 Van Campenhout S, Vander Stappen J, Sagi L, Volckaert G (1995) Locus-specific primers for LMW glutenin genes on each of the group 1 chromosomes of hexaploid wheat. Theor Appl Genet 91: 313319 Vettore A, da Silva F, Kemper E, Souza G, Da Siva A, Ferro M, Henrique-Silva F, Giglioti LM, Coutinho EA, Nobrega LL, et al (2003) Analysis and functional annotation of an expressed sequence tag collection for tropical crop sugarcane. Genome Res 13: 27252735 Wicker T, Yahiaui N, Guyot R, Schlagenhauf E, Liu ZD, Dubkovsky J, Keller B (2003) Rapid genome divergence at orthologous low molecular weight glutenin loci of the A and Am genomes of wheat. Plant Cell 15: 11861197 Zhang H, Sreenivasulu N, Weschke W, Stein N, Rudd S, Radchuk V, Potokina E, Scholz U, Schweizer P, Zierold U, et al (2004) Large-scale analysis of the barley transcriptome based on expressed sequence tags. Plant J 40: 276290[CrossRef][Web of Science][Medline] Zhang W, Gianibelli MC, Ma W, Rampling L, Gale KR (2003) Identification of SNPs and development of allele-specific PCR markers for This article has been cited by other articles:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ASPB Publications | PLANT PHYSIOLOGY® | THE PLANT CELL | |
|---|---|---|---|