Plant Physiology 139:574-597 (2005)
© 2005 American Society of Plant Biologists
GENOME ANALYSIS
Characterizing the Grape Transcriptome. Analysis of Expressed Sequence Tags from Multiple Vitis Species and Development of a Compendium of Gene Expression during Berry Development1,[w]
Francisco Goes da Silva2,
Alberto Iandolino2,3,
Fadi Al-Kayal2,
Marlene C. Bohlmann,
Mary Ann Cushman,
Hyunju Lim,
Ali Ergul,
Rubi Figueroa,
Elif K. Kabuloglu,
Craig Osborne,
Joan Rowe,
Elizabeth Tattersall,
Anna Leslie,
Jane Xu,
JongMin Baek,
Grant R. Cramer,
John C. Cushman and
Douglas R. Cook*
Department of Plant Pathology (F.G.d.S., H.L., J.X., D.R.C.), Department of Viticulture and Enology (A.I.), and College of Agricultural and Environmental Sciences Genomics Facility (A.I., A.L., J.B., D.R.C.), University of California, Davis, California 95616; Department of Biochemistry, University of Nevada, Reno, Nevada 895570014 (F.A-.K., M.C.B., M.A.C., R.F., E.K.K., C.O., J.R., E.T., G.R.C., J.C.C.); and Biotechnology Institute, Ankara University, 06500 Besevler-Ankara, Turkey (A.E.)
 |
ABSTRACT
|
|---|
We report the analysis and annotation of 146,075 expressed sequence tags from Vitis species. The majority of these sequences were derived from different cultivars of Vitis vinifera, comprising an estimated 25,746 unique contig and singleton sequences that survey transcription in various tissues and developmental stages and during biotic and abiotic stress. Putatively homologous proteins were identified for over 17,752 of the transcripts, with 1,962 transcripts further subdivided into one or more Gene Ontology categories. A simple structured vocabulary, with modules for plant genotype, plant development, and stress, was developed to describe the relationship between individual expressed sequence tags and cDNA libraries; the resulting vocabulary provides query terms to facilitate data mining within the context of a relational database. As a measure of the extent to which characterized metabolic pathways were encompassed by the data set, we searched for homologs of the enzymes leading from glycolysis, through the oxidative/nonoxidative pentose phosphate pathway, and into the general phenylpropanoid pathway. Homologs were identified for 65 of these 77 enzymes, with 86% of enzymatic steps represented by paralogous genes. Differentially expressed transcripts were identified by means of a stringent believability index cutoff of 98.4%. Correlation analysis and two-dimensional hierarchical clustering grouped these transcripts according to similarity of expression. In the broadest analysis, 665 differentially expressed transcripts were identified across 29 cDNA libraries, representing a range of developmental and stress conditions. The groupings revealed expected associations between plant developmental stages and tissue types, with the notable exception of abiotic stress treatments. A more focused analysis of flower and berry development identified 87 differentially expressed transcripts and provides the basis for a compendium that relates gene expression and annotation to previously characterized aspects of berry development and physiology. Comparison with published results for select genes, as well as correlation analysis between independent data sets, suggests that the inferred in silico patterns of expression are likely to be an accurate representation of transcript abundance for the conditions surveyed. Thus, the combined data set reveals the in silico expression patterns for hundreds of genes in V. vinifera, the majority of which have not been previously studied within this species.
On a worldwide basis, grapes (Vitis species) are both the most widely cultivated and economically important fruit crop, encompassing approximately 8 million hectares of arable land (Vivier and Pretorius, 2002 ). Although the grape berry is used for multiple purposes, including juice, fresh and dried fruit, and distilled liquor, wine produced from cultivars of Vitis vinifera has the highest economic value (Mullins et al., 1992 ). Worldwide wine production in the year 2000 was 271,212,000 hectoliters (http://www.wineinstitute.org). In addition to its economic impact, there is increasing interest in wine as a source of health-promoting secondary metabolites (e.g. Bagchi et al., 2003 ; Spiller et al., 2003 ), such as antioxidant polyphenols (Ferguson, 2001 ; Aldini et al., 2003 ). For example, the polyphenol resveratrol has gained particular attention due to its ability to increase longevity in yeast through the stimulation of anti-aging enzymes known as sirtuins (Howitz et al., 2003 ).
In contrast to many other crop species where genotypic variation is a tool for crop improvement, in wine grapes, constancy of the genotype or variety is often the desired goal. Varietal integrity is maintained through vegetative propagation. As a consequence, intensive crop management practices (i.e. viticulture) are more important to maintaining quality characteristics than are traditional breeding methodologies, which have been limited in their application in grapes relative to other major crop species. Genomics approaches are likely to have particular value for grape improvement because they have the potential to identify transcriptional, biochemical, and genetic pathways that contribute to agronomic properties. Examples include revealing transcriptional pathways that are correlated with berry quality (e.g. metabolism of sugars, organic acids, and flavonoids) and disease resistance (e.g. specific resistance genes and downstream transcriptional pathways) and determining how viticultural practices impact these molecular phenotypes. The application of such knowledge to grape improvement is likely to take the form of improved viticultural practices and precise molecular breeding. Approaches such as marker-assisted selection and transgenesis will facilitate transfer of genes for desirable traits into elite or classic cultivars of V. vinifera, with the goal of improving agronomic performance while preserving traditional quality traits (Bisson et al., 2002 ; Vivier and Pretorius, 2002 ).
The strategies outlined above will be aided by genomics efforts to characterize the gene content of grapes, such as those involving expressed sequence tag (EST) screens. ESTs currently represent the most abundant nucleotide commodity from plant genomes, providing a resource that can be exploited for gene discovery, genome annotation, and comparative genomics (Rudd, 2003 ). When EST sequencing is combined with a methodical cDNA library construction effort, in silico analysis of EST frequency can be used to identify genes whose expression is correlated with particular points in development or induced in response to abiotic or biotic factors. Moreover, the identification of coregulated genes provides the basis for analyzing presumptive gene networks and facilitates use of "guilt by association" logic to infer the function of unknown or imprecisely annotated genes (Journet et al., 2002 ).
Although EST collections are currently available for several crop and model plant species, until recently, most ESTs from Vitis species were not publicly available. Reports of nonpublic data include the analysis of 5,000 ESTs generated from V. vinifera cv Chardonnay leaf and berry tissue (Ablett et al., 2000 ), cv Shiraz berries at various stages of development (Terrier et al., 2001 ), and the analysis of over 4,000 ESTs from dormant buds of V. vinifera cv Purple Cornichon (Pacey-Miller et al., 2003 ). However, an international effort from several research groups worldwide has dramatically increased the availability of EST data from grapes. In 2001, there were just over 400 sequences deposited in GenBank. As of September 30, 2003, 146,075 sequences were deposited to the National Center for Biotechnology Information (NCBI) for several Vitis species. Here, we describe the analysis of this transcript data set, with emphasis on organization and annotation of the unigene set and analysis of differentially expressed genes.
 |
RESULTS
|
|---|
The Vitis subsp. Data Sets
In total, 146,075 Vitis sequences were deposited into GenBank (NCBI) as of September 30, 2003. Eighty percent of these sequences were generated by the authors, with the majority of the remaining sequences deposited into public data repositories by five different research groups (Supplemental Table I). The most economically important species of Vitis, V. vinifera, was the source of the majority of the sequences, with 1,471 sequences corresponding to transcript and genomic DNA sequences and 135,541 ESTs from 58 cDNA libraries representing seven cultivars. The depth of sequencing of V. vinifera cDNA libraries ranged from eight to 24,400 ESTs (Supplemental Table I), and libraries represented numerous cultivars, organs, plant developmental stages, and stress treatments as shown in Table I. The remaining Vitis species (Supplemental Table I) were represented by 8,957 ESTs and 106 genomic or expressed transcripts.
To facilitate analysis across the EST data set, we developed a controlled and structured vocabulary to classify cDNA libraries according to three main categories: namely, genotype, plant development, and stress. Each category was further subdivided according to accepted hierarchical relationships using vocabulary in common use for viticulture, as shown by example in Figure 1 (for complete detail, see Supplemental Table II). The vocabulary was used to generate an online query tool (http://cgf.ucdavis.edu/) to facilitate identification of genes that may be differentially or uniquely expressed under specified conditions. One can identify genes whose expression is enhanced in a particular Vitis species or cultivar, under biotic or abiotic stress, in a particular organ, or at a specific plant developmental stage. For example, several genes expressed only in grapes infected with the bacterial pathogen Xylella fastidiosa have been identified using this tool (F. Goes da Silva and D. Cook, unpublished data), demonstrating the utility of this structured vocabulary.

View larger version (26K):
[in this window]
[in a new window]
|
Figure 1. A controlled vocabulary for description of the Vitis species cDNA libraries. Libraries are organized into three main categories, (A) genotype, (B) development, and (C) stress, which can be further subdivided as shown. Terms for the Cabernet Sauvignon Leaf (CA12LI) are shown by way of example, with detailed information for all libraries available in Supplemental Tables I and II.
|
|
Generation of Vitis Unigene Sets
For purposes of generating a unigene set, ESTs and expressed transcripts obtained en masse from NCBI were organized into contigs (also called tentative consensus sequences [TCs]) and singleton sequences by means of MegaBLAST and CAP3 (Liang et al., 2000 ). Clustering was performed separately for species represented by more than 250 sequences (i.e. V. vinifera, V. aestivalis, and V. arizonica x V. rupestris). Genomic DNA sequences as well as sequences from species represented by <250 sequences were used to produce a separate singleton data set. In total, 25,696 unigenes were predicted for V. vinifera, 1,977 for a Vitis rupestris x V. arizonica hybrid, and 1,314 for V. aestivalis (Table II).
View this table:
[in this window]
[in a new window]
|
Table II. Summary of the Vitis spp. unigene sets
Clustering was performed individually for data sets from species represented by at least 250 sequences available at the NCBI database.
|
|
The occurrence in a single TC of paired-end (i.e. 5' and 3' direction) reads from the same cDNA clone was taken as experimental evidence in support of computationally determined TC structure. Approximately 72% of V. vinifera TCs (9,599 TCs, with an average size of 1,020 bp) were composed of paired-end clones, of which 54% (5,146 TCs) of the paired-end cDNA sequences started within 40 bp of the 5' most nucleotide. These results demonstrate that the majority of unigenes were captured within a single cDNA clone.
Annotation of the Unigene Sets
To identify Vitis unigenes that potentially encode homologs of known proteins, we conducted BLASTX (Altschul et al., 1997 ) against GenBank's nonredundant protein database. Sixty-nine percent (18,259) of the V. vinifera unigenes showed significant similarity to proteins in the database based on an E value cutoff of 1e5, and 5,064 of the protein homologs were annotated as unknown protein, hypothetical protein, or expressed protein. This result is not surprising, considering that a great number of sequences in the NCBI database represent uncharacterized genes and proteins. BLASTX analysis for the remaining Vitis species revealed similar trends.
More detailed functional annotation was provided by mapping unigenes onto the Gene Ontology Consortium structure using The Institute for Genomic Research (TIGR) Vitis vinifera Gene Index (http://www.tigr.org/tdb/tgi/vvgi/) version 3.1 (Quackenbush et al., 2001 ). Gene Ontology provides a structured and controlled vocabulary to describe gene products according to three ontologies: molecular function, biological process, and cellular component (Gene Ontology Consortium, 2000 ). In total, 1,962 unigenes could be mapped to one or more ontology, with multiple assignments possible for a given protein within a single ontology. Thus, 2,362 assignments were made to the molecular function ontology, with 68% of these in the catalytic activity and binding categories (Fig. 2A) and annotations such as RNA helicases, kinases, ubiquitin, and nucleotide binding proteins. By way of example, the branch child terms for transporter activity and transcription regulator activity revealed several genes implicated in carbohydrate transport (e.g. CTG1030456 and CTG1030972) as well as predicted transcription factors with putative roles in berry ripening (e.g. myb-related regulatory genes CTG1028317 and CTG1028387 and MADS box genes CTG1028624 and CTG1030218). Under the biological process ontology, the vast majority of the 2,056 assignments were to the physiological process category, with frequent subclassification into the response to stress, response to external stimulus, and cell growth and maintenance categories (Fig. 2B). The abundance of stress- and growth-related annotations is not surprising, considering that a significant portion of the ESTs analyzed were derived from Vitis tissues responding to biotic or abiotic stress or from tissues in early stages of development. Finally, for cellular components, the vast majority of the assignments (96%) were to the cell category (Fig. 2C).

View larger version (24K):
[in this window]
[in a new window]
|
Figure 2. Distribution of V. vinifera unigenes with putative functions assigned through annotation using gene ontology. Molecular function (A), biological process (B), and cellular component (C). Assignments are based on the data available at the TIGR Vitis vinifera Gene Index (http://www.tigr.org/tdb/tgi/vvgi/) version 3.1 (November, 2003).
|
|
Characterizing the Functional Gene Space in Vitis sp.
To determine the extent to which the functional gene space had been sampled, we searched the unigene set for homologs of structural enzymes involved in primary, intermediate, and secondary metabolic pathways. The targets of our analysis were biochemical pathways of known topology involved in the biosynthesis of flavonoids and their precursors, including the Embden-Meyerhoff-Parnas pathway (EMP pathway or glycolysis), the Krebs cycle (citric acid cycle, tricarboxylic acid cycle cycle), the oxidative/nonoxidative pentose phosphate pathway, the shikimic acid or aromatic amino acid pathway, and the general phenylpropanoid and flavonoid pathways. The selected metabolic network converts the simple six-carbon structure of hexoses into the more complex 6:3:6 basic carbon structures of flavonoids. Flavonoids are of particular interest in grape biology because they represent an important metabolic sink for carbon, particularly in organs and cells actively accumulating phenylpropanoids, such as cells in the exocarp and outer mesocarp of ripening grapevine berries (Coombe, 1987 ; Hardie et al., 1996 ).
Table III (and Supplemental Table III) shows TCs and singletons from V. vinifera and other non-vinifera Vitis subsp. predicted to code for structural enzymes of the selected pathways. Most pathways are fully represented by Vitis ESTs, and most enzymes are supported by a significant degree of redundancy, both in terms of the number of ESTs (expression) and the number of TCs and singletons (putative paralogous genes). This redundancy suggests numerous moderately sized gene families in the grapevine genome.
View this table:
[in this window]
[in a new window]
|
Table III. Putative genes in the Vitis spp. data sets coding for structural enzymes of key pathways of primary, intermediate, and secondary metabolism
|
|
In Silico Characterization of Gene Expression in V. vinifera
For the purpose of inferring patterns of gene expression, we constructed a matrix containing nonredundant EST frequency across each cDNA library for all TCs. No TCs were observed with sequences present in all 58 cDNA libraries, which is not surprising given the diversity of tissue types sampled and the limited depth to which some cDNA libraries were sequenced. The most widely expressed TC encodes a putative Rubisco small subunit protein, with presence in 38 separate cDNA libraries. In total, 14 TCs were expressed in at least 30 cDNA libraries (Table IV). All of the broadly expressed TCs matched annotated proteins in the NCBI nonredundant database, with the exception of CTG1027353. Despite their presence in multiple cDNA libraries, certain of these broadly expressed TCs (e.g. CTG1026987 and CTG1027410, both annotated as a putative abscisic acid, stress, and ripening-induced protein implicated in sugar transport) were also predicted to be differentially expressed, as described below.
To identify putative differentially expressed genes, we used the R statistic methodology described by Stekel et al. (2000) . This analysis was restricted to libraries with over 1,000 ESTs (a total of 29 libraries), as libraries represented by relatively few ESTs are likely to confound the prediction of differentially expressed genes (Ewing et al., 1999 ). A total of 665 TCs with an R value 25 and false positive rate 1 (believability [B] 98.4%) were identified (Supplemental Table IV). Two-dimensional hierarchical clustering was performed using a correlation matrix constructed from EST frequencies for the 665 differentially expressed TCs. cDNA libraries clustered into two major clades: one relatively homogeneous clade representing all leaf libraries and a second more heterogeneous clade containing all other libraries (Fig. 3A). In general, libraries tended to cluster according to the stage of development and tissue sampled (Fig. 3A). An exception to this pattern is libraries constructed from preveraison berries, which exhibited very different positions within the cluster. It is noteworthy that several of these preveraison berry libraries were constructed from fruit that was carefully sampled to represent distinct stages of berry development, all prior to the onset of ripening (veraison). This differential clustering of preveraison libraries is suggestive of major shifts in gene expression during preveraison berry development (see below). The two root cDNA libraries included in the analysis also exhibited differential clustering, with abiotic stressed roots clustering separately from nonstressed roots. In total, the abiotic stressed Chardonnay root, leaf, and berry libraries contained 79, 26, and six library-specific TCs, respectively, with B > 98.0%. These putative stress-specific TCs were typically highly expressed (EST counts ranging from 8215 ESTs) and in 72% of cases had either no similarity to sequences in the NCBI database nonredundant based on BLASTX or matched an unknown or hypothetical protein.

View larger version (27K):
[in this window]
[in a new window]
|
Figure 3. Hierarchical cluster of grape cDNA libraries and TCs based on EST distribution. A, One-dimensional clustering of 29 grape cDNA libraries. B, Two-dimensional hierarchical clustering of 665 differentially expressed transcripts versus the 29 cDNA libraries shown in A. All 665 differentially expressed transcripts in B have a true positive rate of >98.4% (Stekel et al., 2000 ). Two-dimensional clustering is based on a Pearson's correlation coefficient matrix. Band intensity designates relative transcript abundance in a given library, as inferred from EST frequency within each TC. Black and white designate least and most abundant transcripts, respectively. ND, Not determined; NA, not available.
|
|
We further compared the patterns of expression predicted from the current in silico analysis with published results obtained by more traditional northern-blot analysis. Davies and Robinson (2000) reported 17 grape ripening-induced (GRIP) genes, all of which encode either putative cell wall or stress response proteins that are differentially expressed in ripening berries (Davies and Robinson, 2000 ). We identified the corresponding TCs for all 17 of these genes, 14 of which were included among the 665 differentially expressed TCs used for hierarchical clustering (Table V). As shown in Figure 4, in silico analysis confirms that expression of the GRIP genes was largely restricted to the veraison and postveraison libraries; moreover, this work extends the previous analysis of Davies and Robinson (2000) by demonstrating that expression of most GRIP genes was also absent from roots, buds, flowers, stems, petioles, and leaves, with the primary exception of GRIP 21 and VvTL2, which were expressed in 1-year-old root tissues of V. vinifera cv Pinot Noir. Examination of the hierarchical cluster populated by GRIPs 3, 13, 15, 21, and 61 identified four putatively coregulated genes (Table V). The annotations for several of these TCs suggest functional relatedness to the GRIPs, including a putative stress response thaumatin-like protein SCUTL2 from V. vinifera (CTG1028245) and a putative cell wall expansin (CTG1027667) with homology to a Glycine max gene. The fact that the patterns of expression predicted by in silico analysis are consistent with those obtained by alternative methods (Davies and Robinson, 2000 ) suggests that broader analysis of the 665 unigenes shown in Figure 3 is likely to be an accurate depiction of the expression for a significant fraction of the grape transcriptome.
View this table:
[in this window]
[in a new window]
|
Table V. GRIP genes (Davies and Robinson, 2000 ) and coregulated TCs (with putative annotation) identified in the V. vinifera nonredundant data set (TCs and singletons)
TC numbers in bold represent those identified within the 665 differentially expressed transcripts based on R-statistics analysis (Stekel et al., 2000 ; Fig. 4). Contigs with a postscript letter were split from chimeric contigs by manual curation, as indicated in Supplemental Table IV. GB, GenBank accession number.
|
|

View larger version (26K):
[in this window]
[in a new window]
|
Figure 4. Expression of GRIP genes based on normalized EST frequency. A, Putative cell-wall-associated GRIP genes. B, Putative stress-response-associated GRIP genes. Gene expression is represented as percentage of distribution of ESTs in each of the corresponding libraries. ND, Developmental stage not determined. NCBI accession and contig numbers are given in Table V.
|
|
Towards a Compendium of Gene Expression during Flower-Berry Development
Grapevine berries are nonclimateric fruits (Giovannoni, 2001 ) with a characteristic double sigmoid growth curve (Fig. 5). The initial phase of exponential berry growth (stage I) is followed by a lag phase (stage II), with growth resumed after the onset of ripening or veraison (stage III). Berry development is characterized by changes in numerous biological processes, including cell division and enlargement, primary and secondary metabolism, and resistance/susceptibility to abiotic/biotic stresses. In particular, the transition to ripening, which distinguishes stage II from stage III, includes major shifts in the accumulation of anthocyanins, organic acids, and sugars and alterations in the relative susceptibility to pathogens, such as powdery mildew (Oidium tuckerii) and bunch rot (Botrytis cinerea), which affect berries primarily during stages I and III, respectively (Ollat et al., 2002 ). Transcriptional profiling should reveal the transcriptional correlates of this dynamic developmental system.

View larger version (29K):
[in this window]
[in a new window]
|
Figure 5. Overview of berry developmental and stages sampled by EST sequencing in the UC Davis flower-berry developmental series. SI, stage I; SII, stage II; SIII, stage III. 1, Flower prebloom; 2, flower bloom; 3, berry stage I; 4, berry stage II green hard; 5, berry stage II green soft; 6, berry stage III, 19 brix. Brix is a measure of soluble solids and a standard measure of ripening.
|
|
To examine transcriptional responses during berry development, six cDNA libraries were prepared at UC Davis from a well-characterized developmental series that encompasses prebloom flowers through stage III ripe fruit (Fig. 5; Supplemental Table V). Each cDNA library was generated using an identical protocol and survey sequenced to an equivalent depth using a paired-end sequencing strategy that increases the utility of the resource for in silico analysis of gene expression. Based on R-statistics analysis, 87 differentially expressed genes were identified with a false positive rate 1 (B 98.8%; Supplemental Table V). As shown in Figure 6, two-dimensional hierarchical clustering reveals that each stage of flower and berry development is characterized by distinct patterns of gene expression, including numerous stage-specific transcripts (Supplemental Table V).

View larger version (38K):
[in this window]
[in a new window]
|
Figure 6. Hierarchical cluster of differentially expressed berry transcripts. EST frequencies used in this analysis corresponds to those derived from sequencing of the UC Davis flower-berry developmental series (see Fig. 5). Contig (CTG) numbers on the vertical axis correspond to the 87 differentially expressed transcripts described in the compendium of berry gene expression. Color designates relative transcript abundance in a given library, with red and blue being the most and least abundant, respectively. Individual transcripts span the vertical axis. cDNA libraries are organized in a temporal series, horizontally from left to right, as follows: FpB, flower prebloom; FB, flower bloom; SI, stage 1 berries; SIIgh, stage II green hard berries; SIIgs, stage II green soft berries; SIII, stage III berries (postveraison).
|
|
Prior to detailed analysis of specific transcripts, we sought to validate the use of our EST frequency data as a measure of gene expression, specifically for this data set. Towards this end, canonical correlation analysis (CCA) was used to determine the extent of correlation between the UC Davis data set and a similar but independent data set compiled from data deposited at the NCBI EST database. Four developmental stages (flower, preveraison, veraison, and ripe berries) could be circumscribed from the nonredundant NCBI data, which served as a pseudoreplicate of the UC Davis developmental series. Only differentially expressed genes with B 98.8% were used for CCA. CCA variables were developmental stages, while EST frequencies in a given developmental stage served as the individual observations. The two data sets were related through the common unigene set described above.
Taken together, the first and second canonical variates account for 89.52% of the variability in the EST data sets. Moreover, as shown in Table VI, CCA reveals a significant correlation between the UC Davis data set and the NCBI EST database pseudoreplicate, with correlations of 0.66 between the first canonical variates and 0.54 between the second canonical variates. The first canonical variates for both data sets are most strongly influenced by stage I (preveraison) and stage III (postveraison) libraries, while the second canonical variates in both data sets are most strongly influenced by stage III (veraison and ripe berry) libraries. Principal component analysis (PCA) of the combined data sets supports these conclusions, with a high level of congruence between libraries that sample similar stages of development (Fig. 7). In agreement with the results of two-dimensional hierarchical clustering, PCA also supports the distinct nature of gene expression during different stages of flower and berry development. Most striking among these differences is the opposite trend in EST frequencies observed between flowers and stage II berries, with a clear contrast along the second principal component. Unigenes having a major influence on PCA are indicated in the legend to Figure 7.
View this table:
[in this window]
[in a new window]
|
Table VI. Correlations, standardized canonical coefficients, and canonical variates among the UC Davis College of Agricultural and Environmental Sciences libraries and other publicly available libraries, and cross-correlation analysis
The canonical correlations between the first and second canonical variables were 0.66 and 0.54, respectively. The first and second canonical variables accounted for 58.8% and 30.7% of the variability (cumulative = 89.5%). The approximate F test indicated that these two components were significant (P < 0.0001 and 0.001, respectively). CA&ES, UC Davis College of Agricultural and Environmental Sciences. Last two columns, cross-correlation analysis.
|
|

View larger version (13K):
[in this window]
[in a new window]
|
Figure 7. PCA of 10 independent flower and berry EST data sets. F, Flower; FpB, flower prebloom; FB, flower bloom; SI, stage I berries; Pv, preveraison berries (approximately equivalent to stage I); SIIgh, stage II green hard berries; SIIgs, stage II green soft berries; SIII, stage III berries (postveraison); V, veraison; R, ripe (postveraison). Note that libraries that sampling similar developmental stages have highly similar principal components. Symbols are as follows: white diamond, chitinases; white square, nsLTPs; gray circle, allergens; white triangle, metallothioneins; *, putative transcription factors; white circle, Hsp; -, hidroxyproline-rich cell wall protein; gray diamond, photosynthesis-related; black inverted triangle, ripening induced proteins; gray square, plasma-membrane-associated proteins; black circle, ACC oxidase; black circle, caffeic acid OMT; black square, PPO; black triangle, chalcone isomerase; gray triangle, myo-inositol 1-P synthase; gray inverted triangle, aquaporin; and black diamond, no hit.
|
|
Taken together, the results of CCA and PCA indicate a strong statistical correlation between EST frequencies in the UC Davis and NCBI pseudoreplicate data sets, lending credence to the inference of gene expression patterns described below.
Functional Classification of Predicted Differentially Expressed Genes
For purposes of relating gene expression during grapevine flower and berry development to known changes in berry phenotypes, we assigned the 87 differentially expressed genes to eight functional categories based on the function of homologous proteins in Arabidopsis (Arabidopsis thaliana) or previously published work in grapes, as shown in Supplemental Table V. Categories included (1) pathogenesis-related (PR) proteins, (2) abiotic stress/cellular redox balance/detoxification-xenobiotic transport proteins, (3) primary metabolism, including CO2 assimilation, carbohydrate metabolism, lipid metabolism, and amino acid biosynthesis, (4) secondary metabolism, (5) berry growth, expansion, and water relations, (6) ethylene metabolism, (7) allergenic peptides and seed-specific proteins, and (8) other proteins of interest.
PR Proteins
Several PR proteins are predicted to be strongly up-regulated during stage II of berry development and during postveraison stages. Among these, chitinases and nonspecific lipid transfer and thaumatin-like proteins are the most abundant. From stage II of berry development through maturity, the protein content of the mesocarp cell walls increases >50% (Nunan et al., 1998 ). This increase in protein parallels the increase in soluble solid content of postveraison berries (Salzman et al., 1998 ). Approximately 90% of these proteins are PR proteins, especially thaumatin-like (osmotins) and chitinases (Salzman et al., 1998 ; Pocock et al., 2000 ). These proteins appear to be expressed under stress (Salzman et al., 1998 ; Pocock et al., 2000 ) and as part of normal berry development (Tattersall et al., 1997 ).
Chitinases constitute a large family of enzymes with hydrolytic activity against a linear polymer of -1,4-N-acetylglucosamine, or chitin, which is a major component in the cell walls of most pathogenic fungi and in the exoskeleton of insect pests. In addition to their presumed importance in disease resistance, chitinases can be major allergens in grapes and wines (Pastorello et al., 2003 ), and they contribute to proteinaceous hazes that are detrimental to wine quality (Waters et al., 1998 ).
In silico analysis of EST frequencies (Fig. 6; Supplemental Table V) reveals the strong induction of an endochitinase (CTG1027444) during stage II pre- and post-berry softening and a class IV chitinase (CTG1027246) during postveraison development. These expression patterns are consistent with published measurements of enzyme activity and protein content in berries. Thus, chitinase activity has been observed to increase in proportion to the accumulation of soluble solids during ripening (Derckel et al., 1998 ), ultimately representing a substantial fraction of the total protein content of berries at industrial maturity (Derckel et al., 1998 ; Waters et al., 1998 ; Sarry and Gunata, 2004 ), although actual values depend on growth conditions and pathogen challenge during berry development (Monteiro et al., 2003 ).
Gene expression and enzyme activity of class IV chitinases have been observed in berry mesocarp at the onset of veraison, as well as in grapevine flowers (Robinson et al., 1997 ). Consistent with the former observation, these EST data reveal induction of transcript for a class IV chitinase in postveraison berries; however, homologous transcripts were not observed in EST libraries constructed from developing flowers (Supplemental Table V). Class I and class III chitinases have not been reported in developing berries, except upon pathogen challenge (Coviella et al., 2002 ), although they are well documented in cell culture and intact leaves following treatment with chemical elicitors of systemic acquired resistance (Busam et al., 1997 ; Aziz et al., 2003 ) and in response to fungal elicitors (Busam et al., 1997 ). The apparent absence of class I and class III chitinases during flower and berry development is also supported by EST frequency data analysis.
Nonspecific lipid transfer proteins (nsLTPs) are small apoplastic basic Cys-rich proteins that possess in vitro phospholipid transfer activity. nsLTPs belong to the PR-14 protein family (van Loon and van Strien, 1999 ), and they have been implicated in disease resistance as antimicrobial proteins (Ge et al., 2002 ) as well as in a proposed inducible defense mechanism (Kader, 1996 ; Broekaert et al., 1997 ; Jung et al., 2003 ). In addition to their induction in response to pathogen attack, nsLTPs accumulate in different plant organs during development (e.g. embryogenesis and in symbiotic root nodules of legumes) and in response to environmental cues (Thomas et al., 1994 ; Kader, 1996 ; Jung et al., 2003 ) and biotic/abiotic stresses (Jung et al., 2003 ). Specifically in grapes, nsLTPs were correlated with enhanced resistance to the causal agent of grape anthracnose Elsinoe ampelina (Jayasankar et al., 2003 ). In addition to their proposed function in disease resistance, nsLTPs may affect enological and edible properties of grape berries as they are implicated in detrimental proteinaceous haze formation in wines (Waters et al., 1998 ) and as plant-derived food allergens (Mills et al., 2002 ; Shewry and Halford, 2002 ). Together, the four contigs annotated as lipid transfer proteins in Table I were observed in all grape berry libraries but in neither flower library. Despite their apparent ubiquity during berry development, the level and tissue specificity of expression varied significantly between the different nsLTP homologs. Thus, CTG1027108 exhibited the second highest expression of any differentially expressed gene in stage II libraries but had low or undetectable expression in other tissues, similar to the expression pattern reported above for endochitinase (CTG1027444). Salzman et al. (1998) determined that a 9-kD nsLTP increases in grape berries at the onset of sugar accumulation, consistent with the high expression of CTG1027108 in stage II berry libraries. Interestingly, CTG1027108 is most closely related to a castor bean (Ricinus communis) nsLTP that has been observed in both peripheral tissues and seeds throughout castor bean development (Osafune et al., 1996 ). Lower levels and contrasting patterns of expression were observed for CTG1027465 and CTG1028016, for which transcripts were most abundant in stage I/preveraison and stage III berry libraries.
-Thionins or defensins are small basic Cys-rich proteins that are rapidly evolving and exhibit antimicrobial activity against a wide range of bacterial and fungal pathogens (Zhang and Lewis, 1997 ) and insect pests (Shade et al., 1994 ). Although defensin proteins accumulate in response to pathogens and their elicitors (Epple et al., 1997 ; Thomma et al., 2002 ), they can also be developmentally regulated and are frequently observed in plant reproductive tissues (Epple et al., 1997 ; Aluru et al., 1999 ) and are abundant in the nodules of Medicago truncatula (Graham et al., 2004 ). CTG1027862 (Supplemental Table V) encodes a putative -thionin that, in Cabernet Sauvignon, is expressed exclusively in ripening (stage III) berries. By contrast, CTG1027862 is expressed in preveraison, veraison, and ripening berries of cultivar Chardonnay (NCBI EST database), suggesting distinct patterns of gene expression between the two genotypes. Defensin proteins have not been previously reported from grapes.
A putative -1,3-glucanase protein (CTG1029513) is predicted to be differentially expressed during stage I of berry development. This contrasts with the reports of Jacobs et al. (1999) and Robinson et al. (1997) in which -1,3-glucanase activity was low or undetectable in preveraison berries. Although it is possible that transcript levels do not correlate with enzyme activity, this discrepancy could also be explained if developmental regulation of -1,3-glucanase expression is genotype dependent, as we suggest above in the case of defensin transcripts. Alternatively, unrecognized biotic or abiotic factors may contribute to the difference between these two studies. Enhanced transcription of -1,3-glucanases is well documented in grape berries in response to pathogens and their elicitors (Renault et al., 1996 ; Jacobs et al., 1999 ), and -1,3-glucanase activity was strongly induced in preveraison berries by exogenous application of the ethylene precursor ethephon (Jacobs et al., 1999 ).
Lipoxygenases (LOXs; E.C.1.13.11.12) are a class of iron-containing dioxygenases that catalyze the hydroperoxidation of lipids containing a cis,cis-1,4-pentadiene structure. CTG1027316, which is predicted to encode a LOX protein, is relatively highly expressed in both prebloom and bloom flowers and at low or undetectable levels in all stages of berry development. Among Arabidopsis LOX proteins, the CTG1027316 protein product is most closely related to AtLOX2, which is implicated in the production of oxylipin volatiles, including the plant hormone jasmonic acid. In addition to its induction in response to herbivory (Van Poecke et al., 2001 ), AtLOX2 is highly expressed in anthers of Arabidopsis (Mandaokar et al., 2003 ), where jasmonic acid plays an essential role in development (McConn and Browse, 1996 ). Based on the flower-enhanced expression of CTG1027316 and its close homology to AtLOX2, we speculate that CTG1027316 functions in grape anther development and as a potential mediator of volatile signal production in response to wounding by insects.
Stress-Induced/Cellular Redox Balance/Detoxification or Xenobiotic Transport Proteins
The most abundant group of abiotic stress-induced proteins predicted to be differentially expressed in flowers and berries were heat shock proteins (Hsp). Eight out of the 10 Hsp reported in Supplemental Table V were observed almost exclusively at the onset of veraison, with 82 of 84 occurrences (98%) in stage II green soft berries. By contrast, CTG1027432 was strongly expressed in flowers at bloom, while CTG1029041 was most strongly expressed in stage II green hard and ripening stage III berries. Most Hsp function as molecular chaperones that aid adaptation to a range of internal and external stresses (Sabehat et al., 1998 ; Iba, 2002 ; Wang et al., 2003 ). Although classically defined based on their induction in response to temperature stress (Sabehat et al., 1998 ; Iba, 2002 ), many Hsp are subject to developmental regulation. In tomato (Lycopersicon esculentum) and strawberry (Fragaria spp.), a plastid-localized Hsp (pTOM111) increased severalfold in ripening fruit and in response to heat stress. pTOM111 has been implicated in the reorganization of thylakoid membranes during the transition from chloroplasts to carotenoid-accumulating chromoplasts, an event that coincides with the onset of ripening (Lawrence et al., 1997 ). The small tomato Hsp vis1 (AAM96946) has been implicated in cell wall pectin depolymerization at the onset of tomato ripening (Ramakrishna et al., 2003 ). In particular, VIS1 is hypothesized to stabilize pectinases that might otherwise be denatured by elevated daytime temperatures observed under field conditions (Ramakrishna et al., 2003 ). The V. vinifera homolog of vis1 (CTG1028022) is up-regulated at the onset of veraison (berry softening). The observation of the up-regulation of numerous Hsp specifically at the onset of ripening is consistent with the large redirection of development and metabolism that occurs at this stage and the likely need to stabilize preexisting and newly synthesized proteins in a changing physiochemical environment.
Metallothioneins are small, Cys-rich proteins found in numerous organisms, including plants, fungi, and animals, where they are implicated in the detoxification of metal ions and reactive oxygen species as well as in the control of cellular redox potential. Conditions that promote oxidative stress are known to enhance metallothionein transcript accumulation (Navabpour et al., 2003 ), and the antioxidant properties of metallothioneins have been demonstrated recently (Akashi et al., 2004 ). Three putative metallothioneins were identified as differentially expressed based on EST frequency. The two most abundant contigs, CTG1027166 and CTG1027085, share 98% identity and may represent alleles of the same gene or close paralogs. Both genes are expressed throughout all stages of berry development, with transcripts most abundant in stage II berries and absent from the two flower libraries. Grip 24 (AJ237990), a probable allele of CTG1027166/CTG1027085 from cultivar Shiraz, encodes a ripening induced metallothionein that was shown by northern-blot analysis to be strongly up-regulated at the onset of ripening in grapes, although transcripts were also detected in earlier stages of berry development (Davies and Robinson, 2000 ; see also Table V). By contrast to CTG1027166/CTG1027085, the third metallothionein-encoding transcript (CTG1027501) is most highly expressed in flowers and stage I berries.
Thioredoxins constitute a group of small proteins involved in the regulation of the redox status of the cell (Gelhaye et al., 2004 ). They are involved in seed reserve breakdown during germination and cellular protection against oxidative stress. In addition, they act as electron donors for enzymes involved in protection against oxidative stress, such as peroxiredoxin and glutathione reductase. Thioredoxins possess a broad range of potential targets in plants, including many of the enzymes identified as differentially expressed during flower and berry development (e.g. endochitinases, nsLTPs, globulins, Hsp, aldolase, and enolase). CTG1027078 encodes a thioredoxin h homolog expressed throughout all stages of berry development and in post bloom flowers, with the highest levels of transcript observed in stage II green soft and stage III ripe berries. The fact that expression of CTG1027078 overlaps substantially with many of its potential target proteins is consistent with the possibility that CTG1027078 plays a central role in protein activation during berry development.
Plant peroxiredoxins are abundant and ubiquitous low-efficiency peroxidases (Dietz, 2003 ). As antioxidants, they protect against the reactive byproducts of photosynthesis and respiration, they modulate redox signaling during development, and they protect against oxidative stress (Dietz, 2003 ). A Vitis peroxiredoxin homolog (CTG1027597) was observed throughout all stages of berry development, with the highest occurrence in stage I berries. Peroxiredoxins are coupled to electron donors such as thioredoxin or glutaredoxin (Rouhier et al., 2004 ). The similarity of expression patterns between the Vitis homolog for thioredoxin h (CTG1027078) and peroxiredoxin (CTG1027597) supports the possibility that products from both transcripts are involved in similar processes. The closest homolog to the putative Vitis peroxiredoxin CTG1027597 was cloned from a poplar (Populus trichocarpa) xylem/phloem cDNA library (Rouhier, 2004 ). This thioredoxin- and glutaredoxin-dependent peroxidase is highly expressed in poplar plastids of the sieve tubes, where it is predicted to play a role in the regulation of the redox state of phloem cells (Rouhier, 2004 ).
CTG1027377 encodes a putative dehydrin. Dehydrin proteins have been reported in maturating seeds during desiccation (Nylander et al., 2001 ), in vegetative tissues treated with abscisic acid (Nylander et al., 2001 ; Parmentier-Line et al., 2002 ), and in plants exposed to environmental stress factors that result in cellular dehydration, such as water stress (Caruso et al., 2004 ) or cold stress (Caruso et al., 2004 ; Hara et al., 2004 ). In addition to seeds, dehydrins are also found among the major mesocarp proteins of ripe grapevine berries (Sarry and Gunata, 2004 ). The expression of CTG1027377, which is particularly elevated during stages II and III of berry development, may reflect coincident changes in water status, such as seed dehydration and osmotic stress in mesocarp tissue that results from hexose accumulation.
Polyphenol oxidases (PPO; E.C.1.14.18.1) catalyze the O2-dependent oxidation of monophenols and o-diphenols to o-quinones. These later compounds are highly reactive and mediate oxidative browning observed during plant organ senescence and in response to pathogen infection and wounding (Mayer and Harel, 1991 ). Preventing oxidative browning is a major concern in winemaking, particularly for white wine production (Boulton et al., 1998 ). CTG1026875 encodes a PPO that is predominantly expressed in prebloom flowers with low levels of transcript observed in other stages of flower and berry development. CTG1026875 is identical to Z27411 reported by Dry and Robinson (1994) , a predicted chloroplast-localized PPO that is primarily expressed in immature tissues. In chloroplasts, PPO is associated with the chloroplast lumen, where it may participate in the Mehler reaction, photoreduction of oxygen by photosystem I, and metabolism of reactive oxygen species in the plastids (Thipyapong et al., 2004a , 2004b ).
Transcripts Involved in Primary Metabolism
Several transcripts involved in primary metabolic pathways, such as those leading to CO2 assimilation, carbohydrate metabolism, and lipid and amino acid metabolism, are predicted to be differentially expressed. Taken together, the data described below highlight the complexity of metabolic changes taking place in distinct tissues and organs of flowers and berries at different developmental stages in grape.
Light and Dark Reaction of PhotosynthesisCO2 Assimilation
Rubisco was continually expressed throughout flower and berry development, although expression was significantly higher in floral tissues compared to berries. During early stages of development, Rubisco may be involved in light-mediated CO2 assimilation and refixation of CO2 released by respiration or other metabolic processes (Schwender et al., 2004 ), while after the onset of ripening, refixation of respiratory CO2 is likely to be the primary function (Blanke and Lenz, 1989 ; Aschan and Pfanz, 2003 ). Expression patterns derived from EST frequencies are consistent with the abundance of Rubisco in the berry mesocarp up to the onset of ripening, with trace levels after this point (Famiani et al., 2000 ). Other transcripts with predicted roles in photosynthesis are also highly expressed in flowers compared to berries. The expression patterns of CTG1027507, a putative photosystem II type I chlorophyll a/b-binding protein, is consistent with stages in flower-berry development where chlorophyll is abundant and flowers and berries fix CO2 (Blanke and Lenz, 1989 ). Similar patterns of expression were observed for an early light-induced-like protein (CTG1027439).
Carbohydrate Metabolism
Cytoplasmic Fru-bisphosphate aldolase (FPBA; E.C.4.1.2.13) is involved in the reversible glycolitic reaction leading to the formation of dihydroxyacetone and D-glyceraldehyde-3-P from Fru-1,6-biphosphate. D-Glyceraldehyde-3-P can be channeled to the remainder of the glycolitic pathway to generate, among other compounds, phosphoenolpyruvate, which can provide carbon skeletons to the phenylpropanoid-flavonoid pathway. The up-regulation of FBPA during stages I and III of berry development, when flavonoid biosynthesis is most active (Downey et al., 2003 , 2004 ), is consistent with this relationship. The expression patterns of FBPA transcript deduced from EST frequency data agree well with that reported by Famiani et al. (2000) , where FBPA protein was localized in the berry mesocarp using an immunohistochemical approach. In spite of the predicted high expression levels during ripening for CTG1037427, FBPA was not identified among the abundant proteins of the mesocarp of ripe berries (Sarry and Gunata, 2004 ) or in seeds after the onset of veraison (Famiani et al., 2000 ).
Enolase (E.C.4.2.1.11) participates in the conversion of the cytosolic pool of 3-phosphoglycerate to phosphoenolpryruvate (PEP). PEP can be channeled to aromatic amino acid biosynthesis via the shikimic acid pathway in the choloroplast stroma through a PEP:Pi transporter (Streatfield et al., 1999 ). Aromatic amino acids, in particular Phe, are the primary substrate for phenylpropanoid metabolism in the cytoplasm. The predicted up-regulation of an enolase encoding transcript during stages II and III of berry development may reflect an increased demand for PEP for different phenylpropanoids and flavonoids (e.g. proanthocyanidins and anthocyanins) that accumulate during these stages of berry development. Consistent with the expression pattern of CTG1027341, Sarry and Gunata (2004) used a proteomic approach to determine that several enolase isoforms are among the major proteins in the mesocarp of ripe grape berries.
The onset of ripening in grape berries is characterized by profound changes in the physiochemical properties of mesocarp cell walls. The most dramatic changes include the decrease in Gal/galactan content from pectins and the increased solubility of pectic polysaccharides (Nunan et al., 1998 ). These transformations are catalyzed by different cell wall modifying enzymes (Nunan et al., 2001 ; Ishimaru and Kobayashi, 2002 ). Among them, pectin methylesterase (PME) catalyzes the demethylesterification of homogalacturonan components of pectins in primary plant cell walls (Vorwerk et al., 2004 ). PME activity is higher during stages I and II of berry development and lower during ripening (Nunan et al., 2001 ). Despite contradictory reports (Ishimaru and Kobayashi, 2002 ), transcripts encoding a putative V. vinifera PME (VvPME1) were up-regulated soon after the onset of veraison (Nunan et al., 2001 ). In this study, VvPME1 transcript (CTG1028299) was detected in both stage II and stage III berry cDNA libraries but not classified as differentially expressed based on the R statistic (data not shown).
Inhibitor proteins often regulate the activity of enzymes involved in carbohydrate metabolism that are secreted out of the cytoplasmic compartment into the apoplast or to the vacuole (Juge et al., 2004 ). In addition, these proteinaceous inhibitors can also act against pathogen secreted cell-wall-degrading enzymes (Bellincampi et al., 2004 ). Examples of these proteinaceous inhibitors are invertase and PME inhibitors (Bellincampi et al., 2004 ). Plant PME and invertase inhibitors (PMEI) are members of a large family of proteins named PMEI-related proteins (PMEI-RPs; Hothorn et al., 2004 ). The molecular bases of target specificity and inhibitory mechanism are starting to be revealed (Hothorn et al., 2004 ). At the biochemical level, PMEI-RPs are likely to regulate the activity of cell wall invertases and PMEs; however, the precise biological role of PMEI-RPs during fruit ripening remains unknown. Moreover, PME activity can also be regulated by the expression of different isoforms and posttranslational mechanisms, such as changes in apoplastic pH. Consistent with a role for PMEI-RPs in PME regulation, in Actinidia chinensis fruit, the activity of PME opposes that of PMEI (Giovane et al., 1995 ). Interestingly, A. chinensis PMEI was only active against endogenous PME and not that secreted by fungal or bacterial pathogens. A putative invertase/PME inhibitor protein (CTG1029451) is highly represented in preveraison stage I berries but not detected during later stages of berry development. Inhibition of PME activity by PMEI may be essential to maintain high degrees of methylation in cell wall pectins, which are implicated in preformed resistance of the organ to fungal and bacterial pathogens (Le Cam et al., 1994 ). Inhibition of PME activity is also presumably important to prevent fruit softening during early development. Decreased PME and polygalacturonase activity induced by antisense repression was correlated with reduced pectin degradation but did not prevent fruit from softening (Tieman et al., 1992 ). However, fruit-specific expansins induced during ripening may also promote fruit softening independent of PME activity (Giovannoni, 2001 ). Interestingly, the predicted decrease in PMEI transcription contrasts with the up-regulation of a putative expansin (CTG1027667) during later stages of berry development.
Another potential target for PMEI is invertase, which functions in the degradation of apoplastic or vacuolar Suc into the monosaccharides Glc and Fru. Vacuolar invertase activity peaks around veraison, remaining at similar levels throughout stage III (Davies and Robinson, 1996 ; Dreier et al., 1998 ). The vacuolar invertase protein is found at all stages of development both in seeds and mesocarp cells (Famiani et al., 2000 ). In this study, two vacuolar invertase homologs (CTG1028100 and CTG1029156) were observed in developing berries but not scored as differentially expressed. Invertase leakage from vacuoles after the onset of ripening is believed to promote increased Suc flow and hexose accumulation in the berry mesocarp cells (Dreier et al., 1998 ). The activity of a specific invertase inhibitor (e.g. CTG1029451), operating in conjunction with cellular compartmentation, could explain the lack of invertase activity on Suc in early stages of berry development.
Two closely related contigs (CTG1027410 and CTG1026987) encode the putative transcription factor VvMSA, which is implicated in regulation of the hexose transporter VvHT1 (AF281656; Atanassova et al., 2003 ). CTG1027410 and CTG1026987 are highly represented in all berry libraries, in agreement with the report of Atanassova et al. (2003) , with the most frequent occurrence in ripening (stage III) berries. By contrast, only low levels of expression were observed in flowers. Consistent with this pattern of expression, VvHT1 is one of the most abundant proteins in the mesocarp of ripe berries (Sarry and Gunata, 2004 ).
D-Myo-inositol-3-P synthase (MIPS; EC.5.5.1.4) catalyzes the first step in the synthesis of myo-inositol by converting D-Glc-6-P to D-myo-inositol-3-P (Loewus and Murthy, 2000 ). Different myo-inositol phosphates are involved in several aspects of cell and plant biology, acting as signal transduction molecules, osmoprotectants, cell wall constituents, mineral nutrient storage molecules, and in auxin metabolism (Loewus and Murthy, 2000 ; Hegeman et al., 2001 ). Contig CTG1027626 encodes a putative MIPS that is predicted to be differentially expressed during flower and stage I of berry development. Although the role of MIPS in flowers is not well understood, myo-inositol content is high in flowers of several species (Ichimura et al., 1999 , 2000 ). Application of myo-inositol inhibited bloom in roses (Rosa hybrida; Ichimura et al., 1999 , 2000 ), where it is proposed to function as an osmolite. In addition, both alkaline phytase and myo-inositol 1-P synthase occur in pollen grains (Loewus et al., 1984 ; Barrientos et al., 1994 ), and myo-inositol in particular is proposed as a nutritional substrate during pollen tube germination and growth (Kroh et al., 1970 ).
Lipid Metabolism
Grapevines accumulate considerable amounts of waxes after the onset of veraison (Rogiers et al., 2004 ). The accumulation of epicuticular waxes may enhance resistance to fungal pathogens such as B. cinerea (Comménil et al., 1997 ; Gabler et al., 2003 ) while simultaneously decreasing water loss (Rogiers et al., 2004 ) and mitigating the detrimental effects of solar UV radiation (Engeseth et al., 1996 ). Biosynthesis of waxes begins with the formation of fatty acids in the plastids and continues with fatty acid elongation to very long chains (C24-C34) that are ultimately processed into alkanes, secondary alcohols, ketones, primary alcohols, and wax esters (Engeseth et al., 1996 ). Very long chain fatty acid wax precursors and their derivatives are highly hydrophobic. Two protein families have been implicated in controlling the cytoplasmic pool of very long chain fatty acids: fatty acid binding proteins and acyl-CoA binding proteins. Cytosolic acyl-CoA binding proteins bind long-chain acyl-CoAs (Engeseth et al., 1996 ) and are believed to play a role in the intracellular transport and formation of cytoplasmic acyl-CoA pools (Knudsen et al., 1999 ). Contig CTG1027236, which encodes a putative acyl-CoA binding protein, is up-regulated at the onset of veraison (stage IIgs) and in ripening berries. Because this pattern of transcript accumulation correlates with stages of active wax accumulation during ripening, CTG1027236 is a candidate gene for this process.
Grape seed oil is rich in polyunsaturated fatty acids (PUFAs), accounting for up to 17% of the seed fresh weight. The major PUFA (72%76%) is linoleic acid ( 6, C18:2; Cao and Ito, 2003 ). An additional 10% to 13% of PUFA content is the monounsaturated oleic acid (C18:1), with the remainder being saturated fatty acids, such as palmitic acid (C16:0, approximately 3.5% to 5%) and stearic acid (C18:0, approximately 1.5% to 2.3%). -6-Fatty acid desaturase catalyzes the addition of a double bond to position 12 of oleic acid, yielding linoleic acid (Jin et al., 2001 ). Contig CTG1028129, which encodes a putative -6-fatty acid desaturase, is first observed at the onset of veraison (stage IIgs), with the highest transcript levels attained in ripening berries (stage III). This expression pattern is in contrast to that of a grapevine seed-specific acyl-CoA carboxylase described by Walker et al. (1999) .
Amino Acid Biosynthesis
Cytoplasmic Asp aminotransferase (AAT; E.C. 2.6.1.1.), also known as Glu-oxaloacetate transaminase, catalyzes the reversible interconversion of Glu into Asp by transamination of either oxaloacetate or -ketoglutarate. AAT protein content in the mesocarp and seeds of grape berries was shown to increase significantly during stages I and II of berry development, with protein remaining unchanged throughout the remaining stages of development (Famiani et al., 2000 ). CTG1027149 encodes a putative AAT that is highly expressed during stages I and II of berry development but absent from later-stage cDNA libraries. Thus, there is good agreement between the increase in AAT protein (Famiani et al., 2000 ) and CTG1027149 transcript, while the absence of CTG1027149 transcript later in development may indicate reduced turnover rates for ATT protein in ripening berries or the presence of a paralogous gene expressed late in berry development.
Transcripts Involved in Secondary Metabolism
Grapevine berries accumulate a wide range of secondary metabolites throughout development, often in a tissue-specific manner. Among the most abundant of these natural products are various phenylpropanoids, including hydroxycinnamic acids (e.g. caffeic acid), flavonoids, and stilbenoids. In the case of flavonoids, proanthocyanidins (also known as condensed tannins), anthocyanins, and flavonols are among the most abundant and biologically important groups. Different branches of the flavonoid pathway are temporally and spatially activated throughout berry development (Boss et al., 1996 ). For example, while proanthocyanidins in the berry skin are synthesized preveraison and postveraison (Downey et al., 2003 , 2004 ), anthocyanin pigment biosynthesis and accumulation occur soon after the onset of fruit ripening. This EST frequency analysis identifies several transcripts encoding structural enzymes of the general phenylpropanoid and flavonoid pathways that are differentially expressed at various stages of flower and berry development.
Chalcone-flavanone isomerase (CHI; E.C.5.5.1.6) catalyzes the reversible isomerization of naringenin chalcone to naringenin flavanone, a key step in the biosynthesis of flavonoids (Dixon and Steele, 1999 ). CTG1026968 encodes a putative CHI that is expressed strongly in flowers and stage I berries as well as at the onset of veraison in stage II green soft berries. This bimodal pattern of expression agrees well with that reported by Boss et al. (1996) both for anthocyanin accumulation and gene expression.
Cytochrome b5 proteins are components of electron transport systems found in animals, plants, and yeast. The cytochrome b5 homolog, DIF-F, has been implicated as an alternative electron donor for the cytochrome P450 monoxygenase flavonoid 3',5'hydroxylase (De Vetten et al., 1999 ), a key structural enzyme of the flavonoid pathway. The activity of flavonoid 3',5'hydroxylase is implicated in the formation of dihydromyricetin, a trihydroxylated dihydroflavonol, and derived anthocyanic pigments such as delphinidin, petunidin, and malvidin. In petunia (Petunia hybrida) flowers, altered expression of the cytochrome b5 DIF-F resulted in novel flower pigmentation (De Vetten et al., 1999![]() |