|
|
||||||||
|
First published online March 16, 2006; 10.1104/pp.105.075390 Plant Physiology 141:32-46 (2006) © 2006 American Society of Plant Biologists Proteomic Analysis of Seed Filling in Brassica napus. Developmental Characterization of Metabolic Isozymes Using High-Resolution Two-Dimensional Gel Electrophoresis1,[W]Department of Biochemistry, Life Sciences Center (M.H., J.E.C., K.E.H., G.K.A., J.J.T.) and Computer Science Department (Z.S.), University of Missouri, Columbia, Missouri 65211
Brassica napus (cultivar Reston) seed proteins were analyzed at 2, 3, 4, 5, and 6 weeks after flowering in biological quadruplicate using two-dimensional gel electrophoresis. Developmental expression profiles for 794 protein spot groups were established and hierarchical cluster analysis revealed 12 different expression trends. Tryptic peptides from each spot group were analyzed in duplicate using matrix-assisted laser desorption ionization time-of-flight mass spectrometry and liquid chromatography-tandem mass spectrometry. The identity of 517 spot groups was determined, representing 289 nonredundant proteins. These proteins were classified into 14 functional categories based upon the Arabidopsis (Arabidopsis thaliana) genome classification scheme. Energy and metabolism related proteins were highly represented in developing seed, accounting for 24.3% and 16.8% of the total proteins, respectively. Analysis of subclasses within the metabolism group revealed coordinated expression during seed filling. The influence of prominently expressed seed storage proteins on relative quantification data is discussed and an in silico subtraction method is presented. The preponderance of energy and metabolic proteins detected in this study provides an in-depth proteomic view on carbon assimilation in B. napus seed. These data suggest that sugar mobilization from glucose to coenzyme A and its acyl derivative is a collaboration between the cytosol and plastids and that temporal control of enzymes and pathways extends beyond transcription. This study provides a systematic analysis of metabolic processes operating in developing B. napus seed from the perspective of protein expression. Data generated from this study have been deposited into a web database (http://oilseedproteomics.missouri.edu) that is accessible to the public domain.
Brassica napus (also known as rape and oilseed rape) is the third largest oilseed crop in the world, providing approximately 13% of the world's supply of vegetable oil. B. napus seeds produce oil and protein as the main storage compounds out of the three principal storage reserves (proteins, oil [triacylglycerols], and carbohydrates [starch]) found in plant seeds (Norton and Harris, 1975
Biochemical and molecular studies are beginning to define the biosynthetic pathways responsible for accumulation of these storage components in B. napus seed (Rawsthorne, 2002
Impressive achievements in genome and cDNA sequencing have yielded a wealth of information for many model organisms, including the flowering plants Arabidopsis (Arabidopsis thaliana) and rice (Oryza sativa; Arabidopsis Genome Initiative, 2000
In recent years, proteomics has been applied to investigate seed germination and seed development using a proteomics workflow including Medicago truncatula (Gallardo et al., 2003 Here we have conducted an in-depth analysis of B. napus (cv Reston) during five sequential stages of seed filling using 2-DE coupled with MS (MALDI-TOF and LC-MS/MS), with the main objective to characterize metabolic networks. We present high-resolution reference maps of pH 3 to 10 and pH 4 to 7, along with expression profiles of 794 protein spots that reveal 12 principal expression trends during seed development. Using both MALDI-TOF and LC-MS/MS, the identity of 517 protein spots were obtained representing 289 nonredundant proteins. One of the surprising findings of this proteomic study is the preponderance of proteins related to metabolism and energy production. We present and discuss the regulation of these metabolic networks at the protein level, by mapping the protein components and their expression profiles on to the pathways of carbon assimilation. Data generated from this study have been deposited into a database (http://oilseedproteomics.missouri.edu) that is accessible to the public domain.
Characterization of Developing B. napus Seed
Developing B. napus (var. Reston) seeds were staged precisely at 2, 3, 4, 5, and 6 weeks after flowering (WAF), the period when major metabolic changes occur within the embryo. At each developmental stage, seed fresh weight and protein content were measured (Fig. 1
). At 2 WAF the liquid endosperm dominates while the embryo accounts for a small portion of the seed. The embryo begins to take significance within the seed at 3 WAF. This can be estimated by the pale green appearance of seeds at 2 and 3 WAF with increasing darker portion representing the developing embryo. At 4, 5, and 6 WAF, the embryo accounts for most of the seed mass and the seed coat begins to senesce, as visualized by the change in color, when approaching the desiccation stage (Bewley and Black, 1994
Seed fresh weight increased from 2 until 5 WAF followed by a decrease in weight at 6 WAF, indicating that developing seeds enter into the desiccation phase after 5 WAF. This is in agreement with previous studies on B. napus seed development where the authors concluded that the aqueous soluble fraction increased to a maximum at 5 WAF and then declined (Norton and Harris, 1975 To further characterize developing B. napus seed, seed fatty acids (FAs) were quantified by gas chromatography (GC; Table I ). Accumulation of total FAs increased significantly during seed development (Fig. 1) and by 6 WAF accounted for over 20% of the seed dry mass. Compositional analysis revealed major fluctuations in the individual FA species (Table I). Twelve different FAs accounted for nearly all of the acyl chains present in developing B. napus seed. Interestingly, the accumulation of oleic acid (18:1) and eicosenoic acid (20:1) showed maximum abundance at 5 WAF, while erucic acid (22:1) peaked later at 6 WAF. This is not surprising, given that 18:1 is the precursor for elongation to 22:1, the prominent FA in B. napus Reston.
Medium-Range Isoelectric Focusing Is Necessary for Reduction of Spot Overlap in Two-Dimensional Gels of Developing B. napus Seed
Initial 2-DE analysis of proteins from B. napus seed was performed using an isoelectric focusing (IEF) range of pH 3 to 10. As seen in Figure 2A
, the region between pH 4 to 7 is protein rich and overlapping spots are prevalent. Use of medium pH gradients is one effective way to reduce spot overlap (Campostrini et al., 2005
Protein Quantification Established Developmental Expression Profiles for 794 Protein Spots For accurate determination of protein expression, the five developmental stages of B. napus seed development were analyzed in biological quadruplicate (Fig. 3A ). Four sample harvesting events were followed by four independent protein extractions. This naturally resulted in both biological and technical variation in the quantification results that is noted by the SDs presented in Supplemental Tables I and II. It was noticed that low abundance spots, as well as those spots present in only two biological replicates, typically exhibited the highest level of variation. To select only high-quality protein spots for expression profiling, the following threshold criteria were applied. Each protein spot was present in at least three biological replicate gels and detected in at least two developmental stages. Using this approach, a total of 794 protein spot groups were matched, manually validated, and quantified (Supplemental Table I). Moreover, coefficient of variation (CV) value was calculated to allow direct comparison of significance in acquired quantification data (Supplemental Table I).
Seed Storage Proteins Can Be Subtracted in Silico to Eliminate the Bias of Highly Expressed Proteins Seed storage proteins (SSPs) are highly abundant at 4, 5, and 6 WAF. As these proteins accumulate, the relative volume of other spots concomitantly decrease, which may bias composite and individual expression profiles. To test this hypothesis, all spots corresponding to SSPs were subtracted in silico and expression of the remaining spots were renormalized (Supplemental Table II). To view possible influence of in silico SSP subtraction, the composite expression profiles (discussed below) were created using both original and SSP-subtracted datasets (Fig. 6). Although the shape of expression trends was not greatly affected, slight changes were observed. For instance, maximum accumulation of photosynthetic proteins occurred at 3 WAF. After SSP removal, the peak moved to 4 WAF.
Combination of MALDI-TOF MS and LC-MS/MS Resulted in 517 High-Confidence Protein Assignments In theory, B. napus is a nonideal organism for proteomic analyses, due to the lack of comprehensive sequence databases. The National Center for Biotechnology Information (NCBI) nonredundant protein database contained only 2,207 entries for Brassica species, as of March, 2005. The largest database for B. napus is available at The Institute for Genomic Research (TIGR), containing 5,568 tentative consensus entries. Due to these limitations, we decided to acquire MS data in duplicate using two different mass spectrometers and to perform comprehensive data mining using both NCBI and TIGR databases. Figure 3B gives an overview of data acquisition and data mining strategy. Altogether 794 spots were excised from 2-DE gels of pH 4 to 7 and pH 3 to 10 (pH 710 region only), digested with trypsin, and analyzed by MALDI-TOF MS and LC-MS/MS in parallel. Peptide mass fingerprint (PMF) analysis using MALDI-TOF MS resulted in 175 protein assignments, while LC-MS/MS resulted in 479 assignments. Protein assignments by MALDI-TOF and LC-MS/MS were integrated to reveal 137 identical and 46 nonidentical assignments. In total, 517 assignments were made (65.1% identification efficiency), of which 289 were nonredundant proteins (Supplemental Table III).
It was previously reported that MALDI-TOF and LC-MS/MS are equally efficient at protein identification when comprehensive databases are available (Lim et al., 2003
In general, database annotations infrequently provide information about intracellular localization. Since enzymes may have different functional roles depending upon subcellular location, proteins that may potentially have multiple intracellular isoforms were analyzed by three different algorithms to predict subcellular targeting. For example, in this study, six protein spots were identified simply as malate dehydrogenase (MDH). In this case, MDH can be located either in the cytosol where it is a part of gluconeogenesis, in the mitochondria where it is involved in the TCA cycle, or in the plastids where it is involved in redox shuttling or the photosynthetic C4 cycle. Applying subcellular localization algorithms, it was suggested that four spots (884, 5,218, 5,222, and 5,225) are cytosolic MDH, and the remaining two spots (5,185 and 5,292) are mitochondrial MDH isoforms. However, it should be noted that subcellular assignments based upon localization algorithms are at best predictive, experimental evidence is required for confirmation. A large percentage of proteins identified in this study were annotated as unknown. At an early stage of this investigation, unknown proteins accounted for as much as 38% of all the identified proteins (data not shown). Although some databases provide current annotation information about sequence homology/similarity in the identifier tag, many do not. Therefore, all assignments lacking this information were subjected to homology search using BLASTP against the NCBI nonredundant database. This manual validation strategy reduced the percentage of unknown proteins from 38% to 2.1%.
All identified proteins were classified into functional classes as originally established by Bevan et al. (1998)
Hierarchical Clustering of 794 Quantified Spot Groups Resulted in 12 Cluster Groups Hierarchical clustering reduced 794 expression profiles into 12 expression cluster groups (Fig. 5 ). Visual inspection of these expression groups suggests diverse and complex patterns of regulation. The two most abundant groups, 3 and 11, had negative slopes, indicating decreasing abundance with seed age. These groups were largely composed of proteins involved in energy and protein destination and storage, representing 21.2% and 20.1% of proteins, respectively (Fig. 5). The third most abundant group, cluster 10, representing 13.7% of proteins, showed a positive slope during seed filling; the majority of these proteins were involved in protein destination and storage.
Figure 5 also revealed that the distribution of plant functional classes in protein expression clusters is not homogenous. For instance, proteins involved in protein destination and storage were mainly grouped in clusters 10 and 11 with negative and positive expression slopes, respectively. This is due to the heterogeneous composition of this functional class that contains SSPs, which increase during seed fill, but also proteins involved in folding and stability that decreased with seed age (Supplemental Table III).
To characterize cellular activities during seed filling, composite expression profiles were established for individual subclasses. For statistical reasons, only those functional subclasses containing 10 or more proteins were considered for analysis. Based upon these criteria, 384 of 517 identified proteins (74.7%) were grouped into 13 functional subclasses (Fig. 6 ). The most prevalent subclass of proteins in terms of relative spot volume, storage proteins (71 proteins), gradually increased in abundance beginning at 3 WAF. The second most abundant subclass, amino acid metabolism (42 proteins), decreased in abundance from 2 WAF until midpoint of seed filling and remained at a constant level thereafter. Relative abundance of detoxification proteins, the third most abundant group with 36 proteins, rapidly increased beginning at 5 WAF. Proteins related to photosynthesis are the fourth abundant cluster and revealed a gradual increase in relative abundance until 3 WAF, followed by a slight decrease. Collectively, these functional subclasses exhibited three different expression trends. The first group included proteins expressed mainly at early stages of seed filling. These proteins are involved in glycolysis, respiration, metabolism of sugars, signal transduction, metabolism of amino acids, proteolysis, and defense (Fig. 6). Proteins of the second group, involved in photosynthesis and lipid metabolism, exhibited highest expression at midpoint of seed filling. Finally, detoxification, seed maturation, and SSPs were highly abundant at later stages of development (Fig. 6). Although perhaps overly simplistic, the expression trends agree with previous observations that seed filling in B. napus begins with sugar mobilization, and is followed by sequential surges in amino acid, lipid, and storage protein synthesis (Fig. 6).
Within the larger goal of profiling protein expression globally in developing B. napus seeds, the aim of this study was to characterize the metabolic pathways operating during seed filling. The major economic value of B. napus lies in its oil, and any insight into the regulation of oil accumulation during seed filling would be useful. Therefore, we focus our discussion on the pathways leading to de novo FA synthesis. However, it is important to mention that this study has also found protein components for a number of other pathways (Supplemental Table III). For instance, 8% and 5% of the total identified proteins are involved in amino acid metabolism and proteolysis, respectively. Interestingly, the majority of proteins identified here were represented by multiple isoelectric forms, suggestive of posttranslational modification. Thus, as genome resources of Brassica crops improve, the high-resolution 2-DE maps reported here can be used as a predictive tool to search for unexpected isoelectric species to begin uncovering posttranslation regulation.
As demonstrated in this study, the main advantage of 2-DE is the detection of different isoelectric species of various proteins. For example, 17 isoelectric species were detected for the glycolytic enzyme Fru bisphosphate aldolase (FBA). This is an unusually high number of isoelectric species, which is strongly suggestive of posttranslational modification(s). This type of information cannot be acquired using transcriptomic or metabolomic approaches. Although adept at resolving isoelectric species, the technique of 2-DE is somewhat restricted at quantifying low abundance proteins. For instance, Glc 6-P dehydrogenase, transketolase, transaldolase, ribulose 5-P epimerase, and Rib 5-P isomerase were analyzed in a recent study of enzyme activities of oxidative pentose phosphate pathway in developing B. napus embryos (Hutchings et al., 2005
Eleven Rubisco large subunits were detected and can be divided into two groups, based on their peak of accumulation. The first group showed maximum abundance at early stages of seed filling (2 or 3 WAF) and includes seven protein spots (511, 520, 536, 4,885, 4,931, 4,937, and 5,009). The second group (spots 4,919, 4,922, and 4,924) with higher protein abundance at midpoint of seed filling (4 WAF), showed bell-shape expression profiles, very similar to the data acquired for pyruvate dehydrogenase (PDH; spots 5,148 and 5,811; Fig. 7
). Interestingly, expression profiles of the second group (spots 4,919, 4,922, and 4,924) and PDH (spots 5,148 and 5,811) are also very similar to the composite expression profile of enzymes involved in lipid metabolism (Fig. 5). A similar increase in the Rubisco small subunit was also reported in Arabidopsis by microarray analysis (Ruuska et al., 2002
The high abundance of Rubisco subunits is in contrast with low abundance of other enzymes of the Calvin cycle, many of which were below the detection limit of this proteomics study. A possible explanation for this disparity in protein abundance may be found in a recent stable isotope labeling study of B. napus embryos (Schwender et al., 2004a
An important component of carbon assimilation in developing seeds is glycolysis. Although this ubiquitous pathway was first elucidated in the 1940's (Meyerhof and Junowicz-Kocholaty, 1943 This study localized several protein spots corresponding to numerous different glycolytic enzymes both in the cytosol and plastids. Based upon quantification data acquired during seed development, it is possible to examine the apparent redundancy of glycolytic pathways between the cytosol and plastids. Suc synthase (SuSy) catalyzes the initial release of sugar for glycolysis by converting Suc into UDP-Glc and Fru. A total of three SuSy spots (202, 204, and 4,628) were identified (Fig. 7). The overall abundance of all three detected protein spots is similar although their expression profiles differ. While two SuSy spots (spots 202 and 204) shared almost identical expression profile with maximum abundance at 3 WAF, the abundance of a third form (spot 4,628) reached a maximum at 5 WAF followed by a dramatic decrease thereafter. This suggests the presence of two types of SuSy that are perhaps active during early (type I) and late (type II) phases of seed filling.
UDP-Glc pyrophoshorylase (UGP) catalyzes the reversible production of Glc-1-P from UDP-Glc. In Arabidopsis there are two homologous UGP genes located on two different chromosomes (Kleczkowski et al., 2004
Phosphoglucomutase (PGM) catalyzes the interconversion of Glc-1-P and Glc-6-P. The Arabidopsis genome contains two cytosolic and one plastidial form of PGM (Caspar et al., 1985
In Arabidopsis two isozymes of phosphoglucose isomerase (PGI) exist, one in the plastids and the other in the cytosol (Caspar et al., 1985 FBA catalyzes the aldol cleavage of Fru 1,6-bisP to glyceraldehyde 3-P (GAP) and dihydroxyacetone phosphate (DHAP). Surprisingly, a relatively large number of cytosolic and plastidial FBA spots were identified (Fig. 7). Most of the nine cytosolic spots and eight plastidial spots shared similar expression profiles, high abundance at early stages followed by rapid decrease between 3 and 4 WAF. However, two differences between cytosolic and plastidial FBA can be noted; the cytosolic forms were generally more abundant and only two cytosolic FBA (spots 5,157 and 5,189) peaked in expression at 4 WAF. The high number of protein spots suggests these activities may be posttranslationally modified. Triose-P isomerase (TPI) catalyzes the interconversion of GAP and DHAP. This reaction is reversible although the equilibrium favors DHAP. Seven cytosolic spots of TPI (spots 1,156, 5,515, 5,520, 5,524, 5,525, 5,528, and 5,530) were identified, but no plastidial TPI spot could be detected (Fig. 7). Interestingly, one protein spot (spot 5,515) showed maximum abundance at 6 WAF (Fig. 7). Glyceraldehyde 3-P dehydrogenase (GAPDH) reversibly catalyzes the conversion of GAP into 1,3-bis PGA. Five cytosolic spots (spots 821, 1,427, 5,184, 5,186, and 5,206) and one plastidial (spot 820) GAPDH were identified. The cytosolic and plastidial GAPDH were almost equally abundant during seed filling and shared very similar expression profiles (Fig. 7). Two spots of cytosolic phosphoglycerate kinase (PGK; spots 5,144 and 5,145) and two plastidial PGK (5,054 and 5,055) were identified. Like TPI, only cytosolic forms of 2,3-bisphosphoglycerate-independent phosphoglycerate mutase (iPGAM) and enolase were identified (Fig. 7). Expression profile of iPGAM (spot 4,747) was present in abundance at 2 WAF. Expression profiles of enolases (spots 4,898 and 4,903) were also high at 2 WAF, but they accumulated strongly until 4 WAF followed by a rapid decrease in abundance. The detection of multiple isoelectric species for cytosolic and plastidial glycolytic enzymes and strong similarities in their expression profiles suggest possible posttranslational modifications as well as coordination between cytosolic and plastidial glycolysis during seed filling.
The current model of metabolite flux between cytosol and plastids has established that either phosphoenolpyruvate (PEP) or pyruvate is transported into plastids for further processing into acetyl-CoA (Weber, 2004
In this study, we identified almost all enzymes involved in cytosolic and many for plastid glycolysis. The notable exceptions are cytosolic pyruvate kinase, plastid iPGAM, plastid enolase, and plastid TPI (Fig. 7). One possible explanation is low expression levels, which is supported by the observation that plastidial iPGAM and enolase were previously determined to have low specific activities (Eastmond and Rawsthorne, 2000
Conversion of acetyl-CoA into malonyl-CoA is catalyzed by acetyl-CoA carboxylase. This plastid complex is comprised of four subunits, the biotin carboxylase, biotin carboxyl carrier protein, and carboxyltransferase subunits ( In summary, this investigation represents a systematic proteomics study of whole-seed proteins expressed during seed filling in B. napus. Multiple categories of proteins were observed, although protein storage, energy, and metabolism associated proteins were most abundant. The preponderance of metabolic proteins presented a unique opportunity to map activities (and isoelectric species therein) for carbon assimilation. Surprisingly, carbon flow from Suc to acetyl-CoA could be entirely predicted based upon the representation of proteins for each enzymatic step. The expression levels of cytosolic pyruvate kinase, plastid enolase, and most of the enzymes of the Calvin cycle were below the detection limit of this proteomics study, except Rubisco and phosphoribulokinase that were both highly expressed. Thus, carbon flow from Suc appears to primarily follow a cytosolic glycolytic track until PEP, at which point carbon is likely imported into plastids and converted into pyruvate and acetyl-CoA for de novo FA synthesis.
Plant Material and Growth Conditions
B. napus (cv Reston) was grown in a growth chamber (16-h light/8-h dark cycle, 23°C day/20°C night, 50% humidity and light intensity of 8,000 LUX). Flowers were tagged upon opening and the developing seeds were collected at precisely 2, 3, 4, 5, and 6 WAF, in the middle of a light cycle. The dry weight and total protein content were measured at each developmental stage. Total protein was quantified using the dye-binding Coomassie protein assay using chicken
Developing seeds of B. napus at 2, 3, 4, 5, and 6 WAF were divided to three test glass tubes per stage (510 seeds per tube) and dried at 80°C overnight. After dry weight determination, 1 mL of 14% boron trifluoride was added to each tube along with 17:0 FA standard in toluene (0.5% of dry mass exactly). Total volume of toluene was brought to 150 µL and samples were incubated at 95°C for 90 min, with vortexing every 10 min. After incubation, samples were cooled to room temperature. To each tube, 1 mL of water and 3 mL of hexane were added. Tubes were vortexed and centrifuged at 3,000 rpm for 5 min. Top phase was removed and transferred to a new conical glass tube. Samples were reextracted with additional 3 mL of hexane, dried under nitrogen stream, and resuspended in 400 µL of hexane before analysis by GC. Analysis of FA was carried on Agilent Technologies model 689N Network GC system gas chromatograph with a DB-23 column (30 m x 0.25 mm; film thickness 0.25 µm; Agilent 1222,332). The GC conditions were: injector temperature and flame ionization detector temperature, 250°C; running temperature program, 150°C for 1 min, then increasing at 2°C/min to 200°C followed by a 5 min hold at 200°C.
Total protein was isolated from developing seed and subjected to 2-DE as described previously (Hajduch et al., 2005
Coomassie G-250 (colloidal) stained gels were imaged by scanning densitometry. Digitized 2-DE images (300 dpi, 16-bit grayscale pixel depth) of five developmental stages in biological quadruplicate were analyzed using ImageMaster 2-D Platinum software (version 5.0, GE Healthcare) as described previously (Hajduch et al., 2005
is the average of relative volumes (x) of spots in biological quadruplicate analysis and n is the sample size (four in case of biological quadruplicate). To subtract SSP in silico, spot volumes were calculated for each of 794 protein spots using ImageMaster 2-D Platinum software. In total, 71 identified SSP were removed from the dataset and two-step normalization approach was applied as described above.
For cluster analysis of expression profiles, hierarchical clustering was performed using SAS statistical software (SAS Institute). The procedure contained two steps. First, the program established the number of classes that is best for a present dataset. The CLUSTER keyword was used with options STANDARD METHOD=AVERAGE CCC PSEUDO as the command for step 1, in which STANDARD means to normalize the variables; AVERAGE means a certain clustering method in contrast to the other 10 methods that are included in SAS IDE; CCC and PSEUDO are both options for calculating some statistical variables that are used to determine the class number. Second, the program clustered expression profiles into each of the established classes. Expression profile data were normalized in two steps. In the first step, any zero between two nonzero points was replaced with the average of two neighbor values. In the second step, a linear transformation was used to normalize expression profiles of different spots to uniform scale. The SAS program used the procedure FASTCLUS for the real clustering and the maximum number of clusters established earlier. For each spot, the variable distance parameter was generated by SAS.
Arraying of 2-DE gel spot, in-gel digestion, and C18 microbed chromatograph were each performed as described previously (Hajduch et al., 2005
Searches against the NCBI (ftp://ftp.ncbi.nih.gov/blast/) nonredundant database (as of March, 2005) and TIGR tentative consensus database for B. napus (http://www.tigr.org/tigr-scripts/tgi/T_index.cgi?species=oilseed_rape) were independently performed using a two-step approach to mine maximum information from MALDI-TOF MS and LC-MS/MS. PMF-based protein identification was performed on local copy of version 3.2.1 of the MS-Fit program of Protein Prospector (http://prospector.ucsf.edu; Clauser et al., 1999 This approach resulted in four independent sets of protein identification data: (1) MALDI-TOF, TIGR search; (2) MALDI-TOF, NCBI search; (3) LC-MS/MS, TIGR search; and (4) LC-MS/MS, NCBI search. To reach consensus in protein assignments, a two-step data reduction strategy was employed. The first step combined the search results from TIGR and NCBI databases for each MS method. If two different protein assignments for one protein spot were noted, the one with the highest number of matching peptides was taken. If number of matching peptides was the same, the assignment with the highest coverage was taken. The second step combined the integrated protein assignments assigned by MALDI-TOF and MS/MS (from step 1). If two different protein identifications were assigned, preference was given to MS/MS-based protein assignment. Assignments annotated as unknown and without specific homology/similarity descriptions in identifier tag were BLASTP searched against the NCBI nonredundant database (as of March, 2005) to further query their homology. This study uses terminology as follows: homology for results, where E-value of BLAST search was 0.0; in all other cases similarity is used.
Subcellular localizations of assigned proteins were predicted using three independent programs: TargetP (http://www.cbs.dtu.dk/services/TargetP/; Emanuelsson et al., 2000
All data from this investigation are available from the oilseed proteomics server (http://oilseedproteomics.missouri.edu). Programming for the web database was performed, as described previously (Hajduch et al., 2005 Received December 10, 2005; returned for revision February 23, 2006; accepted February 25, 2006.
1 This work was supported by the National Science Foundation-Plant Genome Research Program Young Investigator Award (grant no. DBI0332418).
2 Present address: Institute of Plant Genetics and Biotechnology, Slovak Academy of Sciences, 95007 Nitra, Slovak Republic. The author responsible for distribution of materials integral to the findings presented in this article in accordance with the policy described in the Instructions for Authors (www.plantphysiol.org) is: Jay J. Thelen (thelenj@missouri.edu).
[W] The online version of this article contains Web-only data. Article, publication date, and citation information can be found at www.plantphysiol.org/cgi/doi/10.1104/pp.105.075390. * Corresponding author; e-mail thelenj{at}missouri.edu; fax 5738849676.
Aebersold R, Mann M (2003) Mass spectrometry-based proteomics. Nature 422: 198207[CrossRef][Medline] Agrawal GK, Yonekura M, Iwahashi Y, Iwahashi H, Rakwal R (2005) System, trends and perspectives of proteomics in dicot plants Part I: technologies in proteome establishment. J Chromatogr B 815: 109123 Arabidopsis Genome Initiative (2000) Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408: 796815[CrossRef][Medline] Bassham JA, Benson AA, Kay LD, Harris AZ, Wilson AT, Calvin M (1954) The path of carbon in photosynthesis XXI: the cyclic regeneration of carbon dioxide acceptor. J Am Chem Soc 76: 17601770[CrossRef][ISI] Bevan M, Bancroft I, Bent E, Love K, Goodman H, Dean C, Bergkamp R, Dirkse W, Van Staveren M, Stiekema W, et al (1998) Analysis of 19 Mb of contiguous sequence from chromosome 4 of Arabidopsis thaliana. Nature 391: 485488[CrossRef][Medline] Bewley JD, Black M (1994) Seeds: Physiology of Development and Germination. Plenum Press, New York Campostrini N, Areces LB, Rappsilber J, Pietrogrande MC, Dondi F, Patorino F, Ponzoni M, Righetti PG (2005) Spot overlapping in two-dimensional maps: a serious problem ignored for much too long. Proteomics 5: 23852395[CrossRef][Medline] Caspar T, Huber SC, Sommerville C (1985) Alterations in growth photosynthesis and respiration in a starchless mutant of Arabidopsis thaliana (L) deficient in chloroplast phosphoglucomutase activity. Plant Physiol 79: 1117 Chia TY, Pike MJ, Rawsthorne S (2005) Storage oil breakdown during embryo development of Brassica napus (L). J Exp Bot 56: 12851296 Clauser KR, Baker PR, Burlingame AL (1999) Role of accurate mass measurement (+/ 10 ppm) in protein identification strategies employing MS or MS/MS and database searching. Anal Chem 71: 28712882[Medline] Eastmond PJ, Rawsthorne S (2000) Coordinate changes in carbon partitioning and plastidial metabolism during the development of oilseed rape embryo. Plant Physiol 122: 767774 Elborough KM, Winz R, Deka RK, Markham JE, White AJ, Rawsthorne S, Slabas AR (1996) Biotin carboxyl carrier protein and carboxyltransferase subunits of the multi-subunit form of acetyl-CoA carboxylase from Brassica napus: cloning and analysis of expression during oilseed rape embryogenesis. Biochem J 315: 103112[Medline] Emanuelsson O, Nielsen H, Brunak S, von Heijne G (2000) Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. J Mol Biol 300: 10051016[CrossRef][ISI][Medline] Eng J, McCormack AL, Yates JR III (1994) An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database. J Am Soc Mass Spectrom 5: 976989[CrossRef][ISI] Fernie AR, Carrari F, Sweetlove LJ (2004) Respiratory metabolism: glycolysis, the TCA cycle and mitochondrial electron transport. Curr Opin Plant Biol 7: 254261[CrossRef][ISI][Medline] Gallardo K, Job C, Groot SP, Puype M, Demol H, Vandekerckhove J, Job D (2001) Proteomic analysis of Arabidopsis seed germination and priming. Plant Physiol 126: 835848 Gallardo K, Job C, Groot SP, Puype M, Demol H, Vandekerckhove J, Job D (2002a) Importance of methionine biosynthesis for Arabidopsis seed germination and seedling growth. Physiol Plant 116: 238247[CrossRef][Medline] Gallardo K, Job C, Groot SP, Puype M, Demol H, Vandekerckhove J, Job D (2002b) Proteomics of Arabidopsis seed germination: a comparative study of wild-type and gibberellin-deficient seeds. Plant Physiol 129: 823837 Gallardo K, Le Signor C, Vandekerckhove J, Thompson RD, Burstin J (2003) Proteomics of Medicago truncatula seed development establishes the time frame of diverse metabolic processes related to reserve accumulation. Plant Physiol 133: 664682 Gibeaut DM (2000) Nucleotide sugars and glucosyltransferases for synthesis of cell wall matrix polysaccharides. Plant Physiol Biochem 38: 6980[CrossRef][ISI] Girke T, Todd J, Ruuska S, White J, Benning C, Ohlrogge J (2000) Microarray analysis of developing Arabidopsis seeds. Plant Physiol 124: 15701581 Goff SA, Ricke D, Lan T-H, Presting G, Wang R, Dunn M, Glazebrook J, Sessions A, Oeller P, Varma H, et al (2002) A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). Science 296: 92100 Goffman FD, Alonso AP, Schwender J, Shachar-Hill Y, Ohlrogge JB (2005) Light enables a very high efficiency of carbon storage in developing embryos of rapeseed. Plant Physiol 138: 22692279 Goffman FD, Ruckle M, Ohlrogge J, Shachar-Hill Y (2004) Carbon dioxide concentrations are very high in developing oilseeds. Plant Physiol Biochem 42: 703708[CrossRef][Medline] Gorg A, Weiss W, Dunn MJ (2003) Current two-dimensional electrophoresis technology for proteomics. Proteomics 4: 36653685 Gunstone DF, Harwood JL, Padley FB (1995). The Lipid Handbook. Chapman & Hall, London Hajduch M, Ganapathy A, Stein JW, Thelen JJ (2005) A systematic proteomic study of seed filling in soybean: establishment of high-resolution two-dimensional reference maps, expression profiles, and an interactive proteome database. Plant Physiol 137: 13971419 Harrison CJ, Mould R, Leech MJ, Johnson S, Turner L, Schreck SL, Baird K, Jack P, Rawsthorne S, Hedley CL, et al (2000) The rug3 locus of pea encodes plastidial phosphoglucomutase. Plant Physiol 122: 11871192 Herman EM, Helm RM, Jung R, Kinney AJ (2003) Genetic modification removes an immunodominant allergen from soybean. Plant Physiol 132: 3643 Hill LM, Morley-Smith ER, Rawsthorne S (2003) Metabolism of sugars in the endosperm of developing seeds of oilseed rape. Plant Physiol 131: 228236 Huber LA (2003) Is proteomics heading in the wrong direction? Nat Rev Mol Cell Biol 4: 7480[CrossRef][ISI][Medline] Huber SC (1986) Fructose-26-bisphosphate as a regulatory metabolite in plants. Annu Rev Plant Physiol Plant Mol Biol 37: 233246 |