|
|
||||||||
|
Plant Physiology 141:15-25 (2006) © 2006 American Society of Plant Biologists Development of a Controlled Vocabulary and Software Application to Analyze Fruit Shape Variation in Tomato and Other Plant Species1,[W]Department of Horticulture and Crop Science, The Ohio State University/Ohio Agricultural Research and Development Center, Wooster, Ohio 44691 (M.T.B., E.v.d.K.); Department of Statistics (L.L.) and Department of Computer Science and Engineering (K.F.), The Ohio State University, Columbus, Ohio 43210; and Department of Mathematics and Computer Science, The College of Wooster, Wooster, Ohio 44691 (N.D., S.G.)
The domestication and improvement of fruit-bearing crops resulted in a large diversity of fruit form. To facilitate consistent terminology pertaining to shape, a controlled vocabulary focusing specifically on fruit shape traits was developed. Mathematical equations were established for the attributes so that objective, quantitative measurements of fruit shape could be conducted. The controlled vocabulary and equations were integrated into a newly developed software application, Tomato Analyzer, which conducts semiautomatic phenotypic measurements. To demonstrate the utility of Tomato Analyzer in the detection of shape variation, fruit from two F2 populations of tomato (Solanum spp.) were analyzed. Principal components analysis was used to identify the traits that best described shape variation within as well as between the two populations. The three principal components were analyzed as traits, and several significant quantitative trait loci (QTL) were identified in both populations. The usefulness and flexibility of the software was further demonstrated by analyzing the distal fruit end angle of fruit at various user-defined settings. Results of the QTL analyses indicated that significance levels of detected QTL were greatly improved by selecting the setting that maximized phenotypic variation in a given population. Tomato Analyzer was also applied to conduct phenotypic analyses of fruit from several other species, demonstrating that many of the algorithms developed for tomato could be readily applied to other plants. The controlled vocabulary, algorithms, and software application presented herein will provide plant scientists with novel tools to consistently, accurately, and efficiently describe two-dimensional fruit shapes.
Domestication of plant species was accompanied by profound changes in overall plant and organ morphology (Smith, 1997
Tomato fruit is classified according to 10 shape categories such as rounded, high rounded, ellipsoid, or pyriform (International Plant Genetic Resources Institute, 1996 To date, most phenotypic analyses consist of time-consuming manual measurements and subjective scoring of traits that limit the detection of underlying genes. Software-aided measurements of attributes such as height, width, area, and perimeter are most commonly conducted with ImageJ, a public domain program developed at the U.S. National Institutes of Health (program created by W.S. Rasband; http://rsb.info.nih.gov/ij/). ImageJ is a versatile program that allows the user to make minor adjustments to the image. However, objective measurements such as height and width are neither automated nor exported efficiently, and extensive and detailed phenotypic analyses that describe subtle differences in shape such as degree of circular shape and the slope along the boundary of the fruit require development of novel algorithms for the attributes. In addition, many of the descriptors necessary to characterize shape cannot be scored objectively without proper software tools. For example, the degree of distal end indentation would be difficult to rate on a scale, and the scoring of this trait would be inconsistent between years, plots, and persons. Moreover, precise phenotypic measurements are necessary to sufficiently characterize loci and the underlying genes that contribute to shape variation. Thus, an accurate and objective method for conducting phenotypic analyses combined with a concise and detailed set of descriptors and terms for fruit shape attributes is necessary.
In previous work, algorithms for some of the shape attributes were defined and shown to have a genetic basis (Van der Knaap and Tanksley, 2003
Trait Ontology Terms and Mathematical Descriptors
An accurate description of fruit shape requires the development of a common vocabulary that encompasses a range of plant species. The structure of the terms needs to follow the True Path Rule that states that the pathway from a child term all the way up to the parent term must be accurate (Bruskiewich et al., 2002
The development of a definition of each term and an associated mathematical descriptor would permit objective measurements of fruit shape attributes. Several terms and equations were adopted from previous published work on tomato fruit shape. These included fruit shape index, blockiness, and fruit shape triangle (Van der Knaap and Tanksley, 2003
Fruit Shape Index
Fruit Shape Triangle
Fruit Shape Eccentric
Horizontal asymmetry and vertical asymmetry describe how asymmetric a fruit is when divided along a horizontal or vertical axis, respectively. The horizontal or vertical axes that divide the fruit are termed n and m, respectively (Fig. 2, D and E). The position of the horizontal axis, n, is determined by finding the topmost and bottommost points of the fruit and dividing by two to find the center. Likewise, the position of the vertical axis, m, is determined by finding the leftmost and rightmost points of the fruit and dividing in half to find the center. To compute horizontal asymmetry, each column of pixels, termed Li, is determined, and the midpoint of the column, ni, is found (Fig. 2D). Next, the difference between ni and n is calculated and recorded. Once every column is examined, the sum of the differences is determined and divided by the number columns. Thus, the general formula for horizontal asymmetry is (
Distal Fruit End Shape
Proximal Fruit End Shape
Fruit Shapes Circular, Ellipsoid, Heart, and Rectangular
Fruit Size
The Tomato Analyzer application requires digital images of cut fruit saved in jpeg format. When loaded into the application, the entire image appears in the left viewing window (Fig. 3 ). The software is designed to recognize objects of a certain size and image resolution, measured in dots (pixels) per inch (dpi). Generally, the smaller the object, the higher the resolution required to provide accurate analysis. The implementation of the equations with the software relies on obtaining the x and y coordinates of a pixel in a jpeg image of the fruit objects. The software automatically determines the boundaries of fruit in a scanned image. The object boundary is extracted through contour tracing, which results in a list of adjacent points describing the border of an object in an image. All phenotypic measurements are calculated based on the boundaries.
Prior to phenotypic analysis, if fruit are positioned at an angle or if an attached object distorts the boundary, the position of individual fruit can be manually adjusted using the software. Occasionally, the distal and proximal ends of the fruit are not correctly identified, resulting in aberrant angle and indentation values. Therefore, Tomato Analyzer also contains a function to manually adjust the distal and proximal end points of the fruit. In addition, objects can be deselected or selected. The units used for the attribute values can be selected as pixels, centimeters, millimeters, or inches. As most of the measurements are ratios, selecting the appropriate dpi setting consistent with the resolution of original jpeg image is only required for the size measurements including perimeter, area, height, and width. The measurements saved setting allows the user to select which attribute values to compute for display and export. Individual attributes or an entire measurement cluster can be selected or deselected. Shape attributes are divided into several clusters in Tomato Analyzer application and largely follow the grouping listed in Figure 1. Basic Measurements group comprises all the fruit size traits; Fruit Shape Index comprises fruit shape index I and II; Homogeneity comprises circular, ellipsoid, and rectangular; Distal Fruit End Shape comprises macro and micro angle, indentation, and protrusion; Proximal Fruit End Shape comprises angle, shoulder height, and indentation area; Eccentricity comprises all eccentricity attributes in addition to fruit shape heart; and Blockiness comprises distal and proximal fruit end shape blockiness and fruit shape triangle. By selecting the corresponding group tab in the lower right corner of Tomato Analyzer, results for the selected group are displayed (Fig. 3). Some attributes allow the user to select settings that maximize phenotypic diversity. User-defined settings are offered for upper and lower blockiness positions. The upper position is used for calculating proximal end blockiness. The lower position is used to calculate distal end blockiness. In addition, these new values will affect fruit shape triangle. The values for the blockiness positions equal the percentage of the height from the top of the fruit. These values can be changed to any number as long as they are both between 0 and 1 and lower position is greater than upper position. There are two settings to calculate distal end angles, referred to as macro and micro. The setting for macro level determines the percentage of the perimeter from the bottom where the angle will be measured, ranging from 5% to 50%. The micro level setting determines where the proximal angle is measured, ranging from 2% to 10% from the tip of the fruit. The save function allows the user to save the manual adjustments and analyzed fruit shape attributes. Subsequently, when a user opens the original image file, the saved file will be opened and will display all of the adjustments. There are two methods to export data obtained by Tomato Analyzer. In the first, one image is analyzed and the data for the shape attributes of individual fruit is exported to a .csv file suitable for loading into a spreadsheet or statistical analysis package. The second method is called batch mode and allows more than one image to be loaded and analyzed. In this scenario, the data for each attribute is exported to a .csv file as an average of all fruit in a single image. This method of export is most useful when conducting QTL analyses.
To validate the accuracy and demonstrate the utility of the Tomato Analyzer software, phenotypic and genetic analyses were conducted on two F2 populations derived from crosses between one of the extreme-shaped S. lycopersicum cultivars, Howard German or Banana Legs, and a small, round, wild relative, Solanum pimpinellifolium LA1589 (Fig. 4, AC ). Phenotypic data were collected for fruit from both populations using Tomato Analyzer. Principal components analysis (PCA) was conducted to determine the major sources of variation among the morphological traits within and between these populations (Fig. 4D). The variation within each of the two F2 populations could be explained by the first three principal components (PC), which combined represented 77.4% of the total variability (Fig. 4D). In addition, PC 3 demonstrated significant differences between the Howard German and Banana Legs F2 populations, whereas PC 1 and PC 2 were not significantly different (Fig. 4D; Supplemental Table I). PC 1, representing 35.4% of the variation, was predominantly affected by attributes that evaluate the tapered shape of fruit, including the traits fruit shape triangle, proximal end blockiness, and horizontal asymmetry ovoid (loading values for contributing traits are listed in Supplemental Table II). PC 2, representing 24.9% of the variation, was primarily controlled by traits that contribute to fruit elongation such as fruit shape index, proximal end shape features, and distal end angle, as well as the fruit shape uniformity and homogeneity features circular, rectangular, and heart shape (Supplemental Table II). PC 3, representing 17.1% of the variation, was influenced by fruit size, homogeneity, and eccentricity characteristics, as well as distal end blockiness and indentation area (Supplemental Table II).
To identify regions of the genome responsible for the observed fruit morphology variation, QTL analysis was performed using the first three PCs as traits in the Banana Legs and Howard German F2 populations. A genetic map was constructed with molecular markers using MAPMAKER (Lander et al., 1987
The following analysis was conducted with the attribute distal fruit end shape angle to demonstrate the flexibility of Tomato Analyzer. The user can select the location of the slope for the distal end angle measurement. Mapping of distal end angle at various distances from the tip of the fruit, from 2% to 20%, showed that the most significant QTL is associated with the angle measurement at 20% above the tip and that this QTL decreases in significance with angle measurements taken at positions closer to the tip of the fruit (Fig. 5 ). A similar trend was noted in the Howard German population at the same chromosomal location (data not shown). The results of the distal end angle measurements demonstrate that user-defined settings of Tomato Analyzer application permit phenotypic analyses that are optimized for each population or tailored to specific research questions. In addition, adjustment of settings is efficiently applied to all fruit in a population, while such a change would be labor intensive if manual measurements were taken.
To demonstrate a wider utility of Tomato Analyzer in shape analysis, we evaluated its performance on fruit of different species (Fig. 6 ). The software could accurately determine the boundaries of fruit from as small as grape (Vitis vinifera) to as large as butternut squash (Cucurbita moschata) and of fruit of different colors. The output values provided by Tomato Analyzer were consistent with visual observations and manual measurements (Table II ). For example, the software accurately measured distal end angle values from extremely pointed fruit, like the peppers (Capsicum annuum), to very rounded fruit, like the grape and Bartlett pear (Pyrus communis). In addition, the butternut and yellow squash (Cucurbita pepo) and pear had obovoid values, indicating that the largest width of the fruit was well below the midpoint in those fruit. Lastly, triangle shape at the 10% setting indicated that the peppers were the most triangular in shape, while butternut squash and Bartlett pear were the least triangular in shape. Thus, the Tomato Analyzer application is not limited to tomato fruit but can be applied to fruit morphology analyses of other plants.
The development of structured, controlled vocabularies arranged in ontologies would provide great benefit to plant scientists (Bruskiewich et al., 2002
Software programs and computational methods have been developed to categorize and classify plant organ shapes such as fruit (Morimoto et al., 2000 In all, Tomato Analyzer is a flexible and comprehensive application that provides intuitive descriptors and output that facilitate the analysis of fruit morphology. Furthermore, our efforts to combine controlled vocabulary with mathematical descriptors into one software application make this a very useful tool for several applications. Tomato Analyzer allows for accurate and objective measurements of fruit shape attributes in a high-throughput manner and of traits that are nearly impossible to quantify manually. The application is specifically developed to analyze fruit shape QTL in tomato but could readily be applied to fruit of other species and other plant organs such as seed and leaves.
Software Implementation The software application has been implemented in C++ using Visual Studio 6.0 and runs on the Windows operating system. The image processing library Computer Vision and Image Processing (CVIP) 3.7c is used for image I/O. Modifications to the software were done using Visual Studio 2003 with source code control provided by SourceSafe. The source code cross indexer LXR was used to create an online indexable and searchable version of the software code. The program is free for academic purposes and can be downloaded from our laboratory Web site: http://www.oardc.ohio-state.edu/vanderknaap/.
Two F2 populations were constructed from crosses between one of two Solanum lycopersicum cultivars (Banana Legs or Howard German) and a wild species, Solanum pimpinellifolium accession LA1589. The Banana Legs population consisted of 99 plants, whereas the Howard German population contained 130 plants. Both populations were grown simultaneously in the same greenhouse in the summer of 2003. Up to eight fruits were harvested from each plant. Fruit were weighed, cut longitudinally, and scanned at 300 dpi resolution. The images were saved as jpeg image files for phenotypic analyses. Each image contained fruit from only one plant. If eight fruit per plant were harvested at once, these fruits were scanned and saved as one image. If fewer than eight fruit per plant were harvested at once and additional fruits were harvested later, both images were combined using Adobe Photoshop 7.0 (Adobe Systems) prior to analysis with Tomato Analyzer.
Tomato Analyzer was used to conduct the phenotypic analyses. Manual adjustments, including modification of fruit boundaries and rotation of individual fruits, were made to the images, if necessary. If boundaries were distorted, for example by an attached seed, they would be modified using the software. If fruit were scanned at an angle, manual rotation was required to properly align them. Adjusted images and associated results were saved as a separate file with the extension .tmt. Batch analyses allowed analysis and exportation of selected shape attribute data for at least 100 images at once. The values for each attribute were averaged for all fruit per image and exported into .csv file.
Total genomic DNA was isolated from young leaves as described by Bernatzky and Tanksley (1986)
PCA and analysis of variance were conducted with SAS V8 (SAS Institute). QTL analysis was performed by composite interval mapping (Zeng, 1993
Drs. R. Pratt and D.M. Francis are acknowledged for their critical reading of the manuscript, and Dr. P. Jaiswal for insightful comments and suggestions on the trait ontology terms. Also, we acknowledge J. Moyseenko for excellent plant care and assistance in genotyping, and B. Stricker and R. Drushal for their contributions on Tomato Analyzer. Received January 25, 2006; returned for revision March 14, 2006; accepted March 15, 2006.
1 This work was supported by the National Science Foundation (grant no. DBI 0227541). The author responsible for distribution of materials integral to the findings presented in this article in accordance with the policy described in the Instructions for Authors (www.plantphysiol.org) is: Esther van der Knaap (vanderknaap.1{at}osu.edu).
[W] The online version of this article contains Web-only data. www.plantphysiol.org/cgi/doi/10.1104/pp.106.077867. * Corresponding author; e-mail vanderknaap.1{at}osu.edu; fax 3302633887.
Alba R, Payton P, Fei Z, McQuinn R, Debbie P, Martin GB, Tanksley SD, Giovannoni JJ (2005) Transcriptome and selected metabolite analyses reveal multiple points of ethylene control during tomato fruit development. Plant Cell 17: 29542965 Bernatzky R, Tanksley SD (1986) Toward a saturated linkage map in tomato based on isozymes and random cDNA sequences. Genetics 112: 887898 Beyer M, Hahn R, Peschel S, Harz M, Knoche M (2002) Analyzing fruit shape in cherry (Prunus avium L.). Sci Hortic (Amsterdam) 96: 139150[CrossRef] Bruskiewich R, Coe EH, Jaiswal P, McCouch S, Polacco M, Stein L, Vincent L, Ware D (2002) The Plant Ontology Consortium and plant ontologies. Comp Funct Genomics 3: 137142[CrossRef] Doebley J (2004) The genetics of maize evolution. Annu Rev Genet 38: 3759[CrossRef][ISI][Medline] Frary A, Doganlar S (2003) Comparative genetics of crop plant domestication and evolution. Turk J Agric For 27: 5969 Fulton TM, Chunwongse J, Tanksley SD (1995) Microprep protocol for extraction of DNA from tomato and other herbaceous plants. Plant Mol Biol Rep 13: 207209[ISI] Grandillo S, Ku HM, Tanksley SD (1999) Identifying the loci responsible for natural variation in fruit size and shape in tomato. Theor Appl Genet 99: 978987[CrossRef][ISI] International Plant Genetic Resources Institute (1996) Descriptors for Tomato (Lycopersicon spp.). International Plant Genetic Resources Institute, Rome International Union for the Protection of New Varieties and Plants (2001) Guidelines for the Conduct of Tests for Distinctness, Homogeneity and Stability (Tomato). International Union for the Protection of New Varieties and Plants, Geneva Kosambi DD (1944) The estimation of map distances from recombination values. Ann Eugen 12: 172175 Lander ES, Green P, Abrahamson J, Barlow A, Daly MJ, Lincoln SE, Newburg L (1987) MAPMAKER: an interactive computer package for constructing primary genetic linkage maps of experimental and natural populations. Genomics 1: 174181[CrossRef][Medline] Langlade NB, Feng X, Dransfield T, Copsey L, Hanna A, Thebaud C, Bangham A, Hudson A, Coen E (2005) Evolution through genetically controlled allometry space. Proc Natl Acad Sci USA 102: 1022110226 Lemaire-Chamley M, Petit J, Garcia V, Just D, Baldet P, Germain V, Fagard M, Mouassite M, Cheniclet C, Rothan C (2005) Changes in transcriptional profiles are associated with early fruit tissue specialization in tomato. Plant Physiol 139: 750769 Lippman Z, Tanksley SD (2001) Dissecting the genetic pathway to extreme fruit size in tomato using a cross between the small-fruited wild species Lycopersicon pimpinellifolium and L. esculentum var. Giant Heirloom. Genetics 158: 413422 Morimoto T, Takeuchi T, Miyata H, Hashimoto Y (2000) Pattern recognition of fruit shape based on the concept of chaos and neural networks. Comp Electron Agric 26: 171186 Mueller LA, Solow TH, Taylor N, Skwarecki B, Buels R, Binns J, Lin C, Wright MH, Ahrens R, Wang Y, et al (2005) The SOL genomics network: a comparative resource for Solanaceae biology and beyond. Plant Physiol 138: 13101317 Paris HS, Yonash N, Portnoy V, Mozes-Daube N, Tzuri G, Katzir N (2003) Assessment of genetic relationships in Cucurbita pepo (Cucurbitaceae) using DNA markers. Theor Appl Genet 106: 971978[ISI][Medline] Sako Y, McDonald MB, Fujimura K, Evans AF, Bennett MA (2001a) A system for automated seed vigour assessment. Seed Sci Technol 29: 625636 Sako Y, Regnier EE, Daoust T, Fujimura K, Harrison SK, McDonald MB (2001b) Computer image analysis and classification of giant ragweed seeds. Weed Sci 49: 738745 Smith BD (1997) The initial domestication of Cucurbita pepo in the Americas 10,000 years ago. Science 276: 932934[CrossRef][ISI] Van der Knaap E, Sanyal A, Jackson S, Tanksley SD (2004) High-resolution fine mapping and fluorescence in situ hybridization analysis of sun, a locus controlling tomato fruit shape, reveals a region of the tomato genome prone to DNA rearrangements. Genetics 168: 21272140 Van der Knaap E, Tanksley SD (2001) Identification and characterization of a novel locus controlling early fruit development in tomato. Theor Appl Genet 103: 353358[CrossRef][ISI] Van der Knaap E, Tanksley SD (2003) The making of a bell pepper-shaped tomato fruit: identification of loci controlling fruit morphology in Yellow Stuffer tomato. Theor Appl Genet 107: 139147[ISI][Medline] Vincent PLD, Coe EH, Polacco ML (2003) Zea mays ontology: a database of international terms. Trends Plant Sci 8: 517520[CrossRef][ISI][Medline] Yamazaki Y, Jaiswal P (2005) Biological ontologies in rice databases: an introduction to the activities in Gramene and Oryzabase. Plant Cell Physiol 46: 6368 Zeng ZB (1993) Theoretical basis for separation of multiple linked gene effects in mapping quantitative trait loci. Proc Natl Acad Sci USA 90: 1097210976 Zeng ZB (1994) Precision mapping of quantitative trait loci. Genetics 136: 14571468[Abstract] This article has been cited by other articles:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ASPB Publications | PLANT PHYSIOLOGY | ||
|---|---|---|---|