|
|
||||||||
|
First published online June 15, 2007; 10.1104/pp.107.100891 Plant Physiology 144:1813-1826 (2007) © 2007 American Society of Plant Biologists OPEN ACCESS ARTICLE
Between-Species Analysis of Short-Repeat Modules in Cell Wall and Sex-Related Hydroxyproline-Rich Glycoproteins of Chlamydomonas1,[W],[OA]Department of Biology, Washington University, St. Louis, Missouri 63130 (J.-H.L, L.S., U.G.); and Institute für Biochemie, University of Köln, Cologne, Germany 4750674 (S.W.)
Protein diversification is commonly driven by single amino acid changes at random positions followed by selection, but, in some cases, the structure of the gene itself favors the occurrence of particular kinds of mutations. Genes encoding hydroxyproline-rich glycoproteins (HRGPs) in green organisms, key protein constituents of the cell wall, carry short-repeat modules that are posited to specify proline hydroxylation and/or glycosylation events. We show here, in a comparison of two closely related Chlamydomonas species—Chlamydomonas reinhardtii (CC-621) and Chlamydomonas incerta (CC-1870/3871)—that these modules are prone to misalignment and hence to both insertion/deletion and endoduplication events, and that the dynamics of the rearrangements are constrained by purifying selection on the repeat patterns themselves, considered either as helical or as longitudinal face modules. We suggest that such dynamics may contribute to evolutionary diversification in cell wall architecture and physiology. Two of the HRGP genes analyzed (SAG1 and SAD1) encode the mating-type plus and minus sexual agglutinins, displayed only by gametes, and we document that these have undergone far more extensive divergence than two HRGP genes (GP1 and VSP3) that encode cell wall components—an example of the rapid evolution that characterizes sex-related proteins in numerous lineages. Strikingly, the central regions of the agglutinins of both mating types have diverged completely, by selective endoduplication of repeated motifs, since the two species last shared a common ancestor, suggesting that these events may have participated in the speciation process.
Hyp-rich glycoproteins (HRGPs) represent a family of proteins that self-assemble to form vital scaffolding in the cell walls of plants, where the relationship of this scaffolding to the abundant structural polysaccharides—cellulose, hemicellulose, and pectin—found in most types of cell wall is not well understood (Cassab, 1998
In the Chlamydomonas lineage, HRGPs not only self-assemble as cell walls, but also participate in sexual recognition between mating-type plus and minus gametes. Enormous HRGPs (>1,000 kD; approximately 230–240 nm in length; Adair et al., 1983
All known HRGPs in Chlamydomonas are chimeric (Kieliszewski and Lamport, 1994
The PII helix is a secondary structure, adopted by peptide domains with substantial (hydroxy) Pro content, in which 3.34 amino acids generate 1 nm of shaft length (van Holst and Varner, 1984
A recurrent theme within the HRGPs (as well as the collagens) is significant posttranslational modification. Encoded Pro residues are commonly hydroxylated in the endoplasmic reticulum (Harwood et al., 1974
It has been proposed (Kieliszewski and Lamport, 1994
The iterative nature of proteins with short repeats also renders the genes vulnerable to rearrangements: They are prone to undergo slipped-strand misalignment during replication and recombination, generating insertions and deletions (indels; Smith, 1976
To study the evolution of HRGPs, we compared the shaft sequences of the sex-related Sag1 and Sad1 agglutinins and the sex-unrelated cell wall proteins Vsp3 and Gp1 in two species, C. reinhardtii (CC-621) and Chlamydomonas incerta (CC-1870/3871), estimated to have last shared a common ancestor <10 million years ago (Coleman and Mai, 1997
Longitudinal Face versus Helical Modules
The iterative nature of the shaft sequences and the large size of the agglutinin shafts prompted us to devise a means to visualize shafts at the protein sequence level in the context of the PII helix. Such diagrams make it easier to compare a given shaft from two different species and they serve to emphasize the three longitudinal faces of a shaft sequence previously noted in Ferris et al. (2005)
It is generally assumed (Kieliszewski and Lamport, 1994 To illustrate the three-dimensional topology of the longitudinal faces, the N-terminal region of the Sag1 shaft from C. reinhardtii (Fig. 1C, top) is modeled in Figure 1, E to G. Particular residues display a polarized distribution: One face contains a contiguous Pro stretch, the second contains a mix of Ser and Pro, and the third exposes two positively charged Args. Evidence that selection may in some cases act to preserve such longitudinal face modules is presented in a later section.
Figure 1, A to D, depicts the shafts of Gp1, Vsp3, Sag1, and Sad1, the eight HRGP proteins under study, for each of the two Chlamydomonas species. Pros and Sers, candidates for hydroxylation and/or glycosylation, are shown in blue and red, illustrating the probable space-filling pattern of sugars along the main axis. Charged amino acids (light blue) may mediate ionic interactions within or between shafts; notably, the first subdomains of Gp1 are largely devoid of charged amino acids (Fig. 1A), whereas the third subdomains of Sag1 and Sad1 display numerous charges along the axis (Fig. 1, C and D). Yellow represents guest amino acids—two or more contiguous non-Pro residues in a Pro-rich domain that have the potential to destabilize the locally driven PII configuration and promote bending (Creamer, 1998 Whether such faces prove to be recognized by posttranslational modification enzymes or interacting molecules awaits experimental analysis. Meanwhile, the diagrams of Figure 1 serve to summarize and emphasize the overall organization of the proteins under study. In particular, they illustrate the conservation of the subdomain organization of each shaft, suggesting that this organization is functionally relevant to both wall assembly and sexual recognition.
Between-species alignments of short-repeat sequences are reliable only if carried out at the nucleotide level and it proved necessary to develop novel strategies to analyze the HRGP repeats wherein the most parsimonious alignment is identified by the highest Ser codon matches (see "Materials and Methods"). The resultant alignments reveal complex histories for each of the four pairs of C. reinhardtii/C. incerta shaft domains.
Gp1 In contrast to the conservation of subdomain structure, alignment of the Gp1 shaft-encoding sequences from C. reinhardtii and C. incerta (supported by 67/75 Ser codon matches) reveals numerous codon substitutions, indels, and two extensive endoduplication events (Fig. 2 ), as detailed below.
Of the 16 indels in the Gp1 alignment, 14 add 54 residues to the C. incerta shaft and two add four residues to the C. reinhardtii shaft. As a consequence, the C. incerta shaft is predicted to be longer (386 residues or 115 nm) than the C. reinhardtii shaft (336 residues or 100 nm; Fig. 1A). In C. reinhardtii, 9/15 (60%) of the PPX units have been created by indels that truncate PPSPX units; in C. incerta, 4/10 (40%) of the PPX units have been created by such indels. The three-residue PPX unit has the effect of bringing the longitudinal faces back into frame after one iteration because the addition of a PPX unit between PPSPX units creates PPPPXS longitudinal modules on two faces from otherwise PPPXS modules as noted earlier. Therefore, the PPSPX/PPX helical repeat structure, and the (Pro)3 to 4 XS longitudinal faces, are largely maintained despite numerous indels/substitutions. Yet another measure of repeat-sequence conservation pertains to conservation of Pro and Ser positions (indel events are not included in the following calculations): Of the 209 Pro residues in the C. incerta Gp1 sequence, 123 (59%) are specified by the same codon in the C. reinhardtii sequence and 83 (49%) by a synonymous Pro codon; only three (1%) are changed to a different amino acid. Of the 70 Ser residues in the C. incerta sequence, 51 (73%) are specified by the same codon in C. reinhardtii, 14 (20%) by a synonymous Ser codon, and 5 (7%) are changed. By contrast, of the 59 X positions, only 20 are identical, 16 are synonymous, and 23 (40%) are changed to a different amino acid. Despite this large X variation, there is a bias in the amino acids found in the X positions: Of the 112 X amino acids in the two shafts (excluding kink and neck and including indels), 66 are Ala, 15 Pro, 13 Val, nine Ser, three Glu, two Thr, and one each of Gly, Ile, Lys, and Leu. That is, the X positions are not drifting freely: They are restricted in composition. Taken together, what seems to be of primary importance to the Gp1 shaft sequence is the maintenance of particular (hydroxy) Pro residues relative to Ser and a subset of spacer (X) residues.
Vsp3 The longitudinal face patterns and the two distinctive subdomains of the Vsp3 shafts are conserved between C. reinhardtii and C. incerta (Fig. 1B) despite the rearrangements illustrated in the Figure 3A alignments (supported by 56/58 Ser codon matches). The subdomains containing the core (PS)x repeats are organized into nine units (Fig. 3B). Each unit shows size variations in its PS content, the shortest containing two PS repeats and the longest containing 33, with indels creating tracts of different lengths. Each (PS)x unit terminates in KX (where X is usually Ala; Fig. 3B). The KX motifs introduce charged amino acids and, as guest sequences, represent putative loci for PII helix interruptions (Fig. 1B, yellow). Given the variable length of (PS)x in each unit, the displays of KX on the two Vsp3 shafts are quite distinctive.
There are two exceptional cases: (1) an endoduplication of 20 codons (highlight) is found in the C. incerta sequence; and (2) the sequence CCUUCU (which encodes PS) is endoduplicated eight times in the C. reinhardtii sequence (highlight). Underlined and italicized are sequences flanking this second event that have the interesting feature of going out of and then back into frame, as detailed in Figure 3C. Despite such events, the shafts of the two species preserve the basic (PS)x repeat structure and remain similar in predicted length (62 versus 60 nm).
Agglutinin Shafts: Common Features
Here we report the sequences of the Sag1 (predicted length 323 nm) and Sad1 (289 nm) shafts of the sister species C. incerta. The proteins retain the same 2A to 2E topology as their C. reinhardtii counterparts (compare Fig. 4
with figure 7 in Ferris et al., 2005
The four proteins share two additional common features. (1) Conservation of the PPSP tetrameric unit is observed throughout the 2B to 2D subdomains. Of the 551 PPSPX motifs in the four shafts, 90% preserve the PPSP sequence. In the 53 single-change variants, 3% are changed or deleted at the first Pro, 33% at the second Pro, 53% at the Ser, and 11% at the third Pro; of the 27 double-change variants, only one is changed or deleted at the first Pro, 89% at the second Pro, 93% at the Ser, and 15% at the third Pro. These gradients may indicate the relative importance of each position to proper hydroxylation/glycosylation/intermolecular recognition (Ferris et al., 2005
Agglutinin Shafts: Homology I and II Regions The aligned homology I and II regions offer the opportunity to identify features of the agglutinin shaft sequences that have been conserved not only since the recent C. reinhardtii/C. incerta divergence, but also since the ancient divergence of the SAG1 and SAD1 genes themselves. These features are likely to be important for posttranslational modification and/or for function. The analyses below focus on the 2A and 2E subdomains.
The 2A subdomains (Supplemental Fig. S1, A and C) are dominated by PPX motifs (Fig. 5A
), where the X residue is often Arg or Lys. However, there is little conservation of X positions, even within plus or minus agglutinins, suggesting that selection is acting primarily on the retention of spaced PP dyads. Given that a PII sequence can be read as helical modules or as [n, n + 3] longitudinal face modules (Fig. 1), the PPX modules may be important for their helically displayed information and/or for generating the long (
Each 2A subdomain also carries a block-interruption sequence—a string of non-Pro or guest amino acids (highlighted in Fig. 5). The block interruptions are located in comparable positions in all four proteins and are of comparable length (13–15 amino acids). At the amino acid sequence level, the Sag1 and Sad1 block interruptions are completely different from one another. Within orthologous pairs, only five of the positions are conserved between the two Sag1 sequences, including two Cys positions, whereas 11 positions are conserved between the two Sad1 sequences. The 2A block interruption is posited to generate a bend in the distal end of the shaft (Ferris et al., 2005
The boundaries between the 2E subdomains and their adjacent 2D subdomains are fuzzy and variable (Fig. 5B, first rows), after which the 2E subdomains resemble 2A in displaying numerous PPX modules and guest sequences (Fig. 5B) and in generating long (
The four 2A subdomains and the four 2E subdomains of the agglutinin shafts share two common properties: (1) their sequences generate long ( Perfectly repeating helical PPX modules would by definition generate two longitudinal P faces. However, as illustrated in Figure 5, A and B, the PPX modules in the 2A and 2E subdomains are by no means perfectly repeated. Instead, the Pro residues that participate in generating longitudinal P faces (boldfaced) are recruited from diverse helical module contexts. Moreover, they persist despite the occurrence of numerous indels and substitutions within mating type, suggesting that the positions participating in longitudinal module generation may be under selection independently of any selection that may be operant to maintain helical modules. The obvious objection to this proposal is that long P faces might simply be the random outcome of sequences having a high Pro content. To address this possibility, random sequences were generated with the same Pro content and length as the eight 2A and 2E subdomains. As detailed in Figure 5C and its legend, these proved to be significantly less likely to generate long P faces than seven of the actual sequences, the one exception being 2E Sad1 of C. incerta, which falls within the 95% expected line.
The homology I and II regions flank the middle portions of the shafts that we are unable to align, embedded in which are the repetitive 2C subdomains that are unique to each of the four proteins. Figure 6A
presents a detailed analysis of each set of 2C internal repeats. Within each set, some repeats are found to be identical at the codon level, whereas others are increasingly divergent, allowing the construction of evolutionary trees for each subdomain. Evolutionary distances were estimated using the Tajima-Nei model (Tajima and Nei, 1984
These trees are plotted in Figure 6B, along with calculated between-species distances for the homology I and II regions and for the shafts of Gp1 and Vsp3. The between-species distances are greater for the two agglutinins than for the cell wall proteins, conforming to the well-documented tendency for sex-related genes to be more rapidly evolving than non-sex-related genes, as noted earlier. The within-species distances indicate that both sets of C. reinhardtii endoduplications initiated at a similar time, and that both sets of C. incerta endoduplications initiated at a similar, but more recent, time. These patterns are consistent with the possibility that the 2C endoduplications initiated at about the time that C. reinhardtii and C. incerta became genetically isolated. A notable feature of the 2C subdomains is that the repeats generated in C. reinhardtii Sad1 (minus) and in C. incerta Sag1 (plus) both entail reiterations of the PPSPE PPSPA PPSPP motif, with the units being more degenerate in the older C. reinhardtii Sad1 sequence. Because we do not as yet know what role, if any, the 2C sequences play in sexual agglutination, we do not know whether this sequence convergence is biologically meaningful (e.g. whether it is necessary to adhesion that one shaft, from whichever mating type, carry a PPSPE PPSPA PPSPP repeat with its high density of negative charge) or whether it has occurred fortuitously.
Comparing the Evolution of Cell Wall and Sexual Agglutinin HRGP Shafts
Several mate recognition genes have been found to be endowed with repetitive modules (Gao and Garbers, 1998
The HRGP shafts are also repetitive, and we document that they have been subject to major misalignment events since C. reinhardtii and C. incerta last shared a common ancestor <10 million years ago. Nevertheless, and unlike VERL, the repeated motifs that characterize particular shaft subdomains are strongly conserved in the surviving genes. We interpret such constraints to support the concept that the short (two- to six-residue) repeat units in these proteins generate glycomodules (Shpak et al., 1999
We have compared sex-related (agglutinin) and sex-unrelated (cell wall) shafts and find that, in several respects, they display similar evolutionary profiles: (1) Indels and nucleotide changes are tolerated only insofar as the overall pattern of repetitive motifs is not disrupted; (2) high rates of identical codons or synonymous substitutions are observed in Pro and Ser positions, suggesting that the placement of these amino acids is critical; and (3) only certain amino acids occupy the X positions of PPX and PPSPX units, presumably because these most comfortably accommodate the formation of the PII helix and the hydroxylation/glycosylation process (see also Ferris et al., 2005 A striking difference between the evolution of cell wall shafts and agglutinin shafts is that, whereas the cell wall sequences can be aligned without difficulty despite numerous codon changes, it is not possible to align approximately 50% of either of the plus-to-plus or the minus-to-minus agglutinin sequences. Not only do the agglutinins display unique central 2C endoduplications (see below), the sequences flanking one or both ends of these endoduplicated subdomains are also unique to each shaft even as they retain the signature motifs of their subdomains, suggesting that the events that generated the central 2C diversification have extended into, or originated from, the flanking regions. By contrast, the sequences at the proximal and distal ends of the shafts, while highly divergent, retain alignability. Each 2C subdomain reiterates a particular sequence of repeat motifs, reminiscent of the 28 longer reiterations in VERL. Unlike the VERL reiterations, however, where a given module from one species may differ at a few amino acid positions from a second species, the plus 2C sequences from the two Chlamydomonas species are entirely different from one another (four reiterations of PPSPAPPA/LPPSPEPPSPAPPSPEPPSPAPPSPAPPSPAPPSPA versus 22 reiterations of PPSPEPPSPAPPSPP) and the minus 2C sequences are entirely different from one another (seven reiterations of PPSPAPPSPEPPSPTPPSPQPPSPAPALPTPPSPVPPSPAPPSPEPPSPF versus 24 reiterations of PPSPEPPSPAPPSPP).
Analysis of genetic distance among the repeats of the 2C subdomains, detailed in Figure 6, indicates that the 2C repeats diverged as endoduplication events, but their mode of origination is not easily explained. Not only is it the case that each 2C subdomain is a unique sequence, but also it is the case that its forerunner is not evident in the sequence of the other species. For example, if one posits that the ancestral Sad1 gene common to C. reinhardtii and C. incerta had a 2C sequence similar to the modern C. reinhardtii sequence, then the data indicate that, during C. incerta evolution, the 2C sequence was first eliminated entirely and then replaced by a second sequence that went on to undergo endoreduplication. The origin (as opposed to the propagation and diversification) of protein repeat motifs is itself obscure (Andrade et al., 2001
The fact that the repeated modules of HRGP shafts generate information that guides posttranslational modification is widely accepted and supported by experimental studies (Shpak et al., 1999
Comparisons between closely related species can help guide this question because one can ask whether particular longitudinal module patterns are conserved between species even as helical modules diversify. Our analysis of the eight 2A and 2E subdomains of the agglutinin shafts indicates that maintenance of Such proposals, it should be emphasized, in no way rule out roles for helical modules in HRGP biology. Indeed, we find most attractive the hypothesis that both modes of information will prove to be operant, either singly or collectively, in particular instances of HRGP hydroxylation/glycosylation, self-assembly, and interaction with other proteins.
Misalignment of repetitive HRGPs has generated shafts of varying lengths. Thus, the plus agglutinin shafts of C. reinhardtii and C. incerta are predicted to be 275 versus 323 nm, their minus shafts 258 versus 289 nm, and their Gp1 shafts 100 versus 115 nm. By contrast, their Vsp3 shafts are the same lengths (62 versus 60 nm) despite the occurrence of 12 indels adding/subtracting 42 amino acids to the approximately 205 amino acid sequences, suggesting that there may, in this case, be selection for length maintenance. The Vsp3 globular domains are far more strongly conserved in sequence than the heads of Gp1 and the agglutinins (J.-H. Lee, S. Waffenschmidt, and U.W. Goodenough, unpublished data) and, as noted below, share sequence homology with a head domain from Volvox carteri, suggesting a more stringent system for Vsp3 overall. In cell wall assemblies, shaft length variation would be expected to produce matrices with varying porosity and fiber density, and these would presumably be selectable traits.
The Vsp3 protein has had an interesting evolutionary history (Woessner et al., 1994
Evidence for domain swapping has been reported as well for other HRGPs. (1) The VMP family of cell wall metalloproteinases in V. carteri (Hallmann et al., 2001
Domain swapping is, of course, an important evolutionary dynamic, in general, but long Pro-rich repeats may well facilitate this process by enabling intra- and interchromosomal exchange. If chimeric HRGPs prove to be prone to such events, this would allow the generation of novel cell wall ideas that would promote matrix diversification (Baumberger et al., 2003
This study represents, to our knowledge, the first comparison of HRGP shaft sequences between closely related species. We have shown that the divergence between C. reinhardtii and C. incerta can be explained by the occurrence of misalignments and, in the case of agglutinins, by a position-specific repeat generation mechanism that can replace its antecedent. These events occur in the context of purifying selection for particular modules, some of which may be recognized by their occurrence on the longitudinal faces of PII helices, and overall amino acid composition. Preserved misalignment events are more radical for the agglutinins than for the cell wall proteins, but the conservation of overall motifs is similarly stringent. Additional diversity may be generated by the occasional occurrence of domain swapping between heads and shafts.
Green algae assemble numerous kinds of cell walls (Hallmann, 2003
Identification and Sequencing of Chlamydomonas incerta Genes
Orthologous Chlamydomonas incerta genes were identified by heterologous hybridization screening of the genomic library of C. incerta (CC-1870) generated as in Ferris et al. (1997)
Nucleotide sequences for SAG1 and SAD1 pairs of agglutinin shaft domains from C. reinhardtii and C. incerta were initially aligned using ClustalW, version 1.7 (Thompson et al., 1994
Tajima-Nei distances and SEs were calculated by MEGA 3.1 software (Kumar et al., 2004 Sequence data from this article can be found in the GenBank/EMBL data libraries under the following accession numbers. GP1 from C. reinhardtii and C. incerta: AF309494 and EF057410, VSP3 from C. reinhardtii and C. incerta: L29029 and AY795084, SAG1 from C. reinhardtii and C. incerta: AY450930 and AY937239, SAD1 from C. reinhardtii and C. incerta: AY450929 and AY858986.
The following material is available in the online version of this article.
We thank Dr. Patrick Ferris for his important intellectual and technical contributions to this project. Received April 25, 2007; accepted May 31, 2007; published June 15, 2007.
1 This work was supported by the National Institutes of Health (grant no. GM–26150), the National Science Foundation (grant no. MCB 0326829), and the Deutsche Forschungsgemeinschaft (grant no. Wa 659/8–1 to S.W.). The author responsible for distribution of materials integral to the findings presented in this article in accordance with the policy described in the Instructions for Authors (www.plantphysiol.org) is: Ursula Goodenough (ursula{at}biology.wustl.edu).
[W] The online version of this article contains Web-only data.
[OA] Open Access articles can be viewed online without a subscription. www.plantphysiol.org/cgi/doi/10.1104/pp.107.100891 * Corresponding author; e-mail ursula{at}biology.wustl.edu; fax 314–935–5125.
Acosta-Garcia G, Vielle-Calzada J (2004) A classical arabinogalactan protein is essential for the initiation of female gametogenesis in Arabidopsis. Plant Cell 16: 2614–2628 Adair WS, Hwang C, Goodenough UW (1983) Identification and visualization of the sexual agglutinin from mating-type plus flagellar membranes of Chlamydomonas. Cell 33: 183–193[CrossRef][ISI][Medline] Andrade MA, Perez-Iratxeta C, Ponting CP (2001) Protein repeats: structures, functions, and evolution. J Struct Biol 134: 117–131[CrossRef][ISI][Medline] Annunen P, Autio-Harmainen H, Kivirikko KI (1998) The novel type II prolyl 4-hydroxylase is the main enzyme form in chondrocytes and capillary endothelial cells, whereas the type I enzyme predominates in most cells. J Biol Chem 273: 5989–5992 Annunen P, Helaakoski T, Myllyharju J, Veijola J, Pihlajaniemi T, Kivirikko KI (1997) Cloning of the human prolyl 4-hydroxylase Baumberger N, Doesseger B, Guyot R, Diet A, Parsons RL, Clark MA, Smmons MP, Bedinger P, Goff SA, Ringli C, et al (2003) Whole-genome comparison of leucine-rich repeat extensins in Arabidopsis and rice: a conserved family of cell wall proteins form a vegetative and a reproductive clade. Plant Physiol 131: 1313–1326 Bolwell GP, Robbins MP, Dixon RA (1985) Elicitor-induced prolyl hydroxylase from French bean (Phaseolus vulgaris). Biochem J 229: 693–699[ISI][Medline] Cassab GI (1998) Plant cell wall proteins. Ann Rev Plant Physiol Plant Mol Biol 49: 281–309[CrossRef][ISI][Medline] Civetta A, Singh RS (1998) Sex-related genes, directional sexual selection, and speciation. Mol Biol Evol 15: 901–909[Abstract] Clark NL, Aagaard JE, Swanson WJ (2006) Evolution of reproductive proteins from animals and plants. Reproduction 131: 11–22 Coleman AW, Mai JC (1997) Ribosomal DNA ITS-1 and ITS-2 sequence comparisons as a tool for predicting genetic relatedness. J Mol Evol 45: 168–177[CrossRef][ISI][Medline] Colley KJ (1997) Golgi localization of glycosyltransferases: more questions than answers. Glycobiology 7: 1–13[Medline] Creamer TP (1998) Left-handed polyproline II helix formation is (very) locally driven. Proteins 33: 218–226[CrossRef][ISI][Medline] Egelund J, Obel N, Ulvskov P, Geshi N, Pauly M, Bacic A, Petersen BL (2007) Molecular characterization of two Arabidopsis thaliana glycosyltransferase mutants, rra1 and rra2, which have a reduced residual arabinose content in a polymer tightly associated with the cellulosic wall residue. Plant Mol Biol 64: 439–451[CrossRef][Medline] Elder JF, Turner BJ (1995) Concerted evolution of repetitive DNA sequences in eukaryotes. Q Rev Biol 70: 297–320[CrossRef][Medline] Estévez JM, Kieliszewski MJ, Khitrov N, Somerville C (2006) Characterization of synthetic hydroxyproline-rich proteoglycans with arabinogalactan protein and extensin motifs in Arabidopsis. Plant Physiol 142: 458–470 Ferris PJ, Pavlovic C, Fabry S, Goodenough UW (1997) Rapid evolution of sex-related genes in Chlamydomonas. Proc Natl Acad Sci USA 94: 8634–8639 Ferris PJ, Waffenschmidt S, Umen JG, Lin H, Lee JH, Ishida K, Kubo T, Lau J, Goodenough UW (2005) Plus and minus sexual agglutinins from Chlamydomonas reinhardtii. Plant Cell 17: 597–615 Ferris PJ, Woessner JP, Waffenschmidt S, Kilz S, Drees J, Goodenough UW (2001) Glycosylated polyproline II rods with kinks as a structural motif in plant hydroxyproline-rich glycoproteins. Biochemistry 40: 2978–2987[CrossRef][Medline] Galindo BE, Vacquier VD, Swanson WJ (2003) Positive selection in the egg receptor for abalone sperm lysin. Proc Natl Acad Sci USA 100: 4639–4643 Gao Z, Garbers DL (1998) Species diversity in the structure of zonadhesin, a sperm-specific membrane protein containing multiple cell adhesion molecule-like domains. J Biol Chem 273: 3415–3421 Goodenough U, Lin H, Lee J-H (2007) Sex determination in Chlamydomonas. Semin Cell Dev Biol (in press) Goodenough UW (1991) Chlamydomonas mating interactions. In M Dworkin, ed, Microbial Cell-Cell Interactions. American Society for Microbiology, Washington, DC, pp 71–112 Goodenough UW, Adair WS, Collin-Osdoby P, Heuser JE (1985) Structure of Chlamydomonas agglutinin and related flagellar surface proteins in situ and in vitro. J Cell Biol 101: 924–942 Goodenough UW, Armbrust EV, Campbell AM, Ferris PJ (1995) Molecular genetics of sexuality in Chlamydomonas. Plant Mol Biol 46: 21–44[Medline] Goodenough UW, Gebhart B, Mecham R, Heuser JE (1986) Crystals of the Chlamydomonas reinhardtii cell wall: polymerization, depolymerization, and purification of glycoprotein monomers. J Cell Biol 103: 408–417 Goodenough UW, Heuser JE (1985) The Chlamydomonas cell wall and its constituent glycoproteins analyzed by the quick-freeze deep-etch technique. J Cell Biol 101: 1550–1568 Goodenough UW, Heuser JE (1988a) Molecular organization of cell-wall crystals from Chlamydomonas reinhardtii and Volvox carteri. J Cell Sci 90: 717–733 Goodenough UW, Heuser JE (1988b) Molecular organization of the cell wall and cell-wall crystals from Chlamydomonas eugametos. J Cell Sci 90: 735–750 Goodenough UW, Heuser JE (1999) Deep-etch analysis of adhesion complexes between gametic flagellar membranes of Chlamydomonas reinhardtii (Chlorophyceae). J Phycol 35: 756–767[CrossRef][ISI] Hall Q, Cannon MC (2002) The cell-wall hydroxyproline-rich glycoprotein RSH is essential for normal embryo development in Arabidopsis. Plant Cell 14: 1161–1172 Hallmann A (2003) Extracellular matrix and sex-inducing pheromone in Volvox. Int Rev Cytol 227: 131–182[ISI][Medline] Hallmann A (2006) The pherophorins: common, versatile building blocks in the evolution of extracellular matrix architecture in Volvocales. Plant J 45: 292–307[CrossRef][ISI][Medline] Hallmann A, Amon P, Godl K, Heitzer M, Sumper M (2001) Transcriptional activation by the sexual pheromone and wounding: a new gene family from Volvox encoding the modular proteins with (hydroxy)proline-rich and metalloproteinase homology domains. Plant J 26: 583–593[CrossRef][ISI][Medline] Harwood R, Grant ME, Jackson DS (1974) Collagen biosynthesis: characterization of subcellular fractions from embyonic chick fibroblasts and the intracellular localization of protocollagen prolyl and protocollagen lysyl hydroxylases. Biochem J 144: 123–130[ISI][Medline] Hieta R, Myllyharju J (2002) Cloning and characterization of a low molecular weight prolyl 4-hydroxylase from Arabidopsis thaliana: effective hydroxylation of proline-rich, collagen-like, and hypoxia-inducible transcription factor alpha-like peptides. J Biol Chem 277: 23965–23971 Kamei N, Glabe CG (2003) The species-specific egg receptor for sea urchin sperm adhesion is EBR1, a novel ADAMTS protein. Genes Dev 17: 2502–2507 Keskiaho K, Hieta R, Sormunen R, Myllyharju J (2007) Chlamydomonas reinhardtii has multiple prolyl 4-hydroxylases, one of which is essential for proper cell wall assembly. Plant Cell 19: | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||