|
|
||||||||
|
Plant Physiol, February 2001, Vol. 125, pp. 1012-1022 Evidence That Intragenic Recombination Contributes to Allelic Diversity of the S-RNase Gene at the Self-Incompatibility (S) Locus in Petunia inflata1Department of Biochemistry and Molecular Biology (X.W., T.-H.K.) and Intercollege Graduate Degree Program in Plant Physiology (T.-H.K.), Pennsylvania State University, University Park, Pennsylvania 16802; Department of Biological Sciences, University of South Carolina, Columbia, South Carolina 29208 (A.L.H.); and Faculty of Horticulture, Chiba University, 648 Matsudo, Chiba 271-8510, Japan (T.T., T.A.)
For Solanaceae type self-incompatibility, discrimination between self and nonself pollen by the pistil is controlled by the highly polymorphic S-RNase gene. To date, the mechanism generating the allelic diversity of this gene is largely unknown. Natural populations offer a good opportunity to address this question because they likely contain different alleles that share recent common progenitors. We identified 19 S haplotypes from a natural population of Petunia inflata in Argentina, used reverse transcriptase-polymerase chain reaction to obtain cDNAs for 15 alleles of the S-RNase gene, and sequenced all the cDNAs. Phylogenetic studies revealed that five of these alleles and two previously identified alleles form a major clade, and that the 5' region of S19 allele was derived from an ancestor allele closely related to S2, whereas its 3' region was derived from an ancestor allele closely related to S8. A similar evolutionary relationship was found among S3, S12, and S15 alleles. These findings suggest that intragenic recombination contributed to the generation of the allelic diversity of the S-RNase gene. Two additional findings emerged from the sequence comparisons. First, the nucleotide sequence of the S1 allele identified in this work is completely identical to that of the previously identified S1 allele of a different origin. Second, in the two hypervariable regions HVa and HVb, thought to be involved in determining S allele specificity, S6 and S9 alleles differ only by four nucleotides, all in HVb, resulting in two amino acid differences. The implications of these findings are discussed.
Self-incompatibility (SI) in
flowering plants is a prezygotic reproductive barrier that enables the
pistil of a flower to reject self-pollen, but to accept nonself pollen
for fertilization (de Nettancourt, 1977 GSI has been extensively studied in the Solanaceae, which includes
Petunia, potato, tobacco, and tomato. The S-RNase
gene, a highly polymorphic pistil-specific gene at the S
locus, has been shown to encode the S haplotype specificity
determinant of the pistil (Lee et al., 1994 The molecular mechanisms responsible for the generation of haplotype
diversity at the S locus have been studied in Brassicaceae and Solanaceae SI systems. The central question is whether point mutation alone or point mutation and recombination/gene conversion has
contributed to the generation of allelic diversity. Results from
domain-swapping experiments suggest that point mutations in the S-RNase
gene can generate new allelic specificities (Matton et al., 1997 In fact, the following findings have even been interpreted to mean that
recombination does not contribute at all to the generation of
S haplotype diversity in the Solanaceae SI system. First,
the S locus contains the S-RNase gene and the pollen
S gene, so that recombination would destroy the tight
linkage between these two genes, and thus cause breakdown of SI by
generating different haplotype specificities for the pistil and pollen.
Second, the flanking regions of different alleles of the S-RNase gene
share virtually no sequence similarity and contain abundant repetitive sequences (Kaufmann et al., 1991 However, these aforementioned results do not completely rule out the
involvement of recombination. First, even though no recombination at
the S locus has ever been observed in genetic studies, it
does remain possible that recombinants are generated, but at a very low
frequency, and that those that become self-compatible have been
eliminated from the population. Second, the high degree of sequence
diversity in the flanking regions of the S genes might prevent intergenic, but not intragenic recombination. Moreover, the
S-RNases available to Clark and Kao (1991) In another more recent study of the Brassicaceae SI system, Kusaba et
al. (1997) These findings of possible intragenic recombination at the Brassicaceae S locus prompted us to reexamine this possibility for the S locus of the Solanaceae. In this study we used plants derived from seeds collected from a natural population of Petunia inflata in Argentina for the cloning of S-RNase sequences. Plants from the same population are more likely to contain alleles that are closely related and share recent progenitor alleles. We identified 19 different S haplotypes in this population and used reverse transcription (RT)-PCR to obtain cDNAs corresponding with 15 alleles of the S-RNase gene. Phylogenetic analyses using the deduced amino acid sequences of these alleles and a number of previously identified alleles showed that five of the alleles identified from this natural population and two previously identified alleles of P. inflata form a major clade. These closely related alleles allowed us to more precisely assess the occurrence of intragenic recombination between different alleles of the S-RNase gene and also to identify some other interesting features about the Solanaceae SI system.
Identification of S Haplotypes and Cloning of cDNAs for S-RNases of P. inflata from a Natural Population in Argentina One hundred SI plants were raised from seeds randomly collected
from a natural population of P. inflata found in Argentina, and 36 of them were used in this study. Nineteen different S
haplotypes were identified based on the results of genetic crosses
described in the "Materials and Methods." One of these 19 S haplotypes was designated S1
because its SI behavior was identical to that of one of the three
previously identified S haplotypes of P. inflata, S1, S2, and
S3 (Ai et al., 1990
Plants homozygous for all except S14 and
S18 haplotypes were used for cloning cDNAs
of S-RNases. We designed a degenerate primer based on the amino acid
sequences of the conserved region C2 of solanaceous S-RNases (Ioerger
et al., 1991 Genomic DNA-blot analysis was carried out to further ascertain whether the cDNA obtained by RT-PCR for each S haplotype encoded an S-RNase. If a cDNA encodes an S-RNase, it should reveal an S haplotype specific restriction fragment-length polymorphism (RFLP) and hybridize most strongly to the genomic fragment(s) from the same S haplotype. This is because different alleles of the S-RNase gene are divergent in sequence not only in the coding region, but also in the flanking regions. Blots containing EcoRI digests of genomic DNA isolated from plants homozygous for different S haplotypes were hybridized separately with the cDNAs isolated from the 17 S haplotypes. All except the cDNAs for S4 and S5 haplotypes revealed a hybridization pattern expected of cDNAs encoding S-RNases. The results of genomic DNA blotting using the cDNA for S8 haplotype as a probe are shown in Figure 1. The cDNA hybridized to an EcoRI fragment of different size in each of the S8S8, S12S12, S13S13, and S20S20 homozygotes, and did not hybridize to the genomic DNA digests of the other eight S homozygotes on the blot. Moreover, the intensity of the hybridization to the DNA fragment of S8S8 homozygote was the strongest.
The cDNAs for all the 17 S haplotypes were completely
sequenced, and all except the cDNAs for S4
and S5 haplotypes showed sequence
similarity with the sequences of known solanaceous S-RNases. Thus,
cDNAs for a total of 15 alleles of the S-RNase gene identified from a
natural population of P. inflata were cloned. Because the upstream primer used in RT-PCR corresponded to the C2 region of S-RNases, the cDNAs obtained encoded approximately 167 amino acids of
S-RNases, with approximately 35 amino acids at the N-terminal end not
included. It is interesting that the nucleotide sequence (both coding
and the 3'-non-coding) of the S1-RNase gene
identified in this work is completely identical to that of the
previously identified S1-RNase gene. Moreover,
when the genomic DNA of these two independent sources of
S1 haplotype was digested with several restriction enzymes and hybridized with the full-length cDNA for S1-RNase, no RFLP was observed (results not
shown), indicating that there is little, if any, sequence difference in
the immediate flanking regions of the S-RNase gene. However, when the
same genomic DNA blot was hybridized with cDNAs for two
pollen-expressed genes that are linked to, but located at as yet
undetermined distances from the S-RNase gene (McCubbin et al., 2000 Sequence Comparison among S-RNases Deduced amino acid sequences of the 15 S-RNases of P. inflata identified from this natural population, as well as those
of S2- and S3-RNases of
P. inflata were aligned by CLUSTAL X (Thompson et al.,
1997
Molecular Phylogeny of S-RNases A phylogenetic tree of the 15 S-RNases of P. inflata
whose sequences were determined in this work and 19 previously reported solanaceous S-RNases was constructed by the neighbor-joining method based on the proportion of amino acid differences (Fig.
3). The tree was rooted with a fungal
RNase (RNase T2) and two S-like RNases (le and lx) from tomato. The
tree shows that five of the 15 P. inflata S-RNases from this
natural population (S8,
S12, S13,
S15, and S19) and the
previously identified S2- and
S3-RNases form a major clade, and the other 10 are grouped with one another in the same cluster with S-RNase(s) from
other species of the Petunia genus, or with S-RNase(s) from
another genus of the Solanaceae. This pattern of trans-species
polymorphism, first observed by Ioerger et al. (1990)
Different Evolutionary Histories of the 5' and 3' Regions of the S-RNase Gene To ascertain the possibility that intragenic recombination could have generated closely related alleles of the S-RNase gene we compared the seven closely related alleles mentioned above to see whether different regions of the S-RNase gene have taken the same evolutionary pathway. If allelic diversity of the S-RNase gene is entirely due to the accumulation of point mutations over evolutionary time, different regions of the gene would be expected to have consistent genealogies because the entire gene would have evolved as a unit. On the other hand, if recombination among different alleles of the S-RNase gene has occurred, it would shuffle polymorphic sequences among alleles, causing different regions of the S-RNase gene to have different evolutionary histories. As an exploratory tool we plotted ps (the proportion of synonymous differences per synonymous site) and pn (the proportion of non-synonymous differences per non-synonymous site) in a sliding window of 30 codons for all possible pairwise comparisons among the aforementioned seven alleles of the S-RNase gene. The plots for the pair of the S2-RNase gene and the S19-RNase gene (Fig. 4A) show that they are more similar to each other at synonymous and non-synonymous sites in the 5' region than in the 3' region of the gene. The plots for the pair of the S8-RNase gene and the S19-RNase gene (Fig. 4B) show that they are more similar to each other at synonymous and non-synonymous sites in the 3' region than in the 5' region of the gene. The maximum chi-square method gave statistically significant evidence (P = 0.007) of a recombination event giving rise to different evolutionary histories in the 5' region (codons nos. 1-68; refer to Fig. 2A for the numbering of amino acids) and the 3' region (codons nos. 69-168) of the S-RNase gene.
However, a significant result of the maximum chi-square method alone is
not definitive evidence of recombination because several alternative
hypotheses might explain the observation of greater similarity between
alleles in one region of a gene than in another region. To decide
between the hypothesis of recombination and alternative
explanations we constructed separate phylogenetic trees for the 5'
(codons nos. 1-68) and 3' (codons nos. 69-168) regions of the
S2-, S8-,
S13-, and S19-RNase genes.
For each region we constructed two separate trees on the basis of
ps and pn.
Recombination is expected to produce similarity at synonymous and
non-synonymous sites, whereas certain other factors can yield
similarity at synonymous sites alone or at non-synonymous sites alone.
For example, a shared G + C content bias can produce an unexpectedly
high similarity at synonymous sites, whereas a shared functional
constraint can produce an unexpectedly high similarity at
non-synonymous sites (Hughes, 1991
Another group of alleles in which we detected the existence of recombination are S3, S12, and S15. The maximum chi-square test supported the hypothesis of recombination (P = 0.048) between the 5' region (codon nos. 1-62) and the 3' region (codons nos. 63-168) of these alleles. Phylogenetic analysis based on ps and pn showed that the 5' region of the S3-RNase gene is closely related to that of the S12-RNase gene, whereas the 3' region is closely related to that of the S15-RNase gene (Fig. 5, C and D). For both sets of alleles, because the regions examined are large (encoding at least 62 amino acids) and the bootstrap values are high, the observed topological differences are unlikely to be due to stochastic error. Thus, our results suggest that intragenic recombination has played a role in the generation of new allelic specificities of the S-RNase gene.
Use of RT-PCR in Cloning cDNAs for S-RNases from Natural Populations Although cDNAs for a large number of S-RNases have been cloned and
sequenced from a number of solanaceous species, only a few studies so
far have used plants derived from natural populations as the source of
S haplotypes. Richman et al. (1995 In our study of a natural population of P. inflata identified in Argentina we first carried out genetic crosses to identify the S genotype of each of the plants used and then generated progeny plants homozygous for each of the S haplotypes identified. Using S homozygotes for RT-PCR has allowed unambiguous assignment of the resulting cDNAs to a particular S allele of the S-RNase gene. To obtain longer cDNAs for sequence comparison we chose a pair of PCR primers based on the sequence of the C2 region and the poly(A)+ tail; in this way, the cDNAs obtained lack only approximately 100 bp of the sequence upstream from the C2 region, and the deduced amino acid sequences downstream from the C2 region account for approximately 80% of the full-size S-RNases. We also used genomic-blotting analysis to examine the cDNAs obtained by RT-PCR to see whether each revealed an S haplotype specific RFLP. Subsequent sequencing of all the cDNAs showed that those cDNAs that revealed this pattern encode S-RNases and those that did not encode proteins unrelated to S-RNases. Genetically Identical S Haplotypes Identified from Different Natural Populations When studying natural populations of SI species, questions can be asked as to whether genetically identical S haplotypes are contained in different populations and if so, whether the deduced amino acid sequences of their corresponding S locus genes are identical. If differences are found, those amino acids that are different can be eliminated from consideration as playing a role in the determinant of S allelic specificity. Prior to this study only three S haplotypes,
S1, S2, and
S3, had been identified from P. inflata. Although the geographical origin of the plants from which
these haplotypes were identified is not known, it is unlikely to be the
same as the natural population studied here. It is thus interesting to
find that one of the haplotypes, S1, was
also present in this natural population. In fact, no sequence differences were found between these two
S1 alleles of the S-RNase gene in the
coding region or the 3'-non-coding region. Preliminary results suggest
that the sequence conservation in the S locus of these two
S1 haplotypes likely extends beyond the
S-RNase gene, because no RFLP was observed when the genomic DNA digests
from these two S1 haplotypes were
hybridized with the cDNA for S1-RNase. However,
the conservation appears to break down in other parts of the
S locus because RFLP was revealed when cDNAs for two
S-linked pollen-expressed genes were used as hybridization
probes. These findings suggest that these two
S1 haplotypes may have evolved independently for some time and that the S-RNase gene may be under strict functional constraint to maintain allelic identity. Perfect amino acid sequence conservation has also been found between pistil S
proteins of two genetically identical S haplotypes
identified in a British and a Spanish population of Papaver
rhoeas, a species that possesses a different type of GSI from
solanaceous species (Walker et al., 1996 Role of Two Hypervariable Regions in S Specificity Among all the solanaceous S-RNases that have been reported so far,
the pair of S6- and
S9-RNases identified in this work are most
similar to each other in the two hypervariable regions; they have an
identical sequence in HVa and only differ by two amino acids in HVb.
These two regions have recently been suggested to be the sole
determinant of the S allele specificity of S-RNases (Matton
et al., 1997 Intragenic Recombination As a Mechanism of Generation of Allelic Diversity The closely related alleles of the S-RNase gene that we identified in a natural population of P. inflata have allowed us to obtain the first evidence that apart from accumulation of point mutations, allelic diversity of the S-RNase gene has also been generated by intragenic recombination. This finding is quite interesting, considering that recombination has previously been thought to be absent at the S locus to maintain tight linkage between the S-RNase gene and the pollen S gene. One can envision that point mutations would first produce highly
divergent polymorphic sequence motifs, and then intragenic recombination would shuffle these polymorphic sequence motifs into
different allelic combinations. If accumulation of point mutations were
solely responsible for allelic diversity of the S-RNase gene, favorable
mutations would have to occur consecutively in the same allelic
lineage. However, the additional involvement of intragenic
recombination could bring together mutations that have occurred in
different allelic lineages into new allelic forms and thereby speed up
the process of allelic diversity generation. This evolutionary process
is very similar to the diversification of alleles of the major
histocompatibility (MHC) loci, which are also involved in self/nonself
recognition (Hughes et al., 1993 If recombination and/or point mutations result in a new S
allele specificity of the S-RNase gene, there must be complementary changes in the pollen S gene for an SI plant to remain SI
while acquiring a new haplotype specificity. How the pistil and pollen genes "communicate" with each other during the mutational process is one of the most perplexing questions in the study of SI. A solution
has recently been proposed based on the finding of a hybrid S-RNase
that possesses dual S allele specificity (Matton et al.,
1999
Plant Material The natural population of Petunia inflata R. E. Fr. (Petunia integrifolia [Hook.] Schinz & Thell. subsp. inflata [R. E. Fr.] Wijsman) selected for this study was designated A9. It was located 1.6 km east of the junction of Ruta Nacional 4 and Ruta Provincial 103, Municipal Candelaria, Provincial Misiones, Argentina (27 23'13"S, 55 34'00"W). The population was comprised of approximately 100 individuals and was established at a gentle load-side slope poorly covered with vegetation. Seed capsules were randomly collected on November 28, 1990, and the seeds were mixed together and stored in a refrigerator. Thirty-six plants were raised from the seeds, and they were coded from A9-1 to A9-36. All of them were SI as judged by their failure to set seed upon selfing and by the arrest of pollen tube growth in the upper one-third segment of the style. Determination of S Genotypes Pollen from plants homozygous for the previously identified
S1,
S2, and
S3 haplotypes of P.
inflata (Ai et al., 1990 Isolation of Total RNA from Pistils Pistils (without ovaries) were collected 1 or 2 d before
anthesis from plants homozygous for a particular S
haplotype, frozen in liquid nitrogen, and were ground to fine powder.
Total RNA was extracted from the powder using TRIzol reagent (Life
Technologies, Rockville, MD) according to the protocol recommended by
the manufacturer. RNA was recovered by precipitation with isopropyl
alcohol. After washing with 75% (v/v) ethanol, the RNA pellet was
dissolved in water and stored at Isolation of Genomic DNA For each S genotype, 0.2 to 0.3 g of freshly collected young leaves were pulverized in liquid nitrogen, and genomic DNA was extracted from the homogenate with Plant DNAzol Reagent (Life Technologies). Chloroform was then used to remove polysaccharides. Following chloroform extraction, plant debris was removed by centrifugation and DNA was precipitated from the supernatant with ethanol. The resulting DNA pellet was washed with ethanol and dissolved in water. RT-PCR, Cloning, and Sequencing of cDNAs Encoding S-RNases To amplify cDNA for each S-RNase, first-strand cDNA was synthesized at 42°C for 60 min in a 20-µL reaction buffer (50 mM Tris-HCl, pH 8.3, 75 mM KCl, 3 mM MgCl2, and 10 mM dithiothreitol) that contained 1 to 5 µg of total RNA, 500 ng of oligo (dT)17, 0.5 mM dNTPs, and 200 units of SuperScript II reverse transcriptase (Life Technologies). After the reaction, a 0.5-µL aliquot of the single-stranded cDNA was used as a template for PCR amplification in a 20-µL reaction buffer (20 mM Tris-HCl, pH 8.4, 2 mM MgCl2, and 50 mM KCl) containing 2.5 µM upstream primer, 2.5 µM oligo (dT)17, 0.1 mM dNTPs, and 2 units of Display Taq DNA polymerase (PGC Scientific, Gaithersburg, MD). The upstream primer was a degenerate primer, 5'-ATTCAYGGGCTTTGGCCC-3' (Y indicating C or T), which was designed based on the sequence of the conserved region C2 of S-RNases. The reaction mixture was denatured at 94°C for 2 min and then cycled as follows: 94°C for 30 s, 40°C for 2 min, and extension at 72°C for 30 s. After the first five cycles the annealing condition was changed to 45°C for 90 s. The reaction was continued for 25 more cycles with an additional 5-min extension after the final cycle. Aliquots of all PCR reaction mixtures were analyzed on 1% (w/v) agarose gels. The remainder of each PCR reaction mixture was purified, ligated into pGEM-T vector (Promega, Madison, WI), and transformed into Escherichia coli. Plasmid DNA was prepared from multiple colonies of each transformation using a NucleoSpin Plasmid Miniprep Kit (CLONTECH, Palo Alto, CA). Sequencing was carried out on an ABI 60 sequencer at the Pennsylvania State University Life Sciences Consortium's Nucleic Acid facility. To eliminate potential errors introduced by PCR, two or more clones obtained from independent runs of RT-PCR were sequenced for each S haplotype. DNA-Blot Analysis Plasmid DNA (100 ng) was digested with EcoRI (1 unit) and genomic DNA (5 µg) was digested with EcoRI (20 units); the digests were separately electrophoresed on 0.7% (w/v) agarose gels at 25 V overnight. The gels were blotted to Biodyne B nylon membranes (Life Technologies). The DNA fragments were fixed on the membranes by baking at 80°C for 2 h. The membranes were pre-hybridized at 65°C for 1 h in a solution containing 10% (w/v) Dextran sulfate, 1 M NaCl, and 1% (w/v) SDS. Radiolabeled DNA probes were prepared by using an RTS RadPrime DNA Labeling kit (Life Technologies). Hybridization was performed at 65°C overnight in the pre-hybridization solution plus radiolabeled probe. The membranes were washed twice in 0.1× SSC, 0.1% (w/v) SDS for 30 min each (high stringency) or in 2× SSC, 0.1% (w/v) SDS for 30 min each (low stringency), and exposed to phosphoscreen for 8 h. The screen was scanned by a phosphorimager (Molecular Dynamics, Sunnyvale, CA). Statistical Methods Nucleotide and deduced amino acid sequences of cDNAs for
S-RNases were aligned using CLUSTAL X (Thompson et al., 1997
We thank Andrew G. McCubbin for helpful discussions and critical reading of the manuscript and Jiong Wang for helping to maintain plants in the greenhouse and collecting tissues.
Received September 8, 2000; returned for revision October 11, 2000; accepted November 11, 2000. 1 This work was supported by the National Science Foundation (grant nos. IBN-9603993 and IBN-9982659 to T.-H.K.) and by the National Institutes of Health (grant no. GM34940 to A.L.H.).
* Corresponding author; e-mail: txk3{at}psu.edu; fax 814-863- 9416.
This article has been cited by other articles:
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ASPB Publications | PLANT PHYSIOLOGY® | THE PLANT CELL | |
|---|---|---|---|