|
|
||||||||
|
First published online January 21, 2005; 10.1104/pp.104.052829 Plant Physiology 137:500-513 (2005) © 2005 American Society of Plant Biologists Comparative Genomics of the Pennate Diatom Phaeodactylum tricornutum1,[w]Laboratory of Cell Signalling, Stazione Zoologica Anton Dohrn, I80121 Naples, Italy (A.M., C.B.); Signalisation et Morphogenèse des Diatomées, CNRS FRE2910, Ecole Normale Supérieure, 75230 Paris, France (A.M., K.J., C.B.); and Avestha Gengraine Technologies Pvt. Ltd., International Technology Park, Bangalore 560066, India (U.M.)
Diatoms are one of the most important constituents of phytoplankton communities in aquatic environments, but in spite of this, only recently have large-scale diatom-sequencing projects been undertaken. With the genome of the centric species Thalassiosira pseudonana available since mid-2004, accumulating sequence information for a pennate model species appears a natural subsequent aim. We have generated over 12,000 expressed sequence tags (ESTs) from the pennate diatom Phaeodactylum tricornutum, and upon assembly into a nonredundant set, 5,108 sequences were obtained. Significant similarity (E < 1E-04) to entries in the GenBank nonredundant protein database, the COG profile database, and the Pfam protein domains database were detected, respectively, in 45.0%, 21.5%, and 37.1% of the nonredundant collection of sequences. This information was employed to functionally annotate the P. tricornutum nonredundant set and to create an internet-accessible queryable diatom EST database. The nonredundant collection was then compared to the putative complete proteomes of the green alga Chlamydomonas reinhardtii, the red alga Cyanidioschyzon merolae, and the centric diatom T. pseudonana. A number of intriguing differences were identified between the pennate and the centric diatoms concerning activities of relevance for general cell metabolism, e.g. genes involved in carbon-concentrating mechanisms, cytosolic acetyl-Coenzyme A production, and fructose-1,6-bisphosphate metabolism. Finally, codon usage and utilization of C and G relative to gene expression (as measured by EST redundance) were studied, and preferences for utilization of C and CpG doublets were noted among the P. tricornutum EST coding sequences.
The unicellular brown algal class Bacillariophyceae (diatoms) is among the most successful and diversified groups of photosynthetic eukaryotes, with probably over 100,000 extant species (Round et al., 1990
One of the most distinctive features of diatoms is their ability to precipitate soluble silicic acid into a finely patterned silica cell wall. Diatom silicon metabolism is an important regulator of biogeochemical cycling of silicon in the oceans and is also crucial to diatom survival and life histories. Nonetheless, these particular metabolic pathways remain largely unexplored and constitute a clear potential source for discovery of novel protein functions. Indeed, the first identified components of diatom cell wall synthesis, such as silaffins and silicon transporters (Hildebrand et al., 1998
More generally, diatoms have a peculiar genetic makeup in that, unlike the green and red lineages of photosynthetic organisms, they are likely to have emerged from a secondary endosymbiosis between a photosynthetic eukaryote, most probably red algal-like, and a heterotrophic eukaryote (Falkowski et al., 2004
The complete nuclear, mitochondrial, and plastid genome sequences of the centric diatom Thalassiosira pseudonana have recently become available (Armbrust et al., 2004
T. pseudonana has a small genome (34 Mb) and the genus Thalassiosira is of significant ecological importance, with some species being major components of the total phytoplankton biomass in their ecosystems. P. tricornutum is instead thought to be of little ecological relevance and, unlike other diatoms, it does not have an obligate requirement for silicon and can undergo morphological transitions between three possible morphotypes (Borowitzka and Volcani, 1978
Besides T. pseudonana, the green alga Chlamydomonas reinhardtii and the red alga Cyanidioschyzon merolae are currently the only eukaryotic microalgae whose genomes are publicly available (Matsuzaki et al., 2004
We report here the generation and analysis of 12,136 expressed sequence tags (ESTs) from the pennate diatom P. tricornutum and their subsequent comparison with the whole-genome predicted proteomes of T. pseudonana, C. reinhardtii, and C. merolae. This analysis greatly extends an initial study of 997 P. tricornutum ESTs (Scala et al., 2002
Assembly and Functional Characterization of 12,136 Phaeodactylum tricornutum ESTs
Over 12,000 P. tricornutum cDNAs from a previously described cDNA library (Scala et al., 2002
We subsequently proceeded to functionally annotate the P. tricornutum NRS using two local alignment search programs. First, individual ESTs and contig consensi were searched for similarity to proteins in GenBank by means of the BLASTX algorithm (Altschul et al., 1997 An identical analysis was performed with 15,171 ESTs from T. pseudonana, kindly provided by Mark Hildebrand (Scripps Institution of Oceanography, San Diego) and Diego Martinez (Department of Energy Joint Genome Institute, Walnut Creek, CA). We estimated that this collection contains around 5,500 nonredundant sequences (data not shown).
A queryable relational database, The Diatom EST Database (http://avesthagen.sznbowler.com), was constructed with all the sequence information and functional analyses described above, organized into PtDB (for P. tricornutum) and TpDB (for T. pseudonana; Maheswari et al., 2005
The genomes of the green alga C. reinhardtii, the red alga C. merolae, and the centric diatom T. pseudonana, estimated to be 90%, 99.98%, and 98% complete, respectively, have recently become available (Armbrust et al., 2004
A P. tricornutum NRS sequence was considered to have a similar representative in a genome if it showed similarity to at least one of its putative encoded proteins with E < 1E-04. A nonstringent E-value threshold was used to ensure that proteins sorted as being absent in a certain genome were really absent in it (no hits with E < 1E-04), because it was our ultimate goal to identify P. tricornutum nonredundant sequences that are absent in T. pseudonana (see below). From these analyses, it was apparent that the category with the most sequences was "no similarities in any of the three genomes" (2,318 sequences, 45% of the NRS), followed by "similarities in all three" (1,233 sequences, 24% of the NRS; Fig. 1A). This was perhaps not surprising given the probable abundance of untranslated transcript regions among our ESTs and that conserved genes are always abundant when comparing virtually any two photosynthetic eukaryotes. A total of 820 P. tricornutum NRS sequences (16%) were found to have similarities in T. pseudonana, but not in the other two genomes, while for C. merolae and C. reinhardtii, these values were just 25 and 70, respectively. Similar results were obtained by Armbrust et al. (2004)
The fact that the two diatoms have more genes in common with C. reinhardtii than with C. merolae (367 versus 251; Fig. 1A) might prompt one to conclude that diatoms are more closely related to the green lineage than they are to the red branch. This would contradict the commonly accepted theory that diatoms arose by endocytobiosis of a red algal ancestor by a heterotrophic eukaryote (Falkowski et al., 2004
Pennate diatoms are believed to have arisen from the polar centric diatoms (Medlin et al., 2000
We detected 70 P. tricornutum NRS sequences with similarities in the green algal genome but not in T. pseudonana or C. merolae (Fig. 1A). Among these, 13 have similarities in the SwissProt database (E < 1E-10) and only 1 is represented more than 3 times in our EST collection (Table II). This redundant transcript, PTMM01656, encodes a
To understand whether these putative CAs may have been derived from the secondary endosymbiont or were present in the nuclear or mitochondrial genomes of the ancestral heterotrophic host, we aligned the two P. tricornutum ESTs with the most similar CA-like genes in photosynthetic and heterotrophic eukaryotes and prokaryotes, and derived a neighbor-joining tree (Fig. 3). The two diatom sequences clustered with
CAs catalyze the reversible dehydration of bicarbonate into CO2 and in photosynthetic organisms are fundamental players in carbon-concentrating mechanisms (CCM), which ensure a high concentration of CO2 in the vicinity of the Rubisco enzyme (Kaplan and Reinhold, 1999 , , and , all of which require a zinc atom as ligand for catalytic activity. Studies of carbon utilization in the centric diatom Thalassiosira weissflogii revealed the existence of two novel CAs, a zinc-dependent enzyme encoded by TWCA1 (Morel et al., 1994 -type CAs but does contain putative - and -type CA genes, along with a probable TWCA1 ortholog (Armbrust et al., 2004
None of the putative CAs encoded in the T. pseudonana genome appears to contain a plastid-targeting sequence, which could support the provocative hypothesis of Morel and colleagues (Reinfelder et al., 2000
On the other hand, PTMM00034 in Table II is highly similar (41% ID) to the N-terminal portion of a mammalian guanine deaminase (GDA) that has been characterized biochemically (Yuan et al., 1999 It is worth noting that our starting datasets and sorting procedures may have caused one sequence, PTMM09702, to be improperly classified in Table II. A highly similar region of the T. pseudonana genome was detected by tBLASTX (1E-16; Table II), but the gene-finding software failed to predict any coding sequence in this region. Furthermore, the sequence presents poor similarity to a C. reinhardtii model (1E-05; Table II), just below the threshold established, for which this gene may be present in T. pseudonana and absent in C. reinhardtii rather than vice versa. As discussed above, such errors occur as an intrinsic limitation of bioinformatic approaches. In curated annotations or individual case studies, additional manual inspections should therefore be conducted.
Among the 25 translated P. tricornutum NRS sequences showing similarity to red algal proteins but not to T. pseudonana or C. reinhardtii proteins (Fig. 1A), only 3 had significant similarity to SwissProt accesions (E < 1E-10), and none of these are highly redundant in the P. tricornutum EST collection (Table III). The only sequence with a high degree of similarity to a SwissProt entry (PTSS00077, over 60% ID to a C-terminal region of animal and fungal UDP-protein glucosyltransferases) covers only about 100 amino acids.
Twenty-four NRS sequences were sorted as being present in both red and green algal genomes but not in T. pseudonana (Fig. 1A). Nine of these could be annotated by BLASTX analysis against SwissProt (E < 1E-10; Table IV), only three of which have one or two redundant sequences in the raw collection. Like PTMM00034 in Table II, PTMM09692 was similar to mammalian guanine deaminases but to a lesser extent (24%30% ID) and to a more C-terminal region. It appears in Table IV rather than Table II because this more downstream region was weakly similar to a C. merolae predicted protein (E = 8E-09; Table IV) that is most similar to bacterial chlorohydrolases, which are close relatives of guanine deaminases (most are classified as "cytosine deaminases and related hydrolases" in the COG database). The sequence PTMM09692 is more similar to the C. reinhardtii putative GDA gene than to the C. merolae chlorohydrolase-like sequence, and it is possible that PTMM00034 and PTMM09692 are nonoverlapping ESTs from a unique cDNA.
On the other hand, Table IV includes a sequence, PTMM05217, which shows over 60% ID to the C terminus of mammalian and insect cytosolic ATP-citrate lyases (ACLs). We identified another NRS sequence, PTMM07214, which was similar to a more upstream region of animal ACLs by tBLASTN searching the P. tricornutum EST collection with a complete human ACL. This additional EST does not appear in Tables II to IV
ACL (EC 2.3.3.8) is a multimeric enzyme that catalyzes the generation of acetyl-CoA and oxaloacetate from citrate and CoA and constitutes a major source of cytoplasmic acetyl-CoA in mammals, plants, and oleaginous yeasts (Rangasamy and Ratledge, 2000
No
Table IV also includes singleton PTMM07433, which likely encodes a eukaryotic type I Fru-1,6-bisphosphate aldolase (FBA) and implies a further complication in the already complex phylogeny of FBAs. FBAs are amphibolic enzymes involved in glycolysis, gluconeogenesis, and Calvin cycle reactions. Two types of evolutionarily unrelated FBAs are known, and both can catalyze the cleavage of FBA to glyceraldehyde-3-P plus dihydroxyacetone-P in catabolic functions and the reverse condensation in anabolic pathways. FBA-encoding genes have undergone numerous lateral transfers, gene duplications, and gene product recompartmentalizations throughout their phylogenetic history (Rogers and Keeling, 2004
In their recent summary, Patron et al. (2004)
On the other hand, the typically prokaryotic type I sequence PTMM00206, a probable case of lateral transfer (Patron et al., 2004
We next attempted to identify P. tricornutum NRS sequences with no related sequences in T. pseudonana or the other 2 algal genomes but that can be recognized based on their similarity to proteins in the SwissProt database. Following initial BLASTX analyses against the predicted T. pseudonana proteome, 2,437 NRS ESTs (47.7%) were found to have no matches (E > 1E-04), and 550 of these were represented by more than 1 EST (data not shown). We filtered this set further by a tBLASTX search against the T. pseudonana genome sequence to double check for possible T. pseudonana genes that may have been missed by the gene-finding software used for T. pseudonana genome annotation (Armbrust et al., 2004
Among the 820 sequences common in the two diatoms but absent in the green and red algae, genes that are usually absent in primary endosymbiosis-derived photosynthetic organisms but are known in eukaryotic heterotrophs or prokaryotes can be identified, along with known brown lineage-specific genes. As many as 119 of the sequences in this subset (14.5%) could be assigned a tentative SwissProt annotation (E < 1E-10; Supplemental Table II). Among the best 60 SwissProt annotations in this table (supported by E < 1E-20), the abundance of the phaeophyte-specific FCP proteins (five sequences) and of sequences related to nitrogen metabolism (11 sequences) or to utilization of citrate (four sequences) is particularly striking. The detection of several components of a urea cycle, typically a metazoan feature, was one of the major surprises from the recent T. pseudonana genome analysis (Armbrust et al., 2004 In an independent analysis, we identified putative novel diatom-specific sequences by selecting those that did not present similarity to any protein in SwissProt (E < 1E-04) and then searching the T. pseudonana genome using BLASTX (E < 1E-20). A more stringent E-value threshold was set for the T. pseudonana BLASTX analysis in this case to decrease the number of false positives (regions of weak similarity that are not likely to represent related genes). Up to 3,949 translated NRS sequences did not resemble any sequence in SwissProt, and 1,028 of these had potential orthologs encoded in the T. pseudonana genome (Supplemental Table III). This suggests that multiple novel gene families are bound to be identified among diatoms, a possibility that will probably be confirmed with the forthcoming P. tricornutum genome and other diatom cDNA sequencing projects. The function of such sequences being unknown, we can only note that 36 of them are rather highly expressed (more than 10 ESTs) and constitute an interesting, manageable set for benchwork aimed at discovering novel gene functions.
The cDNA library employed to generate the ESTs discussed here was created from a culture grown in a single condition (standard exponential growth conditions), so many genes that are expressed only under particular environmental conditions will not be represented in our sequence dataset. In addition, the cDNA library was not normalized or subtracted for any sequences. These manipulations are known to improve the rate of gene discovery, as they reduce the amount of nontarget cDNAs (e.g. highly redundant sequences; Bonaldo et al., 1996
The 20 most redundant P. tricornutum NRS sequences are represented by between 52 and 315 ESTs (Table VII). Among these highly represented transcripts, we found known sequences encoding P. tricornutum FCPs (Bhaya and Grossman, 1993
We next aimed to derive codon usage tables for both species, which required the selection of a subset of sequences with a known protein coding frame. We chose to assign a sound frame when a translated EST sequence showed similarity to known proteins (GenBank protein database BLASTX) with more than 50% coverage and more than 35% identity. We thus obtained subsets of 859 P. tricornutum and 465 T. pseudonana NRS sequences, with which codon usage was tabulated (Supplemental Table IV; also available at http://avesthagen.sznbowler.com). Inspection of P. tricornutum codon usage tables revealed that C is the preferred third-codon-position nucleotide in 13 out of 16 triplets. This is not the case for T. pseudonana, in which C is used in only 9 out of 16 cases. Interestingly, Arg is preferentially encoded by CGN codons in P. tricornutum, whereas T. pseudonana avoids this quartet in favor of the duet AG(A/G). Such differential use of synonymous codons in the two diatoms may be related to tRNA abundance, as previously suggested for Escherichia coli and yeast (Akashi, 2001
Because the latter observations could not be explained by a marked difference in GC content, we analyzed the content of CpG dinucleotides in the two diatoms. The C of such doublets is a widespread target for DNA methylation, and because mC is converted to T upon deamination, these sites are particularly susceptible to mutation (Jabbari et al., 1997 Upon analysis of the subsets of ESTs with an assigned frame, the occurrence of CpG doublets was found to be significantly higher (P = 0.0001; unpaired t test) in P. tricornutum than in T. pseudonana (Table VIII). The two species contain these doublets more often between codons in which a mutation of the C is more likely to lead to a synonymous change than in positions 1,2 or 2,3. It is also in position 3,1 that the difference between CpG usage in the two diatoms is most pronounced. These data may therefore suggest that T. pseudonana is more susceptible to the CpG to TpG transition than P. tricornutum. This could be due to differences in genome methylation levels or in robustness of DNA repair mechanisms between the two diatom species. Other factors related to coding constraints and, consequently, to base compositional correlations along coding sequences could also contribute to these differences in dinucleotide composition.
Because the majority of preferred codons in P. tricornutum are C-ending (13 out of 16), we plotted C3% NRS sequences against the number of ESTs in its cluster. The 859 NRS sequences used to derive the P. tricornutum codon usage table were sorted into three C3% categories (containing equal numbers of sequences) and their average redundancy was calculated (Fig. 5). NRS sequences with a low C3% were represented on average by less than two redundant sequences, while those with a high C3% were typically represented by more than five redundant ESTs. The differences observed between the three C3% categories are statistically significant (P = 0.035; Kruskal-Wallis nonparametric test). This is in agreement with the preference for C at the third codon position noted above, because abundant codons have been shown to be recognized and translated more quickly, and with fewer errors, than their less frequent counterparts (Andersson and Kurland, 1990
The EST project reported here may have identified an important number of the total P. tricornutum genes. Comparative analyses of these sequences with other eukaryotic algae, and in particular with T. pseudonana, has brought to light a number of interesting features and has allowed a preliminary comparison of gene repertoires between pennate and centric diatoms. The P. tricornutum genome is currently being sequenced by the Joint Genome Institute (Walnut Creek, CA). Once sequencing and assembly have been completed, the genes will be modeled with the help of an additional 60,000 ESTs generated from a range of cDNA libraries by Genoscope (Evry, France). The availability of such large amounts of sequence information will open new horizons in our studies of diatom phylogeny, ecology, physiology, and molecular biology. This study should serve as a useful platform for these new initiatives and as a basis for hypothesis-driven research aimed at dissecting the novel features of diatom biology and the molecular differences between the two major classes of diatoms. The putative genes encoding CAs, ACLs, and FBAs identified here are interesting cases in point.
Generation of 12,136 ESTs
Over 12,000 bacterial clones were picked at random from a previously described cDNA library (Scala et al., 2002
The initial EST collection was assembled into an NRS by means of the CAP3 algorithm (Huang and Madan, 1999
All EST sequences and contigs, together with alignments of redundant clusters and functional annotations, were organized into the twin databases PtDB and TpDB for P. tricornutum and Thalassiosira pseudonana, respectively, together forming The Diatom EST Database (http://avesthagen.sznbowler.com). The T. pseudonana clone CCMP1335 ESTs were a kind gift from Mark Hildebrand (Scripps Institution of Oceanography, San Diego) and Diego Martinez (Department of Energy Joint Genome Institute, Walnut Creek, CA). The programming is the work of Uma Maheswari and the database is maintained at the Avestha Gengraine facilities under contract. Technical details on the construction of this database have been published elsewhere (Maheswari et al., 2005
The P. tricornutum NRS was compared to the predicted proteins of Chlamydomonas reinhardtii (http://www.biology.duke.edu/chlamy_genome), Cyanidioschyzon merolae (http://merolae.biol.s.u-tokyo.ac.jp), and T. pseudonana (http://spider.jgi-psf.org/JGI_microbial/html) using BLASTX. A pennate EST was considered to be related to a protein in a different genome if it displayed similarity to it in any of its translated frames with E < 1E-04. After sorting the P. tricornutum sequences as being present in one, two, or all three genomes, tentative annotations were performed by using the BLASTX algorithm against SwissProt, as described in "Results and Discussion."
Neighbor-joining trees were derived in selected cases to evaluate the degree of relatedness of P. tricornutum sequences to sequences from other organisms. In each case, all sequences to be included in a tree were aligned by means of the ClustalX program (Thompson et al., 1997
Translated NRS sequences that displayed similarity to a GenBank protein entry with an ID level above 35% and a coverage of the subject protein of at least 50% were assigned a sound frame (i.e. the protein sequence and function can be predicted with high degrees of confidence). Codon usage tables for P. tricornutum and T. pseudonana were derived from these subsets, amounting to 859 P. tricornutum sequences and 465 T. pseudonana sequences. Base usage at the third codon position and its correlation with redundancy of the sequence in the P. tricornutum EST collection were studied, along with the abundance of the methylation target dinucleotide CpG, as more specifically described in "Results and Discussion."
We are grateful to the sequencing and bioinformatics teams of Avestha Gengraine Technologies, to Nicola Patron and Patrick Keeling for their comments and kind contributions to the analysis of the novel P. tricornutum FBAs, and to Margherita Groeben, Andrew Allen, Angela Falciatore, and Assaf Vardi for their help and suggestions. The T. pseudonana ESTs were a kind gift from Mark Hildebrand and Diego Martinez. Received September 3, 2004; returned for revision November 24, 2004; accepted November 25, 2004.
1 This work was supported in part by the European Union-funded Margenes (grant no. QLRT200101226 to C.B.) and Marine Genomics (grant no. 505403 to C.B.) projects.
[w] The online version of this article contains Web-only data. Article, publication date, and citation information can be found at www.plantphysiol.org/cgi/doi/10.1104/pp.104.052829. * Corresponding author; e-mail cbowler{at}biologie.ens.fr; fax 33144323935.
Akashi H (2001) Gene expression and molecular evolution. Curr Opin Genet Dev 11: 660666[CrossRef][Web of Science][Medline]
Altschul SF, Madden TL, Schaffer AA, Zhang JH, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 33893402
Andersson SGE, Kurland CG (1990) Codon preferences in free-living microorganisms. Microbiol Rev 54: 198210 Apt KE, Kroth-Pancic PG, Grossman AR (1996) Stable nuclear transformation of the diatom Phaeodactylum tricornutum. Mol Gen Genet 252: 572579[Web of Science][Medline]
Apt KE, Zaslavkaia L, Lippmeier JC, Lang M, Kilian O, Wetherbee R, Grossman AR, Kroth PG (2002) In vivo characterization of diatom multipartite plastid targeting signals. J Cell Sci 115: 40614069
Armbrust EV, Berges JB, Bowler C, Green BR, Martinez D, Putnam NH, Zhou S, Allen AE, Apt KE, Bechner M, et al (2004) The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and metabolism. Science 306: 7986
Baldauf SL (2003) The deep roots of eukaryotes. Science 300: 17031706
Bhaya D, Grossman AR (1993) Characterization of gene clusters encoding the fucoxanthin chlorophyll proteins of the diatom Phaeodactylum tricornutum. Nucleic Acids Res 21: 44584466
Bonaldo MF, Lennon G, Soares MB (1996) Normalization and subtraction: two approaches to facilitate gene discovery. Genome Res 6: 791806 Borowitzka MA, Volcani BE (1978) The polymorphic diatom Phaeodactylum tricornutum: ultrastructure of its morphotypes. J Phycol 14: 1021[CrossRef][Web of Science]
Domergue F, Spiekermann P, Lerchl J, Beckmann C, Kilian O, Kroth PG, Boland W, Zähringer U, Heinz E (2003) New insight into Phaeodactylum tricornutum fatty acid metabolism. Cloning and functional characterization of plastidial and microsomal delta12-fatty acid desaturases. Plant Physiol 131: 16481660 Dong HJ, Nilsson L, Kurland CG (1996) Co-variation of tRNA abundance and codon usage in Escherichia coli at different growth rates. J Mol Biol 260: 649663[CrossRef][Web of Science][Medline] Falciatore A, Bowler C (2002) Revealing the molecular secrets of marine diatoms. Annu Rev Plant Biol 53: 109130[CrossRef][Medline] Falciatore A, Casotti R, Leblanc C, Abrescia C, Bowler C (1999) Transformation of nonselectable reporter genes in marine diatoms. Mar Biotechnol (NY) 1: 239251
Falkowski PG, Katz ME, Knoll AH, Quigg A, Raven JA, Schofield O, Taylor FJR (2004) The evolution of modern eukaryotic phytoplankton. Science 305: 354360 Fatland B, Anderson M, Nikolau BJ, Wurtele ES (2000) Molecular biology of cytosolic acetyl-CoA generation. Biochem Soc Trans 28: 593595[Medline]
Fatland BL, Ke JS, Anderson MD, Mentzen WI, Cui LW, Allred CC, Johnston JL, Nikolau BJ, Wurtele ES (2002) Molecular characterization of a heteromeric ATP-citrate lyase that generates cytosolic acetyl-coenzyme A in Arabidopsis. Plant Physiol 130: 740756 Hildebrand M, Dahlin K, Volcani BE (1998) Characterization of a silicon transporter gene family in Cylindrotheca fusiformis: sequences, expression analysis, and identification of homologs in other diatoms. Mol Gen Genet 260: 480486[CrossRef][Web of Science][Medline]
Huang X, Madan A (1999) CAP3: a DNA sequence assembly program. Genome Res 9: 868877 Jabbari K, Bernardi G (2004) Cytosine methylation and CpG, TpG (CpA) and TpA frequencies. Gene 333: 143149[CrossRef][Web of Science][Medline] Jabbari K, Caccio S, de Barros JPP, Desgres J, Bernardi G (1997) Evolutionary changes in CpG and methylation levels in the genome of vertebrates. Gene 205: 109118[CrossRef][Web of Science][Medline] Kanaya S, Yamada Y, Kudo Y, Ikemura T (1999) Studies of codon usage and tRNA genes of 18 unicellular organisms and quantification of Bacillus subtilis tRNAs: gene expression level and species-specific diversity of codon usage based on multivariate analysis. Gene 238: 143155[CrossRef][Web of Science][Medline] Kaplan A, Reinhold L (1999) CO2 concentrating mechanisms in photosynthetic microorganisms. Annu Rev Plant Physiol Plant Mol Biol 50: 539570[CrossRef][Web of Science] Kim KS, Park SW, Moon YA, Kim YS (1994) Organization of the 5' region of the rat ATP citrate lyase gene. Biochem J 302: 759764 Kooistra WHCF, DeStefano M, Mann DG, Medlin LK (2003) The phylogeny of the diatoms. In WEG Müller, ed, Progress in Molecular and Subcellular Biology, Vol 33. Springer-Verlag, Berlin, pp 6397
Kroger N, Deutzmann R, Sumper M (1999) Polycationic peptides from diatom biosilica that direct silica nanosphere formation. Science 286: 11291132
Lane TW, Morel FMM (2000) A biological function for cadmium in marine diatoms. Proc Natl Acad Sci USA 97: 46274631
Liang F, Holt I, Pertea G, Karamycheva S, Salzberg SL, Quackenbush J (2000) An optimized protocol for analysis of EST sequences. Nucleic Acids Res 28: 36573665
Liaud MF, Lichtle C, Apt K, Martin W, Cerff R (2000) Compartment-specific isoforms of TPI and GAPDH are imported into diatom mitochondria as a fusion protein: evidence in favor of a mitochondrial origin of the eukaryotic glycolytic pathway. Mol Biol Evol 17: 213223
Maheswari U, Montsant A, Goll J, Krishnasamy S, Rajyashri KR, Patell VM, Bowler C (2005) The Diatom EST Database. Nucleic Acids Res 33: D344D347 Marchler-Bauer A, Anderson JB, DeWeese-Scott C, Fedorova ND, Geer LY, He SQ, Hurwitz DI, Jackson JD, Jacobs AR, Lanczycki CJ, et al (2003) CDD: a curated Entrez database of conserved domain alignments. Nucleic Acids Res 31: D383D387 Matsuzaki M, Misumi O, Shin-I T, Maruyama S, Takahara M, Miyagishima SY, Mori T, Nishida K, Yagisawa F, Yoshida Y, et al (2004) Genome sequence of the ultrasmall unicellular red alga Cyanidioschyzon merolae 10D. Nature 428: 653657[CrossRef][Medline] Medlin LK, Kooistra W, Gersonde R, Wellbrock U (1996) Evolution of the diatoms (Bacillariophyta). 2. Nuclear-encoded small-subunit rRNA sequence comparisons confirm a paraphyletic origin for the centric diatoms. Mol Biol Evol 13: 6775[Abstract] Medlin LK, Kooistra WCHF, Sam M (2000) A review of the evolution of the diatoms: a total approach using molecules, morphology and geology. In A Witkowski, J Sieminska, eds, The Origin and Early Evolution of the Diatoms: Fossil, Molecular and Biogeographical Approaches. W. Szafer Institute of Botany, Polish Academy of Sciences, Kracow, Poland, pp 1334 Morel FMM, Reinfelder JR, Roberts SB, Chamberlain CP, Lee JG, Yee D (1994) Zinc and carbon co-limitation of marine phytoplankton. Nature 369: 740742[CrossRef] Nelson DM, Treguer P, Brzezinski MA, Leynaert A, Queguiner B (1995) Production and dissolution of biogenic silica in the ocean: revised global estimates, comparison with regional data and relationship to biogenic sedimentation. Global Biogeochem Cycles 9: 359372 Nowrousian M, Kuck U, Loser K, Weltring KM (2000) The fungal acl1 and acl2 genes encode two polypeptides with homology to the N- and C-terminal parts of the animal ATP citrate lyase polypeptide. Curr Genet 37: 189193[CrossRef][Web of Science][Medline]
Patron NJ, Rogers MB, Keeling PJ (2004) Gene replacement of fructose-1,6-bisphosphate aldolase supports the hypothesis of a single photosynthetic ancestor of chromalveolates. Eukaryot Cell 3: 11691175
Poulsen N, Sumper M, Kroger N (2003) Biosilica formation in diatoms: characterization of native silaffin-2 and its role in silica morphogenesis. Proc Natl Acad Sci USA 100: 1207512080
Rangasamy D, Ratledge C (2000) Compartmentation of ATP: citrate lyase in plants. Plant Physiol 122: 12251230 Ratledge C (2002) Regulation of lipid accumulation in oleaginous micro-organisms. Biochem Soc Trans 30: 10471050[CrossRef][Medline] Raven JA, Waite AM (2004) The evolution of silicification in diatoms: inescapable sinking and sinking as escape? New Phytol 162: 4561[CrossRef] Reinfelder JR, Kraepiel AML, Morel FMM (2000) Unicellular C-4 photosynthesis in a marine diatom. Nature 407: 996999[CrossRef][Medline] Rogers M, Keeling PJ (2004) Lateral transfer and recompartmentalization of Calvin cycle enzymes of plants and algae. J Mol Evol 58: 367375[CrossRef][Web of Science][Medline] Round FE, Crawford RM, Mann DG (1990) The Diatoms: Biology and Morphology of the Genera. Cambridge University Press, London
Satoh D, Hiraoka Y, Colman B, Matsuda Y (2001) Physiological and molecular biological characterization of intracellular carbonic anhydrase from the marine diatom Phaeodactylum tricornutum. Plant Physiol 126: 14591470
Scala S, Carels N, Falciatore A, Chiusano ML, Bowler C (2002) Genome properties of the diatom Phaeodactylum tricornutum. Plant Physiol 129: 9931002
Shrager J, Hauser C, Chang CW, Harris EH, Davies J, McDermott J, Tamse R, Zhang Z, Grossman AR (2003) Chlamydomonas reinhardtii genome project. A guide to the generation and use of the cDNA information. Plant Physiol 131: 401408
Smith KS, Jakubzick C, Whittam TS, Ferry JG (1999) Carbonic anhydrase is an ancient enzyme widespread in prokaryotes. Proc Natl Acad Sci USA 96: 1518415189
Tatusov RL, Galperin MY, Natale DA, Koonin EV (2000) The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res 28: 3336
Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG (1997) The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res 25: 48764882
Van de Peer Y, Dewachter R (1994) Treecon for Windows: a software package for the construction and drawing of evolutionary trees for the Microsoft Windows environment. Comput Appl Biosci 10: 569570 Veldhuis MJW, Cucci TL, Sieracki ME (1997) Cellular DNA content of marine phytoplankton using two new fluorochromes: taxonomic and ecological implications. J Phycol 33: 527541[CrossRef][Web of Science]
Yoon HS, Hackett JD, Pinto G, Bhattacharya D (2002) The single, ancient origin of chromist plastids. Proc Natl Acad Sci USA 99: 1550715512
Yuan G, Bin JC, McKay DJ, Snyder FF (1999) Cloning and characterization of human guanine deaminase: purification and partial amino acid sequence of the mouse protein. J Biol Chem 274: 81758180 This article has been cited by other articles:
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ASPB Publications | PLANT PHYSIOLOGY® | THE PLANT CELL | |
|---|---|---|---|