|
|
||||||||
|
Plant Physiology 134:927-939 (2004) © 2004 American Society of Plant Biologists Genome-Wide Identification of Arabidopsis Coiled-Coil Proteins and Establishment of the ARABI-COIL Database1Department of Plant Biology and Plant Biotechnology Center, Ohio State University, 1060 Carmack Road, Columbus, Ohio 43210 (A.R., I.M.); and Ohio Supercomputer Center, 1224 Kinnear Road, Columbus, Ohio 43212 (S.M., S.J.S., M.A.M., E.A.S.)
Increasing evidence demonstrates the importance of long coiled-coil proteins for the spatial organization of cellular processes. Although several protein classes with long coiled-coil domains have been studied in animals and yeast, our knowledge about plant long coiled-coil proteins is very limited. The repeat nature of the coiled-coil sequence motif often prevents the simple identification of homologs of animal coiled-coil proteins by generic sequence similarity searches. As a consequence, counterparts of many animal proteins with long coiled-coil domains, like lamins, golgins, or microtubule organization center components, have not been identified yet in plants. Here, all Arabidopsis proteins predicted to contain long stretches of coiled-coil domains were identified by applying the algorithm MultiCoil to a genome-wide screen. A searchable protein database, ARABI-COIL (http://www.coiled-coil.org/arabidopsis), was established that integrates information on number, size, and position of predicted coiled-coil domains with subcellular localization signals, transmembrane domains, and available functional annotations. ARABI-COIL serves as a tool to sort and browse Arabidopsis long coiled-coil proteins to facilitate the identification and selection of candidate proteins of potential interest for specific research areas. Using the database, candidate proteins were identified for Arabidopsis membrane-bound, nuclear, and organellar long coiled-coil proteins.
The coiled-coil protein oligomerization motif consists of two or more amphipathic alpha helices that twist around each other in a supercoil (Burkhard et al., 2001
It has been estimated that approximately 10% of all proteins of an organism contain a coiled-coil motif (Liu and Rost, 2001
Some large coiled-coil proteins oligomerize into filaments or networks and have themselves a structural role. One of the three main classes of cytoskeletal proteins, the intermediate filament proteins, represents a well-characterized group of coiled-coil proteins (Strelkov et al., 2003
In the past few years, the number of investigated long coiled-coil proteins from animals and yeast has rapidly grown. They include proteins involved in nuclear organization, such as lamins (Goldman et al., 2002
Long coiled-coil proteins play a role in microtubule nucleation and spindle organization during cell division. For example, coiled-coil proteins are involved in the architecture of the spindle pole body, the nuclear envelope-embedded microtubule organization center in yeast (Saccharomyces cerevisiae). They are required for insertion of the spindle pole body into the nuclear envelope (Schramm et al., 2000
In the cytoplasm, long coiled-coil proteins are involved in the organization of and targeting to membrane systems. The golgin family comprises a group of coiled-coil peripheral or integral membrane proteins associated with the Golgi apparatus. They have been shown to function in a variety of membrane-membrane and membrane-cytoskeleton tethering events at the Golgi and are regulated by small GTPases of the Rab and Arl families (Barr and Short, 2003
These examples serve to illustrate the emerging function of long coiled-coil proteins as anchors for the regulation of protein positioning in the cell, thus both separating and coordinating signaling pathways in a temporal and spatial manner and organizing cellular processes like cell division. In contrast to animals and yeast, only a handful of long coiled-coil proteins have been studied in plants. Besides the large families of myosins and kinesins (Reddy and Day, 2001a In BLAST searches of the whole Arabidopsis genome for all animal and yeast proteins discussed above, significant homologies can only be found for the protein families of the SMC proteins and myosins, with E values typically below e-100, kinesins with E values in the e-50 to e-100 range, and for the nuclear pore complex protein Tpr (5e-78). In all other cases, the best hits for functionally very different proteins are the same three proteins from the Arabidopsis genome, indicating the difficulty in using sequence similarity algorithms to identify functional homologs of long coiled-coil proteins. The multiple heptad repeats in long coiled-coil domains cause a low and promiscuous sequence similarity between long coiled-coil proteins, which leads to meaningless results. This clearly demonstrates the need to use other methods than sequence comparison for the identification of plant long coiled-coil proteins potentially involved in the diverse cellular functions discussed above.
Although the heptad repeat pattern causes false hits in sequence similarity searches, it can be easily exploited by computational methods to predict coiled-coil domains in amino acid sequences (Parry, 1982 We report here the identification of all long coiled-coil proteins from Arabidopsis and the establishment of a novel searchable database, ARABI-COIL (http://www.coiled-coil.org/arabidopsis). In the future, as more fully annotated plant genomes such as rice (Oryza sativa) and C. reinhardtii become available, our analysis pipeline will be applied to these species as well, and the data will be added to the database.
Genome-Wide Screen for Coiled-Coil Proteins
Arabidopsis long coiled-coil proteins were identified using the algorithm MultiCoil (Wolf et al., 1997
To focus on proteins potentially involved in structural aspects of the cells and to exclude shorter coiled-coil domains like Leucine zippers, the output from the original MultiCoil run was further processed and filtered. A software package (ExtractProp Suite, see "Materials and Methods") was developed to automate the processing of data and selection of sequences. In this process, small gaps shorter than 25 amino acids between predicted coiled-coil domains were ignored and the domains treated as a single, larger coiled-coil (Fig. 1D). The relative consistency of the prediction between Arabidopsis and animal sequences was tested by comparing family members of the conserved SMC proteins. SMC proteins typically contain two clusters of coiled-coil domains separated by a central linker domain. Figure 1E shows that this domain distribution was observed for human SMC2 and its two Arabidopsis homologs.
Because a high-stringency algorithm like MultiCoil often predicts long stretches of coiled-coil domains with significant intradomain gaps (as shown in Fig. 1), a filter was introduced to include only proteins with at least one coiled-coil domain of at least 70 amino acids, two domains and a minimal domain length of 50 amino acids, or three domains and a minimal domain length of 30 amino acids in the final data set. This strategy isolated 286 sequences with long or multiple coiled-coil domains while excluding 97% of the known Arabidopsis bZIP proteins (Jakoby et al., 2002
The coiled-coil property information presented and searchable in ARABI-COIL is summarized for a single protein example in Table III. It includes the predicted number of coiled-coil domains, length of the largest coiled-coil domain, percentages of the total sequence and the N-terminal, middle, and C-terminal one-third of the sequence predicted to be in a coiled-coil, and the highest prediction score over the whole sequence. The ARABI-COIL database search form allows for searches limited to a certain length of protein and/or coiled-coil domain and percentage coverage over the whole length and/or the N-terminal, middle, and C-terminal one-third of the sequence. A second output table summarizes the detailed positions of all predicted coiled-coil domains and the length of the longest intradomain gap for each given domain (Table IV). A graphical representation of the predicted coiled-coil structures was included (Fig. 1D). Links to National Center for Biotechnology Information (NCBI) GenBank sequence entries are provided in ARABI-COIL to retrieve the underlying sequence information for each database entry.
Only 10% of the 286 proteins in ARABI-COIL have been characterized so far by experimental data, with about one-half of these falling into the categories kinesin or myosin motors or SMC proteins. For a preliminary estimate of protein functions, annotations were assigned manually. They are based on available publications (refs. linked to PubMed are available in ARABI-COIL), annotations in NCBI RefSeq (http://www.ncbi.nlm.nih.gov/RefSeq/), The Arabidopsis Information Resource (http://www.arabidopsis.org/), The Institute for Genomic Research (http://www.tigr.org/tdb/e2k1/ath1/), and the Munich Information Center for Protein Sequences (http://mips.gsf.de/proj/thal/db/), and conserved domains outside of the coiled-coil domain. The ARABI-COIL database can be searched for keywords within these annotations. Figure 2 summarizes the functional annotations of the proteins in ARABI-COIL and shows that two main fractions of the annotated proteins are involved in either cytoskeletal or nuclear functions. The putative function of 66% of the sequences in ARABI-COIL remains unknown. The percent of uncharacterized ORFs increases with the percentage coverage from 60% unknown proteins with less than 50% coiled-coil to 86% unknown proteins with 50% or more coiled-coil coverage. Seventy-five percent of the proteins with unknown function matched known expressed sequence tags and were annotated as "expressed proteins." The remaining proteins without expressed sequence tag data were annotated as "hypothetical proteins." Table V lists all Arabidopsis proteins of at least 500 amino acids in length with a predicted coiled-coil coverage of more than 25% for which published data are presently available.
The Arabidopsis genome appears to encode only one protein with a continuous coiled-coil domain of more than 1,000 amino acids. This protein, CIP1, has been characterized as a component of the cytoskeleton and functions as a binding site for the photomorphogenesis regulator COP1 (Matsui et al., 1995
In addition to coiled-coil domain prediction, transmembrane domain prediction data from several programs (see Table VI) were incorporated in the database, including the number of predicted transmembrane domains in the ARAMEMNON database (http://aramemnon.botanik.uni-koeln.de; Schwacke et al., 2003
The ARABI-COIL sequence set was further analyzed using a battery of programs to predict putative subcellular targeting of the proteins (Table VI). Two (NLSs) or three (N-terminal targeting signals) prediction scores were included in the database for each targeting signal. The ARABI-COIL search options allow limiting searches to coiled-coil proteins with a certain predicted localization in addition to transmembrane prediction and selected coiled-coil features. The results returned include all proteins with at least one program resulting in a prediction for that location above a probability cutoff of 0.5. The reliability of the prediction scores is color-coded for easier reference on the online result details page by using yellow for lower probability (0.50-0.74) and red for higher probability (0.75-1.00). Table VII shows an example for the detailed prediction output, which also illustrates how predicting the localization of individual proteins can be ambiguous.
To summarize the predicted targeting for all proteins, the cross-program average of the scores for each type of targeting signal were computed and probability values of 0.5 and higher counted as positive. Figure 4 shows the computationally predicted distribution of the ARABI-COIL proteins in the cell using this method. Only proteins with an entry in ARAMEMNON were counted as transmembrane proteins for this analysis. The result shows that proteins with high coiled-coil coverage are predicted to be present in all compartments of the plant cell for which targeting signals were predicted computationally.
About 10% of the annotations in ARABI-COIL suggest a nuclear function, and Figure 4 illustrates that 16% of the proteins in ARABI-COIL are predicted to be nuclear. The ARABI-COIL search functions were used to single out putative nuclear proteins of more than 500 amino acids in length with coiled-coil coverages above 25%. The resulting group of 37 proteins was manually checked for consistency of the predictions as described for Figure 4 to exclude proteins with only weak nuclear prediction or with ambiguous predictions ("unclear" in Fig. 4). The domain structures of the remaining 19 putative nuclear long coiled-coil proteins are summarized in Figure 5. The proteins with the highest predicted coiled-coil coverage are functionally uncharacterized so far. Three of the four Arabidopsis homologs of the carrot nuclear matrix protein NMCP1 (Masuda et al., 1997
Searching ARABI-COIL for proteins with N-terminal targeting signals such as mitochondrial or plastid targeting or secretory signal peptides, 52 proteins matching the criteria used for Table V and Figure 3 were identified. Twenty-seven were predicted by at least one method to target to the chloroplasts, 23 to the mitochondria, and two to the secretory pathway. Disregarding proteins with cross-program average scores below the cutoff or strong ambiguous predictions ("unclear" in Fig. 4), the remaining proteins with clear targeting predictions are summarized in Figure 6. Of the eight proteins predicted to target to plastids, only the localization of AtMFP1 has been characterized experimentally (Jeong et al., 2003
Of the proteins longer than 500 amino acids with at least 25% coiled-coil coverage, 29 fall into the group defined as cytoplasmic in Figure 4. These proteins are summarized in Figure 7. The cytoskeletal protein CIP1 (Matsui et al., 1995
Increasing experimental evidence demonstrates the importance of long coiled-coil proteins for the spatial organization of cellular processes. Although several protein classes with long coiled-coil domains have been studied in animals and yeast, our knowledge about plant long coiled-coil proteins is very limited. The repeat nature of the coiled-coil sequence motif makes it almost impossible to identify homologs of animal coiled-coil proteins without highly conserved non-coiled-coil domains. As a consequence, counterparts of many animal proteins with long coiled-coil domains, like lamins, golgins, or microtubule organization center components, have not been identified yet in plants. The ARABI-COIL database was created to provide the research community with a tool to sort and browse Arabidopsis long coiled-coil proteins to facilitate the identification and selection of candidate proteins of potential interest for specific research areas.
To predict coiled-coil structures based on amino acid sequence, several programs with differing performance rates are available. COILS and NEWCOILS (Lupas et al., 1991
In a genome-wide screen using the MultiCoil program, 5.6% of all Arabidopsis sequences (about 1,500) were identified as coiled-coil proteins. This number is lower than those found for other eukaryotic genomes (about 10%; Liu and Rost, 2001
The goal of the ARABI-COIL database creation was to provide a searchable selection of proteins with high coiled-coil coverage and long coiled-coil domains putatively involved in structural functions. Many long coiled-coil domains, for example that of AtMFP1 (Fig. 1), contain small gaps and disruptions in the overall coiled-coil structure predicted by MultiCoil. To identify the complete length of the long but discontinuous coiled-coil domains of such proteins, a feature was included to ignore small gaps of less than 25 amino acids between predicted coiled-coil structures, thus fusing the predictions for these domains to a single larger coiled-coil as exemplified in Figure 1D. Subsequently, a subset of proteins containing long coiled-coil regions was selected while trying to exclude shorter coiled-coil motifs such as Leucine zippers. The criteria chosen succeeded in excluding 97% of the known Arabidopsis bZIPs (Jakoby et al., 2002
The search features provided to browse the database allow users to select for proteins of a certain coiled-coil length and coverage. By providing coiled-coil percentages predicted for the N-terminal, middle, and C-terminal domains of the protein, the database allows for a crude search for coiled-coil domain configurations. This facilitates the identification of proteins with similar coiled-coil domain structures without detectable sequence homology. The incorporation of transmembrane and targeting signal prediction data allows the user to specify searches for putative chloroplast, mitochondria, secretory pathway, nuclear, and transmembrane proteins. This helps to identify subsets of coiled-coil proteins predicted to localize to a certain cell compartment that are of enhanced interest for further functional studies.
However, the comparison of localization prediction results from different programs and with available experimental data shows that computationally retrieved targeting predictions are ambiguous (Table VII; also see Emanuelsson and von Heijne, 2001
Future enhancements of the ARABI-COIL database and Web site will include the incorporation of additional prediction data and adding the capability of BLAST searches against the sequences populating the database. As more fully annotated plant genomes become available, the ARABI-COIL database will serve as a template for the addition of other genomes, enabling comparative analyses between different plant species. Flexibility and expandability were fundamental criteria for the underlying MySQL database and schema. The ability to add results from additional programs and sources is key to the successful viability of the database over the long term. Essentially, ARABI-COIL is a warehouse of annotated and computed information, with relatively few update transactions relative to the number of queries. For increased availability to the scientific community, the ARABI-COIL data will be made accessible through existing data mining and distribution tools, such as for, example, The Arabidopsis Information Resource (Rhee et al., 2003
The ARABI-COIL database was used to select groups of candidate proteins of at least 500 amino acids in length and more than 25% coiled-coil coverage in combination with other features that could be of potential interest for future research. The length cutoff for this analysis was chosen based on the lengths of animal and yeast coiled-coil proteins with known structural functions in the cell that range from about 600 amino acids (for example, lamin A/C, golgin-67) to more than 3,000 (for example, giantin).
Several long coiled-coil proteins of unknown function with transmembrane domains at the C terminus were identified (Fig. 3). This domain structure is characteristic of a subgroup of animal golgins including golgin-84, golgin-67, giantin, and CASP (Bascom et al., 1999
Another group of potentially interesting proteins is comprised of nuclear long coiled-coil proteins of unknown function (Fig. 5). In animal cells, intermediate filament proteins such as the lamins and NuMA play an important role in the structural organization of the nuclear matrix and the lamina underlying the inner surface of the nuclear envelope. Early immunocyto-logical evidence pointed at the possible existence of similar proteins in plant cells (McNulty and Saunders, 1992
Sequence Sources The Arabidopsis proteome sequence set (all nonredundant SWISS-PROT and TrEMBL entries) was downloaded from the European Bioinformatics Institute proteome analysis database (http://www.ebi.ac.uk/proteome/ARATH/). The initial set of 26,945 sequences at the time of download (June 2002) was updated to reflect the NCBI RefSeq database (http://www.ncbi.nlm.nih.gov/RefSeq/) sequences.
The MultiCoil program version suitable to run on Silicon Graphics systems was downloaded from http://theory.lcs.mit.edu/multicoil and installed on a 32-processor SGI Origin 2000 system. Sequence files in FASTA format were transferred to the SGI system and processed through the locally installed MultiCoil program using the default settings of the program (cutoff score of 0.5, window size 28). A Java-based program suite, ExtractProp, was developed to post-process and extract relevant computed properties from the aggregate computed MultiCoil program output. (The ExtractProp Suite continues to be enhanced and is available upon request.) Gaps of less than 25 amino acids between predicted coiled-coil domains were ignored and the domains fused. The minimum domain length was defined as 20 amino acids, and predicted coiled-coils shorter than 20 residues were disregarded. Proteins having domain numbers and maximum domain length values of at least one of 70, two of 50, or three of 30 were selected to populate the ARABI-COIL database. The sequences for these selected proteins were extracted and summarized in FASTA format for further analyses. XML was selected as the medium for representing the extracted data. Coiled-coil information for inclusion in the database was extracted from this output, such as lengths and positions of coiled-coil regions, and percentages of amino acids were predicted to form a coiled-coil for the complete sequences and the N-terminal, middle, and C-terminal thirds of the sequence.
Sequences were analyzed using a battery of structural and subcellular targeting signal prediction programs (see Table VI). Predotar, MitoProt, and HMMTOP were installed and integrated into the existing basic bioinformatics research environment. A Sun Grid Engine Portal was used to provide Web-based submission of the analysis tasks for these programs with the ExtractProp suite employed to recover the desired properties from the computed output. The remaining programs were applied through their respective Web sites, and the data were compiled into delimited text tables and subsequently processed by the ExtractProp suite for conversion to XML and incorporation in the underlying MySQL database. Hits in the Predict-NLS database were given a score of 1, and no hits were counted as 0. ChloroP scores (0.4-0.6 range in raw output) were normalized to a 0 to 1 scale to match the range for the remaining prediction scores.
MySQL was selected as a database engine to support the Web site. For maximum flexibility and expandability, a denormalized table definition was adopted. The computed output previously translated to XML was converted subsequently to SQL and used to populate the MySQL database. The population of the database is staged, enabling updates, additions, deletions, and minor edits to be done with a high level of automation. The database and its Web interface are hosted on servers maintained by the Ohio Supercomputer Center.
We thank the Ohio Supercomputer Center for providing computer usage time for this analysis and Heather Wang and Tszyeung Ching for collection of PSORT data for input in the database. Received November 5, 2003; returned for revision December 7, 2003; accepted December 19, 2003.
http://www.plantphysiol.org/cgi/doi/10.1104/pp.103.035626.
1 This work was supported by the National Science Foundation 2010 Project (grant no. NSF 0209339 to I.M.). * Corresponding author; e-mail meier.56{at}osu.edu; fax 614-292-5379.
Barr FA, Short B (2003) Golgins in the structure and dynamics of the golgi apparatus. Curr Opin Cell Biol 15: 405-413[CrossRef][ISI][Medline]
Bascom RA, Srinivasan S, Nussbaum RL (1999) Identification and characterization of golgin-84, a novel golgi integral membrane protein with a cytoplasmic coiled-coil domain. J Biol Chem 274: 2953-2962 Berger B, Singh M (1997) An iterative method for improved protein structural motif recognition. J Comput Biol 4: 261-273[Medline]
Berger B, Wilson DB, Wolf E, Tonchev T, Milla M, Kim PS (1995) Predicting coiled coils by use of pairwise residue correlations. Proc Natl Acad Sci USA 92: 8259-8263 Burkhard P, Stetefeld J, Strelkov SV (2001) Coiled coils: a highly versatile protein folding motif. Trends Cell Biol 11: 82-88[CrossRef][ISI][Medline]
Chen XP, Yin H, Huffaker TC (1998) The yeast spindle pole body component Spc72p interacts with Stu1p and is required for proper microtubule assembly. J Cell Biol 141: 1169-1179 Claros MG, Vincens P (1996) Computational method to predict mitochondrially imported proteins and their targeting sequences. Eur J Biochem 241: 779-786[ISI][Medline]
Cleary SP, Tan FC, Nakrieko KA, Thompson SJ, Mullineaux PM, Creissen GP, von Stedingk E, Glaser E, Smith AG, Robinson C (2002) Isolated plant mitochondria import chloroplast precursor proteins in vitro with the same efficiency as chloroplasts. J Biol Chem 277: 5562-5569 Cokol M, Nair R, Rost B (2000) Finding nuclear localization signals. EMBO Rep 1: 411-415[CrossRef][ISI][Medline]
Compton DA, Szilak I, Cleveland DW (1992) Primary structure of NuMA, an intranuclear protein that defines a novel pathway for segregation of proteins at mitosis. J Cell Biol 116: 1395-1408 Crick FH (1952) Is alpha-keratin a coiled coil? Nature 170: 882-883[Medline] Diviani D, Scott JD (2001) AKAP signaling complexes at the cytoskeleton. J Cell Sci 114: 1431-1437[Abstract] Emanuelsson O, Nielsen H, Brunak S, von Heijne G (2000) Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. J Mol Biol 300: 1005-1016[CrossRef][ISI][Medline] Emanuelsson O, Nielsen H, von Heijne G (1999) ChloroP, a neural network-based method for predicting chloroplast transit peptides and their cleavage sites. Protein Sci 8: 978-984[Abstract] Emanuelsson O, von Heijne G (2001) Prediction of organellar targeting signals. Biochim Biophys Acta 1541: 114-119[Medline] Esteban MR, Giovinazzo G, de la Hera A, Goday C (1998) PUMA1: a novel protein that associates with the centrosomes, spindle and centromeres in the nematode Parascaris. J Cell Sci 111: 723-735[Abstract]
Fava F, Raynaud-Messina B, Leung-Tack J, Mazzolini L, Li M, Guillemot JC, Cachot D, Tollon Y, Ferrara P, Wright M (1999) Human 76p: a new member of the
Flory MR, Moser MJ, Monnat RJ Jr, Davis TN (2000) Identification of a human centrosomal calmodulin-binding protein that shares homology with pericentrin. Proc Natl Acad Sci USA 97: 5919-5923 Fukagawa T, Mikami Y, Nishihashi A, Regnier V, Haraguchi T, Hiraoka Y, Sugata N, Todokoro K, Brown W, Ikemura T (2001) CENP-H, a constitutive centromere component, is required for centromere targeting of CENP-C in vertebrate cells. EMBO J 20: 4603-4617[CrossRef][ISI][Medline]
Gillingham AK, Pfeifer AC, Munro S (2002) CASP, the alternatively spliced product of the gene encoding the CCAAT-displacement protein transcription factor, is a golgi membrane protein related to giantin. Mol Biol Cell 13: 3761-3774 Gindullis F, Rose A, Patel S, Meier I (2002) Four signature motifs define the first class of structurally related large coiled-coil proteins in plants. BMC Genomics 3: 9[CrossRef][Medline]
Goggin DE, Lipscombe R, Fedorova E, Millar AH, Mann A, Atkins CA, Smith PMC (2003) Dual intracelluar localization and targeting of aminoimidazole ribonucleotide synthetase in cowpea. Plant Physiol 131: 1033-1041
Goldman RD, Gruenbaum Y, Moir RD, Shumaker DK, Spann TP (2002) Nuclear lamins: building blocks of nuclear architecture. Genes Dev 16: 533-547 Hanin M, Mengiste T, Bogucki A, Paszkowski J (2000) Elevated levels of intrachromosomal homologous recombination in Arabidopsis overexpressing the MIM gene. Plant J 24: 183-189[CrossRef][ISI][Medline]
Harder P, Silverstein R, Meier I (2000) Conservation of matrix attachment region-binding filament-like protein 1 among higher plants. Plant Physiol 122: 225-234
Heim MA, Jacoby M, Werber M, Martin C, Weisshaar B, Bailey PC (2003) The basic helix-loop-helix transcription factor family in plants: a genome wide study of protein structure and functional diversity. Mol Biol Evol 20: 735-747
Hirano T (2002) The ABCs of SMC proteins: two-armed ATPases for chromosome condensation, cohesion, and repair. Genes Dev 16: 399-414 Holaska JM, Wilson KL, Mansharamani M (2002) The nuclear envelope, lamins and nuclear assembly. Curr Opin Cell Biol 14: 257-364[CrossRef]
Huang J, Hack E, Thornburg RW, Meyers AM (1990) A yeast mitochondrial leader peptide functions in vivo as a dual targeting signal for both chloroplast and mitochondria. Plant Cell 2: 1249-1260 Jakoby M, Weisshaar B, Droge-Laser W, Vincente-Carbajosa J, Tiedemann J, Kroj T, Parcy F (2002) bZIP transcription factors in Arabidopsis. Trends Plant Sci 7: 106-111[CrossRef][ISI][Medline]
Jakymiw A, Raharjo E, Rattner JB, Eystathioy T, Chan EKL, Fujita DJ (2000) Identification and characterization of a novel golgi protein, golgin-67. J Biol Chem 275: 4137-4144
Jeong SY, Rose A, Meier I (2003) MFP1 is a thylakoid-targeted, nucleoid-binding protein with a coiled-coil structure. Nucleic Acids Res 31: 5175-5185 Jessberger R (2002) The many functions of SMC proteins in chromosome dynamics. Nat Rev Mol Cell Biol 3: 767-778[CrossRef][ISI][Medline]
Kilmartin JV, Dyos SL, Kershaw D, Finch JT (1993) A spacer protein in the Saccharomyces cerevisiae spindle pole body whose transcript is cell cycle-regulated. J Cell Biol 123: 1175-1184 Krogh A, Larsson B, von Heijne G, Sonnhammer ELL (2001) Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol 305: 567-580[CrossRef][ISI][Medline] Le Masson I, Saveanu C, Chevalier A, Namane A, Gobin R, Fromont-Racine M, Jacquier A, Mann C (2002) Spc24 interacts with Mps2 and is required for chromosome segregation, but is not implicated in spindle pole body duplication. Mol Microbiol 43: 1431-1443[CrossRef][Medline] Li Q, Hansen D, Killilea A, Joshi HC, Palazzo RE, Balczon R (2000) Kendrin/pericentrin-B, a centrosome protein with homology to pericentrin that complexes with PCM-1. J Cell Sci 114: 797-809
Liao H, Winkfein RJ, Mack G, Rattner JB, Yen TJ (1995) CENP-F is a protein of the nuclear matrix that assembles onto kinetochores at late G2 and is rapidly degraded after mitosis. J Cell Biol 130: 507-518 Litowski JR, Hodges RS (2001) Designing heterodimeric two-stranded alpha-helical coiled-coils: the effect of chain length on protein folding, stability and specificity. J Pept Res 58: 477-492[CrossRef][Medline] Liu CM, McElver J, Tzafrir R, Joosen R, Wittich P, Patton D, van Lammeren AA, Meinke D (2002) Condensin and cohesin knockouts in Arabidopsis exhibit a titan seed phenotype. Plant J 29: 405-415[CrossRef][ISI][Medline]
Liu J, Rost B (2001) Comparing function and structure between entire genomes. Protein Sci 10: 1970-1979
Lorson MA, Horvitz HR, von den Heuvel S (2000) LIN-5 is a novel component of the spindle apparatus required for chromosome segregation and cleavage plane specification in Caenorhabditis elegans. J Cell Biol 148: 73-86 Lumb KJ, Carr CM, Kim PS (1994) Subdomain folding of the coiled coil leucine zipper from the bZIP transcriptional activator GCN4. Biochemistry 33: 7361-7367[CrossRef][Medline] Lupas A (1997) Predicting coiled-coil regions in proteins. Curr Opin Struct Biol 7: 388-393[CrossRef][ISI][Medline] Lupas A, Van Dyke M, Stock J (1991) Predicting coiled coils from protein sequences. Science 252: 1162-1164[CrossRef][ISI][Medline] Marcus AI, Ambrose JC, Blickley L, Hancock WO, Cyr RJ (2002) Arabidopsis thaliana protein, ATK1, is a minus-end directed kinesin that exhibits non-processive movement. Cell Motil Cytoskelet 52: 144-150[CrossRef][ISI][Medline]
Marcus AI, Li W, Ma H, Cyr RJ (2003) A kinesin mutant with an atypical bipolar spindle undergoes normal mitosis. Mol Biol Cell 14: 1717-1726 Masuda K, Xu ZJ, Takahashi S, Ito A, Ono M, Nomura K, Inoue M (1997) Peripheral framework of carrot cell nucleus contains a novel protein predicted to exhibit a long alpha-helical domain. Exp Cell Res 232: 173-181[CrossRef][ISI][Medline]
Matsui M, Stoop CD, von Arnim AG, Wei N, Deng XW (1995) Arabidopsis COP1 protein specifically interacts in vitro with a cytoskeleton-associated protein, CIP1. Proc Natl Acad Sci USA 92: 4239-4243 McNulty AK, Saunders MJ (1992) Purification and immunological detection of pea nuclear intermediate filaments: evidence for plant nuclear lamins. J Cell Sci 103: 407-414[Abstract] Mengiste T, Revenkova E, Bechtold N, Paszkowski J (1999) An SMC-like protein is required for efficient homologous recombination in Arabidopsis. EMBO J 18: 4505-4512[CrossRef] |