|
|
||||||||
|
Plant Physiology 140:805-817 (2006) © 2006 American Society of Plant Biologists TOPAAS, a Tomato and Potato Assembly Assistance System for Selection and Finishing of Bacterial Artificial Chromosomes1,[W]Centre for Biosystems Genomics, 6700 AB Wageningen, The Netherlands (S.A.P., J.C.v.H., T.H., M.J.v.S.); Department of Bioscience, Cluster Greenomics, Plant Research International, 6708 PB Wageningen, The Netherlands (S.A.P., J.C.v.H., T.H., M.J.v.S., M.H.C.A.-H., R.M.K.-L.); and Keygene N.V., 6700 AE Wageningen, The Netherlands (T.P.J., D.W., K.J.)
We have developed the software package Tomato and Potato Assembly Assistance System (TOPAAS), which automates the assembly and scaffolding of contig sequences for low-coverage sequencing projects. The order of contigs predicted by TOPAAS is based on read pair information; alignments between genomic, expressed sequence tags, and bacterial artificial chromosome (BAC) end sequences; and annotated genes. The contig scaffold is used by TOPAAS for automated design of nonredundant sequence gap-flanking PCR primers. We show that TOPAAS builds reliable scaffolds for tomato (Solanum lycopersicum) and potato (Solanum tuberosum) BAC contigs that were assembled from shotgun sequences covering the target at 6- to 8-fold coverage. More than 90% of the gaps are closed by sequence PCR, based on the predicted ordering information. TOPAAS also assists the selection of large genomic insert clones from BAC libraries for walking. For this, tomato BACs are screened by automated BLAST analysis and in parallel, high-density nonselective amplified fragment length polymorphism fingerprinting is used for constructing a high-resolution BAC physical map. BLAST and amplified fragment length polymorphism analysis are then used together to determine the precise overlap. Assembly onto the seed BAC consensus confirms the BACs are properly selected for having an extremely short overlap and largest extending insert. This method will be particularly applicable where related or syntenic genomes are sequenced, as shown here for the Solanaceae, and potentially useful for the monocots Brassicaceae and Leguminosea.
1 This work was supported by the research program of the Centre of BioSystems Genomics, which is part of the Netherlands Genomics Initiative/Netherlands Organization for Scientific Research. The author responsible for distribution of materials integral to the findings presented in this article in accordance with the policy described in the Instructions for Authors (www.plantphysiol.org) is: Sander A. Peters (sander.peters{at}wur.nl). [W] The online version of this article contains Web-only data. www.plantphysiol.org/cgi/doi/10.1104/pp.105.071464. * Corresponding author; e-mail sander.peters{at}wur.nl; fax 31317418094. Received September 13, 2005; returned for revision December 16, 2005; accepted January 6, 2006. This article has been cited by other articles:
|
|||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ASPB Publications | PLANT PHYSIOLOGY | THE PLANT CELL | |
|---|---|---|---|