Table I. Massively parallel signature sequencing allows large-scale assembly of transcripts in both C. spinosa and C. gynandra after comparison with the TAIR 8 Arabidopsis database

One GS FLX sequencing run allowed significant generation of sequence for both species, and the vast majority of these could be used to assemble contigs and then matched to Arabidopsis genes.

DataC. spinosaC. gynandra
Raw reads313,807402,674
Raw nucleotides70,564,59291,851,136
Raw mean length225228
Clean reads284,318368,333
Clean nucleotides65,525,13985,681,233
Clean mean length230232
Contigs17,65518,992
Total length (nucleotides)7,746,8949,062,043
Total reads245,324319,732
Percent assembled86.386.8