Assembly V4.0: The A. mexicanum transcriptome has been deep sequenced (15,000,000 Roche 454 short sequence reads and 50K Sanger reads) and assembled. The resulting assembly V4.0 can now be BLAST searched here. Approximately 89.9% of the 14 million high quality sequences were assembled into contigs with a total length of 542 Mb.
These contigs were organized into 921,990 isogroups, representing a total of 1,057,173 isotigs. (Click here to know more about Isotigs) We have complete (~7K) or incomplete (~10K) protein-coding sequence models for ~17K human refseq proteins. We have 3K additional significant blast hits to non-human protein coding models. With respect to the tissues we have sampled (brain, limb, blood, etc) , we believe we have significant hits to >95% of the transcriptome.
Search for Contigs in the Latest V4.0 assembly:
Approximately 74% of the 3.3 million high quality sequences were assembled into contigs with a total length of 71 Mb. The final unique set of contigs and singletons from the assembly was about 915,442 sequences. We have complete (~3K) or incomplete (~12K) protein-coding sequence models for ~15K human refseq proteins.