Bastien Chevreux; Thomas Pfisterer; Bernd Drescher; Albert J. Driesel; Werner E.G. Müller; Thomas Wetter; Sándor Suhai

Using the miraEST Assembler for Reliable and Automated mRNA Transcript Assembly and SNP Detection in Sequenced ESTs

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 1

Example of a misassembled transcript when SNPs are disregarded. Assembly of three input sequences are shown at left; the resulting transcripts of this assembly are shown at right. The three sequences s₁, s_1^*, and s₂ contain different homologous parts, represented by the different shades of gray, and exactly one SNP position. A normal assembly algorithm will assemble first s₁, then s₂ (because of the long overlapping alignment in the white part), and then might try to align s_1^*, but fail because of the large mismatch. The SNP position with G in sequence s₁ and A in s₂ is treated as typical noise in the alignment algorithms and ignored. The resulting transcript sequences are therefore wrong, as they do not represent the sequences found in vivo: t₁ is a mix of two transcripts and does not code a true protein.

This Article

Published in Advance May 12, 2004, doi: 10.1101/gr.1917404 Genome Res. 2004. 14: 1147-1159

Using the miraEST Assembler for Reliable and Automated mRNA Transcript Assembly and SNP Detection in Sequenced ESTs

This Article

Preprint Server

Current Issue

In This Issue