Comparing low coverage random shotgun sequence data from Brassica oleracea and Oryza sativa genome sequence for their ability to add to the annotation of Arabidopsis thaliana

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 1.
Figure 1.

BLASTN, TBLASTX, and BLASTX were used to align 595,321 Brassica oleracea reads against databases of Arabidopsis thaliana chloroplast and mitochondrial DNA, known repeats and transposable elements, and finally, the Arabidopsis thaliana genome and protein sequences. Top BLAST hit was used to classify the Brassica oleracea reads. The reads had to match with an e-value < 1e-10 to be considered as a significant match. The protein database consists of translations of all predicted proteincoding genes in the genome annotation. Therefore, the 7% of the Brassica reads that match only the genome, represent either noncoding RNA or genes that have not yet been annotated.

This Article

  1. Genome Res. 15: 496-504

Preprint Server