Capturing Whole-Genome Characteristics in Short Sequences Using a Naïve Bayesian Classifier

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 5.
Figure 5.

Classification of closely related microorganisms. Classification accuracy between different strains of the same species. The classification accuracy in percent is represented on they-axis as the mean of the ratio of correct predictions, divided by the total number of predictions for each genome and test runs. The x-axis represents the different sequence lengths (35, 60, 100, 200, 400, and 1000) in base pairs. We sampled 100 genomic sequences for each genome and sequence length.

This Article

  1. Genome Res. 11: 1404-1409

Preprint Server