Douglas R. Smith; Peter Richterich; Marc Rubenfield; Philip W. Rice; Carol Butler; Hong-Mei Lee; Susan Kirst; Kristin Gundersen; Kari Abendschan; Qinxue Xu; Maria Chung; Craig Deloughery; Tyler Aldredge; James Maher; Ronald Lundstrom; Craig Tulig; Kathleen Falls; Joan Imrich; Dana Torrey; Marcy Engelstein; Gary Breton; Deepika Madan; Raymond Nietupski; Bruce Seitz; Steven Connelly; Steven McDougall; Hershel Safer; Rene Gibson; Lynn Doucette-Stamm; Karin Eiglmeier; Staffan Bergh; Stewart T. Cole; Keith Robison; Laura Richterich; Jason Johnson; George M. Church; Jen-i Mao

Figure 6.

Summary of alignments from similarity searches between 1157 M. leprae proteins (including all of the gene products from this study) and 1564 M. tuberculosis proteins from GenPept. Each of the M. leprae proteins was searched against the set of M. tuberculosis proteins using an implementation of the Smith–Waterman algorithm with default parameters on a Biocellerator (Compugen) in conjunction with the GCG Wisconsin Package. The Normalized Similarity and % Identity values were obtained from the best alignment for each M. leprae protein by multiplying by the fraction of query amino acids represented in each alignment (no. of query residues in alignment/total query length). This was done to provide a better indication of the overall similarity of each M. leprae protein to the best M. tuberculosis homolog. The resulting values were termed Normalized Identity and Normalized Similarity. The pairs were sorted according to the Normalized Identity values in descending order, and the normalized values were plotted together with the raw percent identity values (for comparison) on a graph.

Multiplex Sequencing of 1.5 Mb of the Mycobacterium leprae Genome

This Article

Preprint Server

Current Issue

In This Issue