Methods

Automated Whole-Genome Multiple Alignment of Rat, Mouse, and Human

    • 1 Department of Computer Science, Stanford University, Stanford, California 94305, USA
    • 2 Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
    • 3 U.S. Department of Energy Joint Genome Institute, Walnut Creek, California 94598, USA
    • 4 Softberry Inc., Mount Kisco, New York 10549, USA
    • 5 Department of Genetics, Stanford University, Stanford, California 94305-5324, USA
    • 6 Department of Pathology, Stanford University, Stanford, California 94305-5324, USA
Published April 1, 2004. Vol 14 Issue 4, pp. 685-692. https://doi.org/10.1101/gr.2067704
Download PDF Please log-in to or register for your personal account in order to access PDF Cite Article Permissions Share
cover of Genome Research Vol 36 Issue 4
Current Issue:

Abstract

We have built a whole-genome multiple alignment of the three currently available mammalian genomes using a fully automated pipeline that combines the local/global approach of the Berkeley Genome Pipeline and the LAGAN program. The strategy is based on progressive alignment and consists of two main steps: (1) alignment of the mouse and rat genomes, and (2) alignment of human to either the mouse-rat alignments from step 1, or the remaining unaligned mouse and rat sequences. The resulting alignments demonstrate high sensitivity, with 87% of all human gene-coding areas aligned in both mouse and rat. The specificity is also high: <7% of the rat contigs are aligned to multiple places in human, and 97% of all alignments with human sequence >100 kb agree with a three-way synteny map built independently, using predicted exons in the three genomes. At the nucleotide level <1% of the rat nucleotides are mapped to multiple places in the human sequence in the alignment, and 96.5% of human nucleotides within all alignments agree with the synteny map. The alignments are publicly available online, with visualization through the novel Multi-VISTA browser that we also present.

Loading
Loading
Back to top