Method

Comparative Annotation Toolkit (CAT)—simultaneous clade and personal genome annotation

    • 1Genomics Institute, University of California Santa Cruz and Howard Hughes Medical Institute, Santa Cruz, California 95064, USA;
    • 210x Genomics, Pleasanton, California 94566, USA;
    • 3Institute of Mathematics and Computer Science, University of Greifswald, 17489 Greifswald, Germany;
    • 4Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA;
    • 5Pacific Biosciences of California, Incorporated, Menlo Park, California 94025, USA;
    • 6Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, USA;
    • 7European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom
    • 8 These authors contributed equally to this work.
Published June 8, 2018. Vol 28 Issue 7, pp. 1029-1038. https://doi.org/10.1101/gr.233460.117
Download PDF Please log-in to or register for your personal account in order to access PDF Cite Article Permissions Share
cover of Genome Research Vol 36 Issue 4
Current Issue:

Abstract

The recent introductions of low-cost, long-read, and read-cloud sequencing technologies coupled with intense efforts to develop efficient algorithms have made affordable, high-quality de novo sequence assembly a realistic proposition. The result is an explosion of new, ultracontiguous genome assemblies. To compare these genomes, we need robust methods for genome annotation. We describe the fully open source Comparative Annotation Toolkit (CAT), which provides a flexible way to simultaneously annotate entire clades and identify orthology relationships. We show that CAT can be used to improve annotations on the rat genome, annotate the great apes, annotate a diverse set of mammals, and annotate personal, diploid human genomes. We demonstrate the resulting discovery of novel genes, isoforms, and structural variants—even in genomes as well studied as rat and the great apes—and how these annotations improve cross-species RNA expression experiments.

Loading
Loading
Back to top