TY - JOUR A1 - Dear, Simon A1 - Durbin, Richard A1 - Hillier, LaDeana A1 - Marth, Gabor A1 - Thierry-Mieg, Jean A1 - Mott, Richard T1 - Sequence Assembly with CAFTOOLS Y1 - 1998/03/01 JF - Genome Research JO - Genome Research SP - 260 EP - 267 DO - 10.1101/gr.8.3.260 VL - 8 IS - 3 UR - http://genome.cshlp.org/content/8/3/260.abstract N2 - Large-scale genomic sequencing requires a software infrastructure to support and integrate applications that are not directly compatible. We describe a suite of software tools built around the Common Assembly Format (CAF), a comprehensive representation of a sequence assembly as a text file. These tools form the backbone of sequencing informatics at the Sanger Centre and the Genome Sequencing Center. The CAF format is intentionally flexible, and our Perl and C libraries, which parse and manipulate it, provide powerful tools for creating new applications as well as wrappers to incorporate other software. The tools are available free by anonymous FTP from ftp://ftp.sanger.ac.uk/pub/badger/. ER -