TY - JOUR A1 - Yu, Wei A1 - Andersson, Björn A1 - Worley, Kim C. A1 - Muzny, Donna M. A1 - Ding, Yan A1 - Liu, Wen A1 - Ricafrente, Jennifer Y. A1 - Wentland, Meredith A. A1 - Lennon, Greg A1 - Gibbs, Richard A. T1 - Large-Scale Concatenation cDNA Sequencing Y1 - 1997/04/01 JF - Genome Research JO - Genome Research SP - 353 EP - 358 DO - 10.1101/gr.7.4.353 VL - 7 IS - 4 UR - http://genome.cshlp.org/content/7/4/353.abstract N2 - A total of 100 kb of DNA derived from 69 individual human brain cDNA clones of 0.7–2.0 kb were sequenced by concatenated cDNA sequencing (CCS), whereby multiple individual DNA fragments are sequenced simultaneously in a single shotgun library. The method yielded accurate sequences and a similar efficiency compared with other shotgun libraries constructed from single DNA fragments (>20 kb). Computer analyses were carried out on 65 cDNA clone sequences and their corresponding end sequences to examine both nucleic acid and amino acid sequence similarities in the databases. Thirty-seven clones revealed no DNA database matches, 12 clones generated exact matches (≥98% identity), and 16 clones generated nonexact matches (57%–97% identity) to either known human or other species genes. Of those 28 matched clones, 8 had corresponding end sequences that failed to identify similarities. In a protein similarity search, 27 clone sequences displayed significant matches, whereas only 20 of the end sequences had matches to known protein sequences. Our data indicate that full-length cDNA insert sequences provide significantly more nucleic acid and protein sequence similarity matches than expressed sequence tags (ESTs) for database searching.[All 65 cDNA clone sequences described in this paper have been submitted to the GenBank data library under accession nos. U79240–U79304.] ER -