RT Journal A1 Sakate, Ryuichi A1 Osada, Naoki A1 Hida, Munetomo A1 Sugano, Sumio A1 Hayasaka, Ikuo A1 Shimohira, Naoko A1 Yanagi, Shinsuke A1 Suto, Yumiko A1 Hashimoto, Katsuyuki A1 Hirai, Momoki T1 Analysis of 5′-End Sequences of Chimpanzee cDNAs JF Genome Research JO Genome Research YR 2003 FD May 01 VO 13 IS 5 SP 1022 OP 1026 DO 10.1101/gr.783103 UL http://genome.cshlp.org/content/13/5/1022.abstract AB We constructed full-length enriched cDNA libraries from chimpanzee brain, skin, and liver tissues by the oligo-capping method to establish a database of sequences of chimpanzee genes. Randomly selected clones from the libraries were subjected to one-pass sequencing from their 5′-ends. As a result, we collected 6813 chimpanzee cDNA sequences longer than 400 bp. Homology search against human mRNA sequences (RefSeq mRNAs) revealed that our collection included sequences of 1652 putative chimpanzee genes. In order to calculate the sequence identity between human and chimpanzee homologs, we constructed 5′-end consensus sequences of 226 chimpanzee genes by aligning at least three sequences for individual genes. Sequence identity was estimated by comparing these consensus sequences and the corresponding sequences of their human homologs. The average sequence identity of the 5′-end cDNAs was 99.30%. Those of the 5′-UTRs and CDSs were 98.79% and 99.42%, respectively. The results confirmed that human and chimpanzee genes are highly conserved at the nucleotide level. As for amino acids, the average sequence identity was 99.44%. The average synonymous (KS ) and nonsynonymous (KA ) divergences were estimated to be 1.33% and 0.28%, respectively.[Supplemental material is available online atwww.genome.org. All of the 1947 sequences used for constructing the consensus sequences of 226 chimpanzee genes have been submitted to DDBJ under accession nos. AU296732–AU298678. Two hundred twenty-six consensus sequences and their detailed annotation descriptions are available at our Web site http://www.prigen.org/.]