RT Journal
A1 Sakate, Ryuichi
A1 Osada, Naoki
A1 Hida, Munetomo
A1 Sugano, Sumio
A1 Hayasaka, Ikuo
A1 Shimohira, Naoko
A1 Yanagi, Shinsuke
A1 Suto, Yumiko
A1 Hashimoto, Katsuyuki
A1 Hirai, Momoki
T1 Analysis of 5′-End Sequences of Chimpanzee cDNAs
JF Genome Research 
JO Genome Research 
YR 2003 
FD May 01 
VO 13 
IS 5 
SP 1022 
OP 1026 
DO 10.1101/gr.783103 
UL http://genome.cshlp.org/content/13/5/1022.abstract 
AB We constructed full-length enriched cDNA libraries from chimpanzee brain, skin, and liver tissues by the oligo-capping method to establish a database of sequences of chimpanzee genes. Randomly selected clones from the libraries were subjected to one-pass sequencing from their 5′-ends. As a result, we collected 6813 chimpanzee cDNA sequences longer than 400 bp. Homology search against human mRNA sequences (RefSeq mRNAs) revealed that our collection included sequences of 1652 putative chimpanzee genes. In order to calculate the sequence identity between human and chimpanzee homologs, we constructed 5′-end consensus sequences of 226 chimpanzee genes by aligning at least three sequences for individual genes. Sequence identity was estimated by comparing these consensus sequences and the corresponding sequences of their human homologs. The average sequence identity of the 5′-end cDNAs was 99.30%. Those of the 5′-UTRs and CDSs were 98.79% and 99.42%, respectively. The results confirmed that human and chimpanzee genes are highly conserved at the nucleotide level. As for amino acids, the average sequence identity was 99.44%. The average synonymous (KS
) and nonsynonymous (KA
) divergences were estimated to be 1.33% and 0.28%, respectively.[Supplemental material is available online atwww.genome.org. All of the 1947 sequences used for constructing the consensus sequences of 226 chimpanzee genes have been submitted to DDBJ under accession nos. AU296732–AU298678. Two hundred twenty-six consensus sequences and their detailed annotation descriptions are available at our Web site http://www.prigen.org/.]