Large-Scale Sequencing of Two Regions in Human Chromosome 7q22: Analysis of 650 kb of Genomic Sequence around the EPO and CUTL1 Loci Reveals 17 Genes

Table 3.

Genes Found in the Analyzed Regions

Gene Description Accession/ homolog No. of exons on contig Minimum length (bp) of mRNA Minimum length (bp) on genome ORF length (aa) mRNA covered by ESTs (%) Detection method/comment
A. EPOcontig
ZAN zonadhesin U83191partial human mRNA 34 7233 >48,500 2177 0 alignment to S. scrofa zan U40024
EPO erythropoietin M11329human mRNA 6 783 2,700 193 10 genomic structure and mRNA sequence already known
CDS1 unknown EST aa 158469 2 461 980 168 100 Alignment to aa 158469
CDS2 unknown U90567 G. gallus 19 3065 7,250 817 30 one EST, exon prediction programs
GNB2 G-nucleotidebinding factor M16514 human mRNA 10 1438 2,950 340 100 mRNA sequence already known
ACT16 actin-like protein D32140 C. merolae 13 1541 10,100 475 77 overlapping ESTs, cDNA sequencing
TFR2 transferrin receptor X01060-related human mRNA 18 2531 20,500 786 95 overlapping ESTs, cDNA sequencing, 60% homology toX01060
CDS3A unknown C34D10 C. elegans 6 1034 3,200 182–235 100 overlapping ESTs, cDNA sequencing; only homology to C. elegans ORF;splicing variants lead to different N amino termini
CDS3B 6 1013 2,800 100
CDS3C 4 842 2,330 100
POLCE procollagen C-proteinase enhancer L33799mRNA 9 1480 5,800 449 100 mRNA sequence already known
CDS4 unknown EST aa 251566 8 1530 10,750 318 25 gene prediction programs
LRN leucine-rich neuronal protein X79682 Felis catus 19 2407 13,000 612 70 overlapping ESTs; gene prediction programs
IRS3L insulin receptor substrate 3-like protein U93880 Rattus norvegicus 5 1228 3,600 256 0 gene prodiction programs
HRBL nucleoporin-like protein D14689-related human mRNA 9 1321 17,000 327 85 overlapping ESTs; gene prediction programs
B. CUTL1contig
CDS5 unknown ESTT11673 7 1499 7,400 235 80 overlapping ESTs; prolin rich protein
PMSL12 mismatch repair gene U14658-related human mRNA 6 1454 18,300 219 ? Alignment with pms2
APS adaptor protein AB000520 human mRNA 9 2111 36,500 632 30 mRNA sequence already known
CUTL1 (CDP) human displacement protein M74099 human mRNA 21 5376 >285,000 1505 15 mRNA sequence already known
(CASP) alternatively spliced CDP L12579 human mRNA 22 2855 >320,000 678 60 mRNA sequence already known

This Article

  1. Genome Res. 8: 1060-1073

Preprint Server