Genomic Sequence and Transcriptional Profile of the Boundary Between Pericentromeric Satellites and Genes on Human Chromosome Arm 10p

Table 1.

ESTs and Genes Within the 10p11 Sequence

Region of identity (contig1), contig2 GenBank acc. no/unigene cluster (No.) % match Features (repeat/ gene name ) Genome origin (if not 10q11)
(30269-30688) AL045218 99
(65078-69657) AI797613/Hs135840 (6) 98 4q24 (ap001860)
(70352-70426) AI740992/Hs133165 (3) 97 4q24 (ap001860)
(184338-183720) AW974557 98 2q11 (al445993)
48769-49073 AI284091 98 L1 1q42.11 (al365438)
63525-72216 AI971943 97 1p36.33 (ac004908)
85121-84555 BE145230 97 AluJo Multiple, Subtelomeric.
133098-132535 AW499533 100
140359-140839 AA195187 98 AluSp 4q26 (ac022702)
173552-151404 AW854054/Hs187579 (75) 98 HSD17B7Ψ
178360-177773 AI799915/Hs15248 (164) 99 1q23.3 (AC069037)
179950-180507 AI927669/Hs42392 (39) 99 1q23.3 (AL441926)
277316-277660 AA593504/Hs162587 (1) 97 1q43 (al360271)
294267-294626 AW015485/Hs341696 (1) 97 10p11.2 (al390956)
349457-349181 AW856442 100 HAL1
351871-365253 AA927631/Hs340030 (1) 99
352513-352875 AW072278 100 L1PA8
379608-379900 AI637955/Hs224979 (4) 99 L2
382137-382486 AA680406/Hs126913 (4) 99
385502-385916 AW301129/Hs318978 (1) 100
435627-406440 X69115/Hs54488 (10) 100 ZNF37A
519524-469848 X68687/Hs70617 100 ZNF11/33A
553075-580039 AK056452 100 ZNF25
672343-700930 AJ492196 100 ZNF248
709987-710637 BE387652/Hs57553 (181) 98 TLK2Ψ
700012-700440 HS562273 100
754102-754795 BE378519 99
785464-785056 AI248257/Hs149302 (1) 100
785930-785515 AI991440 100
771258-770961 AA923150/Hs148281 (1) 100 L1MC4
1062408-1062674 AA744917 100
1073813-1074078 BE064203 100
1075595-1075848 BE063630 99
1076792-1076582 AI393188 100
1099274-1099737 AA921809/Hs132449 (3) 99
1100666-1101084 BE063346 99
1101637-1101938 AA725605/Hs293102 (2) 100 AluJ
1111674-1111962 AI902319 100
1126559-1126868 BE008091 100
1132040-1132647 AI905942 100 MER5A + 5B
1133889-1134201 BE069326 100
1135128-1135707 BE061064 100 MIR
1138066-1139146 BE072345 100 L2
1139874-1140174 AI906585 100
1145135-1145244 BE072230 100 MLT1H1
1147376-1147687 AA632089 100
1149924-1150010 AW175720 100 MIR
  • ESTs were identified by using BLAST to query the Swissprot, TREMBL, Unigene, and dbEST databases. ESTs were defined as genes if they coincided with either ab initio gene predictions or protein similarity and an intact ORF. ESTs were defined as pseudogenes if they coincided with protein similarity and a disrupted ORF relative to the known protein. Details of gene fragments (similarity to part of known protein, no ESTs) can be found at http://www.ncl.ac.uk/ihg/10p11.htm . No consistent ab initio gene predictions were obtained in the absence of ESTs. The position of each feature within contig 1 (parenthesis) and 2 are shown, together with accession number and unigene cluster information. The % identity of the ESTs to 10p11 are also shown (% match over >80% of EST length). AA927631 (351871-365252) is the only anonymous EST which is spliced.

  • Only two ESTs within the TLK2 cluster (BE367652,AV706880) are derived from the 10p11 pseudogene and only four ESTs from the HSD17B7 cluster (BG199997, BG189163, AI351558, BG182213) are derived from the 10p11 pseudogene.

This Article

  1. Genome Res. 13: 159-172

Preprint Server