Assembly, Annotation, and Integration of UNIGENE Clusters into the Human Genome Draft

Table 4.

The Human Transcript Map (Example)

Chromo Clone From (bp) To (bp) GB4 (cR) Evidence HINT ID Description
16 AC002040 31778068 31778186 192.52 (+0+0) AI128170_2 ESTs
16 AC002040 31831164 31831640 192.52 (+0+0) AI285970_4 ESTs
16p12.2 AC025273 31960092 32037384 194.27 (++0+) X87159_30 Sodium channel, nonvoltage-gated 1, beta (Liddle syndrome)
16 AC025273 32055445 32055838 195.29 (+0+0) AI635217_4 RETINOPATHY PROTEIN (Score: 140)
16p12 AC025273 32087870 32088023 196.52 (++0+) AB029003_296 KIAA1080 protein
16 AC025273 32096920 32097148 195.29 (+0+0) AI022922_3 NEURONAL THREAD PROTEIN AD7C-NTP (Score: 243)
16 AC025273 32109671 32109902 195.29 (+0+0) AW207413_1 ESTs
16p12 AC002400 32159709 32203727 196.52 (++0+) AB029003_296 KIAA1080 protein
16 AC002400 32194539 32194738 195.26 (+0+0) AI609753_1 CG14939 PROTEIN (Score: 581)
16 AC002400 32215385 32215802 193.96 (++0+) AA833716_22 ESTs
16 AC002400 32216451 32217395 193.96 (++0+) AW361820_26 ESTs
16 AC002400 32217644 32237865 195.26 (+0+0) AC002400_5 GLUTAMYL-TRNA SYNTHETASE (Score: 329)
16 AC002400 32251204 32264728 195.26 (+0+0) AC002400_34 Ubiquitin-binding protein homolog human (Score: 2041)
16 AC002400 32265227 32267320 196.77 (++0+) W22792_117 ESTs
16 AC002400 32275528 32275628 195.26 (+00+) AC002400_120 NADH dehydrogenase (ubiquinone) 1, alpha/beta subcomplex
16 AC002400 32277889 32278513 196.17 (++0+) H66286_10 ESTs
16 AC009043 32364421 32368017 196.77 (++0+) W22792_117 ESTs
16 AC008870 32425556 32540868 196.77 (++0+) AA811478_5 DYNACTIN SUBUNIT P25 (Score: 167)
16 AC009043 32434020 32437557 196.77 (++0+) AI697625_20 ESTs
16 AC008870 32437193 32447003 196.77 (++0+) AI697625_20 ESTs
16 AC012185 32441029 32441121 193.96 (+0+0) AW445082_3 RETINOBLASTOMA BINDING PROTEIN 2 HOMOLOG 1 (Score: 111)
16 AC012185 32474457 32474527 193.96 (+0+0) AA714835_9 ESTs
16 AC012185 32477696 32507184 193.96 (+0+0) U01038_83 Polo (Drosophila)-like kinase
16 AC012185 32493020 32493133 193.96 (+0+0) AA436947_7 ESTs
16p11.2 AC012185 32496241 32497643 198.28 (++0+) X07109_116 Protein kinase C, beta 1
16 AC012185 32497057 32498921 193.96 (+0+0) AC002302_4 SID470P (Score: 160)
16 AC012185 32499777 32500136 193.96 (++00) AA527435_16 ESTs
16 AC012185 32503878 32505108 193.96 (+0+0) AW294825_1 ESTs
16 AC012185 32507584 32508921 193.96 (+00+) AI732416_20 IRE1, Saccharomyces cerevisiae, homolog of
16 AC012185 32517480 32519058 193.96 (+0+0) AI676060_3 IRE1 (Score: 222)
  • Only a portion of the human 16p is presented. A more complete human transcript map is available as supplemental information from our Web site at http://pandora.med.ohio-state.edu/HINT Supplementary Table 14. Column 1, chromosome or cytoband information; Column 2, GenBank accession numbers for the genomic clones assembled in the GoldenPath map (July version); Columns 3 and 4, the start and end positions in base pair for each transcript consensus based on the GoldenPath chromosomal contigs (July version); Column 4, the GB4 positions for the genomic clones in cR, available from the Genemap'99 or the e-PCR map; Column 5, supporting or conflicting evidence (+ = supportive, − = conflicting, 0 = no data available) ordered by genomic and transcript radiation hybrid, fingerprint, and UNIGENE maps; Column 6, the consensus ID (HINT ID), defined by the GenBank accession number of the longest transcript followed by the number of the transcripts in the same UNIGENE clusters; Column 7, gene description derived from the original UNIGENE dataset and supplemented with additional protein homology information for the previously-anonymous ESTs. The newly annotated information can be found in the rows ending withBLAST scores.

This Article

  1. Genome Res. 11: 904-918

Preprint Server