Assembly, Annotation, and Integration of UNIGENE Clusters into the Human Genome Draft

Table 2.

The Integration of HINT Consensus and Ewing and Green's Assemblies into the Human Genome Draft

Comparison of Mapped Transcripts
HINT Ewing and Green (EG)
Transcripts mapped 43,484 29,582
 HSP length (kb) 34,226 12,723
 Average BLAST score to genome 472 490
 Overall sequence identity with genome 93.77% 95.43%
Unique transcripts 28,026 11,002
 HSP length (kb) 24,568 3,452
 Average BLAST score to genome 471 490
 Overall sequence identity with genome 93.53% 95.18%
Comparison of Exons in Transcripts
HINT vs. EG EG vs. HINT
Transcripts with an exact exon overlap 5,363 5,675
 HSP length (kb) 1,731 1,731
 Average BLAST score to genome 475 490
 Overall sequence identity with genome 93.86% 95.25%
Transcripts with an exon extended 7,580 4,759
 HSP length (kb) 4,682 2,129
 Average BLAST score to genome 472 490
 Overall sequence identity with genome 93.88% 95.69%
Transcripts with an exon truncated 5,142 2,012
 HSP length (kb) 1,724 2,989
 Average BLAST score to genome 471 490
 Overall sequence identity with genome 93.62% 96.02%
Transcripts with an exon overhanging 229 850
 HSP length (kb) 253 656
 Average BLAST score to genome 463 483
 Overall sequence identity with genome 91.41% 93.32%
  • (HSP) High-scoring segment pair, representing exons. AverageBLAST score was obtained by dividing the totalBLAST score by the total length of the HSPs involved, represented as per 100 base pair HSP. Overall sequence identity was similarly averaged by the total length of HSP involved.

  • (Overlap) HSPs overlap by ± 2 base pairs on either side between the HSPs of HINT and EG.

  • (Extended) HSPs from one index were longer than the other one.

  • (Truncated) HSPs from one index were shorter than the other one. In both cases, the positions of two HSPs agreed on either 5′ or 3′ side of their HSPs within ± 2 base pairs.

  • (Overhanging) both the 5′ and 3′ positions of the two HSPs disagreed (> ± 2 base pairs).

  • (kb) Kilobase pairs.

This Article

  1. Genome Res. 11: 904-918

Preprint Server