Combining DNA and protein alignments to improve genome annotation with LiftOn

Table 1.

Statistics for LiftOn at both the gene and transcript levels, as a result of mapping RefSeq release 220 annotation from the GRCh38 human genome to T2T-CHM13

Total feature count Protein-coding feature count Noncoding feature count
Single copy Extra copy Extra copy count Total Single copy Extra copy Extra copy count Total
Gene
 Reference (GRCh38) 37,986 19,927 18,059
 Target (CHM13) 38,916 19,738 86 320 20,144 17,715 289 768 18,772
Transcript
 Reference (GRCh38) 160,561 130,528 30,033
Target (CHM13) 161,701 129,967 239 573 130,779 29,488 410 1024 30,922
  • Extra copy refers to the number of genes with extra copies; extra copy count refers to the total number of additional copies for those extra copy genes.

This Article

  1. Genome Res. 35: 311-325

Preprint Server