Table 1.

Long-read HiFi genome sequencing SNV/indel (<50 bp) accuracy

Genome-wideRefSeq CDS
Low complexityLow mappabilitySegmental duplicationsAll difficult regionsNot in any difficult regionAllNot in any difficult regionAll
SNVsCount160,865190,416120,916584,7432,718,6043,303,34614,05920,593
Recall99.05%97.84%96.63%98.93%99.91%99.74%99.86%99.56%
Precision99.54%99.81%99.67%99.80%99.99%99.95%99.98%99.96%
Insertions (1–5 bp)Count128,23742414596140,93469,118210,63946109
Recall98.57%98.50%98.62%98.64%99.90%99.05%99.71%99.44%
Precision98.42%99.14%98.98%98.53%99.94%98.99%100.00%99.35%
Insertions (6–15 bp)Count14,43534248615,460649622,004542
Recall98.67%98.39%98.77%98.70%99.73%99.00%100.00%99.58%
Precision98.98%98.52%98.87%99.02%99.94%99.29%100.00%99.25%
Insertions (≥16 bp)Count2415122136274720164776213
Recall98.76%96.91%97.86%98.80%99.80%99.22%85.71%97.52%
Precision99.16%97.23%98.15%99.17%99.86%99.46%100.00%96.63%
Deletions (1–5 bp)Count136,02446004507148,95369,677218,12959142
Recall98.78%98.45%98.26%98.82%99.87%99.15%99.48%99.00%
Precision98.89%99.18%98.91%98.95%99.93%99.25%99.72%99.49%
Deletions (6–15 bp)Count16,66446654417,758664724,1901050
Recall98.68%98.11%98.29%98.68%99.57%98.93%100.00%99.74%
Precision98.88%98.64%98.40%98.90%99.86%99.15%100.00%99.74%
Deletions (≥16 bp)Count3424152116368116215170212
Recall99.33%98.53%98.15%99.32%99.63%99.42%85.71%100.00%
Precision99.49%98.93%98.15%99.49%99.88%99.61%100.00%100.00%
All indelsCount285,386984010,200313,715155,551469,067125362
Recall98.78%98.43%98.42%98.82%99.86%99.16%99.65%99.26%
Precision98.70%99.09%98.89%98.78%99.93%99.14%99.88%99.38%
SNVs and indelsCount462,064200,340131,301914,2752,874,1793,788,25514,18420,961
Recall98.81%97.87%96.78%98.85%99.91%99.65%99.86%99.55%
Precision98.99%99.77%99.61%99.42%99.99%99.85%99.98%99.95%

[i] (indels) insertions/deletions, (RefSeq CDS) NCBI Reference Sequence gene coding sequence, (SNVs) single nucleotide variants.