Table 2.

Long-read HiFi genome sequencing SNV/indel (<50 bp) reproducibility (GIAB high confidence)

Genome-wideRefSeq CDS
Low complexityLow mappabilitySegmental duplicationsAll difficult regionsNot in any difficult regionAllNot in any difficult regionAll
SNVsCount170,332181,786113,753584,4872,721,3313,305,81814,02320,467
Concordance99.10%99.51%99.30%99.54%99.95%99.88%99.91%99.85%
Insertions (1–5 bp)Count139,06940854495151,45669,083221,14648115
Concordance97.41%99.38%99.26%97.61%99.94%98.35%99.66%99.17%
Insertions (6–15 bp)Count15,33533549016,341648722,882543
Concordance98.13%98.99%99.31%98.23%99.96%98.74%100.00%100.00%
Insertions (≥16 bp)Count2527118119280518474664212
Concordance98.22%97.70%98.01%98.31%99.84%98.93%100.00%95.22%
Deletions (1–5 bp)Count150,50444594313163,22969,693232,40758141
Concordance98.13%99.35%99.21%98.25%99.97%98.77%99.72%99.41%
Deletions (6–15 bp)Count17,42645755218,499665224,9401047
Concordance98.38%99.17%98.96%98.46%99.96%98.86%100.00%100.00%
Deletions (≥16 bp)Count3525160101377115835220316
Concordance99.21%98.92%99.01%99.26%99.96%99.50%100.00%100.00%
All indelsCount311,65295189877339,360155,318494,492125370
Concordance97.90%99.32%99.21%98.05%99.95%98.64%99.73%99.37%
SNVs and indelsCount498,720191,398123,824940,5872,876,6753,817,07714,14820,841
Concordance98.27%99.50%99.30%98.96%99.95%99.71%99.91%99.84%

[i] (indels) insertions/deletions, (RefSeq CDS) NCBI Reference Sequence gene coding sequence, (SNVs) single nucleotide variants.