Table 6.

MHC haplotype accuracy

HaplotypePipeline errors (bp)Coverage errors (bp)Sequence complexity errors (bp)Bionano DLE-1 discrepancies (6 bp)HLA sequence errors (bp)Microarray errors (bp)Total errors (bp)Assembly accuracy % (identity)
PGF13367185NANANA56599.98
COX1783763116NANANA405799.89
EAS-P-17838114NANANA23099.99
EAS-P-21798151NANANA26699.99
EUR-P-121405604NANANA103099.97
EUR-P-218609546NANANA117399.97
AFA-1NANANA110299.99
AFA-2NANANA090999.98

[i] Pipeline errors: Errors in proband assembly caused by phasing, read correction, or assembly mistakes. Coverage errors: Errors caused by low depth of coverage in either the proband or paternal capture. Sequence complexity errors: Homopolymer or simple repeat length discrepancies between the proband and paternal assemblies. Bionano discrepancies: Number of missing DLE-1 sites in MHC assembly. HLA sequence errors: Number of differences compared to Illumina HLA typing data. Microarray errors: Number of positions discordant with microarray typing after manual review. Total errors: Total errors across all sources. Assembly accuracy: For PGF, COX, and proband samples, the total accuracy of MHC assembly. For AFA, the accuracy across positions was queried by Bionano, HLA typing, and microarray genotyping.