Optical genome mapping identifies rare structural variants in neural tube defects

  1. Roger E. Stevenson1,3
  1. 1Greenwood Genetic Center, Greenwood, South Carolina 29646, USA;
  2. 2Bionano Genomics, San Diego, California 92121, USA;
  3. 3Equanimitas, Greenwood, South Carolina 29646, USA
  • Corresponding author: 1974rest{at}gmail.com
  • Abstract

    Neural tube defects (NTDs) are the most common birth defects of the central nervous system and occur as either isolated malformations or accompanied by anomalies of other systems. The genetic basis of NTDs remains poorly understood using karyotyping, chromosomal microarray, and short-read sequencing, with only a limited number of pathogenic variants identified. Collectively, these technologies may fail to detect rare structural variants (SVs) in the genome, which may cause these birth defects. Therefore, optical genome mapping (OGM) was applied to investigate 104 NTD cases, of which 74 were isolated NTDs and 30 were NTDs with other malformations. A stepwise approach was undertaken to ascertain candidate variants using population and internal databases and performing parental studies when possible. This analysis identifies diagnostic findings in 8% of cases (8/104) and candidate findings in an additional 22% of cases (23/104). Of the candidate findings, 9% of cases (9/104) have SVs impacting genes associated with NTDs in mouse, and 13% of cases (14/104) have SVs impacting genes implicated in the neural tube development pathways. This study identifies RMND5A, HNRNPC, FOXD4, and RBBP4 as strong candidate genes associated with NTDs, and expands the phenotypic spectrum of AMER1 and TGIF1 to include NTDs. This study constitutes the first systematic investigation of SVs using OGM to elucidate the genetic determinants of NTDs. The data provide key insights into the pathogenesis of NTDs and demonstrate the contribution of SVs in the genome to these birth defects.

    Neural tube defects (NTDs) are the most common birth defects of the central nervous system, with wide variation in prevalence from <1 per 1000 births in the Americas and Europe to >10 per 1000 births in Africa and China (Zaganjor et al. 2016; Dixon et al. 2019). Globally, 300,000 births are affected with NTDs each year and account for ∼88,000 deaths per year (Zaganjor et al. 2016). The embryological basis of NTDs is an abnormal neurulation process, wherein the neural plate fails to complete its transformation into the neural tube (Sadler 2005). Neurulation begins in the third week of gestation, with the formation of the neural plate, its longitudinal extension, and concomitant formation of the neural folds and neural grove, followed by fusion of the neural folds. Fusion begins in the cervical region and proceeds both cranially and caudally, with completion of the neural tube closure process by day 28. Failure of the neural tube to close in the cranial region, results in anencephaly or encephalocele, while failure anywhere from the cervical region caudally results in spina bifida. Most NTDs occur as isolated malformations, while ∼15%–25% may occur with additional unrelated malformations (Kälién et al. 1998; Stoll et al. 2011; Dean et al. 2020).

    Epidemiological studies suggest a complex etiology underlying NTDs, with both environmental and genetic contributions. Increased dietary intake of folic acid has resulted in 50%–70% reduction in NTDs (MRC Vitamin Study Research Group 1991; Czeizel and Dudás 1992; Berry et al. 1999; Dean et al. 2020). However, among NTDs that remain folate-resistant, only ∼20% have been explained by either genetic or environmental factors. Environmental influences during pregnancy that may predispose to NTDs include maternal diabetes mellitus, maternal obesity, valproate and hyperthermia exposures, and maternal MTHFR C677T carrier state (Pulliam 2023). And while the risk for NTD is predicted to have a genetic basis in ∼60%–70% of cases, genetic causes have only been identified in <10% of cases. Genetic causes include numerical chromosomal aberrations in ∼5%, microdeletions or microduplications in <1%, and single-gene disorders in <2% of cases (Dean et al. 2020). Recently, exome (ES) and genome sequencing (GS) have been applied to investigate NTDs, but their diagnostic yield and ability to identify new genes have been limited (Lemay et al. 2019; Yang et al. 2022). Overall, these studies indicate that there is a higher frequency of structural changes in the genome compared to sequence alterations.

    These observations highlight the need for comprehensive cytogenomic analysis beyond the detection of numerical or large copy number changes. Optical genome mapping (OGM) is a cytogenomic technique that has the ability to detect several classes of structural variants (SVs) including triploidy, aneuploidy, copy number changes (deletions and duplications), balanced and unbalanced events (insertions, inversions, and translocations), and repeat expansions and contractions, with a higher sensitivity and resolution compared to conventional methods (Broeckel et al. 2024). The strengths and limitations of OGM were previously discussed (Sahajpal et al. 2021, 2023a; Iqbal et al. 2023; Broeckel et al. 2024). In this study, OGM was applied to investigate a cohort of 104 NTD cases, with the hypothesis that the identification of rare SVs would shed additional light on the genetic basis of these birth defects.

    Results

    Optical genome mapping variant calls

    OGM identified 6334 ± 218 (average ± SD) variants per sample. In each case, the variants were filtered against >300 control samples, resulting in 85 ± 67 variants per sample (8827 variants in total). These included 5847 deletions, 2397 insertions, 449 duplications, 75 inversions, 24 rearrangements, one translocation, 30 regions of the absence of heterozygosity (AOH), and four aneuploidies. Of these, a total of 175 variants overlapped a mouse NTD-associated gene, which included 73 deletions, 52 insertions, 17 duplications, five inversions, and 28 regions of AOH. Overall, 8827 variants were manually reviewed for potential relevance to the NTD phenotype. The relevance of SVs/genes was investigated using population databases (gnomAD and database of genomic variants), Online Mendelian Inheritance in Man, Human Gene Mutation Database, and an internal database of >20,000 chromosomal microarray analyses performed on samples submitted for clinical testing.

    This analysis identified diagnostic findings in 8% of cases (8/104) and candidate findings in an additional 22% of cases (23/104). Of the candidate findings, 9% of cases (9/104) had SVs impacting genes associated with NTDs in mouse, and 13% of cases (14/104) had SVs impacting genes implicated in the neural tube development pathways (Fig. 1).

    Figure 1.

    Summary of findings.

    Diagnostic findings

    OGM identified diagnostic findings in 8% of cases (8/104). These included four cases with an aneuploidy (trisomy 18), one case with a deletion involving 15q24.1q24.2, one with an unbalanced translocation involving Chromosomes 10 and 13 that resulted in a partial gain of Chromosome 10p and partial loss of the terminal region of Chromosome 13q, one with a deletion of the promoter region of the VANGL2 gene, and one with a deletion of exon 2 of the AMER1 gene (Fig. 2A; Table 1).

    Figure 2.

    Optical genome maps show deletions in two cases. (A) Partial heterozygous deletion of the AMER1 gene in a female fetus with osteopathia striata with cranial sclerosis (OS-CS) including anencephaly and cervical rachischisis (case 8). (B) Heterozygous deletion of the FOXD4 gene in a fetus with spina bifida (case 9).

    Table 1.

    List of diagnostic findings

    SVs overlapping genes associated with NTDs in mouse

    OGM identified a total of 175 variants overlapping genes associated with NTDs in mouse, of which 11 variants in nine cases were deemed relevant after manual review (Table 2). These included six deletions, three duplications, and two insertions (Figs. 3B, 4). Of these nine cases, two cases harbored multiple SVs, overlapping multiple genes associated with NTDs in mouse or rare SVs in genes of uncertain significance. Additionally, parental samples were available in three cases, of which a 9p24.3 deletion that included the FOXD4 gene was confirmed as nonmaternal.

    Figure 3.

    Optical genome maps showing deletions in two cases. (A) Heterozygous deletion in the 3′ UTR of the RBBP4 in a fetus with spina bifida (case 10). (B) Paternally inherited heterozygous deletion of TGIF1 in a fetus with anencephaly (case 11).

    Figure 4.

    Optical genome maps showing a balanced 311 kb inversion disrupting intron 2 of the RMND5A gene (detected in cases 18 and 19).

    Table 2.

    List of rare SVs impacting genes associated with NTDs in mouse

    SVs overlapping genes implicated in NTD pathways

    On manual review of rare variants, 15 variants in 14 cases were found to be candidates relevant to the NTD phenotype (Table 3). Of these 15 variants, there were six deletions, six duplications, two inversions, and one complex rearrangement. Of note, two cases carried an inversion disrupting the RMND5A gene (Fig. 4). Two cases (one with OEIS and the other with anencephaly) carried a duplication of the HNRNPC gene, the upregulation of which has been previously associated with NTDs (Niggl et al. 2023). Parental samples were available in one case, and the duplication was found to be de novo (Fig. 5A). In one case with anencephaly, deletion of exons 2 and 3 of the PFKFB4 gene, which has been shown to be involved in neural tube development (Pegoraro et al. 2015), was found to be a de novo change. In another case with anencephaly, an Xp22.2 duplication that included the HCCS and AMELX genes was identified to be maternally inherited. Loss-of-function (LoF) variants in the HCCS gene are associated with XLD linear skin defects with multiple congenital anomalies; 1 and LoF variants in the AMELX gene are associated with XLD amelogenesis imperfecta, type 1E. For this case, X-Chromosome inactivation (XCI) was randomly skewed in the mother, while the proband was highly skewed, presumably expressing the X carrying the duplication (Supplemental Fig. S1).

    Figure 5.

    Optical genome maps show duplication in two cases. (A) de novo 5′-UTR region duplication of HNRNPC in a fetus with anencephaly (case 20). (B) A heterozygous duplication of HNRNPC in a fetus with spina bifida (OEIS; case 21).

    Table 3.

    List of rare SVs impacting genes implicated in NTD pathways

    Discussion

    SVs contribute substantially to the genomic variation found in individuals and have been implicated in evolutionary processes, human health, and disease (Eichler 2019; Levy-Sakin et al. 2019). However, this contribution is underestimated, primarily due to the limitation of the available technologies used to assess SVs in the human population and rare disorders, including NTDs. To address this gap, OGM was applied to investigate this cohort of NTD cases, which led to a diagnostic yield comparable to combined conventional methods, expansion of the phenotypic spectrum associated with two genes, and identification of five strong candidate genes associated with NTDs.

    The spectrum of SVs constituting the diagnostic findings in this study indicates that OGM is a comprehensive cytogenomic method with the ability to detect the different classes of SVs in a single method and analysis (Sahajpal et al. 2021, 2023a; Iqbal et al. 2023; Broeckel et al. 2024). The frequency of numerical chromosomal aberrations (∼4%) and microdeletions/duplications (∼2%) was consistent with the frequency reported in the literature (Dean et al. 2020). Trisomy 18 has been reported to be the most common aneuploidy found in NTDs, and is the only trisomy found in this study (cases 1–4), implicating potential overexpression of gene(s) on Chromosome 18 with NTDs. Of the two cases with microdeletions/duplications, anencephaly in the fetus with unbalanced t(10;13) might be attributed to the distal 13q monosomy (case 5). Distal 13q monosomy has long been associated with NTDs, especially cranial NTDs relative to spinal defects, with 13q33-34 considered to be the critical region sufficient to cause NTDs (Lou et al. 2000). The region deleted in this case extended from 13q22.2 to q34, which is associated with a more penetrant NTD phenotype. The proximal 13q22 region is hypothesized to harbor regulatory elements that regulate the NTD-associated genes in the 13q33-q34 region (Lurie et al. 2016). The 15q24.1q24.2 deletion case of spina bifida, appears to be the second deletion case with a spinal NTD (case 6). The region includes ARID3B, CSK, and PTPN9, which are constrained genes and are implicated in neural tube development/NTDs in mouse (Lee and Gleeson 2020). Previously, a 5-month-old female affected with myelomeningocele, hydrocephalus, tetralogy of Fallot, clubfeet, short stature, and facial dysmorphism was reported to carry the 15q24 deletion (72.130–76.080 Mb, hg18) (El-Hattab et al. 2009). Deletion of this region is associated with 15q24 microdeletion syndrome, which is characterized by developmental delay, intellectual disability, and characteristic facial features. While congenital malformations are rarely reported, these include structural brain anomalies, cardiovascular malformations, congenital diaphragmatic hernia, intestinal atresia, and imperforate anus (Magoulas and El-Hattab 2012). Together, these two cases demonstrate the variability of 15q24 microdeletion syndrome and suggest the phenotypic spectrum should be expanded to include spinal NTDs.

    The maternally inherited deletion of VANGL2 in an anencephalic fetus (case 7), demonstrates the incomplete penetrance associated with this gene. VANGL2, associated with autosomal dominant NTDs (MIM # 182940) has an estimated penetrance of 75% (Fineman et al. 1982). The incomplete penetrance suggests the role of other modifiers, genetic, or environmental. However, none was identified in this case. Additional comprehensive genomic/epigenomic analysis in a cohort of cases with pathogenic VANGL2 variants would be needed to identify candidate modifiers, if any.

    The anencephalic fetus with cervical rachischisis and multiple abnormalities (bilateral cleft lip and palate and intestinal malrotation) (case 8) carried a partial deletion of the AMER1 (Supplemental File 1), which is associated with X-linked dominant (XLD) osteopathia striata congenita with cranial sclerosis (OS-CS) (MIM# 300373). To date, one case with spina bifida occulta affecting the third through the fifth lumbar vertebrae and the first sacral vertebra was reported in one female in a kindred with four females in three generations affected with OS-CS (Winter et al. 1980). A single case of holoprosencephaly was reported in a family with five affected females in three generations with OS-CS (Savarirayan et al. 1997). However, neither of these cases had a molecular diagnosis; thus, this represents the first reported case of anencephaly and cervical rachischisis in a female with OS-CS associated with an AMER1 deletion. Collectively, these findings expand the phenotypic spectrum of AMER1 to include NTDs.

    SVs overlapping genes associated with NTDs in mouse

    Over 400 genes have been observed to cause NTDs in mouse models (Harris and Juriloff 2010; Lee and Gleeson 2020); however, only a limited number of these genes have subsequently been associated with NTDs in humans. In this cohort, nine human homologs of genes associated with NTDs in mouse have been identified in association with NTDs. Six of these genes are being proposed as human NTD-associated genes for the first time, with strong or moderate evidence for pathogenicity.

    Four cases carried deletions of genes associated with NTDs in mice (FOXD4, RBBP4, TGIF1, and FOXH1). FOXD4 (case 9, Fig. 2B; Table 1), is a forkhead transcription factor, that regulates the transition from pluripotent embryonic stem cells (ESCs) to neuroectodermal stem cell, and is necessary for neuronal differentiation (Neilson et al. 2012). McMohan et al. (2022) observed craniofacial malformations and neural tube closure defects in mouse models with loss of FOXD4 function. FOXD4 was found to be essential in the anterior mesendoderm and later in the anterior neuroectoderm for rostral neural tube closure and neural crest specification during head development (McMohan et al. 2022). Loss of RBBP4 (case 10, Fig. 3A) results in a defective inner cell mass, severe apoptosis, hyperacetylated histones, and ultimately lethality in mice. Experiments in zebrafish show that RBBP4 is required for neural crest development and persistent neurogenesis in the midbrain (Miao et al. 2020). RBBP4 is highly conserved (pLI of 1) and the functional evidence suggests that LoF variants in this gene might result in severe birth defects, which could be lethal in humans. Loss of TGIF1 (case 11, Fig. 3B) causes exencephaly in the mouse and has been associated with holoprosencephaly type 4 in humans (MIM #142946). Odent et al. 1998 demonstrated autosomal dominant inheritance with incomplete penetrance was the mode of inheritance associated with this gene. Although human NTDs have not been previously associated with TGIF1, the observed phenotype in mouse and anencephaly in case 13, expands the phenotypic spectrum associated with TGIF1 to include NTDs.

    A combinatorial effect of IFT88 and ZMYND12 variants was suspected in case 12. Loss of IFT88 is known to cause exencephaly in mice (Pazour et al. 2000; Ohazama et al. 2009; Tian et al. 2017), and has been implicated as a risk factor for myelomeningocele in humans (Au et al. 2021); while ZMYND12 has been proposed as a candidate gene for ciliary transition zone defects (Wang et al. 2022). The craniorachischisis fetus (case 13), carried FOXH1 and 16p12.2 microdeletion. The regulatory region of FOXH1 was assessed using the ENCODE data set, suggesting the deleted region included cis-regulatory elements (cCREs), also supported by H3K27 acetylation marks. Additionally, one anencephaly case in our internal database carried a heterozygous deletion of FOXH1. Functional analysis of the human NODAL-signaling pathway has linked FOXH1 with congenital heart defects and holoprosencephaly (Roessler et al. 2008), missense variants in zebrafish cause tetralogy of Fallot and holoprosencephaly, while loss of FOXH1 in mice results in anencephaly. Although the human phenotype associated with FOXH1 remains to be elucidated, the mouse phenotype and the two cases in this series suggest an association with cranial NTDs.

    One anencephalic fetus (case 14), with insertion in CNOT10 was also deemed novel. Although the functional consequence of the insertion was not determined, an additional case with a heterozygous deletion of CNOT10 was identified in a case of fetal demise. CNOT10 is a core subunit of the evolutionarily conserved CCR4-NOT complex, which functions in the cytoplasm as the major mRNA deadenylase in both constitutive mRNA turnover and regulated mRNA decay pathways. CNOT10 is highly conserved (pLI of 1) and given the association of other CNOT genes with severe recognizable syndromes (CNOT1 MIM# 618500, 619033; CNOT2 MIM# 618608; CNOT3 MIM# 618672), the two cases in this cohort suggest that LoF variants in CNOT10 could be lethal. However, additional cases are required to further delineate the phenotype associated with CNOT10 variants. Three additional cases carried rare duplications of TWIST1 (case 15), TRIM28 (case 16), and MIB2 (case 17). Although these were rare SVs, whether gain-of-function (GoF) would result in NTDs is questionable as mouse associations have been for LoF variants. Nevertheless, the rare duplication of these genes associated with NTDs in mouse warrants further analysis and is reported for the purpose of future discovery and research.

    SVs overlapping genes implicated in NTD pathways

    The unbiased genome-wide SV analysis led to the identification of 15 additional rare variants in 14 cases. Most notable finding was in two cranial NTD cases (cases 18 and 19), that carried a balanced inversion disrupting RMND5A. The inversion was mediated by sequence homology at both breakpoints, which might explain the occurrence of the same SV in two unrelated cases. RMND5A (pLI of 1) is highly conserved and encodes a ubiquitin ligase, which is highly expressed in the neural ectoderm, and suppression of its function during embryonic development in model organisms results in brain malformations, specifically in dorsal regions of the pros- and mesencephalon, the eyes, and ciliated cells of the skin (Pfirrmann et al. 2015). An 80.65 kb gain of Chromosome 2p11.2 that partially overlapped the RMND5A gene (exons 3–9) has been previously reported in a male child born at 33 weeks gestation with a giant occipitoparietal meningoencephalocele (Vogel et al. 2012). The breakpoint of the duplication within the RMND5A gene overlaps with the inversion breakpoint detected in the two cases in this study. Although the orientation of the duplication was not investigated by the authors, an inverted duplication cannot be ruled out given the sequence homology-mediated inversion detected in this cohort. Nevertheless, the conserved role of the RMND5A gene, and these three cases collectively highlight RMND5A as a new gene associated with cranial NTDs.

    Also, notable were cases 20 and 21 that carried a duplication of HNRNPC (Fig. 5A,B). Dual 5′-UTR variants were constructed using synthetic biology techniques and showed increased transcription and translation of the gene (Balzer Le et al. 2020); thus, the functional consequence of both variants, the 5′-UTR duplication and whole-gene duplication, is expected to be GoF. HNRNPC belongs to the ubiquitous HNRNP family that plays various roles in mRNA processing and is a key regulator of gene expression. Seven HNRNP family members, including HNRNPC, have been associated with neurodevelopmental disorders. LoF and splicing variants in the HNRNPC gene (pLI of 1) were recently associated with neurodevelopmental disorders. Although no GoF variants were reported, functional studies demonstrated that either reduced or enhanced levels of HNRNPC affected neuronal migration (Niggl et al. 2023). HNRNPC along with two additional RNA-binding proteins were found to be overexpressed in folic acid deficient (FAD) H9 human ESCs into neural crest cell (H9-NCCs) models in vitro and in the FAD mouse model in vivo. An upregulation of these RNA-binding proteins was suggested to contribute to folate deficiency-induced neural crest cell dysfunction (Liu et al. 2020). Whether the NTDs in these cases resulted from HNRNPC duplication alone or gene–environment interaction is unknown; however, the evidence implicates HNRNPC in the development of NTDs.

    Another gene identified as a candidate is PFKFB4, which is a glycolysis regulator and has been shown to regulate dorsal ectoderm global patterning in gastrulating frog embryos. PFKFB4 is required for dorsal ectoderm progenitors to proceed toward neural and nonneural ectoderm, neural crest, or placodes (Pegoraro et al. 2015). The identification of a de novo deletion in an NTD fetus, and the early embryonic function warrants further analysis of this gene to be associated with NTDs.

    Four cases harbored deletions of ZNF266, PDE8A, C1QTNF3, and BNIP1 genes, and although these represent rare variants, additional cases are needed to determine the impact of these variants in the human phenotype. The last five cases are all fetuses with anencephaly that carry a duplication of a gene that is associated with a syndrome with either LoF or missense variants, but not duplication variants. These include duplication of HCCS and AMELX in a female fetus with anencephaly (case 27), which are associated with XLD linear skin defects with multiple congenital anomalies and XLD amelogenesis imperfecta, type 1E, respectively; rearrangement on Chromosome 1 that includes SDHB, which is associated with AD paraganglioma (MIM# 606864) (case 28); duplication of Chromosome 8p including the NKX2-6 gene, associated with conotruncal heart malformations (MIM# 217095) (case 29); duplication of KCNMA1, associated with Liang-Wang syndrome, AD (MIM# 618729) (with LoF) (case 30); and a partial duplication of Chromosome1q21.1 region (case 31) that is associated with dysmorphic features, developmental delay, and brain anomalies. However, additional cases and studies are needed to confirm pathogenicity and association with the NTD phenotype. This study constitutes the first systematic assessment of SVs using OGM to identify the genomic determinants in NTDs. This study identified RMND5A, HNRNPC, FOXD4, and RBBP4 to be strong candidates associated with NTDs, and expanded the phenotypic spectrum of AMER1 and TGIF1 to include NTDs. The data provide key insights into the pathogenesis of NTDs and demonstrate the contribution of SVs in the genome to these birth defects.

    The results need to be validated in future studies with larger sample size. A statistical SV burden analysis was not performed for this cohort because of the lack of a comparable control data set. A statistical analysis would be performed in the future once a comparable control data set is established on the OGM platform. Sequence variants, small SVs (<500 bp; limit of resolution of OGM), short and long noncoding RNA, polygenic, and epigenetic modifications were not investigated in this study. Additionally, parental samples were not available for several cases, and all SVs were not confirmed by orthogonal methods. Expression analysis and functional studies were beyond the scope of this study.

    Methods

    Cohort

    Samples for this study were obtained through the South Carolina anatomical gift act, and the protocol was reviewed and approved by the Self Regional Healthcare Institutional Review Board (IRB #Pro00057750). A total of 104 NTD cases were included in the study, of which 45 were spina bifida, 43 anencephaly, six encephalocele, and 10 craniorachischisis. Of these 104 cases, 74 presented as isolated defects and 30 with additional malformations. The families/mothers were interviewed under the South Carolina Neural Tube Defects Program, and clinical history was documented under the surveillance and prevention activities conducted by the Greenwood Genetic Center since 1992. The demographic and clinical parameters of females carrying fetuses with NTDs included in this study are listed in Table 4.

    Table 4.

    Demographic and clinicopathological findings in NTD cases and mothers

    Optical genome mapping

    OGM was performed following the manufacturer's protocol (Bionano Genomics, San Diego, CA, USA) as described previously (Sahajpal et al. 2023b). Briefly, ultra-high-molecular-weight DNA was isolated using a nanodisc-based isolation method, which was then labeled with direct labeling (DL)-green fluorophores at a specific six-base sequence motif (CTTAAG) using an enzymatic reaction. The DNA backbone was stained blue and was loaded onto a specialized chip wherein the DNA molecules were linearized with the help of electrophoresis and the ultra-long labeled and stained DNA molecules were imaged at a single molecule resolution on the Saphyr instrument. Analytical quality control (QC) targets were set to achieve >70× effective coverage of the genome, >60% mapping rate, 13–17 label density (labels per 100 kb), and >220 kb N50 (of molecules >150 kb).

    OGM variant calling and data analysis

    The images of labeled and stained single DNA molecules were converted into digital barcodes from which a de novo genome assembly was constructed using the de novo genome assembly pipeline (Bionano Access (v.1.7)/Solve (v.3.7)) for all samples. The assembled de novo genomes were compared to the reference human genome (GRCh38) for variant calling (deletions, duplications, insertions, inversions, translocations, repeat expansions, and contractions) based on the differences in the alignment of labels between the sample and the reference assembly. Additionally, a coverage-based algorithm enabled the detection of large CNVs and aneuploidies. For data analysis, variants were filtered using the following criteria: (1) The manufacturer's recommended confidence scores were applied: insertion: 0, deletion: 0, inversion: 0.01, duplication: − 1, translocation: 0, and copy number: 0.99 (low stringency, filter set to 0); (2) the GRCh38 SV mask filter that hides any SVs in difficult-to-map regions was turned on for analysis; (3) to narrow the number of variants to be analyzed, polymorphic variants that were present in >1% of an internal OGM control database (n > 300) were removed; (4) The relevance of SVs was ascertained using a stepwise approach utilizing population databases and an internal CMA database (>20,000 chromosomal microarray analyses performed on samples submitted for clinical testing, primarily consisting of individuals affected with rare disorders). Briefly, an unbiased genome-wide SV analysis was performed, and relevance of each variant was ascertained manually, primarily based on frequency in population database (Database of Genomic Variants) and evidence for association/involvement in the NTD/development process based on literature reports. This approach segregated variants into (a) diagnostic findings; (b) SVs overlapping genes associated with NTDs in mouse; and (c) SVs overlapping genes involved in neural tube development pathways. The internal CMA database was reviewed to check the frequency of CNVs in these shortlisted regions/genes, and the phenotypes were compared to rule in/out the SVs for potential association with NTDs.

    Confirmation of candidate SVs using orthogonal methods

    Quantitative PCR

    Quantitative PCR (qPCR) performed on genomic DNA (gDNA) was used to confirm candidate copy number variants. Primer sequences were designed using NCBI and Primer3 software. For each candidate and four controls, PCR amplicons were generated in triplicate using the 7500 real-time thermocycler (Thermo Fisher Scientific, San Francisco, USA). Relative quantitation of genomic dosage was performed using the ΔΔCt method to calculate the copy number of each candidate normalized to the qPCR product of a reference gene in comparison to the control DNA (Livak and Schmittgen 2001).

    Chromosomal microarray

    gDNA was analyzed using the CytoScan HD assay following the manufacturer's protocol (Thermo Fisher Scientific) (Uddin et al. 2015). Briefly, gDNA was digested by the Nsp1 restriction enzyme and amplified to produce amplicons in the 200–1100 bp range, which were then purified and digested with DNase I to produce 25–125 bp fragments. The fragments were end-labeled with a modified biotinylated base and hybridized overnight (16–18 h) to the array. After incubation, the array was washed and stained with a streptavidin-coupled dye and a biotinylated anti-streptavidin antibody. Finally, the array was scanned with the GeneChip Scanner, to generate cell files that included the intensity probe signals. The data were analyzed using the Chromosome Analysis Suite (ChAs 3.0) software, where the signal for the sample was compared with a reference. Differences in signal between the sample and reference were expressed as a log2 ratio representing relative intensity for each marker. A discrete copy number value was determined from the relative intensity data (Uddin et al. 2015).

    Karyotyping and fluorescent in situ hybridization

    Karyotype analysis was performed by standard G-banded analysis with 20 metaphases evaluated per case. Fluorescent in situ hybridization was performed using targeted probes with 200 interphase cells analyzed per probe (Cui et al. 2016).

    XCI assay

    XCI assay was performed using the androgen receptor (AR) locus (Caylor 2023). Briefly, the XCI pattern was determined by PCR analysis of a polymorphic trinucleotide repeat in the first exon of the AR gene with or without digestion with the methylation-sensitive enzyme HpaII (Pegoraro et al. 1994). Samples with a skewing ratio below 80% were classified as “Randomly inactivated,” 80%–90% as “Moderately skewed,” and 91%–100% as “Highly skewed.” Cutoffs were set based on the XCI skewing distribution in healthy individuals (Amos-Landgraf et al. 2006; Shvetsova et al. 2019).

    Data access

    The raw OGM data generated in this study has been submitted to BioStudies (https://www.ebi.ac.uk/biostudies/studies/S-BSST1664) under accession number S-BSST1664.

    Competing interest statement

    R.E.S. is a clinical consultant of Bionano Genomics (San Diego, CA, USA). A.C. and A.H. are salaried employees at Bionano Genomics (San Diego, CA, USA).

    Acknowledgments

    Partial funding for this research was provided by Bionano Genomics (San Diego, CA, USA), and the South Carolina Department of Disabilities and Special Needs (SC, USA). We would like to thank the patients and their families for their participation in this study.

    Author contributions: R.E.S. designed the study. R.E.S. and J.D. selected samples for the study. N.S.S. performed data analysis. N.S.S., B.H., and R.E.S. wrote the manuscript. N.S.S., J.D., B.H., T.F., C.S., A.H., B.R.D., A.C., M.J.F., and R.E.S. reviewed the article.

    Footnotes

    • [Supplemental material is available for this article.]

    • Article published online before print. Article, supplemental material, and publication date are at https://www.genome.org/cgi/doi/10.1101/gr.279318.124.

    • Freely available online through the Genome Research Open Access option.

    • Received March 15, 2024.
    • Accepted February 6, 2025.

    This article, published in Genome Research, is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

    References

    Articles citing this article

    | Table of Contents
    OPEN ACCESS ARTICLE

    Preprint Server