Searching journal content for articles similar to Marchet et al..

Displaying results 1-10 of 93
For checked items
  1. ...), using the shallower and equally biased PacBio CLR data set from Kim et al. (2014), had to manually fill the gaps in Y-linked genes by integrating CDS sequences from FlyBase (Larkin et al. 2021).The bias primarily affects read initiation but also impairs read extension and basecalling, with read...
  2. ...Bruijn graphs are one of the most fundamental data structures in computational genomics. Colored compacted de Bruijn graphs are a variant built on a collection of sequences and associate to each k-mer the sequences in which it appears. We present GGCAT, a tool for constructing both types of graphs, based...
  3. .... 2024) and (2) full-text approaches that create a full-text index to query with substrings of any length or perform alignments to determine the likely origin of a read (Altschul et al. 1990; Matsen et al. 2010; Kim et al. 2016; Menzel et al. 2016; Ahmed et al. 2023b; Song and Langmead 2024).A k-mer-based...
  4. ...to share any sensitive information (Fig. 1). The input data set for each party includes phased haplotype sequences from individuals within that party's cohort. We consider the parties to be honest-but-curious, meaning that they follow our analysis protocol faithfully but might try to infer information...
  5. ...by counting k-mers/motifs (with the motif length k determined by the tandem repeat finder annotation of the locus) in the repeat sequence according to the forward strand of the reference , splitting sequences based on the known repeat motif length but unbiased to which motifs can be found. Each k-mer...
  6. ...resources. In the past, we used RSE to show proof-of-concept that targeting and sequencing the MHC of a homozygous sample (PGF) with short-read Illumina technology was feasible. However, the short reads of the Illumina platform and the limited set of oligos designed to capture the MHC of the homozygous PGF...
  7. ...uses a seed-and-extend approach between a query and reference sequence, and has been used previously in metagenomic read classification (Mäklin et al. 2021; Alanko et al. 2023). We implemented graph psuedoalignment using Bifrost (Holley and Melsted 2020), which builds colored compacted DBGs, whereby k-mers...
  8. ...). Nonbranching paths of k-mers are merged into unitigs (from here referred to as “nodes”). These nodes are “colored” based on the input s they are found in, enabling calculation of the population frequency of all sequences greater than k bases in length.View larger version: In this window In a new window Figure...
  9. ...described in the literature. For most of these, we are reporting for the first time the exact genomic location, at base-pair resolution, of the breakpoints of each rearrangement and the structure at the junction of the breakpoints: strict, microduplications, or inserted sequence (Fig. 4B).View larger...
  10. ...technology has revolutionized biological, biomedical, and clinical research. Hundreds of sequencing-based methods exist today to query gene expression (RNA-seq) (Mortazavi et al. 2008), chromatin state (chromatin immunoprecipitation [ChIP]-seq and ATAC-seq) (Barski et al. 2007; Buenrostro et al. 2013...
For checked items

Preprint Server