A privacy-preserving solution for compressed storage and selective retrieval of genomic data

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 4.
Figure 4.

(A) Storage and (B) runtime on high-coverage clinical data. The data come from a public cell line (NA12878), based on a gene panel that includes the CFTR gene, queried for diagnosing cystic fibrosis. The average coverage of these data is 1035×, containing more than 4 million reads of length around 300 bases. ARHMH2013 (Ayday et al. 2013) is a privacy-preserving solution on BAM files that does not address the compression requirement. We observe a consistent compression performance of SECRAM on high-coverage clinical sequence data. Moreover, querying for the CFTR gene on SECRAM takes less than twice the time on the nonencrypted CRAM.

This Article

  1. Genome Res. 26: 1687-1696

Preprint Server