Sequence determinants of improved CRISPR sgRNA design
- Han Xu1,
- Tengfei Xiao1,
- Chen-Hao Chen2,
- Wei Li1,
- Cliff Meyer1,
- Qiu Wu1,
- Di Wu3,
- Le Cong4,
- Feng Zhang4,
- Jun S. Liu3,
- Myles Brown1 and
- Shirley X. Liu1,5
- 1 Dana-Farber Cancer Institute;
- 2 Harvard Medical School;
- 3 Department of Statistics, Harvard University;
- 4 Broad Institute of MIT and Harvard
- ↵* Corresponding author; email: xsliu{at}jimmy.harvard.edu
Abstract
The CRISPR/CAS9 system has revolutionized mammalian somatic cell genetics. Genome-wide functional screens employing CRISPR/Cas9-mediated knockout or dCas9 fusion-mediated inhibition/activation (CRISPRi/a) are powerful techniques for discovering phenotype-associated gene function. We systematically assessed the DNA sequence features that contribute to single guide RNA (sgRNA) efficiency in CRISPR-based screens. Leveraging the information from multiple designs, we derived a new sequence model for predicting sgRNA efficiency in CRISPR/Cas9 knockout experiments. Our model confirmed known features, and suggested new features including a preference for cytosine at the cleavage site. The model was experimentally validated for sgRNA-mediated mutation rate and protein knockout efficiency. Tested on independent datasets, the model achieved significant results in both positive and negative selection conditions, and outperformed existing models. We also found that the sequence preference for CRISPRi/a is substantially different from that for CRISPR/Cas9 knockout and propose a new model for predicting sgRNA efficiency in CRISPRi/a experiments. These results facilitate the genome-wide design of improved sgRNA for both knockout and CRISPRi/a studies.
- Received February 23, 2015.
- Accepted June 10, 2015.
- Published by Cold Spring Harbor Laboratory Press
This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.











