Skip to main content

Low incidence of SNVs and indels in trio genomes of Cas9-mediated multiplex edited sheep

Abstract

Background

The simplicity of the CRISPR/Cas9 system has enabled its widespread applications in generating animal models, functional genomic screening and in treating genetic and infectious diseases. However, unintended mutations produced by off-target CRISPR/Cas9 nuclease activity may lead to negative consequences. Especially, a very recent study found that gene editing can introduce hundreds of unintended mutations into the genome, and have attracted wide attention.

Results

To address the off-target concerns, urgent characterization of the CRISPR/Cas9-mediated off-target mutagenesis is highly anticipated. Here we took advantage of our previously generated gene-edited sheep and performed family trio-based whole genome sequencing which is capable of discriminating variants in the edited progenies that are inherited, naturally generated, or induced by genetic modification. Three family trios were re-sequenced at a high average depth of genomic coverage (~ 25.8×). After developing a pipeline to comprehensively analyze the sequence data for de novo single nucleotide variants, indels and structural variations from the genome; we only found a single unintended event in the form of a 2.4 kb inversion induced by site-specific double-strand breaks between two sgRNA targeting sites at the MSTN locus with a low incidence.

Conclusions

We provide the first report on the fidelity of CRISPR-based modification for sheep genomes targeted simultaneously for gene breaks at three coding sequence locations. The trio-based sequencing approach revealed almost negligible off-target modifications, providing timely evidences of the safe application of genome editing in vivo with CRISPR/Cas9.

Background

The type II bacterial clustered, regularly interspaced, palindromic repeats (CRISPR)-associated (Cas) system is a powerful and versatile genome editing tool, which has already been tested in the genomes of large animal models [1,2,3,4]. This system consists of a single polypeptide endonuclease Cas9 complexed with a single guide RNA (sgRNA) that allows complementation for up to 20 nucleotides of target DNA sequence. It is not uncommon for the relatively short target recognition requirement to allow unwanted off-target mutations, because of potential for non-unique matching and allowance of sequence mismatches distal from the PAM sequences at the 5′ end of sgRNAs.

Off-target effects of the CRISPR/Cas9 system have raised serious concerns about its ultimate applicability. Schaefer et al. previously reported that gene editing can introduce hundreds of unintended mutations into the genome, and have attracted wide attention. This conclusion resulted from sequencing two entire genomes of mice that had undergone CRISPR-mediated gene editing and an untreated control mouse genome [5]. However, since only one control animal, one guide RNA and a single CRISPR-vector for DNA injection were used [5, 6], the experiment design has very limited power to show that the magnitude of off-target effects is unusually high. However, this conclusion has been recently corrected by themselves when using additional mouse lines for further whole genome sequencing (WGS) analyses [7].

WGS provides a better dataset to fully characterize the mutation profiles at genome-wide scale [8, 9], compared with Sanger or short read deep sequencing of target and predicted off-target PCR amplicons. Furthermore, WGS data can be used to comprehensively look at sequence nucleotide variants (SNV), smaller (< 100 bp) insertion/deletions (indels), and structural variants (SVs) (e.g. inversions and translocations) [10]). In the present study, we used the genetically modified sheep generated through multiplex injection of six sgRNAs targeting three genes (MSTN, ASIP, and BCO2) and Cas9 mRNA in one-cell-stage embryos by previous studies [11, 12].

Take advantage of family trio-based WGS which is capable of discriminating variants in the edited progenies that are inherited, naturally generated, or induced by genetic modification, we went back and completed comparative mutational analyses of WGS from three trios (founder animals with edits and their parents). We found that the rate of de novo mutations in the edited animals was equivalent to the mutation rate in human studies, suggesting the de novo SNVs were rather resulted from normal spontaneous mutagenesis, and not induced by genetic modification.

We herein address the concerns about the unintended off-target mutations in CRISPR/Cas9-mediated edited animals, and provide evidence to show that the incidence of off-target mutations in large animals generated by CRISPR/Cas9 system may be sufficiently low enough to apply this technology for generating viable animals.

Results

Analyses of WGS family data

Cas9 mRNA and gRNAs for three functional gene targets (MSTN, ASIP, and BCO2) were multiplex-injected into one-cell-stage sheep embryos, to produce animals with gene disruptions affecting observable phenotypes for muscling, coat color and type [11]. We selected three family trios for founder animals #25, #28, and #A9 for WGS variant analyses. The edited genotypes of these founders are summarized in Fig. 1 as not all founders were successfully edited at all three target sites. Specifically, genetic modification occurred in all the three genes for #28, at ASIP and BCO2 loci for #25, and only at MSTN for #A9.

Fig. 1
figure1

Outline of identifying de novo variants that may be induced by genetic modification through WGS of three family trios in sheep. #25, # 28, and #A9 are gene-edited animals that generated by CRISPR/Cas9, colors in the founder animals indicated the information of targeted genes

WGS of three trios animal genomic DNAs yielded a total of 550 Gb of raw data, and produced between 460 and 566 million sequence reads per animal (Additional file 1: Table S1). Over 99.34% of the generated sequence reads were mapped, indicating that high quality sequences were obtained. Sequencing depth per animal ranged from 23.08–28.87× genome coverage after alignment to the sheep reference genome (Oar v3.1) [13], which resulted in identification of 21,086,636 SNVs.

Identification and validation of de novo SNVs

To detect mutations that may be derived from Cas9-mediated genetic modification, a series of stringent variant filtering procedures were completed (Fig. 2a). The completion of a series of stringent variant filtering procedures, combined with validation with Sanger sequencing using all trio members and the offspring of #28, we show that only ~ 20 de novo SNVs remained in each edited animal (Table 1). Specifically, 710, 1561, and 632 SNVs were identified after SNV calling, quality check, and filtering SNVs that existed in our sheep SNP database (294 genomes from 74 breeds, > 79 million SNPs, unpublished data). To verify the existence of these SNVs, we randomly selected 20 SNVs for each founder animal, and sequenced all trio members by Sanger sequencing. We could only validate 9–30% of the SNVs (4/13, 1/11, 2/12) in each founder (Table 1). We then conducted stringent filtering procedures including read depth, phred-scaled likelihood (PL) scores [14], and manual examination to show only 26, 20, and 16 SNVs remained as unique in each founder. Sanger sequencing confirmed that most of these SNVs existed and were not inherited from their parents (Additional file 1: Figure S1), suggesting the pipeline was robust for identification of de novo SNVs. In addition, we sequenced the edited sites and 6 randomly selected SNVs in the offspring (n = 3) of founder animal #28 (Additional file 1: Figure S2 and S3), to confirm that Cas9-mediated mutations and nature de novo mutations were both transmittable at the germline level [15].

Fig. 2
figure2

Identification of de novo SNVs through WGS of three family trios that include edited sheep. (a) The predicted off-target sites by CasOT and Cas-OFFinder at each target site. (b) The distances between 100,000 randomly selected sites (upper), and de novo SNVs (below) to predicted off-target sites. The off-target sites were defined as 1 mismatch at seed region, and up to 4 mismatch at non-seed region. The least distance to predicted off-target sites was chosen. The area between two dashlines represents the 95% confidence interval

Table 1 Summary of SNVs called in the founder animals of three trios by WGS

Tsai reported that less overlapped off-targets were obtained when using different computational tools [16]. We then predicted the potential off-target sites with two computational programs, CasOT [17] and Cas-OFFinder [18]. We found that, under the same filtering parameters (up to 5 mis-matches [19]), the Cas-OFFinder is able to identify much more off-targets than the CasOT program (Fig. 2b). We also found over half of the off-targets (53.6%) identified by CasOT can be found in the off-target sites predicted by Cas-OFFinder (Fig. 2a). More importantly, we simulated the distance between 100,000 randomly selected sites, and de novo SNVs to the potential off-target sites that predicted both with CasOT [17] and Cas-OFFinder [18] (Additional file 2: Table S2), no significant effects were observed between the random selected sites and de novo SNVs (Fig. 2b). These results highlighted that de novo SNVs were rather the result of normal spontaneous mutagenesis, and not induced by genetic modification.

Identification and validation of de novo indels and SVs

Next, we did a comparative analysis within each trio to search for Cas9-induced small indels and SVs, given the possible likely outcomes of Cas9 induced double-strand breaks (DSB) repaired via non-homologous end-joining (NHEJ). After stringent indel filtering procedures including read depth, PL value and manual examination, 25, 26, and 17 indels were determined as candidate de novo indels in each founder animal (Table 2). Besides the five edited indel sites confirmed by sequencing (Additional file 1: Figure S4), we also validated several indels for each trio (Additional file 2: Table S2).

Table 2 Summary of indels and SVs called in the founder animals of three trios by WGS

We then searched for large-scale genomic alterations induced by Cas9. We generated a short list of candidate SVs in each trio (Fig. 3a, Table 2, Additional file 2: Table S2), none of them were close or within the edited sites, except for a single inversion found at MSTN in edited animal #25. This 2.4 kb inversion resides between the two DSBs induced by Cas9 among 8 bp downstream and 2 bp upstream of the PAM sites, and perfectly fit with the gap between sg1 and sg2 sites (2438 bp in length) (Fig. 3b). We further verified the existence of this inversion in #25 by amplification of the upstream and downstream junction sequences (Fig. 3c), we assume that this inversion was induced by simultaneous site-specific DSB at the sg1 and sg2 sites. However, there was no visual or clinical evidence that genetic modification caused any abnormality for founder #25 (both ASIP and BCO2 disrupted) so far (2 years old), it seems that the inversion did not influence animal’s health. Nonetheless, we checked for this inversion occurred in other founders obtained through genome editing (n = 54) and found no other evidence of this event (Additional file 1: Figure S6).

Fig. 3
figure3

Validation of the 2.4 kb inversion at MSTN locus in founder #25. (a) Schematic distribution of SVs in the edited animals. (b) Diagram of the MSTN gene targeted by CRISPR/Cas9 with two sgRNAs. The sgRNA-targeting sequences are highlighted. The inversion region is marked with light blue background. The protospacer adjacent motif (PAM) sequences of NGG are highlighted in red. The positions of the forward and reverse PCR primers used for PCR amplification are indicated by arrows. (c) PCR results confirmed predicted inversion of targeted region at MSTN locus in founder #25

Taken together, our WGS analyses findings for de novo variants indicates that Cas9-induced genetic modifications did not induce significant off-target alterations at the whole-genome level for 3 of our sampled 54 founder animals. Furthermore, our results suggested that across sequence variant type all the de novo variants except one were randomly accumulated during parental gamete or embryonic development and were not the consequences of off-target activities that were induced by Cas9 modification.

Discussion

Recent studies have demonstrated that the type II CRISPR-Cas9 system of Streptococcus pyogenes holds great promise to modify gene variants that express valuable traits in a wide variety of crops and animals [20, 21]. We previously show that site-specific gene modification can be efficiently achieved by co-injection of Cas9 mRNA and sgRNAs that target multiple genes in sheep [11]. The integrity of the resultant modified genomes may have some heightened biosafety concerns from regulators and producers; however, our data unequivocally shows what would be predicted. The rate of natural de novo mutations for sequence variants is more than an order of magnitude higher than unintended off-target mutations.

This study represents the first comprehensive trio-based WGS effort to identify genome-wide de novo variants through a filtering pipeline which includes using the largest sheep sequence data so far. We show the power of WGS in the identification of structural genomic changes which cannot be recognized by PCR-base method (e.g. PCR-RFLP or Sanger sequencing). We were able to identify 26, 20, and 16 germline de novo SNVs in these three founder animals from each trio respectively, the estimated germline de novo SNV rate (1.36 × 10− 8) in the present study, which was equivalent to the germline de novo mutation rate in parent-offspring trios in humans (1.17 × 10− 8 in CEU; 0.97 × 10− 8 in YRI) [22,23,24] and a large cattle population (1.20 × 10− 8) [25]. These results indicated that the de novo SNVs in the founder animals are generated naturally rather than induced by genetic modification. Our study adds important new perspectives to the recently reported results by Schaefer et al. [5], in which the unintended mutations in two mutant animals are likely to have preexisted prior to injection or were caused by residual, overexpression of Cas9 from the injected vector.

Through deep sequencing which is able to fully characterize the mutational profiles [8, 9], combined with Sanger sequencing validation using all trio members and the offspring of the founder animal #28, we found that off-target mutations that could be attributable to the nucleases in the founders within these three trios are rare. We therefore provide primary support that Cas9-mediated gene modification induced little or no off-target mutations in the present study, consistent with the recent reports of low incidence of CRISPR/Cas9-mediated off-target mutations in the human cells [26, 27] or mice [28, 29]. Although a 2.4 kb inversion was identified in the founder animal #25, which presumably due to DB-induced cleavage at two sgRNA target sites, the proportion of this inversion is rare when performing multiplex genetic manipulation in sheep genome (1.85%, 1 out of 54 individuals).

On the other hand, Cas9-induced DSB provide an alternative approach to achieve precise inversions or duplications of DNA fragments using two or more sgRNAs in manipulating mammalian genomes [30, 31]. Because the occurrence of off-target effects is one of the crucial issues in the use of genome editing in agriculture [32], our results indicated we could generate animals with no detectable off-target mutations, and further confirmed the reliability of a multiplex-based CRISPR/Cas9 approach for the production of gene-edited animals.

Conclusions

In summary, we present here the first report of characterizing candidate de novo mutations through trio-based re-sequencing in sheep. Those identified de novo mutations are highly important to advance the understanding off-target modifications in genetic modified animals produced by zygote injection, and thereby providing primary results to evaluate the bio-safety of gene-edited animals.

Methods

Animals and pedigree information

The sheep used in the present study were maintained at the Ningxia Tianyuan Sheep Farm, Hongsibu, Ningxia Autonomous Region, China. Since we transferred pooled embryos which were derived from multiple ewes into surrogates previously [11], we did paternity test to identify family members for the selected three trios. The pedigree information of these three family trios was determined using nine microsatellite markers (BM4621, BM143, OarHH55, OARJMP8, BM6438, BM6506, BM1824, OarDB6, and ILSTS004) reported previously [33]. All protocols involving the use of animals were approved by the College of Animal Science and Technology, Northwest A&F University (Approval ID: 2014ZX08008002).

Whole-genome sequencing

Individuals from three family trios comprising three founders (#25, #28, #A9) were selected for WGS (Additional file 1: Table S1). Genomic DNA was extracted from blood samples (live animals) or ear tissues (stillbirth) using a Qiagen DNeasy Blood and Tissue Kit (Qiagen). For construction of the WGS library, 1 μg of gDNA was fragmented to around 300 bp by ultrasonication using a Covaris S2 system. Then, the sheared DNA fragments were used for library construction using an Illumina TrueSeq DNA library preparation kit. The final quality-ensured libraries were sequenced on an Illumina Hiseq2500 for 125-bp paired-end reads. All the reads that passed the quality control were converted into fastq files.

Identification of de novo variants

All of the reads for each sample were mapped to the sheep reference sequence assembly v3.1 [13] using BWA tools [34]. Local realignment and base quality re-calibration were performed using GATK (v.2.7–2) [35]. The raw sequencing reads were first filtered to remove low quality paired reads with the following criteria: (1) reads with > 10% N bases; (2) reads with > 50% bases with a sequencing quality of < 5; (3) reads with residual length of < 60 bases after the adaptor sequences were trimmed. Both SNVs and indels (2–50 bp) were called using GATK ‘IndelRealigner’ [35] and Samtools [36]. Additionally, SNVs and indels were separately re-calibrated as described in GATK Best Practices, and quality filters (quality score < 50.0, reads coverage> 300) were applied.

Putative de novo SNVs for each founder were extracted according to the following criteria: (1) SNVs that specific for each founder and were also identified by the samtools; (2) filtering SNVs that are existed in the sheep SNP databases (n = 294, > 79 million SNPs, unpublished data) only with alternative alleles; and filtering SNVs that did not exist in the ovine SNP databases; (3) filtering SNVs that were existed in the reads with alternative alleles (< 25%), and the read depth in founders < 3 within a trio [14]; (4) the normalized, PL scores for the genotypes (AA, AB and BB) of founders (A, reference allele; B, alternate allele), the PL scores for each genotype in founders should be > 20, 0 and > 0, the PL scores for each genotype in both parents should be 0, > 20 and > 20 [14]; (5) the candidate SNVs should be covered with both forward and reverse reads; (6) manual examination was performed to remove mis-aligned or miscalled SNVs. The filtering procedure of indels was same as the extracting of de novo SNVs. SVs were independently detected using BreakDancer [37], LUMPY [38], and CNVcaller [39]. The detailed filtering pipelines for de novo variants are summarized in Table 1.

We searched for putative off-target sites in the sheep genome that might be recognized these six sgRNAs for the MSTN, ASIP, and BCO2 genes using CasOT [17] and Cas-OFFinder [18]. The number of predicted off-target sites were summaries in Additional file 1: Table S2, and we defined the potential off-target sites as 1 mis-match at seed regions, and 4 mis-matches at non-seed regions, according to Boyle et al. [19].

Validation of indels and SNVs in edited animals

The offspring (#171001, #171004, and #171018) of edited animal #28 were generated by natural mating #28 with three wild-type ewes (#1500442, #1500444, #1500510). Sanger sequencing was conducted to verify the regions with genetic modifications, which were identified by WGS. Purification of PCR products, T7E1 cleavage assay, and Sanger sequencing were performed according to Wang et al. [11]. De novo SNVs were selected for validation by Sanger sequencing. Primers that encompassing these SNVs were designed (Additional file 2: Table S2), and were used for PCR amplification. The Integrative Genomics Viewer (IGV) browser was used to visualize the sequence reads and indels on target sites [40].

Abbreviations

ASIP:

Agouti signaling protein

BCO2:

beta-carotene oxygenase 2

BW:

body weight

CNV:

copy number variations

CRISPR:

clustered regulatory interspaced short palindromic repeats

DSB:

DNA double-strand breaks

GAPDH:

Glycerol phosphate dehydrogenase gene

H&E:

hematoxylin-eosin

indels:

insertions or deletions

MSTN:

myostatin

MUT:

mutation

NHEJ:

non-homologous end-joining

PAM:

Protospacer adjacent motif

PL:

phred-scaled likelihood

sgRNA:

single guide RNA

SNV:

single-nucleotide variants

SV:

structure variations

T7E1:

T7 endonuclease 1

TALENs:

transcription activator-like effector nucleases

TEM:

Transmission electron microscopy

WGS:

Whole-genome sequencing

WT:

wild type

ZFNs:

zinc finger nucleases

References

  1. 1.

    Hai T, Teng F, Guo R, Li W, Zhou Q. One-step generation of knockout pigs by zygote injection of CRISPR/Cas system. Cell Res. 2014;24:372–5.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  2. 2.

    Ni W, Qiao J, Hu S, Zhao X, Regouski M, Yang M, et al. Efficient gene knockout in goats using CRISPR/Cas9 system. PLoS One. 2014;9:e106718.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  3. 3.

    Wu M, Wei C, Lian Z, Liu R, Zhu C, Wang H, et al. Rosa26-targeted sheep gene knock-in via CRISPR-Cas9 system. Sci Rep. 2016a;6:24360.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  4. 4.

    Gao Y, Wu H, Wang Y, Liu X, Chen L, Li Q, et al. Single Cas9 nickase induced generation of NRAMP1 knockin cattle with reduced off-target effects. Genome Biol. 2017;18:13.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  5. 5.

    Schaefer KA, Wu W-H, Colgan DF, Tsang SH, Bassuk AG, Mahajan VB. Unexpected mutations after CRISPR-Cas9 editing in vivo. Nat Methods. 2017;14:547–8.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  6. 6.

    Wu W-H, Tsai Y-T, Justus S, Lee T-T, Zhang L, Lin C-S, et al. CRISPR repair reveals causative mutation in a preclinical model of retinitis Pigmentosa. Mol Ther. 2016b;24:1388–94.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  7. 7.

    Schaefer KA, Darbro BW, Colgan DF, Tsang SH, Bassuk AG, Mahajan VB. Corrigendum and follow-up: Whole genome sequencing of multiple CRISPR-edited mouse lines suggests no excess mutations. bioRxiv. 2018:154450.

  8. 8.

    Hendel A, Fine EJ, Bao G, Porteus MH. Quantifying on- and off-target genome editing. Trends Biotechnol. 2015;33:132–40.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  9. 9.

    Huang S, Weigel D, Beachy RN, Li J. A proposed regulatory framework for genome-edited crops. Nat Genet. 2016;48:109–11.

    Article  PubMed  CAS  Google Scholar 

  10. 10.

    Canver MC, Bauer DE, Dass A, Yien YY, Chung J, Masuda T, et al. Characterization of genomic deletion efficiency mediated by clustered regularly interspaced palindromic repeats (CRISPR)/Cas9 nuclease system in mammalian cells. J Biol Chem. 2014;289:21312–24.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  11. 11.

    Wang X, Niu Y, Zhou J, Yu H, Kou Q, Lei A, et al. Multiplex gene editing via CRISPR/Cas9 exhibits desirable muscle hypertrophy without detectable off-target effects in sheep. Sci Rep. 2016;6:32271.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  12. 12.

    Niu Y, Jin M, Li Y, Li P, Zhou J, Wang X, et al. Biallelic β-carotene oxygenase 2 knockout results in yellow fat in sheep via CRISPR/Cas9. Anim Genet. 2017;48:242–4.

    Article  PubMed  CAS  Google Scholar 

  13. 13.

    Jiang Y, Xie M, Chen W, Talbot R, Maddox JF, Faraut T, et al. The sheep genome illuminates biology of the rumen and lipid metabolism. Science. 2014;344:1168–73.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  14. 14.

    Epi4K Consortium, Epilepsy Phenome/Genome Project. De novo mutations in epileptic encephalopathies. Nature. 2013;501:217–21.

    Article  CAS  Google Scholar 

  15. 15.

    Chen Y, Cui Y, Shen B, Niu Y, Zhao X, Wang L, et al. Germline acquisition of Cas9/RNA-mediated gene modifications in monkeys. Cell Res. 2015;25:262–5.

    Article  PubMed  CAS  Google Scholar 

  16. 16.

    Tsai SQ, Zheng Z, Nguyen NT, Liebers M, Topkar VV, Thapar V, et al. GUIDE-seq enables genome-wide profiling of off-target cleavage by CRISPR-Cas nucleases. Nat Biotechnol. 2015;33:187–97.

    Article  PubMed  CAS  Google Scholar 

  17. 17.

    Xiao A, Cheng Z, Kong L, Zhu Z, Lin S, Gao G, et al. CasOT: a genome-wide Cas9/gRNA off-target searching tool. Bioinformatics. 2014;30:1180–2.

    Article  PubMed  CAS  Google Scholar 

  18. 18.

    Bae S, Park J, Kim J-S. Cas-OFFinder: a fast and versatile algorithm that searches for potential off-target sites of Cas9 RNA-guided endonucleases. Bioinformatics. 2014;30:1473–5.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  19. 19.

    Boyle EA, Andreasson JOL, Chircus LM, Sternberg SH, Wu MJ, Guegler CK, et al. High-throughput biochemical profiling reveals sequence determinants of dCas9 off-target binding and unbinding. PNAS. 2017;114:5461–6.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  20. 20.

    Xie K, Zhang J, Yang Y. Genome-wide prediction of highly specific guide RNA spacers for CRISPR-Cas9-mediated genome editing in model plants and major crops. Mol Plant. 2014;7:923–6.

    Article  PubMed  CAS  Google Scholar 

  21. 21.

    Lawrenson T, Shorinola O, Stacey N, Li C, Østergaard L, Patron N, et al. Induction of targeted, heritable mutations in barley and Brassica oleracea using RNA-guided Cas9 nuclease. Genome Biol. 2015;16:258.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  22. 22.

    Conrad DF, Keebler JEM, DePristo MA, Lindsay SJ, Zhang Y, Cassals F, et al. Variation in genome-wide mutation rates within and between human families. Nat Genet. 2011;43:712–4.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  23. 23.

    Besenbacher S, Liu S, Izarzugaza JMG, Grove J, Belling K, Bork-Jensen J, et al. Novel variation and de novo mutation rates in population-wide de novo assembled Danish trios. Nature Commun. 2015;6:ncomms6969.

  24. 24.

    Francioli LC, Polak PP, Koren A, Menelaou A, Chun S, Renkens I, et al. Genome-wide patterns and properties of de novo mutations in humans. Nat Genet. 2015;47:822–6.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  25. 25.

    Harland C, Charlier C, Karim L, et al. Frequency of mosaicism points towards mutation-prone early cleavage cell divisions. bioRxiv. 2017:079863.

  26. 26.

    Veres A, Gosis BS, Ding Q, Collins R, Ragavendran A, Brand H, et al. Low incidence of off-target mutations in individual CRISPR-Cas9 and TALEN targeted human stem cell clones detected by whole-genome sequencing. Cell Stem Cell. 2014;15:27–30.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  27. 27.

    Yang L, Grishin D, Wang G, Aach J, Zhang C-Z, Chari R, et al. Targeted and genome-wide sequencing reveal single nucleotide variations impacting specificity of Cas9 in human stem cells. Nat Commun. 2014;5:5507.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  28. 28.

    Iyer V, Shen B, Zhang W, Hodgkins A, Keane T, Huang X, et al. Off-target mutations are rare in Cas9-modified mice. Nat Meth. 2015;12:479.

    Article  CAS  Google Scholar 

  29. 29.

    Shen B, Zhang W, Zhang J, Zhou J, Wang J, Chen L, et al. Efficient genome modification by CRISPR-Cas9 nickase with minimal off-target effects. Nat Meth. 2014;11:399–402.

    Article  CAS  Google Scholar 

  30. 30.

    Li J, Shou J, Guo Y, Tang Y, Wu Y, Jia Z, et al. Efficient inversions and duplications of mammalian regulatory DNA elements and gene clusters by CRISPR/Cas9. J Mol Cell Biol. 2015;7:284–98.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  31. 31.

    Park C-Y, Kim DH, Son JS, Sung JJ, Lee J, Bae S, et al. Functional correction of large factor VIII gene chromosomal inversions in hemophilia a patient-derived iPSCs using CRISPR-Cas9. Cell Stem Cell. 2015;17:213–20.

    Article  PubMed  CAS  Google Scholar 

  32. 32.

    Araki M, Ishii T. Towards social acceptance of plant breeding by genome editing. Trends Plant Sci. 2015;20:145–9.

    Article  PubMed  CAS  Google Scholar 

  33. 33.

    Liu L, Jia C, Nuerli A, Zhang W, Zhao Z. Microsatellite DNA markers for paternity test of sheep. Jiangsu J Agr Sci. 2014;30:135–9.

    Google Scholar 

  34. 34.

    Li H, Durbin R. Fast and accurate long-read alignment with burrows–wheeler transform. Bioinformatics. 2010;26:589–95.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  35. 35.

    McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–303.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  36. 36.

    Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–9.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  37. 37.

    Chen K, Wallis JW, McLellan MD, Larson DE, Kalicki JM, Pohl CS, et al. BreakDancer: an algorithm for high-resolution mapping of genomic structural variation. Nat Meth. 2009;6:677–81.

    Article  CAS  Google Scholar 

  38. 38.

    Layer RM, Chiang C, Quinlan AR, Hall IM. LUMPY: a probabilistic framework for structural variant discovery. Genome Biol. 2014;15:R84.

    Article  PubMed  PubMed Central  Google Scholar 

  39. 39.

    Wang X, Zheng Z, Cai Y, et al. CNVcaller: highly efficient and widely applicable software for detecting copy number variations in large populations. Gigascience. 2017;12:1–12.

    Google Scholar 

  40. 40.

    Thorvaldsdóttir H, Robinson JT, Mesirov JP. Integrative genomics viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinformatics. 2013;14:178–92.

    Article  PubMed  CAS  Google Scholar 

Download references

Funding

This work is supported by grants from National Natural Science Foundation of China (31402038, 31572369, 31572381, 31772571), and the Tan Sheep Breeding Project from Ningxia (NXTS201601), as well as by China Agriculture Research System (CARS-39) and the National Thousand Youth Talents Plan. The funders of the study had no role in study design, data collection, data analysis and interpretation of data and in writing the manuscript.

Availability of data and materials

The raw sequencing data of eight animals involved in this study are available at the NCBI Sequence Read Archive (http://www.ncbi.nlm.nih.gov/sra) under accession no. SRX2635108 to no. SRX2635115.

Author information

Affiliations

Authors

Contributions

XW, YJ, TS, BM, XH, and YC conceived and designed the experiments. YN, YL, CL, and JL performed the experiments. SZ and QK Collected the data. XW, JL, BP, and TS wrote the manuscript. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Xingxu Huang or Yu Jiang or Yulin Chen.

Ethics declarations

Ethics approval and consent to participate

All protocols involving the use of animals were approved by the College of Animal Science and Technology, Northwest A&F University (Approval ID: 2014ZX08008002).

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Supporting figures and tables. Figure S1. Sanger sequencing confirms the SNVs identified by WGS in three trios. Example of SNVs sequenced in trio members. Figure S2. Sanger sequencing validation of the genetic modification SNVs in the offspring of #28 (#171001, #171004, and #171018). Target sequences complementary to sgRNAs of targeted genes are in red text, while the PAM sequences are marked in green. The mutations are marked in blue, dashlines indicate deletions, and lowercases indicate insertions or replacements. Deletions (−) and mutations (m) are shown to the right of each allele. The genotypes are shown to the right with the rates of total clones for TA-sequencing. Figure S3. Sanger sequencing validation of the genotypes of six SNVs in the offspring of #28 (#171001, #171004, and #171018). Figure S4. Sanger sequencing confirms the edited sites (indels) in each gene identified by WGS in three trios. The upper windows were generated by using Integrative Genomics Viewer (IGV) browser (http://software.broadinstitute.org/software/igv/). Target sequences complementary to sgRNAs of targeted genes are in red text, while the PAM sequences are marked in green. The mutations are marked in blue, dashlines indicate deletions, and lowercases indicate insertions or replacements. Figure S5. Genome-wide distribution of putative off-target sites in the three trios used for WGS. Putative off-target sites were identified by aligning the sgRNA sequences to the sheep reference genome (Oar v3.1) allowing for a maximum of five mismatches. Potential off-target sites predicted by both Cas-OT and Cas-OFFinder are displayed using the OmiCircos tool (http://bioconductor.org/packages/release/bioc/html/OmicCircos.html). De novo indels in the founder animals #25 (A), #28 (B), and #A9 (C). Putative off-target sites in ASIP (a), BCO2 (b), and MSTN (c). The position of three targeted genes was highlighted with black dots. Figure S6. Validation of the 2.4 kb inversion in 54 animals. The founder animal #25 was marked with red color. Table S1 Detailed information of the three trios for whole-genome sequencing. (DOCX 1633 kb)

Additional file 2:

Table S2. Summary of predicted of off-targets and de novo variants. (XLSX 587 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Wang, X., Liu, J., Niu, Y. et al. Low incidence of SNVs and indels in trio genomes of Cas9-mediated multiplex edited sheep. BMC Genomics 19, 397 (2018). https://doi.org/10.1186/s12864-018-4712-z

Download citation

Keywords

  • Genome editing
  • CRISPR/Cas9
  • Whole genome sequencing
  • Off-target; mutation rate
  • Sheep