Improving the efficiency of genomic loci capture using oligonucleotide arrays for high throughput resequencing
© Lee et al; licensee BioMed Central Ltd. 2009
Received: 18 February 2009
Accepted: 31 December 2009
Published: 31 December 2009
The emergence of next-generation sequencing technology presents tremendous opportunities to accelerate the discovery of rare variants or mutations that underlie human genetic disorders. Although the complete sequencing of the affected individuals' genomes would be the most powerful approach to finding such variants, the cost of such efforts make it impractical for routine use in disease gene research. In cases where candidate genes or loci can be defined by linkage, association, or phenotypic studies, the practical sequencing target can be made much smaller than the whole genome, and it becomes critical to have capture methods that can be used to purify the desired portion of the genome for shotgun short-read sequencing without biasing allelic representation or coverage. One major approach is array-based capture which relies on the ability to create a custom in-situ synthesized oligonucleotide microarray for use as a collection of hybridization capture probes. This approach is being used by our group and others routinely and we are continuing to improve its performance.
Here, we provide a complete protocol optimized for large aggregate sequence intervals and demonstrate its utility with the capture of all predicted amino acid coding sequence from 3,038 human genes using 241,700 60-mer oligonucleotides. Further, we demonstrate two techniques by which the efficiency of the capture can be increased: by introducing a step to block cross hybridization mediated by common adapter sequences used in sequencing library construction, and by repeating the hybridization capture step. These improvements can boost the targeting efficiency to the point where over 85% of the mapped sequence reads fall within 100 bases of the targeted regions.
The complete protocol introduced in this paper enables researchers to perform practical capture experiments, and includes two novel methods for increasing the targeting efficiency. Coupled with the new massively parallel sequencing technologies, this provides a powerful approach to identifying disease-causing genetic variants that can be localized within the genome by traditional methods.
Sequencing capacity has greatly advanced over the years and took a major leap with the commercialization of two new platforms for next-generation sequencers. Currently, three major platforms are actively being used including Applied Biosystems (ABI) SOLiD (Sequencing by Oligo Ligation and Detection), Illumina Genome Analyzer (GA) and Roche 454 Sequencing System [1–3]. Further, proofs of principle experiments with the Helicos system and Pacific Biosciences system have been published [4–6]. These sequencer technologies differ in their sequencing methods and hence they vary in number of reads sequenced, read length, error characteristics. However, all rely on the generation of shotgun libraries for sequencing. With these technologies, a single machine can generate in the range of 0.5-2 gigabases (Gb) of sequence reads per day. While advancement in these technologies is certain, the use of these technologies for sequencing targeted regions of the genome has been limited based on the efficiency of methods to enrich regions of the genome for analysis which are matched to the capacity. The rapid advancement in genotyping technology made possible by the advent of DNA microarrays has resulted in a flood of linkage and whole genome association studies for various disorders, and now the community is overwhelmed with genomic regions of interest for which additional targeted sequence analysis is key bottleneck. Most recently, several studies on exonic capture for broad based sequencing of the amino acid coding portion of the genome have shown successful identification of rare mutations/alleles involved in rare genetic disorders and yielded insights in applying the technique to searching for common variants as well as de novo cancer mutations [7–10].
Several groups have attempted to capture regions of interest by multiplex amplification [11–14]. For the target regions, primer pairs are systematically designed and, as the target regions are amplified by PCR, only the fragments with the right primer pairs are enriched. Reports have demonstrated successful amplification of hundreds of ~200 bp sized fragments but with substantial bias in the amplification between the different fragments. This method may work well for targeting tens of genes but beyond that scale, it requires more effort to design unique primers and optimize the PCR amplification process to ensure uniformity across all fragments. Also, the high cost of primer design and amplification will not compare favorably with that of sequencing as sequencing costs have been reduced significantly. To overcome the cost and effort of a primer design process, Porreca et al. have developed an assay that uses a microarray to design the oligonucleotides in parallel . Using a modification of MIP (molecular inversion probes) assay, 55,000 exons of sizes varying from 60 to 191 bp were targeted and although the specificity of the capture was very good, only ~11,000 exons were captured.
At the same time when Porreca et al. published, Albert et al. and Okou et al. described methods for using DNA arrays to capture large genomic loci directly [15–17]. In the following month, Hodges et al. demonstrated success at capturing 'all' exons using the same methods with better balance of coverage (uniformity) and specificity relative to any other previous capture assays, and it was the least labor intensive and the most cost-effective method . However, the performance of these assays varied across the sample types and array types used.
Albert et al. targeted a total ~5 Mb of sequences for 660 genes dispersed across the genome . They reported the specificity varied with the same array design from 38% to 76% depending on the samples captured. For their tiling arrays encompassing from 200 Kb to 5 Mb around the BRCA1 gene, the fraction of the reads mapped to the intended targets varied from 14% to 64%. The Nimblegen 385 K custom array was used for all capture protocols while the 454 GLX sequencer was used for sequencing. Since the 454 sequencers produce longer individual reads than those from ABI or Illumina sequencers, their sequences are easier to map to the correct genomic location and this should be factored in when comparing capture technologies.
Hodges et al. targeted 'all' human exons using seven Nimblegen arrays and sequenced the captured DNA with the Illumina sequencer . They first hybridized 500 bp genomic fragments to all seven arrays, and the fraction of the reads mapped to the targeted regions varied from 36% to 55%. When they extended the definition of the targeted region to include 300 bp upstream and downstream of each exon, the targeting efficiency was increased to 55-85%. Next, they used 100-200 bp fragments to hybridize to one of the arrays in an attempt to tighten the sequenced region around the targets. However, the specificity of the intended targets was reduced three-fold with the exon coverage rate up to 99%. For both studies, no detailed interpretation was described for the specificity variations across different array designs and sample types.
Here, we concentrate on improving the capture specificity using consistent sample and array design. Throughout the experiment, paired genomic DNA of both cancer and normal tissues were used from a cancer patient. We approached with two different methods to specifically block the adapters while generating the genomic library for hybridization and investigated two sequential rounds of hybridization. These changes resulted in improvements in our measured specificity of the targeted genomic DNA from the same sample and the same array design.
Initial data demonstrated a specificity of the intended capture intervals of 35% with exon hit rate of 99% using 100-200 bp genomic fragments and protocols similar to the published results and Agilent custom 244 K oligo arrays. These results were comparable with the Hodges et al. data using shorter fragments (29% specificity). With these initial data as baseline, we attempted to improve the capture efficiency by changing the hybridization protocol.
Modified capture protocol to block adapter-adapter hybridization
The first change we made to the Agilent hybridization protocol was to block the adapters ligated at the end of every genomic amplicon in the hybridization mix. We assumed that possible hybridization between different genomic targets based on adapter-adapter hybridization will lead to the inadvertent and non-specific enrichment of off-target fragments. This is due to the fact that all of the genomic fragments in the hybridization mix are flanked with the same Illumina adapters (52 nt and 34 nt), which have comparable length to the intended genome location specific target probes (45-60 nt). Thus, the melting temperature between the adapter hybridizations will be similar to that between the appropriate genomic fragment and its specific probe. Moreover, the effective concentration in the hybridization of the adapter sequence is approximately 107 fold higher than the genome specific sequences. Thus, these adapter mediated hybridization may dominate the hybridization process.
Summary of the results.
Single strand separation
Exons with reads*
Modified capture protocol with hybridization repeat
The other protocol modification tested was to repeat the hybridization step with the notion that each round of successive hybridization will further enrich for the target sequence as a substantially simplified amplicon is hybridized in the second round. In 2005, Bashiardes et al. reported in Nature Methods on capturing genomic loci using bacterial artificial chromosome (BAC) in solution . They performed two rounds of hybridization to enhance targeting and achieved 50% specificity. We incorporated this idea and repeated the hybridization step after the 2nd PCR. The targeted specificity was successfully increased up to 90% (Table 1).
This two step modified protocol was independently replicated externally. Total of 2 Mb was targeted for 7,475 non-overlapping exon intervals under 10 linkage regions across the genome. The capture was done by Agilent custom 244 K oligo arrays strictly following the presented protocols except for reducing the 2nd hybridization time to 24 hrs and re-using the same capture array for the 2nd hybridization. The 36 bp single end sequencing was performed using Illumina GA I in the authors' laboratory and the sequences were aligned to the whole genome using MAQ (Mapping and Assembly with Qualities). With the use of blockers, the specificity resulted in 44% and by repeating the hybridization with the blockers, the specificity increased to 84%, which are comparable to the data presented.
Analysis of the sequence data
Genomic intervals with regional aberration detected by SNP array were examined by capture array and the results were compared.
Gain (4 copies)
Gain (25 copies)
Genome is not fragmented after sonication.
Buffer condition is not adaptable for sonication.
Purify the DNA. We used QIAGEN PCR Purification Kit, eluted in EB to have it work.
Nothing is visible on the gel after 1 hr electrophoresis during library generation.
When the starting amount of DNA is small or there is significant DNA loss during the process for various reasons, it is possible that the DNA is smeared over a wide range after an hour of electrophoresis and not visible on the gel.
It is good to check the gel to see if the DNA is present after ~10 min run when the DNA is not smeared at a wide range.
Even though nothing is visible, it is highly possible that the DNA is still present. Proceed to the next step regardless and see if PCR amplifies anything.
Cannot collect ~400 ul after the stripping step.
Gasket slide was re-used.
The array slide was lifted up too quickly.
Different buffer was used for the 95C stripping process.
Do not re-use the gasket slide.
The solution can be flushed to a collection boat and collected.
When using multi-array slides like 2×105 K, 4×44 K or 8×15 K, it is still possible to run the capture protocol as indicated. After the stripping, the array slide should be slowly lifted up to prevent contamination. The solution tends to stay within the gaskets.
Not enough DNA amplified after the first stripping.
Stripping was not efficient.
Another stripping process can be done and checked if there were left over genomic fragments hybridized on the probes. Since it does not matter if the stripped solution contains contaminants as long as the contaminants do not have adapters ligated at the end, it is possible to thoroughly continue the stripping process until no products get amplified.
Two simple optimizations of the hybridization protocol have improved the capture performance significantly. First, by blocking the adapter sequences flanking each of the genomic fragments, we reduced the non-specific pull down through adapter-adapter hybridization. Blocking the nonspecific DNA is an old trick to reduce the background when microarray experiments are performed, with human cot-1 being the most commonly used reagent to block repetitive sequences . Recently, Hodges et al. has shown similar results with the same approach, validating our experimental protocol . Secondly, we repeated the hybridization step to further enrich the genomic fragment pool. While the specificity was enhanced up to 90%, this step introduced ~1% of variant loss and some degree of bias in the relative abundance of specific amplicons. For example, the fold difference observed for EGFR gene was weakened by 2.5-fold when the double hybridization capture protocol was applied, suggesting saturation of the hybridization step effectively normalizing the yield from each amplicon. The overall correlation coefficient between the single hybridization experiment and the double hybridization experiment after excluding the ~100 exons that were outlier was 0.82. This interferes some with the ability to reliable call copy number state of individual exons from the pull down sequence data. Two-round hybridization should be used with caution when copy number detection is critical. The array designed for our current experiments and those in previous reports were all masked for repeats. To test whether including the repeat regions would affect the capture, we have attempted to tile every 15 bp across a 4 Mb region of a single chromosome using Agilent 244 K custom array without vigorously masking for the repeats. The specificity was significantly reduced to 15~30% even with the addition of the primer blockers and increased human cot-1 DNA in the hybridization mix (data not shown). This phenomenon should be taken into account when it is unavoidable to target the repeat regions.
Throughout the experiments, the sequence reads generated were tightly mapped nearby the intended probe regions. For each probe, the local sequence coverage will extend out in relation to the length of genomic fragment library initially created. Without any major variations in the genomic fragments that could interrupt the hybridization to the probe, the sequence coverage will peak within the probe region and decrease with increased distance.
Capturing genomic regions for sequencing has a wide scope of application. Many genetic studies with linkage or association signals will benefit immensely as it becomes possible to reliably and inexpensively capture the region of interest and perform high throughput, shotgun sequencing. Additionally, improvements in exonic enrichment protocols will usher in an era of cost effective sequencing of all the amino acid coding bases of genomes. This will lead to more rapid identification of the causative genes in many disorders.
We chose to capture exonic sequence of ~3,000 cancer genes. Two cancer gene lists, 'cancer gene census list' and 'CGP (cancer genome project) planned studies list' were retrieved from Wellcome Trust COSMIC (catalogue of somatic mutations in cancer) database and combined. Boundaries for exons and UTRs (untranslated regions) were retrieved from RefSeq database and any intervals that overlapped were merged so that non-redundant contiguous intervals were generated. In total, 3,038 genes (31,678 exons) spanning 8.4 Mb were included in the list. Based on the preliminary capture experience data (not shown), we tiled the probes every 120 bp on average so that the distance between the start positions of the two consecutive probes is ~120 bp and the regions between the probes are covered by the two flanking probes resulting in the same coverage as the regions within the probes. Both forward and reverse strands of each probe region were spotted on the array and 3 of the genes (150 exons) of higher interest were spotted at 12× (6× for each strand). Instead of using Nimblegen arrays as other studies have, we used Agilent 244 K custom 60 mer arrays. Probe design was performed using Agilent e-array system http://www.Agilent.com with the repeat mask function on. This resulted in losing 155 exons completely and ~27% of the exons were partially covered (see Additional file 2).
We used a paired normal and tumor whole genomic DNA as the starting materials. The tumor DNA was extracted from the glioblastoma (GBM) specimen and the normal DNA was extracted from the blood sample of the same patient. The collections were approved by the UCLA IRB and the samples were processed at the Biological Samples Processing Core at UCLA using Autopure LS™ nucleic acid purification instrument from Gentra Systems. Both the normal and tumor samples were run on the Affymetrix GeneChip Human Mapping 250 K array for global genomic examination and comparison. Both chromosomal and regional copy number aberrations were detected in the tumor sample with the hallmarks of glioblastoma like EGFR amplification and chromosome 10 loss observed. The detailed description of the mutational landscape and the chromosomal abnormalities of this cancer sample is in preparation (Lee et al.) As we were sequencing with the Illumina Genome Analyzer, we followed the Illumina library generation protocol version 2.3. Five μg of high molecular weight whole genomic DNA from each sample were diluted in 150 μl water as the starting material. The DNA was sheared using a sonicator (Bioruptor, Diagenode) for 1 hr at high power level to generate short fragments. The size of the sheared product ranged between 150 bp to 400 bp with the median size around 200-250 bp. The sample was concentrated in 30 μl of elution buffer (EB) after purification using QIAquick PCR Purification Kit (Qiagen). To repair 3' or 5' overhangs and generate blunt ends for all the double stranded fragments, 5 μl of T4 DNA polymerase, 1 μl of Klenow DNA polymerase, 5 μl of T4 PNK, 4 μl of dNTP mix, 10 μl of T4 DNA ligase buffer with 10 mM ATP and 45 μl water were added to the DNA solution and incubated at 20°C for 30 min and purified to be eluted in 32 μl of EB. Single 'A' base was added to the fragments in a reaction with 3 μl of Klenow exo- (3' to 5' exo minus), 10 μl of dATP and 5 μl of Klenow buffer and incubated at 37°C for 30 min. After purification with MinElute PCR purification kit (QIAGEN) and elution in 10 μl of EB, the Illumina supplied adapters were ligated to the genomic fragments. Five μl of DNA ligase, 10 μl of adapter oligo mix, 25 μl of DNA ligase buffer were used and the sample was incubated for 15 min at room temperature. To select a size-range of templates (100-200 bp) with the correct adapter pairs ligated, the templates were purified and size fractionated on a 2% agarose gel for 1 hr at 120 V. A band from 150-250 bases (including the adaptors) was excised, gel extracted using Gel Extraction Kit (Qiagen) and eluted in 30 μl of EB. The template was PCR amplified with 1 μl of template DNA, 25 μl Phusion DNA polymerase, 1 μl of Illumina primer 1.1, 1 μl of Illumina primer 2.1 and 22 μl of water. Four replicates of PCR reactions were performed to generate sufficient genomic amplicon of ~4 ug for successful hybridization. Total of 18 cycles of PCR with the following cycle conditions was performed: 30 sec at 98°C, [10 sec at 98°C, 30 sec at 65°C, 30 sec at 72°C] × 18 cycles, 5 min at 72°C and hold at 4°C. Samples were purified, eluted in 50 μl of EB and quantified with spectrophotometer (Nanodrop).
PCR primer sequences used to design primers for blocking the adapters in the hybridization mix.
After hybridization, the arrays were washed according to the Agilent CGH wash procedure A protocol with the 2nd wash extended to 5 minutes for increased stringency. The chamber was disassembled and the array slide was separated from the gasket slide in a glass dish filled with Oligo aCGH wash buffer 1 (Agilent). The array slide was placed on a slide-rack in another glass dish filled with Oligo aCGH wash buffer 1 and washed for 5 min at room temperature with stirring on a magnetic stirring plate. The slide rack was carefully moved to a glass dish filled with Agilent Oligo aCGH wash buffer 2 that was pre-heated to 37°C in a water bath and was washed for 5 min. After washing, the arrays were transferred back to a glass dish with Agilent Oligo aCGH wash buffer 1 until the next step was ready.
μl of 1× Titanium Taq PCR Buffer (Clontech) pre-heated to 95°C was dispensed to a new gasket slide and covered with the array slide that was washed. After securely locking the arrays in the chamber, the array was incubated in the 95°C rotating oven for 10 min at 20 rpm. After this stripping process, the chamber was disassembled and the array slide was carefully lifted up from one side so that the solution converged on the gasket slide. Pipette was used to collect the solution and transfer to a 1.5 mL microtube. This step needed to be done promptly as the solution was heated to 95°C and it started evaporating quickly. Collected solution was approximately 400 μl. The sample was aliquoted into 4 tubes so that when added with the Illumina primer pair (final concentration 0.1 μM), enzyme (final concentration 0.5×), and dNTPs (final concentration 250 μM each), the final volume were 100 μl. The stripped DNA was amplified by 15 cycles of PCR as described previously. The samples were purified with QIAquick PCR purification kit, consolidating and eluting the sample in 50 μl of EB. The concentration of DNA was measured and the size was checked on a 2% agarose gel to confirm that the size matched the size extracted from the gel in the previous step. Hybridization, washing and stripping steps were repeated. The stripped DNA was amplified under the same condition as before. After checking the concentration and template size again, the sample was diluted in a final concentration of 10 nM, which is the working concentration for cluster generation.
The Illumina flowcell was strictly prepared according to the manufacturer's protocol and the clusters were sequenced on the Genome Analyzer using standard manufacturer's recommended protocols. The image data produced were converted to intensity files and the sequential image data were processed through the "Firecrest" and "Bustard" algorithms provided by Illumina were used to call the individual sequence reads.
Masks that define each index for sequence alignment.
We counted the sequences that were mapped back to the targeted region to calculate the specificity. From the WIG file for each chromosome, the base positions that mapped within the target intervals were filtered, and the sequence counts for each base position were summed. The target intervals were defined as extending 100 bp upstream and downstream from the final oligo intended to capture each exon interval. To compare the copy number of each chromosome between the normal and tumor sample from a patient, sequence counts were first tallied for each target interval and divided by total sequence counts within all target intervals for normalization. Mean of sequence counts per chromosome was calculated and compared between the normal and tumor sample. To compare the background noise between flanking region of EGFR and FOXP2 genes on chromosome 7, only the coverage in the non-targeted region (excluding targeted region +/- 100 bp) was considered. 200 Kb moving average of the coverage was calculated across each region.
We thank Jason Liu and Felipe Cervantes for technical assistance and the members of the Nelson Lab for guidance and input. Funding was provided from the Dani Saleh Cancer Fund, the Henry Singleton Brain Cancer Research Fund, and a grant to support the UCLA Site of the National Institutes of Health Neuroscience Microarray Consortium (NS U24NS052108) and the National Institute of Mental Health (R01 MH071852). The replication study was funded by grant D22657 and DE019567.
- Shendure J, Porreca GJ, Reppas NB, Lin X, McCutcheon JP, Rosenbaum AM, Wang MD, Zhang K, Mitra RD, Church GM: Accurate multiplex polony sequencing of an evolved bacterial genome. Science. 2005, 309 (5741): 1728-1732. 10.1126/science.1117389.View ArticlePubMedGoogle Scholar
- Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA, Berka J, Braverman MS, Chen YJ, Chen Z, Dewell SB, Du L, Fierro JM, Gomes XV, Godwin BC, He W, Helgesen S, Ho CH, Irzyk GP, Jando SC, Alenquer ML, Jarvie TP, Jirage KB, Kim JB, Knight JR, Lanza JR, Leamon JH, Lefkowitz SM, Lei M, Li J, Lohman KL, Lu H, Makhijani VB, McDade KE, McKenna MP, Myers EW, Nickerson E, Nobile JR, Plant R, Puc BP, Ronan MT, Roth GT, Sarkis GJ, Simons JF, Simpson JW, Srinivasan M, Tartaro KR, Tomasz A, Vogt KA, Volkmer GA, Wang SH, Wang Y, Weiner MP, Yu P, Begley RF, Rothberg JM: Genome sequencing in microfabricated high-density picolitre reactors. Nature. 2005, 437 (7057): 376-380.PubMed CentralPubMedGoogle Scholar
- Bennett S: Solexa Ltd. Pharmacogenomics. 2004, 5 (4): 433-438. 10.1517/146224184.108.40.2063.View ArticlePubMedGoogle Scholar
- Eid J, Fehr A, Gray J, Luong K, Lyle J, Otto G, Peluso P, Rank D, Baybayan P, Bettman B, Bibillo A, Bjornson K, Chaudhuri B, Christians F, Cicero R, Clark S, Dalal R, Dewinter A, Dixon J, Foquet M, Gaertner A, Hardenbol P, Heiner C, Hester K, Holden D, Kearns G, Kong X, Kuse R, Lacroix Y, Lin S, Lundquist P, Ma C, Marks P, Maxham M, Murphy D, Park I, Pham T, Phillips M, Roy J, Sebra R, Shen G, Sorenson J, Tomaney A, Travers K, Trulson M, Vieceli J, Wegener J, Wu D, Yang A, Zaccarin D, Zhao P, Zhong F, Korlach J, Turner S: Real-Time DNA Sequencing from Single Polymerase Molecules. Science. 2008, 323 (5910): 133-8. 10.1126/science.1162986.View ArticlePubMedGoogle Scholar
- Harris TD, Buzby PR, Babcock H, Beer E, Bowers J, Braslavsky I, Causey M, Colonell J, Dimeo J, Efcavitch JW, Giladi E, Gill J, Healy J, Jarosz M, Lapen D, Moulton K, Quake SR, Steinmann K, Thayer E, Tyurina A, Ward R, Weiss H, Xie Z: Single-molecule DNA sequencing of a viral genome. Science. 2008, 320 (5872): 106-109. 10.1126/science.1150427.View ArticlePubMedGoogle Scholar
- Pushkarev D, Neff NF, Quake SR: Single-molecule sequencing of an individual human genome. Nat Biotechnol. 2009, 27 (9): 847-52. 10.1038/nbt.1561.PubMed CentralView ArticlePubMedGoogle Scholar
- Choi M, Scholl UI, Ji W, Liu T, Tikhonova IR, Zumbo P, Nayir A, Bakkaloglu A, Ozen S, Sanjad S, Nelson-Williams C, Farhi A, Mane S, Lifton RP: Genetic diagnosis by whole exome capture and massively parallel DNA sequencing. Proc Natl Acad Sci USA. 2009, 106 (45): 19096-19101. 10.1073/pnas.0910672106.PubMed CentralView ArticlePubMedGoogle Scholar
- Ng SB, Buckingham KJ, Lee C, Bigham AW, Tabor HK, Dent KM, Huff CD, Shannon PT, Jabs EW, Nickerson DA, Shendure J, Bamshad MJ: Exome sequencing identifies the cause of a mendelian disorder. Nat Genet. 2009,Google Scholar
- Ng SB, Turner EH, Robertson PD, Flygare SD, Bigham AW, Lee C, Shaffer T, Wong M, Bhattacharjee A, Eichler EE, Bamshad M, Nickerson DA, Shendure J: Targeted capture and massively parallel sequencing of 12 human exomes. Nature. 2009, 461 (7261): 272-276. 10.1038/nature08250.PubMed CentralView ArticlePubMedGoogle Scholar
- Reversade B, Escande-Beillard N, Dimopoulou A, Fischer B, Chng SC, Li Y, Shboul M, Tham PY, Kayserili H, Al-Gazali L, Shahwan M, Brancati F, Lee H, O'Connor BD, Schmidt-von Kegler M, Merriman B, Nelson SF, Masri A, Alkazaleh F, Guerra D, Ferrari P, Nanda A, Rajab A, Markie D, Gray M, Nelson J, Grix A, Sommer A, Savarirayan R, Janecke AR, Steichen E, Sillence D, Hausser I, Budde B, Nurnberg G, Nurnberg P, Seemann P, Kunkel D, Zambruno G, Dallapiccola B, Schuelke M, Robertson S, Hamamy H, Wollnik B, Van Maldergem L, Mundlos S, Kornak U: Mutations in PYCR1 cause cutis laxa with progeroid features. Nat Genet. 2009, 41 (9): 1016-1021. 10.1038/ng.413.View ArticlePubMedGoogle Scholar
- Dahl F, Gullberg M, Stenberg J, Landegren U, Nilsson M: Multiplex amplification enabled by selective circularization of large sets of genomic DNA fragments. Nucleic Acids Res. 2005, 33 (8): e71-10.1093/nar/gni070.PubMed CentralView ArticlePubMedGoogle Scholar
- Dahl F, Stenberg J, Fredriksson S, Welch K, Zhang M, Nilsson M, Bicknell D, Bodmer WF, Davis RW, Ji H: Multigene amplification and massively parallel sequencing for cancer mutation discovery. Proc Natl Acad Sci USA. 2007, 104 (22): 9387-9392. 10.1073/pnas.0702165104.PubMed CentralView ArticlePubMedGoogle Scholar
- Fredriksson S, Baner J, Dahl F, Chu A, Ji H, Welch K, Davis RW: Multiplex amplification of all coding sequences within 10 cancer genes by Gene-Collector. Nucleic Acids Res. 2007, 35 (7): e47-10.1093/nar/gkm078.PubMed CentralView ArticlePubMedGoogle Scholar
- Stenberg J, Dahl F, Landegren U, Nilsson M: PieceMaker: selection of DNA fragments for selector-guided multiplex amplification. Nucleic Acids Res. 2005, 33 (8): e72-10.1093/nar/gni071.PubMed CentralView ArticlePubMedGoogle Scholar
- Porreca GJ, Zhang K, Li JB, Xie B, Austin D, Vassallo SL, LeProust EM, Peck BJ, Emig CJ, Dahl F, Gao Y, Church GM, Shendure J: Multiplex amplification of large sets of human exons. Nat Methods. 2007, 4 (11): 931-936. 10.1038/nmeth1110.View ArticlePubMedGoogle Scholar
- Albert TJ, Molla MN, Muzny DM, Nazareth L, Wheeler D, Song X, Richmond TA, Middle CM, Rodesch MJ, Packard CJ, Weinstock GM, Gibbs RA: Direct selection of human genomic loci by microarray hybridization. Nat Methods. 2007, 4 (11): 903-905. 10.1038/nmeth1111.View ArticlePubMedGoogle Scholar
- Okou DT, Steinberg KM, Middle C, Cutler DJ, Albert TJ, Zwick ME: Microarray-based genomic selection for high-throughput resequencing. Nat Methods. 2007, 4 (11): 907-909. 10.1038/nmeth1109.View ArticlePubMedGoogle Scholar
- Hodges E, Xuan Z, Balija V, Kramer M, Molla MN, Smith SW, Middle CM, Rodesch MJ, Albert TJ, Hannon GJ, McCombie WR: Genome-wide in situ exon capture for selective resequencing. Nat Genet. 2007, 39 (12): 1522-1527. 10.1038/ng.2007.42.View ArticlePubMedGoogle Scholar
- Bashiardes S, Veile R, Helms C, Mardis ER, Bowcock AM, Lovett M: Direct genomic selection. Nat Methods. 2005, 2 (1): 63-69. 10.1038/nmeth0105-63.View ArticlePubMedGoogle Scholar
- Landegent JE, Jansen in de Wal N, Dirks RW, Baao F, Ploeg van der M: Use of whole cosmid cloned genomic sequences for chromosomal localization by non-radioactive in situ hybridization. Hum Genet. 1987, 77 (4): 366-370. 10.1007/BF00291428.View ArticlePubMedGoogle Scholar
- Hodges E, Rooks M, Xuan Z, Bhattacharjee A, Benjamin Gordon D, Brizuela L, Richard McCombie W, Hannon GJ: Hybrid selection of discrete genomic intervals on custom-designed microarrays for massively parallel sequencing. Nat Protoc. 2009, 4 (6): 960-974. 10.1038/nprot.2009.68.PubMed CentralView ArticlePubMedGoogle Scholar
- Forbes SA, Bhamra G, Bamford S, Dawson E, Kok C, Clements J, Menzies A, Teague JW, Futreal PA, Stratton MR: The Catalogue of Somatic Mutations in Cancer (COSMIC). Curr Protoc Hum Genet. 2008, Chapter 10 (Unit 10): 11-PubMedGoogle Scholar
- Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, Stange-Thomann N, Stojanovic N, Subramanian A, Wyman D, Rogers J, Sulston J, Ainscough R, Beck S, Bentley D, Burton J, Clee C, Carter N, Coulson A, Deadman R, Deloukas P, Dunham A, Dunham I, Durbin R, French L, Grafham D, Gregory S, Hubbard T, Humphray S, Hunt A, Jones M, Lloyd C, McMurray A, Matthews L, Mercer S, Milne S, Mullikin JC, Mungall A, Plumb R, Ross M, Shownkeen R, Sims S, Waterston RH, Wilson RK, Hillier LW, McPherson JD, Marra MA, Mardis ER, Fulton LA, Chinwalla AT, Pepin KH, Gish WR, Chissoe SL, Wendl MC, Delehaunty KD, Miner TL, Delehaunty A, Kramer JB, Cook LL, Fulton RS, Johnson DL, Minx PJ, Clifton SW, Hawkins T, Branscomb E, Predki P, Richardson P, Wenning S, Slezak T, Doggett N, Cheng JF, Olsen A, Lucas S, Elkin C, Uberbacher E, Frazier M, Gibbs RA, Muzny DM, Scherer SE, Bouck JB, Sodergren EJ, Worley KC, Rives CM, Gorrell JH, Metzker ML, Naylor SL, Kucherlapati RS, Nelson DL, Weinstock GM, Sakaki Y, Fujiyama A, Hattori M, Yada T, Toyoda A, Itoh T, Kawagoe C, Watanabe H, Totoki Y, Taylor T, Weissenbach J, Heilig R, Saurin W, Artiguenave F, Brottier P, Bruls T, Pelletier E, Robert C, Wincker P, Smith DR, Doucette-Stamm L, Rubenfield M, Weinstock K, Lee HM, Dubois J, Rosenthal A, Platzer M, Nyakatura G, Taudien S, Rump A, Yang H, Yu J, Wang J, Huang G, Gu J, Hood L, Rowen L, Madan A, Qin S, Davis RW, Federspiel NA, Abola AP, Proctor MJ, Myers RM, Schmutz J, Dickson M, Grimwood J, Cox DR, Olson MV, Kaul R, Shimizu N, Kawasaki K, Minoshima S, Evans GA, Athanasiou M, Schultz R, Roe BA, Chen F, Pan H, Ramser J, Lehrach H, Reinhardt R, McCombie WR, de la Bastide M, Dedhia N, Blocker H, Hornischer K, Nordsiek G, Agarwala R, Aravind L, Bailey JA, Bateman A, Batzoglou S, Birney E, Bork P, Brown DG, Burge CB, Cerutti L, Chen HC, Church D, Clamp M, Copley RR, Doerks T, Eddy SR, Eichler EE, Furey TS, Galagan J, Gilbert JG, Harmon C, Hayashizaki Y, Haussler D, Hermjakob H, Hokamp K, Jang W, Johnson LS, Jones TA, Kasif S, Kaspryzk A, Kennedy S, Kent WJ, Kitts P, Koonin EV, Korf I, Kulp D, Lancet D, Lowe TM, McLysaght A, Mikkelsen T, Moran JV, Mulder N, Pollara VJ, Ponting CP, Schuler G, Schultz J, Slater G, Smit AF, Stupka E, Szustakowski J, Thierry-Mieg D, Thierry-Mieg J, Wagner L, Wallis J, Wheeler R, Williams A, Wolf YI, Wolfe KH, Yang SP, Yeh RF, Collins F, Guyer MS, Peterson J, Felsenfeld A, Wetterstrand KA, Patrinos A, Morgan MJ, de Jong P, Catanese JJ, Osoegawa K, Shizuya H, Choi S, Chen YJ: Initial sequencing and analysis of the human genome. Nature. 2001, 409 (6822): 860-921. 10.1038/35057062.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.