Gene-resolution analysis of DNA copy number variation using oligonucleotide expression microarrays
© Auer et al; licensee BioMed Central Ltd. 2007
Received: 05 September 2006
Accepted: 30 April 2007
Published: 30 April 2007
Array-based comparative genomic hybridization (aCGH) is a high-throughput method for measuring genome-wide DNA copy number changes. Current aCGH methods have limited resolution, sensitivity and reproducibility. Microarrays for aCGH are available only for a few organisms and combination of aCGH data with expression data is cumbersome.
We present a novel method of using commercial oligonucleotide expression microarrays for aCGH, enabling DNA copy number measurements and expression profiles to be combined using the same platform. This method yields aCGH data from genomic DNA without complexity reduction at a median resolution of approximately 17,500 base pairs. Due to the well-defined nature of oligonucleotide probes, DNA amplification and deletion can be defined at the level of individual genes and can easily be combined with gene expression data.
A novel method of gene resolution analysis of copy number variation (graCNV) yields high-resolution maps of DNA copy number changes and is applicable to a broad range of organisms for which commercial oligonucleotide expression microarrays are available. Due to the standardization of oligonucleotide microarrays, graCNV results can reliably be compared between laboratories and can easily be combined with gene expression data using the same platform.
Array-based comparative genomic hybridization (aCGH) allows the identification of genome-wide DNA gains and losses in cancers and genetic diseases [1–3]. An ideal aCGH platform should possess the following features: 1) It should be available to study a broad range of organisms. Unfortunately, aCGH microarrays are commercially available for human and mouse studies only, leaving out other model organisms for DNA copy number studies. 2) The aCGH platform should be commercially available worldwide to make results from different laboratories easily comparable. In-house microarrays frequently show less reproducibility than commercial products [4, 5]. The Microarray Quality Control Consortium (MAQC) study highlighted once again that in-house microarrays generate a much higher coefficient of variation for expression signals compared to commercial products . Comparison of results generated at independent laboratories is frequently problematic when different probes are used at different laboratories. 3) Probes should span short regions to provide detailed information on regions of copy number variation (CNV); BAC clones, used as aCGH probes , due to their average probe length being several ten thousand nucleotides, inherently can not measure small amplified or deleted regions. 4) The platform should provide small spacing between probes to generate high density maps of CNVs; The only commercially available BAC aCGH array available measures at a median resolution of one megabase . 5) Individual measurements should provide reliable data to avoid necessity of averaging multiple measurements, resulting in decreased resolution. Long oligonucleotide arrays [8, 9] and SNP microarrays [10, 11] depend on averaging signals from multiple probes [9, 10] to eliminate false positive measurements, resulting in decreased resolution. 6) CNV measurements should be easily correlated with expression data when the same samples are studied on the genomic and transcriptomic level. BAC clones and probes designed for SNP measurement inherently are not specifically designed to interrogate transcribed genes. Therefore, combining DNA copy number and expression data needs strong bioinformatics support . 7) The analytical procedure should interrogate the entire genome; the DNA labeling protocol for SNP microarrays depends on complexity reduction, leaving out significant parts of the genome from analysis [12, 13].
Here we present gene resolution analysis of copy number variation (graCNV), a method utilizing the most frequently used expression microarray platform for aCGH. For human and mouse studies, this platform provides over 50,000 measurements across the genome (U133 Plus 2.0 and 430 2.0 GeneChips respectively, both Affymetrix). Furthermore, the same technology is available for more than a dozen other organisms with comparable genome coverage. The probes are short oligonucleotides and probe sets span on average short chromosomal regions. Without complexity reduction, genomic DNA is fragmented, labeled and hybridized to these microarrays. After re-annotation of probe sets for interrogation of genomic DNA, WPP, a data analysis algorithm originally developed for expression analysis, is utilized for calculation of DNA copy number variation. Since the vast majority of aCGH data available today has been generated using BAC microarrays , graCNV results have been compared to results from BAC microarrays with high genome coverage .
Properties of the U133 Plus 2.0 Expression Array as an aCGH tool
Properties of the U133 Plus 2.0 expression microarray and the 19K BAC microarray as aCGH tools
U133 Plus 2.0 microarray
Total probe sets
Median length of interrogated regions
Probe sets interrogating the human genome
Mean length of interrogated regions
Probe sets interrogating intergenic regions
Median length of regions not covered
Probe sets interrogating genes
Mean length of regions not covered
Interrogated gene regions
19K BAC microarray
19,116 (in duplicates)
Median length of interrogated regions
Probes interrogating the human genome
18,411 (in duplicates)
Mean length of interrogated regions
Median length of regions not covered
Mean length of regions not covered
Benefits of the WPP algorithm
Detection of copy number variations in large chromosomal regions
High resolution analysis of copy number variations
Detection of copy number variations between normal genomic DNAs at a single gene locus
Connection of CNV and expression data
Our study shows that the most frequently used commercial oligonucleotide expression microarray platform (Affymetrix) can be utilized for measurement of copy number variation. After re-annotation of probe sets, these expression arrays provide a high-resolution platform for CNV analysis.
Many aCGH platforms depend on averaging measurements from adjacent loci (moving average) to remove noise and avoid false positive reports of copy number variation [2, 9, 12, 26]. graCNV on the other hand, reports the bias-corrected median of measurements from eleven adjacent probes within a probe set. Therefore, graCNV shows low false discovery rates (see Additional file 1) and further averaging is not vital. Affymetrix expression arrays provide for many genes more than one probe set for measurement. To allow easy interpretation of results, we averaged results of multiple probe set measuring the same gene. An even higher resolution of CNV analysis could be accomplished when the newly released expression arrays for measurement of individual exons (GeneChip Human Exon 1.0 ST arrays, Affymetrix) would be utilized.
For sample labeling, we used standard chemistry for SNP analysis from the same provider, hybridization and washing protocols are also well established by many laboratories for SNP analysis. Therefore, it should be easy to adapt graCNV by other laboratories. Affymetrix currently provides expression microarrays for over 20 organisms and graCNV can be applied to all of these organisms. Additionally, graCNV provides copy number information comparable to data generated using one of the highest density BAC aCGH arrays available.
Hybridization of genomic DNA to expression arrays produces higher background cross-hybridization (indicated by relatively high signals for mismatch probes, data not shown) than hybridization of labeled mRNA transcripts. We speculate that the reason why the RMA algorithm failed to identify the close similarity of two cell lines derived from the same donor was because RMA did not correct for sequence-specific differences in background. WPP, the algorithm introduced here for CNV measurements, utilizes mismatch probe signals for sequence-specific background correction and has been used successfully for expression analysis by one of our laboratories for several years.
Due to the fact that graCNV uses a one chip per sample principle, the range of CNVs within normal genomic DNAs can be taken into consideration for measurement of disease-related CNVs and a free software for this calculation is provided on our website.
The present study describes a novel method of gene resolution analysis of copy number variation (graCNV) yielding high-resolution maps of DNA copy number changes and applicable to a broad range of organisms for which commercial oligonucleotide expression microarrays are available. Results are comparable to BAC aCGH with high genome coverage. Due to the standardized oligonucleotide microarrays, graCNV results can be compared between laboratories and can easily be combined with gene expression data using the same platform.
For analysis of cancer related genomic alterations, genomic DNA of neuroblastoma cell lines IMR-32, SK-N-AS and SK-N-SH (all provided by ATCC) was analyzed. SK-N-SH cells were propagated in two different laboratories for at least ten passages. The resulting cell lines were analyzed as SK-N-SH/G and SK-N-SH/L. As baseline of normal human individuals, genomic DNA from peripheral blood samples of four females (C2, C4, C5 and C6) was collected after informed consent. Genomic DNA was isolated from animals of a mouse strain (megabladder mouse), which resulted from mutagenesis during generating transgenic mice. Genetic characterization of the megabladder mouse using BAC clones containing the transgene revealed chromosome 16 at approximately 26.4 Mb to be the site of insertional mutation. FISH analysis of metaphase chromosomes further revealed this region of chromosome 16 to be translocated into chromosome 11. Therefore, wild type mice contain two copies of the genomic region surrounding 26.4 Mb on chromosome 16, heterozygous mutants contained three copies and homozygous mutants contained four copies. The megabladder mouse will be described in detail elsewhere.
Re-annotation of expression array
Probe sets were aligned to Build 35 version of the human genome assembly by applying standalone BLAT  to "concatemers" formed by concatenating the non-overlapping portions of individual 25-mer probe sequences of a probe set. If BLAT did not report any match for a concatemer of a certain probe set, the probe sets was eliminated from further annotation. Homology of each alignment was computed as the percentage of concatemer bases matched and the genomic location with the highest homology was used for further annotation. The refFlat.txt.gz file contains physical positions of gene locations according to the human genome assembly version Build 35 and has been used for identification of probe sets interrogating genes. When the genomic location with highest homology to a probe set overlapped with a gene in this database, the probe set was annotated to measure this particular gene. For multiple probe sets measuring the same gene, log2 copy number differences measured by individual probe sets were averaged.
Processing of genomic DNA for graCNV using expression arrays
20 μg genomic DNA was digested using EcoRI (New England Biolabs). Fragmentation and biotin labeling using terminal transferase were performed using GeneChip Mapping 10K Xba Assay Kit (Affymetrix). Human samples were hybridized to U133plus2.0 GeneChips (Affymetrix) and mouse samples were hybridized to custom GeneChips containing 4,400 probe sets preferentially measuring genes located on chromosomes 11 and 16 (Affymetrix). Hybridization and other conditions were slightly modified from those suggested for 10K Mapping Arrays (Affymetrix) and washing conditions were carried out as suggested for 100K Mapping Arrays. A detailed description of sample processing is available in Additional file 1.
Data analysis for expression arrays
CEL files were generated from scanned images (DAT files) using GCOS 1.4 software (Affymetrix). Probe set signals were either generated using the RMA algorithm in ArrayAssist 3.4 (Stratagene) or using the in-house developed WPP algorithm. WPP (Well behaved estimates of differential gene expression Plus probe-level p-values Plus extensible quantile scaling) software is an enhanced version of RMA . WPP provides the following advanced analysis procedures which significantly increase the reliability and interpretability of calculated differentials: 1) probe-level nonparametric p-values are used to assess the statistical significance of individual calculated differentials; 2) strictly monotonic quantile scaling is used to standardize PM and MM probe intensity distributions across arrays; 3) automatic exclusion of uninformative and misinformative probes is used to increase the accuracy and precision of calculated differentials. A detailed description of the WPP algorithm is available in Additional file 1. Measurements of the four normal human DNA samples were used as baseline for measurement of copy number variation in the cell lines. CNVs of cell lines were calculated relative to the median of signals from normal samples.
Principle components analysis and hierarchical cluster analysis
Hierarchical cluster analysis in SPSS software was applied to log2 transformed copy number estimates of probe sets using a Pearson correlation measure with furthest-neighbor distance. To exclude gender-specific differences, X-linked and Y-linked genes were excluded. Principle components analysis in SPSS software was applied with Varimax rotation to log2 transformed copy number estimates for 2,000 probe sets with the widest range of values for autosomal chromosomes.
Construction of the Human BAC CGH array
We prepared DNA spotting solutions from sequence connected RPCI-11 BAC by ligation-mediated PCR as described previously. The array contained ~19,000 BAC clones that were chosen by virtue of their STS content, end-sequence and association with cancer. Each clone was spotted in duplicate on amino-silanated glass slides (Schott Nexterion typeA+) using a MicroGrid ll TAS arrayer (Apogent Discoveries). The BAC DNA products have ~80 μm diameter spots with 150 μm center-to-center spacing creating an array of ~39,000 elements. The printed slides were dried overnight and thereafter UV-crosslinked (350 mJ) in a Stratalinker 2400 (Stratagene) immediately before hybridization. A complete list of the RPCI-11 BAC clones spotted on the 19k array is available online.
Labeling and Hybridization of DNA for BAC aCGH
One μg of reference and test sample genomic DNA (pooled genomic DNA of five individuals) were individually fluorescently labeled using the BioArray CGH Labeling System (Enzo Life Sciences). Initially, the DNA was denatured in the presence of the random primer at 99°C for 10 minutes in a thermalcycler, followed by a quick chill at 4°C. The tubes were transferred to ice and underwent labeling with the addition of dNTP-cyanine 3 mix (or dNTP-cyanine 5) and Klenow. Samples were incubated overnight at 37°C in a thermalcycler. The unincorporated nucleotides were removed using a QIAquick PCR purification column (Qiagen) and the labeled probe is eluted with 2 × 25 ul washes. Prior to hybridization, the test and reference probes were resuspended in 110 μl SlideHyb Buffer #3 (Ambion) containing 5 μl of 20 μg/μl Cot-1 and 5 μl of 100 μg/μl Yeast tRNA (Invitrogen), heated to 95°C for 5 minutes and placed on ice. Hybridization to the 19k CGH arrays were performed for 16 hours at 55°C using a GeneTAC hybridization station (Genomic Solutions, Inc.) as described. After hybridization, the slides are automatically washed in the GeneTAC station with reducing concentrations of SSC and SDS.
Digital Data Acquisition and Analysis for BAC aCGH
The hybridized aCGH slides were scanned using a GenePix 4200A Scanner (Molecular Devices) to generate high-resolution (5 μm) images for both Cy3 (test) and Cy5 (control) channels. Image analysis was performed using the ImaGene (version 6.0.1) software from BioDiscovery, Inc. A loess corrected log2 ratio of the background-subtracted test/control were calculated for each clone to compensate for non-linear raw aCGH profiles in each sample. Mapping information was added to the resulting log2 test/control values. The mapping data for each BAC is found by querying the human genome sequence and examined for regions of large scale variation (LSV) in the human genome[8, 26, 34, 35].
Comparison of copy number segmentation results from expression arrays and BAC arrays
Since BAC aCGH microarray and the graCNV microarray (U133 Plus 2.0 GeneChip) have been annotated according to the human genome assembly version 35, coordinates of copy number segments were compared directly. Copy number segmentation of log2 ratios was performed in R using the DNAcopy package v1.8.1 which applies CBS (Circular Binary Segmentation) [36, 37], one of the best available segmentation algorithms . The undo.splits option was set to "sdundo".
Microarray expression analysis
For expression profiling, 25 ng total RNA per sample was processed using isothermal amplification SPIA Biotin System (NuGEN Technologies, Inc.) and 2.2 μg of cDNA was hybridized per microarray. Microarrays utilized were Custom GeneChips (Affymetrix), containing probe sets to measure transcripts from mouse chromosomes 11 and 16. After 16 hours of hybridization at 45°C, washing and staining of microarrays was performed using a Fluidics Station 450 (Affymetrix); GeneChips were scanned in a GeneChip Scanner 3000 (Affymetrix). CEL files were generated from DAT files using GCOS software (Affymetrix). All steps of sample and microarray processing were performed according to manufacturer's recommendations. For calculation of differential gene expression, log2 differential expression of multiple probe sets per gene were averaged when more than one probe set was available per gene.
Tail samples (<1 cm) were snipped from every animal in the megabladder mouse colony. Tails were digested and DNA was isolated using Spin Doctor Genomic DNA Isolation kit (Gerard Biotech) according to the manufacturer's protocol. The DNA was resuspended in resuspension buffer included in the kit. The concentrations of the samples were determined by Nanodrop ND1000 spectrophotometer (Nanodrop), and the optical density 260/280 nm ratios were evaluated. Genomic DNA was stored at 4°C until further use. Mutant mice contain an artificial transgene in addition to the additional copies of the specified region of chromosome 16. Genotyping of mice by quantitative PCR was performed using transgene specific primers (5'-CAACCGACTCTGCATTCATCTC-3' (forward) and 5'-CTCCAGTACAGCCCTCATGTTTG-3' (reverse) and probe 5'-6FAM AAGCTTGATATCGAATTC MGBNFQ-3'. The Glucagon gene was used as internal control with primers 5'-CACAACATCTCGTGCCAGTCA-3' (forward) and 5'-ATCTGCATGCAAAGCAATATAGCT-3' (reverse), and the probe was 5'-VICT GGGATGTACAATTTCAA MGBNFQ-3'. Working concentrations of primers and probes were 18 μM and 5 μM, respectively. The multiplex PCR reactions were set up with 20 ng DNA and TaqMan Universal PCR Master Mix, No AmpErase UNG (Applied Biosystems). Reactions were performed in triplicate using the ABI series 7500 Sequence Detection System (Applied Biosystems). The initial denaturation was carried out at 50°C for 2 min, followed by 95°C for 10 min (denaturation) followed by 40 cycles of PCR reactions at 95°C for 15 sec and 60°C for 1 min. The amplification data were further analyzed using ABI 7500 System Sequence Detection Software Version 1.2.3 (Applied Biosystems). The genotype was determined by the presence of 0 versus 1 versus 2 copies of the transgene in wild type, heterozygous and homozygous mice respectively. Copy numbers of endogenous genes were determined using SYBR Green or TaqMan chemistry (both from Applied Biosystems). 10 ng of genomic DNA were used per reaction and amplification conditions for SYBR Green assays were as follows: 50°C for 2 min, 95°C for 10 min followed by 40 PCR cycles at 95°C for 15 sec, 54°C for 30 sec and 72°C for 35 sec. The data was collected at 72°C for 35 sec. TaqMan data for glucagon were used for normalization. Primers were generated for the following sequences: 2310061A09Rik (5' GCCATCTGCATATTCTTTGCTAGCA 3' forward and 5'ACATGGTTTAATGGTAGACTGGGCA 3' reverse); Cldn1 (5'CTCAACCTCCCAACTGTTAAGATGA 3' forward and 5'AACCTCTCCTATAACTGTCAGCTTC 3' reverse); Ostn (5'GAGTGTTTGCTTCAACTGTGTCAGA 3' forward and 5'AACAAGCCAGGCAGTAACTTCTTTT 3', reverse); Uts2d (5'GAGTGTTTGCTTCAACTGTGTCAGA 3' forward and 5' TAGGCTGGTAGAAGTAAACAAGCCA 3' reverse), 2610529H08Rik (5'TGGCGTCTAGGGAACTGAGTTTCTT 3' forward and 5'TGAGGAAACAGCAGTACACGATAAC 3' reverse), D16Bwg1543e (5'GCTGGCTGCAGGGAACAATCTATTT 3' forward and 5'GATGTAGACATATGAGTGGTAGTGA 3' reverse), B230343J05Rik (5'TGTGATTCATCATCGCTACAGGGAA 3' forward and 5'AACCTTCTCAAAAGCAAGGCCTTGT 3' reverse). Amplification conditions for TaqMan assays were as described above for genotyping. Commercial "Primer probe mixes" (Applied Biosystems) were used for Il1rap (ILRAP5-K1), Fgf12 (FGF12-1-A2) and Cldn16 (CLDN-I55S4).
Microarray data are deposited as GEO accession # GSE7364.
We thank Albert de la Chapelle for valuable discussions and Long-Sheng Chang for providing SK-N-SH/L and SK-N-AS DNA. Part of this study was supported by grants RO1 AR050078 from NIAMS and PO1 DK55546 from NIDDK (to CYY and YY), and by grant RO1 DK00907 from NIH (to KMM).
- Snijders AM, Nowak N, Segraves R, Blackwood S, Brown N, Conroy J, Hamilton G, Hindle AK, Huey B, Kimura K, Law S, Myambo K, Palmer J, Ylstra B, Yue JP, Gray JW, Jain AN, Pinkel D, Albertson DG: Assembly of microarrays for genome-wide measurement of DNA copy number. Nat Genet. 2001, 29 (3): 263-264.PubMedView ArticleGoogle Scholar
- Pollack JR, Perou CM, Alizadeh AA, Eisen MB, Pergamenschikov A, Williams CF, Jeffrey SS, Botstein D, Brown PO: Genome-wide analysis of DNA copy-number changes using cDNA microarrays. Nat Genet. 1999, 23 (1): 41-46.PubMedView ArticleGoogle Scholar
- Hodgson G, Hager JH, Volik S, Hariono S, Wernick M, Moore D, Nowak N, Albertson DG, Pinkel D, Collins C, Hanahan D, Gray JW: Genome scanning with array CGH delineates regional alterations in mouse islet carcinomas. Nat Genet. 2001, 29 (4): 459-464.PubMedView ArticleGoogle Scholar
- Kuo WP, Liu F, Trimarchi J, Punzo C, Lombardi M, Sarang J, Whipple ME, Maysuria M, Serikawa K, Lee SY, McCrann D, Kang J, Shearstone JR, Burke J, Park DJ, Wang X, Rector TL, Ricciardi-Castagnoli P, Perrin S, Choi S, Bumgarner R, Kim JH, Short GF, Freeman MW, Seed B, Jensen R, Church GM, Hovig E, Cepko CL, Park P, Ohno-Machado L, Jenssen TK: A sequence-oriented comparison of gene expression measurements across different hybridization-based technologies. Nat Biotechnol. 2006, 24 (7): 832-840.PubMedView ArticleGoogle Scholar
- Larkin JE, Frank BC, Gavras H, Sultana R, Quackenbush J: Independence and reproducibility across microarray platforms. Nat Methods. 2005, 2 (5): 337-344.PubMedView ArticleGoogle Scholar
- Shi L, Reid LH, Jones WD, Shippy R, Warrington JA, Baker SC, Collins PJ, de Longueville F, Kawasaki ES, Lee KY, Luo Y, Sun YA, Willey JM, Setterquist RA, Fischer GM, Tong W, Dragan YP, Dix DJ, Frueh FW, Goodsaid FM, Herman D, Jensen RV, Johnson CD, Lobenhofer EK, Puri RK, Schrf U, Thierry-Mieg J, Wang C, Wilson M, Wolber PK, Zhang L, Slikker W, Shi L, Reid LH: The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements. Nat Biotechnol. 2006, 24 (9): 1151-1161.PubMedView ArticleGoogle Scholar
- Ishkanian AS, Malloff CA, Watson SK, DeLeeuw RJ, Chi B, Coe BP, Snijders A, Albertson DG, Pinkel D, Marra MA, Ling V, MacAulay C, Lam WL: A tiling resolution DNA microarray with complete coverage of the human genome. Nat Genet. 2004, 36 (3): 299-303.PubMedView ArticleGoogle Scholar
- Iafrate AJ, Feuk L, Rivera MN, Listewnik ML, Donahoe PK, Qi Y, Scherer SW, Lee C: Detection of large-scale variation in the human genome. Nat Genet. 2004, 36 (9): 949-951.PubMedView ArticleGoogle Scholar
- van den Ijssel P, Tijssen M, Chin SF, Eijk P, Carvalho B, Hopmans E, Holstege H, Bangarusamy DK, Jonkers J, Meijer GA, Caldas C, Ylstra B: Human and mouse oligonucleotide-based array CGH. Nucleic Acids Res. 2005, 33 (22): e192-PubMed CentralPubMedView ArticleGoogle Scholar
- Barrett MT, Scheffer A, Ben-Dor A, Sampas N, Lipson D, Kincaid R, Tsang P, Curry B, Baird K, Meltzer PS, Yakhini Z, Bruhn L, Laderman S: Comparative genomic hybridization using oligonucleotide microarrays and total genomic DNA. Proc Natl Acad Sci U S A. 2004, 101 (51): 17765-17770.PubMed CentralPubMedView ArticleGoogle Scholar
- Baldus CD, Liyanarachchi S, Mrozek K, Auer H, Tanner SM, Guimond M, Ruppert AS, Mohamed N, Davuluri RV, Caligiuri MA, Bloomfield CD, de la Chapelle A: Acute myeloid leukemia with complex karyotypes and abnormal chromosome 21: Amplification discloses overexpression of APP, ETS2, and ERG genes. Proc Natl Acad Sci U S A. 2004, 101 (11): 3915-3920.PubMed CentralPubMedView ArticleGoogle Scholar
- Calhoun ES, Hucl T, Gallmeier E, West KM, Arking DE, Maitra A, Iacobuzio-Donahue CA, Chakravarti A, Hruban RH, Kern SE: Identifying Allelic Loss and Homozygous Deletions in Pancreatic Cancer without Matched Normals Using High-Density Single-Nucleotide Polymorphism Arrays. Cancer Res. 2006, 66 (16): 7920-7928.PubMedView ArticleGoogle Scholar
- Zhao X, Li C, Paez JG, Chin K, Janne PA, Chen TH, Girard L, Minna J, Christiani D, Leo C, Gray JW, Sellers WR, Meyerson M: An integrated view of copy number and allelic alterations in the cancer genome using single nucleotide polymorphism arrays. Cancer Res. 2004, 64 (9): 3060-3071.PubMedView ArticleGoogle Scholar
- Ylstra B, van den Ijssel P, Carvalho B, Brakenhoff RH, Meijer GA: BAC to the future! or oligonucleotides: a perspective for micro array comparative genomic hybridization (array CGH). Nucleic Acids Res. 2006, 34 (2): 445-450.PubMed CentralPubMedView ArticleGoogle Scholar
- Nowak NJ, Gaile D, Conroy JM, McQuaid D, Cowell J, Carter R, Goggins MG, Hruban RH, Maitra A: Genome-wide aberrations in pancreatic adenocarcinoma. Cancer Genet Cytogenet. 2005, 161 (1): 36-50.PubMedView ArticleGoogle Scholar
- Finishing the euchromatic sequence of the human genome. Nature. 2004, 431 (7011): 931-945.Google Scholar
- Irizarry RA, Bolstad BM, Collin F, Cope LM, Hobbs B, Speed TP: Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res. 2003, 31 (4): e15-PubMed CentralPubMedView ArticleGoogle Scholar
- Brinkschmidt C, Christiansen H, Terpe HJ, Simon R, Boecker W, Lampert F, Stoerkel S: Comparative genomic hybridization (CGH) analysis of neuroblastomas--an important methodological approach in paediatric tumour pathology. J Pathol. 1997, 181 (4): 394-400.PubMedView ArticleGoogle Scholar
- Wang Q, Diskin S, Rappaport E, Attiyeh E, Mosse Y, Shue D, Seiser E, Jagannathan J, Shusterman S, Bansal M, Khazi D, Winter C, Okawa E, Grant G, Cnaan A, Zhao H, Cheung NK, Gerald W, London W, Matthay KK, Brodeur GM, Maris JM: Integrative genomics identifies distinct molecular classes of neuroblastoma and shows that multiple genes are targeted by regional alterations in DNA copy number. Cancer Res. 2006, 66 (12): 6050-6062.PubMedView ArticleGoogle Scholar
- Brodeur GM: Neuroblastoma: biological insights into a clinical enigma. Nat Rev Cancer. 2003, 3 (3): 203-216.PubMedView ArticleGoogle Scholar
- Nguyen DQ, Webber C, Ponting CP: Bias of Selection on Human Copy-Number Variants. PLoS Genet. 2006, 2 (2): e20-PubMed CentralPubMedView ArticleGoogle Scholar
- Database of Genomic Variants. [http://projects.tcag.ca/variation/]
- De Preter K, Pattyn F, Berx G, Strumane K, Menten B, Van Roy F, De Paepe A, Speleman F, Vandesompele J: Combined subtractive cDNA cloning and array CGH: an efficient approach for identification of overexpressed genes in DNA amplicons. BMC Genomics. 2004, 5 (1): 11-PubMed CentralPubMedView ArticleGoogle Scholar
- Eichler EE: Widening the spectrum of human genetic variation. Nat Genet. 2006, 38 (1): 9-11.PubMedView ArticleGoogle Scholar
- Chung EK, Yang Y, Rupert KL, Jones KN, Rennebohm RM, Blanchong CA, Yu CY: Determining the one, two, three, or four long and short loci of human complement C4 in a major histocompatibility complex haplotype encoding C4A or C4B proteins. Am J Hum Genet. 2002, 71 (4): 810-822.PubMed CentralPubMedView ArticleGoogle Scholar
- Sebat J, Lakshmi B, Troge J, Alexander J, Young J, Lundin P, Maner S, Massa H, Walker M, Chi M, Navin N, Lucito R, Healy J, Hicks J, Ye K, Reiner A, Gilliam TC, Trask B, Patterson N, Zetterberg A, Wigler M: Large-scale copy number polymorphism in the human genome. Science. 2004, 305 (5683): 525-528.PubMedView ArticleGoogle Scholar
- Functional Genomics Core. [http://www.dnaarrays.org/CNP.php]
- Kent WJ: BLAT--the BLAST-like alignment tool. Genome Res. 2002, 12 (4): 656-664.PubMed CentralPubMedView ArticleGoogle Scholar
- Index of /goldenPath/hg17/database. [http://hgdownload.cse.ucsc.edu/goldenPath/hg17/database/]
- Nowak NJ SA Conroy JM, and Albertson D: The BAC Resource: Tools for Array CGH and FISH. Current Protocols in Human Genetics. 2005, 1-34.Google Scholar
- Roswell Park Cancer Institute. [http://microarrays.roswellpark.org]
- Cowell JK, Wang YD, Head K, Conroy J, McQuaid D, Nowak NJ: Identification and characterisation of constitutional chromosome abnormalities using arrays of bacterial artificial chromosomes. Br J Cancer. 2004, 90 (4): 860-865.PubMed CentralPubMedView ArticleGoogle Scholar
- UCSC Genome Browser Home. [http://genome.ucsc.edu]
- Sharp AJ, Locke DP, McGrath SD, Cheng Z, Bailey JA, Vallente RU, Pertz LM, Clark RA, Schwartz S, Segraves R, Oseroff VV, Albertson DG, Pinkel D, Eichler EE: Segmental duplications and copy-number variation in the human genome. Am J Hum Genet. 2005, 77 (1): 78-88.PubMed CentralPubMedView ArticleGoogle Scholar
- Tuzun E, Sharp AJ, Bailey JA, Kaul R, Morrison VA, Pertz LM, Haugen E, Hayden H, Albertson D, Pinkel D, Olson MV, Eichler EE: Fine-scale structural variation of the human genome. Nat Genet. 2005, 37 (7): 727-732.PubMedView ArticleGoogle Scholar
- Venkatraman ES, Olshen AB: A faster circular binary segmentation algorithm for the analysis of array CGH data. Bioinformatics. 2007Google Scholar
- Olshen AB, Venkatraman ES, Lucito R, Wigler M: Circular binary segmentation for the analysis of array-based DNA copy number data. Biostatistics. 2004, 5 (4): 557-572.PubMedView ArticleGoogle Scholar
- Lai WR, Johnson MD, Kucherlapati R, Park PJ: Comparative analysis of algorithms for identifying amplifications and deletions in array CGH data. Bioinformatics. 2005, 21 (19): 3763-3770.PubMed CentralPubMedView ArticleGoogle Scholar
- Gene Expression Omnibus (GEO) main page. [http://www.ncbi.nlm.nih.gov/projects/geo]
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.