Detection of copy number variations in rice using array-based comparative genomic hybridization

Yu, Ping; Wang, Caihong; Xu, Qun; Feng, Yue; Yuan, Xiaoping; Yu, Hanyong; Wang, Yiping; Tang, Shengxiang; Wei, Xinghua

doi:10.1186/1471-2164-12-372

Research article
Open access
Published: 20 July 2011

Detection of copy number variations in rice using array-based comparative genomic hybridization

Ping Yu¹,
Caihong Wang¹,
Qun Xu¹,
Yue Feng¹,
Xiaoping Yuan¹,
Hanyong Yu¹,
Yiping Wang¹,
Shengxiang Tang¹ &
…
Xinghua Wei¹

BMC Genomics volume 12, Article number: 372 (2011) Cite this article

6366 Accesses
54 Citations
Metrics details

Abstract

Background

Copy number variations (CNVs) can create new genes, change gene dosage, reshape gene structures, and modify elements regulating gene expression. As with all types of genetic variation, CNVs may influence phenotypic variation and gene expression. CNVs are thus considered major sources of genetic variation. Little is known, however, about their contribution to genetic variation in rice.

Results

To detect CNVs, we used a set of NimbleGen whole-genome comparative genomic hybridization arrays containing 718,256 oligonucleotide probes with a median probe spacing of 500 bp. We compiled a high-resolution map of CNVs in the rice genome, showing 641 CNVs between the genomes of the rice cultivars 'Nipponbare' (from O. sativa ssp. japonica) and 'Guang-lu-ai 4' (from O. sativa ssp. indica). The CNVs identified vary in size from 1.1 kb to 180.7 kb, and encompass approximately 7.6 Mb of the rice genome. The largest regions showing copy gain and loss are of 37.4 kb on chromosome 4, and 180.7 kb on chromosome 8. In addition, 85 DNA segments were identified, including some genic sequences. Contracted genes greatly outnumbered duplicated ones. Many of the contracted genes corresponded to either the same genes or genes involved in the same biological processes; this was also the case for genes involved in disease and defense.

Conclusion

We detected CNVs in rice by array-based comparative genomic hybridization. These CNVs contain known genes. Further discussion of CNVs is important, as they are linked to variation among rice varieties, and are likely to contribute to subspecific characteristics.

Background

Copy number variations (CNVs), or copy number polymorphisms (CNPs), are forms of structural variation (SV) that are alterations in DNA resulting in the cell having an abnormal number of copies of one or more segments of DNA. A CNV is a DNA segment ranging from 1 kb to 3 Mb that has been deleted, inserted, or duplicated, on certain chromosomes [1, 2]. In particular, segmental duplications (SDs) were demonstrated to be one of the major catalysts and hotspots for CNV formation [3–5]. A CNV was described as early as 1936, with the duplication of the Bar gene in Drosophila melanogaster[6]. Recently, many studies have discovered CNVs in humans [7–9], chimpanzee [10], dog [11], cattle [12], rat [13], mice [14], Drosophila[15], yeast [16], E. coli[17], and maize [18, 19]. CNVs can be detected using cytogenetic techniques such as fluorescent in situ hybridization, array-based comparative genomic hybridization, and SNP genotyping arrays. Recent advances in DNA sequencing technologies have further enabled the identification of CNVs by next-generation sequencing [20–22].

CNVs can create new genes, change gene dosage, reshape gene structures, and modify elements regulating gene expression [23, 24]. Thus, CNVs are considered likely major sources of genetic variation, and may influence phenotypic variation and gene expression. Some human CNVs have been linked with susceptibility or resistance to disease. A higher CCL3L1 copy number, for example, can reduce risk of HIV/AIDS infection [25], and a lower FCGR3 copy number appears to contribute to increased susceptibility to glomerulonephritis [26]. CNVs also have an impact on fitness and gene expression. CNVs detected among 15 female isolines of Drosophila have been subjected to purifying selection [15]. In addition, a dramatic fruit size change due to a CNV with an insertion of 6-8 kb that affected gene regulation, was described during tomato breeding [27]. It was recently demonstrated that most CNVs in humans are in linkage disequilibrium (LD) with single nucleotide polymorphisms (SNPs); and that LD decay of the two happens at similar rates [8]. CNVs were confirmed to capture about 18% of the variation in gene expression, with little overlap with the variation captured by SNPs [28]. Thus, CNVs can be developed as a type of molecular marker for molecular identification.

Rice (Oryza sativa L.), comprises two subspecies, indica and japonica. It is one of the most important food crops in the world, and a model plant for genomic studies of monocots. Rice genomes exhibit relatively high levels of SNPs and indels [29]. Sequence comparisons between the Nipponbare (japonica) and 9311 (indica) genomes have shown high levels of polymorphisms ranging from one SNP/300 bp to one indel/kp [30, 31]. These can potentially be exploited as molecular markers between these divergent subspecies. However, there are few studies of structural variation within the rice genome. Recent study of many subclones within chromosome 4 of the BAC libraries of Nipponbare and Guang-lu-ai 4 (indica), has documented that many genes vary in copy number [32]. With the completion of rice genome sequencing projects and advances in microarray technologies, comprehensive oligonucleotide microarrays are now being used to discover genetic polymorphisms. Array-based comparative genomic hybridization (aCGH) has the advantages of high resolution and high-throughput genome-wide screening of genomic imbalances, and has been used in rice to detect single-feature polymorphisms [33], and structural variations created by mutagenesis [34].

We used high-density oligonucleotide aCGH (containing 718,256 oligonucleotide probes) to investigate the number of CNVs between Nipponbare and Guang-lu-ai 4 genomes. We found high levels of CNVs, some representing large inserted/deleted regions. In addition, several DNA segments, often including genic sequences, were identified as present in the Nipponbare genome but absent from the Guang-lu-ai 4 genome. Ours is the first comprehensive map of CNVs in the rice genome; providing an important resource for understanding the nature of variation among different rice varieties.

Results

CNV detection using aCGH

To investigate the reproducibility of CNV detection using aCGH, we performed aCGH on three independent samples of Nipponbare and Guang-lu-ai 4 (Figure 1). In comparing hybridization results we decided that most detected CNVs may be accurate, even though some were not present in all replications. Using less stringent criteria, in which the log₂ of the signal ratio between the two genomes was ± 0.5, we detected a total of 1,109, 1,100 and 1,074 CNVs respectively in three replications of Nipponbare and Guang-lu-ai 4; of which 857 (~78.3%) were detected in all three replications. However, using stringent criteria in which the log₂ (Guang-lu-ai 4/Nipponbare) was ± 1.0, three comparisons of two samples revealed 856, 858 and 784 CNVs respectively; of which 641 (~77.0%) were detected in all three replications (Figure 2). Encouraged by this result, we surveyed hybridization signals which had high confidence levels and identified 641 CNVs.

These 641 CNVs comprised ~1.8% (~7.6 Mb) of the rice genome, similar to the proportion of CNVs in a population of Drosophila melanogaster (~2.0%) [15], and were distributed along all 12 rice chromosomes (Figure 3). We found no significant correlation between the frequencies of CNV occurrence and chromosome length (Additional file 1, Figure S1). The highest frequency (94) was found on chromosome 11, and the lowest frequency (30) on chromosome 5. This is consistent with a previous study of heterogeneous distribution of CNVs [35]. CNV sizes ranged from 1.1 kb to 180.7 kb, averaging 11.8 kb. Most CNVs (67.4%) were found to be small variants (< 10 kb), while some (2.5%) were larger variants (>50 kb) (Figure 4). The largest regions showing copy gain and loss were 37.4 kb on chromosome 4 and 180.7 kb on chromosome 8 (Additional file 2, Table S1). Analysis of the aCGH data also revealed a bias towards stronger hybridization signals from the Nipponbare genomic DNA than from the Guang-lu-ai 4 genomic DNA. This was found in CNVs determined by stringent criteria as well as those determined by less stringent criteria. This reflects the fact that the probes were designed from Nipponbare sequences.

PCR analysis

We used 134 PCRs to further analyze 85 putative CNVs detected by aCGH. All PCRs confirmed the existence of insertion/deletion polymorphisms in these regions (Additional file 3, Table S2). More than 90% showed presence/absence variations between Nipponbare and Guang-lu-ai 4. All the validated CNV regions were defined by a few probes. In a CNV located on chromosome 12, for example, four amplicons spanning those probes of the putative deletions did not amplify from Guang-lu-ai 4 (Table 1), indicating that the DNA segment was absent from Guang-lu-ai 4 (Figure 5A). In addition, we also identified the allelic versions in 20 varieties of the two subspecies, and obtained similar results; more amplification products were present in japonica than in indica (Figure 5B). Indica and japonica are derived from independent domestication events of an ancestral rice that had already differentiated into two gene pools [36–38]. It seems unlikely that our observed pattern could be generated randomly, but our low number of samples prevents us from confirming strong evidence of subspecific variation in our CNV analysis.

Table 1 Primers used in PCR validation of a CNV located on chromosome 12 in Nipponbare, Guang-lu-ai 4, and some other varieties of indica and japonica.

Full size table

Annotation of CNVs

Different hybridization signal intensity of a gene across aCGH would indicate a gain or loss of a gene copy number during rice evolution. Using a stringent selection criterion (a Guang-lu-ai 4 to Nipponbare signal ratio of 1 : 2), we identified 500 protein-coding genes that were contracted in Guang-lu-ai 4, and only 19 genes that were duplicated (signal ratio > 2.0) (Additional file 4, Table S3). The dominance of gene contraction over duplication was obvious when the aCGH selection ratio was relaxed (data not shown). Contracted genes thus greatly outnumbered duplicated ones. The majority of contracted genes are hypothetical proteins, indicating duplication of preexisting genes to augment gene function. Among the 19 duplicated genes, three encode different enzymes: transposase, reverse transcriptase and terpenoid cyclase. One gene is involved in gibberellin synthesis, i.e. ent-kaurene synthase like-2. Xa1 is a known bacterial blight resistance gene. Duplication also occurred in genes relating to metabolism, such as the GTP-binding signal recognition particle SRP54, and the 2-oxoglutarate dehydrogenase E2 subunit. As well, two genes were involved in transcription, the RNA polymerase III RPC4 family protein and the C2H2-type zinc finger domain-containing protein. Many of the contracted genes corresponded to genes that were either the same genes or genes involved in the same biological processes. This was similar for genes involved in disease and defense, such as most of them encode proteins with conserved nucleotide-binding sites (NBS) and leucine-rich repeats (LRRs). In addition, Cytochrome P450 and concanavalin A-like lectin/glucanase play crucial roles in defending plants from disease.

Discussion

Using aCGH, we have generated the first map of CNVs in the rice genome. After very stringent filtering, 641 CNV events were identified between the two rice subspecies cultivars Nipponbare and Guang-lu-ai 4. This is likely to represent a very conservative estimate of the true number of CNV events in the rice genome. Focusing only on the unique sequences in our microarray will have potentially led to an underestimation of the number of CNV events. This is due to the selective omission or reduction of probe density in some CNVs enriched regions that contain segmental duplications and diverse repetitive sequences. In addition, our stringent CNV calling criteria restrained the detection of putative true CNVs. Differing probe densities, algorithms and statistical criteria used in the literature, complicate comparisons of rates of CNVs among different organisms [2, 9–11, 13, 39]. Our data suggest that smaller CNVs (< 10 kb) are much more frequent than larger ones; this is supported by other studies [8, 19]. However, using next-generation sequencing techniques would offer advantages over aCGH as DNA variations and recombination breakpoints would be directly detected [21, 40–44].

CNV number differs between species. In mammals, the mean number of CNVs per individual has been found to range from 14 in macaques [45] to 70 in humans [9]. In maize, around 400 CNVs have been detected between two cultivars (Mo 17 and B 73) [18, 19]. We observed many more CNVs between indica and japonica, the main reason for this was that we used subspecific samples. Indica and japonica diverged from their O. rufipogon ancestor between 200,000 and 400,000 years ago [37, 46, 47], and have richly diversified during the processes of domestication and selection. Both phenotypic and molecular studies have confirmed a relatively high level of differentiation between these two subspecies [48], suggesting great variation. This is also indicated by the lower numbers of deleted gene regions (ranging from 2 to 359) between 14 mutants and their wild type IR 64 of indica[34]. More recently, tiling oligonucleotide microarrays with 42 million probes, showed that an average of 1,098 CNVs comprising 0.78% of the human genome were validated between two individuals [49]. This was also found in a previous study [35], indicating that increased density and improved probe design will help us to better understand the roles of CNVs in organisms.

Although the presence and phenotypic effects of CNVs in plants have been little investigated on the genomic level, the nature of CNVs detected in maize suggests that they may have considerable impact on plant phenotypes, including disease responses and heterosis. We detected at least 519 genes in our high confidence CNV regions (Additional file 3, Table S2). However, it is likely that more genes are affected. We found that genes in many CNVs were involved in resistance, and that most of these encode proteins with conserved nucleotide-binding sites (NBS) and leucine-rich repeats (LRRs). NBS-LRR genes in plants tend to cluster at the same loci within genomes [50, 51]. Similarly, both resistance genes and quantitative trait loci (QTL) are clustered in the rice genome [52, 53]. In addition to its functional and agronomic importance, the NBS-LRR gene family has a structural role within the genome [54].

Previous research showed strong evidence that natural selection may shape CNVs, both in their patterns of polymorphism and their distribution within the genome [9, 15]. Long-term purifying selection has changed quantitative traits, and it is possible that genomic variation in rice supplies source material for the generation of novel alleles. This implies that characterization of rice CNVs is far from perfect, and provides a comprehensive view of the polymorphic phase of CNVs.

Conclusion

We have demonstrated that CNVs are able to be detected in rice using array-based comparative genome hybridization. These are likely to be linked with subspecific characteristics and to provide an important resource for understanding variation among different rice varieties.

Methods

Source of DNA samples

The rice varieties for our aCGH survey, Nipponbare (japonica) and Guang-lu-ai 4 (indica), were provided by the China National Rice Research Institute, Hangzhou, Zhejiang Province. The 10 indica varieties for CNV validation were: Minbeiwanxian, Dianbaidashanwang, Sankecun, Aizizhan, Haohuangla, Chiliyubai, Nanjing 11, Zhechang 9, Liantangao and Zhuguang 23. The 10 japonica varieties were: Kendao 8, Guihuahuang, Xiushui 48, Baimaodao, Xingguo, Mingshuixiangdao, Maendalaqili, Weiguo, Zhongdan 2 and Shuiyuansanbaili.

Genomic DNA was extracted and purified from fresh young leaves using a Promega kit (Wizard^® Genomic DNA Purification Kit). Total DNA was quantified using a spectrophotometer and electrophoresed on an agarose gel for integrity checking. Following the NimbleGen quality control requirements, the genomic DNA was undegraded and had 1.8 ≤ A260/A280 ≤ 2.0 and 1.9 ≤ A260/A230 ≤ 2.0.

Array CGH

Custom NimbleGen 3 × 720 K microarrays http://www.nimblegen.com contain 718,256 oligonucleotide probes designed and fabricated on a single slide; resulting in a median probe spacing of 500 bp. These types of arrays utilize synthetic probes 45 to 75-mer in length with similar melting temperatures, and do not require sample amplification or reduced representation. Probes were designed from the NCBI rice genome build of October 2006. Roche NimbleGen's CGH probe design criteria was utilized. Uniqueness information was generated using the SSAHA program http://www.sanger.ac.uk/Software/analysis/SSAHA/. Standard genomic DNA labeling (Cy3 for samples and Cy5 for references), hybridizations, array scanning, data normalization, and segmentation were performed at CapitalBio Corporation as described previously [39, 55]. High confidence calls were made according to the criteria used by Graubert et al. (2007). NimbleGen has an information package that describes the technology and provides measures of reproducibility, accuracy, sensitivity, and specificity. In brief, we used the normalized qspline method from the Bioconductor package in R. CNVs were identified by the circular binary segmentation algorithm [56]. Candidate CNVs were identified by finding more than 5 probe segments with log₂ ratios greater than ± 1.0. We conducted further analysis and visualization using SignalMap software (NimbleGen). Raw aCGH data for this study have been deposited to GenBank GEO database under accession GSE30542http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE30542

Polymerase chain reaction (PCR)

For validation, sequences flanking the first and last probe set location of CNV regions were used to design primers. In addition, to reduce the possibility of interference from overlaps between probes and primer sequences, we designed two independent pairs of primers to confirm partial validated CNVs. PCR methods followed those recommended by the TaKaRa LA Taq manufacturer, optimizing conditions for each use. Products were run on a 1.5% agarose gel, stained with ethidium bromide, and visualized on a UV transilluminator.

Abbreviations

CNV:: Copy number variation
SV:: Structural variation
SD:: Segmental duplication
SNP:: Single nucleotide polymorphism
LD:: Linkage disequilibrium
aCGH:: Array-based comparative genomic hybridization
NBS:: Nucleotide-binding sites
LRR:: Leucine-rich repeats
QTL:: Quantitative trait loci.

References

Feuk L, Carson AR, Scherer SW: Structural variation in the human genome. Nat Rev Genet. 2006, 7: 85-97.
Article CAS PubMed Google Scholar
Scherer SW, Lee C, Birney E, Altshuler DM, Eichler EE, Carter NP, Hurles ME, Feuk L: Challenges and standards in integrating surveys of structural variation. Nat Genet. 2007, 39: S7-15. 10.1038/ng2093.
Article CAS PubMed PubMed Central Google Scholar
Sharp AJ, Locke DP, McGrath SD, Cheng Z, Bailey JA, Vallente RU, Pertz LM, Clark RA, Schwartz S, Segraves R, et al: Segmental duplications and copy-number variation in the human genome. Am J Hum Genet. 2005, 77: 78-88. 10.1086/431652.
Article CAS PubMed PubMed Central Google Scholar
Goidts V, Cooper DN, Armengol L, Conroy J, Estivill X, Nowak N, Hameister H, Kehrer-Sawatzki H: Complex patterns of copy number variation at sites of segmental duplications: An important category of structural variation in the human genome. Hum Genet. 2006, 120: 270-284. 10.1007/s00439-006-0217-y.
Article CAS PubMed Google Scholar
Marques-Bonet T, Girirajan S, Eichler EE: The origins and impact of primate segmental duplications. Trends Genet. 2009, 25: 443-454. 10.1016/j.tig.2009.08.002.
Article CAS PubMed PubMed Central Google Scholar
Bridges CB: The Bar gene: a duplication. Science. 1936, 83: 210-211. 10.1126/science.83.2148.210.
Article CAS PubMed Google Scholar
Iafrate AJ, Feuk L, Rivera MN, Listewnik ML, Donahoe PK, Qi Y, Scherer SW, Lee C: Detection of large-scale variation in the human genome. Nat Genet. 2004, 36: 949-951. 10.1038/ng1416.
Article CAS PubMed Google Scholar
McCarroll SA, Kuruvilla FG, Korn JM, Cawley S, Nemesh J, Wysoker A, Shapero MH, de Bakker PIW, Maller JB, Kirby A, et al: Integrated detection and population-genetic analysis of SNPs and copy number variation. Nat Genet. 2008, 40: 1166-1174. 10.1038/ng.238.
Article CAS PubMed Google Scholar
Redon R, Ishikawa S, Fitch KR, Feuk L, Perry GH, Andrews TD, Fiegler H, Shapero MH, Carson AR, Chen W, et al: Global variation in copy number in the human genome. Nature. 2006, 444: 444-454. 10.1038/nature05329.
Article CAS PubMed PubMed Central Google Scholar
Perry GH, Yang F, Marques-Bonet T, Murphy C, Fitzgerald T, Lee AS, Hyland C, Stone AC, Hurles ME, Tyler-Smith C, et al: Copy number variation and evolution in humans and chimpanzees. Genome Res. 2008, 18: 1689-1710.
Article Google Scholar
Chen WK, Swartz Joshua, Rush Laura, Alvarez Carlos: Mapping DNA structural variation in dogs. Genome Res. 2009, 19: 500-509.
Article CAS PubMed PubMed Central Google Scholar
Liu GE, Hou Y, Zhu B, Cardone MF, Jiang L, Cellamare A, Mitra A, Alexander LJ, Coutinho LL, Dell'Aquila ME, et al: Analysis of copy number variations among diverse cattle breeds. Genome Res. 2010, 20: 693-703. 10.1101/gr.105403.110.
Article CAS PubMed PubMed Central Google Scholar
Guryev V, Saar K, Adamovic T, Verheul M, van Heesch SAAC, Cook S, Pravenec M, Aitman T, Jacob H, Shull JD, et al: Distribution and functional impact of DNA copy number variation in the rat. Nat Genet. 2008, 40: 538-545. 10.1038/ng.141.
Article CAS PubMed Google Scholar
She X, Cheng Z, Zo"llner S, Church DM, Eichler EE: Mouse segmental duplication and copy number variation. Nat Genet. 2008, 40: 909-914. 10.1038/ng.172.
Article CAS PubMed PubMed Central Google Scholar
Emerson JJ, Cardoso-Moreira M, Borevitz JO, Long M: Natural selection shapes genome-wide patterns of copy-number polymorphism in Drosophila melanogaster. Science. 2008, 320: 1629-1631. 10.1126/science.1158078.
Article CAS PubMed Google Scholar
Infante JJ, Dombek KM, Rebordinos L, Cantoral JM, Young ET: Genome-wide amplifications caused by chromosomal rearrangements play a major role in the adaptive evolution of natural yeast. Genetics. 2003, 165: 1745-1759.
CAS PubMed PubMed Central Google Scholar
Skvortsov D, Abdueva D, Stitzer ME, Finkel SE, Tavaré S: Using expression arrays for copy number detection: an example from E. coli. BMC Bioinformatics. 2007, 8: 203-10.1186/1471-2105-8-203.
Article PubMed PubMed Central Google Scholar
Nathan M, Springer , Ying K, Yan F, Tieming J, Yeh CT, Yi J, Wei W, Richmond T, Kitzman J, et al: Maize Inbreds Exhibit High Levels of Copy Number Variation (CNV) and Presence/Absence Variation (PAV) in Genome Content. PLos Genet. 2009, 5 (11): e1000734-10.1371/journal.pgen.1000734.
Article Google Scholar
André Beló , Beatty M, David H, Fengler K, Bailin L, Antoni Rafalski : Allelic genome structural variations in maize detected by array comparative genome hybridization. Theor Appl Genet. 2009, 120: 355-367.
Article PubMed Google Scholar
Korbel JO, Urban AE, Affourtit JP, Godwin B, Grubert F, Simons JF, Kim PM, Palejev D, Carriero NJ, Du L, et al: Paired-end mapping reveals extensive structural variation in the human genome. Science. 2007, 318: 420-426. 10.1126/science.1149504.
Article CAS PubMed PubMed Central Google Scholar
Mills RE, Walter K, Stewart C, Handsaker RE, Chen K, Alkan C, Abyzov A, Yoon SC, Ye K, Cheetham RK, et al: Mapping copy number variation by population-scale genome sequencing. Nature. 2011, 470 (7332): 59-65. 10.1038/nature09708.
Article CAS PubMed PubMed Central Google Scholar
Sudmant PH, Kitzman JO, Antonacci F, Alkan C, Malig M, Tsalenko A, Sampas N, Bruhn L, Shendure J, Eichler E: Diversity of human copy number variation and multicopy genes. Science. 2010, 330 (6004): 641-646. 10.1126/science.1197005.
Article CAS PubMed PubMed Central Google Scholar
Henrichsen CN, Chaignat E, Reymond A: Copy number variants, diseases and gene expression. Hum Mol Genet. 2009, 18: R1-R8. 10.1093/hmg/ddp011.
Article CAS PubMed Google Scholar
Zhang F, Gu W, Hurles ME, Lupski JR: Copy number variation in human health, disease, and evolution. Annu Rev Genomics Hum Genet. 2009, 10: 451-481. 10.1146/annurev.genom.9.081307.164217.
Article CAS PubMed PubMed Central Google Scholar
Gonzalez E, Kulkarni H, Bolivar H, Mangano A, Sanchez R, Catano G, Nibbs RJ, Freedman BI, Quinones MP, Bamshad MJ, et al: The influence of CCL3L1 gene-containing segmental duplications on HIV-1/AIDS susceptibility. Science. 2005, 307: 1434-1440. 10.1126/science.1101160.
Article CAS PubMed Google Scholar
Aitman TJ, Dong R, Vyse TJ, Norsworthy PJ, Johnson MD, Smith J, Mangion J, Roberton-Lowe C, Marshall AJ, Petretto E, et al: Copy number polymorphism in Fcgr3 predisposes to glomerulonephritis in rats and humans. Nature. 2006, 439: 851-855. 10.1038/nature04489.
Article CAS PubMed Google Scholar
Cong B, Barrero LS, Tanksley SD: Regulatory change in YABBY-like transcription factor led to evolution of extreme fruit size during tomato domestication. Nat Genet. 2008, 40: 800-804. 10.1038/ng.144.
Article CAS PubMed Google Scholar
Stranger BE, Forrest MS, Dunning M, Ingle CE, Beazley C, Thorne N, Redon R, Bird CP, de Grassi A, Lee C, et al: Relative impact of nucleotide and copy number variation on gene expression phenotypes. Science. 2007, 315: 848-853. 10.1126/science.1136678.
Article CAS PubMed PubMed Central Google Scholar
Tian D, Wang Q, Zhang P, Araki H, Yang S, Kreitman M, Nagylaki T, Hudson R, Bergelson J, Chen J: Single-nucleotide mutation rate increases close to insertions/deletions in eukaryotes. Nature. 2008, 455: 105-108. 10.1038/nature07175.
Article CAS PubMed Google Scholar
Feltus FA, Wan J, Schulze SR, Estill JC, Jiang N, Paterson AH: An SNP resource for rice genetics and breeding based on subspecies indica and japonica genome alignments. Genome Res. 2004, 14: 1812-1819. 10.1101/gr.2479404.
Article CAS PubMed PubMed Central Google Scholar
Shen YJ, Jiang H, Jin JP, Zhang ZB, Xi B, He YY, Wang G, Wang C, Qian L, Li X, et al: Development of genomewide DNA polymorphism database for map-based cloning of rice genes. Plant Physiol. 2004, 135: 1198-1205. 10.1104/pp.103.038463.
Article CAS PubMed PubMed Central Google Scholar
Hao Hu: Genomics and Comparative Genomics Hybridization (CGH) studies of rice chromosome 4; differentiation of a MITE system and its inference for a diphyletic origin of two subspecies of Asian cultivated rice and molecular mechanism of stress response of a transposon family. Chinese academy of sciences doctoral dissertation. 2005, Beijing, China
Google Scholar
Kumar R, Qiu J, Joshi T, Valliyodan B, Xu D, Nguyen HT: Single feature polymorphism discovery in rice. PLoS One. 2007, 2 (3): e284-10.1371/journal.pone.0000284.
Article PubMed PubMed Central Google Scholar
Bruce M, Hess A, Bai J, Mauleon R, Diaz MG, Sugiyama N, Bordeos A, Wang GL, Leung H, Leach J: Detection of genomic deletions in rice using oligonucleotide microarrays. BMC Genomics. 2009, 10: 129-139. 10.1186/1471-2164-10-129.
Article PubMed PubMed Central Google Scholar
Fadista J, Thomsen B, Holm LE, Bendixen C: Copy number variation in the bovine genome. BMC Genomics. 2010, 11: 284-10.1186/1471-2164-11-284.
Article PubMed PubMed Central Google Scholar
Cai HW, Morishima H: QTL clusters reflect character associations in wild and cultivated rice. Theor Appl Genet. 2002, 104: 1217-1228. 10.1007/s00122-001-0819-7.
Article CAS PubMed Google Scholar
Ma J, Bennetzen JL: Rapid recent growth and divergence of rice nuclear genomes. Proc Natl Acad Sci USA. 2004, 101: 12404-12410. 10.1073/pnas.0403715101.
Article CAS PubMed PubMed Central Google Scholar
Nakamura I, Watanabe KN, Sato YI: Identification of SNPs in the waxy gene among glutinous rice cultivars and their evolutionary significance during the domestication process of rice. Theor Appl Genet. 2004, 108: 1200-1204. 10.1007/s00122-003-1564-x.
Article PubMed Google Scholar
Graubert TA, Cahan P, Edwin D, Selzer RR, Richmond TA, Eis PS, Shannon WD, Li X, McLeod HL, Cheverud JM, et al: A high-resolution map of segmental DNA copy number variation in the mouse genome. PLoS Genet. 2007, 3 (1): e3-10.1371/journal.pgen.0030003.
Article PubMed PubMed Central Google Scholar
Chiang DY, Getz G, Jaffe DB, O'Kelly MJ, Zhao X, Carter SL, Russ C, Nusbaum C, Meyerson M, Lander ES: High-resolution mapping of copynumber alterations with massively parallel sequencing. Nat Methods. 2008, 6: 99-103.
Article PubMed PubMed Central Google Scholar
Alkan C, Kidd JM, Marques-Bonet T, Aksay G, Antonacci F, Hormozdiari F, Kitzman JO, Baker C, Malig M, Mutlu O, et al: Personalized copy number and segmental duplication maps using next-generation sequencing. Nat Genet. 2009, 41: 1061-1067. 10.1038/ng.437.
Article CAS PubMed PubMed Central Google Scholar
Xie C, Tammi M: CNV-seq, a new method to detect copy number variation using high-throughput sequencing. BMC Bioinformatics. 2009, 10: 80-10.1186/1471-2105-10-80.
Article PubMed PubMed Central Google Scholar
Chen W, Kalscheuer V, Tzschach A, Menzel C, Ullmann R, Schulz MH, Erdogan F, Li N, Kijas Z, Arkesteijn G, Pajares IL, et al: Mapping translocation breakpoints by next-generation sequencing. Genome Res. 2008, 18: 1143-1149. 10.1101/gr.076166.108.
Article CAS PubMed PubMed Central Google Scholar
Campbell PJ, Stephens PJ, Pleasance ED, O'Meara S, Li H, Santarius T, Stebbings LA, Leroy C, Edkins S, Hardy C, et al: Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing. Nat Genet. 2008, 40: 722-729. 10.1038/ng.128.
Article CAS PubMed PubMed Central Google Scholar
Lee AS, Gutierrez-Arcelus M, Perry GH, Vallender EJ, Johnson WE, Miller GM, Korbel JO, Lee C: Analysis of copy number variation in the rhesus macaque genome identifies candidate loci for evolutionary and human disease studies. Hum Mol Genet. 2008, 17: 1127-1136. 10.1093/hmg/ddn002.
Article CAS PubMed Google Scholar
Vitte C, Ishii T, Lamy F, Brar D, Panaud O: Genomic paleontology provides evidence for two distinct origins of Asian rice (Oryza sativa L.). Molecular Genetics and Genomics. 2004, 272: 504-511. 10.1007/s00438-004-1069-6.
Article CAS PubMed Google Scholar
Zhu Q, Ge S: Phylogenetic relationships among A-genome species of the genus Oryza revealed by intron sequences of four nuclear genes. New Phytologist. 2005, 167: 249-265. 10.1111/j.1469-8137.2005.01406.x.
Article CAS PubMed Google Scholar
Morishima H: Conservation and genetic characterization of plant genetic resources. Pages 31-42 in MAFF International Workshop on Genetic Resources. National Institute of Agrobiological Resources, Tsukuba, Japan. 1998
Google Scholar
Conrad DF, Pinto D, Redon R, Feuk L, Gokcumen O, Zhang Y, Aerts J, Andrews TD, Barnes C, Campbell P, et al: Origins and functional impact of copy number variation in the human genome. Nature. 2009, 464: 704-712.
Article PubMed PubMed Central Google Scholar
Hulbert SH, Webb CA, Smith SM, Sun Q: Resistance gene complexes: evolution and utilization. Annu Rev Phytopathol. 2001, 3 (9): 285-312.
Article Google Scholar
McHale L, Tan X, Koehl P, Michelmore RW: Plant NBS-LRR proteins: adaptable guards. Genome Biol. 2006, 7: 212-
Article PubMed PubMed Central Google Scholar
Wisser RJ, Qi S, Hulbert SH, Kresovich S, Nelson RJ: Identification and characterization of regions of the rice genome associated with broad-spectrum, quantitative disease resistance. Genetics. 2005, 169: 2277-2293. 10.1534/genetics.104.036327.
Article CAS PubMed PubMed Central Google Scholar
Ballini E, Morel JB, Droc G, Price A, Courtois B, Notteghem JL, Tharreau D: A genome-wide meta-analysis of rice blast resistance genes and quantitative trait loci provides new insights into partial and complete resistance. Mol Plant Microbe Interact. 2008, 21: 859-868. 10.1094/MPMI-21-7-0859.
Article CAS PubMed Google Scholar
Ameline-Torregrosa CB, Wang B, O'Bleness MS, Deshpande S, Zhu H, Roe B, Young ND, Cannon SB: Identification and characterization of nucleotide-binding site-leucine-rich repeat genes in the model plant Medicago truncatula. Plant Physiol. 2008, 146: 5-21.
Article CAS PubMed PubMed Central Google Scholar
Selzer RR, Richmond TA, Pofahl NJ, Green RD, Eis PS, Nair P, Brothman AR, Stallings RL: Analysis of chromosome breakpoints in neuroblastoma at sub-kilobase resolution using fine-tiling oligonucleotide array CGH. Genes Chromosomes Cancer. 2005, 44: 305-319. 10.1002/gcc.20243.
Article CAS PubMed Google Scholar
Olshen AB, Venkatraman ES, Lucito R, Wigler M: Circular binary segmentation for the analysis of array-based DNA copy number data. Biostatistics. 2004, 5: 557-572. 10.1093/biostatistics/kxh008.
Article PubMed Google Scholar

Download references

Acknowledgements

This work was supported by the Agricultural Wild Resources Protection Project of MOA, China, and the Basic Research Budget of China National Rice Research Institute (No. 2009RG001-3). We thank Y Ren for technical support and excellent discussions, and the associate editor and two anonymous reviewers for their valuable suggestions.

Author information

Authors and Affiliations

State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, China
Ping Yu, Caihong Wang, Qun Xu, Yue Feng, Xiaoping Yuan, Hanyong Yu, Yiping Wang, Shengxiang Tang & Xinghua Wei

Authors

Ping Yu
View author publications
You can also search for this author in PubMed Google Scholar
Caihong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Qun Xu
View author publications
You can also search for this author in PubMed Google Scholar
Yue Feng
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoping Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Hanyong Yu
View author publications
You can also search for this author in PubMed Google Scholar
Yiping Wang
View author publications
You can also search for this author in PubMed Google Scholar
Shengxiang Tang
View author publications
You can also search for this author in PubMed Google Scholar
Xinghua Wei
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xinghua Wei.

Additional information

Authors' contributions

PY and XW conceived and designed the experiments. PY, CW, QX, and YF performed DNA preparations. HY and YW carried out microarray processing, and PY performed PCR analysis and contributed to interpretation of the data. PY, XW, and ST drafted the manuscript. All authors read and approved the final manuscript.

Electronic supplementary material

Additional file 1:Excel file includes Figure S1. Correlation between chromosome length and number of CNVs. (XLS 28 KB)

Additional file 2:Excel file includes Table S1. CNV regions detected and the sizes ranges of CNVs. (XLS 82 KB)

Additional file 3:Excel file includes Table S2. Primers used for PCR validation of CNV regions. (XLS 55 KB)

Additional file 4:Excel file includes Table S3. Genes included within CNV regions. (XLS 88 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Yu, P., Wang, C., Xu, Q. et al. Detection of copy number variations in rice using array-based comparative genomic hybridization. BMC Genomics 12, 372 (2011). https://doi.org/10.1186/1471-2164-12-372

Download citation

Received: 14 April 2011
Accepted: 20 July 2011
Published: 20 July 2011
DOI: https://doi.org/10.1186/1471-2164-12-372

Detection of copy number variations in rice using array-based comparative genomic hybridization

Abstract

Background

Results

Conclusion

Background

Results

CNV detection using aCGH

PCR analysis

Annotation of CNVs

Discussion

Conclusion

Methods

Source of DNA samples

Array CGH

Polymerase chain reaction (PCR)

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Authors' contributions

Electronic supplementary material

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Genomics

Contact us