- Methodology article
- Open Access
Systematic genome sequence differences among leaf cells within individual trees
BMC Genomics volume 15, Article number: 142 (2014)
Even in the age of next-generation sequencing (NGS), it has been unclear whether or not cells within a single organism have systematically distinctive genomes. Resolving this question, one of the most basic biological problems associated with DNA mutation rates, can assist efforts to elucidate essential mechanisms of cancer.
Using genome profiling (GP), we detected considerable systematic variation in genome sequences among cells in individual woody plants. The degree of genome sequence difference (genomic distance) varied systematically from the bottom to the top of the plant, such that the greatest divergence was observed between leaf genomes from uppermost branches and the remainder of the tree. This systematic variation was observed within both Yoshino cherry and Japanese beech trees.
As measured by GP, the genomic distance between two cells within an individual organism was non-negligible, and was correlated with physical distance (i.e., branch-to-branch distance). This phenomenon was assumed to be the result of accumulation of mutations from each cell division, implying that the degree of divergence is proportional to the number of generations separating the two cells.
At the beginning of the 21st century, genome sequences of two closely related species, human and chimpanzee, were found to differ by approximately 4% based on conventional genome sequencing technology . With the advent of next-generation sequencing (NGS), it has been established that each person has a unique genome . Within a single organism, genome sequences may be epigenetically different between cells, and sporadic differences are sometimes present between cells from different organs . It is not clear, however, whether each cell within an individual organism possesses a systematically different genome sequence.
Various breakthroughs have been steadily reshaping our understanding of genomes. These advances include accumulating analyses of whole-genome sequences of individuals [4, 5], identification of various non-coding RNAs , discovery of the existence of highly repeated sequences , and recognition of frequent recombination of genome structures [8, 9]. Recently, an intensive study on the fate of cancerous cells by NGS revealed that lineages of such cells are vigorously mutating . Advanced papers on this topic have subsequently appeared [11, 12].
On the other hand, genome sequence differences have been examined by the copy number variation analysis between normal cells within a single organism [13–15], which informed us of frequent occurrence of mutation in the form of replication slippage at particular genomic loci. In a sense, this is a filtered (i.e., restricted to the tandem repeat sequences) observation of genome alterations. More wide observation of normal genomic DNA is just beginning as can be seen in the recent report [3, 16]. Our study is the first to detect systematic genome sequence differences among cells in single organisms, i.e., within individuals of two woody plant species (Figure 1E).
There has been a hypothesis (genetic mosaicism hypothesis) that long statured plants accumulate spontaneous mutations that expanded among modules (shoots, branches, leaf etc.) and become genetically mosaic as they grow . This hypothesis is explicitly based on the idea of finite spontaneous mutation rate. That is, DNA replication proceeds with limited accuracy, i.e., 10-6 to 10-9 errors/base/replication  and thus every replicated genome sequence (e.g., the 3 × 109-bp sequence of the human genome) naturally differs from its parental genome. In general, these differences were too small to be directly detected, as they were often below the detection limit of sequencing analysis. Consequently, mutation rate has been conventionally estimated indirectly based on phenotypic changes, such as variation in antibiotic resistance. This situation has been changed by the advent of the NGS (next generation sequencing), enabling the detection of low rate of mutations . However, its application is limited mainly due to high cost and difficulty in data processing .
Fortuitously, Genome Profiling (GP) (Figure 2), an easily operable and informative genome analysis method [21–28] is sufficiently competent to detect differences between closely related cells [27, 28]. Compared with conventional sequencing approaches, GP involves two unique procedures (Figure 2): i) collection of DNA fragments from genomic DNA by random PCR  and ii) acquisition of DNA sequence information using micro-temperature gradient gel electrophoresis (μTGGE) by separating DNA fragments and observing their melting profiles (Figure 2B) [30–32]. In this method, the property, spiddos (species identification dots), derived from the DNA sequence information  plays the pivotal role in identifying a genome and enables us to measure the genome distance (see Methods).
GP has been used as a tool for universal species identification [21, 24, 27, 28, 33] and as an accurate detector of mutation [34, 35]. In this study, we applied the GP method to a new challenge: detection of extremely small genomic differences between very closely related cells with the aim of examining within-organism sequence variation.
Results and discussion
We used Japanese beech (Fagus crenata) trees to examine whether GP was able to reveal if all leaves within a single tree had identical genome sequences (Figures 2 and 3). More specifically, we analyzed sets of sp ecies id entification do ts (spiddos), a pivotal GP parameter derived from genome sequences (Figure 2C), that were obtained from genome profiles, specified by both mobility and melting temperature, both of which are determined after calibration and normalization of band patterns by a computer using co-migrating internal references (see Methods). Although genome profiles (i.e., DNA patterns generated by μTGGE analysis) were not always reproducible because of experimental fluctuations (i.e., environmental temperature, instrumental drift and others), spiddos were highly reproducible as a result of a normalization process that compensated for experimental fluctuations (Figure 2). As shown in Figure 4A, all leaves on the same Japanese beech branch (e.g., A1-1, A1-2, and A1-3, where “A1-2” refers to tree A, branch 1, leaf 2) clustered together. This was also the case for the genome profiles of leaves on branches A2 and A3. Leaves from different branches were found to have different genome sequences. Spiddos of branch A1 and A2 leaves were more similar to one another than to spiddos of leaves on branch A3, located furthest from the ground (Figure 3). Differences were observed in spiddos between leaves belonging to the same branch, but these differences were the level of experimental errors and thus they cannot be said to be significant at this moment . These results reveal that within statistical significance, leaves from individual branches possessed identical genome sequences, but had distinctively different sequences from those of different branches, a finding not previously reported. This result was further confirmed by conducting a similar experiment using different Japanese beech individuals. We also analyzed another species, Yoshino cherry (Prunus × yedoensis), located ~800 km from the site of the Japanese beech trees for more generalized confirmation (Figure 1). Finally, to detect methodological differences, we sequenced a particular DNA band obtained from GP (see in Additional file 1: Figure S1 and for details see Additional file 2). Throughout these experiments, we consistently reached the same conclusion: genome sequences within organisms were not identical, but instead varied systematically.
Figure 4B reveals that very similar results were obtained from the two additional Japanese beech trees. Interestingly, the same relationship trend was observed among all three trees: spiddos of leaves from uppermost branches (A3, B3, and C3) were distinct from spiddos of other leaves (Figure 4B). The cluster dendrogram in Figure 4B was globally constructed based on the whole set of distances (d G ) obtained from all leaf spiddos (Additional file 3: Table S1); consequently, the resulting logically expected structure—leaves on the same branch grouped together and branches on the same tree clustered together—is most impressive and unexpected, demonstrating the effectiveness of this approach. It is therefore evident that genomes of leaves on a tree are neither completely identical to one another nor randomly different but, rather, systematically differ depending on branch location.
As shown in Figure 1, similar results were reproducibly obtained using the other species, Yoshino cherry. Results of cluster analyses of distances (d G ) obtained using spiddos data from three independent GP experiments using the same samples from five branches (Additional file 4: Table S2) are shown in Figure 1A-C; clustering results based on an average of the three trials are shown in Figure 1D. These results of individual experiments (Figure 1A,C) show basically the same pattern as those obtained from the statistically more reliable averages (Figure 1D), indicating that this experimental system has a rather low variance (in other words, a single experiment can provide a good prospect) with only a minor exception: positional exchange of branches 3 and 4 in Figure 1C. The situation observed in Figure 4 (Japanese beech) also held true for Yoshino cherry, i.e., genome profiles of leaves were not identical, but instead differed systematically. In addition, genomes of leaves from the uppermost branch (5–1, 5–2, and 5–3) were genetically distant from leaves of middle branches, indicating a correlation between genomic distance and branch location. The same phenomenon was thus observed in two different, widely separated species, namely, that leaves from the same tree have different genome sequences that can be distinguished using GP.
Our discovery was partially corroborated upon further investigation using direct sequencing. As shown in (see Additional file 1: Figure S1), leaves from the same branch tended to have more closely related sequences, as seen in pairs of leaves from the same Japanese beech branches (B2-2 and B2-3) and (B3-1 and B3-2) in (see Additional file 1: Figure S1B) and from closely located branches of Yoshino cherry (B2-1 and B3-1) and (B4-1 and B5-1) in (see Additional file 1: Figure S1A). Because of missing data caused by generation of artifacts during cloning and sequencing, these results are somewhat equivocal; nonetheless, these data are congruent with the conclusions drawn from the GP experiments. With respect to these direct sequencing results, the experimental procedures used, and sequencing in general, need to be taken into account. DNA fragments generated from the GP experiment were collected by excising their bands from polyacrylamide gels, the most reliable method for obtaining sequences common to both GP and conventional sequencing. Collected DNA was then subjected to cloning and sequencing, two procedures that can introduce mutations. Many spurious sequences were in fact obtained and discarded, including sequences having very low sequence similarity to the primary sequence generated from the DNA band, and sequences of non-plant origin. Although they were within an apparently acceptable range based on sequence consistency (i.e., high similarity), the results shown in (see Additional file 1: Figure S1) were thus subject to limitations inherent to the cloning and sequencing process. Nevertheless, this illustrates one difficulty encountered when using such a clone-isolation- and sequencing-based approach to identify mutation frequencies: the two mutation types—original mutations and sequencing operation-derived mutations (presumably introduced during template preparation, PCR-amplification, sequencing, and base-calling), cannot be distinguished in the final clonal sequencing results. To obtain statistically significant results using conventional high-precision sequencing, high-volume sequencing of the multiple-million base-pair level must be carried out to separate infrequently occurring mutations (e.g., < 10-6/mutations/base/replication) from background noise. In this regard, it should be noted that the ability of the GP method to overcome this difficulty has been experimentally demonstrated: GP has been used successfully for species identification and classification [24, 25, 27, 28, 36] and in high-sensitivity mutation assays [34, 35].
In this study, we have demonstrated that leaves from the same tree do not have exactly identical genome sequences. This conclusion is expected to be applicable to any multi-celled organism, as DNA is not perfectly replicated in any organism, and thus each genome replication cycle induces mutations that are usually too infrequent to be detected (10-6 to 10-9 mutations/base/replication) . In addition, epigenetic methylation of DNA, of which degree must be different from cell to cell and may have a potential to induce base-substitution during PCR, does not effect its PCR amplification , which was independently confirmed in our study (Table 1 and Figure 5). Based on the total number of base pairs in the DNA bands obtained by random PCR (i.e., roughly 10 bands, each 1000 bp), we tentatively estimate the GP method has a detection sensitivity of 10-4 mutations/base/replication. More specifically, the total number of mutations accumulating over g generations, μ(g), can be calculated using the formula:
where μ(i) and γ(i) represent replication-dependent and repair-dependent mutation rates, respectively. If we tentatively assume μ(i) = μ c (a constant) and μ(i) > > γ(i) for all i, then
This estimate indicates that the GP method cannot detect mutations occurring at a frequency lower than g · μ c (≤ 10-4/base). Consequently, leaf genomes must contain a significant number of mutations, equivalent to the sum of replication- and repair-caused mutations. This finding leads us to consider whether the large number of estimated mutations implies that mutation events during replication and repair (a type of 'somatic mutation’) have been unexpectedly frequent , or if instead there is a large cell generation difference between tree branches, as follows:
If we assume that μ c = 10-8 in the above context, then g, the number of generations, must be
Because longitudinally tandem consecutive cells expand to the length g' · a, where g' is the number of cell generations and a is the unit cell length, we can calculate the number of cell generations (g') separating two branches. If a = 20 μm and the branch-to-branch distance, B, is 2 m, then
and thus from Eq. 2,
Based on this tentative calculation, the apparent genomic distance observed using the GP method, which has a detection limit ≥ 10-4/base, is within a reasonable range. In other words, the accumulated point mutations are as a consequence of the large generational difference between cells. Obviously, this conclusion needs to be confirmed by other approaches. Our finding regarding this unexpectedly wide genome-to-genome distance will surely collect the interest in this theme which have been less payed with attention.
Except for cancer cells, cells within an individual organism have been previously believed to possess identical genomes. Two brief reports have recently appeared suggesting that cells from a single individual might have different genomes [3, 16], although no hard evidence exists nor has systematic research been performed to confirm those observations. Nevertheless, these reports are consistent with the findings of our study.
The study reported here provided with the first systematic analysis of genome sequence differences among cells in single individuals using the GP method. As a result leaf genome sequences within individual trees were found not to be identical, but varied systematically from the bottom to the top of the tree. Since this phenomenon was detected by the GP method that cannot detect the mutation of less than 10-5/base/replication, a large number of accumulated mutations must exist between distantly located cells in the tree.
This fact leads to a natural inference that two cells in an individual differ in their genome sequences in relation to their physical distances. In other words, no two cells have completely identical genome sequences. This finding and inference will surely have an influence on the interpretation of various phenomena including mutagens, cancer and others.
Leaves of Japanese beech (or Buna) (Fagus crenata) trees growing in Sapporo, Japan, and Yoshino cherry (Sakura) (Prunus × yedoensis) trees from Saitama, Japan, were used in this study. The notation A1-2 denotes leaf 2 on branch 1 of tree A. Branch numbers were assigned in the order in which they appeared, beginning from lower (ground) to upper (tree top) levels.
Genomic DNA preparation
After washing leaf samples in 10% sodium dodecyl sulfate (SDS), DNA was extracted using the cetyltrimethylammonium bromide (CTAB) method . Briefly, 100–120-mg samples (wet weight) were homogenized with a mortar and pestle using liquid nitrogen. One milliliter of CTAB solution (200 mM Tris–HCl [pH 9.0], 2% [w/v] CTAB, 2% [w/v] polyvinylpyrrolidone, 0.1% [v/v] 2-mercaptoethanol, 1.4 M NaCl, and 20 mM ethylenediaminetetraacetic acid) was immediately added to the crushed cells, followed by incubation for 1 h at 65°C. After incubation, a 24:1 chloroform-isoamyl alcohol mixture was added; the solution was mixed gently and then centrifuged for 10 min at 12,000 × g (14,000 rpm). This step was repeated twice. An equal volume of propanol was then added to the supernatant, which was centrifuged for 5 min at 16,000 × g (15,000 rpm). In most cases, the pellet obtained was washed with 70% ethanol, centrifuged, and desiccated using an evaporator. Finally, 100 μl of phosphate-buffered saline was added to the precipitate to dissolve the pellet.
GP technology is sufficiently robust such that slight impurities of denatured proteins or polysaccharides will not interfere. Other plant cell components, such as alkaloids and secondary products, can be inhibitory to the PCR reaction, however; consequently, DNA samples were diluted prior to amplification.
Genome profiling (GP)
Genome profiling (GP) uses a set of DNA fragments sampled from genomic DNAs, and is composed of three fundamental steps: random PCR, micro-temperature gradient gel electrophoresis (μTGGE), and data normalization by computer processing [22, 32] (Figure 2). Random PCR can employ arbitrary primers for the PCR reaction because of the relaxed nature of primer binding to template DNA under sufficiently low temperatures. This attribute allows samples of unknown genomic sequence, for which specific primers cannot be designed, to be amplified. As a consequence, DNA fragments from any genomic DNA can be collected independently of the sequence of an oligonucleotide primer used [30, 31] (Note that a single primer is used for random PCR).
Random PCR was performed using primers HUNT (5′-dTGCTGCTGCTGC-3′) and Pfm12 (5′-dAGAACGCGCCTG-3′), which were Cy3-labeled at their 5′ ends. The reaction mixture (25 μl total volume) for random PCR contained 1 ng template DNA, 100 μM primer DNA, 200 μM dNTPs, 10 mM Tris–HCl (pH 9.0), 50 mM KCl, 2.5 mM MgCl2, and 0.02 unit μl-1Taq DNA polymerase (Takara Bio Inc., Shiga, Japan). During random PCR, contamination by other organisms should be carefully avoided. To inactivate any contaminating DNAs that could act as a template, the entire random PCR solution, without the template DNA, was therefore UV-irradiated prior to the reaction. Random PCR was carried out using 30 cycles of denaturation (94°C, 30 s), annealing (26°C, 1 min), and extension (47°C, 1 min) on a C1000 thermal cycler (Bio-Rad, Hercules, CA, USA). The second random PCR mixture (50 μl volume) contained 1 μl of the first PCR product as template and the same concentrations of constituents used in the original reaction. The reaction was performed using 10 cycles of denaturation (94°C, 30 s), annealing (60°C, 1 min), and extension (74°C, 1 min). Only 10 cycles were used to ensure that the reaction was terminated before all primer molecules were consumed; this was necessary to guarantee that the major PCR products were in a double-stranded state and thus suitable for TGGE analysis (i.e., so that the melting transition of double-stranded DNA to a single-stranded form can be detected).
For μTGGE, we used a tiny slab gel (24 × 16 × 1 mm3) set on a μ-TG temperature-gradient generator (Taitec, Iruma, Japan) for electrophoresis . Two internal reference DNAs with known melting patterns were co-migrated during each electrophoretic run to calibrate each genome profile, giving highly reproducible results : a 200-bp Ref1 (a 191-bp fragment from the bacteriophage fd gene VIII, sites 1350–1540, attached to a 9-bp sequence, CTACGTCTC, at the 3′ end; Tm = 60°C) and a 900-bp Ref2 taken from a 4361-bp pBR322 fragment (Tm = 61.4°C). Fluorescently-labeled primers MA1 (5′-cy3-dTGCTACGTCTCTTCCGATGCTGTCTTTCGCT-3′) and MA2 (5′-dCCTTGAATTCTATCGGTTTATCA-3′), Ref6F (5′-cy3-dGCCGGCATCACCGGCGCCACAGGTGCGGTTG-3′), and Ref6R (5′-dTAGCGAGGTGCCGCCGGCTTCCATTCAGGTC-3′) were used to generate internal references 1 and 2, respectively. The gel used was 6% polyacrylamide (19:1 acrylamide:bis) containing 500 mM Tris–HCl, 485 mM boric acid, 20 mM EDTA (pH 8.0), and 8 M urea. Approximately 2 μg of DNA was loaded onto the gel and subjected to electrophoresis with a linear temperature gradient of 15 to 65°C for 12 minutes at 100 V cm-1. After electrophoresis, DNA bands were detected using an FX Molecular Imager fluorescence imager (Bio-Rad).
Computer-aided data analysis
Genome profiles obtained by the GP method are highly informative, but difficult to interpret because of their complexity. To overcome this problem, feature points called spiddos can be introduced . Spiddos correspond to points where DNA structural transitions occur, such as from double-stranded to single-stranded DNA . The coordinates of spiddos are established to be reproducibly obtained by an internal reference-mediated normalization (i.e., the coordinates of the two reference points contained in each GP profile (ref 1 and ref 2, Figure 2C) are used to calibrate the coordinates of the featuring points for same DNAs) which is sequence- and size-dependent.
Using these normalized coordinates, a pattern similarity score (PaSS) between two genomes can be measured as follows:
where and correspond to the normalized positional vectors (composed of two elements: mobility μ and temperature θ) for spiddos and collected from two genome profiles, respectively, and i denotes the spiddo serial number. In general, 0 ≤ PaSS ≤ 1, and thus, 0 ≤ d G ≤ 1. PaSS is equal to one when two spiddo sets match perfectly.
Genomic distance (d G ), a more practical form, is derived from PaSS as follows:
If d G is sufficiently small (<< 1), the two genomes of interest belong to the same species.
Cluster analysis of GP data
DNA bands of interest were extracted from TGGE microgels and used as PCR templates in reaction mixtures containing 320 μM dNTPs, 100 μM primer pfm12 (5′-dAGAACGCGCCTG-3′), 10 mM Tris–HCl (pH 9.0), 50 mM KCl, 2.5 mM MgCl2, and 0.02 unit μl-1Taq DNA polymerase (Takara). Reaction conditions consisted of 30 cycles of denaturation at 94°C for 30 s, annealing at 60°C for 60 s, and extension at 74°C for 60 s. The resulting random PCR products (DNA) were ligated to pGEM-T Easy vectors (Promega, Madison, WI, USA) at 4°C overnight. Competent cells of E. coli DH5α (Toyobo Co. Ltd., Osaka, Japan) were transformed with the ligation product. Transformed cells were cultivated on LB agar plates (1% tryptone, 0.5% yeast extract, 1% NaCl [pH 7.0], and 1.5% agar) supplemented with ampicillin (10 mg in 200 ml of LB media), 20 μl X-Gal (50 mg ml-1 in dimethyformamide), and 100 μl of 0.1 M IPTG (isopropylthio-β-galactoside). The agar plates were incubated at 37°C for 12–14 h. White colonies on the plates were selected with a sterile toothpick, transferred to LB broth (1% tryptone, 0.5% yeast extract, and 1% NaCl, pH 7.0; 10 mg ampicillin), and incubated at 37°C for 12–14 h with shaking at about 180 rpm. After confirmation of gene insertion, plasmid DNA was purified using a Wizard Plus SV Minipreps DNA purification system (Promega) and commercially sequenced (Operon Bio-technology, Tokyo, Japan).
Availability of supporting data
Data sets supporting the results of this study are included within the article and its additional files.
Micro-temperature gradient gel electrophoresis
Species identification dots
Pattern similarity score
Varki A, Nelson DL: Genomic Comparisons of Humans and Chimpanzees. Annu Rev Anthropol. 2007, 36: 191-209. 10.1146/annurev.anthro.36.081406.094339.
Wheeler DA, Srinivasan M, Egholm M, Shen Y, Chen L, McGuire A, He W, Chen Y, Makhijani V, Roth GT, Gomes X, Tartaro K, Niazi F, Turcotte CL, Irzyk GP, Lupski JR, Chinault C, Song XZ, Liu Y, Yuan Y, Nazareth L, Qin X, Muzny DM, Margulies M, Weinstock GM, Gibbs RA, Rothberg JM: The complete genome of an individual by massively parallel DNA sequencing. Nature. 2008, 452: 872-876. 10.1038/nature06884.
Yong E: Tree’s leaves genetically different from its roots. Nat news. 2012, doi:10.1038/nature.2012.11156
Kidd JM, Gravel S, Byrnes J, Moreno-Estrada A, Musharoff S, Bryc K, Degenhardt JD, Brisbin A, Sheth V, Chen R, McLaughlin SF, Peckham HE, Omberg L, Chung CAB, Stanley S, Pearlstein K, Levandowsky E, Acevedo-Acevedo S, Auton A, Keinan A, Acuña-Alonzo V, Barquera-Lozano R, Canizales-Quinteros S, Eng C, Burchard EG, Russell A, Reynolds A, Clark AG, Reese MG, Lincoln SE, et al: Population Genetic Inference from Personal Genome Data: Impact of Ancestry and Admixture on Human Genomic Variation. Am J Hum Genet. 2012, 91: 660-671. 10.1016/j.ajhg.2012.08.025.
Meyer M, Kircher M, Gansauge MT, Li H, Racimo F, Mallick S, Schraiber JG, Jay F, Prüfer K, de Filippo C, Sudmant PH, Alkan C, Fu Q, Do R, Rohland N, Tandon A, Siebauer M, Green RE, Bryc K, Briggs AW, Stenzel U, Dabney J, Shendure J, Kitzman J, Hammer MF, Shunkov MV, Derevianko AP, Patterson N, Andrés AM, Eichler EE, et al: A High-Coverage Genome Sequence from an Archaic Denisovan Individual. Science. 2012, 338: 222-226. 10.1126/science.1224344.
Guttman M, Amit I, Garber M, French C, Lin MF, Feldser D, Huarte M, Zuk O, Carey BW, Cassady JP, Cabili MN, Jaenisch R, Mikkelsen TS, Jacks T, Hacohen N, Bernstein BE, Kellis M, Regev A, Rinn JL, Lander ES: Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals. Nature. 2009, 458: 223-227. 10.1038/nature07672.
Jurka J, Kapitonov VV, Kohany O, Jurka MV: Repetitive Sequences in complex genomes: Structure and Evolution. Annu Rev Genomics Hum Genet. 2007, 8: 241-259. 10.1146/annurev.genom.8.080706.092416.
Saito A, Nishigaki K: Homogenization of Chromosomes Revealed by Oligonucleotide-Stickiness. J Comput Chem Jpn. 2004, 3: 145-152. 10.2477/jccj.3.145.
McVean GA, Myers SR, Hunt S, Deloukas P, Bentley DR, Donnelly P: The Fine-Scale Structure of Recombination Rate Variation in the Human Genome. Science. 2004, 304: 581-584. 10.1126/science.1092500.
Navin N, Kendall J, Troge J, Andrews P, Rodgers L, McIndoo J, Cook K, Stepansky A, Levy D, Esposito D, Muthuswamy L, Krasnitz A, McCombie WR, Hicks J, Wigler M: Tumor evolution inferred by single-cell sequencing. Nature. 2011, 472: 90-94. 10.1038/nature09807.
Li GW, Xie XS: Central dogma at the single-molecule level in living cells. Nature. 2011, 475: 308-315. 10.1038/nature10315.
Blanpain C: Tracing the cellular origin of cancer. Nat Cell Biol. 2013, 15: 126-134. 10.1038/ncb2657.
Piotrowski A, Bruder CE, Andersson R, Diaz de Ståhl T, Menzel U, Sandgren J, Poplawski A, von Tell D, Crasto C, Bogdan A, Bartoszewski R, Bebok Z, Krzyzanowski M, Jankowski Z, Partridge EC, Komorowski J, Dumanski JP: Somatic mosaicism for copy number variation in differentiated human tissues. Hum Mutat. 2008, 29: 1118-1124. 10.1002/humu.20815.
Liang Q, Conte N, Skarnes WC, Bradley A: Extensive genomic copy number variation in embryonic stem cells. Proc Natl Acad Sci U S A. 2008, 105: 17453-17456. 10.1073/pnas.0805638105.
O'Huallachain M, Karczewski KJ, Weissman SM, Urban AE, Snyder MP: Extensive genetic variation in somatic human tissues. Proc Natl Acad Sci U S A. 2012, 109: 18018-18023. 10.1073/pnas.1213736109.
Grant B: DNA may differ between tissues. The Scientist. 2009,http://www.the-scientist.com/?articles.view/articleNo/27529/title/DNA-may-differ-between-tissues,
Gill DE, Chao L, Perkins SL, Wolf JB: Genetic mosaicism in plants and clonal animals. Annu Rev Ecol Syst. 1995, 26: 423-444. 10.1146/annurev.ecolsys.26.1.423.
Eckert KA, Kunkel TA: DNA polymerase Fidelity and the Polymerase Chain Reaction. Genome Res. 1991, 1: 17-24. 10.1101/gr.1.1.17.
Schmitt MW, Kennedy SR, Salk JJ, Fox EJ, Hiatt JB, Loeb LA: Detection of ultra-rare mutations by next-generation sequencing. Proc Natl Acad Sci U S A. 2012, 109: 14508-14513. 10.1073/pnas.1208715109.
Richter BG, Sexton DP: Managing and Analyzing Next-Generation Sequence Data. PLoS Comput Biol. 2009, 5: e1000369-10.1371/journal.pcbi.1000369.
Nishigaki K, Naimuddin M, Hamano K: Genome Profiling: a Realistic Solution for Genotype-Based Identification of Species1. J Biochem. 2000, 128: 107-112. 10.1093/oxfordjournals.jbchem.a022719.
Naimuddin M, Kurazono T, Zhang Y, Watanabe T, Yamaguchi M, Nishigaki K: Species-identification dots: a potent tool for developing genome microbiology. Gene. 2000, 261: 243-250. 10.1016/S0378-1119(00)00502-3.
Naimuddin M, Nishigaki K: Genome analysis technologies: Towards species identification by genotype. Brief Funct Genomic Proteomic. 2003, 1: 356-371. 10.1093/bfgp/1.4.356.
Kouduka M, Matuoka A, Nishigaki K: Acquisition of genome information from single-celled unculturable organisms (radiolaria) by exploiting genome profiling (GP). BMC Genomics. 2006, 7: 135-10.1186/1471-2164-7-135.
Kouduka M, Sato D, Komori M, Kikuchi M, Miyamoto K, Kosaku A, Naimuddin M, Matsuoka A, Nishigaki K: A Solution for Universal Classification of Species Based on Genomic DNA. Int J Plant Genomics. 2007, 2007: 27894-
Takasaka T, Sakurada K, Akutsu T, Nishigaki K, Ikegaya H: Trials of the detection of semen and vaginal fluid RNA using the genome profiling method. Leg Med (Tokyo). 2011, 13: 265-267. 10.1016/j.legalmed.2011.05.001.
Ahmed S, Komori M, Tsuji-Ueno S, Suzuki M, Kosaku A, Miyamoto K, Nishigaki K: Genome Profiling (GP) Method Based Classification of Insects: Congruence with That of Classical Phenotype-Based One. PLoS ONE. 2011, 6: e23963-10.1371/journal.pone.0023963.
Hamano K, Ueno-Tsuji S, Tanaka R, Suzuki M, Nishimura K, Nishigaki K: Genome profiling (GP) as an effective tool for monitoring culture collections: A case study with Trichosporon. J Microbiol Methods. 2012, 89: 119-128. 10.1016/j.mimet.2012.02.007.
Sakuma Y, Nishigaki K: Computer Prediction of General PCR Products Based on Dynamical Solution Structures of DNA1. J Biochem. 1994, 116: 736-741.
Nishigaki K, Miura T, Tsubota M, Sutoh A, Amano N, Husimi Y: Structural analysis of nucleic acids by precise denaturing gradient gel electrophoresis: II. Applications to the analysis of subtle and drastic mobility changes of oligo- and polynucleotides1. J Biochem. 1992, 111: 151-156.
Nishigaki K, Saito A, Takashi H, Naimuddin M: Whole genome sequence-enabled prediction of sequences performed for random PCR products of Escherichia coli. Nucleic Acids Res. 2000, 28: 1879-1884. 10.1093/nar/28.9.1879.
Biyani M, Nishigaki K: Hundredfold productivity of genome analysis by introduction of microtemperature-gradient gel electrophoresis. Electrophoresis. 2001, 22: 23-38. 10.1002/1522-2683(200101)22:1<23::AID-ELPS23>3.0.CO;2-Z.
Watanabe T, Saito A, Takeuchi Y, Naimuddin M, Nishigaki K: A database for the provisional identification of species using only genotypes: web-based genome profiling. Genome Biol. 2002, 3: research0010.1-research0010.8.
Futakami M, Salimullah M, Miura T, Tokita S, Nishigaki K: Novel Mutation Assay with High Sensitivity based on Direct Measurement of Genomic DNA Alterations: Comparable Results to the Ames Test. J Biochem. 2007, 141: 675-686. 10.1093/jb/mvm074.
Futakami M, Nishigaki K: Measurement of DNA Mutations Caused by Seconds-period UV-irradiation. Chem Lett. 2007, 36: 358-359. 10.1246/cl.2007.358.
Oda H, Hatakeyama Y, Iwano H: Phylogenetic relationships among Bacillus thuringiensis(Bacillaceae: Bacillales) strains based on a comparison of SSU rRNA sequences and genome profiling. Appl Entomol Zool. 2011, 46: 489-496. 10.1007/s13355-011-0068-4.
Flusberg BA, Webster DR, Lee JH, Travers KJ, Olivares EC, Clark TA, Korlach J, Turner SW: Direct detection of DNA methylation during single-molecule, real-time sequencing. Nat Methods. 2010, 7: 461-465. 10.1038/nmeth.1459.
Proffitt JH, Davie JR, Swinton D, Hattman S: 5-Methylcytosine Is Not Detectable in Saccharomyces cerevisiae DNA. Mol Cell Biol. 1984, 4: 985-988.
Klekowski EJ, Godfrey PJ: Ageing and mutation in plants. Nature. 1989, 340: 389-391. 10.1038/340389a0.
Ibrahim RIH: A modified CTAB protocol for DNA extraction from young flower petals of some medicinal plant species. Gene Conserve. 2011, 10: 165-182.
Nishgaki K, Tsubota M, Miura T, Chonan Y, Husimi Y: Structural analysis of nucleic acids by precise denaturing gradient gel electrophoresis: I. Methodology. J Biochem. 1992, 111: 144-150.
Lilley D, Dahlberg J: DNA Structures Part B: Chemical and Electrophoretic Analysis of DNA. Methods in Enzymology. Edited by: Abrams ES, Stanton VPJr. 1992, 71-104. Volume 212
Ward JH: Hierarchical grouping to optimize an objective function. J Am Stat Assoc. 1963, 58: 236-244. 10.1080/01621459.1963.10500845.
Ahmed S, Nishigaki K: Error-Robust Nature of Genome Profiling Applied for Clustering of Species Demonstrated by Computer Simulation. Int J Biol Life Sci. 2007, 3: 82-88.
We are grateful to Dr. Hisao Honda (Hyogo University) for his scientific suggestions and encouragement. We thank Dr. Robert L. Fischer (University of California) for his valuable advice. This study was partly supported by a grant from the City Area Program (Saitama Metropolitan Area) from the Ministry of Education, Culture, Sports, and Technology (MEXT).
The authors declare that they have no competing interests.
SK carried out initial experiments to establish the overall experimental protocol, and collected and processed GP data. DD performed GP analyses and supplementary experiments, including methylation experiments, finalized the study, and wrote the manuscript. MS, NN, and TA analyzed the data; AS helped analyze the data and collected samples. KN designed and directed the study, analyzed data, and helped write and edit the manuscript. All authors read and approved the final manuscript.
Deepti Diwan, Shun Komazaki contributed equally to this work.
Electronic supplementary material
Additional file 1: Figure S1: Sequence-based clustering of leaves from (A) Yoshino cherry and (B) Japanese beech trees. Only sequence data that could be consistently assigned were used. Clustering was performed using Consensus Maker v2.0.0 (http://www.hiv.lanl.gov/content/sequence/CONSENSUS/consensus.html). Yoshino cherry tree leaf number designations are arbitrary. (TIFF 105 KB)
Additional file 2:DNA consensus sequence data of leaves used for analysis were derived using Consensus Maker v2.0.0 (http://www.hiv.lanl.gov/content/sequence/CONSENSUS/consensus.html), and then used to construct clustering tree. (These sequences are deposited in GenBank database: KJ411230-KJ411277). (DOCX 24 KB)
Additional file 3: Table S1: Genome distances (d G ) among genomes of Japanese beech tree leaves. Each value is the average of two independent experiments. (XLSX 15 KB)
Additional file 4: Table S2: Genome distances (d G ) among genomes of Yoshino Cherry tree leaves. Each value is the average of three independent experiments. (XLSX 12 KB)
About this article
Cite this article
Diwan, D., Komazaki, S., Suzuki, M. et al. Systematic genome sequence differences among leaf cells within individual trees. BMC Genomics 15, 142 (2014). https://doi.org/10.1186/1471-2164-15-142
- Genome sequence
- Genomic distance
- Mutation rate
- Japanese beech
- Yoshino cherry
- Leaf genomes
- Genome profiling (GP)
- Next-generation sequencing (NGS)