Reconstruction of putative DNA virus from endogenous rice tungro bacilliform virus-like sequences in the rice genome: implications for integration and evolution
- Motoyuki Kunii†1,
- Masanori Kanda†1,
- Hironori Nagano1,
- Ichiro Uyeda2,
- Yuji Kishima1Email author and
- Yoshio Sano1
© Kunii et al; licensee BioMed Central Ltd. 2004
Received: 11 June 2004
Accepted: 18 October 2004
Published: 18 October 2004
Plant genomes contain various kinds of repetitive sequences such as transposable elements, microsatellites, tandem repeats and virus-like sequences. Most of them, with the exception of virus-like sequences, do not allow us to trace their origins nor to follow the process of their integration into the host genome. Recent discoveries of virus-like sequences in plant genomes led us to set the objective of elucidating the origin of the repetitive sequences. Endogenous rice tungro bacilliform virus (RTBV)-like sequences (ERTBVs) have been found throughout the rice genome. Here, we reconstructed putative virus structures from RTBV-like sequences in the rice genome and characterized to understand evolutionary implication, integration manner and involvements of endogenous virus segments in the corresponding disease response.
We have collected ERTBVs from the rice genomes. They contain rearranged structures and no intact ORFs. The identified ERTBV segments were shown to be phylogenetically divided into three clusters. For each phylogenetic cluster, we were able to make a consensus alignment for a circular virus-like structure carrying two complete ORFs. Comparisons of DNA and amino acid sequences suggested the closely relationship between ERTBV and RTBV. The Oryza AA-genome species vary in the ERTBV copy number. The species carrying low-copy-number of ERTBV segments have been reported to be extremely susceptible to RTBV. The DNA methylation state of the ERTBV sequences was correlated with their copy number in the genome.
These ERTBV segments are unlikely to have functional potential as a virus. However, these sequences facilitate to establish putative virus that provided information underlying virus integration and evolutionary relationship with existing virus. Comparison of ERTBV among the Oryza AA-genome species allowed us to speculate a possible role of endogenous virus segments against its related disease.
The virus-like sequences that have been found in plant genomes are divided into two groups of plant viruses, single-stranded DNA geminivirus and double-stranded DNA pararetroviruses. The geminivirus segments, including the viral replication origin and the adjacent AL1 gene, have been found in the genomes of tobacco and its related species [1, 2]. Pararetrovirus-like sequences have been reported in the petunia [3, 4] banana [5–7] and tobacco genomes [8–10]. Compared to the intact virus sequences, most of the endogenous virus-like sequences were rearranged in the host genomes. Their rearranged structures suggested that illegitimate recombination may have occurred when putative virus progenitors integrated . The endogenous viruses for banana streak virus (BSV) , tobacco vein-clearing virus (TVCV)  and petunia vein clearing-virus (PVCV)  could be activated as episomal viruses under certain conditions in the host plant, and appeared to have pathogenic potential. The integrations of these viruses were shown to have been relatively recent events and the copy numbers of the endogenous virus sequences were found to be very low. On the other hand, for tobacco endogenous pararetroviruses (TEPRVs), it was estimated that there are about 1000 segments in the tobacco genome , but the intact virus has not been identified so far, suggesting that the integration of TEPRVs was not a recent event. The finding of such endogenous virus sequences raises questions concerning 1) the integration process giving rise to endogenous virus sequences, 2) possible differences in the evolutionary rate between the virus and endogenous virus and 3) resistance potential as a result of endogenous virus integration.
In the rice genome, pararetrovirus-like sequences that are similar to rice tungro bacilliform virus (RTBV) have also been found [12–14]. In South and Southeast Asia, rice tungro bacilliform virus, which is transmitted by green leafhoppers, causes one of the most serious diseases of rice with the assistance of rice tungro spherical virus (RTSV) . Kobayashi and Ikeda  reported that African rice species, Oryza glaberrima and O. barthii, showed much severer systemic necrosis compared to the other rice species present in South and Southeast Asia after inoculation of both RTBV and RTSV.
Here, we have characterized RTBV-like sequences in the Japonica (cv. Nipponbare) genome. These sequences, denoted endogenous RTBV-like sequences (ERTBVs), were highly rearranged and dispersed throughout the rice genome. Sequences of the putative viruses for ERTBV were reconstructed from the dispersed segment in the genome. Copy numbers of ERTBV segments are shown to vary among AA-genome Oryza species. Asian species have more ERTBV segments than the species originated from the other regions where RTBV is not distributed. The results obtained advance our understanding of the manner of integration of authentic pararetrovirus into the host genome, evolutionary implication of the integrated virus and possible involvement of endogenous virus segments in the corresponding disease resistance.
Identification of RTBV-like sequences in the rice genomes
Summary of the ERTBV structures in the rice genomes1.
ERTBV position (length)3
alignment of gene5
ORF y (MP/CP/PR)
ORF y (RT/RH)
Assembling ERTBV segments
No sequences exactly matching RTBV per se were found in the databases for Japonica cv. Nipponbare. We conducted Southern hybridization analysis to see whether RTBV-homologous sequences are present in 14 lines from Oryza AA-genome species (Table 1). The hybridization patterns probed with RTBV sequence resulted in faint and indistinct bands (data not shown).
Periods of integration of ERTBV segments into the rice genome
To investigate the periods of integration of ERTBV, we attempted to obtain ERTBV sequences from an Indica variety (cv. IR36) using the RTBV-like sequence near the waxy locus as a probe. We thereby isolated three clones carrying ERTBV-homologous sequences (Table 1). Indica clone, AB124591, was found to have the same ERTBV sequence as that found in Nipponbare accession AL606592 (Table 1), indicating that the ERTBV integration events occurred before the Japonica-Indica differentiation. The other two Indica clones do not correspond with any ERTBV segments from Nipponbare.
Except for the sequences examined here, we could not find any other RTBV-like sequences in database searches using RTBV nor each of ERTBVs sequences as queries. Therefore, a fourth cluster of the RTBV-like sequences is unlikely to be present. The integration of ERTBV or its derived segments after the differentiation of Japonica and Indica cultivars thus seems to have not occurred.
Distribution of ERTBVs in the Oryza AA-genome species
Methylation of ERTBV in Oryza AA-genome species
Completion of ERTBV via integration of the virus
Relationship between ERTBV and RTBV
Although the genomes of ERTBV and RTBV are structurally similar, particularly in the longest ORF, some regions are markedly different. Phylogenetically, both are apparently closely related viruses. One important question is whether or not ERTBV is a cognate of RTBV. ERTBVs existed in the Oryza AA-genome species before differentiation of Japonica and Indica. Considering this evolutionary period, two possible relationships of RTBV and ERTBV might be predicted: one is that the virus that was to become ERTBV was a direct progenitor of RTBV, and the other is that both were differentially branched from a common ancestral virus. Because the evolution of virus genomes is generally much faster than that of plant genomes, we cannot compare them equivalently. In fact, rapid evolution of RTBV was inferred from high level of genetic diversity of RTBV field isolates . Even in a single field in the Philippines, more than one RTBV isolate could be observed, and the genotype changes year by year. Comparison of the sequences of several different isolates revealed evidence for incidences of nucleotide substitution, insertion/deletion and recombination occurred during differentiation of RTBV isolates . Particularly, it was suggested that recombination had played a role in the evolution of RTBV . If ERTBV is a direct progenitor of RTBV, recombination events might have contributed as a driving force to establish the present RTBV form. The longest ORF (ORF 3) of RTBV is functionally essential  and is thought to have been conserved since ERTBVs were present as viruses, but the other less homologous segments might have gradually undergone substitution by recombination. If RTBV was derived from the virus that became ERTBV, estimation of the nucleotide substitution rate of the genes common to RTBV and ERTBV would allow us to compare the evolutionary rates of a plant genome and a plant virus. By such a comparison, we estimated the virus evolution ranged from 30 to140 times faster than that of the host genome. The faster evolution of a virus was thus substantiated for the first time in plants if the virus for ERTBV was in fact the progenitor of RTBV. The different evolutionary rates are dependent on the virus genes; that is, the slow evolution might reflect the functional conservation of the gene.
In the other case, in which RTBV and the virus for ERTBV are phylogenetically located on different branches, the progeny of the virus corresponding to ERTBV might have vanished or may be hidden in a small population. Even though RTBV is not in the case a direct progeny of the virus corresponding to ERTBV, their structural similarity and parallel distribution lead us to consider their common ancestral origin.
Correlation between rice tungro disease and ERTBV
Plant virus-like sequences have been found in several plant genomes such as banana, tobacco and petunia [11, 31]. So far, no association between virus disease and endogenous virus sequences has been reported . Kobayashi and Ikeda  reported that O. glaberrima and O. barthii showed severe systemic necrosis within 4 weeks after infection of RTBV and rice tungro spherical virus (RTSV). O. longistaminata, which possesses ERTBV fragments in its genome, showed disease symptoms like those of the Asian species, and some of the accessions were utilized for obtaining the resistance gene . Interestingly, this species originated from Africa where RTBV is not distributed. The fact led us to suppose that there is a correlation between presence or absence of ERTBV in the genome and the degree of RTBV susceptibility. The species, which have a low-copy-number of ERTBV tend to be vulnerable to the rice tungro disease caused by both RTBV and RTSV. The methylation of ERTBV appeared to be positively related in the copy number in the genome. Based on a study on the endogenous virus sequences (EPRV) in tobacco, Mette et al.  proposed a model in which methylation dependent on the copy number of endogenous virus sequences may induce episomal viral methylation through a homology-dependent process involving DNA-DNA or RNA-DNA interaction. The phenomenon observed here fits their model. The copy number of EPRV in the tobacco genome is 10 times as high as that of ERTBV in the rice genome. Our results demonstrated that about 50 copies of endogenous elements are sufficient to induce methylation in the genome. If we ever find the rice germ lines that have incorporated sequences more similar to those of RTBV, as well as ERTBV in germ lines, those would be exploited as valuable sources of stronger resistance against the rice tungro disease.
The rice genome contains more than 30 of RTBV-like sequence (ERTBVs) which were unlikely to have functional potential as a virus, while we were able to assemble putative virus forms from these sequences. The phylogenetic analysis showed that at least three times integrations of authentic ERTBVs occurred during Oryza speciation. ERTBV integrations likely occurred when the virus was in the replication process, and were preferentially targeted to AT-repeat sequences. The closely relationship between ERTBV and RTBV were proven by comparisons of the DNA and amino acid sequences. The Oryza AA-genome species originated from RTBV-distributed regions appeared to contain higher copy numbers of ERTBV segments. The methylation state of the ERTBV sequences was correlated with their copy number in the genome. The results obtained allowed us to speculate a possible relationship between RTBV disease resistance and the copy number and/or DNA methylation of ERTBV in the Oryza AA-genome species.
ERTBV sequences from Japonica
Sequences of ERTBV in Japonica (cv. Nipponbare) were mined with rice blast search queries  against the rice genome sequences that had been registered as of June 2003. Twenty-nine ERTBV sequences were found in the following Japonica genomic database (Table 1).
ERTBV sequences from Indica
We attempted to search for ERTBV sequences in the Indica (cv. 93–11) genome database, however, the homologous sequences found through the search had insufficient length for designation as ERTBV segments. Indica ERTBV sequences were isolated from the EMBL 3 genomic library constructed with IR36 strain (FL1041j), which was purchased from Clontech (Palo Alto, California). For screening, a 3.5-kb ERTBV fragment about 50-kb upstream of the waxy locus was used as probe . Three clones carrying an ERTBV-containing segment of more than 4 kb were selected. Nucleotide analysis was performed with using a d-Rhodamine Terminator Cycle Sequencing Ready Reaction-Sequencing Kit (Applied Biosystems) and an ABI377 Automated DNA Sequencer (Applied Biosystems).
Construction of phylogenetic trees
The nucleotide sequences and amino acids sequences inferred from the reverse transcriptase (RT) gene were aligned using the CLUSTAL W program  from the DNA Data Bank Japan (DDBJ). We calculated the pairwise nucleotide divergence (K) between 30 independent ERTBV sequences (including RTBV) based on Kimura's two-parameter method  without taking synonymous and nonsynonymous changes into account. We constructed a neighbor-joining (NJ) tree based on these estimates . The tree was drawn using PAUP*4.0 . The consensus maximum parsimony and NJ trees based on amino acid sequences for RT in 14 viruses including ERTBV-A, -B and -C were calculated using programs from the PHYLIP package . The minimum evolution tree was calculated by the implementation of PAUP software in the GCG package.
Plant materials used.
Cultivar or accession
Japonica from Japan
Near-isogenic line of Taichung 65 with wx from Kinoshitamochi (BC12)
Javanica type from Indonesia
Indica type from India
Annual type from India
Perennial type from India
Perennial type from China (through IRRI)
Perennial type from China (through IRRI)
The sequences containing Indica ERTBV have been deposited in DDBJ: accession nos. AB124591, AB124592 and AB124593. The reconstructed sequences for the authentic ERTBV viruses, ERTBV-A, -B and -C were deposited in DDBJ: accession nos. BR000029, BR000030 and BR000031, respectively.
We thank Dr. IL Ryong Choi (International Rice Research Institute, Philippines) Dr. Kaien Fujino (Hokkaido Univ.), and Ms. Kumi Saito (Hokkaido Univ.) for valuable comments. We are also grateful to Prof. Testo Mikami (Hokkaido Univ.) for providing laboratory facilities. This work was supported by Grants-in-aid for scientific research on priority area (A) Ministry of Education, Culture, Sports, Science and Technology.
- Bejarano ER, Khashoggi A, Witty M, Lichtenstein C: Integration of multiple repeats of geminiviral DNA into the nuclear genome of tobacco during evolution. Proc Natl Acad Sci U S A. 1996, 93: 759-764. 10.1073/pnas.93.2.759.PubMed CentralView ArticlePubMedGoogle Scholar
- Ashby MK, Warry A, Bejarano ER, Khashoggi A, Burrell M, Lichtenstein CP: Analysis of multiple copies of geminiviral DNA in the genome of four closely related Nicotiana species suggest a unique integration event. Plant Molecular Biology. 1997, 35: 313-321. 10.1023/A:1005885200550.View ArticlePubMedGoogle Scholar
- Richert-Poggeler KR, Shepherd RJ: Petunia vein-clearing virus: A plant pararetrovirus with the core sequences for an integrase function. Virology. 1997, 236: 137-146. 10.1006/viro.1997.8712.View ArticlePubMedGoogle Scholar
- Richert-Poggeler KR, Noreen F, Schwarzacher T, Harper G, Hohn T: Induction of infectious petunia vein clearing (pararetro) virus from endogenous provirus in petunia. Embo Journal. 2003, 22: 4836-4845. 10.1093/emboj/cdg443.PubMed CentralView ArticlePubMedGoogle Scholar
- Harper G, Hull R: Cloning and sequence analysis of banana streak virus DNA. Virus Genes. 1998, 17: 271-278. 10.1023/A:1008021921849.View ArticlePubMedGoogle Scholar
- Ndowora T, Dahal G, LaFleur D, Harper G, Hull R, Olszewski NE, Lockhart B: Evidence that badnavirus infection in Musa can originate from integrated pararetroviral sequences. Virology. 1999, 255: 214-220. 10.1006/viro.1998.9582.View ArticlePubMedGoogle Scholar
- Harper G, Osuji JO, Heslop-Harrison JS, Hull R: Integration of banana streak badnavirus into the Musa genome: Molecular and cytogenetic evidence. Virology. 1999, 255: 207-213. 10.1006/viro.1998.9581.View ArticlePubMedGoogle Scholar
- Jakowitsch J, Mette MF, van der Winden J, Matzke MA, Matzke AJM: Integrated pararetroviral sequences define a unique class of dispersed repetitive DNA in plants. P Natl Acad Sci U S A. 1999, 96: 13241-13246. 10.1073/pnas.96.23.13241.View ArticleGoogle Scholar
- Lockhart BE, Menke J, Dahal G, Olszewski NE: Characterization and genomic analysis of tobacco vein clearing virus, a plant pararetrovirus that is transmitted vertically and related to sequences integrated in the host genome. J Gen Virol. 2000, 81: 1579-1585.View ArticlePubMedGoogle Scholar
- Gregor W, Mette MF, Staginnus C, Matzke MA, Matzke AJ: A distinct endogenous pararetrovirus family in Nicotiana tomentosiformis, a diploid progenitor of polyploid tobacco. Plant Physiol. 2004, 134: 1191-1199. 10.1104/pp.103.031112.PubMed CentralView ArticlePubMedGoogle Scholar
- Harper G, Hull R, Lockhart B, Olszewski N: Viral sequences integrated into plant genomes. Annual Review of Phytopathology. 2002, 40: 119-136. 10.1146/annurev.phyto.40.120301.105642.View ArticlePubMedGoogle Scholar
- Nagano H, Kawasaki S, Kishima Y, Sano Y: Structural differences in the vicinity of the waxy locus among the Oryza species with the AA-genome: identification of variable regions. Theoretical and Applied Genetics. 2000, 100: 376-383. 10.1007/s001220050049.View ArticleGoogle Scholar
- Mao L, Wood TC, Yu YS, Budiman MA, Tomkins J, Woo SS, Sasinowski M, Presting G, Frisch D, Goff S, Dean RA, Wing RA: Rice transposable elements: A survey of 73,000 sequence-tagged-connectors. Genome Research. 2000, 10: 982-990. 10.1101/gr.10.7.982.PubMed CentralView ArticlePubMedGoogle Scholar
- Nagano H, Kunii M, Azuma T, Kishima Y, Sano Y: Characterization of the repetitive sequences in a 200-kb region around the rice waxy locus: diversity of transposable elements and presence of veiled repetitive sequences. Genes & Genetic Systems. 2002, 77: 69-79. 10.1266/ggs.77.69.View ArticleGoogle Scholar
- Hibino H,, Saleh N,, Roechan M,: Transmission of two kinds of rice tungro-associated viruses by insect vectors. Phytopathology. 1979, 69: 1266-1268.View ArticleGoogle Scholar
- Kobayashi N,, Ikeda R,: Necrosis caused by rice tungro viruses in Oryza glaberrima and O. barthii. Japanese Journal of Breeding. 1992, 42: 885-890.View ArticleGoogle Scholar
- Nagano H,, Oka A,, Kishima Y,, Sano Y,: DNA sequences homologous to rice tungro bacilliform virus (RTBV) present in the rice genome. Rice Genetic Newsletter. 2000, 17: 103-105.Google Scholar
- Hull R: Molecular biology of rice tungro viruses. Annual Review of Phytopathology. 1996, 34: 275-297. 10.1146/annurev.phyto.34.1.275.View ArticlePubMedGoogle Scholar
- Herzog E, Guerra-Peraza O, Hohn T: The Rice tungro bacilliform virus gene II product interacts with the coat protein domain of the viral gene III polyprotein. J Virol. 2000, 74: 2073-2083. 10.1128/JVI.74.5.2073-2083.2000.PubMed CentralView ArticlePubMedGoogle Scholar
- Felsenstein J,: Phylip Phylogenetic Inference Package. Edited by: University of Washington. 1995, SeattleGoogle Scholar
- Chu YE,, Oka HI,: Introgression across isolating barriers in wild and cultivated Oryza species. Evolution. 1970, 24: 344-355.View ArticleGoogle Scholar
- Kishima Y, Ogura K, Mizukami K, Mikami T, Adachi T: Chloroplast DNA Analysis in Buckwheat Species - Phylogenetic-Relationships, Origin of the Reproductive Systems and Extended Inverted Repeats. Plant Science. 1995, 108: 173-179. 10.1016/0168-9452(95)04130-M.View ArticleGoogle Scholar
- Sherker AH, Marion PL: Hepadnaviruses and Hepatocellular-Carcinoma. Annual Review of Microbiology. 1991, 45: 475-508.View ArticlePubMedGoogle Scholar
- Sawasaki T, Takahashi M, Goshima N, Morikawa H: Structures of transgene loci in transgenic Arabidopsis plants obtained by particle bombardment: Junction regions can bind to nuclear matrices. Gene. 1998, 218: 27-35. 10.1016/S0378-1119(98)00388-6.View ArticlePubMedGoogle Scholar
- Liebich I, Bode J, Reuter I, Wingender E: Evaluation of sequence motifs found in scaffold/matrix-attached regions ( S/MARs). Nucleic Acids Research. 2002, 30: 3433-3442. 10.1093/nar/gkf446.PubMed CentralView ArticlePubMedGoogle Scholar
- Tikhonov AP, Lavie L, Tatout C, Bennetzen JL, Avramova Z, Deragon JM: Target sites for SINE integration in Brassica genomes display nuclear matrix binding activity. Chromosome Research. 2001, 9: 325-337. 10.1023/A:1016650830798.View ArticlePubMedGoogle Scholar
- Arboleda M,, Azzam O,: Inter- and intra-site genetic diversity of natural field populations of rice tungro bacilliform virus in the Philippines. Archives of Virology. 2000, 145: 275-289. 10.1007/s007050050023.View ArticlePubMedGoogle Scholar
- Cabauatan PQ, Melcher U, Ishikawa K, Omura T, Hibino H, Koganezawa H, Azzam O: Sequence changes in six variants of rice tungro bacilliform virus and their phylogenetic relationships. J Gen Virol. 1999, 80 (pt 8): 2229-2237.View ArticlePubMedGoogle Scholar
- Cabunagan RC,, Angeles ER,, Villareal S,, co-authors 16: Multilocation evaluation of advanced breeding lines for resistance to rice tungro viruses. Rice Tungro Disease Management. Edited by: Chancellor T C B, Azzam O and Heong K L. 1999, Philippines, IRRI, 45-57.Google Scholar
- Laco GS, Beachy RN: Rice Tungro Bacilliform Virus Encodes Reverse-Transcriptase, DNA-Polymerase, and Ribonuclease-H Activities. Proc Natl Acad Sci U S A. 1994, 91: 2654-2658.PubMed CentralView ArticlePubMedGoogle Scholar
- Hull R, Harper G, Lockhart B: Viral sequences integrated into plant genomes. Trends in Plant Science. 2000, 5: 362-365. 10.1016/S1360-1385(00)01723-4.View ArticlePubMedGoogle Scholar
- Covey SN, Al-Kaff NS: Plant DNA viruses and gene silencing. Plant Molecular Biology. 2000, 43: 307-322. 10.1023/A:1006408101473.View ArticlePubMedGoogle Scholar
- Mette MF, Kanno T, Aufsatz W, Jakowitsch J, van der Winden J, Matzke MA, Matzke AJM: Endogenous viral sequences and their potential contribution to heritable virus resistance in plants. Embo Journal. 2002, 21: 461-469. 10.1093/emboj/21.3.461.PubMed CentralView ArticlePubMedGoogle Scholar
- RiceBLAST: Rice sequence database BLAST search. [http://riceblast.dna.affrc.go.jp/]
- Thompson JD, Higgins DG, Gibson TJ: Clustal-W - Improving the Sensitivity of Progressive Multiple Sequence Alignment through Sequence Weighting, Position-Specific Gap Penalties and Weight Matrix Choice. Nucleic Acids Research. 1994, 22: 4673-4680.PubMed CentralView ArticlePubMedGoogle Scholar
- Kimura M,: A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. Journal of Molecular Evolution. 1980, 16: 111-120.View ArticlePubMedGoogle Scholar
- Saitou N, Nei M: The Neighbor-Joining Method - a New Method for Reconstructing Phylogenetic Trees. Mol Biol Evol. 1987, 4: 406-425.PubMedGoogle Scholar
- Swofford DL,: PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Method). Edited by: Sinauer Association. 1998, Sunderland, Massachusetts, Version 4Google Scholar
This article is published under license to BioMed Central Ltd. This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.