- Research article
- Open Access
Phylogenetic and recombination analysis of the herpesvirus genus varicellovirus
BMC Genomicsvolume 18, Article number: 887 (2017)
The varicelloviruses comprise a genus within the alphaherpesvirus subfamily, and infect both humans and other mammals. Recently, next-generation sequencing has been used to generate genomic sequences of several members of the Varicellovirus genus. Here, currently available varicellovirus genomic sequences were used for phylogenetic, recombination, and genetic distance analysis.
A phylogenetic network including genomic sequences of individual species, was generated and suggested a potential restriction between the ungulate and non-ungulate viruses. Intraspecies genetic distances were higher in the ungulate viruses (pseudorabies virus (SuHV-1) 1.65%, bovine herpes virus type 1 (BHV-1) 0.81%, equine herpes virus type 1 (EHV-1) 0.79%, equine herpes virus type 4 (EHV-4) 0.16%) than non-ungulate viruses (feline herpes virus type 1 (FHV-1) 0.0089%, canine herpes virus type 1 (CHV-1) 0.005%, varicella-zoster virus (VZV) 0.136%). The G + C content of the ungulate viruses was also higher (SuHV-1 73.6%, BHV-1 72.6%, EHV-1 56.6%, EHV-4 50.5%) compared to the non-ungulate viruses (FHV-1 45.8%, CHV-1 31.6%, VZV 45.8%), which suggests a possible link between G + C content and intraspecies genetic diversity. Varicellovirus clade nomenclature is variable across different species, and we propose a standardization based on genomic genetic distance. A recent study reported no recombination between sequenced FHV-1 strains, however in the present study, both splitstree, bootscan, and PHI analysis indicated recombination. We also found that the recently sequenced Brazilian CHV-1 strain BTU-1 may contain a genetic signal in the UL50 gene from an unknown varicellovirus.
Together, the data contribute to a greater understanding of varicellovirus genomics, and we also suggest a new clade nomenclature scheme based on genetic distances.
The Varicellovirus genus is part of the larger alphaherpesvirus subfamily which includes herpes simplex viruses 1 and 2, as well as Marek’s disease virus. Like other alphaherpesviruses, varicelloviruses typically infect epithelial surfaces, and most appear to be neurotropic, establishing latency in neurons [1,2,3,4,5,6,7,8,9]. The first varicellovirus to be clinically described as a unique disease was varicella zoster virus (VZV), the causative agent of chickenpox and shingles in 1767 . Numerous other varicelloviruses have been identified, including pseudorabies virus/SuHV-1 (Aujeszky’s disease) in pigs, BHV-1 (bovine herpes virus type 1; infectious bovine rhinotracheitis; IBR), EHV-1 (equine herpes virus type 1; epidemic abortion and myeloencephalopathy in horses), EHV-4 (equine herpes virus type 4; equine rhinopneumonitis), FHV-1 (feline herpes virus type 1; feline rhinotracheitis), and CHV-1 (canine herpes virus type 1; fading puppy syndrome). These viruses have significant impact on livestock and companion animals. Due to high transmissibility and virulence, pseudorabies virus and EHV-1 are both classified by the United States Department of Agriculture (USDA) as reportable diseases . Vaccines have been developed against several varicelloviruses, including VZV, pseudorabies virus, BHV-1, EHV-1, EHV-4, FHV-1, CHV-1, which have been effective at reducing morbidity and mortality [12,13,14,15,16,17,18,19,20,21,22]. Pseudorabies vaccination and eradication efforts in the United States have been effective, with the country declared disease free in 2004. The USA and Canada have also enacted BHV-1 control programs, and several European countries have successfully eradicated the disease . Despite vaccination and control efforts, many of these diseases continue to negatively affect humans and animals worldwide.
The first complete sequence of a varicellovirus, (VZV) was reported in 1986 , followed by several others [25,26,27] . The advent of next-generation sequencing (NGS) has revolutionized genomics, and has allowed additional varicellovirus species and sub-strains to be sequenced. The first comprehensive computational multigene phylogenetic analysis of the three main herpes virus subfamilies was a major step forward in cementing the basic structure for Alphherpesvirinae, including varicelloviruses . As increasing numbers of viral strains have been sequenced, full genome phylogenetic and recombination analysis of VZV, SuHV-1, EHV-1, and EHV-4 have been performed [29,30,31,32].
The genetic code is degenerate, resulting in most amino acids being encoded by multiple codons. The usage of some codons and not others for an amino acid is often not random, and is called codon usage bias . Codon usage and mutational bias analysis has been examined in several viruses, including phages, canine parvovirus, Japanese encephalitis virus, rabies, Zika virus, herpesviruses, and other vertebrate DNA viruses [34,35,36,37,38,39,40]. Shackelton et al. , showed that codon usage bias is strongly linked with genomic G + C content. The previous analysis of codon usage in herpesviruses  found strong codon bias in the SuHV-1 and BHV-5 viruses, both high G + C viruses. While the earlier herpesvirus study  included several varicelloviruses, additional viruses have now been sequenced, and a more inclusive analysis is now possible.
The goal of the current study was to perform a genome based comprehensive phylogenetic, genetic distance, and recombination analysis of the varicellovirus subfamily. Unique findings reported here are a phylogenetic stricture between ungulate herpesviruses and the remaining species, a possible link between genomic G + C content and intraspecies distance, the identification of recombination amongst FHV-1 strains, and results suggesting that the Brazilian CHV-1 strain BTU-1 may be a recombinant between CHV-1 and an unknown varicellovirus. We also propose a Varicellovirus genus clade nomenclature standardization based on genetic distance.
Genomic multiple sequence alignments
For phylogenetic and distance analysis, currently available Varicellovirus genus genomic sequences were downloaded from NCBI, and are cataloged in Additional file 1: Table S1. The first generated alignment was of the Varicellovirus genus as a whole, using one strain from each of the viral species, as well as an outgroup, anatid herpes virus type 1 (AnHV-1). AnHV-1 was chosen as an outgroup because the AnHV-1 is an alphaherpesvirus, and the genome is annotated in a similar fashion to the varicelloviruses. The varicelloviruses and the AnHV-1 outgroup virus have similar genome annotation and gene synteny, with the exception of pseudorabies virus. In pseudorabies virus, the UL27 to UL44 genes are inverted. Prior to genome alignment, the UL27-UL44 genome segment of pseudorabies virus was inverted by reverse complemention in order to generate a gene order similar to the other varicellovirueses. MAFFT v2.66 [41, 42] was utilized to generate the alignment using the FFT-NS-1 method. The subsequent genomic multiple sequence alignment was manually inspected for quality. Areas of the alignment that appeared to be of poor quality were realigned in Mega 6  using ClustalW.
Additional intraspecies genomic alignments of BHV-1, CHV-1, EHV-1, EHV-4, FHV-1, SuHV-1, and VZV, were generated with and without outgroups using MAFFT v2.66. The outgroups for the intraspecies alignments were chosen based on low genetic distance, in other words, the closest known relative. Thus, for EHV-1 analysis, EHV-8 was chosen as the outgroup, with the remaining analyzed species/outgroup combinations being; BHV-1/outgroup BHV-5, EHV-4/outgroup EHV-1, SuHV-1/outgroup BHV-1, FHV-1/outgroup CHV-1, CHV-1/outgroup FHV-1, and VZV/outgroup CeHV-9. All of the genomic alignments generated in thus study are available for download at http://sites.ophth.wisc.edu/brandt/.
Genetic distance and genomic G + C content calculations
The mean maximum likelihood (ML) distances for each alignment were calculated using the Mega 6 package. For the genetic distance analysis, pairwise gap deletion rather than complete deletion of gaps was used, as complete deletion of alignment gaps may exclude valuable phylogenetic data, and could result in an underestimation of distance. To calculate overall genome G + C content, an online calculator found at http://www.endmemo.com/bio/gc.php was used.
Phylogenetic and recombination analysis
For phylogenetic maximum likelihood and network analysis, intraspecies genomic alignments of BHV-1, CHV-1, EHV-1, EHV-4, FHV-1, SuHV-1, VZV, and total sequenced varicelloviruses were generated using duck enteritis virus (AnHV-1) as the outgroup as described above. Maximum likelihood phylogenetic analysis was performed on the genomic alignments containing an outgroup using the RAxMLGUI package  with the GTRCAT + I model and 500 bootstrap replicates.
Phylogenetic networks for BHV-1, CHV-1, EHV-1, EHV-4, FHV-1, SuHV-1, VZV, and total sequenced varicelloviruses were generated with Splitstree 4  using multiple sequence alignments and AnHV-1 as the outgroup. Splitstree was also used to calculate the pairwise homoplasy index (PHI) statistical test  for recombination. Jmodeltest2  was used to identify the optimal substitution model settings for each individual phylogenetic network. RDP4  recombination analysis performed genomic multiple sequence alignments without outgroups using the Jin and Nei substitution model with the following parameters: 1500 bp window, 750 bp step size, and 200 bootstrap replicates.
Determining shared clade cut-off values between BHV-1, EHV-1, and SuHV-1
Shared clade Cut-off values between BHV-1, EHV-1, and SuHV-1 were determined by first generating a histogram of pairwise p-distances and corresponding frequencies for each virus species, similar to the study performed by Grau-Roma et el with porcine circovirus type 2 (PCV2) .The pairwise p-distances for each species were calculated using Mega 6, and multiple sequence alignments without outgroups. Histograms of frequency vs p-distance for BHV-1, EHV-1, and SuHV-1, and subsequent data were generated using R (version 3.4.2 using the ggplot2 package). An initial shared cut-off of 0.01 was established by examining the upper and lower bounds of the two main groups in the three histograms. This intial cut-off was further evaluated, using a variance analysis framework, where variance between and within groups was examined. For each potential cut-off value, we calculated the following quantities for the p-distances for each virus:
For each potential cut-off value, we calculate the following quantities:
SS between = ∑f ij ·(group mean j − overall mean),2
SS within = ∑f ij ·(p-distance i − overall),2
group mean j = mean p-distance in groupj, j = 1, 2,
overall mean = overall mean p-distance,
p-distance i = the i th p-distance,
f ij = the frequency of the i th p-distance in the j th group.
Finally, we calculate ratio
where N j is the total number of observations in group j, or in terms of frequencies, N j = ∑ i f ij . We next wanted to determine the cut-off which maximized the quantity across the groups, by first plotting the F values for each of the three graphs (Figure S1). We next restricted the cut-off values where the p-values for the different viruses was divided into two groups. For example, this means that values larger than 0.012 were discarded as no such distances were found in the BHV-1 virus group. The values within each graph were rescaled 0 to 1 in order to make each virus of equal value, and the sum of the curves maximized. To examine how the F measure corresponded to the frequency distributions, the value of rescaled F was overlayed. The point at which the sum of the rescaled F values attains it’s maximum, was chosen as the cut-off value.
Codon usage analysis
To investigate codon usage in the Varicellovirus genus as a whole, the effective number of codons (ENC) was calculated for the US1 (ICP22), UL30 DNA polymerase, and glycoprotein H genes from each varicellovirus species. These genes represent one member of the α (immediate-early), β (early), and γ (late) gene classes. The ENC value is a measure of how much the codon usage of a gene deviates from the equal usage of synonymous codons . ENC values range from 20 to 61, with 20 indicating maximum bias, with one codon used from each synonymous codon group, to 61 indicating no codon usage bias. The ENC values for the varicellovirus US1, UL30, and gH genes were calculated using DnaSP (v5) . In addition to the ENC values, GC3s values were calculated using DnaSP, while the GC1 and GC2 values were obtained using in the online calculator http://genomes.urv.es/CAIcal/. The GC12 values were calculated using Microsoft Excel. ENC-GC3s plots were generated using SigmaPlot v.11 . In the ENC-GC3s plots, if the plotted values are located on or near the standard curve, then codon usage is constrained only by G + C mutation bias. However, the greater the plotted values deviate from the standard curve, the more additional factors such as natural selection may influence the bias.
Neutral evolution plots (GC12s vs GC3s) were generated to examine the contribution of mutational pressure and natural selection. Sigmaplot v.11 was used to generate the plots, as well as for the linear regression statistical analysis.
Results and discussion
The nomenclature designation for varicellovirus species strains and intraspecies clades is somewhat variable, with some, such as bovine herpesvirus 1 strains given a BHV-1.1 or 1.2 designation , while VZV clades are given a simple numeral designations (1–6) . As such, a clade nomenclature standardization across varicelloviruses may be useful, and we are proposing a common nomenclature system based on genetic distances. The International Committee on Taxonomy of Viruses (ICTV) does not provide guidelines in defining taxonomic clades below species level , and it is within the purview of interested groups to do so. Genetic distances have been previously used as a basis for nomenclature systems in H5N1 avian influenza  and porcine circovirus type 2 [55, 56]. The genetic distance based nomenclature system we are proposing would preserve classic BHV-1 1.1 and 1.2 clade designations, as well the varicella zoster virus (VZV) numerical designations. Within each species, clade/group distances greater than 1% would be designated by a 1.1, 1.2, ... numbering as seen in BHV-1 . Between clade/group distances of less than 1% would result in a numerical format (i.e. 1, 2, 3…) as has been consistently used with VZV . Under this system, it would be possible to have two distant clades given a 1.x designation, with less distant subclades designated numerically.
The 1% cut-off was determined by calculating a shared value for the BHV-1, EHV-1, and SuHV-1 viruses. BHV-1, EHV-1, and SuHV-1 were chosen, as these three viruses have the highest levels of intraspecies genetic distance of the varicelloviruses (detailed in the sections below). An initial cut-off of 0.01 was chosen, based on a shared p-distance value that divides the observed p-distances into two groups simultaneously for all three viruses (Fig. 1a, b, and c). Additional evaluation of the initial cut-off was performed using a variance analysis framework, where variation between groups and within groups was examined. Figure 1e, f, and g show the individual rescaled F curves (gray dotted line) for each of the three viruses, as well as the sum of the curves in black. The F values were rescaled so as not to weight one virus more than the rest so the curves do not directly overlap. The sum of the rescaled F curves shows a peak at 0.01, which validates the initial cut-off. The values of the unscaled, and rescaled F values are shown in Additional file 1: Tables S2, and S3 respectively.
Phylogenetic network analysis of varicelloviruses
To investigate phylogenetic relationships between the sequenced varicelloviruses, the genomes of each species, along with the AnHV-1 outgroup genome were aligned. Both a maximum likelihood (ML) based tree (Fig. 2a) and phylogenetic network (Fig. 2b) were constructed. The ML whole genome based tree showed that CeHV-9 and VZV occupy a basal position in the genus, and the ungulate viruses share a node with bootstrap support of 100% (Fig. 2a). This result is fairly unremarkable and is similar to analysis performed using smaller sets of genes . To assess the phylogenetic dissonance in the dataset, a phylogenetic network was also generated, and shows a similar basic topology with the ML tree. The network however suggests a stricture between the ungulates and non-ungulates, and is denoted by a pink circle (Fig. 2b). The stricture could be the result of low amounts of recombination between the two sides of the network, however, it may represent a bottleneck, or may be simply due to divergent phylogenetic signals. To determine if there was recombination within the network, the PHI statistical test for recombination was performed. The PHI test indicated statistically significant signals amongst the ungulate virus portion of the network, as well the non-ungulate portion, however, analysis of the network as a whole (minus outgroup) was not significant (Table 1). This lack of a significant result is likely due to the high amount of genetic distance within the dataset. The genomic distances between virus species are also shown in Fig. 2b to aid in data interpretation.
Varicellovirus G + C content and interstrain genetic distance
The G + C content of each of the varicellovirus species was analyzed, with results shown in both Fig. 2b, and Table 2. All of the ungulate viruses had a G + C content above 50%, ranging from 50.5% in EHV-4 to 74.8% in BHV-5. The primate and carnivore viruses had G+C contents under 50%, and ranged from 31.6% in CHV-1 to 45.8% in both VZV and FHV-1. For each varicellovirus, where multiple strains have been sequenced, the overall intraspecies genetic distance for each species was calculated (Fig. 1b and Table 1). The results suggest a possible link between G + C content and intraspecies genetic distance with higher G + C content and genetic distance in the ungulate viruses (SuHV-1 = 1.65%, BHV-1 = 0.81%, EHV-1 = 0.79%, and EHV-4 = 0.16%), and lower values in the carnivore and VZV viruses (VZV = 0.136%, CHV-1 = 0.0056/0.020%, and FHV-1 = 0.0089%). It should be noted that for CHV-1, two distance values are given, and this is discussed below.
It is unclear if the higher genomic G + C content of the ungulate viruses is the result of genetic drift or evolutionary pressure. The observation that G + C content in varicelloviruses may be linked to intraspecies genetic distance may not be unprecedented, as G + C content appears to correlate with substitution rates in Arabadopsis . It is highly unlikely that G + C content is the main driver of varicellovirus intraspecies genetic distance, however, it may be possible that G + C content is able to influence distance. Additional factors may influence intraspecies genetic variability in varicelloviruses, such as the propensity of the host to form large herds, transmissibility, and the number reactivation events in the life of the host. We also cannot eliminate the possibility that the genomic G + C content and intraspecies distance link is an artifact due to small sample size.
Codon usage and mutational bias in the Varicellovirus genus
The observation of varying G + C content across the Varicellovirus genus lead us to investigate codon usage and mutational bias. Codon usage and mutational bias has been previously examined in other viruses such as canine parvovirus , Japanese encephalitis virus , and Zika virus . For the present analysis, the effective codon number values of three genes, US1 (α), UL30 polymerase (β), and UL22 (glycoprotein H; γ) were calculated for each of the varicellovirus species (Table 3). These three genes were chosen to be representative of each kinetic class; α (immediate-early), β (early), and γ (late). ENC values range from 20 to 61, with 61 indicating no bias and 20 indicating extreme bias. The values show greater bias in all three genes from viruses that are either A-T or G-C rich (Table 3), for example CHV-1, SuHV-1, BHV-1, BHV-5, and BuHV-1. Next, ENC values in the context of mutational pressure were assessed by plotting the ENC values against the G + C content in the synonymous third codon position (GC3s), found in Fig. 3a, b, and c. The ENC plots for US1, UL30, and UL22 show that these three Varicellovirus genus genes are largely constrained by G + C mutation bias, as most data points are located close to the standard curve. Some of the data points are located farther away from the plot, such as SuHV-1 US1 (Fig. 3a), and EHV-4 (Fig. 3b), which suggest additional pressures influencing bias, possibly including natural selection.
Neutrality plots (GC12 vs GC3s) for US1, UL30, and UL22 were generated to further examine mutation and natural selection biases (Fig. 3c, d, and e). The neutrality plots showed significant results for US1 (r 2 = 0.827; p = < 0.001), UL30 (r 2 = 0.949; p = < 0.001), and UL22 (r 2 = 0.937; p = < 0.001), which confirmed mutational bias. The slopes for all three of the neutrality plots were shallow (US1 = 0.1137, UL30 = 0.0933, and UL22 = 0.1092), which indicated that mutation bias influenced codon usage only 11.37%, 9.33%, and 10.92% for each of these genes, respectively. ENC and neutrality plots appear to result in somewhat different conclusions in investigating codon usage and mutational pressure. Care should be taken in interpretation, as the analysis is genus wide, and not in a single species. G+C constrained mutation bias in varicelloviruses was confirmed as has been previously shown in vertebrate DNA viruses , however additional factors such as natural selection are likely to play major role as well.
Bovine herpesvirus 1
BHV-1 has been traditionally divided into three subtypes; BHV-1.1, BHV-1.2a, and BHV-1.2b, with the 1.1 strains generally associated with IBR, and 1.2 with venereal disease phenotypes [59, 60]. It must be noted that strains of either type can cause respiratory and venereal disease phenotypes [61, 62], which suggests that genetic criteria may be a more reliable way to group BHV-1 strains than by clinical phenotype. To examine BHV-1 phylogeny, recombination, and genetic distance between clades, maximum likelihood trees and phylogenetic networks were generated, recombination bootscan analysis was conducted, and inter- as well as intra-clade distances were calculated. The ML tree and the phylogenetic network both recover two main clades (Fig. 4). Genetic distance analysis showed that the distance between the two groups was 1.12%, fulfilling the criteria for designating the clades 1.1 and 1.2. Thus, the BHV-1 clades retain the 1.x organization they had previously under our proposed nomenclature criteria. The genomic sequence analysis of BHV-1 recovered two main clades designated 1.1 and 1.2, however subclades within 1.2 were not detected. It is possible that as additional BHV-1 strains are sequenced, evidence of 1.2. subclades may become apparent. The overall genetic distance within BHV-1.1 was 0.60%, and within BHV-1.2 was 0.43% (Fig. 4b). Bootscan recombination analysis using BHV-1.2 strain B589 revealed extensive recombination from the remaining BHV-1.2 strains, but no significant recombinant signals from the BHV-1.1 viruses were detected (Fig. 4c). PHI recombination test analysis suggests that there is recombination in the dataset (p = <0.001; Table 1).
The phylogenetic structure of SuHV-1 strains was examined next. The genome based ML tree and phylogenetic network both recovered two basic clades, Chinese and European/American, which have been previously identified  (Fig. 5a and b). The phylogenetic network however, showed that the Italian domestic dog isolated strain ADV32751 was separated somewhat from the remaining European/American strains (Fig. 5b). Distance analysis showed that the genetic distance between the main Chinese and European/American clades was 2.76% (Fig. 5b). This comparatively large genetic distance between the two groups appears to be consistent with what is thought to be two independent domestication events, one in China , and another in modern day Turkey roughly 9000 years before present . Additional calculations showed that the distance between strain ADV32751 and the remaining European/American strains was 1.44% (Fig. 5b). Based on the results of the distance calculations, we suggest that pseudorabies virus be designated as SuHV-1.1 (Chinese), SuHV-1.2 (main European/American), and provisional SuHV-1.3 (strain ADV32751) (Fig. 5b). It is unclear if the ADV32751 strain contains mutations which could have enhanced transmission to a domestic dog. Additionally, given the distance value with respect to the SuHV-1.2 viruses, the chance of genetic contributions from a European wild boar strain should not be excluded. Within the European/American SuHV-1.2 clade, two additional groupings were detected, and designated 1 and 2 based on the genetic distance (0.37%; Fig. 5b). A bootscan using SuHV-1.2 strain NIA3 against the remaining strains showed little to no recombination signals from either SuHV-1.1 or 1.3 (Fig. 5c). The lack of recombination signals between the SuHV-1.2 and 1.1 subclades is not unexpected due to geographic distances, however a recent report showed that the Chinese pseudorabies virus strain SC contained genomic contributions from the vaccine strain Bartha . The PHI recombination test of all the SuHV-1 strains showed (Table 1) statistically significant recombination within the dataset (p = <0.001).
The ML tree (Fig. 6a) shows a split between the wild and domestic horse derived EHV-1. The phylogenetic network also confirms this split (Fig. 6b). Genetic distance calculations resulted in 2.92% distance between the wild and domestic EHV-1 clades, and we suggest designating these EHV-1.1 (wild equine) and EHV-1.2 (domestic horse). The distance within the wild horse EHV-1.1 clade was higher than EHV-1.2, at 0.61% vs. 0.18% respectively. An expansion of the EHV-1.2 clade is shown in Fig. 5c, and shows three provisional clades, with clades 1 and 2 being 0.135% distant, and clades 2 and 3 0.137% distant. Two strains, NY03 and 5586, are separated from the remaining EVH-1.2 viruses and may represent a separate clade, however additional strains need to be islolated before this can be determined. EHV-1.2 strains NMKT04 and V592 occupy a position between clades 1 and 2 may be interclade recombinants. It is notable that EHV-1.2 strains do not appear to correlate to geographic origin, and may reflect the cosmopolitan nature of common breeds such as the Thoroughbred. Bootstrap recombination analysis scanning EHV-1.2 group 3 strain Va02 against the remaining strains showed no recombination from EHV-1.1 stains (Fig. 6c). When the EHV-1 strains were examined using the PHI recombination test (Table 1), statistically significant recombination was detected (p = <0.001). Wild equine derived EHV-1 strains cause severe infections, often neurological in both equine and non-equine captive animals [66,67,68,69,70], and some domestic horse EHV-1.2 strains can also cause myeloencephalopathy [71, 72]. A SNP (D752) within the polymerase gene of domestic horse viruses has been shown to influence the neurological disease phenotype, and is shared among wild equine strains [73, 74]. The genomic phylogenetic analysis (Fig. 5) of the domestic horse (EHV-1.2) strains did not sort the strains based on disease phenotype, i.e. neuropathology vs abortion (Fig. 6c). This finding along with data showing that non-D752 strains  can cause encephalitis strongly suggests multiple genes contribute to EHV-1 disease phenotypes, as has been observed in HSV-1 [76,77,78,79].
Equine herpesvirus type 4 is an important equine disease, and causes rhinopneumonitis most commonly in foals , however it is not a reportable disease as is EHV-1. The phylogenetic ML tree (Fig. 7a) and phylogenetic network (Fig. 7b) of the available EHV-4 showed a split into two main groups (clade 1 and 2) as has been shown previously . The genetic distance between the two clades was 0.23%. Both the ML tree and phylogenetic network showed that the EHV-4 viruses, like EHV-1, do not sort according to geographic origin and this is likely the result of the modern movement of common breeds globally. Recombination bootscan analysis (Fig. 7c) scanning EHV-4 group A strain 12-I-203 against the rest showed extensive recombination from both groups 1 and 2. Additional PHI recombination test analysis (Table 1) detected significant amounts of recombination in EHV-4 (p = <0.001).
Feline herpes virus type 1 (FHV-1) is thought to be main cause of corneal ulceration  in cats, and can also contribute to upper respiratory disease . Recently, several FHV-1 genomes from Australia were sequenced and genomic analysis did not reportedly detect recombination . As part of our analysis of varicellovirus phylogenetic relationships, we analyzed the available FHV-1 genomic sequences. The ML tree and phylogenetic network (Fig. 8a and b) suggested some strain grouping, however, because the overall interstrain genetic distance is low (0.0089%), we did not designate any clades. Reticulations within the phylogenetic network implied the presence of recombination between the FHV-1 strains. Bootscan recombination analysis (Fig. 8c), as well the PHI recombination test (Table 1; p = <0.001) detected recombination signals. The difficulty in detecting recombination is likely due to the low interstrain genetic distance (0.0089%), and it is possible that as additional strains are sequenced, more recombination may be more readily identified. It would be surprising if recombination was not detected in FHV-1, as herpesviruses have been shown to be highly recombinagenic [29, 31, 84, 85].
Until very recently, only three CHV-1 strains, collected between 1985 and 2000 from the UK had been sequenced , however a short time ago, a CHV-1 strain from Brazil was deposited into GenBank. The overall genetic distance between the three UK strains was very low, at 0.005% (Table 1; Fig. 9a). The CHV-1 strain from Brazil (strain BTU-1) was 0.34% distant from the three UK viruses, with an overall interstrain distance of 0.20% for the four viruses (Table 2; Fig. 9a). We performed a similarity plot using the MSA (multiple sequence alignment) without an outgroup, and found a deep trough at approximately 9 kb from the left end of the genome (Fig. 9b). The similarity trough corresponded to the UL50 deoxyuridine triphosphatase gene. Distance analysis comparing the UL50 protein sequences from the four CHV-1 viruses showed that the Brazilian BTU-1 strain UL50 was 12.2% distant compared to the UK strains (Table 4). Further blast searches (data not shown) determined that even though the BTU-1 strain UL50 protein sequence was 12.2% distant, it appeared closest to the remaining CHV-1 strains, rather than FHV-1. Bootscan recombination analysis using the UK derived 0194 strain as a reference only detected recombination signals from the other UK viruses, and none from BTU-1 (Fig. 9c). Curiously, the PHI recombination test did not detect statistically significant recombination (Table 1; p = 0.3082), and may be due to the small size of the dataset. Based on the data, we hypothesize that the BTU-1 strain may be the result of a recombination event between canine herpesvirus 1, and an unknown varicellovirus. It would be unlikely that positive selection would only affect a single gene in the virus to such a large extent (12.2% distance), however the possibility cannot be eliminated. Because the UL50 sequence most closely resembles the remaining CHV-1 viruses, it is likely that the unknown virus originated from an animal of the Caniformia suborder, which includes Brazilian species such as the maned wolf, bush dog, pampas fox, tayra, striped hog-nosed skunk, and crab-eating racoon.
Varicella zoster virus
The Varicella-Zoster virus (VZV) causes chickenpox as well as shingles, and the phylogeny of VZV clades has been extensively studied . We treated our analysis of VZV as an update, as additional strains have been sequenced and uploaded into GenBank. A ML tree and phylogenetic network using CeHV-9 (simian varicella virus) as the outgroup were constructed and are found in Fig. 10. The phylogenetic network suggests six clades, which are denoted numerically as described previously . (Fig. 9b). PHI recombination analysis confirmed statistically significant recombination among the VZV strains (Table 1; p = <0.001%).
In conclusion, this is the first genome based phylogenetic study of the entire Varicellovirus genus. In this study, we present a number of unique findings including results suggesting that a phylogenetic stricture exists between the ungulate viruses and the primate and carnivore viruses, a possible link between genome G + C content and intraspecies strain genetic diversity, the detection of recombination in all of the varicellovirus species including FHV-1, and that the Brazilian CHV-1 strain BTU may contain a genetic signal from an unknown varicellovirus in the UL50 gene. We also propose a clade nomenclature standardization for varicelloviruses. This work helps to deepen the understanding of varicellovirus genomics and evolution.
Anatid herpes virus type 1
Bovine herpes virus type 1
Bovine herpes virus type 5
Bubaline herpes virus type 1
Cercopithecine herpes virus type 9
Canine herpes virus type 1
Equine herpes virus type 1
Equine herpes virus type 3
Equine herpes virus type 4
Equine herpes virus type 8
Equine herpes virus type 9
Effective number of codons
Feline herpesvirus type 1
International Committee on Taxonomy of Viruses
Multiple sequence alignment
Porcine circovirus type 2
Pairwise homoplasty index
United States Department of Agriculture
Varicella zoster virus
Cardoso TC, Ferreira HL, Okamura LH, Oliveira BR, Rosa AC, Gameiro R, Flores EF. Comparative analysis of the replication of bovine herpesvirus 1 (BHV1) and BHV5 in bovine-derived neuron-like cells. Arch Virol. 2015;160(11):2683–91.
Charlton KM, Mitchell D, Girard A, Corner AH. Meningoencephalomyelitis in horses associated with equine herpesvirus 1 infection. Vet Pathol. 1976;13(1):59–68.
Verheyen K, Newton JR, Wood JL, Birch-Machin I, Hannant D, Humberstone RW. Possible case of EHV-4 ataxia in warmblood mare. Vet Rec. 1998;143(16):456.
Card JP, Rinaman L, Schwaber JS, Miselis RR, Whealy ME, Robbins AK, Enquist LW. Neurotropic properties of pseudorabies virus: uptake and transneuronal passage in the rat central nervous system. J Neurosci. 1990;10(6):1974–94.
Steiner I, Kennedy PG, Pachner AR. The neurotropic herpes viruses: herpes simplex and varicella-zoster. Lancet Neurol. 2007;6(11):1015–28.
Fukushi H, Tomita T, Taniguchi A, Ochiai Y, Kirisawa R, Matsumura T, Yanai T, Masegi T, Yamaguchi T, Hirai K. Gazelle herpesvirus 1: a new neurotropic herpesvirus immunologically related to equine herpesvirus 1. Virology. 1997;227(1):34–44.
Ledbetter EC, Marfurt CF, Dubielzig RR. Metaherpetic corneal disease in a dog associated with partial limbal stem cell deficiency and neurotrophic keratitis. Vet Ophthalmol. 2013;16(4):282–8.
Hora AS, Tonietti PO, Guerra JM, Leme MC, Pena HF, Maiorka PC, Brandao PE. Felid herpesvirus 1 as a causative agent of severe nonsuppurative meningoencephalitis in a domestic cat. J Clin Microbiol. 2013;51(2):676–9.
Gray WL. Simian varicella in old world monkeys. Comp Med. 2008;58(1):22–30.
Heberden W. On the chicken-pox. Med Trans Coll Phys. 1767;1:428.
USDA: U.S. National List of Reportable Animal Diseases (NLRAD)- National Animal Health Reporting System (NAHRS) reportable disease list. 2017.
Takahashi M, Otsuka T, Okuno Y, Asano Y, Yazaki T. Live vaccine used to prevent the spread of varicella in children in hospital. Lancet. 1974;2(7892):1288–90.
Osterrieder N, Neubauer A, Brandmuller C, Kaaden OR, O'Callaghan DJ. The equine herpesvirus 1 IR6 protein influences virus growth at elevated temperature and is a major determinant of virulence. Virology. 1996;226(2):243–51.
Kit S, Kit M, Pirtle EC. Attenuated properties of thymidine kinase-negative deletion mutant of pseudorabies virus. Am J Vet Res. 1985;46(6):1359–67.
Kit S, Sheppard M, Ichimura H, Kit M. Second-generation pseudorabies virus vaccine with deletions in thymidine kinase and glycoprotein genes. Am J Vet Res. 1987;48(5):780–93.
Moormann RJ, de Rover T, Briaire J, Peeters BP, Gielkens AL, van Oirschot JT. Inactivation of the thymidine kinase gene of a gI deletion mutant of pseudorabies virus generates a safe but still highly immunogenic vaccine strain. J Gen Virol. 1990;71(Pt 7):1591–5.
Kaashoek MJ, Moerman A, Madic J, Rijsewijk FA, Quak J, Gielkens AL, van Oirschot JT. A conventionally attenuated glycoprotein E-negative strain of bovine herpesvirus type 1 is an efficacious and safe vaccine. Vaccine. 1994;12(5):439–44.
Galeota JA, Flores EF, Kit S, Kit M, Osorio FA. A quantitative study of the efficacy of a deletion mutant bovine herpesvirus-1 differential vaccine in reducing the establishment of latency by wildtype virus. Vaccine. 1997;15(2):123–8.
Heldens JG, Hannant D, Cullinane AA, Prendergast MJ, Mumford JA, Nelly M, Kydd JH, Weststrate MW, van den Hoven R. Clinical and virological evaluation of the efficacy of an inactivated EHV1 and EHV4 whole virus vaccine (Duvaxyn EHV1,4). Vaccination/challenge experiments in foals and pregnant mares. Vaccine. 2001;19(30):4307–17.
Jas D, Aeberle C, Lacombe V, Guiot AL, Poulet H. Onset of immunity in kittens after vaccination with a non-adjuvanted vaccine against feline panleucopenia, feline calicivirus and feline herpesvirus. Vet J. 2009;182(1):86–93.
Lappin MR, Sebring RW, Porter M, Radecki SJ, Veir J. Effects of a single dose of an intranasal feline herpesvirus 1, calicivirus, and panleukopenia vaccine on clinical signs and virus shedding after challenge with virulent feline herpesvirus 1. J Feline Med Surg. 2006;8(3):158–63.
Poulet H, Guigal PM, Soulier M, Leroy V, Fayet G, Minke J, Chappuis Merial G. Protection of puppies against canine herpesvirus by vaccination of the dams. Vet Rec. 2001;148(22):691–5.
Raaperi K, Orro T, Viltrop A. Epidemiology and control of bovine herpesvirus 1 infection in Europe. Vet J. 2014;201(3):249–56.
Davison AJ, Scott JE. The complete DNA sequence of varicella-zoster virus. J Gen Virol. 1986;67(Pt 9):1759–816.
Telford EA, Watson MS, McBride K, Davison AJ. The DNA sequence of equine herpesvirus-1. Virology. 1992;189(1):304–16.
Telford EA, Watson MS, Perry J, Cullinane AA, Davison AJ. The DNA sequence of equine herpesvirus-4. J Gen Virol. 1998;79(Pt 5):1197–203.
Delhon G, Moraes MP, Lu Z, Afonso CL, Flores EF, Weiblen R, Kutish GF, Rock DL. Genome of bovine herpesvirus 5. J Virol. 2003;77(19):10339–47.
McGeoch DJ, Dolan A, Ralph AC. Toward a comprehensive phylogeny for mammalian and avian herpesviruses. J Virol. 2000;74(22):10401–6.
Norberg P, Depledge DP, Kundu S, Atkinson C, Brown J, Haque T, Hussaini Y, MacMahon E, Molyneaux P, Papaevangelou V, et al. Recombination of globally circulating varicella-zoster virus. J Virol. 2015;89(14):7133–46.
Ye C, Guo JC, Gao JC, Wang TY, Zhao K, Chang XB, Wang Q, Peng JM, Tian ZJ, Cai XH, et al. Genomic analyses reveal that partial sequence of an earlier pseudorabies virus in China is originated from a Bartha-vaccine-like strain. Virology. 2016;491:56–63.
Vaz PK, Horsington J, Hartley CA, Browning GF, Ficorilli NP, Studdert MJ, Gilkerson JR, Devlin JM. Evidence of widespread natural recombination among field isolates of equine herpesvirus 4 but not among field isolates of equine herpesvirus 1. J Gen Virol. 2016;97(3):747–55.
Izume S, Kirisawa R, Ohya K, Ohnuma A, Kimura T, Omatsu T, Katayama Y, Mizutani T, Fukushi H. The full genome sequences of 8 equine herpesvirus type 4 isolates from horses in Japan. J Vet Med Sci. 2017;79(1):206–12.
Marin A, Bertranpetit J, Oliver JL, Medina JR. Variation in G + C-content and codon choice: differences among synonymous codon groups in vertebrate genes. Nucleic Acids Res. 1989;17(15):6181–9.
Sau K, Gupta SK, Sau S, Ghosh TC. Synonymous codon usage bias in 16 Staphylococcus aureus phages: implication in phage therapy. Virus Res. 2005;113(2):123–31.
Li G, Ji S, Zhai X, Zhang Y, Liu J, Zhu M, Zhou J, Su S. Evolutionary and genetic analysis of the VP2 gene of canine parvovirus. BMC Genomics. 2017;18(1):534.
Butt AM, Nasrullah I, Qamar R, Tong Y. Evolution of codon usage in Zika virus genomes is host and vector specific. Emerg Microbes Infect. 2016;5(10):e107.
Singh NK, Tyagi A, Kaur R, Verma R, Gupta PK. Characterization of codon usage pattern and influencing factors in Japanese encephalitis virus. Virus Res. 2016;221:58–65.
Roychoudhury S, Mukherjee D. A detailed comparative analysis on the overall codon usage pattern in herpesviruses. Virus Res. 2010;148(1–2):31–43.
Shackelton LA, Parrish CR, Holmes EC. Evolutionary basis of codon usage and nucleotide composition bias in vertebrate DNA viruses. J Mol Evol. 2006;62(5):551–63.
He W, Zhang H, Zhang Y, Wang R, Lu S, Ji Y, Liu C, Yuan P, Su S. Codon usage bias in the N gene of rabies virus. Infect Genet Evol. 2017;54:458–65.
Katoh K, Misawa K, Kuma K, Miyata T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002;30(14):3059–66.
Katoh K, Toh H. Parallelization of the MAFFT multiple sequence alignment program. Bioinformatics. 2010;26(15):1899–900.
Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol. 2013;30(12):2725–9.
Berger SA, Krompass D, Stamatakis A. Performance, accuracy, and web server for evolutionary placement of short sequence reads under maximum likelihood. Syst Biol. 2011;60(3):291–302.
Huson DH, Bryant D. Application of phylogenetic networks in evolutionary studies. Mol Biol Evol. 2006;23(2):254–67.
Bruen TC, Philippe H, Bryant D. A simple and robust statistical test for detecting the presence of recombination. Genetics. 2006;172(4):2665–81.
Darriba D, Taboada GL, Doallo R, Posada D. jModelTest 2: more models, new heuristics and parallel computing. Nat Methods. 9(8):772.
Martin DP, Lemey P, Lott M, Moulton V, Posada D, Lefeuvre P. RDP3: a flexible and fast computer program for analyzing recombination. Bioinformatics. 2010;26(19):2462–3.
Grau-Roma L, Crisci E, Sibila M, Lopez-Soria S, Nofrarias M, Cortey M, Fraile L, Olvera A, Segales J. A proposal on porcine circovirus type 2 (PCV2) genotype definition and their relation with postweaning multisystemic wasting syndrome (PMWS) occurrence. Vet Microbiol. 2008;128(1–2):23–35.
Wright F. The 'effective number of codons' used in a gene. Gene. 1990;87(1):23–9.
Rozas J. DNA sequence polymorphism analysis using DnaSP. Methods Mol Biol. 2009;537:337–50.
Engels M, Giuliani C, Wild P, Beck TM, Loepfe E, Wyler R. The genome of bovine herpesvirus 1 (BHV-1) strains exhibiting a neuropathogenic potential compared to known BHV-1 strains by restriction site mapping and cross-hybridization. Virus Res. 1986;6(1):57–73.
ICTV Taxonomy [https://talk.ictvonline.org/taxonomy/w/ictv-taxonomy].
WHO. Updated unified nomenclature system for the highly pathogenic H5N1 avian influenza viruses. In: Global influenza surveillance and response system (GISRS): World Health Organization; 2011. http://www.who.int/influenza/gisrs_laboratory/h5n1_nomenclature/en/.
Xiao CT, Halbur PG, Opriessnig T. Global molecular genetic analysis of porcine circovirus type 2 (PCV2) sequences confirms the presence of four main PCV2 genotypes and reveals a rapid increase of PCV2d. J Gen Virol. 2015;96(Pt 7):1830–41.
Segales J, Olvera A, Grau-Roma L, Charreyre C, Nauwynck H, Larsen L, Dupont K, McCullough K, Ellis J, Krakowka S, et al. PCV-2 genotype definition and nomenclature. Vet Rec. 2008;162(26):867–8.
Grose C. Pangaea and the out-of-africa model of varicella-zoster virus evolution and phylogeography. J Virol. 2012;86(18):9558–65.
DeRose-Wilson LJ, Gaut BS. Transcription-related mutations and GC content drive variation in nucleotide substitution rates across the genomes of Arabidopsis thaliana and Arabidopsis lyrata. BMC Evol Biol. 2007;7:66.
Miller JM, Whetstone CA, Van der Maaten MJ. Abortifacient property of bovine herpesvirus type 1 isolates that represent three subtypes determined by restriction endonuclease analysis of viral DNA. Am J Vet Res. 1991;52(3):458–61.
D'Arce RC, Almeida RS, Silva TC, Franco AC, Spilki F, Roehe PM, Arns CW. Restriction endonuclease and monoclonal antibody analysis of Brazilian isolates of bovine herpesviruses types 1 and 5. Vet Microbiol. 2002;88(4):315–24.
Muylkens B, Thiry J, Kirten P, Schynts F, Thiry E. Bovine herpesvirus 1 infection and infectious bovine rhinotracheitis. Vet Res. 2007;38(2):181–209.
Rijsewijk FA, Kaashoek MJ, Langeveld JP, Meloen R, Judek J, Bienkowska-Szewczyk K, Maris-Veldhuis MA, van Oirschot JT. Epitopes on glycoprotein C of bovine herpesvirus-1 (BHV-1) that allow differentiation between BHV-1.1 and BHV-1.2 strains. J Gen Virol. 1999;80(Pt 6):1477–83.
Wang X, CX W, Song XR, Chen HC, Liu ZF. Comparison of pseudorabies virus China reference strain with emerging variants reveals independent virus evolution within specific geographic regions. Virology. 2017;506:92–8.
Cucchi HT-BA, Yuan J, Dobney K, et al. J Archeol Sci. 2011;38:11–22.
Ervynck A, Dobney K, Hongo H, Meadow R. Born free? New evidence for the status of Sus Scrofa at Neolithic Çayönü Tepesi (southeastern Anatolia, Turkey). Paléorient. 2001;27:47–73.
Abdelgawad A, Azab W, Damiani AM, Baumgartner K, Will H, Osterrieder N, Greenwood AD. Zebra-borne equine herpesvirus type 1 (EHV-1) infection in non-African captive mammals. Vet Microbiol. 2014;169(1–2):102–6.
Greenwood AD, Tsangaras K, Ho SY, Szentiks CA, Nikolin VM, Ma G, Damiani A, East ML, Lawrenz A, Hofer H, et al. A potentially fatal mix of herpes in zoos. Curr Biol. 2012;22(18):1727–31.
Kennedy MARE, Diderrich V, Richman L, Allen GP, Potgieter LND. Encephalitis associated with a variant of equine herpesvirus 1 in a Thomson’s gazelle (Gazella Thomsoni). J Zoo Wildl Med. 1996;27:533–8.
Montali RJ, Allen GP, Bryans JT, Phillips LG, Bush M. Equine herpesvirus type 1 abortion in an onager and suspected herpesvirus myelitis in a zebra. J Am Vet Med Assoc. 1985;187(11):1248–9.
Blunden AS, Smith KC, Whitwell KE, Dunn KA. Systemic infection by equid herpesvirus-1 in a Grevy's zebra stallion (Equus Grevyi) with particular reference to genital pathology. J Comp Pathol. 1998;119(4):485–93.
Goehring LS, van Winden SC, van Maanen C, Sloet van Oldruitenborgh-Ossterbaan MM: Equine herpesvirus type 1-associated myeloencephalopathy in The Netherlands: a four-year retrospective study (1999–2003). J Vet Intern Med 2006, 20(3):601-607.
Allen GP, Bolin DC, Bryant U, Carter CN, Giles RC, Harrison LR, Hong CB, Jackson CB, Poonacha K, Wharton R, et al. Prevalence of latent, neuropathogenic equine herpesvirus-1 in the thoroughbred broodmare population of central Kentucky. Equine Vet J. 2008;40(2):105–10.
Goodman LB, Loregian A, Perkins GA, Nugent J, Buckles EL, Mercorelli B, Kydd JH, Palu G, Smith KC, Osterrieder N, et al. A point mutation in a herpesvirus polymerase determines neuropathogenicity. PLoS Pathog. 2007;3(11):e160.
Guo X, Izume S, Okada A, Ohya K, Kimura T, Fukushi H. Full genome sequences of zebra-borne equine herpesvirus type 1 isolated from zebra, onager and Thomson's gazelle. J Vet Med Sci. 2014;76(9):1309–12.
Nugent J, Birch-Machin I, Smith KC, Mumford JA, Swann Z, Newton JR, Bowden RJ, Allen GP, Davis-Poynter N. Analysis of equid herpesvirus 1 strain variation reveals a point mutation of the DNA polymerase strongly associated with neuropathogenic versus nonneuropathogenic disease outbreaks. J Virol. 2006;80(8):4047–60.
Javier RT, Sedarati F, Stevens JG. Two avirulent herpes simplex viruses generate lethal recombinants in vivo. Science. 1986;234:746–8.
Kintner RL, Allan RW, Brandt CR. Recombinants are isolated at high frequency following in vivo mixed ocular infection with two avirulent herpes simplex virus type 1 strains. Arch Virol. 1995;140(2):231–44.
Kolb AW, Lee K, Larsen I, Craven M, Brandt CR. Quantitative trait locus based virulence determinant mapping of the HSV-1 genome in murine ocular infection: genes involved in viral regulatory and innate immune networks contribute to virulence. PLoS Pathog. 2016;12(3):e1005499.
Lee K, Kolb AW, Larsen I, Craven M, Brandt CR. Mapping murine corneal neovascularization and weight loss virulence determinants in the HSV-1 genome and the detection of an epistatic interaction between the UL and IRS/US regions. J Virol. 2016;90:8115–31.
Matsumura T, Sugiura T, Imagawa H, Fukunaga Y, Kamada M. Epizootiological aspects of type 1 and type 4 equine herpesvirus infections among horse populations. J Vet Med Sci. 1992;54(2):207–11.
Hartley C. Aetiology of corneal ulcers assume FHV-1 unless proven otherwise. J Feline Med Surg. 2010;12(1):24–35.
Gaskell R, Dawson S, Radford A, Thiry E. Feline herpesvirus. Vet Res. 2007;38(2):337–54.
Vaz PK, Job N, Horsington J, Ficorilli N, Studdert MJ, Hartley CA, Gilkerson JR, Browning GF, Devlin JM. Low genetic diversity among historical and contemporary clinical isolates of felid herpesvirus 1. BMC Genomics. 2016;17:704.
Lee K, Kolb AW, Sverchkov Y, Cuellar JA, Craven M, Brandt CR. Recombination analysis of herpes simplex virus type 1 reveals a bias towards GC content and the inverted repeat regions. J Virol. 2015;89:7214–23.
Schynts F, Meurens F, Detry B, Vanderplasschen A, Thiry E. Rise and survival of bovine herpesvirus 1 recombinants after primary infection and reactivation from latency. J Virol. 2003;77(23):12535–42.
Papageorgiou KV, Suarez NM, Wilkie GS, McDonald M, Graham EM, Davison AJ. Genome sequence of canine herpesvirus. PLoS One. 2016;11(5):e0156015.
We would like to acknowledge Dr. Ellison Bentley for her general support.
This study was supported by a Core Grant for Vision Research (EYP30016665), an unrestricted grant to the Department of Ophthalmology and Visual Sciences from Research to Prevent Blindness, Inc. and the Clinical and Translational Science Award (CTSA) program, through the NIH National Center for Advancing Translational Sciences (NCATS), grant UL1TR000427. The funding sources had no role in study design, data collection, data analysis, data interpretation, or the writing of the manuscript.
Availability of data and materials
The accession numbers for the sequences used in this study are located in Additional file 1: Table S1. The multiple sequence alignments generated for this study are available for download at http://sites.ophth.wisc.edu/brandt/.
AK, AL, RMT and CRB conceived and designed the experiments. AK, AL, and RMT performed the experiments. AK, AL, RMT and CRB analyzed the data. AK, AL, RMT, GM, and CRB contributed to the writing of the manuscript. The authors have read, and approved this manuscript.
Ethics approval and consent to participate
The experiments in this study did not directly involve humans, animals, or plants.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Table S1. Accession numbers for the sequences used in the present study. The species, isolation host, strain designation, genome size, country of isolation, isolation source, collection date, and accession number for each sequence is provided. Table S2. Unscaled F values. The the only reported F values were those which included cut-offs that result in two groups for all viruses. Table S3. Rescaled F values. The rescaled values of F simply indicates the relative size of F. The total value is on a scale from 0 to 3, with 3 meaning that the separation is as good as possible for all three viruses. (ZIP 42 kb)