Construction of anti-codon table of the plant kingdom and evolution of tRNA selenocysteine (tRNASec)

Mohanta, Tapan Kumar; Mishra, Awdhesh Kumar; Hashem, Abeer; Abd_Allah, Elsayed Fathi; Khan, Abdul Latif; Al-Harrasi, Ahmed

doi:10.1186/s12864-020-07216-3

Research article
Open access
Published: 19 November 2020

Construction of anti-codon table of the plant kingdom and evolution of tRNA selenocysteine (tRNA^Sec)

BMC Genomics volume 21, Article number: 804 (2020) Cite this article

3167 Accesses
5 Citations
1 Altmetric
Metrics details

Abstract

Background

The tRNAs act as a bridge between the coding mRNA and incoming amino acids during protein translation. The anti-codon of tRNA recognizes the codon of the mRNA and deliver the amino acid into the protein translation chain. However, we did not know about the exact abundance of anti-codons in the genome and whether the frequency of abundance remains same across the plant lineage or not.

Results

Therefore, we analysed the tRNAnome of 128 plant species and reported an anti-codon table of the plant kingdom. We found that CAU anti-codon of tRNA^Met has highest (5.039%) whereas GCG anti-codon of tRNA^Arg has lowest (0.004%) abundance. However, when we compared the anti-codon frequencies according to the tRNA isotypes, we found tRNA^Leu (7.808%) has highest abundance followed by tRNA^Ser (7.668%) and tRNA^Gly (7.523%). Similarly, suppressor tRNA (0.036%) has lowest abundance followed by tRNA^Sec (0.066%) and tRNA^His (2.109). The genome of Ipomoea nil, Papaver somniferum, and Zea mays encoded the highest number of anti-codons (isoacceptor) at 59 each whereas the genome of Ostreococcus tauri was found to encode only 18 isoacceptors. The tRNA^Sec genes undergone losses more frequently than duplication and we found that tRNA^Sec showed anti-codon switch during the course of evolution.

Conclusion

The anti-codon table of the plant tRNA will enable us to understand the synonymous codon usage of the plant kingdom and can be very helpful to understand which codon is preferred over other during the translation.

Background

The proteins present in cells are the product of the blueprint prescribed by the genes [1,2,3]. Collectively, all of the genes (including coding and non-coding) presents in a cell represent the genome of an organism [4, 5]. The construction of a protein from a gene is a complex procedure and requires the involvement of transfer RNA (tRNA), messenger RNA (mRNA), ribosomes, amino acids, and other molecules [6,7,8,9]. This process is commonly known as translation which is a fundamental parameter of living cells [6,7,8,9]. The functional apparatus involved in gene translation is highly conserved across the tree of life [10]. mRNA conveys the blueprint information as triplet codons composed of nucleotides and tRNA are able to perceive the cognate codons [11, 12]. Although mRNA and ribosomes represent the two major parts of the machinery responsible for translation, transfer RNAs (tRNAs) are the fundamental units of this translation machinery [13,14,15]. The anti-codon of a tRNA links to the codon of the mRNA and supplies the corresponding amino acid into the protein translation chain [3, 8, 15, 16]. Two or more different tRNAs can bind an amino acid and transfer it to the ribosome [17,18,19,20]. There are 22 different amino acids encoded by 63 codons (including UGA and UAG codons for selenocysteine and pyrrolysine, respectively) as several of the amino acids are encoded by more than one codon and hence its corresponding anti-codon [21,22,23,24,25]. Therefore, it is possible to encode more than one tRNA molecule with different anti-codons to transfer a particular amino acid [21, 26,27,28]. Although codon selection for a corresponding anti-codon is the primary unit of the translation machinery, mutational bias, selection, drift, and codon usage bias also shape the prescribed translation [29,30,31,32]. Although there are critical steps for the efficient and proper functioning of the translation machinery, other synonymous codons can also serve as an alternative choice [32,33,34]. The differential use of codons also reflects their natural demand in the protein translation machinery [35, 36]. tRNAs are classified into various gene families based on their isoacceptor anti-codons [17, 19, 20]. The available tRNA pool is maintained at a level that can accommodate the transcript levels present in a cell, thus ensuring efficient and accurate translation. Highly-expressed genes, however, exhibit codon usage bias that reflects the copy number of the corresponding tRNA [37,38,39]. Translational selection acts to maintain the balance between codon usage and tRNA availability [40,41,42]. There is always selection pressure, however, to increase the production of the codons used in highly-expressed genes [32, 43, 44].

Over the course of evolution, the earth has undergone enormous changes and the plant kingdom has been subjected to numerous stresses [45,46,47]. All living organisms had to adapt to a changing environment, which resulted in the increased importance of some protein-coding genes while others became less important [48,49,50]. Accordingly, there was a need to alter the relative number and type of available tRNAs to fulfil the translational requirements of the new and/or modified protein-coding genes [51, 52]. Changes in the relative number and type of tRNA molecules are also associated with a change in the number and type of anti-codons [53, 54]. The role of selection pressure brought about by translational demand and its role in maintaining tRNA pools has not been adequately addressed. Furthermore, the selection pressure that determines the maintenance of low copy tRNA families and anti-codons also remains unclear. Whether translational selection pressure favours optimal codons in particular cases and keeps other codons as non-optimal, and hence in low supply, is unknown. It is also unknown if the amino acid requirements of proteins impact the need to provide specific tRNAs having the required anti-codons, as well as the genes that encode those tRNAs. In the present study, an attempt was made to determine the frequency of anti-codons in the tRNAnome of the Plant Kingdom to better understand the presence of codons and anti-codon frequency. Our objective was to provide information on the link between the presence of codons and their corresponding anti-codons, tRNAs, and the number of amino acids utilized in plant proteomes. Therefore, we analysed the frequency of anti-codons in the tRNA of plant genomes and constructed an anti-codon table of the Plant Kingdom.

Material and methods

Sequence retrieval

The annotated RNA sequence files of all 128 plant species were downloaded from the National Center for Biotechnology Information (NCBI) using the Ensemble genome browser. The downloaded sequence files were scanned for the presence of tRNAs using tRNAscan-SE software on a Linux-based platform. The resulting tRNAscan files were used for further analysis. After the completion of the scanning of individual files, all files were merged to obtain a complete plant tRNAnome file. The frequency of each individual anti-codon was obtained from the tRNAnome file and presented as a number and percentage (%). In the course of the analysis, several tRNA^Sec were identified in different plant genomes and were kept separately for further study.

Sequence alignment

Multiple sequence alignment of tRNA^Sec genes was conducted using multalin software with default parameters. To construct the phylogenetic tree, a multiple sequence alignment of tRNAs and tRNA^Sec were conducted using the MUSCLE program in MEGA7 software [55, 56]. The resulting alignment was saved in a MEGA file format. The alignment file was subsequently used to construct a phylogenetic tree using MEGA7 software. Prior to the construction of the phylogenetic tree, a model selection was carried out using the following statistical parameters; statistical method, maximum likelihood substitution type, nucleotides, gaps/missing data treatment, complete deletion. Based on the lowest BIC score, a phylogenetic tree of tRNAs and tRNA^Sec was constructed. The statistical parameters used to construct the phylogenetic tree were: statistical method (maximum likelihood), test of phylogeny (bootstrap method), no. of bootstrap replicates (1000), substitution type (nucleotides), model/method (Kimura-2-parameter model), rates among sites (gamma distributed), no. of discrete gamma parameters (5), gaps/missing data treatment (partial deletion), site coverage cut-off (95%), ML Heuristic method (nearest-neighbour-interchange), and branch swap filter (very strong). A separate phylogenetic tree was constructed using all of the tRNA^Sec sequences and the same statistical approaches as mentioned above to determine deletion and duplication events. The constructed phylogenetic tree of tRNA^Sec genes was exported in a Newick file format. Subsequently, a species tree was constructed using all of the 128 species in the taxonomy browser of NCBI. To determine RNA^Sec deletion and duplication events, the phylogenetic tree of tRNA^Sec was reconciled with the species tree using Notung software, version 2.9. The reconciled gene and species tree revealed deletion, duplication, and co-divergence events that occurred in tRNA^Sec genes. The resultant phylogenetic tree of tRNAs (with tRNA^Sec) and the phylogenetic tree of tRNA^Sec were analysed by using Icy Tree to identify recombination events.

Cluster based grouping of the anti-codons

Anti-codons were grouped based on their percentage frequency in the tRNAnome. To cluster them, the percent frequency of anti-codons was used against each anti-codon. A classical clustering approach was used to cluster the anti-codons using a paired group UPGMA algorithm and Euclidean similarity index with 1000 bootstrap replicates.

Statistical analysis

The probability plot linear regression analysis of tRNA gene number per genome and frequency of anti-codons were statistically analysed and a value of p < 0.05 was considered to be significant. To investigate anti-codon numbers in different lineages and their statistical significance, a t-test was conducted comparing anti-codon number in eudicot vs. monocot, eudicot vs. algae, and monocot vs. algae. Differences were deemed significant at p < 0.05. All of the statistical analyses were conducting using Past3 software.

Results

Genome size is not proportional to the number of tRNA genes

A genome-wide analysis of fully-annotated whole genome sequences of 128 plant species was conducted to identify tRNA genes and to construct an anti-codon table of the plant kingdom (Table 1). The species included in the study varied in the size of their respective genomes (Table 2) A regression analysis was conducted to determine the correlation between genome size and the number of tRNA genes encoded per genome. Results indicated that plant genome size was not correlated (r = 0.5471, y = 0.17892x + 619.76) with the number of the tRNA genes per genome (Fig. 1). Ipomoea nil, with a genome size of genome size of 735.23 Mb, possesses 6475 tRNA genes which was the highest number of tRNA encoding genes identified in the species of plants that were analysed. Other species with a high number of tRNA genes in their genome were Cucurbita moschata (4062), Cucurbita pepo (3228), Cucurbita maxima (3036), Papaver somniferum (2571), Brassica napus (2180), and Ipomoea triloba (2180). Among the 128 analysed plant species, 22 (16.92%) species possessed more than 1 thousand tRNA genes in their genome. In contrast, Ostreococcus tauri and Phaedactylum tricornutum only encoded 41 tRNA genes in their genome, which was the lowest number of tRNA genes in the analysed genomes. Other species encoding lower number of tRNA genes were Raphidocelis subcapitata (43), Monoraphidium neglectum (48), and Bathycoccus prasinus (57). The genome size of O. tauri, P. tricornutum, R. subcapitata, and M. neglectum was 14.76, 27.4, 51.16, and 69.71 Mb, respectively. These genome sizes are relatively smaller than the genome of most of the other plant species that were analysed.

Table 1 Anti-codon table of the plant kingdom with frequency of anti-codons

Full size table

Table 2 Genomic details of plant anti-codons

Full size table

CAU (met) was the most abundant and GCG (Arg) was the least abundant encoded anti-codons in the plant kingdom

The occurrence of each of the anti-codons were separately analysed to determine the frequency of anti-codons in the genomes of the Plant Kingdom. Results indicated that CAU (Met) was the most abundant (5.033%) anti-codon in the Plant Kingdom, followed by GUC (Asp, 4.274%), GUU (Asn, 4.020%), and GCC (Gly, 3.811%) (Table 1, Supplementary File 1). In contrast, GCG (Arg) was identified as the least abundant (0.004%) anti-codon in the Plant Kingdom, followed by GAG (Leu, 0.009%), CUA (Sup, 0.0111%), and ACU (Ser, 0.019%) (Table 1, Supplementary File 1). The lowest-abundant anti-codon (GCG) was only present in Ipomea nil, Nicotiana attenuata, Papaver somniferum, and Ziziphus jujuba. When the anti-codon frequency of different tRNA isoacceptor was considered, however, tRNA^Leu was found to be the most abundant tRNA isoacceptor (Table 1). Approximately 7.808% of all anti-codons in the Plant Kingdom were found to be encoded by tRNA^Leu (Table 1). The abundance of tRNA^Leu, was followed by tRNA^Ser (7.668%), tRNA^Gly (7.523%), and tRNA^Arg (7.284%) (Table 1). tRNA^Leuc, tRNA^Ser, and tRNA^Arg encode six different isoacceptors which might be the reason for their higher abundance in the plant genomes. Suppressor tRNA (0.036%) was found to be the least abundant tRNA isoacceptor in the plant genomes, followed by tRNA^Sec (0.066%), tRNA^His (2.109%), and tRNA^Cys (2.547%) (Table 1). Suppressor tRNA (CUA) anti-codon was only found in Ectocarpus siliculosus, Nicotiana sylvestris, and Zea mays (Supplementary File 1).

Anti-codons can be classified into five groups based on their frequency of occurrence in plant genomes

A clustering analysis based on the frequency of abundance of the anti-codons in the Plant Kingdom was conducted using the paired group (UPGMA) algorithm and Euclidean similarity index with 1000 bootstrap replicates. The analysis revealed five distinct groups of anti-codons and were named as group A, B, C, D, and E (Fig. 2). The anti-codons in the different groups were: Group A - CAU, GCC, GUU, and GUC); Group B - CUU, GAA, AAU, AGA, UCC, GCA, GCU, UCC, AAC, CCA, GUA, UUU, UGG, AGC, UUC, and CUC; Group C - UGA, UGU, UAG, UUG, UCU, CAC, AGU, GUG, AAG, AGG, UGC, CAA, and ACG; Group D - CCG, CGU, CGA, CGG, CAG, UAA, CGC, UAU, UCG, CCC, UAC, CCU, and CUG; and Group E - GGU, GGA, AUU, GAU, GAC, AUC, AUG, AAA, ACA, UCA, GGG, ACU, UUA, GGC, ACC, AUA, GAG, CUA, and GCG (Fig. 2). The anti-codon groupings are based on their abundance in plant genomes, from highest (Group A) to lowest (Group E).

Plant genomes encode 18 to 59 isoacceptors (anti-codons)

The genome-wide analysis of the Plant Kingdom revealed the diversity in the number of anti-codons present in the genomes of individual species, which ranged from 18 to 59 (Table 2). Ostreococcus tauri was found to encode only 18 isoacceptors while Micromonas commoda encodes only 26 isoacceptors (Table 2). Ipomoea nil, Papaver somniferum, and Zea mays encoded the highest number of anti-codons at 59 each. At least 51 (39.53%) species were found to encode 50 or more anti-codons in their genome. On average, plant genomes encode 48.25 anti-codons per genome. A paired two tailed t-test was conducted to statistically analyse the frequency of anti-codons present in algae, eudicot, and monocot species. The comparison between eudicot and monocot species indicated that the frequency of tRNA anti-codons in these two groups was not significantly different (P < 0.05) at 1.2691 < 1.984 (t-test result 1.2691, critical value T 1.984), respectively (Table 3). In contrast, a significant difference in tRNA frequency was observed between eudicots and algae (10.3939 > 1.987), and between monocots and algae (6.2914 > 2.037) (Table 3). Notably, the variance in tRNA frequency in the monocot lineage was much lower than it was in the eudicots and algae.

Table 3 t-test (two tailed) between eudicot and monocot anti-codon numbers. The t-value is smaller than critical value (1.2691 < 1.984). So, the mean was not significantly different (p < 0.05). (B) t-test (two tailed) between eudicot and algae anti-codon numbers. The t-test result was greater than critical value (10.3939 > 1.987). So, the mean is significantly different (p < 0.05). (C) t-test (two tailed) between Eudicot and algae anti-codon numbers. The t-test result was greater than critical value (6.2914 > 2.037). So, the mean is significantly different (p < 0.05)

Full size table

Only a few species have lost tRNA genes

Our analysis revealed that a few species have lost the presence of specific tRNA genes (tRNA isotype) in their genome. These species include Coccomyxa subellipsoidea (tRNA^Tyr), Corchorus capsularis (tRNA^Lys, tRNA^Tyr), Corchorus olitorius (tRNA^Tyr), Klebsormidium nitens (tRNA^Tyr, tRNA^Ser), Monoraphidium neglectum (tRNA^Thr), Ostreococcus tauri (tRNA^Phe, tRNA^Gln), Picea glauca (tRNA^Ser), Phaedactylum tricornutum (tRNA^Cys), and Raphidocelis subcapitata (tRNA^Tyr) (Table 2). These species were found to lost the mentioned gene(s) in their genome. Understanding the loss of tRNA genes and its functional implication in protein translation is very crucial.

Some plant species encode tRNA^Sec in their genomes

Several plant species were found to encode tRNA genes for selenocysteine amino acids. More specifically, 22 (17.187%) species were found to encode a tRNA^Sec gene in their genome. These species were Aegilops tauschii, Beta vulgaris, Brassica rapa, Cucumis sativus, Cucurbita maxima, Cucurbita moschata, Cucurbita pepo, Ectocarpus siliculosus, Ipomoea nil, Ipomoea triloba, Lactuca sativa, Momordica charantia, Medicago truncatula, Monoraphidium neglectum, Nicotiana tabacum, Papaver somniferum, Picea glauca, Populus euphratica, Salvia splendens, Tarenaya hassleriana, Triticum urartu, and Zea mays (Table 2). The length of tRNA^Sec encoding genes was ranged from 70 to 90 nucleotides with average length being 72.93 nucleotides per tRNA. A multiple sequence alignment of tRNA^Sec genes indicated the presence of a conserved G-x-C nucleotide at the 30th and 32nd positions and a conserved U-C-A at 34th, 35th, and 36th positions (Supplementary Figure 1). The pseudo-uridine loop was also found to contain a conserved G-U-U-x₂-A-x₂-C nucleotide consensus sequence (Supplementary Figure 1). The tRNA^Sec in C. maxima (NW_019272053.1), however, was found to encode a C-U-U nucleotide sequence instead of a G-U-U conserved consensus sequence in its pseudo-uridine loop (Supplementary Figure 1).

Loss of tRNA^Sec occurred to a greater extent than duplication

A phylogenetic tree was constructed to investigate the evolution of tRNA^Sec genes by considering the nucleotide sequences of all the 20 tRNA genes along with tRNA^Sec genes. The phylogenetic tree revealed the 28 major tRNA groups (Fig. 3). The tRNA^Sec genes were clustered in the middle of the phylogenetic tree and tRNA^Sec was found to be present in at least six different clusters (Fig. 3). A few tRNA^Sec genes were grouped with tRNA^Lys (CUU), tRNA^Asn (GUU), tRNA^Arg (UCG, CCG), tRNA^Gly (UCC), and tRNA^Trp (CCA) (Fig. 3). The analysis indicates that tRNA^Sec is distributed in different clusters in the phylogenetic tree. This explains the role of duplication events in the evolution of tRNA^Sec genes. Therefore, an analysis was conducted to investigate the deletion/duplication events related to tRNA^Sec genes. As a result, we found that tRNA^Sec deletion events occurred more frequently than duplication events. A total of 45 duplications, 119 deletions, and 9 co-divergent events were identified within 68 tRNA^Sec genes found in 22 species (Supplementary Figure 2). The role of recombination in the evolution of tRNA^Sec was further analysed. Results indicated that tRNA^Sec genes had undergone recombination events, as did other tRNA genes (Fig. 4). The role of recombination and duplication of tRNA^Sec genes resulted in the sharing of its genetic sequence with other tRNAs genes which may perhaps explain why tRNA^Sec was present in different clusters within the phylogenetic tree. A recombination analysis of tRNA^Sec genes indicated the role of recombination events within the tRNA^Sec itself (Fig. 5). A time tree analysis revealed that the divergence time of tRNA^Sec genes in plant species occurred at least 2466.30 million years ago (MYA) (Supplementary Figure 3) and less than a MYA in the case of the tRNA^Sec in P. somniferum. The tRNA^Sec in P. somniferum was found to arise from a duplication event. The recent divergence time for the tRNA^Sec in P. somniferum indicates that this gene has undergone a recent duplication event.

tRNA^Sec underwent a switch in anti-codons during evolution

tRNA genes undergo rapid changes during the course of their evolution to meet translational demand. Therefore, an attempt was made to better understand the role of tRNA^Sec genes in plant evolution. It is well known that the tRNA^Sec gene is encoded by a UCA anti-codon and that this gene was found in different clusters in the phylogenetic tree of tRNAs. An anti-codon switch occurs more frequently with a nucleotide sequence of a tRNA gene with a different anti-codon than with a gene with a similar anti-codon [51]. Therefore, the possibility of anti-codon switch in tRNA^Sec gene was examined. tRNA^Sec grouped with tRNA^Lys (CUU), tRNA^Asn (GUU), tRNA^Arg (UCG, CCG), tRNA^Gly (UCC), and tRNA^Trp (CCA). The UCA anti-codon of tRNA^Sec was replaced by CUU in tRNA^Lys and in tRNA^Asn it was replaced by GUU where the 2nd and 3rd nucleotide of the anti-codons were constant. In tRNA^Arg and tRNA^Gly, the UCA anti-codon of tRNA^Sec was replaced by UCG and UCC where the 1st nucleotide of the anti-codons remained constant and the 2nd and 3rd anti-codons were variable. For the CCG anti-codon of tRNA^Arg and the CCA anti-codon of tRNA^Trp, the 1st nucleotide of U (CA) of tRNA^Sec was replaced with a C nucleotide and the 3rd nucleotide remained variable.

Statistical analysis

The varied number and frequency of anti-codons led us to understand whether or not a dataset is approximately normally distributed. Therefore, we conducted normal probability plot study of anti-codon numbers (Fig. 6). The normal probability plot correlation coefficient was 0.9632. the correlation co-efficient and an approximately straight line indicate that normal distribution was good for the dataset (Fig. 6). Ordinary linear fit least square regression model of anti-codon numbers was conducted to find the best fit for a set of data by minimizing the sum of the offsets or residuals of points from the plotted curve and to understand the behaviour of dependent variables (Supplementary Figure 4). The method estimates the relationship by minimizing the sum of the squares in the difference between the observed and predicted values of dependent variable configured as a straight line. At 95% significance and intercept at zero, the slope was found to be 34.621 (Supplementary Figure 4). The statistical result of the ordinary least square regression was; t = 10.728, standard error a = 3.227, and p (slope) = 6.161E-16. For 95% bootstrap confidence interval (N = 1999); correlation r = 0.00916, r² = 8.3917E-05, t = 0.072713, p (uncorr) = 0.94226, and pemutation p = 0.9404. the residual standard error of estimate was 147.

Discussion

tRNA is an adaptor molecule that becomes charged when it binds an amino acid and subsequently donates it to an elongating peptide chain as determined by a codon-anti-codon recognition system. Each tRNA contain a characteristics anti-codon sequence which dictates the translation of a mRNA sequence into a protein. In some cases, the same codon can get decoded by different tRNA species and the same tRNA species can also become decoded by different codons due to wobble interactions (Watson-Crick base pairing) at the first position of an anti-codon and third position of the codon [26,27,28]. In our analysis of 128 species of the plants, none were found to encode all 64 anti-codons, which suggests that wobble base pairing exists in all plant species. The wobble interaction occurs at the G:U (guanine-uracil) base pairing and modifications in anti-codons that change the specificity of a codon [57,58,59]. Due to this redundancy, it is not necessary for a plant genome to encode all of existing anti-codons and utilize different tRNAs according to the requirement. The presence of only 29 anti-codons in the genome of Klebsmordium nitens and 31 anti-codons in Bathycoccus prasinos, however, are somewhat very interesting. Species K. nitens and B. prasinos belonged to the phylum algae and the genome sizes of these species are much smaller than the genome sizes found in gymnosperm and angiosperms. The absence of a greater number of anti-codons in these species suggests that the rate of wobble base-pairing might be quite high in these species. Mohanta et al., (2020) reported that species of cyanobacteria possessed 32 to 43 anti-codons per genome [20]. Cyanobacterial genomes are smaller than genomes of alae and higher plants [60]. The absence of a greater number of anti-codons in species with smaller genome is directly related to a higher frequency of wobble base-pairing. Ipomea nil (59), Ipomea triloba (58), Papaver somniferum (59), Cucurbita pepo (56), and Zea mays (59) possess a high number of anti-codons and so the occurrence of wobble base pairing may be quite minimal in these species. It will be interesting to determine the factors responsible for the occurrence of high and low frequencies of wobble base-pairing. Zhang et al., [61] reported that the presence of high concentration of amino acids in the nutrient media led to higher rate of mismatch incorporation of amino acids into the translating protein chain [61]. They also reported that wobble codon position is less stringent in base pair mismatch and base change in 3rd position explained additional 25% misincorporation either by favourable G^mRNA/U^tRNA mismatch or wobble position mismatch [61]. The G/U mismatch was predominant during the codon recognition and which is commonly found in the nucleic acid secondary structures as well [62,63,64].

The abundance of the CAU anti-codon that encodes tRNA^Met was the greatest among all of the anti-codons (Supplementary File 1). Methionine is used to initiate the start of a polypeptide chain, and as a result, almost all proteins require a methionine amino acid. Therefore, the abundance of an anti-codon for tRNA^Met was found to be the highest. Additionally, tRNA^Met (CAU) was found to have evolved earlier than other tRNAs during the course of evolution [18, 19]. If the abundance of isoacceptors is considered, tRNA^Leu, which contain six isoacceptors (GGA, AGA, CGA, UGA, ACU, GCU), has the highest abundance (7.808% of the collective plant species). Similarly, tRNA^Ser, and tRNA^Arg, both with six isoacceptors, have a high percentage of anti-codon abundance. This finding led us to conclude that, the higher the number of isoacceptors for tRNA isotypes, the greater the level of anti-codon sharing in a genome. The study also reveals that plant genomes encode tRNA^Leu, tRNA^Ser, and tRNA^Arg more frequently than other tRNAs. A proteome-wide analysis by Mohanta et al., [19] reported a higher abundance of Leu amino acids in the proteomes of the Plant Kingdom [65]. This observation directly corroborates that the number and abundance of tRNA^Leu genes in genome is directly proportional to the number of Leu amino acids in the proteome. In contrast, a few anti-codons, including GCG, GAG, GGG, GGC, ACU, ACC, UCA (Sec) (group E) of different tRNA isotypes were found to have a low abundance (Fig. 2). Yona et al., [51] reported that multiple copies of rare tRNAs are deleterious to a cell [51]. They also stated that the effective gene copy number of each tRNA anti-codon set can undergo changes during evolution that may be due to the changes in demand-to-supply [51]. A single point mutation in an anti-codon can change one tRNA to another. The lowest encoding anti-codon GCG of tRNA^Arg may have undergone a point mutation resulting in tRNA^Arg with ACG, CCG, and UCG, which avoids the deleterious effect of the GCG anti-codon. Previous studies have also noted that rare tRNAs may be essential for co-translational folding as low abundance could provide a pause in translation [44, 66].

When plants grow in a multitude of environmental conditions, environmental stress can induce the expression of genes needed for stress adaptation, which may affect codon usage by the transcriptome. This leads to a demand for a different pool of tRNAs to support the change in codon usage and avoid a translational imbalance [52, 67]. If the altered environmental conditions persist, the tRNAs have to undergo changes in their level of expression to meet and respond to the environmental stress-induced changes in gene expression. If the changes in supply-demand continue, it may lead to changes in the genetic pool of the tRNAs that are beneficial and favoured by selection pressures. These novel translational demands can be maintained by shifting nucleotides in the anti-codons rather than by the duplication of genes. The tRNA pool can evolve to maintain the translational requirement by adjusting the number and/or ratio of tRNA isotypes encoding the same amino acid. An anti-codon switch, however, can also dramatically change the ratios of tRNA isoacceptor within a tRNA pool. This can be done by increasing the copy number of one isoacceptor at the expense of others. The high sequence similarity of different anti-codons (anti-codon switch) can be the result of purifying selection that maintains sequence similarity. Sequence similarity, however, can result from concerted evolution that maintains sequence similarity through frequent recombination among members of the same gene family [68, 69]. The presence of a high level of recombination in tRNAs indicates that the evolution of plant tRNAs for anti-codon switch and sequence similarity may be due to concerted evolution. A single point mutation in an anti-codon can result in the encoding of a different tRNA family. It would be interesting to understand the evolutionary constraints that lead to the generation of more members while others have fewer members. It has been previously reported that tRNA^Leu encodes a higher number of tRNA genes in the genome, a feature that is directly related to the higher number of tRNA isoacceptors in tRNA^Leu [17,18,19,20]. The question remains if purifying selection plays a role in maintaining a low level of certain tRNAs, such as tRNA^Sec, tRNA^His, tRNA^Trp, and tRNA^Tyr. It is plausible that this purifying selection might be responsible for maintaining the anti-codons of these tRNAs at non-optimal levels. A previous study reported that increasing the copy number of a low copy tRNA gene family in a cell results in proteotoxic stress due to problems in protein folding [51]. In addressing the need for environmental adaptation, tRNA isotypes provide evolutionary plasticity to changes in translational demand due to their presence as a multi-member gene family. A few species have lost tRNA genes for particular tRNA isotypes and anti-codon switch/point mutations of anti-codons may be a factor that contributes to maintaining the function of a genome in the complete absence of a particular gene family.

Selenocysteine (a selenium containing cysteine analog) is co-translationally inserted in a small fraction of proteins (selenoproteins) and is driven by a tRNASec gene. Although Sec is found in all three domains of life, it is not universal. Approximately 20% of the prokaryotic genome contains selenoproteins, while in eukaryotes selenoproteins are reported to be more concentrated in the metazoan lineage [70,71,72,73]. The absence of selenoproteins in fungi and land plants has also been reported previously [74]. and results from a lack of a tRNA^Sec gene in their genomes. tRNA^Sec is encoded by a UGA anti-codon which also encodes a stop codon. A highly sensitive and efficient method of tRNA identification is needed to find tRNA^Sec. The lack of suitable identification techniques may be the main reason for stating the absence of tRNA^Sec genes in fungal and plant genomes. Using current technology, however, we were able to identify tRNA^Sec, as well as tRNASec genes in a few of the genomes of the analysed plant species.

Conclusion

The repertoire of tRNA has a significant impact on the fitness of an organism. The frequency (abundance) of anti-codons that explains synonymous codon usage in coding genes, however, has remained unexplored. Anti-codon frequency can be directly attributed to the frequency of synonymous codon usage and an anti-codon table of the Plant Kingdom, along with the percent abundance of each anti-codon, can be very helpful for understanding the relationship between codon and anti-codon frequency in the genome. The 21st amino acid, selenocysteine, encoded by tRNA^Sec has undergone a duplication event along with an anti-codon switch. Understanding the mechanisms involved in the loss of tRNA genes in a few species may be crucial to deciphering the translation mechanism in these species. The frequency of the anti-codons GCG (Arg), GAG (Leu), ACU (Ser), GGG (Pro) were very low in abundance and appear to be the rarest form of anti-codons in the Plant Kingdom. Yona et al., [51] reported that multiple copies of rare tRNAs are deleterious to a cell [51], which suggests that large copy numbers of CGC, GAG, ACU, and GGG anti-codons may be deleterious to plant cells. Therefore, a very low number of these anti-codons are encoded in the plant genome. A few species have completely lost specific tRNA isotype genes in their genome. Additionally, a previous also reported the loss of tRNA genes in some plant genomes [75].

Availability of data and materials

All the studied data were taken from publicly available databases and data associated with the manuscript is provided in supplementary file.

Abbreviations

tRNA:: Transfer RNA
Ala:: Alanine
Arg:: Arginine
Asn:: Asparagine
Asp:: Aspartic acid
Cys:: Cysteine
Gln:: Glutamine
Glu:: Glutamic acid
Gly:: Glycine
His:: Histidine
Ile:: Isoleucine
Leu:: Leucine
Lys:: Lysine
Met:: Methionine
Phe:: Phenylalanine
Pro:: Proline
Ser:: Serine
Thr:: Threonine
Trp:: Tryptophan
Tyr:: Tyrosine
Val:: Valine
Sec:: Selenocysteine
Pyl:: Pyrrolysine
NCBI:: National Center For Biotechnology Information
UPGMA:: Unweighted pair group with arithmetic mean
r :: Correlation coefficient

References

Kleijn M, Scheper GC, Voorma HO, Thomas AAM. Regulation of translation initiation factors by signal transduction. Eur J Biochem. 1998;253:531–44.
Article CAS PubMed Google Scholar
Meinnel T, Mechulam Y, Blanquet S. Methionine as translation start signal: a review of the enzymes of the pathway in Escherichia coli. Biochimie. 1993;75:1061–75.
Article CAS PubMed Google Scholar
Kozak M. Initiation of translation in prokaryotes and eukaryotes. Gene. 1999;234:187–208.
Article CAS PubMed Google Scholar
Schaffer R, Landgraf J, Pérez-Amador M, Wisman E. Monitoring genome-wide expression in plants. Curr Opin Biotechnol. 2000;11:162–7.
Article CAS PubMed Google Scholar
Lonsdale DM. A review of the structure and organization of the mitochondrial genome of higher plants. Plant Mol Biol. 1984;3:201–6.
Article CAS PubMed Google Scholar
Noller HF, Ribosomal RNA. And translation. Annu Rev Biochem. 1991;60:191–227.
Article CAS PubMed Google Scholar
Zamecnik P. From protein synthesis to genetic insertion. Annu Rev Biochem. 2005;74:1–28.
Article CAS PubMed Google Scholar
Gualerzi CO, Pon CL. Initiation of mRNA translation in prokaryotes. Biochemistry. 1990;29:5881–9.
Article CAS PubMed Google Scholar
Nakamoto T. Mechanisms of the initiation of protein synthesis: in reading frame binding of ribosomes to mRNA. Mol Biol Rep. 2011;38:847–55.
Article CAS PubMed Google Scholar
Tuller T, Carmi A, Vestsigian K, Navon S, Dorfan Y, Zaborske J, et al. An evolutionarily conserved mechanism for controlling the efficiency of protein translation. Cell. 2010;141:344–54.
Article CAS PubMed Google Scholar
Chevance FFV, Hughes KT. Case for the genetic code as a triplet of triplets. Proc Natl Acad Sci U S A. 2017;114:4745–50.
Article CAS PubMed PubMed Central Google Scholar
Clancy S, Brown W. Translation: DNA to mRNA to protein. Nat Educ. 2008;1:101.
Google Scholar
Sharp SJ, Schaack J, Cooley L, Burke DJ, Soil D. Structure and transcription of eukaryotic tRNA gene. Crit Rev Biochem. 1985;19:107–44.
Article CAS Google Scholar
Crick FHC. The origin of the genetic code. J Mol Biol. 1968;38:367–79.
Article CAS PubMed Google Scholar
Green R, Noller HF. Ribosomes and translation. Annu Rev Biochem. 1997;66:679–716.
Article CAS PubMed Google Scholar
Baggett NE, Zhang Y, Gross CA. Global analysis of translation termination in E coli. PLoS Genet. 2017;13:e1006676.
Article PubMed PubMed Central CAS Google Scholar
Mohanta TK, Bae H. Analyses of genomic tRNA reveal presence of novel tRNAs in Oryza sativa. Front Genet. 2017;8:90.
Article PubMed PubMed Central CAS Google Scholar
Mohanta T, Syed A, Ameen F, Bae H. Novel genomic and evolutionary perspective of Cyanobacterial tRNAs. Front Genet. 2017;8:200.
Article PubMed PubMed Central CAS Google Scholar
Mohanta TK, Khan AL, Hashem A, Allah EFA, Yadav D, Al-Harrasi A. Genomic and evolutionary aspects of chloroplast tRNA in monocot plants. BMC Plant Biol. 2019;19:39.
Article PubMed PubMed Central Google Scholar
Mohanta TK, Yadav D, Khan A, Hashem A, Abd_Allah EF, Al-Harrasi A. Analysis of genomic tRNA revealed presence of novel genomic features in cyanobacterial tRNA. Saudi J Biol Sci. 2019;27:124–33.
Article PubMed PubMed Central CAS Google Scholar
Ambrogelly A, Palioura S, Söll D. Natural expansion of the genetic code. Nat Chem Biol. 2007;3:29–35.
Article CAS PubMed Google Scholar
Lobanov AV, Turanov AA, Hatfield DL, Gladyshev VN. Dual functions of codons in the genetic code. Crit Rev Biochem Mol Biol. 2010;45:257–65.
Article CAS PubMed PubMed Central Google Scholar
Polycarpo C, Ambrogelly A, Bérubé A, Winbush SM, McCloskey JA, Crain PF, et al. An aminoacyl-tRNA synthetase that specifically activates pyrrolysine. Proc Natl Acad Sci U S A. 2004;101:12450 LP–12454.
Article Google Scholar
Mahapatra A, Srinivasan G, Richter KB, Meyer A, Lienard T, Zhang JK, et al. Class I and class II lysyl-tRNA synthetase mutants and the genetic encoding of pyrrolysine in Methanosarcina spp. Mol Microbiol. 2007;64:1306–18.
Article CAS PubMed Google Scholar
Yuan J, O’Donoghue P, Ambrogelly A, Gundllapalli S, Sherrer RL, Palioura S, et al. Distinct genetic code expansion strategies for selenocysteine and pyrrolysine are reflected in different aminoacyl-tRNA formation systems. FEBS Lett. 2010;584:342–9.
Article CAS PubMed PubMed Central Google Scholar
Crick F. Codon-anticodon pairing. J Mol Biol. 1966;19:548–55.
Article CAS PubMed Google Scholar
Agris PF, Vendeix FAP, Graham WD. tRNA’s wobble decoding of the genome: 40 years of modification. J Mol Biol. 2007;366:1–13.
Article CAS PubMed Google Scholar
Näsvall SJ, Chen P, Björk GR. The wobble hypothesis revisited: Uridine-5-oxyacetic acid is critical for reading of G-ending codons. RNA. 2007;13:2151–64.
Article PubMed PubMed Central CAS Google Scholar
Guo Y, Xiong L, Ishitani M, Zhu J-K. An Arabidopsis mutation in translation elongation factor 2 causes superinduction of transcription factor genes but blocks the induction of their downstream targets under low temperatures. Proc Natl Acad Sci. 2002;99:7786 LP–7791.
Article CAS Google Scholar
Schwartz DC, Parker R. Mutations in translation initiation factors lead to increased rates of deadenylation and decapping of mRNAs in Saccharomyces cerevisiae. Mol Cell Biol. 1999;19:5247–56.
Article CAS PubMed PubMed Central Google Scholar
Morton BR. The role of context-dependent mutations in generating compositional and codon usage Bias in grass chloroplast DNA. J Mol Evol. 2003;56:616–29.
Article CAS PubMed Google Scholar
Bulmer M. The selection-mutation-drift theory of synonymous codon usage. Genetics. 1991;129:897 LP–907.
Article Google Scholar
Yang Z, Nielsen R. Mutation-selection models of codon substitution and their use to estimate selective strengths on codon usage. Mol Biol Evol. 2008;25:568–79.
Article CAS PubMed Google Scholar
Sharp PM, Li W-H. An evolutionary perspective on synonymous codon usage in unicellular organisms. J Mol Evol. 1986;24:28–38.
Article CAS PubMed Google Scholar
Song KY, Choi HS, Hwang CK, Kim CS, Law P-Y, Wei L-N, et al. Differential use of an in-frame translation initiation codon regulates human mu opioid receptor (OPRM1). Cell Mol Life Sci. 2009;66:2933–42.
Article CAS PubMed Google Scholar
Saier MH. Differential codon usage: a safeguard against inappropriate expression of specialized genes? FEBS Lett. 1995;362:1–4.
Article CAS PubMed Google Scholar
Sharp PM, Tuohy TMF, Mosurski KR. Codon usage in yeast: cluster analysis clearly differentiates highly and lowly expressed genes. Nucleic Acids Res. 1986;14:5125–43.
Article CAS PubMed PubMed Central Google Scholar
Sharp P, Li W-H. Codon usage in regulatory genes in Escherichia coli does not reflect selection for ‘rare’ codons. Nucleic Acids Res. 1986;14:7737–49.
Article CAS PubMed PubMed Central Google Scholar
Sharp PM, Devine KM. Codon usage and gene expression level in Dictyosteiium discoidtum: highly expressed genes do ‘prefer’ optimal codons. Nucleic Acids Res. 1989;17:5029–40.
Article CAS PubMed PubMed Central Google Scholar
Musto H, Cruveiller S, D’Onofrio G, Romero H, Bernardi G. Translational selection on codon usage in Xenopus laevis. Mol Biol Evol. 2001;18:1703–7.
Article CAS PubMed Google Scholar
Naya H, Romero H, Carels N, Zavala A, Musto H. Translational selection shapes codon usage in the GC-rich genome of Chlamydomonas reinhardtii. FEBS Lett. 2001;501:127–30.
Article CAS PubMed Google Scholar
Romero H, Zavala A, Musto H, Bernardi G. The influence of translational selection on codon usage in fishes from the family Cyprinidae. Gene. 2003;317:141–7.
Article CAS PubMed Google Scholar
Bulmer M. Coevolution of codon usage and transfer RNA abundance. Nature. 1987;325:728–30.
Article CAS PubMed Google Scholar
Drummond DA, Wilke CO. Mistranslation-induced protein misfolding as a dominant constraint on coding-sequence evolution. Cell. 2008;134:341–52.
Article CAS PubMed PubMed Central Google Scholar
DiMichele WA. Wetland-Dryland Vegetational dynamics in the Pennsylvanian ice age tropics. Int J Plant Sci. 2014;175:123–64.
Article Google Scholar
Mohanta TK, Occhipinti A, Atsbaha Zebelo S, Foti M, Fliegmann J, Bossi S, et al. Ginkgo biloba responds to herbivory by activating early signaling and direct defenses. PLoS One. 2012;7:e32822.
Article CAS PubMed PubMed Central Google Scholar
Mohanta T. Advances in Ginkgo biloba research: genomics and metabolomics perspectives. Afr J Biotechnol. 2012;11:15936–44.
Article Google Scholar
Wu D-D, Irwin DM, Zhang Y-P. De Novo Origin of Human Protein-Coding Genes. PLoS Genet. 2011;7:e1002379.
Article CAS PubMed PubMed Central Google Scholar
Graur D. Amino acid composition and the evolutionary rates of protein-coding genes. J Mol Evol. 1985;22:53–62.
Article CAS PubMed Google Scholar
Larracuente AM, Sackton TB, Greenberg AJ, Wong A, Singh ND, Sturgill D, et al. Evolution of protein-coding genes in drosophila. Trends Genet. 2008;24:114–23.
Article CAS PubMed Google Scholar
Yona AH, Bloom-Ackermann Z, Frumkin I, Hanson-Smith V, Charpak-Amikam Y, Feng Q, et al. tRNA genes rapidly change in evolution to meet novel translational demands. eLife. 2013;2013:1–17.
Google Scholar
Frumkin I, Lajoie MJ, Gregg CJ, Hornung G, Church GM, Pilpel Y. Codon usage of highly expressed genes affects proteome-wide translation efficiency. Proc Natl Acad Sci. 2018;115:E4940 LP–E4949.
Article Google Scholar
Schultz DW, Yarus M. tRNA structure and Ribosomal function: II. Interaction between anticodon Helix and other tRNA mutations. J Mol Biol. 1994;235:1395–405.
Article CAS PubMed Google Scholar
Bloom-Ackermann Z, Navon S, Gingold H, Towers R, Pilpel Y, Dahan O. A Comprehensive tRNA Deletion Library Unravels the Genetic Architecture of the tRNA Pool. PLoS Genet. 2014;10:e1004084 Public Libr Sci.
Article PubMed PubMed Central CAS Google Scholar
Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–7.
Article CAS PubMed PubMed Central Google Scholar
Kumar S, Stecher G, Tamura K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016;33:1870–4.
Article CAS PubMed PubMed Central Google Scholar
Varani G, McClain WH. The G. U wobble base pair. EMBO Rep 2000;1:18–23.
Limmera S, Reifa B, Otta G, Arnold L, Sprinzl M. NMR evidence for helix geometry modifications by a G-U wobble base pair in the acceptor arm of E. coli tRNAAla. FEBS Lett. 1996;385:15–20.
Article Google Scholar
Mueller U, Schübel H, Sprinzl M, Heinemann U. Crystal structure of acceptor stem of tRNA (Ala) from Escherichia coli shows unique G. U wobble base pair at 1.16 A resolution. RNA. 1999;5:670–7.
Article CAS PubMed PubMed Central Google Scholar
Mohanta TK, Pudake RN, Bae H. Genome-wide identification of major protein families of cyanobacteria and genomic insight into the circadian rhythm. Eur J Phycol. 2017;52.
Zhang Z, Shah B, Bondarenko PV. G/U and certain wobble position mismatches as possible Main causes of amino acid Misincorporations. Biochemistry. 2013;52:8165–76.
Article CAS PubMed Google Scholar
Müller UR, Fitch WM. The biological significance of G-T/G-U mispairing in nucleic acid secondary structure. J Theor Biol. 1985;117:119–26.
Article PubMed Google Scholar
Sugimoto N, Kierzek R, Freier SM, Turner DH. Energetics of internal GU mismatches in ribooligonucleotide helixes. Biochemistry. 1986;25:5755–9.
Article CAS PubMed Google Scholar
Limmer S. Mismatch base pairs in RNA. Prog Nucleic Acid Res Mol Biol. 1997;57:1–39.
Article CAS PubMed Google Scholar
Mohanta TK, Khan AL, Hashem A, Abd_Allah EF, Al-Harrasi A. The Molecular Mass and Isoelectric Point of Plant Proteomes. BMC Genomics. 2019;20:631.
Article PubMed PubMed Central CAS Google Scholar
Thanaraj TA, Argos P. Ribosome-mediated translational pause and protein domain organization. Protein Sci. 1996;5:1594–612.
Article CAS PubMed PubMed Central Google Scholar
Rocha EPC. Codon usage bias from tRNA’s point of view: redundancy, specialization, and efficient decoding for translation optimization. Genome Res. 2004;14:2279–86.
Article CAS PubMed PubMed Central Google Scholar
Munz P, Amstutz H, Kohli J, Leupold U. Recombination between dispersed serine tRNA genes in Schizosaccharomyces pombe. Nature. 1982;300:225–31.
Article CAS PubMed Google Scholar
Amstutz H, Munz P, Heyer W-D, Leupold U, Kohli J. Concerted evolution of tRNA genes: Intergenic conversion among three unlinked serine tRNA genes in S pombe. Cell. 1985;40:879–86.
Article CAS PubMed Google Scholar
Zhang Y, Romero H, Salinas G, Gladyshev VN. Dynamic evolution of selenocysteine utilization in bacteria: a balance between selenoprotein loss and evolution of selenocysteine from redox active cysteine residues. Genome Biol. 2006;7:R94.
Article PubMed PubMed Central CAS Google Scholar
Zhang Y, Turanov AA, Hatfield DL, Gladyshev VN. In silico identification of genes involved in selenium metabolism: evidence for a third selenium utilization trait. BMC genomics. BioMed Central. 2008;9:251.
Google Scholar
Mariotti M, Guigó R. Evolution of selenophosphate synthetases : emergence and relocation of function through independent duplications and recurrent subfunctionalization. Genome Res. 2015;25:1256–67.
Article CAS PubMed PubMed Central Google Scholar
Jiang L, Ni J, Liu Q. Evolution of selenoproteins in the metazoan. BMC Genomics. 2012;13:446.
Article CAS PubMed PubMed Central Google Scholar
Lobanov AV, Hatfield DL, Gladyshev VN. Eukaryotic selenoproteins and selenoproteomes. Biochim Biophys Acta. 2009;1790:1424–8.
Article CAS PubMed PubMed Central Google Scholar
Wald N, Margalit H. Auxiliary tRNAs: large-scale analysis of tRNA genes reveals patterns of tRNA repertoire dynamics. Nucleic Acids Res. 2014;42:6552–66.
Article CAS PubMed PubMed Central Google Scholar
Tamura K, Filipski A, Peterson D, Stecher G, Kumar S. MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol. 2013;30:2725–9.
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

Authors would like to extend their sincere thanks to Natural and Medical Sciences Research Center, University of Nizwa for extending facilities to conduct the research. The authors would like to extend their sincere appreciation to the Deanship of Scientific Research at King Saud University for funding this research group NO (RGP-271).

Funding

Not applicable.

Author information

Tapan Kumar Mohanta and Awdhesh Kumar Mishra contributed equally to this work.

Authors and Affiliations

Natural and Medical Sciences Research Center, University of Nizwa, 616, Nizwa, Oman
Tapan Kumar Mohanta, Abdul Latif Khan & Ahmed Al-Harrasi
Department of Biotechnology, Yeungnam University, 38541, Gyeongsan, South Korea
Awdhesh Kumar Mishra
Botany and Microbiology Department, College of Science, King Saud University, P.O. Box. 2460, Riyadh, 11451, Saudi Arabia
Abeer Hashem
Mycology and Plant Disease Survey Department, Plant Pathology Research Institute, ARC, Giza, 12511, Egypt
Abeer Hashem
Plant Production Department, College of Food and Agricultural Sciences, King Saud University, P.O. Box. 2460, Riyadh, 11451, Saudi Arabia
Elsayed Fathi Abd_Allah

Authors

Tapan Kumar Mohanta
View author publications
You can also search for this author in PubMed Google Scholar
Awdhesh Kumar Mishra
View author publications
You can also search for this author in PubMed Google Scholar
Abeer Hashem
View author publications
You can also search for this author in PubMed Google Scholar
Elsayed Fathi Abd_Allah
View author publications
You can also search for this author in PubMed Google Scholar
Abdul Latif Khan
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed Al-Harrasi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

TKM: conceived the idea, collected and annotated the genome sequences, analysed and interpreted the data and drafted the manuscript, AKM: analysed the data; AH and EFA: drafted and revised the manuscript, ALK: revised the manuscript, AA: revised the manuscript. The author(s) read and approved the final manuscript.

Corresponding authors

Correspondence to Tapan Kumar Mohanta or Ahmed Al-Harrasi.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

All authors agree and have consent for publication.

Competing interests

There is no competing of interest to declare.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1:

Supplementary File 1. Percentage frequency of anti-codons of the plant kingdom.

Additional file 2:

Supplementary Figure 1. Multiple sequence alignment of plant tRNA^Sec genes. Alignment revealed the presence of conserved nucleotide sequences in the anti-codon loop and pseudo-uridine loop region (marked red). The Multiple sequence alignment was conducted using Multalin software (http://multalin.toulouse.inra.fr/multalin/).

Additional file 3:

Supplementary Figure 2. Deletion, duplication, and codivergence events in tRNA^Sec in 128 analysed plant species. The gene tree of tRNA^Sec was reconciled with the species tree to identify deletion, duplication, and codivergence events in tRNA^Sec genes. Results of the analysis indicated that deletion events in tRNA^Sec were predominant over duplication and co-divergence events. Analysis was conducted using Notung software version 2.9.

Additional file 4:

Supplementary Figure 3. Evolutionary time tree of tRNA^Sec genes. The analysis revealed that tRNA genes in the Plant Kingdom arose at least 2466.30 million years ago. The reference time period was considered based on the evolutionary time scale of the species Chloropicon primus and Ectocarpus siliculosus as per the time tree database (http://www.timetree.org/). The time tree shown was generated using the RelTime method. Divergence times for all of the branching points in the topology were calculated using the Maximum Likelihood method based on the Kimura 2-parameter model. Bars around each node represent 95% confidence intervals which were computed using the method described in Tamura et al. (2013) [76]. The estimated log likelihood value of the topology shown is − 1964.5432. A discrete Gamma distribution was used to model evolutionary rate differences among the sites [5 categories (+G, parameter = 2.8271)]. The tree is drawn to scale, with branch lengths representing the relative number of substitutions per site. The analysis utilized 68 nucleotide sequences. All positions with less than 95% site coverage were eliminated. Fewer than 5% alignment gaps, missing data, and ambiguous bases were allowed at any position. Evolutionary analyses were conducted in MEGA7 [56].

Additional file 5:

Supplementary Figure 4. Ordinary least square regression between anti-codons and their numbers in the plant kingdom. The ordinary least square regression parameters (slope and intercept) and statistical significance of each regression are indicated. The solid red line represents linear least square fit and blue lines represented 95% confidence interval.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Mohanta, T.K., Mishra, A.K., Hashem, A. et al. Construction of anti-codon table of the plant kingdom and evolution of tRNA selenocysteine (tRNA^Sec). BMC Genomics 21, 804 (2020). https://doi.org/10.1186/s12864-020-07216-3

Download citation

Received: 15 July 2020
Accepted: 08 November 2020
Published: 19 November 2020
DOI: https://doi.org/10.1186/s12864-020-07216-3

Construction of anti-codon table of the plant kingdom and evolution of tRNA selenocysteine (tRNASec)

Abstract

Background

Results

Conclusion

Background

Material and methods

Sequence retrieval

Sequence alignment

Cluster based grouping of the anti-codons

Statistical analysis

Results

Genome size is not proportional to the number of tRNA genes

CAU (met) was the most abundant and GCG (Arg) was the least abundant encoded anti-codons in the plant kingdom

Anti-codons can be classified into five groups based on their frequency of occurrence in plant genomes

Plant genomes encode 18 to 59 isoacceptors (anti-codons)

Only a few species have lost tRNA genes

Some plant species encode tRNASec in their genomes

Loss of tRNASec occurred to a greater extent than duplication

tRNASec underwent a switch in anti-codons during evolution

Statistical analysis

Discussion

Conclusion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Supplementary Information

Additional file 1:

Additional file 2:

Additional file 3:

Additional file 4:

Additional file 5:

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Genomics

Contact us

Construction of anti-codon table of the plant kingdom and evolution of tRNA selenocysteine (tRNA^Sec)

Some plant species encode tRNA^Sec in their genomes

Loss of tRNA^Sec occurred to a greater extent than duplication

tRNA^Sec underwent a switch in anti-codons during evolution