Complete chloroplast genome sequence of Barleria prionitis, comparative chloroplast genomics and phylogenetic relationships among Acanthoideae
BMC Genomics volume 21, Article number: 393 (2020)
The plastome of medicinal and endangered species in Kingdom of Saudi Arabia, Barleria prionitis was sequenced. The plastome was compared with that of seven Acanthoideae species in order to describe the plastome, spot the microsatellite, assess the dissimilarities within the sampled plastomes and to infer their phylogenetic relationships.
The plastome of B. prionitis was 152,217 bp in length with Guanine-Cytosine and Adenine-Thymine content of 38.3 and 61.7% respectively. It is circular and quadripartite in structure and constitute of a large single copy (LSC, 83, 772 bp), small single copy (SSC, 17, 803 bp) and a pair of inverted repeat (IRa and IRb 25, 321 bp each). 131 genes were identified in the plastome out of which 113 are unique and 18 were repeated in IR region. The genome consists of 4 rRNA, 30 tRNA and 80 protein-coding genes. The analysis of long repeat showed all types of repeats were present in the plastome and palindromic has the highest frequency. A total number of 98 SSR were also identified of which mostly were mononucleotide Adenine-Thymine and are located at the non coding regions. Comparative genomic analysis among the plastomes revealed that the pair of the inverted repeat is more conserved than the single copy region. In addition high variation is observed in the intergenic spacer region than the coding region. The genes, ycf1and ndhF and are located at the border junction of the small single copy region and IRb region of all the plastome. The analysis of sequence divergence in the protein coding genes indicates that the following genes undergo positive selection (atpF, petD, psbZ, rpl20, petB, rpl16, rps16, rpoC, rps7, rpl32 and ycf3). Phylogenetic analysis indicated sister relationship between Ruellieae and Justcieae. In addition, Barleria, Justicia and Ruellia are paraphyletic, suggesting that Justiceae, Ruellieae, Andrographideae and Barlerieae should be treated as tribes.
This study sequenced and assembled the first plastome of the taxon Barleria and reported the basics resources for evolutionary studies of B. prionitis and tools for phylogenetic relationship studies within the core Acanthaceae.
The Acanthaceae Juss. Ex Bercht.& J. Presl is among the largest family in the order Lamiales with ca. 3800 recognized species accommodated in ca. 200 genera , the members of the family are mainly diversified in the sub tropics and tropics, with few species in the temperate zones . The family is close to Bignoniaceae family in the Lamiales order . The main centres of distribution of the species in the family are Africa, Central America and Asian continent particularly Malaysia, Indonesia and Brazil . They are characterized by having decussate phyllotaxis, while some species have congest whorled phyllotaxis, the leaves are usually simple with toothed margin, opposite, existipulate and contained calcium oxalate crystals or hypodermal calcium carbonate cystolith [5, 6].
In an effort to resolve taxonomic issues of the family and its species, researchers for the past decades works extensively in delimiting the family [7,8,9,10], identifying major clades in the family [11,12,13,14]. Scotland and his colleagues carried out infrafamilial studies using floral parts [15,16,17], their findings gives more insight on the infra familial classification of the family and gives morphological synapomorphies of the major lineages. Recently, phylogenetic approach was used to reveal the relationships between the lineages [18,19,20]. Despite these researches, the classifications of the species within the Acanthoideae are still not clear.
The chloroplast organelle is one the most distinguishing featured that differentiates plant cell and other type of cells; therefore it is the most noticeable feature in plants. The organelle which is semi-autonomous is believed to have evolved decade of millions years ago from cynobacterium [21, 22]. The plastome of flowering plant is conserved than the other genomes (i.e mitochondrial and nuclear genomes), in addition the genome is small compared with the others and it is used frequently in phylogeny studies due to its low rate of nucleotide substitution . The chloroplast genome is typically quadripartite in structure, containing large single copy (LSC) and small single copy (SSC) separated by pair of inverted repeat (IR) . The genome organization, its content and gene structure are highly conserved . Due to its conserved nature, the cp genome contents are widely used by researchers as a tool to investigate phylogenetic relationship and in genomic studies . Single nucleotide polymorphisms as well as insertion/deletions which are among the evolutionary hotspot of the organelle are believed to be use as a tool to solve taxonomic issues among taxa that their phylogenetic relationships are unresolved. Phylogenetic relationship generated from single or combination of few genes are being replaced by the ones constructed from the whole genome as a result of new DNA sequencing methods such as next generation sequencing (NGS). The introduction of next generation sequencing has increased the availability of data for solving phylogenetic relationship issues. However, in spite of its importance, the approach is not fully and well utilize by researchers in plant systematic studies [27,28,29]. One of the most important benefits of next generation sequencing technique is that it generate very high amount of sequences compared with sanger sequencing technique. Additionally, the platform used in next generation sequencing like Illumina is very cheap process . This approach has been used to generate huge number of data for inferring phylogenetic relationship in different taxonomic levels inference [31,32,33,34].
With the advent of next generation sequencing, importance of plastome sequence in resolving phylogenetic relationships and the great number of genera in Acanthaceae, only plastome of few genera have been sequenced and no phylogenomic studies have been conducted for the family.
In this research, we sequenced and characterized the plastome of Barleria prionitis and compared the genome with cp genomes from Acanthoideae species. We used data from the whole chloroplast genome of 8 genera belonging to the Acanthoideae to reveal their tribal positions. This is as a result of incongruent of previous studies in placing the genera in their respective tribes . placed Barlerieae and Andrographideae as sub tribes under the tribe Justicieae, this classification has been reported by other student of Acanthaceae . classify the sub family Acanthoideae into two tribes, placing Ruelliinae, Justiciinae, Andrograpiinae and Barleriinae under the tribe Ruellieae. Findings of recent studies by McDade and her colleagues using molecular data contradict with previous classifications. Therefore, there is need to use complete chloroplast genome to address the correct placement of the genera into their respective tribes. The result of this study will be useful for developing makers, provide resources for evolutionary studies and authentication of B. prionitis and the inference of phylogenetic relationships within Acanthoideae.
Characteristics of B. prionitis chloroplast genome
The complete plastome sequence of B. prionitis was reported to be 152,217 bp in size and has a structural organization of quadripartite containing a large single copy (LSC, 83, 772 bp), a pair of inverted repeat (IRa and IRb 25, 321 bp each) and small single copy (SSC, 17, 803 bp) (Fig. 1 and Table 1). Composition of Adenine-Thymine and Guanine-Cytosine content in B. prionitis was 61.7 and 38.3%, respective whereas the IRA, IRB, SSC and LSC regions have, 67.4 and 32.6%, 56.5 and 43.5%, 56.4 and 43.6%, and 63.6% and 36.4, respectively (Table 1). The inverted repeat region have higher GC content of 49% compared with the SSC and LSC regions with 32.6 and 36.4% respectively (Table 1). In terms of the size of the coding and non coding region, the protein coding regions is 79, 950 pb in length whereas the non coding which includes the intergenic spacer and introns have 72, 267 bp.
The complete chloroplast genome of B. prionitis contained 113 different genes out of which 18 are duplicated in the IRA and IRB region, totaling 131 genes. The number of rRNA genes, tRNA genes and protein-coding genes in the genome are 4, 30 and 80, respectively (Fig. 1 and Table 2). Four rRNA, seven protein coding and tRNA genes are located in the pair of the inverted repeat region of the plastome whereas the large single copy region harbored 62 protein-coding sequence and 22 tRNA genes, the remaining one tRNA and 12 protein coding genes are located in the single copy region. Among the genes coding for protein, many of them started with the codon ATG while few starts with other codon such as ACG and GTG, this is also reported in other chloroplast genome of angiosperms.
The chloroplast genome of B. prionitis is found to have intron in some of the genes, like in other species in the Lamiales order [36, 37]. Out of the 113 different genes, 14 of them contain intron (Table 3), six tRNAs and eight protein-coding genes. Four of the genes with intron viz.: ndhB, trnA-UGC, trnI-GAU and rpl2 are situated in the inverted repeat region and the other 12 in the large single copy region. clpP and ycf3 are the only genes with two intron, while the other 12 genes have one intron, this is consistent with that of S. cusia . trnK-UUU is the gene with longest intron with 2460 bp because of the situation of matK in the gene.
The frequency of the codon usage present in the plastome of B. prionitis was computed using the nucleotide sequence of protein-coding genes and tRNA genes 100,319 bp, the result is presented in Table 4, the results showed the genes in the plastome are encoded by 33, 436 codons. The codons that codes for the amino acids Leucine appears more frequently in the genome 3286 (9.83%) (Fig. 2), comparable to that of Ailanthus altisssima and the ones coding for Trp have the lowest 622 (1.86%) in the plastid sequence. Guanine-Cytosine ending are more common than the Adenine-Thymine ending, this is incongruent with other cp genome sequence [38,39,40]. The result of the analysis show that there is low codon usage bias in the plastome sequence of B.prionitis (Table 4). 29 codons have RSCU values greater than 1 and all of them are characterized with Adenine-Thymine ending while for 30 codons, were less than 1 and are all of Guanine-Cytosine ending. The amino acids Tryptophan and Methionine have RSCU value of 1 hence they don’t have codon bias.
The prediction of RNA editing sites present in the plastome sequence of B. priniotis was done by means of PREP suite. The first codon of the first nucleotide was used in all the analysis. The results as shown in (Table 5) showed that most of the conversions in the codon positions are from Serine to Leucine. Generally, the editing sites observed in the plastome were 61 which are distributed between the 19 protein-coding genes. psaB is found to have the highest number of editing site (13 sites) followed by ndhB (9 sites), rpoB (6 site) and rpl20, accD, rps, atpI, rpl2, rpoA have the lowest number of editing site with 1 editing site each. Nine (9) RNA editing site in ndhB has been confirmed in the plastome of other species [41,42,43]. Conversions of proline to serine were observed, which involves the changing of the amino acids in the RNA editing site from apolar to polar group. Genes such as petD, ndhC, atpB, clpP, ndhE, petL, ndhG, petG and ccsA among others do not possess RNA editing site in their first codon of the nucleotide.
Repeat sequence in the chloroplast genome of B. prionitis were screen using REPuter programme with default settings, the programme revealed that only three types of repeats were present in the genome viz. Palindromic, forward and reverse, the complement repeat is not detected within the plastome (Table 6). The result revealed 18 palindromic repeats, 25 forward repeats and 6 reverse repeats (Table 6). Most of the repeats size are between 20 and 29 bp (78.6%), followed by 10–19 bp (10.20%) whereas 40-49 bp are the least (4.08%). In all, there are 49 number repeats in B. priniotis plastome. In the first location, 65.30% of the repeats are contained in the non coding region; this is comparable to the cp genome of Fagopyrum dibotrys . Eight repeats were located in the tRNA (16.32%), the other 9 repeats (18.36%) are situated in the protein coding genes in particular rpl2, ndhA, ycf1, ndhC, and ycf2. Among the protein coding genes ycf2 contained 2 forward palindromic and repeats.
The rate of repeats among eight Acanthoideae plastomes was compared, the results indicates that complement, palindromic, reverse and forward type of repeats occurred in the plastome of J. flava, A. paniculata, S. cusia, B. ciliaris and R. breedlovei, whereas no complement repeats detected in the cp genomes of B. prionitis, E. attenuatus and A. knappiae (Fig. 3). S. cusia, B. ciliaris and A. paniculata are found to have high frequency of palindromic repeats (23) and J. flava is found to have the least (16). R. breedlovei, S. cusia and A. paniculata have15 forward repeats in their plastome and the frequency of reverse repeats is identical in the plastome of A. paniculata, S. cusia and J. flava. Complement repeat is absent in B. prionitis, E. attenuates, A. knappiae and is the least repeat in the plastome of J. flava, A. paniculata, B. ciliaris, R. breedlovei and S. cusia.
Microsatellites (SSRs) are short repeat of nucleotide sequences (1-6 bp) that are distributed throughout genome. This short repeats are used as important makers for evolutionary studies of plants . In this research, a total number of 98 microsatellites were identified in the chloroplast genome of B. priniotis (Table 7). Most of the microsatellites in the plastome are mononucleotide (83.67%) and majority of them are polythymine 58.53% followed by poly A (polyadenine) 40.24%, only one Poly G (polyguanine (1.21%) is present where as no poly C detected in the genome. Among dinucleotide only 5 repeats were detected, TA repeated four times and AT only once. Considering sequence complimentary, two trinucleotide AAG/CTT and AAT/ATT, four tetra AAAC/GTTT, AAAG/CTTT, AAAT/ATTT, AATC/ATTG and only one penta AAATGG/ATTTCC were detected in the genome (Fig. 4a) whereas no hexanucleotide repeat detected. The majority of the microsatellites are found in the intergenic spacer region (Fig. 4b) (62.24%) and the coding region contained the least (33.67%). The majority of the repeats were located in the large single copy region (70.40%) and the single copy region contained the lowest frequency of repeat (9.18%) in the plastome.
The rate of occurrence of SSRs among the plastomes of the eight members of Acanthoideae was compared (Fig. 4c); the comparison indicate high frequency of mononucleotides across all the plastomes. E. attenuatus and A. paniculata had the highest number of mononucleotide with 107 and 104 respectively. Pentanucleotides were not found in the plastome of B. prionitis, E. attenuatus, A. knappiae, B. ciliaris and R. breedlovei while hexanucleotide were only present in B. prionitis, R. breedlovei and A. knappiae.
Comparative analysis of Justicia flava chloroplast to other Acanthaceae genomes
The plastome sequences of eight Acanthaceae species namely (B. prionitis, J. flava, B. ciliaris, A. paniculata, E. attenuatus, R. breedlovei, A. knappiae and S. cusia were compared. To check the level of nucleotide sequence variation between the sampled plastomes of Acanthoideae species, the programe mVISTA was used to aligned the sequences with the annotation of B. prionitis as reference. Result of the alignment indicates that the plastomes are extremely conserved, however some level of variations were detected. The pair of the iverted repeat is highly conserved than the small single copy region and large single copy region. Additionally, the protein-coding genes are highly conserved than the non coding region, mostly the integernenic spacer regions. The intergenic spacer regions with high level of variation within the gemone are trnL – trnA, trnH-GUG – psbA, trnC – petN, trnL – trnF, accD – psaI, rps12- trnV, rps15 – ycf1, rps16 – trnQ (Fig. 5). The protein coding genes that showed sequence divergence are ycf2, psbL, atpE, rbcL, petB, petA, and atpF.
The plastome sequence of flowering plant is reported to have generally been conserved , although there is a little variations in size and boundries of the single copy and inverted repeats as a results of the evolutionary happenings such as contraction and expansion in the plastome architecture [47, 48]. The comparison between the invterted repats and single copy regions boundries in the eight plastome of Acanthaceae (B. prionitis, B. ciliaris, A. paniculata, E. attenuatus, R. breedlovei, J. flava, S.cusia and A. knappiae are presented in (Fig. 6). There is a little variation in the boundaries of the IR-SSC and IR-LSC of the plastomes (Fig. 6),the rps19 is located in LSC region of B. prionitis, B. ciliaris, A. paniculata and A. knappiae. The following genes trnH, rps19, ycf1 and ndhF are located at the junction of IR-SSC and IR-LSC of J. flava and E. attenuatus plastomes slightly variation in number of nucleotides (Fig. 6). In the SSC/IRb border of the eight plastomes, ycf1 and ndhF genes are found. Positioning of ycf2 gene in the IRb/LCS border is observed only in the genome of R. breedlovei where as E. attenuatus plastome also has distinctive structural variation of having rpl22 in junction of IRb/LSC. The gene ndhF was found to have 36 bp, 109 bp, 40 bp and 41 in the IRb region in B. prionitis, E. attenuatus, A. paniculata and A. knappiae respectively where as trnH of E. attenuatus and J. flava is positioned at IRa/LSC border .
Divergence of protein coding gene sequence
The dN/dS ratio and rates of nonsynonymous (dN) substitution and synonymous (dS) were calculated using DNAsp among the plastome of eight species of Acanthoideae to detect the protein-coding genes that were under selective pressure. The results revealed that the dN/dS ratio is < 1 in most of the genes with the exception of atpF, petD, psbZ and rpl20 of B. prionitis vs E. attenuatus, petB, petD, rpl16, rpoC and rps16 of B. prionitis vs A. paniculata, petD, psbZ, rpl16, rps7of B. prionitis vs A. knappieae, psbZ of B. ciliaris, rpl32and ycf3 of B. prionitis vs J. flava having 1.16, 2.08, 2.76 and 1.72, 2.74, 2.30, 2.71, 1.65, 1.30 and 1.61, 2.70, 2.41, 2.76 and 1.61, 1.19, 1.45 and 1.32 respectively (Fig. 7). This shows that the majority of the genes undergo negative selection only few of them were under positive selection. The values of synonymous (dS) rate ranged from 0.01 to 0.38 in all the genes (Fig. 7). Some of the genes including psaJ, atpH, ndhC, psaI, psbE, rpl2, psbH, psbI, psbL and psbF showed no nonsynonymous changes.
To determine the phylogenetic relationship and tribal positions of the nine species of Acanthaceae, we used the plastome of the eight species to reconstruct phylogenetic tree. The phylogenetic analyses were performed using Maximum likelihood and Bayesian inference (BI) with Erythranthe lutea, Scrophularia dentate, Lysionatus pauciflorus and Tanaecium tetragonolobum as outgroup. The resulting tree from Bayesian inference (BI) and Maximum likelihood analyses were congruent with high support PP, 1.0 and MP, 100 in all relationships (Fig. 8). All the nine species clustered in one clade with strong support and are divided into two major sub clades. Sub clade 1 which is monophyletic includes A. knappiae and B. ciliaris (Acantheae) is sister to large clade 2 containing Ruellieae, Barlerieae, Justicieae, Andrographideae. Within the second clade Justicieae and Ruellieae are sister taxa as well as Barlerieae and Andrographideae.
In this study, we sequenced the plastome sequence of B. prionitis using Illumina sequencing technology. This is a new approach of obtaining cp genome without prior isolation of the cpDNA and it has been used in several studies. The analysis of the cp genome revealed that the genome has a quadripartite structure; with a pair of inverted repeats regions (IRa and IRb) separated by small single copy region (SSC) and large single copy region (LSC). The organization and structure of the B. prionitis cp genome is similar to other sequenced Acanthaceae cp genomes [49, 50]. Notably, there is high variation in terms of genome size and organization between B.prionitis and S. cusia, this is as a result of IR contraction. The size of the genome 152,217 bp is comparable to other sequenced cp genome of Acanthaceae species, longer than A. paniculata , R. breedlovei  and S. cusia  shorter than E. attenuatus . The size of the genome in all the studied species is relevant to variation in the LSC region. The cp genome of B. prionitis was found to posses 38.3% GC content, as in S.cusia . Additionally, rps12 was recognized as trans-spliced gene, this was reported in other species [52,53,54]. The arrangement and gene contents of the B. prionitis cp genome is similar to other sequence cp genome of Acanthaceae [50, 51] but is different with that of S. cusia which has trnH-GUG in the inverted repeat regions and ycf2 in the large single copy . Some of the genes in the cp genome of B. prionitis start with ACG, GTG and ATC codon, this phenomenon have been reported in angiosperm chloroplast genome [36, 37, 55].
Repeat elements present in cp genome are correlated with the genome recombination and rearrangements [56, 57]. The cp genome of B. prionits is found have low number of repeats compared to sequenced Acanthaceae plastome [47, 51, 52]. Acanthaceae plastomes contained low repeats compared with other angiosperm cp genome. Most of the repeats were located in the non coding region and ycf genes (ycf1 and ycf2), this has been commonly observed in plastome of angiosperms . Chloroplast microsatellites (cpSSRs) are short repeat in chloroplast genome inherited from a single parent, hence are often used as molecular makers in evolutionary studies such as genetic diversity, they also play role in identification of species [59,60,61]. cp microsatellites analysis, reveal total number of 98 SSRs in the cp genome of B. prionits of which most are mononucleotides, A and T. Poly A and T are reported to be the most abundant repeat in cp genome of plants [62,63,64]. Most of the cpSSRs are located in the non coding region whereas few are located in the protein coding genes region. The microsatellite detected in this study will be useful in evolutionary studies of the genus Barleria as well as identification and conservation of the genus.
Variation in size among cp genome is as a result of contraction and expansion of the inverted repeats (IRs) . Contraction and expansion in IRs region were observed in the cp genome B. prionitis and other sequenced Acanthaceae. The size of the inverted repeats ranges from 16, 328 bp in S. cusia to 25, 761 bp in E. attenuatus. Despite the similar lengths of the IR regions of B. prionitis and the other Acanthaceae species with the exception of S. cusia some level of expansion and contraction were observed. There are variation in the border of IR-SC region among the eight species compared, we identified six type of junctions based on the position of rps19, rpl2 and trnH, which occur as a result of contraction and expansion in the inverted repeat region. Type I occurs in three species B. prionitis, A. knappiae and A. paniculata, one of the duplicated rpl2 is located in the LSC region while the other is in the IRb region whereas only 1 rps19 is present in the LSC region. Type II was found in E. attenuatus, here the two rps19 are located in the inverted repeat regions (IRa and IRb) and the rpl22 gene is located in the LSC region. Type III pattern occurs in S. cusia and is characterized by having trnH-GUG duplicated in the inverted region. Type IV has no genes in the IRb/LSC border and was only found in R. breedlovei. In type V which is observed in the genome J. flava, some part of the rps19 gene are located in the inverted repeat region while some are located in single copy region, another remarkable observation is that the two rps19 are of unequal length. The last pattern, type VI occurs in B. ciliaris and is characterized by having rps19 in the LSC region and rpl2 in the IRb region. All the genomes have ndhF in the IRb/SSC border as well as ycf1 in the SSC/IRa border. It is observed that there is extension of inverted repeat into the single copy region in genome of S. cusia which made the LSC region to have length of 93, 666 bp. Despite the conserve nature of the cp genome, some variation could be detected . The positioning of ycf1 gene in IRb, is considered a pseudogene in many flowering plant plastomes. In addition, the stop codon is absent in the ycf1 gene sequence and this result to the differences in the distribution of genes in single copy and inverted repeat borders. The result of the comparative genome analysis using mVISTA revealed that the genome is relatively conserved with some degree of variation, which mostly occurs in the non coding region as a result of insertion and deletion. The results of the alignment showed no considerable structural rearrangements, like gene relocation or inversion were detected in the plastomes. The structural rearrangement was detected in the cp genome of S. cusia. DNA barcodes are sequences in the genome unique to particular taxa and are used as reliable tools for identification of plants and resolving phylogenetic relationship [65, 66]. The alignment of the eight cp genome reveals variable regions which includes trnH-GUG – psbA, rps16 – trnQ, trnC – petN, accD – psaI, clpP intron, trnL – trnF, rps15 – ycf1, rps12- trnV, trnL – trnA, atpE, atpF, rbcL. These regions will be used as makers for identification of the sampled Acanthaceae species as well as resolving phylogetic relationships in the family. Most of the variable regions are located in the single copy region particularly the large single copy, this is consistent in most angiosperms.
Synonymous (dS) and non synonymous (dN) substitution rate as well as dN/dS ration were calculated to evaluate sequence divergence and purifying selection in the protein coding genes. The result indicates low sequence divergence in most of the genes (dS < 0.1). The dN/dS analyses show that most of the protein coding genes were under negative selection, only few genes (atpF, petD, psbZZ, rpl20, petB, rpl16, rps16, rpoC, rps7, rpl32 and ycf3) were under positive selection (dN/dS > 1), comparable findings were reported for other plastomes [66,67,68].
Complete chloroplast genome is a good resource for inferring evolutionary and phylogenetic relationships [69,70,71]. Many researchers have used the plastome sequence to resolve phylogenetic relationships at various taxonomic levels [72, 73]. Until this study, the phylogenetic relationships and tribal classification of Acanthaceae was evaluated using only few genes and the tribal classification is still required to be clarified. In this study, we used the cp genome of nine species representing the four major tribes of the Acanthoideae and reconstructed phylogenetic relationships based on maximum parsimony and Bayesian inference methods. The resulting phylogenetic tree from the two methods showed the same topology with high resolution values at the clades. The result of this study based on nine Acanthaceae taxa confirm that Acanthoideae (the retinaculate clade) are monophyletic and also confirm the sister relationship between Acantheae (non cystolith clade) and the cystolith clade, this has been reported earlier [11,12,13, 19] . The phylogenetic tree showed Justicieae and Ruelliae are sister taxa as reported previously  therefore should be regarded as separate tribes not as Justicieae or Ruelliae because the species within these two taxa are paraphyletic. The sister relationship between Andrographideae and Barlerieae is also confirm. Andrographideae and Barlerieae were placed in the tribe Justiceae as sub tribes [35, 74]. Recently Scotland and Vollesen classified all species with cystolith under the tribe Ruellieae placing Andrographis, Barleria and Justicia under the sub tribes Andrographinae, Barleriinae and Justiciinae respectively. Our findings suggested that Andrographideae, Justicieae and Barlerieae should be treated as tribes not sub tribes.
In this study, we sequenced and reported the complete chloroplast genome of B. prionitis, providing valuable plastome genomic resources for the species. The plastome of B. prionitis has a typical gymonosperm cp genome structure and is comparable to other cp genome of Acanthaceae. Simple sequence repeats that will be used for evolutionary studies within Barleria were identified. The genome comparative analyses of 9 Acanthaceae reveal variable hotspot that could be used to develop DNA barcode for the identification of the species. These hotspots will also be useful in phylogenetic relationship studies of the family Acanthaceae. The study also reveals that only few genes were under positive selection. The findings of the confirmed the tribal position of major genera within Acanthoideae and suggested that Andrographideae, Justicieae and Barlerieae should be treated as tribes not sub tribes.
Plant material and DNA extraction
Plant material was collected from Makkah Taif road, Saudi Arabia (390 20′ 0.30″E, 210 45′ 33.68″N) and identified by the curator of King Abdulaziz University Herbarium, Dr. Dhafer A. Alzahrani, the voucher specimen was deposited in the herbarium of King Abdulaziz University, Jeddah, Saudi Arabia, with voucher specimen number KAU22534. Total genomic DNA was extracted from leaves using Qiagen DNA extraction Kit according to manufacturer’s protocol.
Library construction, sequencing and assembly
The genomic DNA was sequenced using Illumina Hiseq 2500 platform (Novogene Technologies, Inc. Beijing, China). Raw data reads were filtered by PRINSEQ lite Ver0.20.4  to get clean reads (5GB). The cp genome was assembled from the high quality clean reads using NOVOplasty2.7.2  with kmer 39 using the cp genome of Ruellia breedlovei (KP300014.1) as reference and ndhF from B. prionitis (U12653) as seed.
Dual Organellar GenoMe Annotator (DOGMA)  was used to annotate the genes in plastome followed by manual adjustment of the positions of start and stop codons. TrNAscan-SE2.0  was used to verify tRNA genes. Organellar Genome Draw (ORGDRAW)  was used to circular map of plastome. The complete chloroplast genome sequence of B. prionitis was submitted to GenBank (Accession number MK548575).
Relative synonymous codon usage values (RSCU), base composition and codon usage were analyzed using MEGA 6.0. PREP suite  with cutoff value of 8.0 was used to predict the RNA editing sites in the plastome.
Repeat analysis in B. prionitis chloroplast genome
MIcroSAtellite (MISA)  was used to identify the simple sequence repeats (SSRs) with the following parameters: eight for mononucleotides, five for dinucleotides, four trinucleotides and three for tetra, penta, hexanucleotides SSR motifs. Long repeats analysis was done using the program REPuter (https://bibiserv.cebitec.uni-bielefeld.de/reputer)  with default parameters.
Characterization of substitution rate
To detect the genes that were under selective pressure, DNAsp v5.10.01  was used to analyze the synonymous (dS), nonsynonymous (dN) and dN/dS value of all the protein coding genes in sampled Acanthoideae species.
For phylogenomic analysis, the cp genomes of Acanthoideae species deposited in the GenBank were recovered (Table 8). The plastome of four species of the order lamiales were also downloaded and set as out groups (Table 8). The downloaded sequences and cp genome of B. prionitis were aligned with MAFFT v.7  and analyzed using Maximum parsimony with (PAUP version 4.0b10)  and Bayesian Inference with MrBayes version 3.2.6 .. To select the suitable model for Bayesian analysis jModelTest 3.7  was used.
Availability of data and materials
All data generated or analysed during this study are included in this published article and the complete chloroplast genome sequence of Barleria prionitis is deposited in the genbank with I. D no: MK548575.
The accession numbers corresponding to the additional datasets used and analysed in this study can be found in Table 8. These were retrieved from National Center for Biotechnology Information database.
Conserved non coding sequence
Large single copy region
Next generation sequencing
Polymerase chain reaction
Relative synonymous codon usage
Small single copy region
Simple sequence repeats
Bergianska.< 2015.www.angio.bergianska.se/asterids/Plantaginales/Plantagin-ales.html>. Accessed Aug 2019.
Olmstead RA. Synoptical Classification of the Lamiales. Version 2.4. http://depts.washington.edu/phylo/Classification.pdf. 2012.
APG III. An update of the angiosperm phylogeny group classification for the orders and families of flowering plants: APG III. Bot J Linn Soc. 2009;161:105–21.
Royle JF. Illustrations of the botany and other branches of the natural history of the Himalayan Mountains and of the flora of cashmere. New Delhi I: Today's and Tomorrow's Printers & Publishers; 1970. p. 296–8..
Solereder H. Systematic anatomy of the Dicotyledons: a handbook for Laboratories of Pure and Applied Botany, 2 v. Oxford: Clarendon Press; 1908.
Patil AM, Patil DA. Petiolar anatomy of some hitherto unstudies Acanthaceae. J Exp Sci. 2012;3:5–10.
Schonenberger J, Endress PK. Structure and development of the flowers in Mendoncia, Pseudocalyx, and Thunbergia (Acanthaceae) and their systematic implications. Int J Plant Sci. 1998;159:446–65.
Schonenberger J. Floral structure, development and diversity in Thunbergia (Acanthaceae). Bot J Linn Soc. 1999;130:1–36.
Schwarzbach AE, McDade LA. Phylogenetic relationships of the mangrove family Avicenniaceae based on chloroplast and nuclear ribosomal DNA sequences. Syst Bot. 2002;27:84–98.
Wortley AH, Harris DJ, Scotland RW. On the taxonomy and phylogenetic position of Thomandersia. Syst Bot. 2007;32:415–44. https://doi.org/10.1600/036364407781179716.
Hedren M, Chase MW, Olmstead RG. Relationships in the Acanthaceae and related families as suggested by cladistic analysis of rbcL nucleotides equences. Plant Syst Evol. 1995;194:93–109.
Scotland RW, Sweere JA, Reeves PA, Olmstead RG. Higher- level systematics of Acanthaceae determined by chloroplast DNA sequences. Am J Bot. 1995;82:266–75.
McDade LA, Moody ML. Phylogenetic relationships among Acanthaceae: evidence from non-coding trnL-trnF chloroplast DNA sequences. Am J Bot. 1999;86:70–80.
McDade LA, Masta SE, Moody ML, Waters E. Phylogenetic relationships among Acanthaceae: evidence from two genomes. Syst Bot. 2000;25:105–20.
Scotland RW, Endress PK, Lawrence TJ. Corolla ontogeny and aestivation in the Acanthaceae. Bot J Linn Soc. 1994;114:49–65. https://doi.org/10.1111/j.1095-8339.1994.tb01923.x.
Scotland RW. Pollen morphology of Contortae (Acanthaceae). Bot JLinn Soc. 1993;111:471–504 http://ora.ox.ac.uk/objects/uuid:cd07aee0-b24c-4ef3-a5b8-816c0ae9e06c.
Scotland RW, Vollesen K. Classification of Acanthaceae. Kew Bull. 2000;55:513–89.
McDade LA, Thomas FD, Carrie AK, Vollesen K. Phylogenetic relationships among Acantheae (Acanthaceae): major lineages present contrasting patterns of molecular evolution and morphological differentiation. Syst Bot. 2005;30:834–62.
McDade LA, Thomas FD, Carrie AK. Toward a comprehensive understanding of phylogenetic relationships among lineages of Acanthaceae s.l. (Lamiales). Am. J. Bot. 2008;95:1136–52.
McDade LA, Thomas FD, Carrie AK, Agneta JB. Phylogenetic placement, delimitation, and relationships among genera of the enigmatic Nelsonioideae (Lamiales: Acanthaceae). Am J Bot. 2012;61:637–51.
Timmis JN, Ayliffe MA, Huang CY, Martin W. Endosymbiotic gene transfer: organelle genomes forge eukaryotic chromosomes. Nat Rev Genet. 2004;5:123–35 pmid:14735123.
Price DC, Chan CX, Yoon HS, Yang EC, Qiu H, Weber AP, et al. Cyanophora paradoxagenome elucidates origin of photosynthesis in algae and plants. Science. 2012;335:843–7 pmid:22344442.
Wei W, Youliang Z, Li C, Yuming W, Zehong Y, Ruiwu Y. PCR-RFLP analysis of cpDNA and mtDNA in the genus Houttuynia in some areas of China. Hereditas. 2005;142:24–32. https://doi.org/10.1111/j.1601-5223.2005.01,704.x.
Bendich AJ. Circular chloroplast chromosomes: the grand illusion. Plant Cell. 2004;16:1661–6. https://doi.org/10.1105/tpc.160771.
Asaf S, Khan AL, Khan MA, Imran QM, Kang SM. Comparative analysis of complete plastid genomes from wild soybean (Glycine soja) and nine otherGlycinespecies. PLOS ONE. 2017;12:e0182281. https://doi.org/10.1371/journal.pone.0182281.
Cho KS, Yun BK, Yoon YH, Hong SY, Mekapogu M, Kim KH, et al. Complete Chloroplast Genome Sequence of Tartary Buckwheat (Fagopyrum tataricum) and Comparative Analysis with Common Buckwheat (F. esculentum). PLOS ONE. 2015;10:e0125332. https://doi.org/10.1371/journal.pone.0125332.
Cronn R, Knaus BJ, Liston AP, Maughan J, Parks M, Syring J, et al. Targeted enrichment strategies for next-generation plant biology. Am J Bot. 2012;99:291–311. https://doi.org/10.3732/ajb.1100356.
Carstens BC, Pelletier TA, Reid NM, Satler JD. How to fail at species delimitation. Mol Ecol. 2013;22:4369–83. https://doi.org/10.1111/mec.12413.
Eaton D, Ree RH. Inferring phylogeny and introgression using RADseq data: an example from 347 flowering plants (Pedicularis: Orobanchaceae) Syst. Biol. 2013;62:689–706. https://doi.org/10.1093/sysbio/syt032.
Lemmon A, Emme S, Lemmon E. Anchored hybrid enrichment for massively high-throughput phylogenomics. Syst.Biol. 2012;61:727–44. https://doi.org/10.1093/sysbio/sys049.
Kamneva OK, Syring J, Liston A, Rosenberg NA. Evaluating allopolyploid origins in strawberries (Fragaria) using haplotypes generated from target capture sequencing. BMC Evol Biol. 2017;17. https://doi.org/10.1186/s12862-017-1019-7.
Sousa F, Bertrand YKJ, Nylinder S, Oxelman B, Eriksson JS, Pfeil BE, et al. Phylogenetic properties of 50 nuclear loci in Medicago (Leguminosae) generated using multiplexed sequence capture and next-generation sequencing. PLOS ONE. 2014;9:e109704. https://doi.org/10.1371/journal.pone.0109704.
Stephens JD, Rogers WL, Heyduk K, Cruse-Sanders JM, Determann RO, Glenn TC, et al. Resolving phylogenetic relationships of the recently radiated carnivorous plant genus Sarracenia using target enrichment. Mol Phylogenet Evol. 2015;85:76–87.
Weitemier K, Straub SCK, Cronn RC, Fishbein M, Schmickl R, McDonnell A, et al. Hyb-Seq: combining target enrichment and genome skimming for plant phylogenomics. Appl Plant Sci. 2014;2:1400042. https://doi.org/10.3732/apps.1400042.
Clarke CB. Acanthaceea. In: Hooker JD, editor. The flora of British India, vol. 4. London: L. Reeve and Co.; 1884. p. 387–558.
Raman G, Park S. The complete chloroplast genome sequence of Ampelopsis: gene organization, comparative analysis, and phylogenetic relationships to other angiosperms. Front Plant Sci. 2016;341.
Park I, Kim WJ, Yeo S-M, Choi G, Kang Y-M, Piao R, et al. The complete chloroplast genome sequences of Fritillaria ussuriensis maxim. In addition, Fritillaria cirrhosa D. Don, and comparative analysis with other Fritillaria species. Molecules. 2017;282..
Zhou J, Chen X, Cui Y, Sun W, Li Y, Wang Y. Molecular structure and phylogenetic analyses of complete chloroplast genomes of two Aristolochia medicinal species. Int J Mol Sci. 2017;1839.
Jiang D, Zhao Z, Zhang T, Zhong W, Liu C, Yuan Q, et al. The chloroplast genome sequence of Scutellaria baicalensis provides insight into intraspecific and interspecific chloroplast genome diversity in Scutellaria. Genes. 2017;227.
Zhou J, Cui Y, Chen X, Li Y, Xu Z, Duan B, et al. Complete chloroplast genomes of Papaver rhoeas and Papaver orientale: molecular structures, comparative analysis, and phylogenetic analysis. Molecules. 2018;437.
Wang W, Yu H, Wang J, Lei W, Gao J, Qiu X, et al. The complete chloroplast genome sequences of the medicinal plant Forsythia suspensa (oleaceae) Int. J. Mol. Sci. 2017;2288.
Kumbhar F, Nie X, Xing G, Zhao X, Lin Y, Wang S, et al. Identification and characterisation of rna editing sites in chloroplast transcripts of einkorn wheat (Triticum monococcum). Ann Appl Biol. 2018;172:197–207.
Park M, Park H, Lee H, Lee B-H, Lee J. The complete plastome sequence of an antarctic bryophyte Sanionia uncinata (Hedw.) loeske. Int J Mol Sci. 2018;709.
Xumei W, Tao Z, Guoqing B, Yuemei Z. Complete chloroplast genome sequence of Fagopyrum dibotrys: genome features, comparative analysis and phylogenetic relationships. Sci Rep. 2018;8.
Provan J, Powell W, Hollingsworth PM. Chloroplast microsatellites: new tools for studies in plant ecology and evolution. Trends Ecol Evol. 2001;16:142–7. https://doi.org/10.1016/S0169-5347(00)02097-8.
Philippe H, Delsuc F, Brinkmann H, Lartillot N. Phylogenomics. Ann Rev Ecol Evol Syst. 2005;36:541–62.
Raubeson LA, Peery R, Chumley TW, Dziubek C, Fourcade HM, Boorem JL, et al. Comparative chloroplast genomics: Analyses including new sequences from the angiosperms Nuphar advena and Ranunculus macranthus. BMC Genomics. 2007;8:174–201.
Wang RJ, Cheng CL, Chang CC, Wu CL, Su TM, Chaw SM, et al. Dynamics and evolution of the inverted repeat-large single copy junctions in the chloroplast genomes of monocots. BMC Evol Biol. 2008;8:36–50.
Chunming G, Yunfei D, Jun W. The complete chloroplast genomes of Echinacanthus species (Acanthaceae): phylogenetic relationships, adaptive evolution, and screening of molecular markers. Front Plant Sci. 2019;1989.
Yongbin Z, Erin AT. The draft genome of Ruellia speciosa beautiful wild Petunia: Acanthaceae. DNA Res. 2017;24:179–92.
Ding P, Shao YH, Li Q, Gao JL, Zhang RJ, Lai XP, et al. The chloroplast genome sequence of the medicinal plant Andrographis paniculata. Mitochondr DNA. 2016;27:2347–8. https://doi.org/10.3109/19401736.2015.1025258.
Chen HM, Shao JJ, Zhang H, Jiang M, Huang LF, Zhang Z, et al. Sequencing and analysis of Strobilanthes cusia (Nees) Kuntze chloroplast genome revealed the rare simultaneous contraction and expansion of the inverted repeat region in angiosperm. Front Plant Sci. 2018;9. https://doi.org/10.3389/fpls.2018.00324.
Hildebrand M, Hallick RB, Passavant CW, Bourque DP. Trans-splicing in chloroplasts: the rps12 loci of Nicotiana tabacum. Proc Natl Acad Sci U S A. 1988;85:372–6. https://doi.org/10.1073/pnas.85.2.372.
Liu TJ, Zhang CY, Yan HF, Zhang L, Ge XJ, Hao G, et al. Complete plastid genome sequence of Primula sinensis (Primulaceae): struture comparison, sequence variation and evidence for accD tranfer to nucleus. PeerJ. 2016;4:e2101. https://doi.org/10.7717/peerj.2101.
Gichira AW, Li ZZ, Saina JK, Long ZC, Hu GW, Gituru RW, et al. The complete chloroplast genome sequence of an endemic monotypic genus Hagenia (Rosaceae): structural comparative analysis, gene content and microsatellite detection. PeerJ. 2017;5:e2846. https://doi.org/10.7717/peerj.2846.
Weng ML, Blazier JC, Govindu M, Jansen RK. Reconstruction of the ancestral plastid genome in Geraniaceae reveals a correlation between genome rearrangements, repeats and nucleotide substitution rates. Mol Biol Evol. 2013;31:645–59. https://doi.org/10.1093/molbev/mst257.
Lu L, Xiam L, Zhaodong H, Liming Y, Jingbo Z, Ye P, et al. Phylogenetic studies and comparative chloroplast genome analyses elucidate the basal position of halophyte Nitraria sibirica (Nitrariaceae) in the Sapindales. Mitochondr DNA Part A. 2017:1–11. https://doi.org/10.1080/24701394.2017.1350954.
Curci PL, De Paola D, Danzi D, Vendramin GG, Sonnante G. Complete chloroplast genome of the multifunctional crop globe artichoke and comparisonwithotherAsteraceae. PLOS ONE. 2015;10:e0120589. https://doi.org/10.1371/journal.pone.0120589.
Bryan GJ, McNicol JW, Meyer RC, Ramsay G, De Jong WS. Polymorphic simple sequence repeat markers in chloroplast genomes of Solanaceous plants. Theor Appl Genet. 1999;99:859–67.
Provan J. Novel chloroplast microsatellites reveal cytoplasmic variation in Arabidopsis thaliana. Mol Ecol. 2000;9:2183–5.
Ebert D, Peakall R. Chloroplast simple sequence repeats (cpSSRs): technical resources and recommendations for expanding cpSSR discovery and applications to a wide array of plant species. Mol Ecol Resour. 2009;9:673–90.
Dong W, Xu C, Li W, Xie X, Lu Y, Liu Y, et al. Phylogenetic resolution in Juglans based on complete chloroplast genomes and nuclear DNA sequences. Front Plant Sci. 2017;8:1148. https://doi.org/10.3389/fpls.2017.01148.
Ye W-Q, Yap Z-Y, Li P, Comes HP, Qiu Y-X. Plastome organization, genome-based phylogeny and evolution of plastid genes in Podophylloideae (Berberidaceae). Mol Phylogenet Evol. 2018;127:978–87. https://doi.org/10.1016/j.ympev.2018.07.001.
Yang Y, Zhou T, Duan D, Yang J, Feng L, Zhao G. Comparative analysis of the complete chloroplast genomes of five Quercus species. Front Plant Sci. 2016;7:959. https://doi.org/10.3389/fpls.2016.00959.
Aldrich J, Cherney BW, Merlin E. The role of insertions/deletions in the evolution of the intergenic region between psbA and trnH in the chloroplast genome. Curr Genet. 1988;14:137–46. https://doi.org/10.1007/bf00569337.
Zhou T, Chen C, Wei Y, Chang Y, Bai G, Li Z, et al. Comparative transcriptome and chloroplast genome analyses of two related Dipteronia species. Front Plant Sci. 2016;7. https://doi.org/10.3389/fpls.2016.01512.
Rousseau-Gueutin M, Bellot S, Martin GE, Boutte J, Chelaifa H, Lima O, et al. The chloroplast genome of the hexaploid Spartina maritima (Poaceae, Chloridoideae): comparative analyses and molecular dating. Mol Phylogenet Evol. 2015;93:516. https://doi.org/10.1016/j.ympev.2015.06.013.
Xu J-H, Liu Q, Hu W, Want T, Xue Q, Messing J. Dynamics of chloroplast genome in green plants. Genomics. 2015;106:221–31. https://doi.org/10.1016/j.ygeno.2015.07.004.
Borsch T, Quandt D. Mutational dynamics and phylogenetic utility of noncoding chloroplast DNA. Plant Syst Evol. 2009;282:169–99.
Dong WP, Liu J, Yu J, Wang L, Zhou SL. Highly variable chloroplast markers for evaluating plant phylogeny at low taxonomic levels and for DNA barcoding. PLoS One. 2012;7:e35071.
Tong W, Kim TS, Park YJ. Rice chloroplast genome variation architecture and phylogenetic dissection in diverse Oryza species assessed by whole-genome resequencing. Rice. 2016;9:57.
Dong WP, Liu H, Xu C, Zuo YJ, Chen ZJ, Zhou SL, et al. A chloroplast genomic strategy for designing taxon specific DNA mini-barcodes: a case study on ginsengs. BMC Genet. 2014;15:138.
Du YP, Bi Y, Yang FP, Zhang MF, Chen XQ, Xue J, Zhang XH, et al. Complete chloroplast genome sequences of Lilium: insights into evolutionary dynamics and phylogenetic analyses. Sci Rep. 2017;7:5751.
Bentham G. Acanthaceae. In: Bentham G, Hooker JD, editors. Genera Plantarum, vol. 2:1060 – 1122. Reeeve, London Co., Lond; 1876.
Schmieder R, Edwards R. Quality control and preprocessing of metagenomic datasets. Bioinformatics. 2011;27:863–4.
Dierckxsens N, Mardulyn P, Smits G. NOVOPlasty: De novo assembly of organelle genomes from whole genome data. Nucleic Acids Res. 2016;45.
Wyman SK, Jansen RK, Boore JL. Automatic annotation of organellar genomes with DOGMA. Bioinformatics. 2004;20:3252–5.
Schattner P, Brooks AN, Lowe TM. The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs. Nucleic Acids Res. 2005;33:686–W689.
Lohse M, Drechsel O, Bock R. OrganellarGenomeDRAW (OGDRAW): a tool for the easy generation of high-quality custom graphical maps of plastid and mitochondrial genomes. Curr Genet. 2007;52:267–74.
Kurtz S, Choudhuri JV, Ohlebusch E, Schleiermacher C, Stoye J, Giegerich R, et al. Reputer: the manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res. 2001;29:4633–42.
Thiel T, Michalek W, Varshney R, Graner A. Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.). Theor Appl Genet. 2003;106:411–22.
Mayor C, Brudno M, Schwartz JR, Poliakov A, Rubin EM, Frazer KA, et al. VISTA: visualizing global DNA sequence alignments of arbitrary length. Bioinformatics. 2000;16:1046–7.
Frazer KA, Pachter L, Poliakov A, Rubin EM, Dubchak I. VISTA: computational tools for comparative genomics. Nucleic Acids Res. 2004;32:273–9.
Librado P, Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009;25:1451–2.
Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30:772–80.
Felsenstein J. Cases in which parsimony or compatibility methods will be positively misleading. Syst Zool. 1978;27:401–10.
Fredrik R, Maxim T, Paul VM, Daniel LA, Aaron D, Sebastian H, et al. MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Systematic. 2012;61:539–42.
Posada D. jModelTest: phylogenetic model averaging. Mol Biol Evol. 2008;25:1253–9.
This project was funded by the Deanship of Scientific Research (DSR), King Abdulaziz University, Jeddah, under grant No. (DF-293-130-1441). The authors, therefore, gratefully acknowledge DSR technical and financial support.
Ethics approval and consent to participate
Not applicable. The plant was collected in non protected area; no any legal authorization/license is required.
Consent for publication
The authors declare that they have no competing interest.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Alzahrani, D.A., Yaradua, S.S., Albokhari, E.J. et al. Complete chloroplast genome sequence of Barleria prionitis, comparative chloroplast genomics and phylogenetic relationships among Acanthoideae. BMC Genomics 21, 393 (2020). https://doi.org/10.1186/s12864-020-06798-2