- Research article
- Open Access
The complete mitochondrial genomes of sixteen ardeid birds revealing the evolutionary process of the gene rearrangements
BMC Genomics volume 15, Article number: 573 (2014)
The animal mitochondrial genome is generally considered to be under selection for both compactness and gene order conservation. As more mitochondrial genomes are sequenced, mitochondrial duplications and gene rearrangements have been frequently identified among diverse animal groups. Although several mechanisms of gene rearrangement have been proposed thus far, more observational evidence from major taxa is needed to validate specific mechanisms. In the current study, the complete mitochondrial DNA of sixteen bird species from the family Ardeidae was sequenced and the evolution of mitochondrial gene rearrangements was investigated. The mitochondrial genomes were then used to review the phylogenies of these ardeid birds.
The complete mitochondrial genome sequences of the sixteen ardeid birds exhibited four distinct mitochondrial gene orders in which two of them, named as “duplicate tRNAGlu–CR” and “duplicate tRNAThr–tRNAPro and CR”, were newly discovered. These gene rearrangements arose from an evolutionary process consistent with the tandem duplication - random loss model (TDRL). Additionally, duplications in these gene orders were near identical in nucleotide sequences within each individual, suggesting that they evolved in concert. Phylogenetic analyses of the sixteen ardeid species supported the idea that Ardea ibis, Ardea modesta and Ardea intermedia should be classified as genus Ardea, and Ixobrychus flavicollis as genus Ixobrychus, and indicated that within the subfamily Ardeinae, Nycticorax nycticorax is closely related to genus Egretta and that Ardeola bacchus and Butorides striatus are closely related to the genus Ardea.
The duplicate tRNAThr–CR gene order is found in most ardeid lineages, suggesting this gene order is the ancestral pattern within these birds and persisted in most lineages via concerted evolution. In two independent lineages, when the concerted evolution stopped in some subsections due to the accumulation of numerous substitutions and deletions, the duplicate tRNAThr–CR gene order was transformed into three other gene orders. The phylogenetic trees produced from concatenated rRNA and protein coding genes have high support values in most nodes, indicating that the mitochondrial genome sequences are promising markers for resolving the phylogenetic issues of ardeid birds when more taxa are added.
Five distinct mitochondrial (mt) gene orders have been described in birds since the complete mt genome of the domestic chicken (Gallus gallus) was presented [1–17]. There are differences among the various copies and arrangements of the control region (CR) and flanking genes among these five mt gene orders, which were assumed to have resulted from the evolutionary process, tandem duplication and random loss (TDRL). Gibb et al.  introduced a naming system of avian mt gene orders according to these variations, known as ancestral avian, remnant CR (2), duplicate CR and duplicate tRNAThr–CR. These were initially identified in the chicken , falcon , Amazona parrot  and albatross . In ancestral avian gene order, the gene content and arrangement in the region located between cytochrome b (Cytb) and tRNAPhe is Cytb, tRNAThr, tRNAPro, ND6, tRNAGlu and CR. These genes and the CR are tandem duplicated in the duplicate tRNAThr–CR gene order, but the second Cytb is only a partial copy (p-Cytb). In comparison with the duplicate tRNAThr–CR gene order, the first tRNAPro, ND6 and tRNAGlu and the second tRNAThr are lost or reduced in the duplicate CR and remnant CR (2) gene orders. Additionally, there is degeneration of the second CR in the remnant CR (2) gene order. Besides these four orders, Verkuil et al.  identified a novel mt gene order in the ruff, which differs from the duplicate tRNAThr–CR gene order in the degeneration of the second tRNAThr.
In TDRL model, usually one copy of the duplicated genes or CRs loses function and degrades or is eventually deleted . However, in birds it is common that the duplications have persisted for a long evolutionary time, where both copies of the duplicated genes or CRs are functional and have high sequence similarity. Such situation has generally been thought to be due to concerted evolution and multiple mechanisms accounting for concerted evolution were postulated [5, 7, 8, 10, 12–15, 19, 20].
The family Ardeidae is a group of large wading birds that is distributed across most parts of the world. According to a recent classification by Kushlan and Hancock , this family consists of five subfamilies, 17 genera and 62 species. Recent avian molecular phylogenies based on a large set of nuclear data  and complete mtDNA  have united the Ardeidae into a large taxonomic group that contains Pelecaniformes and Ciconiiformes. The duplicate tRNAThr–CR gene order is taxonomically widespread within this group [7, 11, 12, 14, 17], suggesting that this mitochondrial gene order might be a general pattern in these related birds. However, the recently published complete mitochondrial DNA (mtDNA) sequences of four ardeid species indicate that their gene orders are similar to the ancestral gene order [8, 23, 24].
In the current study, the complete mtDNA of 16 ardeid species was sequenced in order to explore the occurrence of derived, rather than ancestral avian, mt gene orders in ardeid mt genomes and to investigate the evolution of mt gene rearrangements in the family Ardeidae. The phylogenetic relationships within the surveyed ardeid species were also reviewed using the concatenated nucleotide sequences of two rRNA and 12 protein coding genes.
Tissue samples were collected from sixteen ardeid species (Table 1). The nomenclature used for ardeid species in this study is the taxonomic system proposed by Kushlan and Hancock . The muscle or liver tissue samples used in this study were collected from dead birds found in the wild during field investigations and stored in alcohol until DNA extraction. The Administration Center for Wildlife Conservation in Fujian Province (FJWCA-1208) approved the current research and all procedures involving animal tissue collection in the wild. The scientific license for access to the study site was issued by the Administration Department of Xiamen Egret Natural Reserve (XMENR-1005).
DNA extraction, PCR amplification and sequencing
Whole genomic DNA was extracted using the DNeasy Blood and Tissue Kit (Qiagen) following the manufacturer’s protocol.
To minimize the possibility of obtaining nuclear copies of mt genes, the entire mt genome was first amplified in two long overlapping fragments via long and accurate polymerase chain reaction (LA PCR) using primer pairs of LACOIF and LA16SR, LA12SF and LACOIIIR (Additional file 1)  and the Premix LA Taq kit version 2.0 (Takara, China). LA PCR was carried out in a Veriti 96 well thermocycler (Applied Biosystems). The final reaction volume of 50 μL contained 1 μL (about 100 ng) of genomic DNA, 2.5 U LA Taq Polymerase, 1× LA PCR buffer II, 0.4 mM of each dNTP, and 0.1 μM of each primer. PCR cycling conditions were as follows: 1 min denaturation at 94°C, followed by 30 cycles of 15 s at 94°C, 10 min at 68°C, with a final extension at 72°C for 10 min. Each PCR yielded a single band, which was detected in a 0.8% agarose gel. The band was extracted and purified using a DNA Gel Extraction Kit (Axygen).
The purified PCR products were used as templates for nested PCR amplification with 19 primer sets (Additional file 1) [23, 26], using Premix Taq® kit version 2.0 (Takara). The strategy employed to amplify the region between Cytb and 12S rRNA is shown in Additional file 2. All nested PCR reactions were carried out in a final reaction volume of 50 μL containing 1 μL of template DNA, 1.25 U Taq Polymerase, 1× PCR buffer, 1.5 mM MgCl2, 0.2 mM of each dNTP, 0.1 μM of each primer. PCR cycling was carried out in a Veriti 96 well thermocycler (Applied Biosystems). PCR condictions were as follows: 3 min denaturation at 94°C, followed by 30 cycles of 15 s at 94°C, 15 s at 52°C, 2 min at 72°C, with a final extension at 72°C for 7 min. The amplification products were separated on a 1% agarose gel and DNA fragments from the individual bands were purified, ligated into pMD 18-T vector (TaKaRa) and transformed into Escherichia coli DH5α. Positive clones were selected to sequence in both directions and each sequenced twice independently. Sequencing was performed on an ABI PRISM 3730 automatic sequencer (Applied Biosystems) using universal M13 sequencing primers and BigDye version 3.1 (Applied Biosystems).
Gene identification and genome analysis
Contiguous sequences that were obtained were assembled using the DNAMAN version 5.2 (Lynnon Biosoft). Most tRNA genes were identified using tRNAscan-SE 1.21 (http://lowelab.ucsc.edu/tRNAscan-SE/) with the vertebrate mitochondrial genetic code and ‘mito/chloroplast’ source. The tRNASer (AGY), which not found by the tRNA-SE1.21, and protein coding genes were identified by homology alignments with the corresponding known sequences of E. novaehollandiae, E. eulophotes, N. nycticorax, and I. cinnamomeus [GenBank: DQ780878, EU072995, JN018412, HQ690247, respectively] using Clustal × 2.0 . The boundaries of ribosomal RNA genes (rRNA) and CR2 were inferred from boundaries of flanking genes under the assumption that there are neither intergenic spacers nor overlaps. The boundary of the 5’ end of CR1 was inferred from the flanking tRNAGlu or sequence homology to CR2 if the flanking tRNAGlu was absent. The boundary of the 3’ end of CR1 was inferred from the sequence homology to CR2. The pair-wise alignments between putative pseudogenes and corresponding functional copy within an individual were performed in the LALIGN query program using default parameters (http://fasta.bioch.virginia.edu/fasta_www2/fasta_www.cgi?rm=lplalign) .
The following mtDNA from Threskiornithinae were retrieved from GenBank for phylogenetic analysis: Nipponia nippon [GenBank: NC_008132], Platalea minor [GenBank: NC_010962], P. leucorodia [GenBank: NC_012772] and Threskiornis aethiopicus [GenBank: NC_013146], which served as outgroups.
Nucleotide sequences of the two rRNAs and 12 protein coding genes encoded on heavy strands (excluding the stop codon) were used for phylogenetic analysis. Alignments were first built for all 14 genes individually using Clustal × 2.0 with default settings and then concatenated into a single alignment. Gaps and ambiguous areas in the alignments of the two rRNA genes were excluded using Gblocks 0.91b  with default parameters. The final alignment encompassed 20 taxa and 13, 184 nucleotide sites, of which 3, 882 sites were parsimony informative.
The concatenated alignment dataset was partitioned for phylogenetic analysis. First, the dataset was partitioned by 12 s rRNA, 16 s rRNA and each codon position of each protein coding gene. PartitionFinder v1.1.1  was then used to determine the best partitioning scheme and nucleotide substitution models according to the Bayesian information criterion (BIC) and “greedy” algorithm with branch lengths estimated as “unlinked”. The analysis suggested splitting the dataset into three partitions as the best partitioning scheme: (1) 12 s rRNA + 16 s rRNA + the first codon position of the 12 protein coding genes + the second codon position of ATP8, (2) the second codon position of the 11 protein coding genes, and (3) the third codon position of the 12 protein coding genes. The proposed best-fit nucleotide substitution model for partitions (1) and (3) was a general time-reversible model with gamma distribution substitution rates and proportion of invariable sites (GTR + I + G), and for partition (2) was the Hasegawa-Kishino-Yano model with a proportion of invariant sites and gamma distributed rate variation (HKY + I + G). These results were implemented in the following phylogenetic analyses.
Phylogenetic analyses were conducted using the Bayesian inference (BI) and maximum likelihood (ML) methods available on the CIPRES Science Gateway v3.3 . The BI analysis was performed using MrBayes v3.1.2  on XSEDE . Two sets of four chains were allowed to run simultaneously for 10,000,000 generations and each set was sampled every 1000 generations. Convergence and mixing of the chain of each analysis was evaluated using Tracer v1.6.1 (http://tree.bio.ed.ac.uk/software/tracer) to check that the ESS values were all superior to 200. A consensus tree was then calculated after omitting the first 25% trees as burn-in. The ML analysis was performed using RAxML-HPC2 on XSEDE v8.0.9  and 1000 bootstraps were used to estimate the node reliability. Additionally, the GTR + G + I model was used for all three partitions, since the HKY + I + G model is not implemented in RAxML.
Gene content and order of CR and flanking genes
In this study, the complete mitochondrial DNA sequences were obtained for sixteen ardeid birds, which had been deposited in NCBI GenBank under accession numbers KJ190944 to KJ190959. The size of deposited sequences varied substantially, from 18640 bp (I. cinnamomeus) to 20350 bp (B. stellaris). In the region between Cytb and tRNAPhe, all sixteen mitochondrial genomes contained two CRs, but the number of tRNAThr, tRNAPro, tRNAGlu and ND6 copies varied (Figure 1). Four gene order types were exhibited in the sixteen genomes due to these variations: 1) in ten species, including E. eulophotes, E. sacra, E. garzetta, B. striatus, A. bacchus, N. nycticorax, B. stellaris, I. sinensis, I. eurhythmus, and I. flavicollis, the gene order was similar to the duplicate tRNAThr–CR gene order and was characterized by a tandem duplicated region that spanned the last part of Cytb to the CR (Figure 1A); 2) in A. modesta, A. intermedia and A. ibis, the gene order was similar to the duplicate CR gene order, where tRNAPro1, ND61 and tRNAGlu1 as well as the partial Cytb and tRNAThr2 were replaced by putative pseudogenes with respect to the duplicate tRNAThr–CR gene order (Figure 1B); 3) in A. cinerea and A. purpurea, the gene order was similar to the duplicate CR gene order with exception of two copies of tRNAGlu (Figure 1C); 4) in I. cinnamomeus, the gene order differed from the duplicate tRNAThr–CR gene order in that tRNAPro1 was immediately followed by CR1, which began with a poly C stretch (Figure 1D). The latter two gene orders are novel for birds, and for the sake of congruence with Gibb et al. , they were named as a duplicate tRNAGlu–CR gene order and a duplicate tRNAThr–tRNAPro and CR gene order, respectively.
Comparison of the duplicated regions
There were duplicated CRs in the four gene orders, both of which were deemed as complete since they both contained CR-specific conserved sequence blocks previously described in other avian research (poly C stretch, TAS motif, F box, E box, D box, C box, B box, bird similarity box, and CSB1) [34–38] (Additional file 3). Sequence alignments indicated that the sequences near the poly C stretch were usually highly similar between the duplicated CRs within each individual, while the middle parts of domain I were significantly divergent except in I. flavicollis and B. stellaris. After this, approximately 200 bp (150 bp in A. bacchus and I. cinnamomeus) at the 3’ end of domain I and all of domain II (defined from the start of the F box to the start of the CSB1) continued nearly identical between CR1 and CR2. CR1 domain III also coincided with the 5’ end of CR2 domain III. After this region, there were several hundred additional nucleotides, composed primarily of complicated CA-rich repetitive sequences, in CR2. In seven species, another repetitive sequence with a remarkably long motif was found in domain III of both CR1 and CR2. The sizes of the motifs were 93 bp in A ibis, 92 bp in I. sinensis, 91 bp in B. stellaris and A. modesta, and 81 bp in the three Egretta species. In the surveyed specimens, the minimum number of repeats was 3.2 in CRs of A. modesta, and the maximum number was 10 in CR2 of B. stellaris although heteroplasmy was present in this case (Additional file 3).
The other duplicated regions in the duplicate tRNAThr–CR gene order, beginning from the 3’ end of Cytb through tRNAGlu (duplicated Cytb section vary between 81 and 113 bp in length among the species) were also identical or nearly identical to each other within each individual. However, CR1 and the p-Cytb were separated by a small non-coding spacer (varying in length between 13 and 40 bp among species), which was not present in CR2 or Cytb. In I. cinnamomeus that had the duplicate tRNAThr–tRNAPro and CR gene order, this intervening spacer was 26 bp in length. Additionally, in this individual, the duplicated region beginning from the last 88 bp of Cytb through tRNAPro, differed by one mismatch base.
In the duplicate tRNAGlu–CR gene order, the sizable non-coding section (NC1, 217 bp in A. cinerea and 220 bp in A. purpurea), located between tRNAThr and tRNAGlu1, was presumed to be pseudogenes, which corresponded to remnants of the degenerated tRNAPro and ND6. Moreover, another sizable non-coding section (NC2, 166 bp), located between CR1 and tRNAPro, was presumed to be pseudogenes, which corresponded to remnants of the degenerated Cytb and tRNAThr. in order to verify the reliability of this mechanism, the degree of similarity between the NCs and corresponding functional genes was examined by pair-wise alignments within individuals of respective species. In A. cinerea, there was a 132 bp portion in NC1that had 68.8% identity with tRNAPro and ND6, a 42 bp portion in NC1 that had 85.7% identity with ND6, and a 142 bp portion in NC2 that had 72.3% identity with Cytb and tRNAThr. In A. purpurea, there was a 194 bp portion in NC1 that had 63% identity with ND6 which especially exhibits 81.8% identity over the last 66 bp, and a 141 bp portion in NC2 that had 68% identity with Cytb and tRNAThr (Additional file 4).
Similarly, in the duplicate tRNAGlu–CR gene order, tRNAPro-like or ND6-like subsections were found in NC1 located between tRNAThr and CR1, and Cytb-like and tRNAThr-like subsections were found in NC2 (Additional file 4). Specifically, in A. modesta, a 76 bp portion in NC1 has 76.3% identity with tRNAPro and a 132 bp portion in NC2 has 66.0% identity with Cytb and tRNAThr. In A. intermedia, a 225 bp portion in NC1 has 63.6% identity with the tRNAPro and ND6, and a 122 bp portion in NC2 has 63.6% identity with Cytb and tRNAThr. In A. ibis, a 155 bp portion in NC1 has 58.8% identity with the ND6 and a 114 bp portion in NC2 has 62.6% identity with Cytb and tRNAThr.
The BI and ML trees had similar topologies and most nodes were supported by high posterior probabilities (PP) and bootstrap percentages (BP) (Figure 2). In the trees, the 16 ardeid birds were first split into two monophyletic clades corresponding to the subfamilies Botaurinae and Ardeinae. Within the Botaurinae clade, the Ixobrychus birds formed a monophyletic clade with B. stellaris as the basal sister group. Within the Ardeinae clade, N. nycticorax was closely related to Egretta; A. bacchus and B. striatus formed a group which was closely related to Ardea. Within Ardea, the A. ibis was basal and A. modesta was sister to A. intermedia.
Phylogenetic relationships among the surveyed ardeid birds
Among the surveyed ardeid birds, the major phylogenetic controversies have been related to the classifications of the A. ibis, A. modesta, A. intermedia and I. flavicollis and to the genus relationships within Ardeinae [39–45]. This study confirmed Sheldon’s postulate based on DNA-DNA hybridization data  that A. ibis, A. modesta and A. intermedia are members of the Ardea genus as they are more closely related to two typical Ardea species. I. flavicollis is generally received as a member of Ixobrychus [21, 43], but has sometimes been put in a monotypic genus Dupetor . The present results indicate that I. flavicollis should be included in the genus Ixobrychus since it nested in this group. Moreover, the relationship between the genera, Ardeola and Butorides, was found to be the closest since A. bacchus and B. striatus were the first to form a group with high support (PP = 1 and BP = 99). This is consistent with the result based on osteological characters reported by Payne and Risley . The group of Ardeola and Butorides was more closely related to the genus Ardea, which is in accord with the postulate of Kushlan and Hancock . The closest relationship between the genera Nycticorax and Egretta, a newly found in current study, was strongly supported by BI analysis (PP = 0.99), but only weakly supported by ML analysis (BP = 54) and it could not be ruled out as a result of the long branch attraction .
In brief, our results were broadly consistent with previously phylogenetic studies based on DNA hybridization , Cytb  and vocalization . These have been so far the only available datasets to estimate ardeid bird phylogeny. Although these methods are only rarely applied in current phylogenetic analyses, they have yielded highly congruent results . Furthermore, in this study, most nodes in the phylogenetic trees had high support values, especially in the BI tree. The complete mitochondrial genomes may be the promising markers for resolving ardeid phylogenetic relationships at low taxonomic levels. In order to break up some long branches, avoid long-branch attraction and improve phylogenetic accuracy, it will be necessary to add more taxa, particularly the inclusion of certain key groups such as genera Nyctanassa, Gorsachius, and Syrigma, and more species in Nycticorax and Ardeola. Moreover, because the entire mitochondrial genome is inherited as a single locus, it would be highly desirable to include further unlinked nuclear makers into future phylogenetic analyses on ardeid birds.
Ardeid mitochondrial gene orders
The complete mtDNAs of sixteen ardeid species exhibited four gene order types, all of which contained duplications. In light of the current results, we presume that mt gene orders with duplications were common in all ardeid birds, or at least in Ardeinae and Botaurinae subfamilies. The ancestral avian gene order previously found in E. eulophotes , N. nycticorax, and I. cinnamomeus  have been proven to be misidentifications in present study. Although the mt gene order of E. novaehollandiae  was not reexamined here, it is very likely that the ancestral avian gene order found in this species was also a misidentification. Indeed, in the sequencing of E. novaehollandiae mtDNA, Gibb et al. found indications of a potential duplication, but could not extensively test to completely exclude co-amplification of nuclear copies due to poor DNA quality .
Two of the four different mt gene orders in ardeid birds, the duplicate tRNAGlu–CR and duplicate tRNAThr–tRNAPro and CR, were newly discovered, whereas the other two had been found previously in other avian species. The duplicate tRNAThr–CR gene order had previously been reported in Philippine hornbills , albatrosses [7, 12], the black-faced spoonbill , and several birds of “core” pelecaniforms . The p-Cytb was significantly shorter in ardeid birds compared to these birds, which implies that the extant length of the p-Cytb in ardeid birds was not the initial duplicated piece, but resulted from degeneration of the 5’ end of p-Cytb. Such a case has been reported in Thalassarche albatrosses , where a degenerate partial copy of Cytb (d-Cytb, retaining 70% sequence similarity to the functional Cytb) was found to precede the p-Cytb (retaining 100% identity to the 3’ end portion in the functional copy). The duplicate CR gene order has previously been reported for parrots and cockatoos [5, 16], osprey , ivory-billed aracari , blackcap and reed warbler . In these cases, the p-Cytb and the pseudo copies of tRNAPro1, ND61, tRNAGlu1 and tRNAThr2 are usually decayed beyond recognition. The only exception was Amazona parrots , in which identifiable pseudo ND6 and tRNAGlu were still present. In current study, many recognizable pseudogenes were present and had more than 60% sequence identity with the portions of corresponding functional copies, making the extent of the duplication more easily defined and providing visible evidence that one copy of the duplicated genes was in the process of being eliminated.
Evolution of ardeid mt gene rearrangements
The existence of tandem duplications and pseudogenes in ardeid mt genomes made it clear that the mt gene rearrangements arose from a process consistent with the TDRL model. The duplicate tRNAThr–CR gene order was derived from the ancestral avian gene order by a tandem duplication of the region spanning from Cytb (or just the last part) to CR, followed by degeneration of the 3’ portion of the CR1 and 5’ portion of the Cytb. The other three gene orders were derived from the duplicate tRNAThr–CR gene order by further degenerations or deletions of one copy of specific duplicated genes.
Since the duplicate tRNAThr–CR gene order was shared by most lineages (Figure 2) and the p-Cytbs were nearly identical in length not only among closely related but also distantly related species, it was most parsimonious that the origin of the duplicate tRNAThr–CR gene order preceded before the divergence of these ardeid birds. When this hypothesis was accepted, it is easier to explain the variation in ardeid gene orders, since the other three gene orders could have been more easily derived from the duplicate tRNAThr–CR gene order than from the ancestral avian gene order via accumulation of substitutions and deletions. Alternatively, although tandem duplication might arise multiple times within ardeid birds, this was less likely because the same replication errors, same tandem duplications and same degenerations would have had to repeatedly occur in each species .
In I. cinnamomeus, complete deletion of ND61 and tRNAGlu1 as well as the 5’ end portion of CR1 (the portion upstream of C-stretch in CR2 is absent in CR1) had independently occurred, transforming the duplicate tRNAThr–CR gene order into the duplicate tRNAThr–tRNAPro and CR gene order. In the lineage leading to the five Ardea species, the degenerations or deletions of tRNAPro1, ND61, pCytb and tRNAThr2 might have independently occurred in each species, because the degeneration degrees and patterns were different among these species (Additional file 4). Alternatively, the degenerations could have originated before the divergence of the five species and then continued to accumulate numerous substitutions and deletions in each species. The deletion of tRNAGlu1 in the duplicate CR gene order could not have originated before the divergence of the A. ibis, A. modesta and A. intermedia, but likely occurred independently in two lineages, according to the evolutionary relationships among the five Ardea species shown in Figure 2, in which one is leading to A. ibis and the other is leading to A. modesta and A. intermedia,.
Maintenance of the duplicated regions
According to the TDRL model, one copy of each duplicated gene or CR usually degenerates and is eventually lost during the evolutionary process . However, persistent and stable duplications in mtDNA had been widely reported for several groups of vertebrates, such as fish [48, 49], frogs [20, 50], snakes [19, 51, 52], and birds [5, 7–15], and are usually interpreted to be concerted evolution. When duplications evolve in concert, paralogous sequences within individuals are more similar than orthologous sequences across closely related species [5, 47]. Likewise, in the ardeid mtDNAs with duplicate tRNAThr–CR gene order, we could observed that all duplications including the partial Cytb, tRNAThr, tRNAPro, ND6, tRNAGlu and CRs were nearly identical to each other within each individual, but very different between species (Additional file 3), suggesting that they were evolving in concert. In the mtDNAs of Ardea and I. cinnamomeus, the concerted evolution might have been stopped in some parts by large deletions, but still occurred between the duplicated CR.
The primary proposed mechanisms of concerted evolution include tandem replication slippage and gene conversion via general recombination [5, 19]. The tandem replication slippage sequentially homogenized sequences between duplications , therefore this mechanism cannot account for the observation in most ardeid species, where the homogenized duplications were interrupted by a divergent section nesting in the middle of CR domain I. In contrast, such uneven homogenized pattern could be easily created and maintained by gene conversion, similar to the previous reports in albatrosses [7, 12], the black-faced spoonbill , the ruff , boobies  and Philippine hornbills . In albatrosses, Abbott et al.  proposed that the gene conversion could have multiple recombination points, whereby certain duplicated portions were regularly homogenized while intervening sections remained unaffected and therefore evolved independently. In boobies and Philippine hornbills, Morris-Pocock et al.  and Sammler et al.  seperately suggested that the concerted evolution in those birds was well consistent with a novel gene conversion model outlined by Kumazawa et al.  if the intervening divergent section corresponds to putative Replication Fork Barrier (RFB) and the replication in avian mtDNA is initiated from multiple points. Based on this model, the gene conversion is mediated by RFB regions in CRs, and the duplicated CRs are easier to recombine than other duplicated portions, which resulting more stable of the homogenized CRs. This explanation matched with the observation in the ardeid species, where the duplicated CRs were maintained in all species while extra copies of different genes were degenerating or deleted in some species. The conflict with this model in the ardeid birds was no observation of the divergence between putative RFBs in both I. flavicollis and B. stellaris. To explain such conflict observed in an individual of the Luzon Tarictic Hornbill (Penelopides manillae), Sammler et al.  postulated a possible scenario that RFB itself might have been subjected to homogenization via strand exchange once the replication fork is erroneously not halted at the RFB in the evolutionary history.
In this study, the complete mtDNA of 16 ardeid birds were sequenced and four gene order types were found, including two that were newly discovered. These genomes strongly support the TDRL model of mt gene rearrangements and concerted evolution between duplications. Furthermore, based on the analysis of the phylogenetic relationships among the surveyed ardeid species using the concatenated nucleotide sequences of two rRNA and 12 protein coding genes, we found that our results were broadly congruent with previously studies using the DNA hybridization , Cytb  and vocalization . We also found that most of the nodes in phylogenetic study were supported by high PP and BP values, suggesting that complete mt genomics could be promising molecular markers for resolving deep phylogenetic relationships in ardeid birds.
Availability of supporting data
All supporting data is included as additional files. Complete mitochondrial DNA sequences have been deposited in NCBI GenBank under accession numbers KJ190944 to KJ190959.
Desjardins P, Morais R: Sequence and gene organization of the chicken mitochondrial genome. A novel gene order in higher vertebrates. J Mol Biol. 1990, 212 (4): 599-634.
Mindell DP, Sorenson MD, Dimcheff DE: Multiple independent origins of mitochondrial gene order in birds. Proc Natl Acad Sci U S A. 1998, 95 (18): 10693-10697.
Bensch S, Harlid A: Mitochondrial genomic rearrangements in songbirds. Mol Biol Evol. 2000, 17 (1): 107-113.
Haddrath O, Baker AJ: Complete mitochondrial DNA genome sequences of extinct birds: ratite phylogenetics and the vicariance biogeography hypothesis. Proc Biol Sci. 2001, 268 (1470): 939-945.
Eberhard JR, Wright TF, Bermingham E: Duplication and concerted evolution of the mitochondrial control region in the parrot genus Amazona. Mol Biol Evol. 2001, 18 (7): 1330-1342.
Haring E, Kruckenhauser L, Gamauf A, Riesing MJ, Pinsker W: The complete sequence of the mitochondrial genome of Buteo buteo (Aves, Accipitridae) indicates an early split in the phylogeny of raptors. Mol Biol Evol. 2001, 18 (10): 1892-1904.
Abbott CL, Double MC, Trueman JW, Robinson A, Cockburn A: An unusual source of apparent mitochondrial heteroplasmy: duplicate mitochondrial control regions in Thalassarche albatrosses. Mol Ecol. 2005, 14 (11): 3605-3613.
Gibb GC, Kardailsky O, Kimball RT, Braun EL, Penny D: Mitochondrial genomes and avian phylogeny: complex characters and resolvability without explosive radiations. Mol Biol Evol. 2007, 24 (1): 269-280.
Singh TR, Shneor O, Huchon D: Bird mitochondrial gene order: insight from 3 warbler mitochondrial genomes. Mol Biol Evol. 2008, 25 (3): 475-477.
Cadahia L, Pinsker W, Negro JJ, Pavlicev M, Urios V, Haring E: Repeated sequence homogenization between the control and pseudo-control regions in the mitochondrial genomes of the subfamily Aquilinae. J Exp Zool B Mol Dev Evol. 2009, 312B (3): 171-185.
Cho HJ, Eda M, Nishida S, Yasukochi Y, Chong JR, Koike H: Tandem duplication of mitochondrial DNA in the black-faced spoonbill. Platalea minor Genes Genet Syst. 2009, 84 (4): 297-305.
Eda M, Kuro-o M, Higuchi H, Hasegawa H, Koike H: Mosaic gene conversion after a tandem duplication of mtDNA sequence in Diomedeidae (albatrosses). Genes Genet Syst. 2010, 85 (2): 129-139.
Verkuil YI, Piersma T, Baker AJ: A novel mitochondrial gene order in shorebirds (Scolopacidae, Charadriiformes). Mol Phylogenet Evol. 2010, 57 (1): 411-416.
Morris-Pocock JA, Taylor SA, Birt TP, Friesen VL: Concerted evolution of duplicated mitochondrial control regions in three related seabird species. BMC Evol Biol. 2010, 10: 14-
Sammler S, Bleidorn C, Tiedemann R: Full mitochondrial genome sequences of two endemic Philippine hornbill species (Aves: Bucerotidae) provide evidence for pervasive mitochondrial DNA recombination. BMC Genomics. 2011, 12: 35-
Schirtzinger EE, Tavares ES, Gonzales LA, Eberhard JR, Miyaki CY, Sanchez JJ, Hernandez A, Mueller H, Graves GR, Fleischer RC, Wright TF: Multiple independent origins of mitochondrial control region duplications in the order Psittaciformes. Mol Phylogenet Evol. 2012, 64 (2): 342-356.
Gibb GC, Kennedy M, Penny D: Beyond phylogeny: pelecaniform and ciconiiform birds, and long-term niche stability. Mol Phylogenet Evol. 2013, 68 (2): 229-238.
Boore JL: The duplication/random loss model for gene rearrangement exemplified by mitochondrial genomes of deuterostome animals. Computational biology series, vol.1. Edited by: Sankoff D, Dordrecht NJ. 2000, Netherlands: Kluwer Academic Publishers, 133-147.
Kumazawa Y, Ota H, Nishida M, Ozawa T: The complete nucleotide sequence of a snake (Dinodon semicarinatus) mitochondrial genome with two identical control regions. Genetics. 1998, 150 (1): 313-329.
Kurabayashi A, Sumida M, Yonekawa H, Glaw F, Vences M, Hasegawa M: Phylogeny, recombination, and mechanisms of stepwise mitochondrial genome reorganization in mantellid frogs from Madagascar. Mol Biol Evol. 2008, 25 (5): 874-891.
Kushlan JA, Hancock JA: Herons (Ardeidae) (Bird Families of the World). 2005, New York: Oxford University Press
Hackett SJ, Kimball RT, Reddy S, Bowie RC, Braun EL, Braun MJ, Chojnowski JL, Cox WA, Han KL, Harshman J, Huddleston CJ, Marks BD, Miglia KJ, Moore WS, Sheldon FH, Steadman DW, Witt CC, Yuri T: A phylogenomic study of birds reveals their evolutionary history. Science. 2008, 320 (5884): 1763-1768.
Zhou X, Wang Y, Chen X, Lin Q, Fang W, Wei D: PERMANENT GENETIC RESOURCES: A set of primer pairs for amplifying the complete mitochondrial DNA of endangered Chinese egret (Aves, Ardeidae, Egretta eulophotes). Mol Ecol Resour. 2008, 8 (2): 412-414.
Zhang L, Wang L, Gowda V, Wang M, Li X, Kan X: The mitochondrial genome of the Cinnamon Bittern, Ixobrychus cinnamomeus (Pelecaniformes: Ardeidae): sequence, structure and phylogenetic analysis. Mol Biol Rep. 2012, 39 (8): 8315-8326.
Nishibori M, Hayashi T, Tsudzuki M, Yamamoto Y, Yasue H: Complete sequence of the Japanese quail (Coturnix japonica) mitochondrial genome and its genetic relationship with related species. Anim Genet. 2001, 32 (6): 380-385.
Sorenson MD, Ast JC, Dimcheff DE, Yuri T, Mindell DP: Primers for a PCR-based approach to mitochondrial genome sequencing in birds and other vertebrates. Mol Phylogenet Evol. 1999, 12 (2): 105-114.
Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, Thompson JD, Gibson TJ, Higgins DG: Clustal W and Clustal X version 2.0. Bioinformatics. 2007, 23 (21): 2947-2948.
Huang X, Miller W: A time-efficient, linear-space local similarity algorithm. Adv Appl Math. 1991, 12 (3): 373-381.
Castresana J: Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol. 2000, 17 (4): 540-552.
Lanfear R, Calcott B, Ho SY, Guindon S: Partitionfinder: combined selection of partitioning schemes and substitution models for phylogenetic analyses. Mol Biol Evol. 2012, 29 (6): 1695-1701.
Miller MA, Pfeiffer W, Schwartz T: Creating the CIPRES Science Gateway for inference of large phylogenetic trees. Proceedings of the Gateway Computing Environments Workshop (GCE): Nov 14, 2010. 2010, New Orleans, LA, 1-8.
Ronquist F, Huelsenbeck JP: MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003, 19 (12): 1572-1574.
Stamatakis A: RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014, 30 (9): 1312-1313.
Quinn TW, Wilson AC: Sequence evolution in and around the mitochondrial control region in birds. J Mol Evol. 1993, 37 (4): 417-425.
Ramirez V, Savoie P, Morais R: Molecular characterization and evolution of a duck mitochondrial genome. J Mol Evol. 1993, 37 (3): 296-310.
Ritchie PA, Lambert DM: A repeat complex in the mitochondrial control region of Adelie penguins from Antarctica. Genome. 2000, 43 (4): 613-618.
Ruokonen M, Kvist L: Structure and evolution of the avian mitochondrial control region. Mol Phylogenet Evol. 2002, 23 (3): 422-432.
Buehler DM, Baker AJ: Characterization of the red knot (Calidris canutus) mitochondrial control region. Genome. 2003, 46 (4): 565-572.
Bock WJ: A generic review of the family Ardeidae (Aves). Am Mus Novit. 1956, 1779: 1-49.
McCracken KG, Sheldon FH: Avian vocalizations and phylogenetic signal. Proc Natl Acad Sci U S A. 1997, 94 (8): 3833-3836.
Peters JL: Check-list of birds of the world, vol. 1. 1931, Cambridge, Massachusetts: Harvard University Press
McCracken KG, Sheldon FH: Molecular and osteological heron phylogenies: sources of incongruence. Auk. 1998, 115 (1): 127-141.
Payne RB, Risley CJ: Systematics and evolutionary relationships among the herons (Ardeidae). Misc Publ Mus Zool Univ Mich. 1976, 150: 1-115.
Sheldon FH: Phylogeny of herons estimated from DNA-DNA hybridization data. Auk. 1987, 104 (1): 97-108.
Sheldon FH, Jones CE, McCracken KG: Relative patterns and rates of evolution in heron nuclear and mitochondrial DNA. Mol Biol Evol. 2000, 17 (3): 437-450.
Bergsten J: A review of long‒branch attraction. Cladistics. 2005, 21 (2): 163-193.
Shao R, Barker SC, Mitani H, Aoki Y, Fukunaga M: Evolution of duplicate control regions in the mitochondrial genomes of metazoa: a case study with Australasian Ixodes ticks. Mol Biol Evol. 2005, 22 (3): 620-629.
Lee JS, Miya M, Lee YS, Kim CG, Park EH, Aoki Y, Nishida M: The complete DNA sequence of the mitochondrial genome of the self-fertilizing fish Rivulus marmoratus (Cyprinodontiformes, Rivulidae) and the first description of duplication of a control region in fish. Gene. 2001, 280 (1–2): 1-7.
Tatarenkov A, Avise JC: Rapid concerted evolution in animal mitochondrial DNA. Proc Biol Sci. 2007, 274 (1619): 1795-1798.
Sano N, Kurabayashi A, Fujii T, Yonekawa H, Sumida M: Complete nucleotide sequence of the mitochondrial genome of Schlegel’s tree frog Rhacophorus schlegelii (family Rhacophoridae): duplicated control regions and gene rearrangements. Genes Genet Syst. 2005, 80 (3): 213-224.
Kumazawa Y, Ota H, Nishida M, Ozawa T: Gene rearrangements in snake mitochondrial genomes: highly concerted evolution of control-region-like sequences duplicated and inserted into a tRNA gene cluster. Mol Biol Evol. 1996, 13 (9): 1242-1254.
Dong S, Kumazawa Y: Complete mitochondrial DNA sequences of six snakes: phylogenetic relationships and molecular evolution of genomic features. J Mol Evol. 2005, 61 (1): 12-22.
Sammler S, Ketmaier V, Havenstein K, Tiedemann R: Intraspecific rearrangement of duplicated mitochondrial control regions in the Luzon Tarictic Hornbill Penelopides manillae (Aves: Bucerotidae). J Mol Evol. 2013, 77 (5–6): 199-205.
We thank Fuming Lei (Institute of Zoology, Chinese Academy of Science), Wei Liang (Hainan Normal University), and Qing Chang (Nanjing Normal University), who provided some tissue samples for this study. We also thank Zen Wang, Yufei Dai, Wei Lei, Site Luo, Changfu Mao and Jing Lin for help in collecting some samples and Bo Wang and Kaijian Zhu for assistance in field work of this study. This research was funded by the National Natural Science Foundation of China (Grant # 30970380 to XC, 40876077 to WF, 31272333 to QL and 31000963 to XZ), the Fujian Natural Science Foundation of China (Grant # 2012 J01149 and 2010Y2007 to XC), and the Fundamental Research Funds for the Xiamen Universities (Grant # 2011121038 to XZ).
The authors declare that they have no competing interests.
XZ conceived, designed and performed the experiments, completed the analyses, wrote and revised the manuscript; XC and WZ directed, coordinated and conceived this study, helped with writing and revising the manuscript. QL collected the samples, assisted with analyses and lab work. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 3: Alignments of the duplicated CR within individuals of sixteen ardeid birds. Dots indicate identity of nucleotides to the reference sequence and dashes indicate gaps. The sequences of C stretch, TAS, F, E, D, C, BSB, B and CSB1 are highlighted in blue and are in bold. The sequences with long repeated motifs are highlighted in green and red. Eeul, E. eulophotes; Egar, E. garzetta; Esac, E. sacra; Abac, A. bacchus; Bstr, B. striatus; Nnyc, N. nycticorax; Bste, B. stellaris; Ifla, I. flavicollis; Ieur, I. eurhythmus; Isin, I. sinensis; Icin, I. cinnamomeus; Acin, A. cinerea; Apur, A. purpurea; Amod, A. modesta; Aint, A. intermedia; Aibi, A. ibis. (PDF 118 KB)
Additional file 4: Pair-wise alignments of subsections between the sizable no coding sections and corresponding functional genes within individuals of five species. NC1: no coding section between tRNAThr and tRNAGlu1 or CR1; NC2: no coding section between CR1 and tRNAPro. Numerals in brackets indicate the positions of the aligned sequences. Vertical lines indicate boundary of the gene. Dots indicate identity of nucleotides to the reference sequence; dashes indicate gaps. (PDF 31 KB)
About this article
Cite this article
Zhou, X., Lin, Q., Fang, W. et al. The complete mitochondrial genomes of sixteen ardeid birds revealing the evolutionary process of the gene rearrangements. BMC Genomics 15, 573 (2014). https://doi.org/10.1186/1471-2164-15-573
- Mitochondrial genome
- Gene rearrangement
- Concerted evolution