The complete mitochondrial genome of Flustra foliacea (Ectoprocta, Cheilostomata) - compositional bias affects phylogenetic analyses of lophotrochozoan relationships
© Nesnidal et al; licensee BioMed Central Ltd. 2011
Received: 6 May 2011
Accepted: 23 November 2011
Published: 23 November 2011
The phylogenetic relationships of the lophophorate lineages, ectoprocts, brachiopods and phoronids, within Lophotrochozoa are still controversial. We sequenced an additional mitochondrial genome of the most species-rich lophophorate lineage, the ectoprocts. Although it is known that there are large differences in the nucleotide composition of mitochondrial sequences of different lineages as well as in the amino acid composition of the encoded proteins, this bias is often not considered in phylogenetic analyses. We applied several approaches for reducing compositional bias and saturation in the phylogenetic analyses of the mitochondrial sequences.
The complete mitochondrial genome (16,089 bp) of Flustra foliacea (Ectoprocta, Gymnolaemata, Cheilostomata) was sequenced. All protein-encoding, rRNA and tRNA genes are transcribed from the same strand. Flustra shares long intergenic sequences with the cheilostomate ectoproct Bugula, which might be a synapomorphy of these taxa. Further synapomorphies might be the loss of the DHU arm of the tRNA L(UUR), the loss of the DHU arm of the tRNA S(UCN) and the unique anticodon sequence GAG of the tRNA L(CUN). The gene order of the mitochondrial genome of Flustra differs strongly from that of the other known ectoprocts. Phylogenetic analyses of mitochondrial nucleotide and amino acid data sets show that the lophophorate lineages are more closely related to trochozoan phyla than to deuterostomes or ecdysozoans confirming the Lophotrochozoa hypothesis. Furthermore, they support the monophyly of Cheilostomata and Ectoprocta. However, the relationships of the lophophorate lineages within Lophotrochozoa differ strongly depending on the data set and the used method. Different approaches for reducing heterogeneity in nucleotide and amino acid data sets and saturation did not result in a more robust resolution of lophotrochozoan relationships.
The contradictory and usually weakly supported phylogenetic reconstructions of the relationships among lophotrochozoan phyla based on mitochondrial sequences indicate that these alone do not contain enough information for a robust resolution of the relations of the lophotrochozoan phyla. The mitochondrial gene order is also not useful for inferring their phylogenetic relationships, because it is highly variable in ectoprocts, brachiopods and some other lophotrochozoan phyla. However, our study revealed several rare genomic changes like the evolution of long intergenic sequences and changes in the structure of tRNAs, which may be helpful for reconstructing ectoproct phylogeny.
Molecular systematics has dramatically changed the ideas about the phylogenetic relationships of the lophophorate lineages, ectoproct bryozoans, brachiopods and phoronids. Once considered the sister or paraphyletic stem-group of Deuterostomia based on embryological and morphological characters [1–5], molecular analyses almost unequivocally place them in Lophotrochozoa, a group established to accommodate the lophophorate lineages along with trochozoans, Annelida, Mollusca and relatives . These analyses rely on a spectrum of molecular data ranging from rDNA sequences [6–16], mitochondrial protein sequences [17–19], single nuclear protein-encoding genes [20, 21], Hox genes [22, 23], multiple nuclear protein-encoding sequences [24, 25] and phylogenomic approaches [26–34].
However, the relationships of the lophophorate lineages within Lophotrochozoa are still controversial, because analyses of different data sets resulted in conflicting topologies. Whereas analyses of rDNA data sets [7, 8, 14–16, 35, 36] and phylogenomic data sets [27, 29, 34] strongly support Brachiozoa, a clade including Brachiopoda and Phoronida, phylogenetic analyses of mitochondrial sequences most often indicated sister group relationships between Brachiopoda and Ectoprocta [19, 37–39] and between Phoronida and Entoprocta [37–39]. Thus, these analyses also contradict the monophyly of Bryozoa (= Polyzoa) including Ectoprocta and Entoprocta (and Cycliophora, of which no mitochondrial genome is available so far) as postulated based on morphological arguments by Nielsen [40, 41] and found in some recent analyses of phylogenomic data sets [26, 29–34] and of rDNA data sets [14–16], albeit with poor nodal support.
There is evidence that the inference of the relationships of the lophophorate lineages in phylogenomic analyses might be affected by systematic errors resulting from compositional bias . One possibility to check for systematic errors in phylogenetic analyses is the comparison of the results based on independent data sets. Therefore, we analysed a mitochondrial data set in this study and compared the phylogenetic results with those of phylogenomic analyses, in which no or only few mitochondrial data have been considered. We sequenced an additional mitochondrial genome of the most species-rich lophophorate lineage, the ectoprocts. Because there are large differences in the nucleotide composition of mitochondrial sequences of different lineages as well as in the amino acid composition of the encoded proteins [42–48], we applied several approaches for reducing compositional bias in the phylogenetic analyses. We reduced the compositional heterogeneity by excluding third codon positions from the nucleotide data set, by excluding taxa with strongly deviating amino acid composition and by recoding amino acids in bins. As an alternative to reducing compositional heterogeneity in the data, we applied phylogenetic inference methods with nonstationary models of evolution. Finally, we tried to mitigate saturation and long-branch-attraction problems by excluding fast evolving sites.
Results and Discussion
Organization of the mitochondrial genome of the ectoproct Flustra foliacea
Transfer RNA genes
A second putative tRNA gene for tryptophan as found here in Flustra foliacea (Figure 2) has neither been found in the other known mitochondrial genomes of ectoprocts nor in most other animal mitochondrial genomes. There is no similarity between the sequence of this putative tRNA gene and any of the other tRNA genes in the mitochondrial genome of Flustra. It is proximate to the major non-coding region. We cannot exclude the possibility that it is functionally part of the control region. Nevertheless, its structure is very similar to a tRNA and it is likely that it is at least derived from a tRNA. The two leucine and one of the serine tRNAs lack a DHU arm. The DHU arm of the tRNA L(UUR) is also missing in the cheilostomate Bugula, but not in the cheilostomate Watersipora and the ctenostomate Flustrellidra, whereas the DHU arm of the tRNA L(CUN) is also missing in Flustrellidra, but not in Bugula and Watersipora. Given the relations of these taxa, the loss of the DHU arm of the tRNA L(UUR) might be a synapomorphy of the lineages leading to Flustra and Bugula, whereas the loss of the DHU arm of the tRNA L(CUN) occurred most likely independently in Flustra and Flustrellidra. The DHU arm of the tRNA S(UCN) is also missing in Bugula, but not in Watersipora and might be another synapomorphy of the lineages leading to Flustra and Bugula. This tRNA has not been found in Flustrellidra.
The inferred anticodons of 21 tRNAs of Flustra foliacea (Figure 2) are the same as those in Bugula neritina. Only the anticodon of the tyrosine tRNA differs between Flustra and Bugula. The anticodon of tyrosine tRNA is GUA in Flustra, but AUA in Bugula. Because the anticodon of the Watersipora and Flustrellidra tyrosine tRNAs is also GUA, the change to AUA is probably an autapomorphy of the lineage leading to Bugula. The anticodon of the tRNA L(CUN) of Flustra and Bugula is GAG. This has not been found in any other metazoan so far. In Watersipora and Flustrellidra the anticodon of the tRNA L(CUN) is UAG. Thus, the sequence GAG may represent a unique synapomorphy of the lineages leading to Bugula and Flustra.
The most conserved region of all tRNAs is the anticodon stem and loop region (Figure 2). All other tRNA regions have a high level of variation within Ectoprocta. Especially the TΨC arm is highly variable. The tRNAs with the highest nucleotide conservation across the four ectoprocts are Gly, His, Ile, Met, Phe, Pro and Trp1. Less conserved tRNAs are Arg, Asn, Ser2, Lys, Thr and Trp2. As expected from the phylogenetic relationships, the tRNA sequences of Flustra are most similar to those of the cheilostomate entoprocts Watersipora (p-distance based on all concatenated tRNAs equals 0.357) and Bugula (p-distance 0.361), whereas those of the cheilostomate Flustrellidra are more dissimilar (p-distance 0.461).
Comparison of mitochondrial gene order
Breakpoint distance matrix between orders of mitochondrial protein coding genes and rDNAs of representatives of ectoprocts, entoprocts, brachiopods, phoronids, and molluscs.
Flustra foliacea (Ff)
Bugula neritina (Bn)
Watersipora subtorquata (Ws)
Flustrellidra hispida (Fh)
Loxosomella aloxiata (La)
Terebratulina retusa (Tr)
Laqueus rubellus (Lr)
Lingula anatina (Lia)
Phoronis psammophila (Pp)
Katharina tunicata (Kt)
Nucleotide composition and codon usage
Nucleotide composition and AT-and CG-skews of mitochondrial genomes.
Phoronis psammophila a
Lingula anatina b
Nucleotide composition and AT- and GC-skews of the mitochondrial protein-encoding and ribosomal RNA genes and the entire Flustra foliacea genome.
Proportion of nucleotides
Protein coding sequences
1st codon position
2nd codon position
3rd codon position
There is a high variation in AT- and GC-skews in metazoan mitochondrial genomes. In our data set AT-skews range from -0.430 (Echinococcus) to 0.209 (Trichinella) (Table 2). The range of GC-skews extends from -0.412 (Nautilus) to 0.515 (Echinococcus) (Table 2). Compared with other ectoprocts, Flustra foliacea is characterized by high AT- and GC-skews (Table 2). Among lophophorates, similarly high AT- and GC-skews have been found only in some brachiopods (Laqueus, Terebratalia). Nine genes of Flustra (atp6, cox3, cob, nad1, nad3, nad4, nad 4L, nad5, nad6) have an AT-skew higher than 0.3 and ten genes (atp8, cox2, cox3, nad1, nad2, nad3, nad4, nad4L, nad5, nad6) have a GC-skew higher than 0.3 (Table 3). The GC-skew is positive for all 13 protein-encoding and the two ribosomal RNA genes, whereas the AT-skew is positive for all 13 protein-encoding genes, but negative for the two ribosomal RNA genes in Flustra foliacea mitochondria (Table 3).
Four-fold degenerate codon usage is A/T biased in the third position, and T is the preferred nucleotide (Additional file 1). T is also the preferred nucleotide in two-fold degenerate codons ending in T or C. The codon usage is less biased in two-fold degenerate codons ending in A or G, with A predominating in Leu1, Lys and Met, and G predominating in Gln, Glu, Trp and the termination codons.
Phylogenetic analyses of the relationships of the lophophorate lineages
Phylogenetic relationships of ectoprocts, brachiopods and phoronids according to different phylogenetic analyses (only sister group relationships with one other phylum; more complex relationships are not considered).
Maximum-likelihood (MtZoa+F model)
Amino acid data set, with Lingula
Additional file 2
Maximum-likelihood (GTR model)
Nucleotide data set
Additional file 3
Maximum-likelihood (GTR model)
Nucleotide data set (Gblocks edited)
Additional file 4
Maximum-likelihood (GTR model)
Nucleotide data set (direct nucleotide alignment)
Additional file 5
Maximum-likelihood (MtZoa+F model)
Amino acid data set
Additional file 6
Maximum-likelihood (MtZoa+F model)
Amino acid data set (Gblocks edited)
Additional file 7
Maximum-likelihood (GTR model)
1st and 2nd codon positions
Nucleotide data set; starting tree GTR tree
Additional file 8
Nucleotide data set; starting tree CAT tree
Additional file 9
Bayesian (CAT model)
Amino acid data set
Bayesian (CAT model)
Amino acid data set; 10 taxa with the most strongly differing amino acid composition excluded
Additional file 11
Maximum-likelihood (MtZoa+F model)
Amino acid data set; 10 taxa with the most strongly differing amino acid composition excluded
Additional file 12
Bayesian (CAT model)
Amino acid data set recoded using 9 minmax chi-squared bins'
Additional file 14
Maximum-likelihood (MULTIGAMMA model)
Amino acid data set recoded using 9 minmax chi-squared bins
Additional file 15
Bayesian (CAT model)
Amino acid data set recoded using 6 minmax chi-squared bins
Additional file 16
Maximum-likelihood (MULTIGAMMA model)
Amino acid data set recoded 6 minmax chi-squared bins
Additional file 17
Bayesian (CAT model)
Amino acid data set recoded using Dayhoff groups
Additional file 18
Maximum-likelihood (MULTIGAMMA model)
Amino acid data set recoded Dayhoff groups
Additional file 19
Bayesian (CAT+BP model)
Amino acid data set
Additional file 20
Maximum-likelihood (GTR model)
Nucleotide data set, 20% of the alignment positions with highest sitewise rates removed
Additional file 21
Maximum-likelihood (MtZoa+F model)
Amino acid data set; 10% of the alignment positions with highest sitewise rates removed
Additional file 22
Initially, we included all completely sequenced mitochondrial genomes of lophophorate lineages in the phylogenetic analysis (Additional file 2). However, the mitochondrial genes of the brachiopod Lingula are generally longer and deviate considerably in sequence from their orthologs in other animals . Therefore, these sequences introduced ambiguities into the alignments. Thus, we excluded this taxon from all further phylogenetic analyses.
The newly sequenced cheilostomate ectoproct Flustra clusters in all analyses with the two other included cheilostomate ectoprocts Bugula and Watersipora. Ectoprocta is also monophyletic in all analyses. In the majority of the analyses Flustra is sister group to Bugula. Only in some analyses Bugula is sister taxon to Watersipora instead. A closer relationship of Bugula to Flustra than to Watersipora (or other Lepraliomorpha, to which Watersipora belongs) is also supported by the presence of long intergenic sequences and the structure of some tRNAs in these taxa (see above) and by phylogenetic analyses based on 18S rDNA, 28S rDNA and cox1 sequences .
The lophophorate lineages are usually more closely related to trochozoan phyla than to deuterostomes or ecdysozoans confirming the Lophotrochozoa hypothesis. Only in a few of the analyses, ectoprocts cluster with a long-branch group including platyhelminths, nematodes and chaetognaths. However, the sister group relationships of the lophophorate lineages within Lophotrochozoa differ strongly depending on the data set, method and evolutionary model (Table 4). The different sister group relationships are not strongly supported by the data and may be affected by stochastic as well as systematic errors. Surprisingly, a sister group relationship between Ectoprocta and Brachiopoda as reconstructed in several other analyses of mitochondrial sequences [19, 37–39] was not recovered in any of our analyses. The same applies to the previously proposed sister group relationship between Ectoprocta and Chaetognatha [19, 37, 39, 49]. These vagaries indicate that there is no robust phylogenetic signal for such relationships in the mitochondrial sequences.
In the maximum likelihood tree (Additional file 3) calculated based on the nucleotide alignment derived from the amino acid alignment and edited with ALISCORE [53, 54] comprising 12,648 positions of 49 taxa using the GTR model implemented in RAxML, a sister group relationship between brachiopods and annelids is comparatively well-supported (86% bootstrap value). In this as well as in several of the following analyses platyhelminths, nematodes and chaetognaths, all of them characterized by high substitution rates, form a monophylum, so that neither Ecdysozoa nor Lophotrochozoa are monophyletic. Such long branch artefacts have also been found in most other phylogenetic analyses of mitochondrial nucleotide and amino acid sequences (e.g., [32, 38, 39, 55]). The topology of the maximum likelihood tree based on the nucleotide alignment edited with Gblocks  (including 6,839 positions) differs from that based on the alignment edited with ALISCORE only with regard to nodes that are not well supported in any of the trees (Additional file 4). The topology of the maximum likelihood tree based on a direct nucleotide alignment (edited with ALISCORE; including 12,648 positions; Additional file 5) does not differ from that based on the nucleotide alignment derived from the amino acid alignment in any strongly supported nodes.
Evaluation of compositional heterogeneity of mitochondrial nucleotide sequences and phylogenetic analyses accounting for it
A chi-square test indicates that the nucleotide composition of the used mitochondrial nucleotide sequences is significantly heterogeneous between lineages (chi-square = 23,209 (df = 144), P = 0.000). This is confirmed by the matched-pairs tests of symmetry, according to which 99.6% of the pairwise comparisons show significant (P < 0.050) heterogeneity. Although the nucleotide composition is heterogeneous at all codon positions, it is less pronounced at the first (chi-square = 5,814 (df = 144), P = 0.000; 97.5% significantly heterogeneous pairs) and second (chi-square = 2,990 (df = 144), P = 0.000; 90.7% significantly heterogeneous pairs) than at the third codon positions (chi-square 24,521 (df = 144), P = 0.000; 99.3% significantly heterogeneous pairs).
A maximum likelihood analysis based on the first and second codon positions only resulted in a reduction of the support for a brachiopod-annelid sister group relationship (Figure 5B), indicating that this grouping might be an artefact resulting from compositional bias.
Alternatively, we accounted for the compositional heterogeneity in the nucleotide sequences by using the nonstationary model implemented in nhPhyML-Discrete. This analysis requires a starting tree, for which we used the maximum likelihood tree obtained with the nucleotide data set and the GTR model as well as the Bayesian inference tree based on the amino acid sequences obtained with the CAT model (see below). The two analyses resulted in strongly different topologies (Additional file 8, 9). The tree obtained with the starting tree based on the nucleotide data set and the GTR model had a slightly higher likelihood (loglk = -375,007) than the tree obtained with the starting tree based on the amino acid data set (loglk = -375,103). In the latter platyhelminths are included in Lophotrochozoa and phoronids are sister group of ectoprocts, whereas in the former platyhelminths are the sister group of nematodes and Phoronis is nested in Nemertea.
Evaluation of compositional heterogeneity of mitochondrial amino acid sequences and phylogenetic analyses accounting for it
Results of posterior predictive tests indicating the ability of different approaches to reduce compositional bias in mitochondrial amino acid data sets.
Number of taxa with significantly deviating amino acid composition
Original data set
Exclusion of the 10 taxa with the most strongly differing amino acid composition
Recoding using 9 minmax chi-squared bins
Recoding using 6 minmax chi-squared bins
Recoding using Dayhoff groups
One approach to reduce the compositional heterogeneity of the data set is the exclusion of taxa with strongly deviating amino acid composition. Obviously, not all 40 taxa with significantly deviating amino acid composition can be removed from the phylogenetic analysis. After excluding the ten taxa with the most strongly deviating amino acid composition from the calculations (Additional files 11, 12), the CAT model is still significantly violated (global Z score 7.308; Table 5; Additional file 10) and the test statistic for individual taxa indicates that the amino acid composition of 32 taxa is significantly deviating. Remarkably, Ectoprocta and Entoprocta form a monophylum, Bryozoa, in the maximum likelihood tree based on the reduced data set as in some analyses of phylogenomic [26, 27, 29–34] and rDNA data sets [14–16], albeit with no nodal support (Additional file 12).
Another approach for reducing compositional heterogeneity is recoding of amino acids in bins. We determined bins that minimize compositional heterogeneity with the minmax method described by Susko and Roger . Whereas the minimum P values for 10 or more bins are smaller than 0.05 (Additional file 13), the minimum P value for 9 minmax chi-squared bins (D, PV, AIMSY, GFT, L, NH, W, RCQK, E) is 0.112, which indicates that compositional homogeneity cannot be rejected for these bins according to the chi-square test. However, a posterior predictive test shows that the compositional heterogeneity has not been reduced (global Z score 8.690) and that the CAT model is still significantly violated (Table 5; Additional file 10) if the amino acid sequences of the mitochondrial proteins were recoded using these bins. This contradiction between the results of the chi-square test and the posterior predictive test might be explained by the fact that the chi-square test does not consider correlation due to relatedness of the taxa on a tree or by the biasing effect of invariable sites on this test [58, 59]. A reduction of the categories to 6 minmax chi-squared bins resulted only in a minor reduction of the compositional heterogeneity (global Z score 7.196; Table 5; Additional file 10) despite the minimum P value for 6 bins (GFTW, AHILMSY, NPV, E, D, RCQK) being 0.21 according to the chi-square test.
Alternatively, we recoded the amino acid data into the six groups of amino acids (AGPST, C, DENQ, FWY, HKR, ILMV) that tend to replace one another . A posterior predictive test showed that the compositional heterogeneity even increased (global Z score 11.285) compared to the unrecoded data set (Table 5; Additional file 10).
The phylogenetic analyses of recoded data sets (Additional files 14, 15, 16, 17, 18, 19) yielded again contradictory results concerning the relationships of the lophophorate lineages (Table 4). None of the possible relationships of the lophophorate lineages is strongly supported.
We analysed the amino acid sequences also with a non-stationary model of sequence evolution by performing a Bayesian analysis with the CAT-BP model as implemented in the program nhPhyloBayes . We started 16 chains with the mitochondrial amino acid data set. The mean number of breakpoints N, at which the amino acid composition changes, varied between 34 and 47. Because the prior on N used in the CAT-BP model is conservative, an N as high as observed in our analysis confirms that there is compositional bias in the data. The high number of breakpoints reflects the result of the posterior predictive test that 40 taxa belonging to several different clades have amino acid compositions that significantly deviate from the assumptions of the CAT model (Additional file 10). Despite almost nine weeks of calculation for each chain on a 2.8 GHz processor no convergence of the chains was achieved. A consensus of all chains is shown for illustrative purposes (Additional file 20). Lophotrochozoa including Platyhelminthes is monophyletic, but the relationships between lophotrochozoan phyla are largely unresolved.
Phylogenetic analyses accounting for saturation
Finally, we tried to mitigate saturation and long-branch-attraction problems by excluding fast evolving sites. We removed 20% of the positions with high rates from the nucleotide alignment (10,118 nucleotides remaining) and 10% of the amino acid alignment positions (2,456 amino acid remaining). Despite the exclusion of the fastest evolving sites, the long-branch group including platyhelminths, nematodes and chaetognaths could not be broken up (Additional file 21, 22) and the relationships between the lophotrochozoan phyla could not be resolved more robustly. However, there is strong support (98% bootstrap probability) for a sister group relation between brachiopods and annelids in the tree based on the nucleotide data set.
Altogether, the results obtained in the phylogenetic analyses of the mitochondrial nucleotide and amino acid sequences are contradictory and weakly supported by the data (Table 4). Most of the results concerning the phylogenetic relationships of the lophophorate lineages are in strong contrast to the results of recent phylogenomic analyses [26, 27, 29–31, 33, 34] and phylogenetic analyses of nuclear rDNA [14–16] that support the monophyly of Bryozoa (= Polyzoa) including Ectoprocta and Entoprocta as well as the monophyly of Brachiozoa including Brachiopoda and Phoronida. Jang and Hwang  showed that a topology test based on mitochondrial amino acid data rejects both, Brachiozoa and Bryozoa. Thus, the differences between the phylogenetic results based on mitochondrial data and the phylogenomic analysis based mainly or exclusively on nuclear data cannot be attributed to stochastic errors alone. The posterior predictive tests indicate that the phylogenetic analyses of the mitochondrial amino acid sequences are strongly affected by compositional bias, a systematic error source that is not taken into account by topology tests. Thus, the apparent contradiction between the phylogenetic results based on mitochondrial amino acid data and the phylogenomic analyses may be due to compositional bias. This is supported by the results of the approaches to reduce compositional heterogeneity in the data sets respectively the analyses with non-stationary models (Table 4). Although Bryozoa including Ectoprocta and Entoprocta were rejected in the topology tests performed by Jang and Hwang  based on mitochondrial amino acid data, Bryozoa was found in our maximum likelihood analysis with the MtZoa+F model with the 39 taxa set, albeit with no nodal support (Additional file 12).
Phylogenetic analyses of nuclear protein sequence data of Metazoa are also affected by compositional bias [34, 62]. However, none of several approaches accounting for this bias supported a sister group relationship between Ectoprocta and Brachiopoda or between Phoronida and Entoprocta  as did some of the phylogenetic analyses of mitochondrial data ([19, 37–39]; Table 4).
The weak support for relationships between phyla in the analyses based on the mitochondrial data (Table 4) indicates that the information content of the mitochondrial sequence data set, which is almost one magnitude smaller than current phylogenomic data sets, is insufficient for a robust resolution of the divergences of the lophotrochozoan phyla (see also [19, 38]). In addition, the strong compositional bias in the mitochondrial data (Table 5; Additional file 10) complicates phylogenetic analyses of these data. The high variability of the gene order in some lophotrochozoan phyla like ectoprocts, brachiopods or molluscs undoes the hope that this character set may help to disentangle the relationships between lophotrochozoan phyla. With current methods and evolutionary models mitochondrial genome data can contribute little to resolving the relationships of the lophotrochozoan phyla.
However, our study revealed several rare genomic changes like the loss of the DHU arm and changes of the anticodon sequence of tRNAs and the evolution of long intergenic sequences, that may be helpful for reconstructing ectoproct phylogeny more robustly in future studies.
A sample of Flustra foliacea (Ectoprocta, Gymnolaemata) was obtained from the Biologische Anstalt Helgoland (Germany) and conserved at -70°C. Total genomic DNA was extracted with the QIAamp DNA Mini kit (Qiagen, Hilden, Germany) following the manufacturer's instructions for tissue.
PCR amplification, cloning and sequencing
Mitochondrial sequence fragments of the genes nad1, nad2, nad3, nad4, nad5, nad6, cob, cox1, cox2, cox3, atp6, and rrnS from an EST library of Flustra foliacea (Hausdorf et al., 2007) were used to design specific primers (Additional file 23). The complete mitochondrial genome of Flustra foliacea was amplified with these primers. All PCRs were done in an Eppendorf Mastercycler Gradient thermocycler. PCRs were carried out in 50 μl volumes (33.75 μl water, 10 μl 5× amplification buffer (Promega, Mannheim, Germany), 2 μl MgCl2 solution (25 mM), 0.25 μl Taq polymerase (5 U/μl), 1 μl dNTP mixture (25 mM each), 1 μl template DNA, 2 μl primer mixture (10 μM each)) using GoTaq polymerase (Promega, Mannheim, Germany). To minimize replication errors, proof-reading Pwo polymerase (Roche, Mannheim, Germany) was added to the reaction mix. Cycling conditions were as follows: 94°C for 120 s for initial denaturation, 35 cycles of 94°C for 30 s, 45-55°C for 30 s, 72°C for 180 s, followed by 72°C for 420 s for final elongation. If amplifications were not successful, DNA fragments were amplified with the PCR Extender System (5Prime, Darmstadt, Germany) in 50 μl volumes (38.1 μl water, 5 μl 10× tuning buffer, 0.4 μl PCR Extender Polymerase Mix, 2.5 μl dNTP mixture, 2 μl template DNA, 2 μl primer mixture (10 μM each)) under the following long PCR conditions: 93°C for 180 s for initial denaturation, 10 cycles of 93°C for 15 s, 55-62°C for 30 s, 68°C for 900 s, 20 cycles of 93°C for 15 s; 55-62°C for 30 s, 68°C for 900 s plus 20 s for each cycle. PCR fragments were excised from agarose gel and purified with the NucleoSpin Extract II kit (Macherey-Nagel, Düren, Germany). Dependent on the band intensity on the agarose gel, DNA was eluted in 20-50 μl elution buffer or ddH2O and stored at -20°C. Each purified fragment was ligated into the pCR2.1-TOPO cloning vector (Invitrogen, Karlsruhe, Germany) and transformed into Escherichia coli TOP10 cells (Invitrogen, Karlsruhe, Germany). Clones containing inserts of the correct size were sequenced on an automatic capillary sequencer. Large inserts were sequenced by primer walking (sequences available on request).
Sequence assembly and annotation
Sequence assembly was done with SeqMan (DNASTAR, Madison, WI). The average coverage of the genome by sequenced clones or EST contigs was 2.4×. Protein-encoding and ribosomal RNA genes were identified by BLAST (blastn, tblastx) searches of NCBI databases and by using the MITOS WebServer BETA (http://bloodymary.bioinf.uni-leipzig.de/mitos/index.py). Start and end positions of rRNA genes and MNCR were determined by boundaries of adjacent genes. The tRNA genes were detected via class-specific co-variance models using the MITOS WebServer BETA. Complementarily, tRNAscan-SE  and ARWEN  were used. The sequence data was deposited in GenBank with the accession number JQ061319. We used CRex  to analyse gene order data. GC- and AT-skew was calculated by using the formula of Perna and Kocher .
For phylogenetic analyses, we focused the taxon sampling (Additional file 24) on lophotrochozoan taxa. We assembled complete or nearly complete mitochondrial genomes available from members of the phyla Ectoprocta (4 species), Brachiopoda (4), Phoronida (1), Entoprocta (2), Nemertea (2), Chaetognatha (2) and selected 8 representative mitochondrial genomes from molluscs, 5 from annelids and 3 from platyhelminths. We added 12 ecdysozoan and 5 deuterostome species as well as 2 cnidarian taxa as outgroups.
The amino acid sequences of the mitochondrial protein-encoding genes of the selected taxa were individually aligned by the L-INS-i algorithm implemented in MAFFT [66, 67]. Because it is preferable to take the amino acid level into account during alignment of protein-coding DNA, the aligned amino acid sequences were used as a scaffold for constructing the corresponding nucleotide sequence alignment using RevTrans 1.4 . For comparison, the nucleotide sequences were aligned directly. We identified randomly similar sections in each gene alignment with ALISCORE [53, 54] on the nucleotide and amino acid level using default settings and maximal number of pairwise comparisons. In total, 15% of originally 14,968 nucleotide positions and 39% of originally 4,452 amino acid positions were excluded using ALICUT (http://www.utilities.zfmk.de) to increase the signal-to-noise ratio. The final alignments, spanning 12,648 nucleotide respectively 2,729 amino acid positions, were attained by concatenating all processed alignments. Alternatively to the ALISCORE evaluation of the sequences, we used Gblocks  with low stringency parameters (minimum block length 5; allowed gap positions with half) for eliminating poorly aligned positions and divergent regions resulting in concatenated alignments spanning 6,839 nucleotide respectively 1,862 amino acid positions. The final alignments have been deposited at TreeBASE and can be accessed at http://purl.org/phylo/treebase/phylows/study/TB2:S10996. Alignments with reduced taxa sets were obtained by removing taxa from the complete alignments. Unless otherwise noted, the alignments edited with ALISCORE were used.
Phylogenetic analyses and evaluation of model violation caused by compositional heterogeneity
We checked the homogeneity of nucleotide frequencies across taxa using the chi-square test implemented in PAUP* 4.0 beta 10 . However, this test ignores correlation resulting from phylogenetic structure. Therefore, we also measured the probability that the base composition of two sequences is homogeneous for each pair of sequences using the matched-pairs test of symmetry as implemented in SeqVis version 1.4 .
We performed maximum likelihood analyses using a parallel Pthreads-based version  of RAxML, version 7.2.8 . We used the GTR model for nucleotide sequences, the MtZoa+F model  for amino acid sequences, and the MULTIGAMMA model for recoded amino acid data (see below). Using a modified perl script for model selection based on likelihood calculations with RAxML (available from http://icwww.epfl.ch/~stamatak/index-Dateien/software/ProteinModelSelection.pl), the MtZoa+F model  was selected for amino acid sequences. Rate heterogeneity among sites was modelled using the gamma model. Confidence values for edges of the maximum likelihood tree were computed by rapid bootstrapping  (100 replications).
We performed Bayesian inference analyses of the amino acid sequences with the CAT model that adjusts for site-specific amino acid frequencies  as implemented in PhyloBayes version 3.2f (http://megasun.bch.umontreal.ca/People/lartillot/www/download.html). Eight independent chains were run for each analysis. The number of points of each chain, the number of points that were discarded as burn-in, and the largest discrepancy observed across all bipartitions (maxdiff) are listed in Additional file 25. Taking every tenth sampled tree, a 50%-majority rule consensus tree was computed using all chains.
We evaluated in how far the assumptions of the CAT model are violated by using posterior predictive tests. In posterior predictive tests the observed value of a given test statistic on the original data is compared with the distribution of the test statistic on data replicates simulated under the reference model using parameter values drawn from the posterior distribution (every tenth sampled tree). The reference model is rejected for that statistic if the observed value of the test statistic deviates significantly. We used two test statistics measuring compositional heterogeneity implemented in PhyloBayes. One measures the compositional deviation of each taxon by summing the absolute differences between the taxon-specific and global empirical frequencies over the 20 amino acids. This test statistic indicates which taxa deviate significantly, but raises a multiple-testing issue. Alternatively, the maximum deviation across taxa was used as a global statistic.
Approaches for reducing the potential impact of compositional bias
Because the third codon positions show the strongest compositional heterogeneity (see results) and because these positions become saturated first because of their higher substitution rates, we tried to reduce the potential impact of systematic errors on phylogenetic inference by excluding the third codon positions from the nucleotide data set.
We applied two approaches to reduce compositional heterogeneity in the amino acid data set. First, we excluded the taxa with the most strongly deviating amino acid composition as indicated by the posterior predictive test and repeated the Bayesian inference analysis as described. Secondly, we recoded the amino acid data into groups. Susko and Roger  developed an algorithm for constructing bins of amino acids in order to minimize compositional heterogeneity for a given alignment by minimizing the maximum chi-squared statistic for a taxon of the data set. We used the program minmax-chisq (http://www.mathstat.dal.ca/tsusko/software.cgi) to obtain these minmax chi-squared bins for the mitochondrial amino acid data set. In order to lose as little information as possible, we chose the largest number of bins for which the minimum P value is larger than 0.05, which indicates that compositional homogeneity cannot be rejected for this set of bins according to the chi-square test. Alternatively, we recoded the amino acid data into the six groups of amino acids (AGPST, C, DENQ, FWY, HKR, ILMV) that tend to replace one another .
As alternative to the approaches for reducing compositional heterogeneity in the data set, we used nonstationary models of evolution in phylogenetic inference analyses. We analysed the nucleotide data set using the nonstationary model of evolution developed by Galtier and Gouy  as implemented in nhPhyML-Discrete , limited to 3 base content frequency categories and with 8 categories for a discrete gamma model of among-site rate variation. Based on the amino acid data set, we performed a Bayesian analysis with the CAT-BP model  as implemented in nhPhyloBayes (http://www.lirmm.fr/mab/blanquart/), which accounts for compositional heterogeneity between lineages by introducing breakpoints along the branches of the phylogeny at which the amino acid composition is allowed to change. Sixteen independent chains were run for 10,000 points. Stationarity of the posterior probabilities of all chains were reached during the first 2,000 points. Thus, 2,000 points were discarded as burn-in for all chains. Taking every tenth sampled tree, a 50%-majority rule consensus tree was computed.
Approaches for reducing the potential impact of saturation and long-branch attraction
To mitigate the potential impact of saturation and long-branch attraction, we excluded the fastest evolving sites as determined by Treefinder, version of October 2008 [78, 79]. An appropriate model for nucleotide respectively protein evolution was determined with the 'propose model' option of Treefinder based on the Akaike Information Criterion with a correction term for small sample size. According to this criterion the GTR model with gamma-distributed rates was chosen for the nucleotide data set and a mixed model that is a linear combination of 14 empirical models of protein evolution and considering among-site rate variation with a five-category discrete gamma-distribution for rates was chosen for the amino acid data set. With the data sets and these models maximum likelihood trees were calculated with Treefinder. Finally, sitewise rates were calculated with the data sets, the models and the trees as input.
- atp6 and atp8:
genes encoding ATPase subunits 6 and 8
- cox1-3 :
genes encoding cytochrome C oxidase subunits I-III
- cob :
gene encoding cytochrome b
major non-coding region
- nad1-6 and nad4L:
genes encoding NADH dehydrogenase subunits 1-6 and 4L
- rrnS and rrnL:
genes encoding small (12S) and large (16S) rRNAs
We thank Alexander Donath for help with the MITOS WebServer and four anonymous referees for helpful comments and suggestions. This study was funded by the priority program "Deep Metazoan Phylogeny" of the Deutsche Forschungsgemeinschaft (HA 2763/5).
- Ax P: Das System der Metazoa I. 1995, Stuttgart: G. Fischer, 226-Google Scholar
- Lüter C, Bartolomaeus T: The phylogenetic position of Brachiopoda - a comparison of morphological and molecular data. Zoologica Scripta. 1997, 26: 245-253. 10.1111/j.1463-6409.1997.tb00414.x.View ArticleGoogle Scholar
- Lüter C: The origin of the coelom in Brachiopoda and its phylogenetic significance. Zoomorphology. 2000, 120: 15-28. 10.1007/s004359900019.View ArticleGoogle Scholar
- Sørensen MV, Funch P, Willerslev E, Hansen AJ, Olesen J: On the phylogeny of the Metazoa in the light of Cycliophora and Micrognathozoa. Zoologischer Anzeiger. 2000, 239: 297-318.Google Scholar
- Brusca RC, Brusca GJ: Invertebrates. 2003, Sinauer Associates, 2Google Scholar
- Halanych KM, Bacheller J, Aguinaldo A, Liva S, Hillis D, Lake J: Evidence from 18S ribosomal DNA that the lophophorates are protostome animals. Science. 1995, 267: 1641-1643. 10.1126/science.7886451.PubMedView ArticleGoogle Scholar
- Mackey LY, Winnepenninckx B, de Wachter R, Backeljau T, Emschermann P, Garey JR: 18S rRNA suggests that Entoprocta are protostomes, unrelated to Ectoprocta. J Mol Evol. 1996, 42: 552-559. 10.1007/BF02352285.PubMedView ArticleGoogle Scholar
- Littlewood DTJ, Telford MJ, Clough K, Rohde K: Gnathostomulida - An enigmatic metazoan phylum from both morphological and molecular perspectives. Mol Phylogenet Evol. 1998, 9: 72-79. 10.1006/mpev.1997.0448.PubMedView ArticleGoogle Scholar
- Giribet G, Distel D, Polz M, Sterrer W, Wheeler W: Triploblastic relationships with emphasis on the acoelomates and the position of Gnathostomulida, Cycliophora, Plathelminthes, and Chaetognatha: A combined approach of 18S rDNA sequences and morphology. Syst Biol. 2000, 49: 539-562. 10.1080/10635159950127385.PubMedView ArticleGoogle Scholar
- Peterson KJ, Eernisse DJ: Animal phylogeny and the ancestry of bilaterians: inferences from morphology and 18S rDNA gene sequences. Evol Dev. 2001, 3: 170-205. 10.1046/j.1525-142x.2001.003003170.x.PubMedView ArticleGoogle Scholar
- Mallatt J, Winchell C: Testing the new animal phylogeny: First use of combined large-subunit and small-subunit rRNA gene sequences to classify the protostomes. Mol Biol Evol. 2002, 19: 289-301.PubMedView ArticleGoogle Scholar
- Halanych KM: The new view of animal phylogeny. Annu Rev Ecol Evol Syst. 2004, 35: 229-256. 10.1146/annurev.ecolsys.35.112202.130124.View ArticleGoogle Scholar
- Passamaneck Y, Halanych KM: Lophotrochozoan phylogeny assessed with LSU and SSU data: evidence of lophophorate polyphyly. Mol Phylogenet Evol. 2006, 40: 20-28. 10.1016/j.ympev.2006.02.001.PubMedView ArticleGoogle Scholar
- Baguna J, Martinez P, Paps J, Riutort M: Back in time: a new systematic proposal for the Bilateria. Philos T Roy Soc B. 2008, 363: 1481-1491. 10.1098/rstb.2007.2238.View ArticleGoogle Scholar
- Paps J, Baguna J, Riutort M: Lophotrochozoa internal phylogeny: new insights from an up-to-date analysis of nuclear ribosomal genes. P R Soc B. 2009, 276: 1245-1254.View ArticleGoogle Scholar
- Mallatt J, Craig CW, Yoder MJ: Nearly complete rRNA genes assembled from across the metazoan animals: Effects of more taxa, a structure-based alignment, and paired-sites evolutionary models on phylogeny reconstruction. Mol Phylogenet Evol. 2010, 55: 1-17. 10.1016/j.ympev.2009.09.028.PubMedView ArticleGoogle Scholar
- Stechmann A, Schlegel M: Analysis of the complete mitochondrial DNA sequence of the brachiopod Terebratulina retusa places Brachiopoda within the protostomes. Proc Biol Sci. 1999, 266: 2043-2052. 10.1098/rspb.1999.0885.PubMed CentralPubMedView ArticleGoogle Scholar
- Helfenbein K, Boore J: The mitochondrial genome of Phoronis architecta--Comparisons demonstrate that phoronids are Lophotrochozoan protostomes. Mol Biol Evol. 2004, 21: 153-157.PubMedView ArticleGoogle Scholar
- Waeschenbach A, Telford MJ, Porter JS, Littlewood DTJ: The complete mitochondrial genome of Flustrellidra hispida and the phylogenetic position of Bryozoa among the Metazoa. Mol Phylogenet Evol. 2006, 40: 195-207. 10.1016/j.ympev.2006.03.007.PubMedView ArticleGoogle Scholar
- Ruiz-Trillo I, Paps J, Loukota M, Ribera C, Jondelius U, Baguna J, Riutort M: A phylogenetic analysis of myosin heavy chain type II sequences corroborates that Acoela and Nemertodermatida are basal bilaterians. Proc Natl Acad Sci USA. 2002, 99: 11246-11251. 10.1073/pnas.172390199.PubMed CentralPubMedView ArticleGoogle Scholar
- Anderson F, Cordoba A, Thollesson M: Bilaterian phylogeny based on analyses of a region of the sodium-potassium ATPase beta-subunit gene. J Mol Evol. 2004, 58: 252-268. 10.1007/s00239-003-2548-9.PubMedView ArticleGoogle Scholar
- de Rosa R, Grenier J, Andreeva T, Cook C, Adoutte A, Akam M, Carroll S, Balavoine G: Hox genes in brachiopods and priapulids and protostome evolution. Nature. 1999, 399: 772-776. 10.1038/21631.PubMedView ArticleGoogle Scholar
- Passamaneck YJ, Halanych KM: Evidence from Hox genes that bryozoans are lophotrochozoans. Evol Dev. 2004, 6: 275-281. 10.1111/j.1525-142X.2004.04032.x.PubMedView ArticleGoogle Scholar
- Paps J, Baguna J, Riutort M: Bilaterian phylogeny: A broad sampling of 13 nuclear genes provides a new Lophotrochozoa phylogeny and supports a paraphyletic basal Acoelomorpha. Mol Biol Evol. 2009, 26: 2397-2406. 10.1093/molbev/msp150.PubMedView ArticleGoogle Scholar
- Helmkampf M, Bruchhaus I, Hausdorf B: Multigene analysis of lophophorate and chaetognath phylogenetic relationships. Mol Phylogenet Evol. 2008, 46: 206-214. 10.1016/j.ympev.2007.09.004.PubMedView ArticleGoogle Scholar
- Hausdorf B, Helmkampf M, Meyer A, Witek A, Herlyn H, Bruchhaus I, Hankeln T, Struck TH, Lieb B: Spiralian phylogenomics supports the resurrection of Bryozoa comprising Ectoprocta and Entoprocta. Mol Biol Evol. 2007, 24: 2723-2729. 10.1093/molbev/msm214.PubMedView ArticleGoogle Scholar
- Hausdorf B, Helmkampf M, Nesnidal MP, Bruchhaus I: Phylogenetic relationships within the lophophorate lineages (Ectoprocta, Brachiopoda and Phoronida). Mol Phylogenet Evol. 2010, 55: 1121-1127. 10.1016/j.ympev.2009.12.022.PubMedView ArticleGoogle Scholar
- Dunn CW, Hejnol A, Matus DQ, Pang K, Browne WE, Smith SA, Seaver E, Rouse GW, Obst M, Edgecombe GD, Sørensen MV, Haddock SHD, Schmidt-Rhaesa A, Okusu A, Kristensen RM, Wheeler WC, Martindale MQ, Giribet G: Broad phylogenomic sampling improves resolution of the animal tree of life. Nature. 2008, 452: 745-U5. 10.1038/nature06614.PubMedView ArticleGoogle Scholar
- Helmkampf M, Bruchhaus I, Hausdorf B: Phylogenomic analyses of lophophorates (brachiopods, phoronids and bryozoans) confirm the Lophotrochozoa concept. P R Soc B. 2008, 275: 1927-1933. 10.1098/rspb.2008.0372.View ArticleGoogle Scholar
- Struck TH, Fisse F: Phylogenetic position of Nemertea derived from phylogenomic data. Mol Biol Evol. 2008, 25: 728-736. 10.1093/molbev/msn019.PubMedView ArticleGoogle Scholar
- Witek A, Herlyn H, Meyer A, Boell L, Bucher G, Hankeln T: EST based phylogenomics of Syndermata questions monophyly of Eurotatoria. BMC Evol Biol. 2008, 8: 345-10.1186/1471-2148-8-345.PubMed CentralPubMedView ArticleGoogle Scholar
- Bleidorn C, Podsiadlowski L, Zhong M, Eeckhaut I, Hartmann S, Halanych KM, Tiedemann R: On the phylogenetic position of Myzostomida: can 77 genes get it wrong?. BMC Evol Biol. 2009, 9: 150-10.1186/1471-2148-9-150.PubMed CentralPubMedView ArticleGoogle Scholar
- Hejnol A, Obst M, Stamatakis A, Ott M, Rouse GW, Edgecombe GD, Martinez P, Baguna J, Bailly X, Jondelius U, Wiens M, Mueller WEG, Seaver E, Wheeler WC, Martindale MQ, Giribet G, Dunn CW: Assessing the root of bilaterian animals with scalable phylogenomic methods. P R Soc B. 2009, 276: 4261-4270. 10.1098/rspb.2009.0896.View ArticleGoogle Scholar
- Nesnidal MP, Helmkampf M, Bruchhaus I, Hausdorf B: Compositional heterogeneity and phylogenomic inference of metazoan relationships. Mol Biol Evol. 2010, 27: 2095-2104. 10.1093/molbev/msq097.PubMedView ArticleGoogle Scholar
- Cohen BL: Monophyly of brachiopods and phoronids: reconciliation of molecular evidence with Linnaean classification (the subphylum Phoroniformea nov.). Proc Biol Sci. 2000, 267: 225-231. 10.1098/rspb.2000.0991.PubMed CentralPubMedView ArticleGoogle Scholar
- Cohen BL, Weydmann A: Molecular evidence that phoronids are a subtaxon of brachiopods (Brachiopoda: Phoronata) and that genetic divergence of metazoan phyla began long before the early Cambrian. Organisms Diversity & Evolution. 2005, 5: 253-273. 10.1016/j.ode.2004.12.002.View ArticleGoogle Scholar
- Yokobori S, Iseto T, Asakawa S, Sasaki T, Shimizu N, Yamagishi A, Oshima T, Hirose E: Complete nucleotide sequences of mitochondrial genomes of two solitary entoprocts, Loxocorone allax and Loxosomella aloxiata: implications for lophotrochozoan phylogeny. Mol Phylogenet Evol. 2008, 47: 612-628. 10.1016/j.ympev.2008.02.013.PubMedView ArticleGoogle Scholar
- Jang KH, Hwang UW: Complete mitochondrial genome of Bugula neritina (Bryozoa, Gymnolaemata, Cheilostomata): phylogenetic position of Bryozoa and phylogeny of lophophorates within the Lophotrochozoa. BMC Genomics. 2009, 10: 167-10.1186/1471-2164-10-167.PubMed CentralPubMedView ArticleGoogle Scholar
- Podsiadlowski L, Braband A, Struck TH, Döhren von J, Bartolomaeus T: Phylogeny and mitochondrial gene order variation in Lophotrochozoa in the light of new mitogenomic data from Nemertea. BMC Genomics. 2009, 10: 364-10.1186/1471-2164-10-364.PubMed CentralPubMedView ArticleGoogle Scholar
- Nielsen C: Animal phylogeny in the light of the trochaea theory. Biol J Linn Soc. 1985, 25: 243-299. 10.1111/j.1095-8312.1985.tb00396.x.View ArticleGoogle Scholar
- Nielsen C: Animal evolution: Interrelationships of the living phyla. Oxford. 2001, SecondGoogle Scholar
- Naylor GJ, Collins TM, Brown WM: Hydrophobicity and phylogeny. Nature. 1995, 373: 565-566.PubMedView ArticleGoogle Scholar
- Foster P, Jermiin L, Hickey D: Nucleotide composition bias affects amino acid content in proteins coded by animal mitochondria. J Mol Evol. 1997, 44: 282-288. 10.1007/PL00006145.PubMedView ArticleGoogle Scholar
- Foster P, Hickey D: Compositional bias may affect both DNA-based and protein-based phylogenetic reconstructions. J Mol Evol. 1999, 48: 284-290. 10.1007/PL00006471.PubMedView ArticleGoogle Scholar
- Gibson A, Gowri-Shankar V, Higgs PG, Rattray M: A comprehensive analysis of mammalian mitochondrial genome base composition and improved phylogenetic methods. Mol Biol Evol. 2005, 22: 251-264.PubMedView ArticleGoogle Scholar
- Hassanin A, Leger N, Deutsch J: Evidence for multiple reversals of asymmetric mutational constraints during the evolution of the mitochondrial genome of Metazoa, and consequences for phylogenetic inferences. Syst Biol. 2005, 54: 277-298. 10.1080/10635150590947843.PubMedView ArticleGoogle Scholar
- Sheffield NC, Song H, Cameron SL, Whiting MF: Nonstationary evolution and compositional heterogeneity in beetle mitochondrial phylogenomics. Syst Biol. 2009, 58: 381-394. 10.1093/sysbio/syp037.PubMedView ArticleGoogle Scholar
- Song H, Sheffield NC, Cameron SL, Miller KB, Whiting MF: When phylogenetic assumptions are violated: base compositional heterogeneity and among-site rate variation in beetle mitochondrial phylogenomics. Syst Entomol. 2010, 35: 429-448. 10.1111/j.1365-3113.2009.00517.x.View ArticleGoogle Scholar
- Sun M, Wu Z, Shen X, Ren J, Liu X, Liu H, Liu B: The complete mitochondrial genome of Watersipora subtorquata (Bryozoa, Gymnolaemata, Ctenostomata) with phylogenetic consideration of Bryozoa. Gene. 2009, 439: 17-24. 10.1016/j.gene.2009.03.003.PubMedView ArticleGoogle Scholar
- Bernt M, Merkle D, Ramsch K, Fritzsch G, Perseke M, Bernhard D, Schlegel M, Stadler PF, Middendorf M: CREx: inferring genomic rearrangements based on common intervals. Bioinformatics. 2007, 23: 2957-2958. 10.1093/bioinformatics/btm468.PubMedView ArticleGoogle Scholar
- Endo K, Endo K, Noguchi Y, Noguchi Y, Ueshima R, Ueshima R, Jacobs HT, Jacobs HT: Novel repetitive structures, deviant protein-encoding sequences and unidentified ORFs in the mitochondrial genome of the brachiopod Lingula anatina. J Mol Evol. 2005, 61: 36-53. 10.1007/s00239-004-0214-5.PubMedView ArticleGoogle Scholar
- Fuchs J, Obst M, Sundberg P: The first comprehensive molecular phylogeny of Bryozoa (Ectoprocta) based on combined analyses of nuclear and mitochondrial genes. Mol Phylogenet Evol. 2009, 52: 225-233. 10.1016/j.ympev.2009.01.021.PubMedView ArticleGoogle Scholar
- Misof B, Misof K: A Monte Carlo approach successfully identifies randomness in multiple sequence alignments: A more objective means of data exclusion. Syst Biol. 2009, 58: 21-34. 10.1093/sysbio/syp006.PubMedView ArticleGoogle Scholar
- Kück P, Meusemann K, Dambach J, Thormann B, Reumont von BM, Wägele JW, Misof B: Parametric and non-parametric masking of randomness in sequence alignments can be improved and leads to better resolved trees. Front Zool. 2010, 7: 10-10.1186/1742-9994-7-10.PubMed CentralPubMedView ArticleGoogle Scholar
- Bleidorn C, Eeckhaut I, Podsiadlowski L, Schult N, Mchugh D, Halanych KM, Milinkovitch MC, Tiedemann R: Mitochondrial genome and nuclear sequence data support Myzostomida as part of the annelid radiation. Mol Biol Evol. 2007, 24: 1690-1701. 10.1093/molbev/msm086.PubMedView ArticleGoogle Scholar
- Castresana J: Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol. 2000, 17: 540-552.PubMedView ArticleGoogle Scholar
- Susko E, Roger AJ: On reduced amino acid alphabets for phylogenetic inference. Mol Biol Evol. 2007, 24: 2139-2150. 10.1093/molbev/msm144.PubMedView ArticleGoogle Scholar
- Foster P: Modeling compositional heterogeneity. Syst Biol. 2004, 53: 485-495. 10.1080/10635150490445779.PubMedView ArticleGoogle Scholar
- Jermiin L, Ho S, Ababneh F, Robinson J, Larkum A: The biasing effect of compositional heterogeneity on phylogenetic estimates may be underestimated. Syst Biol. 2004, 53: 638-643. 10.1080/10635150490468648.PubMedView ArticleGoogle Scholar
- Dayhoff MO, Schwartz RM, Orcutt BC: A model of evolutionary change in proteins. Atlas of protein sequence and structure. 1978, Washington: National Biomedical Research Foundation, 5 (Suppl 3): 345-352. Dayhoff MO
- Blanquart S, Lartillot N: A site-and time-heterogeneous model of amino acid replacement. Mol Biol Evol. 2008, 25: 842-858. 10.1093/molbev/msn018.PubMedView ArticleGoogle Scholar
- Lartillot N, Philippe H: Improvement of molecular phylogenetic inference and the phylogeny of Bilateria. Philos T Roy Soc B. 2008, 363: 1463-1472. 10.1098/rstb.2007.2236.View ArticleGoogle Scholar
- Lowe T, Eddy S: tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Research. 1997, 25: 955-964. 10.1093/nar/25.5.955.PubMed CentralPubMedView ArticleGoogle Scholar
- Laslett D, Canback B: ARWEN: a program to detect tRNA genes in metazoan mitochondrial nucleotide sequences. Bioinformatics. 2008, 24: 172-175. 10.1093/bioinformatics/btm573.PubMedView ArticleGoogle Scholar
- Perna NT, Kocher TD: Patterns of nucleotide composition at fourfold degenerate sites of animal mitochondrial genomes. J Mol Evol. 1995, 41: 353-358. 10.1007/BF01215182.PubMedView ArticleGoogle Scholar
- Katoh K, Misawa K, Kuma K, Miyata T: MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Research. 2002, 30: 3059-3066. 10.1093/nar/gkf436.PubMed CentralPubMedView ArticleGoogle Scholar
- Katoh K, Toh H: Recent developments in the MAFFT multiple sequence alignment program. Brief Bioinformatics. 2008, 9: 286-298. 10.1093/bib/bbn013.PubMedView ArticleGoogle Scholar
- Wernersson R, Pedersen AG: RevTrans: Multiple alignment of coding DNA from aligned amino acid sequences. Nucleic Acids Research. 2003, 31: 3537-3539. 10.1093/nar/gkg609.PubMed CentralPubMedView ArticleGoogle Scholar
- Swofford DL: PAUP*. Phylogenetic Analysis Using Parsimony *and other methods. Version 4.0b10. 2002, Sunderland MA: Sinauer AssociatesGoogle Scholar
- Ho JWK, Adams CE, Bin Lew J, Matthews TJ, Ng CC, Shahabi-Sirjani A, Tan LH, Zhao Y, Easteal S, Wilson SR, Jermiin LS: SeqVis: Visualization of compositional heterogeneity in large alignments of nucleotides. Bioinformatics. 2006, 22: 2162-2163. 10.1093/bioinformatics/btl283.PubMedView ArticleGoogle Scholar
- Ott M, Zola J, Stamatakis A, Aluru S: Large-scale maximum likelihood-based phylogenetic analysis on the IBM BlueGene/L. 2007, New York, New York, USA: ACM, 4-Google Scholar
- Stamatakis A: RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006, 22: 2688-2690. 10.1093/bioinformatics/btl446.PubMedView ArticleGoogle Scholar
- Rota-Stabelli O, Yang Z, Telford MJ: MtZoa: A general mitochondrial amino acid substitutions model for animal evolutionary studies. Mol Phylogenet Evol. 2009, 52: 268-272. 10.1016/j.ympev.2009.01.011.PubMedView ArticleGoogle Scholar
- Stamatakis A, Hoover P, Rougemont J: A rapid bootstrap algorithm for the RAxML Web servers. Syst Biol. 2008, 57: 758-771. 10.1080/10635150802429642.PubMedView ArticleGoogle Scholar
- Lartillot N, Philippe H: A Bayesian mixture model for across-site heterogeneities in the amino acid replacement process. Mol Biol Evol. 2004, 21: 1095-1109. 10.1093/molbev/msh112.PubMedView ArticleGoogle Scholar
- Galtier N, Gouy M: Inferring pattern and process: Maximum-likelihood implementation of a nonhomogeneous model of DNA sequence evolution for phylogenetic analysis. Mol Biol Evol. 1998, 15: 871-879.PubMedView ArticleGoogle Scholar
- Boussau B, Gouy M: Efficient likelihood computations with nonreversible models of evolution. Syst Biol. 2006, 55: 756-768. 10.1080/10635150600975218.PubMedView ArticleGoogle Scholar
- Jobb G, Haeseler von A, Strimmer K: TREEFINDER: a powerful graphical analysis environment for molecular phylogenetics. BMC Evol Biol. 2004, 4: 18-10.1186/1471-2148-4-18.PubMed CentralPubMedView ArticleGoogle Scholar
- Jobb G: TREEFINDER version October 2008. 2008, Distributed by the author at www.treefinder.deGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.