Thirteen is enough: the myosins of Dictyostelium discoideum and their light chains
© Kollmar; licensee BioMed Central Ltd. 2006
Received: 14 March 2006
Accepted: 20 July 2006
Published: 20 July 2006
Skip to main content
© Kollmar; licensee BioMed Central Ltd. 2006
Received: 14 March 2006
Accepted: 20 July 2006
Published: 20 July 2006
Dictyostelium discoideum is one of the most famous model organisms for studying motile processes like cell movement, organelle transport, cytokinesis, and endocytosis. Members of the myosin superfamily, that move on actin filaments and power many of these tasks, are tripartite proteins consisting of a conserved catalytic domain followed by the neck region consisting of a different number of so-called IQ motifs for binding of light chains. The tails contain functional motifs that are responsible for the accomplishment of the different tasks in the cell. Unicellular organisms like yeasts contain three to five myosins while vertebrates express over 40 different myosin genes. Recently, the question has been raised how many myosins a simple multicellular organism like Dictyostelium would need to accomplish all the different motility-related tasks.
The analysis of the Dictyostelium genome revealed thirteen myosins of which three have not been described before. The phylogenetic analysis of the motor domains of the new myosins placed Myo1F to the class-I myosins and Myo5A to the class-V myosins. The third new myosin, an orphan myosin, has been named MyoG. It contains an N-terminal extension of over 400 residues, and a tail consisting of four IQ motifs and two MyTH4/FERM (my osin t ail h omology 4/band 4.1, e zrin, r adixin, and m oesin) tandem domains that are separated by a long region containing an SH3 (s rc h omology 3) domain. In contrast to previous analyses, an extensive comparison with 126 class-VII, class-X, class-XV, and class-XXII myosins now showed that MyoI does not group into any of these classes and should not be used as a model for class-VII myosins.
The search for calmodulin related proteins revealed two further potential myosin light chains. One is a close homolog of the two EF-hand motifs containing MlcB, and the other, CBP14, phylogenetically groups to the ELC/RLC/calmodulin (e ssential l ight c hain/r egulatory l ight c hain) branch of the tree.
Dictyostelium contains thirteen myosins together with 6–8 MLCs (m yosin l ight c hain) to assist in a variety of actin-based processes in the cell. Although they are homologous to myosins of higher eukaryotes, the myosins of Dictyostelium should be considered with care as models for specific functions of vertebrate myosins.
Three kinds of molecular motor proteins have been identified so far: myosins, kinesins, and dyneins [1, 2]. While kinesins and dyneins move on microtubule tracks, myosins are the only motors that use the energy of ATP hydrolysis to power movement along actin filaments. The first myosin was identified in skeletal muscle tissue, and subsequently a large number of proteins containing the myosin motor domain have been discovered in eukaryotic cells where they fulfill a variety of cellular functions from cell division, cellular locomotion, vesicle transport to muscle contraction .
Myosin proteins are typically divided into three major domains. The motor domain that is usually found at the N-terminus contains the nucleotide and actin binding sites. The neck domain, following the motor domain, consists of a helical segment that binds specific myosin light chains or calmodulin. The target sequence for light chain binding is based on the consensus sequence IQxxxRGxxxR  and therefore termed the IQ motif. Myosins may have zero to over 15 IQ motifs in the neck region. The third domain is called the tail domain and contains class specific functional motifs that are responsible for the accomplishment of the different tasks of the myosins in the cell . The classification of the myosins is based on the phylogenetic relation of the myosin motor domains. Altogether, up to 20 myosin classes have been assigned in recent reviews [4, 5]. In addition, there are many myosins for which close homologs have not been found and which are therefore termed orphan myosins. Recently, two analyses of myosin proteins describing conflicting findings have been published [6, 7]. Both disagree with previously established models of myosin evolution [reviewed in ], because of the erroneous data sets and analysis methods used. However, we have performed an exhaustive analysis of 1910 manually annotated myosins from 303 species that will be referred to in the analysis of the Dictyostelium discoideum myosins (F. Odronitz and M. Kollmar, submitted).
Different organisms contain only a subset of the classes but in many cases several homologs of the same class. For example, the Entamoeba histolytica genome reveals one class-I and one class-II myosin , while the Saccharomyces cerevisiae genome contains two class-I, one class-II, and two class-V myosins . In contrast, more complex genomes show more diversity. The Caenorhabditis elegans genome contains 17 myosins belonging to seven classes, the Drosophila melanogaster genome contains 13 myosins belonging to ten classes, and the Homo sapiens genome reveals 40 myosins belonging to twelve classes. Plants do not have a large repertoire of classes but many homologs of the class-VIII and class-XI myosins (e.g. Arabidopsis thaliana contains 4 class-VIII and 13 class-XI myosin genes).
For the lower eukaryot Dictyostelium discoideum, 14 myosins or potential myosins have been reported so far. The first myosin identified, a class-II myosin, has been named mhcA (myosin heavy chain A) and all subsequently discovered myosins where named alphabetical (MyoA-M). For MyoF and MyoH only small fragments of the motor domain have been obtained while for MyoG and MyoL only potential loci have been identified. The class-I myosins (MyoA-E, MyoK) and the class-II myosin have unambiguously been assigned in the past [e.g. [5, 10]]. MyoI has been classified as a class-VII myosin [5, 11], although the phylogenetic grouping has been very weak, while no similar myosin has been found to MyoM that has therefore been added to the orphan class. The closest homologues found for MyoJ came from the class-XI myosins, a class only containing plant myosins, and it was therefore grouped to them [12, 13].
Almost all myosins exist at least as heterodimers by binding of light chains to the neck region. The light chains bind to the IQ motif and have an essential role in stabilizing the neck, so it can function as a rigid lever arm that swings relative to the motor domain to generate movement . In addition, the light chains can be important regulatory sites, either through phosphorylation or by binding of Ca2+ . The light chains belong to the family of calmodulin-related proteins. Most myosins are expected to bind calmodulins, while the class-II myosins always bind two specific calmodulin-related proteins, the essential and the regulatory myosin light chains. Recently, calmodulin-related proteins have been identified that specifically bind to a certain myosin, e.g. a light chain binding to Toxoplasma gondii Myo14A [TgMLC-1, ], or the light chain binding to Accanthamoeba castellanii Myo1C [AcMICLC, ]. For Dictyostelium discoideum, in addition to the essential and the regulatory myosin light chains two further specific light chains have been discovered. The class-I myosin MyoD binds a light chains that is phylogenetically related the Ac Myo1C light chain [MlcD, ], and the class-I myosin MyoB forms a heterodimer with a light chain that is unique as it consists of only the half of a calmodulin-related protein [MlcB, ].
Dictyostelium is one of the most famous model organisms for studying motile processes in cells, especially those related to the actin cytoskeleton. Recently, the question has been raised how many myosins a simple multicellular organism like Dictyostelium would need to accomplish all the diverse motility-related tasks . Here, the complete repertoire of myosin family proteins in the slime mold Dictyostelium discoideum is presented. The analysis revealed thirteen myosin proteins of which three have not been described so far. The new myosin family members are described and the already published myosins revised and partially reclassified. In addition, all members of the calmodulin-related protein family in Dictyostelium have been identified and analyzed to reveal the complete repertoire of myosin light chains.
Members of the myosin gene superfamily in D. discoideum.
Size in amino acids
No. of introns
Gene accession (GeneBank)a
Protein accession (GenBank)b
The Dictyostelium cDNA database in Japan  was searched for the new myosin genes to confirm their expression (exclusion of pseudogenes) and gene structure, as the newly identified myoF, myoH, and myoG genes contain several introns (Table 1). However, the cDNA clones only cover the region around the last of the introns of Myo5A (MyoH). The extremely high AT content of Dictyostelium introns and the help of a multiple sequence alignment of over 1700 myosin motor domains (M. Kollmar, unpublished data) nevertheless allowed the unambiguous identification of the introns and, subsequently, the protein coding regions.
Differences in the sequences of previously published genes and the genes obtained from the genome sequencing project.
Sequence differences. The first residue/sequence is from the genome project, the numbers indicate the position in the protein sequence.
Example cDNA sequences covering the revised residues.
3T instead of E;211SSEEE215 instead of HQG; 239CFTA242 instead of WFSLP; 270EI instead of KS; 583R instead of S; 638L instead of S; 809ATLKR813 instead of DALPS; 819AVLQL823 instead of DCSSKR
AHM490, VFH810, AHH186, VHP545
207A instead of R; 365R instead of K
AFK241, AHL184, AHE272
479GADQKLLQSIAVCKSNPHFDTR500 instead of VPIKSYFNPLPFVNQIHISILV 813SSKSQVMVHPI823 instead of LQISSHGTPN
SLH381, SHG368, AHF290
613 no insertion of FGRI; 694A instead of R; 837T instead of R
26E instead of D; 48T instead of R; 77M instead of I; 137LD instead of IRFQ; 215D instead of N; 371IINCTTEKGP instead of LSIVHREGT; 427VRE instead of K, 440N in addition, 498I instead of N; 604V instead of D; 681N instead of I; 683T instead of R; 735K instead of N; 763H instead of D; 828S instead of W; 889K instead of N; 979NQ instead of KE
VHK623, SHL808, AHG510,
191F instead of L; 284T instead of A; 291R instead of G; 550K instead of N; 865QQ instead of HH; 1041K instead of N; 1389Q instead of P
The cDNA database is based on libraries of cells obtained from several different stages of the developmental cycle of Dictyostelium . One library contains data from clones obtained at the so-called first finger stage (14 h – 16 h) of development. A second library was constructed from vegetatively growing cells. A third library contains clones derived from "sexually competent cells" (cells are cultured in liquid medium in the dark, and are competent for fusion with opposite mating type cells), cells that are roughly equivalent to growth phase cells. The data supposed to contain full-length genes is based on libraries from cells at the following stages of the developmental cycle: axenically growing cells, cells developed on nitrocellulose filter to aggregation stage (8 hours), slug stage (16 hours) and early culmination stage (20 hours). For Myo5A (MyoH) and MyoG cDNA fragments have been obtained in all libraries containing the potentially full-length genes indicating that these two myosins are expressed in all stages of the developmental cycle. However, only one gene fragment has been obtained for Myo1F from the library of full-length genes of axenically growing cells. The cDNA data is not supposed to reveal the complete expression pattern of all proteins at the different developmental stages. But the number of obtained clones and the occurrence in a specific library indicates that Myo1F might not be strongly expressed, and primarily expressed in vegetative cells. These conclusions have of course to be confirmed by further experimental data.
This myosin is the seventh class-I myosin found in the Dictyostelium genome. Based on the short fragment of the motor domain which has been obtained in an earlier PCR-based screen of the genome  it was already supposed to group to this subfamily. Its motor domain sequence is most similar to that of Myo1E. Unlike the other class-I myosins that contain only small N-terminal extensions to the motor domain of 8 to 15 residues it has an N-terminal domain of 50 amino acids. However, based on its sequence this small domain is unlikely to fold into a similar structure like the N-terminal domain of class-II myosins. Directly following the motor domain, Myo1F has two consecutive IQ motifs that are strong indicators for binding of calmodulin or a calmodulin-like myosin light chain. A short coiled-coil region has been predicted for the small region between the two IQ motifs. The short distance of only about 20 residues, however, makes it unlikely that Myo1F would be able to dimerize in that area. The remainder of the tail is similar to those of Myo1A or Myo1E predicted to be all α-helical. In the centre of the tail there is a domain rich in basic residues and it is therefore outlined as membrane binding domain in analogy to those of the other class-I myosins. Whether this region really binds to membranes has to be shown.
Myo5A seems to be the smaller brother of Myo5B (MyoJ). Both are phylogenetically closely related and show similar domain organisations. Altogether, Myo5A (MyoH) is 480 aa shorter in length. It also has an N-terminal SH3-like domain, but the N-terminal extension is not as long as for Myo5B (MyoJ). In contrast to Myo5B (MyoJ), it has three additional long insertions into surface loops of the motor domain that are not observed in any other Dictyostelium myosin sequence. It has five instead of six IQ motives for binding of light chains, and the coiled-coil region is predicted to be considerably shorter than that of Myo5B (MyoJ). Except for the DIL domain at the C-terminus of the tail there is no further sequence similarity between the two Myo5 tail sequences.
MyoG does not group to any of the class-VII, class-X, class-XV, or class-XXII myosins, or any other myosin, and is therefore designated an orphan myosin. MyoG is one of the longest myosins of the whole family. It has an N-terminal domain of 440 residues that does not show any homology do other proteins. The sequence in the N-terminal domain contains long stretches of consecutive asparagines and serines that are typical for many Dictyostelium proteins . The head domain is followed by four IQ motifs. The C-terminal tail is characterised by two MyTH4/FERM tandem domains that are separated by a long region containing an SH3 domain and a short predicted coiled-coil region. The regions between these recognised domains also contain many stretches of consecutive polar residues. An outstanding case is the nine-fold consecutive repeat of the motif 'SQQQQ'. The C-terminal end of the tail contains a second predicted coiled-coil region. The coiled-coil domains in myosins are normally located directly behind the IQ motifs and are responsible for dimerisation. MyoG might also exist as a dimer in vivo, but the heads are not expected to move on actin filaments in a similar hand-over-hand mechanism as has been found for other myosin dimers like class-V myosins .
Loop-2 has been shown to be involved in both weak and strong binding interactions with actin . According to this study, especially positively charged residues strengthen the binding to actin. MyoG and MyoM contain long extensions of loop-2 compared to MhcA, but the sequences contain mainly glycines, prolines, and polar amino acids. Thus, both myosins are not expected to have considerably different actin-binding properties. Loop-4 is the loop that is furthest removed from the actin surface as has been suggested from actomyosin models derived from electron microscopy. Except for some class-I, the insect class-V, some class-XVII, and some apicomplexa myosins, almost all myosins have a loop-4 of similar length. The loop has been suggested to be involved in either interactions with actin or regulatory proteins that are bound to actin . For the class-I myosin myr1 from rat, it has been shown that a head fragment localizes to the same highly dynamic actin structures at the cell cortex as the full-length construct and not to the actin filaments that are regulated by tropomyosin . An extended loop-4 might be responsible for this localisation as it might hinder binding to the tropomyosin stabilized less dynamic actin filaments as they occur in stress fibers. Myo5A (MyoH) contains one of the longest loop-4 of all myosins and might therefore only bind to the dynamic actin structures at the cell cortex and not to actin structures that are stabilized by other proteins.
MyoM has an extended loop at the same position where the class-VI myosins have one of their prominent insertions. This loop has been suggested to affect nucleotide binding by changing the conformation of a following loop . This loop therefore protrudes within the nucleotide-binding pocket, resulting in a decrease in nucleotide accessibility. The functions of the other two surface loops, for which Myo5A (MyoH) has long extensions, have not been analysed so far.
Members of the myosin light chain, calmodulin and calmodulin-related gene superfamily in D. discoideum.
Size in aa
No. of introns
Gene accession (GenBank)
Protein accession (GeneBank)
, cluster with cbpC, cbpG, cbpF, CBP14
cluster with cbpK
According to the phylogenetic tree, 16 of the Dictyostelium CBPs (CBP1-13, calfumirin) do not belong to an already named class but are closely related to the frequenins. CBP4c is very similar to CBP4a and CBP4b, but the N-terminus could not be identified. CBP13 is a protein fragment, missing the N-terminus as well as the C-terminus, and is very similar to CBP12. CBP4c and CBP13 are therefore most probably pseudogenes. Those members of this group, that have already been identified, belong to developmentally regulated genes. Their distinct spatial expression patterns suggested that they might be involved in morphogenesis . Seven of the Dictyostelium CBPs belong to the frequenin class. The members of the frequenin class are also highly developmentally regulated. An interaction with a myosin family protein has not been described in the literature for any member of this group of any organism. Dictyostelium also contains two centrins and two proteins grouping to the calcineurin family.
So far, only the ELC and RLC myosin light chains as well as calmodulin subfamily proteins have been shown to bind to the IQ motifs of myosins. The exceptions are the myosin light chain of Toxoplasma gondii Myo14A [TgMLC-1, ], the light chain of Accanthamoeba castellanii Myo1C [AcMIMLC, ], and, identified in Dictyostelium, a light chain of Myo1D that is phylogenetically related the Ac Myo1C light chain [MlcD, ], and a light chain that binds to Myo1B [MlcB, ]. Next to MlcD and MlcB, Dictyostelium contains one ELC and one RLC myosin light chain that bind to MhcA. For Saccharomyces cerevisiae it has been shown that the ELC (also termed Mlc1p) not only binds to the class-II myosin, but also to the class-V myosin Myo2p. Chicken myosin-5A even binds two different ELCs next to calmodulin. Therefore, the Dictyostelium ELC is also expected to participate in binding to the class-V myosins. Dictyostelium also contains two members of the calmodulin subfamily that have not specifically been shown to bind to myosins, but are highly expected to in accordance with results obtained for other organisms. The analysis also revealed another CBP containing two EF-hand motifs that is most closely related to MlcB. It is therefore also highly expected to bind to a myosin, and termed MLC-1. As Myo1B and Myo1D bind a specific light chain each, and Myo1C is the closest homolog of the remaining class-I myosins, MLC-1 might be the specific light chain for Myo1C, but this has of course to be proven by biochemical experiments.
CBP14 does not group to any specific class but is more similar to the members of the calmodulin/ELC/RLC part of the phylogenetic tree then to the developmentally related CBPs. If it were able to function as a myosin light chain, then it would be the founding member for another specific myosin light chain class.
The analysis of the Dictyostelium discoideum genome revealed thirteen members of the myosin family of which three have not been described before. The phylogenetic analysis of their motor domains placed seven myosins to the class-I myosins (Myo1A to Myo1G, Myo1F is a new member), one to the class-II myosins (MhcA), and two, of which Myo5A (MyoH) has newly been identified, to the class-V myosins. Three myosins (MyoG, MyoI, and MyoM) do not have a close homolog in any other organism and could therefore not be classified. In contrast to previous analyses, an extensive comparison with 126 class-VII, class-X, class-XV, and class-XXII myosins now showed that MyoI does not group into any of these classes and can not be used as a model for class-VII myosins. The third new myosin has been named MyoG. It contains an N-terminal extension of over 400 residues, and a tail consisting of four IQ motifs and two MyTH4/FERM tandem domains that are separated by a long region containing an SH3 domain. Although its tail organisation is similar to that of class-VII myosins, the motor domain of MyoG does not group into any existing class.
Four specific myosin light chains have been identified so far (ELC, RLC, MlcB, MlcD) next to two calmodulins. The analysis of the genome revealed another protein containing two EF-hand motifs that is closely related to MlcB. Based on its phylogenetic relationship it is highly expected to be a myosin light chain. A further calmodulin-related protein, termed CBP14, phylogenetically groups to the ELC/RLC/calmodulin branch of the tree and might therefore also be a myosin light chain, although it does not have a close homolog in other model organisms.
The full-length sequences of ten of the thirteen Dictyostelium myosins have been reported in the literature (Table 1). Partial sequences of Myo1F (MyoF) and Myo5A (MyoH) have already been reported  and were used as basis for the manual assembly of the genes from clones published by the Dictyostelium Genome Sequencing Project. These genes are in consistence with the assembly of the recently published genome . The two sequences, as well as MyoG that has also been derived from genomic data , are therefore predicted sequences that have not been verified by complete cDNAs. The Japanese cDNA project [21, 23] includes only small parts of these new myosins, which are, however, consistent with the predicted sequences. The Japanese cDNA project also includes at least fragments of all other myosins. No additional myosin genes have been found in the genome of Dictyostelium, and thus the reported "myoL" gene locus  might have originated from experimental artefacts.
The Dictyostelium calmodulin-related genes have been identified in an iterated TBLASTN search of the completed Dictyostelium genome starting with the protein sequence of CalA. Thus, all solely EF-hand motif-containing proteins have been collected (Table 3). The predicted sequences have been verified by searches against the Japanese cDNA database . The EF-hand containing proteins from Saccharomyces cerevisiae, Schizosaccharomyces pombe, Caenorhabditis elegans, Drosophila melanogaster, and Homo sapiens have been obtained in iterative TBLASTN searches of the corresponding genomes. Specific myosin light chains from other organisms have been obtained from the protein database at NCBI.
The complete analysis of the myosin motor domains derived from the NCBI non-redundant database and the EST and genomic sequences of more than 270 eukaryotes will be published elsewhere (Kollmar, unpublished data). All these myosin sequences, including the Dictyostelium myosins, together with their accession numbers and additional references will be accessible through the newly designed CyMoBase [, F. Odronitz and M. Kollmar, submitted]. The database comprises over 1700 myosin sequences (Feb. 2006) that have been used for the phylogenetic classification of the Dictyostelium myosins. The underlying phylogenetic tree has been built of a structure-guided multiple sequence alignment. The phylogenetic trees of the class-V/-VIII/-XI and the class-VII/-X/-XV/-XXII myosins have been constructed from corresponding sequences of this alignment. All phylogenetic trees are unrooted and were generated using neighbour joining and the Bootstrap (1,000 replicates) method as implemented in ClustalW [standard settings, ] and drawn by using TreeView .
The phylogenetic tree of the calmodulin-related proteins has been calculated based on a manual sequence alignment. The manual sequence alignment has been improved by iteratively creating phylogenetic trees and adjusting the alignment. The resulting phylogenetic tree is unrooted and was generated using neighbour joining and the Bootstrap (1,000 replicates) method as implemented in ClustalW and drawn by using TreeView.
Protein domains were predicted using the SMART [42, 43] and Pfam [44, 45] web server. The prediction of protein motifs (coiled coils, leucine zipper, prenyl-group binding motifs) is mainly based on the results of the predict-protein server [46, 47]. The IQ-motifs and N-terminal domains were predicted manually based on the homology to similar domains of other myosins included in the multiple sequence alignment of the myosins. The recognition motifs included in the SMART and Pfam databases are too restrictive, as the motifs have been created based on the small datasets available some years ago.
src homology 3
band 4.1, ezrin, radixin, and moesin
myosin tail homology 4
Rho GDP/GTP exchange factor
myosin light chain
essential light chain
regulatory light chain
M.K. is supported by a Liebig Stipendium of the Fonds der Chemischen Industrie, which is in part financed by the BMBF.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.