- Research article
- Open Access
The serine/threonine kinase 33 is present and expressed in palaeognath birds but has become a unitary pseudogene in neognaths about 100 million years ago
BMC Genomics volume 16, Article number: 543 (2015)
Serine/threonine kinase 33 (STK33) has been shown to be conserved across all major vertebrate classes including reptiles, mammals, amphibians and fish, suggesting its importance within vertebrates. It has been shown to phosphorylate vimentin and might play a role in spermatogenesis and organ ontogenesis. In this study we analyzed the genomic locus and expression of stk33 in the class Aves, using a combination of large scale next generation sequencing data analysis and traditional PCR.
Within the subclass Palaeognathae we analyzed the white-throated tinamou (Tinamus guttatus), the African ostrich (Struthio camelus) and the emu (Dromaius novaehollandiae). For the African ostrich we were able to generate a 62,778 bp long genomic contig and an mRNA sequence that encodes a protein showing highly significant similarity to STK33 proteins from other vertebrates. The emu has been shown to encode and transcribe a functional STK33 as well. For the white-throated tinamou we were able to identify 13 exons by sequence comparison encoding a protein similar to STK33 as well.
In contrast, in all 28 neognath birds analyzed, we could not find evidence for the existence of a functional copy of stk33 or its expression. In the genomes of these 28 bird species, we found only remnants of the stk33 locus carrying several large genomic deletions, leading to the loss of multiple exons. The remaining exons have acquired various indels and premature stop codons.
We were able to elucidate and describe the genomic structure and the transcription of a functional stk33 gene within the subclass Palaeognathae, but we could only find degenerate remnants of stk33 in all neognath birds analyzed. This led us to the conclusion that stk33 became a unitary pseudogene in the evolutionary history of the class Aves at the paleognath-neognath branch point during the late cretaceous period about 100 million years ago. We hypothesize that the pseudogenization of stk33 might have become fixed in neognaths due to either genetic redundancy or a non-orthologous gene displacement and present potential candidate genes for such an incident.
Serine/threonine kinase 33 (stk33) was first discovered in the course of comparative genome analyses of human chromosome 11p15.3 and its orthologous region in the mouse [1–3]. This region is highly syntenic (see Additional file 1: Figure S1) and known to be associated with several diseases including different types of cancer [4–8]. Stk33 is predominantly expressed in testis, certain brain regions and embryonic organs such as brain, heart and spinal cord in human and mouse, implying that it might be involved in spermatogenesis and organ ontogenesis . Its protein product has been classified as a member of the calcium/calmodulin-dependent kinase group . STK33 has been shown to co-localize with the intermediate filament protein vimentin in tanycytes of mouse, rat and hamster  and kinase assays demonstrated that STK33 is able to phosphorylate vimentin in vitro . STK33 and vimentin are associated in vivo and this interaction is not mediated by any additional protein as demonstrated by co-immunoprecipitation and co-sedimentation assays . Thus, STK33 might be involved in the dynamics of the intermediate filament cytoskeleton by phosphorylating vimentin . Furthermore, stk33 has attracted attention since it has been identified as an essential component for the survival of KRAS-dependent cancer cells by high-throughput RNA interference (RNAi) screens [12–14]. It has also been associated with pancreatic cancer . However, the role of stk33 in KRAS-dependent cancer cells remains disputed because other research groups were not able to find a correlation between the knockdown or inhibition of stk33 and KRAS-dependent cancer cell survival, neither using RNAi nor small molecule inhibitors [16–19]. Nevertheless, recent studies found an association of stk33 with colorectal cancer , hepatocellular carcinoma , hypopharyngeal squamous cell carcinoma  and lung cancer .
To date, stk33 has been found only in the subphylum Vertebrata. So far it has been found in multiple vertebrate classes including mammals, reptiles, amphibians and fish, but not in birds (A phylogeny of all known STK33 proteins has been built, see Additional file 1: Figure S3). Neither the analyses of the International Chicken Genome Sequencing consortium , nor our own screening of the chicken (Gallus gallus) genome for stk33 revealed a functional copy (see below). So far, this was the only vertebrate class with no indication for a functional stk33. Since stk33 genes are present as well in mammalians as in phylogenetically older vertebrate species (see Additional file 1: Figure S3), we wanted to know whether birds do not have stk33 genes in general or if the chicken is an exception. Therefore, we analyzed the genomes and transcriptomes of more than 30 different bird species using deep sequencing and publicly available next generation sequencing data .
The class of Aves comprises about 10,500 extant species  and the majority of these species arose from a rapid radiation after a mass extinction event about 66 million years ago within neonaves [27, 28]. It is divided into the clade Palaeognathae, including the flightless ratites and the tinamous, and the clade Neognathae, including all other extant bird species [27, 29, 30].
In this article, we report that stk33 is present and expressed in paleognath birds, but has become a unitary pseudogene within the subclass Neognathae during the cretaceous period about 100 million years ago.
First, we analyzed the chicken (Gallus gallus) genome for the presence of a functional copy of stk33. In human (Homo sapiens) and mouse (Mus musculus) stk33 is located in a highly syntenic region on chromosome 11p15.3 and chromosome 7E3, respectively . In the chicken this region could be identified on chromosome 5, whereas in the chicken no stk33 has been annotated in this region (see Additional file 1: Figure S1). Therefore we screened the whole genome and this region in particular for the presence of stk33. These genomic screens revealed the presence of only eight remnant exons containing both frameshifting indels and multiple stop codons (Fig. 3) in this region and multiple large genomic deletions that have led to the deletion of several exons. Extensive BLASTn searches of the chicken genome did not reveal a duplication or a retrotransposition event for stk33.
This led us to the conclusion that stk33 has become a unitary pseudogene in the chicken, raising the question if the chicken is an exception or if stk33 is missing in birds in general.
Stk33 is present and expressed in the subclass Palaeognathae
To answer this question, we first reconstructed the stk33 genomic locus and the intron-exon structure in the most basal extant bird, the African ostrich (Struthio camelus) which belongs to the subclass Palaeognathae. Illumina sequencing of total RNA generated from both cerebellum and testis yielded 102,624,190 reads (Table 2). Since no genomic data was available as a reference at the time, we performed de novo assemblies of the Illumina reads using k-mer sizes between 20 and 64. These assemblies yielded between 42,726 and 67,174 contigs with an average length between 781 and 1010 bp. All contigs from all assemblies were analyzed by BLASTn searches against the non-redundant nucleotide collection (nr/nt) database. A total of 16 assemblies contained stk33 relevant contigs, which were assembled into a single scaffold of 4,020 bp length. For a preliminary annotation of exons on this scaffold the publicly available stk33 sequence from the green anole (Anolis carolinensis) [GenBank:XM_008117649] was used. These exons were verified by RT-PCR and the intron-exon boundaries were determined using intron-spanning PCRs on genomic DNA followed by Sanger sequencing. We obtained the full length mRNA sequence and identified two alternative transcriptional start sites as well as two different polyadenylation sites by RACE PCR. In summary, a stk33 mRNA sequence of 3,366 bp with a total of 14 exons was established (Fig. 1), all flanked by consensus splice sites (GT/AG). For the genomic structure of stk33 all introns and the promoter region were amplified by PCR. Finally a genomic contig of 62,778 bp was assembled [GenBank:KP072780]. In the promoter region, two GC-box elements were identified at positions −43 and -55. For verification, the Illumina transcriptome reads were mapped back to the genomic reference contig (Fig. 1).
The ostrich stk33 mRNA sequence contains a single open reading frame (ORF) of 1,635 bp encoding a 544 amino acid protein of 59.9 kDa that shows highly significant (E = 0.0) similarity to STK33 of the American alligator (Alligator mississippiensis), green sea turtle (Chelonia mydas), Chinese softshell turtle (Pelodiscus sinensis) and other reptiles. Protein analysis employing InterPro  showed the presence of a serine/threonine protein kinase catalytic domain, an ATP binding site and an active site, suggesting that all features of an active kinase are present (Fig. 2). The ostrich STK33 catalytic domain shows high similarity to STK33 proteins from other species (Fig. 2). A GC-rich region of 241 bp was found at the 5′ end of the transcribed region with a GC content of 81 % that seems to be unique for the ostrich (Fig. 1). Mappings of the transcriptomic data to the genomic reference contig revealed two alternative splice variants, skipping either exon 5 or exon 13 (Fig. 1). All splice variants were verified by PCR using splice variant specific primers. The existence of a splice variant lacking both exon 5 and exon 13 was precluded by PCR. The splice variant lacking exon 13 shows no change within the functional domains, whereas the splice variant lacking exon 5 contains a by 35 aa shortened serine/threonine kinase catalytic domain. However both the active site and the ATP binding site are not affected in both splice variants.
Encouraged by our findings that stk33 is expressed in the African ostrich, we investigated if the genomes of other paleognath birds contain a functional stk33 gene. For this purpose we chose the closely related emu (Dromaius novaehollandiae) and the white-throated-tinamou (Tinamus guttatus). For the emu we used transcriptomic Illumina data of the embryonic brain with a total of 51.1 Mio reads (Illumina) and 182,922 reads (454) from adult brain (Table 2). Since the publicly available genomic data [SRA: SRP019803] was of insufficient quality for the generation of a suitable reference sequence, we mapped the emu transcriptomic reads to the ostrich stk33 mRNA sequence and extracted the consensus sequence. The transcriptomic reads were mapped back to this consensus sequence for verification. This led to an mRNA sequence of 2,560 bp in length split into 14 exons. The mRNA sequence contains a single open reading frame of 1,635 bp which shows 94 % identity (E = 0.0) with the ostrich mRNA sequence. The emu ORF encodes a 544 aa protein of 60.1 kDa. Amino acid sequence alignments of the ostrich and emu STK33 proteins show 97 % identity (E = 0.0) within the catalytic domain (Fig. 2). Across the whole protein sequence the similarity is 91 %. The emu STK33 also contains a serine/threonine protein kinase catalytic domain, an ATP binding site and an active site.
For the white-throated-tinamou we focused on the genomic analysis, since no transcriptomic SRA data is available. This analysis showed that the white-throated-tinamou stk33 contains only 13 Exons instead of the 14 seen in the African ostrich and emu, due to a large genomic deletion of 2.5 kb, including exon 13 [GenBank:BK008887]. However, the deletion of exon 13 does not interrupt the open reading frame and there are no indels or point mutations present elsewhere in the coding region, that would lead to frameshifts or premature stop codons. The same applies to a 6 bp deletion within exon 12 (Fig. 3). Furthermore, the African ostrich is expressing a splice variant lacking exon 13, which is not crucial for the function of STK33 as a kinase, because this exon is not part of the serine/threonine kinase catalytic domain. In summary, we can conclude that stk33 is present and expressed in all birds of the subclass of Palaeognathae.
Stk33 has become a unitary pseudogene in Neognathae
Since we had evidence that a functional copy of stk33 is missing in the genome of the chicken, we wanted to know whether this is a particular case for galliformes or whether this is generally the case for all neognath birds. Therefore, we examined a total of 28 neognaths (Additional file 1: Table S2) on a genomic level and four on a transcriptional level in order to achieve a good sample distribution over the Neognathae pedigree (see Additional file 1: Figure S2) [29, 27, 30]. The genomic studies were carried out as outlined for the white-throated-tinamou.
All of the 28 neognaths analyzed showed only remnants of non-functional stk33 sequences. Frequent large genomic deletions of several kilobases within the stk33 locus were observed, leading to the loss of several exons. BLASTx searches of the remnant exons showed multiple frameshifting indels and frequent SNPs, leading to the generation of multiple premature stop codons within the stk33 ORF (Fig. 3 and Additional file 2: Figure S5). Even the closest relative to the paleognath birds in our list, the mallard (Anas platyrhynchos) shows a severely degenerated stk33, including the deletion of six exons, several indels and multiple premature stop codons (Fig. 3). In the Anna’s hummingbird (Calypte anna) an additional genomic inversion of a 3 kb fragment of the stk33 gene is present, leading to the inversion of the 5′ end of exon 2 (Fig. 3). All birds examined belonging to the order Passeriformes (golden-collared manakin, American crow, Tibetan ground tit, collared flycatcher, zebra finch, white-throated sparrow, medium ground finch), which represents about 60 % of all extant avian species, showed deletions of the exons 3, 5, 8, 12 and 13, several indels and premature stop codons (see Fig. 3 and Additional file 1: Figure S1). All other neognaths show similar severe mutations (see Additional file 1: Figure S1). Interestingly, all neognaths examined share the deletion of exon 3.
We have investigated whether stk33 is still transcribed in spite of the many mutations found in the neognath stk33 genes. We also looked for the possibility that in neognaths stk33 might have potentially acquired new exons in the course of evolution and thus, a functional variant of stk33 might have been restored. For this analysis, we downloaded several transcriptomic SRA data sets (Table 2) of tissues in which transcription of stk33 has been described previously [9, 10]. The data included more than 1.5 Billion reads from chicken (Gallus gallus), about 200 million reads from the white-throated sparrow (Zonotrichia albicollis) and 50 million reads from the collared flycatcher (Ficedula albicollis). In addition, we sequenced cDNA generated from zebra finch (Taeniopygia guttata) brain, yielding 9.5 million reads (Table 2). These data sets were then mapped to the appropriate genomic stk33 target region and to the stk33 mRNA sequence from the African ostrich. In all this data we could not detect any stk33-related sequences. Furthermore, RT-PCR from cDNA using pairs of primers designed to the remnant exon sequences did not yield any products.
In order to exclude that a duplication or a retrotransposition of stk33 occurred during the course of evolution, we performed both transcriptional de novo assemblies and extensive BLASTn searches of the WGS assemblies. These analyses did not reveal any duplicated or retrotransposed stk33 copies elsewhere in the genome, so that we can conclude that stk33 turned into a unitary pseudogene in neognaths and is no longer expressed.
In this study we used a combination of large scale next generation sequencing data analysis and PCR to elucidate the genomic structure and the transcription of the stk33 gene in the phylogenetic class of Aves. In both ostrich and emu, belonging to the phylogenetically older Palaeognathae, we were able to prove the presence of a functional stk33 which is also undoubtedly transcribed. In the third paleognath, the white-throated tinamou, genomic analysis showed the presence of 13 exons with no frameshifting indels or premature stop codons within the coding region. We therefore conclude that the white-throated tinamou, as ostrich and emu, carries a functional copy of stk33. In the future this will need to be proven by transcriptomic and/or proteomic analyses. In summary, we can conclude that stk33 is present and expressed in all paleognath birds.
In neognaths we found only non-functional remnants of stk33. There were no transcripts of these stk33 remnants detectable. Genomic deletions, indels and premature stop codons clearly led to non-functionality of the degenerate stk33 remnants (Fig. 3 and Additional file 2: Figure S5). Furthermore, there is neither evidence for a functional stk33 copy elsewhere in the genome nor are new exons acquired to compensate for those that have been lost. This leads us to the conclusion that stk33 has become a pseudogene in neognaths during the evolutionary history of the class Aves. Although there is ample evidence that no gene duplication occurred, a copy of stk33 could have been retrotransposed back into the genome and somehow become a functional substitute for the original stk33. But since these so called processed pseudogenes are generally considered to be “dead on arrival” [32, 33], due to the lack of a promoter, this scenario would be considered highly unlikely. In addition, all attempts to identify any copies of stk33 elsewhere in any neognath genome failed, so that we are rather confident that this subclass of birds is the only one among vertebrates lacking stk33, as can be inferred from those vertebrates with a present STK33 (see Additional file 1: Figure S3). Such gene loss is a common and important evolutionary process and has been described in bacteria , eukaryotes  and archaea . It is very abundant in some lineages such as bacteria or tunicates, that have undergone considerable genome reduction during their evolution [34, 37–40]. Birds also have undergone gene loss and genome reduction compared to other tetrapod classes, but not to the same extent as in bacteria or tunicates [41–43].
It is interesting that all neognaths lack exon 3, indicating that this chromosomal deletion may have been the initial mutation leading to the degeneration of stk33. This mutation most likely occurred at a very early stage in the diversification of the class of Aves during the late cretaceous period about 100 million years ago, at the branching point of Palaeognathae and Neognathae (see Additional file 1: Figure S2) [27, 29, 30]. Interestingly, a current large scale comparative genomics study in birds could show that the origin of neognath birds was accompanied by an elevated rate of genomic rearrangements . This and the long time span are in accordance with the drastic changes seen in the stk33 locus in neognaths. There are cases where pseudogenes have gained new functions expressing non-coding RNAs that can have a variety of different regulatory functions . However, in spite of intensive searches we could not detect any stk33 related transcripts in any of the neognath bird tissues analyzed. Therefore, a function of the stk33 pseudogene is highly unlikely. This leads to the question, which gene could have taken over the function of stk33 in neognaths or if its function has become dispensable.
In the following we would like to discuss different hypothesis about how the loss of stk33 could have been tolerated and become fixed in neognath birds. One possibility could be, that the function of the stk33 gene could have become dispensable, leading to the fixation of the pseudogene by genetic drift . We believe this to be unlikely since stk33 has been implicated in critical functions such as spermatogenesis and organogenesis  and there is no obvious indication to assume that neognath birds can manage without these functions of stk33 whereas paleognath birds and other vertebrates cannot. Following this assumption, stk33 could have undergone subfunctionalization, a process where a pair of genes that resulted from a duplication event retained different subfunctions of the original gene [46, 47]. If one of these subfunctions should have become dispensable, the corresponding paralogous gene would have acquired the observed deleterious mutations. But since we could not find any evidence for a stk33 duplication event in neognath birds, we can preclude this possibility.
A third possibility is given by the “less is more” hypothesis. It states that gene loss can be an advantageous event and thus can serve as an engine of evolutionary change [48, 49]. Even though this may seem contradictory, since it suggests evolutionary innovation by degeneration, there are several examples of such a positive selection for gene loss [50–52]. One could argue that the loss of stk33 may be beneficial, since it has been identified as an essential component in several types of cancer [12–15, 20–23]. However, the role of stk33 in cancer remains controversial [16–19] and we do not have any indication for a beneficial effect due to the lack of stk33, making this a rather unlikely hypothesis. Nevertheless, analyzing if neognaths display any form of immunity to these types of cancer could represent an interesting approach, since they constitute natural knockouts for stk33.
Another possible explanation for the toleration of gene loss is the phenomenon of genetic redundancy, which can be caused by either gene duplication or convergent evolutionary processes, leading to proteins that are performing the same function, but are unrelated in sequence [53, 54]. Due to the lack of selective pressure on these genes, genetic redundancy would be considered to be evolutionary unstable. However, genetic redundancy appears to be quite frequent in genomes of higher organisms, implying that redundant genes can be evolutionary stable under certain conditions [54–58]. One model describing a stable genetic redundancy takes pleiotropy into consideration. If gene A performs function f1 and gene B performs both functions f1 and f2, but f1 with a slightly reduced efficacy, this genetic redundancy would be considered as evolutionary stable . The presence of different splice variants in ostrich, human and mouse  strongly indicates that STK33 has more than one function and that vimentin may not be its only substrate . This is supported by the fact that STK33 can also be found in vimentin-negative cells . If we presume stk33 to be gene B and assume that the function f2 has become irrelevant to the neognath lifestyle, stk33 would have inevitably been lost in neognath birds.
A variant of the latter possibility is that stk33 may have become dispensable, because it has been functionally substituted by a non-orthologous, but functionally analogous kinase in neognaths, a process called non-orthologous gene displacement (NOD). Several examples of such a NOD have been reported [60–64].
Regardless of whether genetic redundancy or non-orthologous gene displacement is causal for the pseudogenization of stk33 in neognath birds, a second kinase that is able to phosphorylate vimentin has to be involved. For this, only kinases that are able to phosphorylate vimentin in the same regions as STK33 are considered as candidates. So far, the exact phosphorylation sites of STK33 have not yet been determined, but they have been circumscribed to at least one phosphorylation site between amino acids 30 and 42 and at least a second phosphorylation site located between amino acids 50 and 80 within the amino-terminal non-α-helical head domain by kinase assays using truncated vimentin mutants . A total of ten kinases has been shown to phosphorylate vimentin (see Table 1) [65–80]. All these kinases are present in birds . Since vimentin could be regulated differently in neognaths or birds in general, an alignment of the vimentin amino-terminal head domain has been built. This alignment shows that most of the phosphorylation sites are conserved between human, mouse and birds (Additional file 1: Figure S4). This suggests that vimentin is regulated in a similar way compared to human and mouse.
The phosphorylation sites of two kinases, Plk1 and p37, are located outside the amino-terminal head domain [79, 80] and thus probably are no candidates that could substitute stk33. Three other kinases (Pak1, CDK1, Aurora kinase B) are also not suitable candidates since they only have a single phosphorylation site [73–78]. Even though MAPKAP kinase 2 and CAMKII have at least one phosphorylation site within the same region that is phosphorylated by STK33, they both also phosphorylate Ser-82 [68, 69, 71, 72], making them unlikely substituents for STK33. The three remaining kinases, Rho-kinase, protein kinase C and protein kinase A all have 2 phosphorylation sites within the same region as STK33 (Table 1) [65–67, 70]. However the vimentin alignment clearly shows that Ser-50, which is phosphorylated by protein kinase C [66, 67], is not conserved in birds (Additional file 1: Figure S4), making it a rather unlikely candidate. Thus Rho-kinase and protein kinase A represent the best candidates to compensate for the loss of stk33 in neognath birds, since their phosphorylation sites are within the same region as those for stk33 and are highly conserved in birds (Additional file 1: Figure S4).
Further studies about stk33 on the genomic and protein level, especially about the exact phosphorylation sites of STK33, are necessary to further elucidate its function and to give additional indications towards the actual substituent. Without further knowledge all hypothesis remain speculative.
Here, we elucidate and describe the genomic structure and the transcription of the serine/threonine kinase 33 (stk33) in paleognath birds, such as the African ostrich (Struthio camelus), emu (Dromaius novaehollandiae) and the white-throated tinamou (Tinamus guttatus) using both large scale next generation sequencing data analysis and PCR. We were able to show that all three paleognath birds analyzed carry functional and expressed copies of stk33 and thus, we concluded that stk33 is present and expressed in paleognath birds in general, as it is the case in all other vertebrate classes.
Surprisingly, in neognaths birds we found only degenerate remnants of stk33 that have become non-functional by genomic deletions, indels and premature stop codons (Fig. 3 and Additional file 2: Figure S5). Furthermore, no transcripts of these remnant exons could be detected. Interestingly, all neognath birds share the deletion of exon 3. These results lead us to the conclusion that stk33 became a unitary pseudogene in the evolutionary history of the class Aves during the late cretaceous period about 100 million years ago, at the branching point of Palaeognathae and Neognathae that was accompanied by an elevated rate of genomic rearrangements in neognaths [27, 29, 30]. We hypothesize that this pseudogenization of stk33 might have become fixed in Neognathae due to either genetic redundancy or a non-orthologous gene displacement (NOD). Rho-kinase and protein kinase A have been identified as the most promising candidate genes by comparing their vimentin phosphorylation sites to those of STK33.
All procedures concerning animals were performed in accordance with the published Directive 2010/63/EU of the European Parliament under a protocol approved by the local Administration District Official Committee. Moreover, all efforts were made to minimize the number of animals and their suffering.
Tissues expected to have high stk33 expression were obtained from three bird species: zebra finch (Taeniopygia guttata), chicken (Gallus gallus) and African ostrich (Struthio camelus).
Adult male zebra finch (Taeniopygia guttata) were obtained from a local pet store, anaesthetized with carbon dioxide, killed by cervical dislocation and used for the preparation of brain and testis. Fertilized chicken (Gallus gallus) eggs were obtained from a local breeder and incubated for 6–17 days at 38 °C. After incubation eggs were opened and either whole embryos or the embryonic brain were isolated. Testes and complete heads from the African ostrich (Struthio camelus) were obtained from a local breeder. Ostrich heads were cut in half for preparation of the Cerebellum. After preparation all material was stored at −80 °C until further use.
Sample preparation and sequencing
Tissue material was homogenized using a syringe and cannula. RNA isolation was performed using the RNeasy Mini Kit (Qiagen, Hilden, Germany), High Pure RNA Tissue Kit (Roche, Penzberg, Germany), PureLink® RNA Mini Kit (Life Technologies, Carlsbad, USA) or the GTC-method . Subsequently, the RNA quality was tested using the 2100 Bioanalyzer (Agilent, Santa Clara, USA). DNA was isolated with the peqGOLD Tissue DNA Mini Kit (PeqLab, Erlangen, Germany).
1.5 μg of total RNA isolated from ostrich cerebellum and testis were each used for the library preparation using the TruSeq RNA Sample PrepV2 Kit (Illumina, San Diego, USA) and a 100 bp paired-end run was performed on an Illumina HiSeq 2000 (IMSB, Mainz, Germany). For zebra finch adult brain RNA sequencing, library preparation was carried out by GENterprise Genomics (Mainz, Germany) using 1.5 μg of total RNA and samples were sequenced in a 150 bp paired-end run on an Illumina HiSeq 2500 (IMSB, Mainz, Germany).
Handling of SRA data
Genomic and transcriptomic data was downloaded from the NCBI SRA archive (http://www.ncbi.nlm.nih.gov/sra) and converted to fastq.gz files using the SRA Toolkit v2.3.5-2 . An overview of all SRA files used in this study is shown in Table 2. The fastq.gz files were imported into CLC Genomics Workbench version 6.5.1 (CLC bio, Aarhus, Denmark) and trimmed as described below.
Trimming of Illumina data
Sequencing reads were trimmed for quality and adapter sequences using CLC Genomics Workbench (CLC bio, Aarhus, Denmark). Quality trimming was done using an error probability limit of 0.01 and a maximum of 1 ambiguous base. All sequences below a read length of 15 bp were discarded.
De novo assemblies and mappings
De novo assemblies of genomic and transcriptomic reads were conducted in CLC Genomics Workbench (CLC bio, Aarhus, Denmark) using standard parameters. Stk33 contigs were identified using local BLASTn/BLASTx searches against a stk33 database containing all available stk33 sequences.
The feature “map reads to reference” of the CLC Genomics Workbench (CLC bio, Aarhus, Denmark) was used to map transcriptomic reads to transcriptomic contigs and “large gap read mapping” was employed to map transcriptomic data to genomic contigs. For intraspecies mappings a similarity of 98 % for at least 95 % of the read length was required, for interspecies mappings this similarity was reduced to 80 % similarity across 80 % of the read length. Mismatches were given a penalty of 2 and insertions or deletions were given a penalty of 3.
Stk33 is located in a highly syntenic region spanning from st5 to ric3 (Additional file 1: Figure S1). In order to obtain the genomic target regions for stk33 of the different birds we downloaded the annotated genomic contigs containing the region between genes st5 and tub from UniGene. If these were not available we downloaded all assembled genomic contigs of the respective genome assembly project from the NCBI genome database (Additional file 1: Table S1) and identified contigs located in the stk33 target region by local BLASTn searches with the genomic stk33 contig from ostrich as reference. Putative exons of stk33 were identified by comparison with the ostrich exons. Premature stop codons were identified by local BLASTx searches of single exons with the full length STK33 protein sequence from ostrich as reference.
PCR and Sanger sequencing
cDNA was generated employing SuperScript® III reverse transcriptase (Invitrogen, Carlsbad, USA). Traditional PCRs were carried out using GoTaq® Polymerase (Promega, Fitchburg, USA) or Q5® High-Fidelity DNA Polymerase (NEB, Ipswich, USA). Takara LA Taq® DNA Polymerase (Takara, Otsu, Japan) was used for long Range PCRs. Primers were purchased from Invitrogen (Carlsbad, USA) and Sigma Aldrich (St. Louis, USA). DNA was denatured at 98 °C for 30 s, annealing was carried out at 60 °C for 30 s and extension was performed at 72 °C with an extension time of 1 min/kb. RACE PCRs were done using the GeneRacer™ Kit (Invitrogen, Carlsbad, USA). Sanger sequencing was carried out by StarSeq (Mainz, Germany).
Availability of supporting data
All sequencing data generated in this study is available from the SRA-Archive (http://www.ncbi.nlm.nih.gov/sra) under the accessions numbers SRX738987 for the zebra finch, SRX738984 and SRX736627 for the African ostrich. Stk33 sequences generated have been deposited into the GenBank database (http://www.ncbi.nlm.nih.gov/genbank/). Accession numbers are available from Additional file 1: Table S2. Due to GenBank limitations all sequences have also been included in Additional file 3.
Mujica AO, Hankeln T, Schmidt ER. A novel serine/threonine kinase gene, STK33, on human chromosome 11p15.3. Gene. 2001;1–2:175–81.
Amid C, Bahr A, Mujica A, Sampson N, Bikar SE, Winterpacht A, et al. Comparative genomic sequencing reveals a strikingly similar architecture of a conserved syntenic region on human chromosome 11p15.3 (including gene ST5) and mouse chromosome 7. Cytogenet Cell Genet. 2001;3–4:284–90.
Cichutek A, Brueckmann T, Seipel B, Hauser H, Schlaubitz S, Prawitt D, et al. Comparative architectural aspects of regions of conserved synteny on human chromosome 11p15.3 and mouse chromosome 7 (including genes WEE1 and LMO1). Cytogenet Cell Genet. 2001;3–4:277–83.
Redeker E, Alders M, Hoovers JM, Richard CW, Westerveld A, Mannens M. Physical mapping of 3 candidate tumor suppressor genes relative to Beckwith-Wiedemann syndrome associated chromosomal breakpoints at 11p15.3. Cytogenet Cell Genet. 1995;3–4:222–5.
Bepler G, Koehler A. Multiple chromosomal aberrations and 11p allelotyping in lung cancer cell lines. Cancer Genetics and Cytogenetics. 1995;1:39–45.
Nowak NJ, Shows TB. Genetics of chromosome 11: loci for pediatric and adult malignancies, developmental disorders, and other diseases. Cancer Invest. 1995;6:646–59.
Karnik P, Paris M, Williams BR, Casey G, Crowe J, Chen P. Two distinct tumor suppressor loci within chromosome 11p15 implicated in breast cancer progression and metastasis. Human molecular genetics. 1998;5:895–903.
Karnik P, Chen P, Paris M, Yeger H, Williams BRG. Loss of heterozygosity at chromosome 11p15 in Wilms tumors: identification of two independent regions. Oncogene. 1998;2:237–40.
Mujica AO, Brauksiepe B, Saaler-Reinhardt S, Reuss S, Schmidt ER. Differential expression pattern of the novel serine/threonine kinase, STK33, in mice and men. FEBS J. 2005;19:4884–98.
Brauksiepe B, Baumgarten L, Reuss S, Schmidt ER. Co-localization of serine/threonine kinase 33 (Stk33) and vimentin in the hypothalamus. Cell Tissue Res. 2014;1:189–99.
Brauksiepe B, Mujica AO, Herrmann H, Schmidt ER. The Serine/threonine kinase Stk33 exhibits autophosphorylation and phosphorylates the intermediate filament protein Vimentin. BMC Biochem. 2008;9:25.
Scholl C, Fröhling S, Dunn IF, Schinzel AC, Barbie DA, Kim SY, et al. Synthetic lethal interaction between oncogenic KRAS dependency and STK33 suppression in human cancer cells. Cell. 2009;5:821–34.
Azoitei N, Hoffmann CM, Ellegast JM, Ball CR, Obermayer K, Gößele U, et al. Targeting of KRAS mutant tumors by HSP90 inhibitors involves degradation of STK33. J Exp Med. 2012;4:697–711.
Frohling S, Scholl C. STK33 Kinase is not essential in KRAS-dependent cells-letter. Cancer Research. 2011;24:7716.
Carter H, Samayoa J, Hruban RH, Karchin R. Prioritization of driver mutations in pancreatic cancer using cancer-specific high-throughput annotation of somatic mutations (CHASM). Cancer Biol Ther. 2010;6:582–7.
Babij C, Zhang Y, Kurzeja RJ, Munzli A, Shehabeldin A, Fernando M, et al. STK33 Kinase activity is nonessential in KRAS-dependent cancer cells. Cancer Research. 2011;17:5818–26.
Luo T, Masson K, Jaffe JD, Silkworth W, Ross NT, Scherer CA, et al. STK33 kinase inhibitor BRD-8899 has no effect on KRAS-dependent cancer cell viability. Proceedings of the National Academy of Sciences. 2012;8:2860–5.
Weïwer M, Spoonamore J, Wei J, Guichard B, Ross NT, Masson K, et al. A potent and selective quinoxalinone-based STK33 inhibitor does not show synthetic lethality in KRAS-dependent cells. ACS Med Chem Lett. 2012;12:1034–8.
Spoonamore J, Weïwer M, Wei J, Guichard B, Ross NT, Masson K, et al. Screen for Inhibitors of STK33 Kinase Activity. [http://www.ncbi.nlm.nih.gov/books/NBK133418/]
Moon JW, Lee SK, Lee JO, Kim N, Lee YW, Kim SJ, et al. Identification of novel hypermethylated genes and demethylating effect of vincristine in colorectal cancer. J Exp Clin Cancer Res. 2014;33:4.
Yang T, Song B, Zhang J, Yang G, Zhang H, Yu W, et al. STK33 promotes hepatocellular carcinoma through binding to c-Myc. Gut;2014. doi:10.1136/gutjnl-2014-307545. [Epub ahead of print].
Huang L, Chen C, Zhang G, Ju Y, Zhang J, Wang H, et al. STK33 overexpression in hypopharyngeal squamous cell carcinoma: possible role in tumorigenesis. BMC cancer. 2015;1:13.
Wang P, Cheng H, Wu J, Yan A, Zhang L. STK33 plays an important positive role in the development of human large cell lung cancers with variable metastatic potential. Acta biochimica et biophysica Sinica. 2015;3:214–23.
International Chicken Genome Sequencing Consortium. Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution. Nature. 2004;7018:695–716.
The NCBI Sequence Read Archive. [http://www.ncbi.nlm.nih.gov/Traces/sra/]
Frank Gill, David Donsker. IOC World Bird List. [http://www.worldbirdnames.org/]
Jarvis ED, Mirarab S, Aberer AJ, Li B, Houde P, Li C, et al. Whole-genome analyses resolve early branches in the tree of life of modern birds. Science. 2014;6215:1320–31.
Feduccia A. ‘Big bang’ for tertiary birds? Trends in Ecology & Evolution. 2003;4:172–6.
Jetz W, Thomas GH, Joy JB, Hartmann K, Mooers AO. The global diversity of birds in space and time. Nature. 2012;7424:444–8.
Zhang G, Li C, Li Q, Li B, Larkin DM, Lee C, et al. Comparative genomics reveals insights into avian genome evolution and adaptation. Science. 2014;6215:1311–20.
Hunter S, Jones P, Mitchell A, Apweiler R, Attwood TK, Bateman A, et al. InterPro in 2011: new developments in the family and domain prediction database. Nucleic Acids Res. 2012;Database issue:D306–12.
Petrov DA, Lozovskaya ER, Hartl DL. High intrinsic rate of DNA loss in Drosophila. Nature. 1996;6607:346–9.
Shemesh R, Novik A, Edelheit S, Sorek R. Genomic fossils as a snapshot of the human transcriptome. Proc Natl Acad Sci USA. 2006;5:1364–9.
Koskiniemi S, Sun S, Berg OG, Andersson DI. Selection-driven gene loss in bacteria. PLoS genetics. 2012;6, e1002787.
Krylov DM, Wolf YI, Rogozin IB, Koonin EV. Gene loss, protein sequence divergence, gene dispensability, expression level, and interactivity are correlated in eukaryotic evolution. Genome Research. 2003;10:2229–35.
Wolf YI, Makarova KS, Yutin N, Koonin EV. Updated clusters of orthologous genes for Archaea: a complex ancestor of the Archaea and the byways of horizontal gene transfer. Biology direct. 2012;7:46.
Vienne A. Metaphylogeny of 82 gene families sheds a new light on chordate evolution. Int J Biol Sci. 2006;2(2):32–7.
Berná L, Alvarez-Valin F. Evolutionary genomics of fast evolving tunicates. Genome biology and evolution. 2014;7:1724–38.
Dehal P, Satou Y, Campbell RK, Chapman J, Degnan B, de Tomaso A, et al. The draft genome of Ciona intestinalis: insights into chordate and vertebrate origins. Science (New York, NY). 2002;5601:2157–67.
Holland LZ, Gibson-Brown JJ. The Ciona intestinalis genome: when the constraints are off. BioEssays. 2003;6:529–32.
Lovell PV, Wirthlin M, Wilhelm L, Minx P, Lazar NH, Carbone L, et al. Conserved syntenic clusters of protein coding genes are missing in birds. Genome Biol. 2014;12:117.
Hughes AL, Friedman R. Genome size reduction in the chicken has involved massive loss of ancestral protein-coding genes. Molecular biology and evolution. 2008;12:2681–8.
Newman SA, Mezentseva NV, Badyaev AV. Gene loss, thermogenesis, and the origin of birds. Annals of the New York Academy of Sciences. 2013;1289:36–47.
Groen JN, Capraro D, Morris KV. The emerging role of pseudogene expressed non-coding RNAs in cellular functions. Int J Biochem Cell Biol. 2014;54:350–5.
Zhang ZD, Frankish A, Hunt T, Harrow J, Gerstein M. Identification and analysis of unitary pseudogenes: historic and contemporary gene losses in humans and other primates. Genome biology. 2010;3:R26.
Force A, Lynch M, Pickett FB, Amores A, Yan YL, Postlethwait J. Preservation of duplicate genes by complementary, degenerative mutations. Genetics. 1999;4:1531–45.
Lynch M, Force A. The probability of duplicate gene preservation by subfunctionalization. Genetics. 2000;1:459–73.
Olson MV. When less is more: gene loss as an engine of evolutionary change. American journal of human genetics. 1999;1:18–23.
Olson MV, Varki A. Sequencing the chimpanzee genome: insights into human evolution and disease. Nature reviews Genetics. 2003;1:20–8.
Galili U, Swanson K. Gene sequences suggest inactivation of alpha-1,3-galactosyltransferase in catarrhines after the divergence of apes from monkeys. Proceedings of the National Academy of Sciences of the United States of America. 1991;16:7401–4.
Chou H, Hayakawa T, Diaz S, Krings M, Indriati E, Leakey M, et al. Inactivation of CMP-N-acetylneuraminic acid hydroxylase occurred prior to brain expansion during human evolution. Proceedings of the National Academy of Sciences of the United States of America. 2002;18:11736–41.
Stedman HH, Kozyak BW, Nelson A, Thesier DM, Su LT, Low DW, et al. Myosin gene mutation correlates with anatomical changes in the human lineage. Nature. 2004;6981:415–8.
Galperin MY, Walker DR, Koonin EV. Analogous enzymes: independent inventions in enzyme evolution. Genome Research. 1998;8:779–90.
Kafri R, Levy M, Pilpel Y. The regulatory utilization of genetic redundancy through responsive backup circuits. Proceedings of the National Academy of Sciences of the United States of America. 2006;31:11653–8.
Enns LC, Kanaoka MM, Torii KU, Comai L, Okada K, Cleland RE. Two callose synthases, GSL1 and GSL5, play an essential and redundant role in plant and pollen development and in fertility. Plant molecular biology. 2005;3:333–49.
Pearce AC, Senis YA, Billadeau DD, Turner M, Watson SP, Vigorito E. Vav1 and vav3 have critical but redundant roles in mediating platelet activation by collagen. The Journal of biological chemistry. 2004;52:53955–62.
Kafri R, Springer M, Pilpel Y. Genetic redundancy: new tricks for old genes. Cell. 2009;3:389–92.
Carracedo S, Sacher F, Brandes G, Braun U, Leitges M. Redundant role of protein kinase C delta and epsilon during mouse embryonic development. PLoS ONE. 2014;8, e103686.
Nowak MA, Boerlijst MC, Cooke J, Smith JM. Evolution of genetic redundancy. Nature. 1997;6638:167–71.
Koonin EV, Mushegian AR, Bork P. Non-orthologous gene displacement. Trends in genetics. 1996;9:334–6.
Marsh JJ, Lebherz HG. Fructose-bisphosphate aldolases: an evolutionary history. Trends in biochemical sciences. 1992;3:110–3.
Rutter WJ. Evolution of aldolase. Fed Proc. 1964;23:1248–57.
Perham RN. The fructose-1,6-bisphosphate aldolases: same reaction, different enzymes. Biochemical Society transactions. 1990;2:185–7.
Houten SM, Waterham HR. Nonorthologous gene displacement of phosphomevalonate kinase. Molecular genetics and metabolism. 2001;3:273–6.
Goto H. Phosphorylation of Vimentin by Rho-associated Kinase at a Unique Amino-terminal site that is specifically phosphorylated during cytokinesis. Journal of Biological Chemistry. 1998;19:11728–36.
Ogawara M, Inagaki N, Tsujimura K, Takai Y, Sekimata M, Ha MH, et al. Differential targeting of protein kinase C and CaM kinase II signalings to vimentin. J Cell Biol. 1995;4:1055–66.
Takai Y, Ogawara M, Tomono Y, Moritoh C, Imajoh-Ohmi S, Tsutsumi O, et al. Mitosis-specific phosphorylation of vimentin by protein kinase C coupled with reorganization of intracellular membranes. Journal of cell biology. 1996;1:141–9.
Stefanovic S, Windsor M, Nagata K, Inagaki M, Wileman T. Vimentin rearrangement during African swine fever virus infection involves retrograde transport along microtubules and phosphorylation of vimentin by calcium calmodulin kinase II. Journal of virology. 2005;18:11766–75.
Oguri T, Inoko A, Shima H, Izawa I, Arimura N, Yamaguchi T, et al. Vimentin-Ser82 as a memory phosphorylation site in astrocytes. Genes Cells. 2006;5:531–40.
Eriksson JE, He T, Trejo-Skalli AV, Härmälä-Braskén A, Hellman J, Chou Y, et al. Specific in vivo phosphorylation sites determine the assembly dynamics of vimentin intermediate filaments. J Cell Sci. 2004;Pt 6:919–32.
Cheng TJ, Lai YK. Identification of mitogen-activated protein kinase-activated protein kinase-2 as a vimentin kinase activated by okadaic acid in 9L rat brain tumor cells. Journal of cellular biochemistry. 1998;2:169–81.
Cheng T, Tseng Y, Chang W, Chang MD, Lai Y. Retaining of the assembly capability of vimentin phosphorylated by mitogen-activated protein kinase-activated protein kinase-2. Journal of cellular biochemistry. 2003;3:589–602.
Goto H, Yasui Y, Kawajiri A, Nigg EA, Terada Y, Tatsuka M, et al. Aurora-B regulates the cleavage furrow-specific vimentin phosphorylation in the cytokinetic process. The Journal of biological chemistry. 2003;10:8526–30.
Chou Y, Bischoff JR, Beach D, Goldman RD. Intermediate filament reorganization during mitosis is mediated by p34cdc2 phosphorylation of vimentin. Cell. 1990;6:1063–71.
Chou YH, Ngai KL, Goldman R. The regulation of intermediate filament reorganization in mitosis. p34cdc2 phosphorylates vimentin at a unique N-terminal site. The Journal of biological chemistry. 1991;12:7325–8.
Tsujimura K, Ogawara M, Takeuchi Y, Imajoh-Ohmi S, Ha MH, Inagaki M. Visualization and function of vimentin phosphorylation by cdc2 kinase during mitosis. The Journal of biological chemistry. 1994;49:31097–106.
Tang DD, Bai Y, Gunst SJ. Silencing of p21-activated kinase attenuates vimentin phosphorylation on Ser-56 and reorientation of the vimentin network during stimulation of smooth muscle cells by 5-hydroxytryptamine. Biochemical Journal. 2005;Pt 3:773–83.
Wang R, Li Q, Anfinogenova Y, Tang DD. Dissociation of Crk-associated substrate from the vimentin network is regulated by p21-activated kinase on ACh activation of airway smooth muscle. Am J Physiol Lung Cell Mol Physiol. 2007;1:L240–8.
Yamaguchi T, Goto H, Yokoyama T, Silljé H, Hanisch A, Uldschmid A, et al. Phosphorylation by Cdk1 induces Plk1-mediated vimentin phosphorylation during mitosis. The Journal of cell biology. 2005;3:431–6.
Chou YH, Opal P, Quinlan RA, Goldman RD. The relative roles of specific N- and C-terminal phosphorylation sites in the disassembly of intermediate filament in mitotic BHK-21 cells. Journal of Cell Science. 1996;109(Pt 4):817–26.
Chomczynski P, Sacchi N. Single-step method of RNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction. Anal Biochem. 1987;1:156–9.
We thank Benjamin Rieger and Dr. Steffen Rapp for their bioinformatics support. We are grateful to Sabine Fischer and Dr. Romina Petersen for their helpful comments on the manuscript.
Financial support by the NMFZ (Naturwissenschaftlich-medizinisches Forschungszentrum) from the University Medical Center, Johannes Gutenberg-University Mainz is greatly acknowledged.
The authors declare that they have no competing interests.
TL participated in the design of the study, carried out the genomic and transcriptomic studies and drafted the manuscript. SL carried out the sample and library preparation for the ostrich and participated in the generation of the ostrich contig. DS provided material from chicken and participated in the analysis of the chicken. ERS conceived the study and participated in its design and coordination and helped to draft the manuscript. All authors read and approved the final manuscript.
Contains the following items: Table S1 All genomic contigs used in this study. Table S2 All birds examined in this study and their GenBank accessions numbers. Figure S1 Representation of the syntenic genomic locus containing stk33 in human, mouse and the chicken. Figure S1 Phylogenetic tree of the class Aves with indications of analyzed bird species. Figure S2 Phylogeny for STK33. Figure S3 Alignment of the vimentin amino-terminal head domain from different bird species with human and mouse. Figure S4 Alignment of the vimentin amino-terminal head domain from different bird species with human and mouse.
Sequence alignments of identified stk33 exons of remaining neognaths to African ostrich stk33 mRNA.
Contains the stk33 sequences from all birds examined in this study.
About this article
Cite this article
Lautwein, T., Lerch, S., Schäfer, D. et al. The serine/threonine kinase 33 is present and expressed in palaeognath birds but has become a unitary pseudogene in neognaths about 100 million years ago. BMC Genomics 16, 543 (2015). https://doi.org/10.1186/s12864-015-1769-9
- Serine/threonine kinase 33
- Genetic redundancy
- Non-orthologous gene displacement