Skip to main content
  • Research article
  • Open access
  • Published:

Developmentally regulated expression, alternative splicing and distinct sub-groupings in members of the Schistosoma mansoni venom allergen-like (SmVAL) gene family



The Sperm-coating protein/Tpx-1/Ag5/PR-1/Sc7 (SCP/TAPS) domain is found across phyla and is a major structural feature of insect allergens, mammalian sperm proteins and parasitic nematode secreted molecules. Proteins containing this domain are implicated in diverse biological activities and may be important for chronic host/parasite interactions.


We report the first description of an SCP/TAPS gene family (S chistosoma m ansoni venom allergen-like (SmVALs)) in the medically important Platyhelminthes (class Trematoda) and describe individual members' phylogenetic relationships, genomic organization and life cycle expression profiles. Twenty-eight SmVALs with complete SCP/TAPS domains were identified and comparison of their predicted protein features and gene structures indicated the presence of two distinct sub-families (group 1 & group 2). Phylogenetic analysis demonstrated that this group 1/group 2 split is zoologically widespread as it exists across the metazoan sub-kingdom. Chromosomal localisation and PCR analysis, coupled to inspection of the current S. mansoni genomic assembly, revealed that many of the SmVAL genes are spatially linked throughout the genome. Quantitative lifecycle expression profiling demonstrated distinct SmVAL expression patterns, including transcripts specifically associated with lifestages involved in definitive host invasion, transcripts restricted to lifestages involved in the invasion of the intermediate host and transcripts ubiquitously expressed. Analysis of SmVAL6 transcript diversity demonstrated statistically significant, developmentally regulated, alternative splicing.


Our results highlight the existence of two distinct SCP/TAPS protein types within the Platyhelminthes and across taxa. The extensive lifecycle expression analysis indicates several SmVAL transcripts are upregulated in infective stages of the parasite, suggesting that these particular protein products may be linked to the establishment of chronic host/parasite interactions.


Schistosomes are dioecious metazoan parasites of the phylum Platyhelminthes, which are estimated to infect more than 200 million people worldwide, with a further 600 million individuals living in the tropics and sub-tropics at risk of infection. The deposition of schistosome eggs within host tissues and the subsequent immune response elicited are the principal causes of chronic schistosomiasis, which can lead to a range of morbidities such as periportal fibrosis and granulomatous inflammation [1]. Despite the availability of a highly effective chemotherapeutic agent (praziquantel), recent reassessment of disease-related morbidity shows schistosomiasis to be a far greater public health problem than previously estimated [2]. This reappraisal of the impact of schistosomiasis and the potential emergence of praziquantel-resistant strains argues strongly for the identification and characterisation of novel vaccine and drug targets.

Schistosoma mansoni is one of three schistosome species that cause the vast majority of human infections and is the most extensively studied in the laboratory. Large-scale sequencing projects have created extensive S. mansoni expressed sequence tag (EST) and genomic databases leading to the identification of thousands of new genes, as well as providing a repository of information useful for post-genomic activities [3, 4]. In our search for novel chemotherapeutic and immunoprophylactic targets, we have utilised these sequence databases for construction of DNA microarrays to identify gender-associated and developmentally-regulated S. mansoni transcripts [5, 6]. One interesting finding from these investigations was the identification of two adult male-associated transcripts bearing sequence similarity to the SCP/Tpx-1/Ag 5/Pr-1/Sc7 (SCP/TAPS) family.

Members of the SCP/TAPS family (Pfam accession number no. PF00188; [7]) encode structurally related proteins found throughout the eukaryotic kingdom. All members contain a unique SCP/TAPS protein domain, which varies in length between 120 and 170 amino acids. Tertiary structural studies have demonstrated that this domain adopts a highly conserved α-β-α sandwich conformation [813]. The strong conservation of the tertiary structure and of particular residues within the domain have suggested that all SCP/TAPS domain containing proteins share a common biological activity [9]. However, no specific function has yet been ascribed to the SCP/TAPS domain, despite some biological roles having been linked to member proteins within the superfamily. Specifically, superfamily members have been linked to diverse processes including immune responses [1416], testis/sperm development [7, 17], envenomation [1820] and parasitic nematode invasion of definitive hosts [2123].

Collectively these data suggest that SCP/TAPS proteins participate in various biological activities across phyla and, as such, warrant further study in the Platyhelminthes as potential modulators of immune function, components of sexual development and candidates for novel vaccine strategies. Towards this end, we present the molecular characterisation of 13 SCP/TAPS family members in S. mansoni, hereafter referred to as Schistosoma mansoni Venom allergen-like 1–13 (SmVAL1-13). We additionally describe a further 15 members of the SmVAL family predicted in the most recent S. mansoni genome assembly [24] and confirm the transcription of eleven of these. Our comparison of the 28 genes provides evidence for the partitioning of the SmVAL family into two groups based on phylogenetic analysis, genomic structure comparison and specific protein feature inclusion. Furthermore, examples of both group 1 and group 2 SCP/TAPS protein types are identified in several metazoan species, supporting an evolutionarily conserved superfamily division across phyla. Analysis of SmVAL1-13 transcription across the schistosome lifecycle demonstrates a range of distinct expression patterns, with a subset exhibiting transcription tightly associated with the invasive stages of the parasite. The proposed functional role of individual SmVAL family members in host/parasite interactions and parasite-specific activities is discussed.


cDNA cloning and identification of SmVAL family members

To identify S. mansoni SCP/TAPS domain-containing family members, the Vespula vulgaris Wasp Venom Allergen 5 (Ves v 5, [Genbank:AAA30333]) protein sequence was used in a tBLASTn search of S. mansoni ESTs and Phat [25] predicted genes from version 1 of the publicly available S. mansoni genomic database [24]. Thirteen different members with significant sequence similarity to Ves v 5 were identified, twelve originating from EST contigs and one from a Phat predicted gene. After PCR confirmation of the sequences from parasite-derived cDNA, these transcripts were named Schistosoma mansoni Venom allergen-like transcripts (SmVAL1-13). The molecular details of these thirteen sequences are shown in Table 1. The available Hidden-Markov Model (HMM) consensus of the SCP/TAPS domain (PF00188; [26]) was subsequently used in a search of the most recent release of the genomic database (version 4). This identified 15 additional family members with complete SCP/TAPS domains, and a further 7 with incomplete SCP/TAPS domains. The molecular details of these sequences are shown in Table 2.

Table 1 Molecular Details/Characteristics of SmVAL1-13.
Table 2 Molecular Details/Characteristics of SmVAL14-28.

Complete open-reading frames (ORFs) for SmVAL1-13 were predicted by inspection of genomic sequence data, and these representative members (out of the total 28 with full SCP/TAPS domains) were confirmed by PCR amplification from parasite-derived cDNA. Additional full-length mRNA sequences for SmVAL1 (isolated from mixed sex cercariae RNA), SmVAL2 (isolated from mixed sex miracidia RNA), SmVAL4 (isolated from mixed sex cercariae RNA), SmVAL6 (isolated from mixed sex, 7-week adult RNA), SmVAL7 (isolated from mixed sex, 7-week adult RNA) and SmVAL11 (isolated from mixed sex, 7-week adult RNA) were obtained by Rapid Amplification of cDNA Ends (RACE) and these amplicons all contained the confirmed ORFs as well as associated 5' and 3' untranslated regions (UTRs).

Complete ORFs for SmVAL14-28 were not physically obtained, although partial cDNAs for a subset (11 out of 15, Table 2) were identified during reverse-transcription PCR of schistosome cDNA to demonstrate they were indeed expressed. Subsequent bioinformatics analysis presented herein for SmVAL14-28 is based on the predicted full length ORF obtained from S. mansoni GeneDB (version 4).

Protein domain prediction using the SMART database [27] identified a single SCP/TAPS domain (SMART accession number: SM00198) in all but one SmVAL transcript, with SmVAL11 containing two. No other SMART predicted protein domains were identified in any of the SmVAL proteins. The SCP/TAPS domains ranged between 135 and 153 AA in length and typically comprised about 60% of the total amino-acid sequence of an individual SmVAL. SmVAL6 is an exception, as it contained a large extension (~295AA) C-terminal to the SCP/TAPS domain.

Prediction of hydrophobic signal peptide sequences using the SignalP program [28] identified twenty-one SmVALs with strong evidence for signal peptides (D-score > 0.43), two SmVALs (5 and 15) encoding possible signal peptides (D-score > 0.38 but < 0.43) and five SmVALs (6, 11, 13, 16 and 17) showing no evidence of putative signal peptides (D-score < 0.15) (Table 1 &2).

Comparative sequence and phylogenetic analysis of SmVALs

Alignment of the amino acid sequences encoded by SmVAL1-28 revealed significant similarity only over the putative SCP/TAPS domains (Fig. 1). Here, seven residues were found to be invariant among the SmVAL family members (shaded amino acids, Fig. 1). The SMART-derived consensus sequence (indicated on the top of the multiple sequence alignment, Fig. 1) for SCP/TAPS domains indicates five of these seven invariant residues are conserved across phyla (found in 80% or more of superfamily members) and the other two are moderately conserved (found in 50–79% or more of superfamily members). The amino acids (displayed in black boxes) present at the positions of the four residues proposed in structural studies to be the SCP/TAPS domain putative active site (e.g. His72, Glu77, Glu98 and His117 in Lycopersicon esculentum P14a [29]) are shown. These putative active site residues show low levels of conservation within the SmVAL family with only SmVAL6, 13, 16 and 17 containing all four residues. Sequence identity over the SmVAL SCP/TAPS domain averages around 34% identity and ranges from very similar (SmVAL1 is over 90% identical to SmVAL2) to weakly related (SmVAL6 has less than 20% sequence identity with SmVAL19).

Figure 1
figure 1

Alignment of the 29 SCP/TAPS domains present in the deduced amino-acid sequences of SmVAL1-28 demonstrates a clear segregation of the family into two distinct groups. Group 1 and group 2 members of the SmVAL superfamily are indicated to the right of the MUSCLE-generated alignment. The row labelled 'consensus' represents the SMART consensus sequence of the SCP/TAPS domain (SM00198) where conserved amino acids (50–79%) are indicated by lower-case and highly conserved residues (> 80%) by upper-case letters. The amino acid numbering (indicated at right of alignment) begins at the first residue of the SCP/TAPS domains. Invariant amino acids are highlighted in grey and the group 1 conserved cysteines are boxed and labelled C1 to C6. Amino acids found at the positions of the four residues comprising the putative SCP/TAPS domain active site are coloured in black with white text. The proposed conserved active site contains histidine residues at the first and fourth positions and glutamic acid residues at the second and third positions. The regions of insertion and deletion within the group 2 proteins are boxed and labelled I1 (insertion) and D1 (deletion).

The multiple sequence alignment also indicated that the SmVAL family could be divided into two groups based on features contained within the SCP/TAPS domain. The first feature is the conservation of one deletion (D1, Fig. 1) and one insertion (I1, Fig. 1) in SmVAL6, SmVAL11, SmVAL13, SmVAL16 and SmVAL17. The second feature is the absolute conservation of six cysteine residues (C1-C6, Fig. 1) in twenty-three SmVAL encoded proteins, which are absent from SmVAL6, SmVAL11, SmVAL13, SmVAL16 and SmVAL17. Together with signal peptide presence/absence, these data support the partitioning of SmVALs into 2 distinct sub-groups (Table 1 &2): group 1, which harbour a signal peptide, contain conserved cysteine residues and lack the D1 and I1 primary amino acid elements and group 2 (SmVAL6, SmVAL11, SmVAL13, SmVAL16 and SmVAL17), which lack a signal peptide, lack the 6 conserved cysteine residues but contain the I1 and D1 regions.

Phylogenetic analysis of SCP/TAPS proteins

To investigate the phylogenetic relationship within the SmVAL family, the predicted 29 complete SCP/TAPS domains were used to construct a phylogram using Bayesian analysis of a conserved (within all 29 SCP/TAPS domains) 84 amino acid region (split across six regions) identified by GBLOCKs [30] (Fig. 2A). The derived phylogram contained two main SmVAL clades (with 100% support) – one harbouring the six SCP/TAPS domains from the group 2 SmVALs and the other containing the 23 SCP/TAPS domains from the group 1 SmVALs (Fig. 2A). These two clades are also well-supported (99% bootstrap value) in the consensus phylogenetic tree constructed by the Neighbor-Joining algorithm (see Additional file 1). Phylogenetic analyses of this type have also been performed for SCP/TAPS domain- containing proteins within Drosophila melanogaster [7] and clearly demonstrate that distinct group 1/2 sub-divisions exist within SCP/TAPS family members of other species.

Figure 2
figure 2

Phylogenetic analysis of SmVALs and representative metazoan SCP/TAPS proteins reveal a clear and evolutionary distant split between group 1 and group 2 proteins. Phylogenetic trees were inferred by MrBayes 3.1.2 and illustrated by Treeview as described in Methods. Relative branch lengths are indicated, as are the Bayesian posterior probability support values that are greater than 0.5. A) A rooted phylogram is illustrated for the 29 SmVAL SCP/TAPS domain-containing proteins. B) An unrooted phylogram is illustrated for the representative metazoan SCP/TAPS domain-containing proteins. The non-S. mansoni protein sequences used have the following GenBank accession numbers: Vv Vesv5 [AAA30333], Dm AGR [AAB92563], Na ASP2 [AAP41952], Ce SCL1 [NP_502502], Ct Tex31 [CAD36507], Bm VAL1 [AAK12274], Ce CE09088 [NP_494312], Dm CG4270 [NP_608663], Hs GAPR1 [Q9H4G4], Ss NIE [AAD46493], Hh Helothermine [2122236A] and Hs CRISP1 [CAC34980]. Branch length represents the number of amino acid changes calculated by the WAG model and is indicated by the scale bars.

The presence of two distinct groups of SCP/TAPS domains within both S. mansoni and D. melanogaster led us to question whether this relationship was conserved across diversely related phyla. To study this split further, fourteen representative SCP/TAPS family members from species within Platyhelminthes, Nematoda, Arthropoda, Gastropoda and Chordata were used for extended phylogenetic analysis (Fig. 2B). The derived phylogram again contains two main clades (100% support), with the representative SmVAL group 1 member (SmVAL4) and the representative D. melanogaster group 1 member (DmAGR) found in one clade whereas the group 2 representatives of these species are clustered in the other clade (SmVAL6 and Dm CG4270). An additional larger analysis examining SCP/TAPS family members from 44 (out of 56 queried) eukaryotic genomes using different phylogenetic methods demonstrated the wide-scale presence and independent segregation of group 1 and group 2 proteins throughout the metazoan subkingdom (data not shown) and further supported this observation.

Homology modelling of SmVALs

To explore the tertiary structural characteristics within the SCP/TAPS domain of the SmVAL family, we created homology models of the sequence verified SmVAL1-13 members using MODELLER [31] (Fig. 3). Two templates, Necator americanus Na-ASP 2 [PDB:1u53] and L. esculentum p14a [PDB:1cfe] were used to model the group 1 SmVAL SCP/TAPS domains (represented in Fig. 3A by SmVAL1), and one template, Homo sapiens GAPR-1 [PDB:1smb] was used to model the group 2 SmVAL SCP/TAPS domain (represented in Fig. 3B by SmVAL13). Importantly, these derived models suggested that all SmVAL SCP/TAPS domains will adopt the α-β-α sandwich conformation common to all superfamily members across phyla. Secondly, these models demonstrated that the six conserved cysteine residues within the group 1 SmVALs are capable of forming three disulphide bonds (yellow residues, C1-C5, C2-C3, C4-C6, Fig. 3A). Thirdly, these models illustrated that the SmVAL group 2-specific insertion and deletion domains are both found in the loop regions between the α-β-α secondary structural elements (white regions, I1 and D1, Fig. 3). Finally, homology model surface analysis demonstrated that both SmVAL1 and SmVAL13 SCP/TAPS domains possess the large cleft proposed to contain the active site [29] (Fig. 3C and 3D). Within the cleft, the SmVAL1 model (Fig. 3C) contains three of the four putative active site residues (His84, Glu115, His131), with the first glutamic acid of the active site being substituted by a glutamine (Gln97). In the SmVAL13 model, all four proposed active site residues (His51, Glu59, Glu84 and His100) are present on the surface within the cleft (Fig. 3D). In both models, these four residues are found occupying a similar orientation to orthologous residues identified in previous crystallization studies of SCP/TAPS family members [12, 13].

Figure 3
figure 3

Homology modelling of representative group 1 and group 2 SmVAL proteins demonstrates the SCP/TAPS domain of both groups possess an α-β-α tertiary structure with a large surface cleft containing the proposed family active site. Models were constructed for all SmVAL SCP/TAPS domains using MODELLER as described in Methods and are represented here by SmVAL1 (group 1, panels A and C) and SmVAL13 (group 2, panels B and D). A cartoon representation of the core α-β-α sandwich structure is coloured in red for SmVAL1 (A) and blue for SmVAL13 (B). Conserved cysteines and the disulphide bonds are coloured yellow, with the cysteines involved in the formation of each disulphide bond labelled C1-C6. The two regions I1 (insertion) and D1 (deletion) are coloured white on each structure. Panel C (SmVAL1) and D (SmVAL13) are semi-transparent visualisations of the protein surface with the four putative active site residues shown in white.

SmVAL gene characteristics

SmVAL1-28 genomic sequences were obtained from the S. mansoni genome database [24] by comparing the laboratory derived cDNA sequences to the assembled genomic sequences (version 4). To verify the current assembly, complete genomic sequences for SmVAL1, SmVAL2, SmVAL3, SmVAL5 and SmVAL7 and partial genomic sequences for SmVAL4, SmVAL6 and SmVAL8-13 were obtained by PCR amplification of S. mansoni genomic DNA. When compared, these laboratory-derived sequences confirmed the assembled genomic sequences for each SmVAL gene from S. mansoni GeneDB. The gene sizes of SmVAL1-28 ranged from 1182 bp (SmVAL23) to ~60 kbp (SmVAL6) (Table 1 &2). The gene structures of the SmVAL family members were investigated by aligning the cDNA sequences with the gene sequences from the genomic database. With three exceptions, all exon/intron junctions conformed to the consensus (GT/AG) splice donor/acceptor sequences for eukaryotes. Exon 10 of SmVAL6 and exon 3 of SmVAL11 use a 5' GC splice donor, and in exon 38 of SmVAL6 the splice donor has been replaced by AC. Given that cDNA sequences of SmVAL6 and SmVAL11 are found to be spliced at these locations, these divergent splice sites appear to be functional.

As with the group 1/group 2 split in conserved cysteines, signal-peptide predictions and phylogenetic tree, the group 1 SmVALs differed from the group 2 SmVALs in gene structure (Fig. 4). All group 1 SmVAL genes possess five exons interrupted by four introns (Fig. 4A). The positioning and phase of each of the four introns in relation to the coding sequence of the genes was common to all (Fig. 4A). The group 2 SmVAL genes varied in the number of exons, but within the coding region for the SCP/TAPS domain, two intron positions and their phases were conserved throughout all members (Fig. 4B).

Figure 4
figure 4

SmVAL group 1 and group 2 gene structures are unique and likely evolved from a common ancestral gene within the genus. All SmVAL gene structures were manually assembled by comparison of laboratory derived cDNA sequences to assembled genomic sequences (SchistoGeneDB, version 4). A) Representative group 1 SmVAL gene structure and B) representative group 2 SmVAL gene structure over the SCP/TAPS encoding exons. Exons are represented by boxes with introns represented by lines. The phases of introns are indicated above the arrows. Exons are coloured to show regions encoding important protein features; predicted signal peptides (yellow), SCP/TAPS domain (red) and the remaining ORF (blue). The position of the six conserved cysteines in the group 1 SmVAL proteins is indicated. The single asterisk (*) indicates an intron conserved in all group 2 genes except SmVAL11.

Despite the conserved intron/exon structure (within the group 2 SmVALs) comprising the SCP/TAPS domain of SmVAL6, the intron/exon structure coding for the C-terminus of SmVAL6 is notable in its complexity. Unlike all other family members, the large and extended genomic DNA (gDNA) region 3' of the SCP/TAPS domain is composed of numerous small exons. In total, the gene encoding the SmVAL6 1302 bp transcript is comprised of 38 exons, with 17 smaller than 20 bp (Fig. 5).

Figure 5
figure 5

SmVAL6 genomic structure analysis reveals a highly complex gene composed of 38 exons. SmVAL6 exons are represented by boxes, with exon length (in base pairs) given above each box. Introns are represented by lines, with intron length (in base pairs) shown below each line. Exons are coloured to indicate: SCP/TAPS domain (red), untranslated region (white) and remaining ORF (blue). Each successive exon has been ascribed a number (underlined numbers below the exons). Exons that introduce a premature stop codon are highlighted with an asterisk and dashed lines represent separate exons created by competing 3' splice sites. The exon specific annealing positions of primers VAL6 var F and VAL6 var R, used for developmentally-regulated alternative splicing analysis, are indicated.

SmVAL genomic clusters

Version 4 of the S. mansoni genomic assembly predicted several SmVAL gene clusters to be spatially linked to distinct chromosomal regions (Fig. 6A). To experimentally verify the existence of these distinct SmVAL gene clusters, BAC clones spanning the genome assembly were sought for independent analysis. One BAC clone, Sm1-41J19, was subsequently identified and PCR analysis of its DNA sequence confirmed that the SmVAL2/SmVAL8/SmVAL12 gene cluster was indeed present (Fig. 6B). Subsequent fluorescence in situ hybridisation (FISH) localized Sm1-41J19 (containing SmVAL2/SmVAL8/SmVAL12) to the long arm of chromosome 6 and the short arm of chromosome W (Fig. 6C).

Figure 6
figure 6

SmVAL gene clusters exist throughout the genome with SmVAL2, SmVAL8 and SmVAL12 genetically linked to chromosome 6 and W. A) Genomic regions containing two or more SmVAL genes were identified from interrogation of the current Schisto GeneDB v4 assembly. Schisto GeneDB v4 scaffold ID for each region is shown. SmVAL genes are shown as labelled boxes with the direction of transcription indicated by a triangle at the stop codon. The genomic sequence between SmVAL genes is represented with a dashed line and the length in base pairs shown below. B) Genomic linkage of SmVAL2, 8 and 12 was established by PCR amplification of gene specific regions from BAC clone Sm1-41J19. SmAT1 (alpha tubulin, M80214) was not contained on the BAC clone and was only amplified from adult worm cDNA. C) FISH analysis indicates positive signal (arrowheads) for Sm1-41J19 on chromosome 6 and W (W in inset). Bar indicates 10 μm.

Developmental expression of SmVALs

Lifecycle expression profiles of the sequence-verified SmVAL family members (SmVAL1-13) were obtained by real-time RT-PCR using cDNA created from the mRNA of selected S. mansoni life-stages (Fig. 7). Here, distinct examples of normalised SmVAL gene expression throughout the lifecycle were revealed including those displaying either developmental or ubiquitous patterns. One developmentally regulated pattern of SmVAL transcription associated with invasion of the definitive host was found for genes encoding SmVAL1, SmVAL4 and SmVAL10. Specifically, SmVAL1 expression was minimal in all stages examined except mother sporocyst and cercaria, with expression levels estimated to be 15-fold higher in cercaria compared to mother sporocyst. SmVAL4 and SmVAL10 also displayed peak expression in stages associated with vertebrate invasion (cercaria and 3 hour schistosomule) although other life stages show minimal expression.

Figure 7
figure 7

SmVAL transcription throughout the schistosome life cycle includes both developmental and constitutive patterns. Total RNA from indicated life-stages was obtained as described in Methods and utilized for real time quantitative PCR analysis to determine SmVAL1-13 transcript abundance. For each SmVAL transcript, a bar graph is displayed indicating relative abundance (compared to SmAT1) across the S. mansoni lifecycle. On the x-axis, each specific life-stage cDNA being tested is indicated. The y-axis represents the ratio of SmVAL gene expression relative to that of SmAT1 (reference gene). Data are presented as mean ratios (+/- standard deviation) from technical duplicates.

The real time PCR results also revealed a second pattern of developmental regulation with SmVAL2, SmVAL3, SmVAL5 and SmVAL9 showing expression associated with the invasion of the intermediate host. Specifically, SmVAL2, SmVAL3, SmVAL5 and SmVAL9 transcription was restricted to egg, miracidia and sporocyst life stages, with the miracidial stage showing an expression peak for all four gene products. A third type of developmentally regulated transcriptional profile was observed for SmVAL8 and SmVAL12 mRNAs. Here, both genes were up-regulated during worm maturation with peak levels of transcription occurring in the 5-week worm. Finally, transcription of SmVAL6, SmVAL7, SmVAL11 and SmVAL13 appeared to be far less restricted, with measurable levels of mRNA found in most life stages examined. SmVAL6 and SmVAL7 were minimally transcribed in the egg, miracidial and mother sporocyst stages, dramatically up-regulated in the cercaria and intermediately expressed in all subsequent somule/worm samples. Ubiquitous (but variable) transcription for SmVAL11 and SmVAL13 are seen in all parasite samples with expression peaks observed in the miracidial (SmVAL11) and adult female worm (SmVAL13) life stages.

Developmentally regulated alternative splicing of SmVAL6

In the process of cloning SmVAL6, and likely due to its genomic complexity (Fig. 5), many different individual full-length ORFs were identified. Interestingly, all sequence variability was limited to the region C-terminal to the SCP/TAPS domain. We used primers (indicated in Fig. 5) designed to amplify this specific region, both to further examine the level of SmVAL6 variation and to compare the diversity of SmVAL6 products expressed in different developmental stages. Analysis of restriction digests from SmVAL6 PCR amplicons throughout the life cycle showed that the greatest qualitative differences in RFLP (restriction fragment length polymorphism) patterns existed between cercarial and 7-week adult worms (data not shown). Therefore, clones derived from cercarial and adult 7-week mixed-sex cDNA were subsequently isolated and sequenced. Analysis of the randomly-selected clones, 32 from 7-week adult worm samples and 35 from cercarial samples, showed that the SmVAL6 transcript is highly variable in both developmental stages (Fig. 8). From the 67 clones sequenced, 35 separate isoforms were observed (Fig. 8A). The variation observed was found to be due either to the absence/presence of exons or alternative 3' splicing within an exon with none of the single nucleotide polymorphisms detected shared by more than one clone. No isoform was predominant; the most abundant isoform accounting for only 7 clones (data not shown). To represent the different isoforms identified in this study, each exon found in the SmVAL6 gene was numbered (according to Fig. 5) and each isoform was scored for the presence or absence of each exon (Fig. 8A). Using the χ2 test, statistically significant developmental regulation of exon expression was found for two exons – 20 (p < 0.001) and 26 (p < 0.001) (Fig. 8B). Interestingly, these two exons were found specifically associated with SmVAL6 transcripts expressed by the adult 7-week life stage and encode very similar amino acid pentamers (KDDQY & KDEQY, respectively) (Fig 8B).

Figure 8
figure 8

SmVAL6 exhibits developmentally-regulated alternative splicing . Sixty-seven individual SmVAL6 cDNA clones (spanning exons 4–38) were isolated and sequenced from parasite material: 32 clones were derived from 7-week mixed-sex adult worm cDNA and 35 clones from mixed-sex cercarial cDNA (see Methods). A) Thirty-five distinct SmVAL6 isoforms were identified from the sixty-seven clones sequenced. Columns represent exon number (as described in Fig. 5) and rows indicate the 35 detected isoforms. Filled, grey boxes represent presence of exon, whereas empty, white boxes represent absence of exon in each of the 35 isoforms. The presence of an asterisk indicates an exon encoding a premature stop codon. The diagonal lines in the exon 35 column indicate that this exon has been detected in previous studies but was not observed in any of the transcripts in this experiment. B) Presence of exons 20 and 26 is associated with adult cDNA clones. Frequency of exon 20 and 26 usage in adult (black bars) and cercariae clones (grey bars) is shown with the χ2 test p values indicated. The amino acids encoded by each exon are shown below, with amino acids spanning splice acceptor/donator sites indicated in parentheses.


Critical to the development of novel schistosome intervention strategies is the identification of exploitable parasite gene products that functionally participate in important biological niches such as host invasion, chronic host/parasite interactions or immunomodulation. With the availability of genomic information to support the use of transcriptomic and proteomic tools [32], the discovery of exploitable schistosome gene products can now proceed at a previously unattainable rate. We have capitalised on this wealth of information to lead a study culminating in the molecular description of twenty-eight S. mansoni SCP/TAPS family members, designated SmVAL1-28. While the exact function of each SmVAL family member is currently unknown, the information provided in our study suggests potential roles in larval penetration, host immune response modulation and adult worm development.

One of the most striking observations in this study, mirroring a recent Drosophila melanogaster investigation [7], was the clear segregation of the SmVAL superfamily into two distinct phylogenetic groups (group1/group 2, Fig. 2A). Importantly, we now show this segregation was not solely limited to the Arthropoda and Platyhelminthes, as additional phylogenetic analyses demonstrated that the group 1/group 2 partition to be particularly well conserved and present in other phyla including Nematoda and Chordata (Fig. 2B). SmVAL amino acid features (e.g. signal peptides, cysteine residues, I1 and D1 regions; Table 1, Table 2, Fig. 1 and Fig. 3) also segregated well into the group 1/group 2 partition and further strengthened the phylogenetic predictions. Evidence to support that these group-specific amino acid features may impart distinct SmVAL tertiary structural characteristics originates from the solved crystal structures of both group 1 [8, 9, 1113, 33] and group 2 [10] members. Although similar α-β-α core SCP/TAPS domains can be formed, the domains found in the group 1 proteins are stabilised by disulphide bonds (mediated by the conserved cysteines), whereas the domains formed in group 2 proteins do not need these structural features. Disulfide bonds are frequently found in extracellular proteins, but only rarely in intracellular proteins. Therefore, the structural differences found in group 1 and group 2 proteins are in accordance with different localisations/fates for the group 1/group 2 SCP/TAPS domain containing proteins. This contention is supported by localization studies involving several SCP/TAPS domain-containing proteins. Golgi-associated PR-1 protein, GAPR-1 (which, like the group 2 SmVALs, lacks a signal peptide and conserved cysteines) demonstrated intracellular association with golgi membranes [34, 35] whereas Tex31, Allurin and Ac-ASP 2 (which, like the group 1 SmVALs, contain both signal peptides and conserved cysteines) are detected in extracellular locations [18, 36, 37]. Therefore, irrespective of the function(s), one potential advantage of encoding two distinct SCP/TAPS domains within the same organism is the ability to have one protein type specialised for the extracellular environment (with signal peptide for secretion and stabilising disulphide bonds) and another type specialised for functioning intracellularly. Indeed, diversification of this kind has recently been reported for the annelid globin gene family [38].

Comparison of the group 1 (SmVAL1) and group 2 (SmVAL13) SCP/TAPS domain homology models showed similar overall structures except for subtle differences in two of the loop regions (I1 and D1). Both proteins also contain the large cleft (Fig. 3C and 3D) which is thought to be the location of the SCP/TAPS domain active site [29]. Four of the amino acids (two histidines and two glutamic acids, positions indicated in Fig. 1 as black boxes) present within the cleft have been proposed to form an active site of undefined activity due to their location, surface accessibility and evolutionary conservation [29]. Interestingly, all group 2 SmVAL proteins except SmVAL11 contain all four residues (Fig. 1) within this surface cleft (e.g. SmVAL13, Fig. 3D) while none of the group 1 SmVAL proteins contain all four residues (Fig. 1) within this cleft (e.g. SmVAL1, Fig. 3C). This suggests that any potential activity (mediated by these four residues) across the family could be variable and therefore, diverse functions within the SmVALs would be substantial.

SmVAL group-specific differences are also observed in the expression of gene products during the parasite's life cycle (Fig. 7). The real-time PCR analysis of SmVAL1-13 showed that many group 1 genes displayed developmentally-regulated expression patterns while all examined group 2 genes demonstrated far less restricted transcription profiles. Group 1 members SmVAL2, SmVAL3, SmVAL5 and SmVAL9 all share a similar expression pattern with transcription peaking in miracidia, suggesting roles for these gene products in the free-living stage or in intermediate host invasion. The minimal presence of these transcripts in the egg stage may be due to our use of both immature & mature (miracidia-containing) egg material. However, recent proteomic studies of S. mansoni egg secretions have detected all four of these SmVALs, suggesting a potential secondary role in the egg [39]. The role of SmVAL2, SmVAL3, SmVAL5 and SmVAL9 secretion from unhatched eggs is currently being addressed as part of ongoing experiments into the protein secretion and localisation of SmVALs.

Unlike SmVAL2, SmVAL3, SmVAL5 and SmVAL9, the developmental expression profiles of group 1 members SmVAL1, SmVAL4, and SmVAL10 are linked to definitive host invasion – SmVAL1 to the cercarial stage; SmVAL4 and SmVAL10 to both cercariae and 3-hour schistosomula (Fig. 7). The SmVAL4 and SmVAL10 transcriptional results are corroborated by proteomic studies showing that these gene products (as well as SmVAL18) are indeed present in S. mansoni schistosomule secretions [40] further supporting a role in definitive host invasion. Overall, the presence of SmVAL2, SmVAL3, SmVAL4, SmVAL5, SmVAL9, SmVAL10 and SmVAL18 proteins in secreted samples supports the signal peptide predictions for these proteins and, on a more general level, suggests that many or all of the SmVAL group 1 proteins are released from the schistosome and capable of interacting with their immediate environment.

Release of group 1 SCP/TAPS proteins during parasitism may not be limited to the trematodes as Ancylostoma caninum (Ac)-ASP 1-6, Haemonchus contortus (Hc)-24 & 40, Ancylostoma ceylanicum (Ay)-NIF and Ac-HPI are also secreted during nematode infective processes [4144]. Given the biological activities of the hookworm proteins Ay-NIF (neutrophil inhibitory factor) and Ac-HPI (platelet inhibitory protein), it is tempting to speculate that some secreted group 1 SmVALs may have similar immunomodulatory roles acting, like NIF and HPI, via interactions with integrins. Other potential functions for group 1 SmVALs that may confer selective advantages during parasite invasion may relate to protease activity (Tex31 and RTVP-1) or protease inhibition (P25TI) [18, 45, 46]. Studies are ongoing to ascertain SmVAL group 1 specific functions.

While all secreted group 1 SmVALs may indeed participate in host/parasite interactions, there is also evidence suggesting that SmVAL1, SmVAL4, SmVAL10 and SmVAL18 (released into the definitive host) may be targets of the adaptive immune response. For example, the hymenopteran venom component Ves v5 is a specific SCP/TAPS allergen capable of activating basophils via an IgE dependent mechanism in sensitised patients [47, 48]. Furthermore, human IgE reactivity against the hookworm ASP-2 protein (component of the vertebrate infective L3 stage excretory/secretory products [21, 36]) is positively associated with light hookworm infections in heavily exposed endemic populations [49]. This data, as well as successful vaccine experiments in animal models [4952], has led to ASP-2 becoming a major human hookworm vaccine candidate [53]. Together, these observations suggest that secreted group 1 SmVALs (and possibly group 2 SmVALs [54]) may be specifically recognised by protective, anti-schistosome immune responses [55, 56] and imply that SmVAL1, SmVAL4, SmVAL10 and SmVAL18 proteins are potential S. mansoni vaccine candidates.

Although we report the identification of twenty-eight SCP/TAPS family members in this study, the complete number of SmVAL genes will be unknown until the final genome build is released. Preliminary evidence that one of the seven incomplete SCP/TAPS genes (smp_120670) is transcribed (data not shown) does suggest more SmVAL (with complete SCP/TAPS domains) genes will be identified as the genome assembly progresses. The completion of the genome will also help in solving questions relating to SmVAL gene duplication events. During the sequencing of both cDNA and gDNA, we have identified two different SmVAL8 transcripts, two different SmVAL7 genomic sequences and evidence for at least four variations of the SmVAL3/SmVAL23 family (data not shown). The FISH localisation of the SmVAL2/SmVAL8/SmVAL12 BAC clone to more than two chromosomal regions (Fig. 6C) also seems to provide evidence for duplications events, although recent reports of non-specific BAC clone hybridization to the short arm of chromosome W have been reported [57]. Additionally, within the twenty-eight SmVALs, there are several that display such striking sequence conservation that a recent gene duplication event is the most likely explanation. For example, genes encoding SmVAL26 and SmVAL27 (both transcriptionally confirmed, Table 2) are 90% identical over encoded amino acid sequences (182AAs) and 93% identical over complete genomic sequences. This high level of identity over both exons and introns, suggests such genes have been through recent gene duplication events and supports the contention that the SmVAL family was expanded from two ancestral genes containing group-specific intron/exon structures (Fig. 4).

In the absence of biological functions for the SmVAL family, it is currently unknown why gene duplications have persisted. However, arguments can be made for all three of the main evolutionary conditions for copy retention – beneficial increased production, subfunctionalisation or neofunctionalisation [58]. The extra amount of an SmVAL gene product gained by two gene copies could be advantageous despite functional redundancy between the two proteins. This has been suggested for gene copies of S. mansoni cercarial elastases, which all appear to encode proteins of the same substrate specificity [59]. Subfunctionalisation describes a scenario where a protein encoded by the ancestral gene either has two biological functions or is expressed in two separate locations (physically or temporally). After the duplication event, each copy can then specialise in a unique function or in a distinct location. Neofunctionalisation describes a situation where one gene retains the original function while the other evolves a new function. Both subfunctionalisation and neofunctionalisation could explain at least one SmVAL gene duplication event, that of the ancestral SmVAL1/SmVAL2 gene. SmVAL1 and SmVAL2 are very similar genes (85% genomic identity) but have radically different expression profiles (Fig. 7) with SmVAL1 associated with vertebrate invasion and SmVAL2 associated with invertebrate invasion. One hypothesis is that the ancestral SmVAL1/SmVAL2 gene was expressed both in the cercaria and the miracidia, and carried out the same function during both invasive processes. After the duplication event SmVAL1 expression became restricted to cercaria and SmVAL2 became specialised for the miracidial function. Alternatively, the SmVAL1/SmVAL2 ancestral gene may have been expressed in only one invasive stage and gene duplication led to one gene gaining a new function (though it could possess a very similar biological activity) in the other invasive stage. Further work will be needed to dissect the exact mechanism responsible for SmVAL genomic expansion.

Previous results utilizing DNA microarrays demonstrated SmVAL6 and SmVAL7 to be male-associated in the adult worm [6]. The quantitative PCR assays reported here confirmed this finding with transcript levels 36-fold (SmVAL6) and 3-fold higher (SmVAL7) in adult males compared to adult females. Quantitative PCR also showed an adult female-biased expression for SmVAL13 mRNA (7-fold over adult male worms). Interestingly, several SCP/TAPS proteins in other species are also gender-associated. These include the mammalian CRISP-1 and CRISP-2 proteins, which are localised to the developing sperm and the Xenopus Allurin protein, which is associated with the egg [37, 60]. Semi-quantitative reverse transcription PCR of the Drosophila SCP/TAPS family members also showed that the majority of the transcripts to be gender-associated with 18/26 showing male-associated expression and only one exhibiting female-associated expression [7]. Future experiments identifying SmVAL protein localisation within the adult worms will elucidate whether the connections between SCP/TAPS proteins with reproductive organs exist in the schistosome or the gender-associated transcription is due to other developmental differences.

A final observation of this study was the high level of developmentally-regulated alternative splicing found within the region encoding the long C-terminus of the group 2 member SmVAL6 (Fig. 8). The complex gene structure encoding the SmVAL6 C-terminus (Fig. 5) allows for alternative splicing of many small exons each in the same phase, avoiding frameshifts. Though the relevance of both the long C-terminus and the diversity observed within it is unknown, the developmental regulation of exon 20 and 26 suggests that the amino acid composition within the C-terminal region may be important for SmVAL6 function. Developmentally regulated alternative splicing occurs for a few other S. mansoni genes such as CA150 and SmHSF [61, 62], although these examples result in the introduction of premature stops changing any resulting protein radically, unlike the small changes of exon 20 and 26 within SmVAL6. One potential biological difference between cercarial and adult worm SmVAL6 transcripts is in phosphorylation. Exon 20 and 26 (statistically differentially expressed in the adult, Fig. 7) encode highly similar amino acid pentamers (KDDQY and KDEQY respectively) that are predicted to be phosphorylated at the tyrosine residue (data not shown). If these are functional phosphorylation sites, many SmVAL6-associated biological processes (protein-protein interactions, protein location within the cell, increased protein degradation and signalling) could be differentially affected in the cercarial versus the adult life stage. Interestingly, group 2 GAPR-1 can be differentially phosphorylated and this phenomenon has been associated with different cellular locations [63]. Further work to ascertain the relevance of SmVAL6 developmentally-regulated splicing, as it applies to function, is ongoing.


In this study, we identified twenty-eight S. mansoni genes encoding proteins containing SCP/TAPS domains, named SmVAL1-28. Examination of protein features and gene structures has revealed two distinct groups within the SmVAL gene family and phylogenetic analysis demonstrated both group 1 and group 2 type SCP/TAPS proteins to be present across the metazoan sub-kingdom. We also obtained developmental expression profiles of SmVAL1-13, which suggested that several of the proteins are involved in host invasion. Furthermore, we have discovered developmentally regulated alternative splicing within the 3' region of SmVAL6. Further work is aimed at functionally characterising this interesting schistosome protein family in terms of host/parasite and parasite/parasite interactions.


Parasite Materials

A Puerto Rican isolate of Schistosoma mansoni is maintained by passage through Biomphalaria glabrata snails and Tuck Ordinary (T.O.) mice (Harlan). The fifteen lifestages (mixed sex unless stated) providing RNA for Real-time quantitative developmental expression analysis were: Egg; Miracidia; Mother sporocyst; Cercariae; 3-hour schistosomule; 18-hour schistosomule; 3-day schistosomule; 6-day schistosomule; 3-week worm; 4-week worm; 5-week worm; 6-week worm; 7-week worm; 7-week male, and 7-week female.

Eggs were recovered from mouse livers infected 7 weeks prior with cercariae as previously described [64]. To isolate miracidia, the eggs were then hatched in non-chlorinated water and miracidia were separated phototrophically. Mother sporocyst RNA was provided by Dr. Tim Yoshino, of the University of Wisconsin [65]. Cercariae were shed from snails phototrophically, with schistosomula prepared by mechanical transformation of cercariae as previously described [66]. Schistosomula were cultured at 37°C in Dulbecco's Modified Eagle's medium (DMEM) (Sigma, UK) supplemented with 10% FCS and 100 ng/ml penicillin-streptomycin for the time periods indicated in an atmosphere of 5% CO2 [66]. Adult worm lifestages (3 wk-7 wk) were obtained by perfusion at the stated week from percutaneously infected T.O. mice after challenge with 250 cercariae. After perfusion, worms were counted and, in the case of 7 wk adults, separated on the basis of gender.

Total RNA isolation and cDNA synthesis

Total RNA was isolated using a modified TRIZOL (Invitrogen)/RNeasy (Qiagen) procedure [67] and subsequently treated with DNase I (Ambion Inc.) to remove contaminating genomic DNA. RNA quality from each sample was assessed by formaldehyde denaturing gel electrophoresis and quantity determined by spectrophotometry at 260 nm.

Total RNA from the fifteen lifecycle stages were used as templates to synthesise cDNA by reverse transcription as described previously [68]. The efficiency and quality of the cDNA obtained was tested using an alpha-tubulin (SmAT1) specific primer-pair [69] to amplify SmAT1 cDNA by polymerase chain reaction (PCR).

cDNA and gene sequence determination

Using S. mansoni Gene DB [24], the complete open reading frames (ORFs) of 28 SmVAL genes were predicted (SmVAL1-28). The sequences of SmVAL1-13 ORFs were confirmed by PCR amplification from parasite cDNA using proof-reading enzymes (HiFi Platinium Taq (Invitrogen) or Phusion (GRI)), cloned into pCR4-TOPO vector (Invitrogen) and sequenced. In addition, Rapid Amplification of cDNA ends (RACE) PCR was performed for selected SmVALs using RNA from parasite lifestages processed into cDNA according to the manufacturer's protocol (GeneRacer kit, Invitrogen). The ORF sequences for SmVAL1-13 have been submitted to Genbank, accession numbers are shown in Table 1. Additional S. mansoni Gene DB interrogation using the Pfam [26] HMM model for SCP/TAPS domain (PF00188) resulted in the discovery of a further 15 genes (SmVAL14-28) predicted to encode full SCP/TAPS domains and 7 predicted to encode incomplete SCP/TAPS domains (Table 2). PCR amplification from parasite cDNA and sequencing of the resulting fragments confirmed transcription for 11 of the 15 gene products containing full SCP/TAPS domains.

The genomic sequences of all SmVAL genes (start to stop codons, including nucleotide discrepancies 'N', where each 'N' was treated as an individual nucleotide) were derived from analysis of S. mansoni Gene DB version 4 with regions of genomic sequences for SmVAL 1-13 confirmed by PCR amplification. The genomic DNA used for PCR amplification of SmVAL1-13 was purified from S. mansoni adult worm and cercarial material according to the protocol provided with the DNeasy Tissue kit (Qiagen). Sequencing of the SmVAL cDNAs or gene fragments was performed at the Department of Genetics, University of Cambridge in both orientations using Big Dye v3.1 fluorescent chemistry and an Applied Biosystems 3100 Genetic Analyser.

Bioinformatic Analysis

Domain prediction was performed by the SMART sequence analysis software [27] and was used, with manual alignment comparison, to define the limits of the SCP/TAPS domain for each sequence.

The prediction of signal peptides was performed using the software SignalP 3.0 [28] and presence/absence of signal peptides was defined by the default Neural Network D-score threshold of 0.43.

Alignment and Phylogenetic trees

Alignment of the twenty-nine SCP/TAPS domains from the 28 SmVALs was created using MUSCLE software [70]. For comparison of sequence conservation within S. mansoni to a wide range of SCP/TAPS domain containing proteins, a consensus sequence was generated using the family alignment for the SCP/TAPS domain (SM00198) found in the SMART database [27].

For the phylogenetic analysis of the SmVAL family and other metazoan SCP/TAPS family members, SCP/TAPS domain regions were aligned using MUSCLE software and interrogated to determine regions of conservation using GBLOCKs 0.91b software [30]. For the SmVAL phylogenetic analysis six such regions covering a total of 84 amino acids were chosen, whereas the metazoan analysis used four regions containing a total of 54 amino acids. The phylogenetic analyses were performed as described previously [69] using MrBayes (version 3.1.2 [71]) software and the WAG protein substitution model [72]. The consensus phylograms were visualised using the TreeView 1.6.6 program [73]. Utilizing the same alignment and conserved regions, phylogenetic trees were also constructed using the Neighbor-Joining method and the Poisson correction model, accompanied by bootstrap analysis (1000 replications). The trees were produced by MEGA v4.0 [74].

Homology modelling

The Na-ASP 2 (pdb: 1u53[13]) and p14a (pdb: 1cfe[8]) structures were used as templates to build models for group 1 SmVAL proteins. The GAPR-1 structure (pdb: 1smb,[10]) was used as the template for the creation of group 2 SmVAL protein models. An alignment of the protein sequences was initially generated by MUSCLE software and refined manually. Models were created using MODELLER software [31] and the validity of the models chosen were tested using WHATCHECK and PROSAII [75, 76]. Models were visualized using MacPyMOL (DeLano Scientific LLC).


S. mansoni Gene DB version 1 was used to identify BAC clones (obtained from the Sm1 library [77]) containing SmVAL gene clusters. The presence of SmVAL2, SmVAL8 and SmVAL12 on Sm1-41J19 was confirmed by PCR amplification, agarose gel electrophoresis and automated DNA sequencing. Fluorescence in-situ hybridisation (FISH) was used to identify the chromosomal localisation of BAC clone Sm1-41J19, using a biotinylated probe made from the BAC clone. The probe was hybridised to S. mansoni chromosomes using established procedures [78].

Real-time RT-PCR analysis

The transcript abundance of each sequence-verified SmVAL (SmVAL1-13) mRNA was quantified relative to SmAT1 (alpha tubulin) in fifteen stages of the S. mansoni lifecycle using real-time RT-PCR analysis. The cDNA from 15 different life stages was assayed by real-time PCR in duplicate reactions using gene-specific primers. To confirm the absence of non-specific amplification in PCR reactions, control reactions lacking template were run in duplicate for each primer set. Reactions were performed on a MiniOpticon real-time PCR thermal cycler system (Bio-Rad) using iQ SYBR Green Master Mix (Bio-Rad) according to the manufacturer's instructions. The amplification efficiency (E) of each primer set was determined during assay development by plotting the cycle thresholds (Ct) from serial dilutions of a suitable cDNA sample and inputting the resulting slope in the equation, E = 10(-1/slope). For each real-time PCR reaction the following equation was used to calculate a normalised SmVAL expression ratio:

ratio = (ESmAt1)CtSmAT1/(ESmVALx)CtSmVALx

where ESmAt1 is the amplification efficiency of the reference gene (alpha tubulin), ESmVALx is the amplification efficiency of the target gene (SmVAL), CtSmVALx is the cycle threshold of the target gene and CtSmAT1 is the cycle threshold of the reference gene from the same cDNA sample [79].

SmVAL6 splicing study

A primer pair (VAL6 var F/VAL6 var R) designed to amplify the region containing all identified variability discovered in previous RACE and ORF sequencing of SmVAL6 transcripts was used to examine SmVAL6 sequence variation (see Fig. 5 for primer locations). PCR reactions using this primer pair with either cercarial or 7 wk mixed sex worm cDNA were carried out using Platinum High Fidelity Taq polymerase (Invitrogen). The PCR products were ligated into the pCR2.1 vector (Invitrogen). Thirty-two 7-week adult clones and thirty-five cercarial clones were picked at random and fully sequenced. The sequences were then aligned to ascertain the exon usage of each clone. Student's χ2 test was used to determine any significant associations between exon usage and developmental life stage.


  1. de Jesus AR, Magalhaes A, Miranda DG, Miranda RG, Araujo MI, de Jesus AA, Silva A, Santana LB, Pearce E, Carvalho EM: Association of type 2 cytokines with hepatic fibrosis in human Schistosoma mansoni infection. Infect Immun. 2004, 72 (6): 3391-3397. 10.1128/IAI.72.6.3391-3397.2004.

    Article  PubMed  Google Scholar 

  2. King CH, Dickman K, Tisch DJ: Reassessment of the cost of chronic helmintic infection: a meta-analysis of disability-related outcomes in endemic schistosomiasis. Lancet. 2005, 365 (9470): 1561-1569. 10.1016/S0140-6736(05)66457-4.

    Article  PubMed  Google Scholar 

  3. Verjovski-Almeida S, DeMarco R, Martins EA, Guimaraes PE, Ojopi EP, Paquola AC, Piazza JP, Nishiyama MY, Kitajima JP, Adamson RE, Ashton PD, Bonaldo MF, Coulson PS, Dillon GP, Farias LP, Gregorio SP, Ho PL, Leite RA, Malaquias LC, Marques RC, Miyasato PA, Nascimento AL, Ohlweiler FP, Reis EM, Ribeiro MA, Sa RG, Stukart GC, Soares MB, Gargioni C, Kawano T, Rodrigues V, Madeira AM, Wilson RA, Menck CF, Setubal JC, Leite LC, Dias-Neto E: Transcriptome analysis of the acoelomate human parasite Schistosoma mansoni. Nat Genet. 2003, 35 (2): 148-157. 10.1038/ng1237.

    Article  PubMed  Google Scholar 

  4. LoVerde PT, Hirai H, Merrick JM, Lee NH, El-Sayed N: Schistosoma mansoni genome project: an update. Parasitol Int. 2004, 53 (2): 183-192. 10.1016/j.parint.2004.01.009.

    Article  PubMed  Google Scholar 

  5. Fitzpatrick JM, Hoffmann KF: Dioecious Schistosoma mansoni express divergent gene repertoires regulated by pairing. Int J Parasitol. 2006, 36 (10-11): 1081-1089. 10.1016/j.ijpara.2006.06.007.

    Article  PubMed  Google Scholar 

  6. Fitzpatrick JM, Johnston DA, Williams GW, Williams DJ, Freeman TC, Dunne DW, Hoffmann KF: An oligonucleotide microarray for transcriptome analysis of Schistosoma mansoni and its application/use to investigate gender-associated gene expression. Mol Biochem Parasitol. 2005, 141 (1): 1-13. 10.1016/j.molbiopara.2005.01.007.

    Article  PubMed  Google Scholar 

  7. Kovalick GE, Griffin DL: Characterization of the SCP/TAPS gene family in Drosophila melanogaster. Insect Biochem Mol Biol. 2005, 35 (8): 825-835. 10.1016/j.ibmb.2005.03.003.

    Article  PubMed  Google Scholar 

  8. Fernandez C, Szyperski T, Bruyere T, Ramage P, Mosinger E, Wuthrich K: NMR solution structure of the pathogenesis-related protein P14a. J Mol Biol. 1997, 266 (3): 576-593. 10.1006/jmbi.1996.0772.

    Article  PubMed  Google Scholar 

  9. Henriksen A, King TP, Mirza O, Monsalve RI, Meno K, Ipsen H, Larsen JN, Gajhede M, Spangfort MD: Major venom allergen of yellow jackets, Ves v 5: structural characterization of a pathogenesis-related protein superfamily. Proteins. 2001, 45 (4): 438-448. 10.1002/prot.1160.

    Article  PubMed  Google Scholar 

  10. Groves MR, Kuhn A, Hendricks A, Radke S, Serrano RL, Helms JB, Sinning I: Crystallization of a Golgi-associated PR-1-related protein (GAPR-1) that localizes to lipid-enriched microdomains. Acta Crystallogr D Biol Crystallogr. 2004, 60 (Pt 4): 730-732. 10.1107/S0907444904002136.

    Article  PubMed  Google Scholar 

  11. Asojo OA, Loukas A, Inan M, Barent R, Huang J, Plantz B, Swanson A, Gouthro M, Meagher MM, Hotez PJ: Crystallization and preliminary X-ray analysis of Na-ASP-1, a multi-domain pathogenesis-related-1 protein from the human hookworm parasite Necator americanus. Acta Crystallograph Sect F Struct Biol Cryst Commun. 2005, 61 (Pt 4): 391-394. 10.1107/S1744309105007748.

    Article  Google Scholar 

  12. Shikamoto Y, Suto K, Yamazaki Y, Morita T, Mizuno H: Crystal structure of a CRISP family Ca2+ -channel blocker derived from snake venom. J Mol Biol. 2005, 350 (4): 735-743. 10.1016/j.jmb.2005.05.020.

    Article  PubMed  Google Scholar 

  13. Asojo OA, Goud G, Dhar K, Loukas A, Zhan B, Deumic V, Liu S, Borgstahl GE, Hotez PJ: X-ray structure of Na-ASP-2, a pathogenesis-related-1 protein from the nematode parasite, Necator americanus, and a vaccine antigen for human hookworm infection. J Mol Biol. 2005, 346 (3): 801-814. 10.1016/j.jmb.2004.12.023.

    Article  PubMed  Google Scholar 

  14. Kjeldsen L, Cowland JB, Johnsen AH, Borregaard N: SGP28, a novel matrix glycoprotein in specific granules of human neutrophils with similarity to a human testis-specific gene product and a rodent sperm-coating glycoprotein. FEBS Lett. 1996, 380 (3): 246-250. 10.1016/0014-5793(96)00030-0.

    Article  PubMed  Google Scholar 

  15. Alexander D, Goodman RM, Gut-Rella M, Glascock C, Weymann K, Friedrich L, Maddox D, Ahl-Goy P, Luntz T, Ward E, et al: Increased tolerance to two oomycete pathogens in transgenic tobacco expressing pathogenesis-related protein 1a. Proc Natl Acad Sci U S A. 1993, 90 (15): 7327-7331. 10.1073/pnas.90.15.7327.

    Article  PubMed  PubMed Central  Google Scholar 

  16. Murphy EV, Zhang Y, Zhu W, Biggs J: The human glioma pathogenesis-related protein is structurally related to plant pathogenesis-related proteins and its gene is expressed specifically in brain tumors. Gene. 1995, 159 (1): 131-135. 10.1016/0378-1119(95)00061-A.

    Article  PubMed  Google Scholar 

  17. Maeda T, Sakashita M, Ohba Y, Nakanishi Y: Molecular cloning of the rat Tpx-1 responsible for the interaction between spermatogenic and Sertoli cells. Biochem Biophys Res Commun. 1998, 248 (1): 140-146. 10.1006/bbrc.1998.8918.

    Article  PubMed  Google Scholar 

  18. Milne TJ, Abbenante G, Tyndall JD, Halliday J, Lewis RJ: Isolation and characterization of a cone snail protease with homology to CRISP proteins of the pathogenesis-related protein superfamily. J Biol Chem. 2003, 278 (33): 31105-31110. 10.1074/jbc.M304843200.

    Article  PubMed  Google Scholar 

  19. Yamazaki Y, Morita T: Structure and function of snake venom cysteine-rich secretory proteins. Toxicon. 2004, 44 (3): 227-231. 10.1016/j.toxicon.2004.05.023.

    Article  PubMed  Google Scholar 

  20. Lu G, Villalba M, Coscia MR, Hoffman DR, King TP: Sequence analysis and antigenic cross-reactivity of a venom allergen, antigen 5, from hornets, wasps, and yellow jackets. J Immunol. 1993, 150 (7): 2823-2830.

    PubMed  Google Scholar 

  21. Hawdon JM, Jones BF, Hoffman DR, Hotez PJ: Cloning and characterization of Ancylostoma-secreted protein. A novel protein associated with the transition to parasitism by infective hookworm larvae. J Biol Chem. 1996, 271 (12): 6672-6678. 10.1074/jbc.271.12.6672.

    Article  PubMed  Google Scholar 

  22. Murray J, Gregory WF, Gomez-Escobar N, Atmadja AK, Maizels RM: Expression and immune recognition of Brugia malayi VAL-1, a homologue of vespid venom allergens and Ancylostoma secreted proteins. Mol Biochem Parasitol. 2001, 118 (1): 89-96. 10.1016/S0166-6851(01)00374-7.

    Article  PubMed  Google Scholar 

  23. Tawe W, Pearlman E, Unnasch TR, Lustigman S: Angiogenic activity of Onchocerca volvulus recombinant proteins similar to vespid venom antigen 5. Mol Biochem Parasitol. 2000, 109 (2): 91-99. 10.1016/S0166-6851(00)00231-0.

    Article  PubMed  Google Scholar 

  24. Schistosoma mansoni GeneDB. []

  25. Cawley SE, Wirth AI, Speed TP: Phat--a gene finding program for Plasmodium falciparum. Mol Biochem Parasitol. 2001, 118 (2): 167-174. 10.1016/S0166-6851(01)00363-2.

    Article  PubMed  Google Scholar 

  26. Finn RD, Mistry J, Schuster-Bockler B, Griffiths-Jones S, Hollich V, Lassmann T, Moxon S, Marshall M, Khanna A, Durbin R, Eddy SR, Sonnhammer EL, Bateman A: Pfam: clans, web tools and services. Nucleic Acids Res. 2006, 34 (Database issue): D247-51. 10.1093/nar/gkj149.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Schultz J, Milpetz F, Bork P, Ponting CP: SMART, a simple modular architecture research tool: identification of signaling domains. Proc Natl Acad Sci U S A. 1998, 95 (11): 5857-5864. 10.1073/pnas.95.11.5857.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Bendtsen JD, Nielsen H, von Heijne G, Brunak S: Improved prediction of signal peptides: SignalP 3.0. J Mol Biol. 2004, 340 (4): 783-795. 10.1016/j.jmb.2004.05.028.

    Article  PubMed  Google Scholar 

  29. Szyperski T, Fernandez C, Mumenthaler C, Wuthrich K: Structure comparison of human glioma pathogenesis-related protein GliPR and the plant pathogenesis-related protein P14a indicates a functional link between the human immune system and a plant defense system. Proc Natl Acad Sci U S A. 1998, 95 (5): 2262-2266. 10.1073/pnas.95.5.2262.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Castresana J: Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol. 2000, 17 (4): 540-552.

    Article  PubMed  Google Scholar 

  31. Marti-Renom MA, Stuart AC, Fiser A, Sanchez R, Melo F, Sali A: Comparative protein structure modeling of genes and genomes. Annu Rev Biophys Biomol Struct. 2000, 29: 291-325. 10.1146/annurev.biophys.29.1.291.

    Article  PubMed  Google Scholar 

  32. Hokke CH, Fitzpatrick JM, Hoffmann KF: Integrating transcriptome, proteome and glycome analyses of Schistosoma biology. Trends Parasitol. 2007, 23 (4): 165-174. 10.1016/

    Article  PubMed  Google Scholar 

  33. Wang F, Li H, Liu MN, Song H, Han HM, Wang QL, Yin CC, Zhou YC, Qi Z, Shu YY, Lin ZJ, Jiang T: Structural and functional analysis of natrin, a venom protein that targets various ion channels. Biochem Biophys Res Commun. 2006, 351 (2): 443-448. 10.1016/j.bbrc.2006.10.067.

    Article  PubMed  Google Scholar 

  34. Eberle HB, Serrano RL, Fullekrug J, Schlosser A, Lehmann WD, Lottspeich F, Kaloyanova D, Wieland FT, Helms JB: Identification and characterization of a novel human plant pathogenesis-related protein that localizes to lipid-enriched microdomains in the Golgi complex. J Cell Sci. 2002, 115 (Pt 4): 827-838.

    PubMed  Google Scholar 

  35. Eisenberg I, Barash M, Kahan T, Mitrani-Rosenbaum S: Cloning and characterization of a human novel gene C9orf19 encoding a conserved putative protein with an SCP-like extracellular protein domain. Gene. 2002, 293 (1-2): 141-148. 10.1016/S0378-1119(02)00703-5.

    Article  PubMed  Google Scholar 

  36. Hawdon JM, Narasimhan S, Hotez PJ: Ancylostoma secreted protein 2: cloning and characterization of a second member of a family of nematode secreted proteins from Ancylostoma caninum. Mol Biochem Parasitol. 1999, 99 (2): 149-165. 10.1016/S0166-6851(99)00011-0.

    Article  PubMed  Google Scholar 

  37. Olson JH, Xiang X, Ziegert T, Kittelson A, Rawls A, Bieber AL, Chandler DE: Allurin, a 21-kDa sperm chemoattractant from Xenopus egg jelly, is related to mammalian sperm-binding proteins. Proc Natl Acad Sci U S A. 2001, 98 (20): 11205-11210. 10.1073/pnas.211316798.

    Article  PubMed  PubMed Central  Google Scholar 

  38. Bailly X, Chabasse C, Hourdez S, Dewilde S, Martial S, Moens L, Zal F: Globin gene family evolution and functional diversification in annelids. Febs J. 2007, 274 (10): 2641-2652. 10.1111/j.1742-4658.2007.05799.x.

    Article  PubMed  Google Scholar 

  39. Cass CL, Johnson JR, Califf LL, Xu T, Hernandez HJ, Stadecker MJ, Yates JR, Williams DL: Proteomic analysis of Schistosoma mansoni egg secretions. Mol Biochem Parasitol. 2007

    Google Scholar 

  40. Curwen RS, Ashton PD, Sundaralingam S, Wilson RA: Identification of novel proteases and immunomodulators in the secretions of schistosome cercariae that facilitate host entry. Mol Cell Proteomics. 2006, 5 (5): 835-844. 10.1074/mcp.M500313-MCP200.

    Article  PubMed  Google Scholar 

  41. Zhan B, Liu Y, Badamchian M, Williamson A, Feng J, Loukas A, Hawdon JM, Hotez PJ: Molecular characterisation of the Ancylostoma-secreted protein family from the adult stage of Ancylostoma caninum. Int J Parasitol. 2003, 33 (9): 897-907. 10.1016/S0020-7519(03)00111-5.

    Article  PubMed  Google Scholar 

  42. Yatsuda AP, Krijgsveld J, Cornelissen AW, Heck AJ, de Vries E: Comprehensive analysis of the secreted proteins of the parasite Haemonchus contortus reveals extensive sequence variation and differential immune recognition. J Biol Chem. 2003, 278 (19): 16941-16951. 10.1074/jbc.M212453200.

    Article  PubMed  Google Scholar 

  43. Ali F, Brown A, Stanssens P, Timothy LM, Soule HR, Pritchard DI: Vaccination with neutrophil inhibitory factor reduces the fecundity of the hookworm Ancylostoma ceylanicum. Parasite Immunol. 2001, 23 (5): 237-249. 10.1046/j.1365-3024.2001.00383.x.

    Article  PubMed  Google Scholar 

  44. Del Valle A, Jones BF, Harrison LM, Chadderdon RC, Cappello M: Isolation and molecular cloning of a secreted hookworm platelet inhibitor from adult Ancylostoma caninum. Mol Biochem Parasitol. 2003, 129 (2): 167-177. 10.1016/S0166-6851(03)00121-X.

    Article  PubMed  Google Scholar 

  45. Rosenzweig T, Ziv-Av A, Xiang C, Lu W, Cazacu S, Taler D, Miller CG, Reich R, Shoshan Y, Anikster Y, Kazimirsky G, Sarid R, Brodie C: Related to testes-specific, vespid, and pathogenesis protein-1 (RTVP-1) is overexpressed in gliomas and regulates the growth, survival, and invasion of glioma cells. Cancer Res. 2006, 66 (8): 4139-4148. 10.1158/0008-5472.CAN-05-2851.

    Article  PubMed  Google Scholar 

  46. Yamakawa T, Miyata S, Ogawa N, Koshikawa N, Yasumitsu H, Kanamori T, Miyazaki K: cDNA cloning of a novel trypsin inhibitor with similarity to pathogenesis-related proteins, and its frequent expression in human brain cancer cells. Biochim Biophys Acta. 1998, 1395 (2): 202-208.

    Article  PubMed  Google Scholar 

  47. Binder M, Fierlbeck G, King T, Valent P, Buhring HJ: Individual hymenoptera venom compounds induce upregulation of the basophil activation marker ectonucleotide pyrophosphatase/phosphodiesterase 3 (CD203c) in sensitized patients. Int Arch Allergy Immunol. 2002, 129 (2): 160-168. 10.1159/000065875.

    Article  PubMed  Google Scholar 

  48. Hauswirth AW, Natter S, Ghannadan M, Majlesi Y, Schernthaner GH, Sperr WR, Buhring HJ, Valenta R, Valent P: Recombinant allergens promote expression of CD203c on basophils in sensitized individuals. J Allergy Clin Immunol. 2002, 110 (1): 102-109. 10.1067/mai.2002.125257.

    Article  PubMed  Google Scholar 

  49. Bethony J, Loukas A, Smout M, Brooker S, Mendez S, Plieskatt J, Goud G, Bottazzi ME, Zhan B, Wang Y, Williamson A, Lustigman S, Correa-Oliveira R, Xiao S, Hotez PJ: Antibodies against a secreted protein from hookworm larvae reduce the intensity of hookworm infection in humans and vaccinated laboratory animals. Faseb J. 2005, 19 (12): 1743-1745.

    PubMed  Google Scholar 

  50. Goud GN, Zhan B, Ghosh K, Loukas A, Hawdon J, Dobardzic A, Deumic V, Liu S, Dobardzic R, Zook BC, Jin Q, Liu Y, Hoffman L, Chung-Debose S, Patel R, Mendez S, Hotez PJ: Cloning, yeast expression, isolation, and vaccine testing of recombinant Ancylostoma-secreted protein (ASP)-1 and ASP-2 from Ancylostoma ceylanicum. J Infect Dis. 2004, 189 (5): 919-929. 10.1086/381901.

    Article  PubMed  Google Scholar 

  51. Sen L, Ghosh K, Bin Z, Qiang S, Thompson MG, Hawdon JM, Koski RA, Shuhua X, Hotez PJ: Hookworm burden reductions in BALB/c mice vaccinated with recombinant Ancylostoma secreted proteins (ASPs) from Ancylostoma duodenale, Ancylostoma caninum and Necator americanus. Vaccine. 2000, 18 (11-12): 1096-1102. 10.1016/S0264-410X(99)00371-0.

    Article  PubMed  Google Scholar 

  52. Ghosh K, Hawdon J, Hotez P: Vaccination with alum-precipitated recombinant Ancylostoma-secreted protein 1 protects mice against challenge infections with infective hookworm (Ancylostoma caninum) larvae. J Infect Dis. 1996, 174 (6): 1380-1383.

    Article  PubMed  Google Scholar 

  53. Loukas A, Bethony J, Brooker S, Hotez P: Hookworm vaccines: past, present, and future. Lancet Infect Dis. 2006, 6 (11): 733-741. 10.1016/S1473-3099(06)70630-2.

    Article  PubMed  Google Scholar 

  54. Ravi V, Ramachandran S, Thompson RW, Andersen JF, Neva FA: Characterization of a recombinant immunodiagnostic antigen (NIE) from Strongyloides stercoralis L3-stage larvae. Mol Biochem Parasitol. 2002, 125 (1-2): 73-81. 10.1016/S0166-6851(02)00214-1.

    Article  PubMed  Google Scholar 

  55. Dunne DW, Butterworth AE, Fulford AJ, Kariuki HC, Langley JG, Ouma JH, Capron A, Pierce RJ, Sturrock RF: Immunity after treatment of human schistosomiasis: association between IgE antibodies to adult worm antigens and resistance to reinfection. Eur J Immunol. 1992, 22 (6): 1483-1494. 10.1002/eji.1830220622.

    Article  PubMed  Google Scholar 

  56. Hagan P, Blumenthal UJ, Dunn D, Simpson AJ, Wilkins HA: Human IgE, IgG4 and resistance to reinfection with Schistosoma haematobium. Nature. 1991, 349 (6306): 243-245. 10.1038/349243a0.

    Article  PubMed  Google Scholar 

  57. Wu W, Niles EG, Hirai H, LoVerde PT: Evolution of a novel subfamily of nuclear receptors with members that each contain two DNA binding domains. BMC Evol Biol. 2007, 7: 27-10.1186/1471-2148-7-27.

    Article  PubMed  PubMed Central  Google Scholar 

  58. Zhang J: Evolution by gene duplication: an update. Trends in Ecology & Evolution. 2003, 18 (6): 292-298. 10.1016/S0169-5347(03)00033-8.

    Article  Google Scholar 

  59. Salter JP, Choe Y, Albrecht H, Franklin C, Lim KC, Craik CS, McKerrow JH: Cercarial elastase is encoded by a functionally conserved gene family across multiple species of schistosomes. J Biol Chem. 2002, 277 (27): 24618-24624. 10.1074/jbc.M202364200.

    Article  PubMed  Google Scholar 

  60. Roberts KP, Ensrud KM, Wooters JL, Nolan MA, Johnston DS, Hamilton DW: Epididymal secreted protein Crisp-1 and sperm function. Mol Cell Endocrinol. 2006, 250 (1-2): 122-127. 10.1016/j.mce.2005.12.034.

    Article  PubMed  Google Scholar 

  61. DeMarco R, Oliveira KC, Venancio TM, Verjovski-Almeida S: Gender biased differential alternative splicing patterns of the transcriptional cofactor CA150 gene in Schistosoma mansoni. Mol Biochem Parasitol. 2006, 150 (2): 123-131. 10.1016/j.molbiopara.2006.07.002.

    Article  PubMed  Google Scholar 

  62. Ram D, Ziv E, Lantner F, Lardans V, Schechter I: Stage-specific alternative splicing of the heat-shock transcription factor during the life-cycle of Schistosoma mansoni. Parasitology. 2004, 129 (Pt 5): 587-596. 10.1017/S003118200400602X.

    Article  PubMed  Google Scholar 

  63. Serrano RL: Structural and Functional Characterization of GAPR-1, a Mammalian Plant Pathogenesis-related Protein in Lipidenriched Microdomains of the Golgi Complex. Department of Structural Biology. 2003, University of Heidelberg, PhD:

    Google Scholar 

  64. Fallon PG, Smith P, Dunne DW: Type 1 and type 2 cytokine-producing mouse CD4+ and CD8+ T cells in acute Schistosoma mansoni infection. Eur J Immunol. 1998, 28 (4): 1408-1416. 10.1002/(SICI)1521-4141(199804)28:04<1408::AID-IMMU1408>3.0.CO;2-H.

    Article  PubMed  Google Scholar 

  65. Yoshino TP, Laursen JR: Production of Schistosoma mansoni daughter sporocysts from mother sporocysts maintained in synxenic culture with Biomphalaria glabrata embryonic (Bge) cells. J Parasitol. 1995, 81 (5): 714-722. 10.2307/3283960.

    Article  PubMed  Google Scholar 

  66. Brink LH, McLaren DJ, Smithers SR: Schistosoma mansoni: a comparative study of artificially transformed schistosomula and schistosomula recovered after cercarial penetration of isolated skin. Parasitology. 1977, 74 (1): 73-86.

    Article  PubMed  Google Scholar 

  67. Hoffmann KF, Johnston DA, Dunne DW: Identification of Schistosoma mansoni gender-associated gene transcripts by cDNA microarray profiling. Genome Biol. 2002, 3 (8): RESEARCH0041-10.1186/gb-2002-3-8-research0041.

    Article  PubMed  PubMed Central  Google Scholar 

  68. Wynn TA, Eltoum I, Cheever AW, Lewis FA, Gause WC, Sher A: Analysis of cytokine mRNA expression during primary granuloma formation induced by eggs of Schistosoma mansoni. J Immunol. 1993, 151 (3): 1430-1440.

    PubMed  Google Scholar 

  69. Fitzpatrick JM, Hirai Y, Hirai H, Hoffmann KF: Schistosome egg production is dependent upon the activities of two developmentally regulated tyrosinases. Faseb J. 2007, 21 (3): 823-835. 10.1096/fj.06-7314com.

    Article  PubMed  Google Scholar 

  70. Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32 (5): 1792-1797. 10.1093/nar/gkh340.

    Article  PubMed  PubMed Central  Google Scholar 

  71. Huelsenbeck JP, Ronquist F: MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics. 2001, 17 (8): 754-755. 10.1093/bioinformatics/17.8.754.

    Article  PubMed  Google Scholar 

  72. Whelan S, Goldman N: A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol Biol Evol. 2001, 18 (5): 691-699.

    Article  PubMed  Google Scholar 

  73. Page RD: TreeView: an application to display phylogenetic trees on personal computers. Comput Appl Biosci. 1996, 12 (4): 357-358.

    PubMed  Google Scholar 

  74. Tamura K, Dudley J, Nei M, Kumar S: MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol. 2007, 24 (8): 1596-1599. 10.1093/molbev/msm092.

    Article  PubMed  Google Scholar 

  75. Hooft RW, Vriend G, Sander C, Abola EE: Errors in protein structures. Nature. 1996, 381 (6580): 272-10.1038/381272a0.

    Article  PubMed  Google Scholar 

  76. Sippl MJ: Recognition of errors in three-dimensional structures of proteins. Proteins. 1993, 17 (4): 355-362. 10.1002/prot.340170404.

    Article  PubMed  Google Scholar 

  77. Le Paslier MC, Pierce RJ, Merlin F, Hirai H, Wu W, Williams DL, Johnston D, LoVerde PT, Le Paslier D: Construction and characterization of a Schistosoma mansoni bacterial artificial chromosome library. Genomics. 2000, 65 (2): 87-94. 10.1006/geno.2000.6147.

    Article  PubMed  Google Scholar 

  78. Hirai H, LoVerde PT: FISH techniques for constructing physical maps on schistosome chromosomes. Parasitol Today. 1995, 11 (8): 310-314. 10.1016/0169-4758(95)80048-4.

    Article  PubMed  Google Scholar 

  79. Freitas TC, Jung E, Pearce EJ: TGF-beta signaling controls embryo development in the parasitic flatworm Schistosoma mansoni. PLoS Pathog. 2007, 3 (4): e52-10.1371/journal.ppat.0030052.

    Article  PubMed  PubMed Central  Google Scholar 

Download references


We thank Susan Arnold and Frances Jones for excellent technical help with S. mansoni lifecycle maintenance, Dr. Peter Olson (Natural History Museum, London, UK) for assistance with phylogenetic analyses, Dr. Matt Berriman (Wellcome Trust Sanger Institute, UK) for guidance on SchistoGeneDB interrogation and manual editing and Dr. Mark Blaxter (University of Edinburgh, UK) for helpful discussions regarding SCP/TAPS family members within the Nematoda. We also thank Dr. Jennifer Fitzpatrick (University of Cambridge, UK) and Timothy Yoshino (University of Wisconsin, USA) for providing schistosome life cycle RNA, Dr. David Johnston (Natural History Museum, London, UK) for assisting in the SmVAL6 transcript study, Dr. Rhian Hayward for editorial proofreading and Prof. David Dunne (University of Cambridge, UK) for contributing many helpful comments. This work was supported by means of a Wellcome Trust Career Development Grant (RG35177) awarded to KFH and Biodiversity Research of the 21st Century COE (A14) and Global COE (A06) Grants awarded to HH.

Author information

Authors and Affiliations


Corresponding authors

Correspondence to Iain W Chalmers or Karl F Hoffmann.

Additional information

Authors' contributions

IWC, AJM and MAW cloned the SmVAL transcripts and gene fragments, HH performed the FISH experiments, RS helped create the homology models and RMRC identified homologous SCP/TAPS metazoan sequences. IWC analysed the results. KFH designed the study. IWC and KFH wrote the paper. All authors read and approved the final manuscript.

Electronic supplementary material


Additional file 1: Neighbor-Joining phylogenetic tree derived from SmVAL protein sequences. Phylogenetic relations were examined using Neighbor-Joining method under the Poisson substitution model. Support values for the consensus tree were obtained by bootstrapping 1000 replicates and are shown. Branch length represents the number of amino acid changes calculated by the Poisson model and is indicated by the scale bar. (DOC 64 KB)

Authors’ original submitted files for images

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Chalmers, I.W., McArdle, A.J., Coulson, R.M. et al. Developmentally regulated expression, alternative splicing and distinct sub-groupings in members of the Schistosoma mansoni venom allergen-like (SmVAL) gene family. BMC Genomics 9, 89 (2008).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: