Skip to main content
  • Research article
  • Open access
  • Published:

Transcriptome analysis of reproductive tissue and intrauterine developmental stages of the tsetse fly (Glossina morsitans morsitans)



Tsetse flies, vectors of African trypanosomes, undergo viviparous reproduction (the deposition of live offspring). This reproductive strategy results in a large maternal investment and the deposition of a small number of progeny during a female's lifespan. The reproductive biology of tsetse has been studied on a physiological level; however the molecular analysis of tsetse reproduction requires deeper investigation. To build a foundation from which to base molecular studies of tsetse reproduction, a cDNA library was generated from female tsetse (Glossina morsitans morsitans) reproductive tissues and the intrauterine developmental stages. 3438 expressed sequence tags were sequenced and analyzed.


Analysis of a nonredundant catalogue of 1391 contigs resulted in 520 predicted proteins. 475 of these proteins were full length. We predict that 412 of these represent cytoplasmic proteins while 57 are secreted. Comparison of these proteins with other tissue specific tsetse cDNA libraries (salivary gland, fat body/milk gland, and midgut) identified 51 that are unique to the reproductive/immature cDNA library. 11 unique proteins were homologus to uncharacterized putative proteins within the NR database suggesting the identification of novel genes associated with reproductive functions in other insects (hypothetical conserved). The analysis also yielded seven putative proteins without significant homology to sequences present in the public database (unknown genes). These proteins may represent unique functions associated with tsetse's viviparous reproductive cycle. RT-PCR analysis of hypothetical conserved and unknown contigs was performed to determine basic tissue and stage specificity of the expression of these genes.


This paper identifies 51 putative proteins specific to a tsetse reproductive/immature EST library. 11 of these proteins correspond to hypothetical conserved genes and 7 proteins are tsetse specific.


Tsetse flies (Diptera: Glossinidae) are important agricultural and medical vectors responsible for the transmission of African trypanosomes, the agents of sleeping sickness disease in humans and nagana in animals. Human sleeping sickness has resurged in Africa necessitating the development and reevaluation of control strategies [1]. In the past, vector population control has been a successful strategy in reduction of trypanosome transmission. In particular, the success of population reduction based control strategies has resulted from the low reproductive potential of tsetse flies. The knowledge gained from molecular aspects of tsetse reproductive biology has the potential to yield new insights important for increasing the efficiency and decreasing the cost and complexity of the currently available tool set.

Tsetse has a unique reproductive physiology and developmental cycle. They undergo viviparous reproduction (intrauterine development and parturition of live offspring). Viviparous reproduction has arisen independently in other flies and in other orders of insects [2]. This process is specialized in tsetse. In tsetse the entire larval developmental cycle is intrauterine and the mother supplies nutrients to her offspring in the form of a milk secretion from a specialized gland (milk gland) [3]. Female flies develop a single larva at a time. As a result a single female has the capacity to generate ~8 offspring in her lifecycle. This is significantly less than most other Diptera, such as mosquitoes, that are capable of generating hundreds of offspring in the span of a single life cycle. The low reproductive rate in tsetse represents a potential target for vector control as disruption of this process could have dramatic effects on population density.

Previous gene discovery projects for tsetse have focused on the adult fat body/milk gland [4], midgut [5] and salivary gland [6] organs in the tsetse species Glossina morsitans morsitans. Multiple genes with potential importance for the reproductive cycle were identified in the fat body/milk gland library analysis. In particular four cDNAs have been characterized in more detail and identified as proteins synthesized by the milk gland for larval nourishment. These proteins include the major milk protein (gmmmgp1) [7], transferrin (gmmtsf) [8], milk gland protein 2 (gmmmgp2) and milk gland protein 3 (gmmmgp3) [9].

In order to perform an in depth study of reproduction associated genes, a new cDNA library was generated with a pool of mRNA from reproductive tissues (uterus, ovaries and spermathica) and intrauterine larvae at all stages of development (embryo 1st, 2nd and 3rd instar larvae). cDNAs from this library were randomly sequenced and resulting sequences were manually annotated. Many of the annotated sequences were orthologous to genes associated with oogenesis, embryogenesis and larvigenesis in other insects. Comparison of this library with other tsetse tissue specific libraries was informative in the identification of reproductive specific transcripts with uncharacterized orthologs in other insects, as well as transcripts of putative genes that show no homology to any genes in the NCBI non-redundant database.

Results and discussion

General description of the reproductive/immature cDNA library

A total of 3438 ESTs were sequenced from the reproductive library of which 3048 were used for contig generation after quality control screening. Following assembly, the total number of unique contigs derived from the ESTs was 1391. Average length of the contigs was 1125 bp with a maximum of 5302 bp and a minimum of 88 bp.

The contigs from this library were combined with EST sequence data from the other tissue specific tsetse cDNA libraries. The combined library consists of 16509 contigs derived from published cDNA libraries including adult fat body/milk gland [4], midgut [5], salivary gland (in press) and the reproductive/immature ESTs described here. After assembly, reproductive/immature library specific contigs were identified and annotated.

Analysis and categorization of proteins extrapolated from reproductive/immature EST sequences was performed (Figure 1). There were a total of 520 unique putative proteins identified in the reproductive/immature cDNA library; 475 transcripts represented full-length cDNAs, 41 were truncated and 4 were fragments. Identification of signal peptides and anchoring motifs associated with the putative proteins predicted that 412 are cytoplasmic proteins, 57 have signal peptides suggesting secretion and 18 are anchored to the cell membrane. Results for the 33 remaining sequences were difficult to interpret due to the fact that they were either fragments or the results were ambiguous.

Figure 1
figure 1

Statistics on all predicted reproductive/immature proteins. This graph represents all putative proteins within the reproductive/immature cDNA library. Light bars represent the percentage of predicted proteins categorized by function from the reproductive/immature library. Dark bars represent the number of ESTs associated with the genes within each category.

The profile of this library shows that hypothetical conserved proteins constitute 13% of the entire library, followed by proteins associated with metabolism and protein synthesis at 10% and 11%, respectively. The categories that had the highest number of transcripts differed from the ones that had the highest number of actual genes. Factors associated with protein synthesis were the most highly expressed in the library with 416 transcripts, followed by cytoskeletal genes at 206 transcripts and then reproductive proteins at 188 transcripts. These results are logical as reproductive tissues and immature stages synthesize a large amount of protein for oogenesis, embryogenesis and nutrient storage. High levels of expression of cytoskeletal genes such as tubulin and actin are required for the cellular changes undergone during oogenesis, embryogenesis and larvigenesis. Genes classified as reproductive encode proteins associated with processes such as vitellogenesis, embryonic development and larvigenesis. The reproductive classification was determined by homology to previously identified tsetse genes or to orthologs associated with these processes in other insects.

Predicted proteins unique to the Reproductive/immature library were categorized and EST abundance for the categories was calculated (Figure 2). After filtration of nonspecific sequences, 51 library specific predicted proteins remained. Within this subgroup, reproduction associated genes contributed to 33% of the ESTs. This group of genes also had the highest number of transcripts among reproductive tract/immature exclusive genes with 169 transcripts. The second and third most abundant EST categories were hypothetical conserved (11) and unknown genes (7) with the second and third highest number of transcripts, 51 and 17, respectively.

Figure 2
figure 2

Statistics on predicted proteins unique to the reproductive/immature library. This graph represents putative proteins unique to the reproductive/immature cDNA library relative to fat body, midgut and salivary gland specific cDNA libraries. Light bars represent the percentage of library specific predicted proteins categorized by function. Dark bars represent the number of ESTs associated with the genes within each category.

The hypothetical conserved genes are of interest, as these genes are likely associated with reproduction in not only tsetse but in other insects as well. Characterization of these genes could provide new insights into processes associated with reproduction and development in insects. There were seven unknown genes, which returned no significant homologues from the NCBI database and thus appear to be unique to tsetse. Analysis of these genes and their products has the potential to reveal novel functions and processes associated with viviparous reproduction. These gene products may constitute potential targets for population control strategies. As these genes may be unique to tsetse, they may pose very specific targets for control mechanisms interrupting the viviparous reproductive cycle of tsetse.

Protein orthologs expressed exclusively in the reproductive/immature library: Table 1 (Additional file 1)

With the exception of cDNAs encoding unknown proteins, the rest of the library was orthologous to putative proteins identfied in the NCBI non-redundant database. The majority of the genes were most closely related to orthologs from Drosophila species (81%). This is not unexpected as Drosophila is well characterized and is closely related to tsetse (both are on the suborder Brachycera/Muscomorpha) relative to other flies such as mosquitoes (which are Nematocera) [10]. We focused our analysis on tsetse cDNAs expressed exclusively in this library to identify putative proteins associated with tsetse's unique reproductive and developmental biology.

Nuclear regulation: Reproductive specific histones

We identified two putative histones limited to the reproductive library. Histone composition of nucleosomes has been shown to change in different tissues and at different developmental stages in Drosophila [11]. One of the histones (EZ421930) appears to be homologous to Histone H1. This particular orthologue of H1 could be associated with regulation of genomic DNA structure during specific developmental events in the developing oocyte or the developing embryo.

The second histone is homologous to histone H3 (EZ421945). A reproduction specific H3 variant (H3.3) appears to play a role in oocyte and embryonic development in mice. This variant is a maternal factor provided to the oocyte and is localized to active regions of the genome. It is thought to be associated with epigenetic reprogramming [12].

Protein export: gamma-SNAP

An important process that occurs during oogenesis in insects is vitellogenesis, the secretion of yolk proteins from either fat body cells or ovarian follicle cells and the uptake of those yolk proteins into the oocyte for use as raw materials during embryonic development. This process requires the transport of protein(s) across the membrane of the cell producing the protein (in the case of tsetse the ovarian follicle cells) and the membrane of the developing oocyte. A reproductive specific protein homologus to the SNAP family of proteins was identified in the library (EZ421968). In vertebrates, the SNAP proteins are required for formation of the SNARE complex which promotes the fusion of membrane lipid bilayers [1315]. As the primary function of this protein is to mediate exocytosis and is specific to this library, it is a logical assumption that it may be a component in the machinery necessary for vitellogenesis. Further study of this gene could yield insights into the mechanisms yolk protein secretion and vitellogenesis.

Lipid metabolism/hormone synthesis: 17-beta-hydroxysteroid dehydrogenase type 3

Two of the library exclusive cDNAs identified (EZ421953 and EZ421957) appear to be orthologues of a 17 beta-hydroxysteroid dehydrogenase in Drosophila melanogaster. These enzymes are associated with the biosynthesis and inactivation of steroid hormones [16]. Although the two cDNAs code for similar proteins, they appear to represent unique genes. Steroid hormones are associated with regulation of egg development in other insects. The role of steroid hormone function in tsetse reproduction is unknown. Analysis of these genes is a potential starting point for this area of research.

Yolk proteins

Vitellogenesis is a major process associated with egg development. It is the synthesis and deposition of yolk proteins into a developing oocyte. Characterization of vitellogenesis was previously performed in Glossina [7], where a yolk protein gene identified from a fat body/milk gland tissue library was characterized and found to be predominantly expressed in the reproductive tract. In nematoceran flies such as mosquitoes, yolk proteins are generated by the fat body. In contrast, Cyclorrhaphan flies such as D. melanogaster and Sarcophaga bullata express their yolk proteins from both the fat body and the follicle cells of the ovary. In Glossina yolk proteins appear to be predominantly expressed in the ovary and are likely expressed from the follicle cells. This particular gene is one of a few that has a large number of representative ESTs present in this library (16) relative to other library specific genes. This observation supports our previous characterization as well as observations by [17] that GmmYP is an ovary specific gene and is the only yolk protein identified in Glossina. Multiple yolk proteins have been identified and compared in D. melanogaster, Musca domestica and Sarcophaga bullata [1820].

Gonadal proteases

Other important constituents of oocyte yolk are proteases required for the metabolism of yolk proteins and activation of other proteases during embryogenesis. One of the library specific genes appears to be homologous to a trypsin type protease (EZ421932). This trypsin could either be a component of the yolk or a larval midgut specific trypsin as the library does contain transcripts from immature stages. Other trypsin like transcripts were also identified (EZ421964 and EZ421960), however these transcripts were truncated making it difficult to perform a complete analysis. Phylogenetic comparison of trypsin EZ421932 with a previously identified midgut specific trypsin from Glossina suggests that these genes developed independently from each other (Figure 3a). Orthologs of the library specific trypsin were identified in other fly species. None of the homologues had been previously characterized. Alignment of this trypsin with its homologues and with midgut specific trypsins reveals structural conservation of key residues associated with the trypsin active site, substrate binding site and proteolytic cleavage site (Figure 3b). The two protein clades differ significantly in the N-terminal region of the protein. This region may be associated with secretion and/or activation of these trypsins.

Figure 3
figure 3

Phylogenetic and alignment based analysis and comparison of a gonad specific trypsin (EZ421932) with a midgut specific trypsin (EU589384). Orthologus sequences were identified by PSI-BLAST analysis. Sequences were then aligned with standalone ClustalX followed by manual adjustments. Phylogenetic tree construction and bootstrap analysis were performed with MEGA 3.1.

Chorion proteins

Two putative chorion proteins (EZ421949 and EZ421934) were identified as library specific and are orthologus to chorion proteins from Drosophila. The protein encoded by EZ421949 is closely related to chorion protein 38 and is likely a component of the endochorion as it undergoes transcriptional activation in Drosophila oocytes late in oocyte development [21].

RNA localization

Correct localization of nurse cell generated maternal RNA within the developing oocyte is required for dorsal/ventral and anterior/posterior patterning and positioning of primordial germ cells in the developing embryo. A cDNA that is strongly orthologous to the Drosophila gene tsunagi was identified in the database (EZ421943). Tsunagi is a RNA binding protein that forms a complex with another protein called Mago Nashi. In Drosophila Tsunagi knockout phenotypes show failure of the oocyte nucleus to migrate, the maternal mRNA oskar does not localize to the posterior pole and dorsal/ventral patterning abnormalities occur [22].


There are many transcription factors associated with embryonic development in insects. The ortholog for one of these factors (encoded by a gene called apterous (ap) in Drosophila) was identified as a library specific transcript (EZ421958). The effects of the knockout of this gene in Drosophila are an absence of wing and haltere development, absence of development of a group of embryonic muscles and juvenile hormone deficiency due to defects in secretory cells in the corpora allata. Indirect effects due to the absence of juvenile hormone include female sterility due to arrest of oogenesis, abnormal larval fat body breakdown, aberrant sexual behavior and premature death in adults [23].

Study of this factor and its effects on the development of the tsetse endocrine system (specifically the corpora allata) could yield important information on how juvenile hormone regulates oogenesis and pregnancy in tsetse.


An essential factor for the development of the embryonic brain and nervous system is a protein called Doublecortin (dcx). Mutations in this gene in mammalian systems result in disruption of cortical neuronal migration [24]. An orthologous cDNA to dcx was identified as a library specific gene. A thorough investigation of how this gene functions in invertebrates has yet to be performed.

Cuticular proteins

A group of proteins associated with larval development that were exclusive to this library were cuticular proteins. Three putative proteins were identified (EZ421946, EZ421965 and EZ421951), two of which (EZ421946 and EZ421951) are orthologous to ecdysone dependant pupal cuticle proteins. Previous work has been performed on tsetse cuticle proteins expressed during the different developmental stages [25]. It was observed that a large number of proteins are synthesized by the late third instar larvae relative to first and second instar larvae. This is logical as the intrauterine environment is hydrated and protective thereby not requiring the larva to protect itself from a hostile environment. The burst of expression of third instar cuticle proteins could be associated with larvae that are preparing for the stresses of parturition, wandering and pupation.

Fat body protein 2

Another transcript identified in the analysis of the library is an ortholog to the fat body Protein 2 (FBP2) gene (EZ421940) which is homologous to an alcohol dehydrogenase protein (ADH). This protein is thought to assist in the degradation of the fat body during metamorphosis [26].


Hexamerins (Arylphorins) are larva specific storage proteins. Two hexamerin/arylphorin type proteins were identified as library specific (EZ421969/EZ421959 and EZ421955). These cDNAs were very abundant relative to other unique genes with representative EST abundance of 23 and 4 transcripts respectively. Phylogenetic analysis of EZ421969/EZ421959 was performed with hexamerins from other insect species and a hemocyanin from crabs as an outgroup (Figure 4). Tsetse hexamerin forms a group with the other cyclorrhaphan flies (Drosophila and Musca domestica) and then on a larger scale with the nematoceran diptera (mosquitoes). The next closest group is the Lepidoptera (butterflies and moths) followed by the Coleoptera (beetles). This is followed by the more primitive hemimetabolus insects, Plecoptera (stoneflies), Dictyoptera (cockroaches and termites) and Orthoptera (grasshoppers). The relationship between these proteins follows that predicted by current morphological and molecular systematic analysis of insects at the "Tree of Life "" [27]. Phylogenetic analysis of these genes shows them to be informative from a taxonomic viewpoint as there are many hexamerin proteins present in the NCBI database. They are also of biological interest as they are developmentally and tissue specifically regulated [28] and are essential for the transition from immature to the adult stages of development.

Figure 4
figure 4

Phylogenetic and alignment based analysis and comparison of a library specific hexamerin protein (EZ421932). Orthologus sequences were identified by PSI-BLAST analysis. Sequences were then aligned with standalone ClustalX followed by manual adjustments. Phylogenetic tree construction and bootstrap analysis were performed with MEGA 3.1.

Tissue and developmental specificity analysis of unknown and hypothetical conserved genes: Table 2 (Additional file 2)

The search for library specific sequences revealed 18 hypothetical conserved and unknown genes. Of these genes 11 hypothetical conserved and 7 unknown gene sequences were identified. To verify the specificity of these transcripts RT-PCR analysis was performed using cDNAs from pupa, larva, reproductive tract (ovaries, uterus, spermathica) and the remaining carcass mRNA from 10 adult flies at varying ages (Table 2). The cDNAs used in this assay were generated from tissue samples independent of the ones used to generate the library. None of the genes were expressed in the carcass, confirming their library specific nature. A total of seven genes, five hypothetical conserved (EZ421939, EZ421950, EZ421936, EZ421937 and EZ421938) and two unknowns (EZ421944 and EZ421929) were found to be exclusive to the reproductive tract. These proteins could be associated with functions in adult tissues such as the ovaries (oogenesis), uterus or spermatheca. Identification of signal peptides and transmembrane domains in these proteins offer clues for their localization and function. EZ421939 and EZ421950 both have predicted secretion signals suggesting that these proteins are secreted from the reproductive tract. Neither EZ421936 nor EZ421937 have signal peptides; however they both have one transmembrane domain, with 76% of the protein predicted to be extracellular.

Three hypothetical conserved genes (EZ421923, EZ421935 and EZ421956) were identified as larva specific. All three putative proteins contain signal peptides suggesting secretion. A hypothetical conserved gene was found to be pupa specific (EZ421925) and has a predicted signal peptide and two transmembrane domains with 50% of the protein predicted to be extracellular. EZ421954 was detectable in both reproductive tract and larva and has a secretion signal.

Finally, three unknown genes were expressed in both larva and pupa specific cDNAs (EZ421961, EZ422129 and EZ421927). The localization of EZ421961 and EZ422129 could not be determined as they are partial cDNAs.

Hypothetical and unknown genes are of particular interest as they encode proteins either present in other systems with yet unknown functions, or proteins completely novel to the tsetse system. The evolutionary biology of the unknown genes is of particular interest. Determination of the origins and functions of these proteins will be important to understanding the evolutionary process by which tsetse developed viviparous reproduction. This line of research can also yield the identification of tsetse specific target genes, which could be blocked to disrupt reproduction with minimal environmental effects.


Determining tsetse's transcriptome from the reproductive/immature cDNA library has yielded new information on putative proteins expressed in the reproductive tract and immature intrauterine developmental stages of tsetse. Information associated with similar proteins characterized in other organisms can now be placed into the context of tsetse's reproductive and developmental biology to elucidate its reproductive physiology. This will be important to understand how these genes have been adapted to perform in a viviparous system. Also of importance is the identification of genes orthologous in other organisms that remain uncharacterized, as well as the identification of genes entirely novel to tsetse. Analysis and characterization of these genes will reveal information fundamental to insect reproduction and development. Given the low reproductive capacity of tsetse, molecular data on reproduction specific processes has the potential to reveal novel mechanisms which could be exploited to control tsetse population levels and trypanosomiasis transmission.



The Glossina morsitans morsitans colony maintained in the insectary at Yale University was originally established from puparia from fly populations in Zimbabwe. Flies are maintained at 24 ± 1°C with 50-55% relative humidity, and receive defibrinated bovine blood every 48 h using an artificial membrane system [29].

Tissue collection and RNA extraction

Reproductive tissues and immature developmental stages were dissected from female flies at various stages of pregnancy. Dissections were performed in PBS and tissues were snap frozen in liquid nitrogen and stored at -80°C. Reproductive tissues included ovaries, uterus and spermathica. Immature stages included embryo, first, second and third instar larvae. Total RNA was isolated from samples using TRIzol®Reagent (Invitrogen, Carlsbad, CA) according to manufacturer's instructions.

cDNA library construction, sequencing, EST clustering and data analysis

The library was commercially constructed in Express 1 vector from 1 mg of total RNA derived from reproductive tissues and immature developmental stages (OPEN biosystems, Huntsville, AL). Each clone was sequenced using a T3 or T7 primer using ABI Big Dye terminator kits on an ABI 3730 sequencing machine. ESTs were trimmed of primer and vector sequences, clustered and compared with other databases as has been described previously [30]. The entire database can be downloaded from the Aksoy lab website

Functional annotation of the transcripts were performed using BLASTX to compare nucleotide sequences to the NR protein database at NCBI [31] as well as to KOG [32] and GO [33] databases. Detection of conserved protein domains was performed using rpsBLAST [34] and Pfam [35]. The predicted protein translations were submitted to the SignalP server to help identify those products that could be secreted [36]. We compared these transcripts to several proteomes obtained from Flybase [37] (D. melanogaster) and ENSEMBL [38] (An. gambiae).

A subsequent tsetse EST assembly of the combined EST data dated 12/17/2008 has been produced by the International Glossina Genomics Initiative (IGGI) consortium and is available at The contig sequences in this paper reference the identifiers assigned in the initial clustering. The tables included in this manuscript and supplementary spreadsheets include the analogous GeneDB identifiers and hyperlinks to the appropriate data pages for each sequence.

Phylogenetic analysis

Phylogenetic analysis of the library specific hexamerin and trypsin genes was performed via multiple steps. Putative orthologues were identified by PSI-BLAST [34] search of the NCBI NR database and compiled within a FASTA file. Sequences were aligned using standalone ClustalX software [39]. Alignments were hand edited in BioEdit [40]. Incomplete and poorly aligned sequences were removed. Phylogenetic tree construction and bootstrap analysis were performed in MEGA 3.1 [41].

RT-PCR expression analysis

cDNA pools used for RT-PCR analysis were generated from total RNA isolated from reproductive tract (uterus, spermathica and ovaries), 1st - 3rd instar larvae, pupa and the remaining carcass after removal of reproductive tract and intrauterine offspring. Total RNA was isolated from samples using TRIzol® Reagent (Invitrogen, Carlsbad, CA) according to manufacturer's instructions. One μg of total RNA from each sample was used to synthesize each pool of cDNA using the Superscript Double Stranded cDNA Synthesis kit (Invitrogen). Tubulin levels were used to determine the dilution factor and final amount of cDNA from each pool used in the PCR reactions. After equilibration with tubulin, PCR analysis was performed with primers specific to each of the hypothetical conserved and unknown proteins identified within the library. Primer sequences used for the RT-PCR analysis are listed in Additional File 3. PCR conditions were as follows: 95°C for 3 minutes (1×), 95°C for 30 secs, 60°C for 45 secs, 72°C for 1 min (25×) and 72°C for 5 min (1×). PCR products were run on a 1% agarose gel and stained with ethidium bromide. Presence or absence of transcript was determined by visual examination of gel staining. Primer sequences are included in the primer list supplementary file.



expressed sequence tag








reverse transcriptase polymerase chain reaction




  1. Smith DH, Pepin J, Stich AH: Human African trypanosomiasis: an emerging public health crisis. Br Med Bull. 1998, 54: 341-355.

    Article  CAS  PubMed  Google Scholar 

  2. Meier R, Kotrba M, Ferrar P: Ovoviviparity and viviparity in the Diptera. Biol Rev Camb Philos Soc. 1999, 74 (3): 199-258. 10.1017/S0006323199005320.

    Article  Google Scholar 

  3. Denlinger DL, Ma WC: Dynamics of the pregnancy cycle in the tsetse Glossina morsitans. J Insect Physiol. 1974, 20 (6): 1015-1026. 10.1016/0022-1910(74)90143-7.

    Article  CAS  PubMed  Google Scholar 

  4. Attardo GM, Strickler-Dinglasan P, Perkin SA, Caler E, Bonaldo MF, Soares MB, El-Sayeed N, Aksoy S: Analysis of fat body transcriptome from the adult tsetse fly, Glossina morsitans morsitans. Insect Mol Biol. 2006, 15 (4): 411-424. 10.1111/j.1365-2583.2006.00649.x.

    Article  CAS  PubMed  Google Scholar 

  5. Lehane MJ, Aksoy S, Gibson W, Kerhornou A, Berriman M, Hamilton J, Soares MB, Bonaldo MF, Lehane S, Hall N: Adult midgut expressed sequence tags from the tsetse fly Glossina morsitans morsitans and expression analysis of putative immune response genes. Genome Biol. 2003, 4 (10): R63-10.1186/gb-2003-4-10-r63.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  6. Alves-silva J, Ribeiro JM, Abbeele JV, Attardo GM, Hao Z, Haines LR, Soares MB, Berriman M, Aksoy S, Lehane MJ: An insight into the sialome of Glossina morsitans morsitans. BMC Genomics. 2010,

    Google Scholar 

  7. Attardo GM, Guz N, Strickler-Dinglasan P, Aksoy S: Molecular aspects of viviparous reproductive biology of the tsetse fly (Glossina morsitans morsitans): Regulation of yolk and milk gland protein synthesis. J Insect Physiol. 2006, 52: 1128-1136. 10.1016/j.jinsphys.2006.07.007.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  8. Guz N, Attardo GM, Wu Y, Aksoy S: Molecular aspects of transferrin expression in the tsetse fly (Glossina morsitans morsitans). J Insect Physiol. 2007, 53 (7): 715-723. 10.1016/j.jinsphys.2007.03.013.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  9. Yang G, Attardo GM, Lohs C, Aksoy S: Molecular characterization of two novel milk proteins in the tsetse fly (Glossina morsitans morsitans). Insect Mol Biol. 2010, 19 (2): 253-262. 10.1111/j.1365-2583.2009.00987.x.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  10. Friedrich M, Tautz D: Evolution and phylogeny of the Diptera: a molecular phylogenetic analysis using 28S rDNA sequences. Syst Biol. 1997, 46 (4): 674-698.

    Article  CAS  PubMed  Google Scholar 

  11. Holmgren P, Johansson T, Lambertsson A, Rasmuson B: Content of histone H1 and histone phosphorylation in relation to the higher order structures of chromatin in Drosophila. Chromosoma. 1985, 93 (2): 123-131. 10.1007/BF00293159.

    Article  CAS  PubMed  Google Scholar 

  12. Torres-Padilla ME, Bannister AJ, Hurd PJ, Kouzarides T, Zernicka-Goetz M: Dynamic distribution of the replacement histone variant H3.3 in the mouse oocyte and preimplantation embryos. Int J Dev Biol. 2006, 50 (5): 455-461.

    CAS  PubMed  Google Scholar 

  13. Hanson PI, Heuser JE, Jahn R: Neurotransmitter release - four years of SNARE complexes. Curr Opin Neurobiol. 1997, 7 (3): 310-315. 10.1016/S0959-4388(97)80057-8.

    Article  CAS  PubMed  Google Scholar 

  14. Koticha DK, Huddleston SJ, Witkin JW, Baldini G: Role of the cysteine-rich domain of the t-SNARE component, SYNDET, in membrane binding and subcellular localization. J Biol Chem. 1999, 274 (13): 9053-9060. 10.1074/jbc.274.13.9053.

    Article  CAS  PubMed  Google Scholar 

  15. Koticha DK, McCarthy EE, Baldini G: Plasma membrane targeting of SNAP-25 increases its local concentration and is necessary for SNARE complex formation and regulated exocytosis. J Cell Sci. 2002, 115 (Pt 16): 3341-3351.

    CAS  PubMed  Google Scholar 

  16. Penning TM: Molecular endocrinology of hydroxysteroid dehydrogenases. Endocr Rev. 1997, 18 (3): 281-305. 10.1210/er.18.3.281.

    CAS  PubMed  Google Scholar 

  17. Hens K, Macours N, Claeys I, Francis C, Huybrechts R: Cloning and expression of the yolk protein of the tsetse fly Glossina morsitans morsitans. Insect Biochem Mol Biol. 2004, 34 (12): 1281-1287. 10.1016/j.ibmb.2004.08.006.

    Article  CAS  PubMed  Google Scholar 

  18. Hens K, Lemey P, Macours N, Francis C, Huybrechts R: Cyclorraphan yolk proteins and lepidopteran minor yolk proteins originate from two unrelated lipase families. Insect Mol Biol. 2004, 13 (6): 615-623. 10.1111/j.0962-1075.2004.00520.x.

    Article  CAS  PubMed  Google Scholar 

  19. White NM, Bownes M: Cloning and characterization of three Musca domestica yolk protein genes. Insect Mol Biol. 1997, 6 (4): 329-341. 10.1046/j.1365-2583.1997.00187.x.

    Article  CAS  PubMed  Google Scholar 

  20. Bownes M: Three genes for three yolk proteins in Drosophila melanogaster. FEBS Lett. 1979, 100 (1): 95-98. 10.1016/0014-5793(79)81138-2.

    Article  CAS  PubMed  Google Scholar 

  21. Spradling AC, Mahowald AP: Amplification of genes for chorion proteins during oogenesis in Drosophila melanogaster. Proc Natl Acad Sci USA. 1980, 77 (2): 1096-1100. 10.1073/pnas.77.2.1096.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  22. Mohr SE, Dillon ST, Boswell RE: The RNA-binding protein Tsunagi interacts with Mago Nashi to establish polarity and localize oskar mRNA during Drosophila oogenesis. Genes Dev. 2001, 15 (21): 2886-2899.

    CAS  PubMed Central  PubMed  Google Scholar 

  23. Shtorch A, Werczberger R, Segal D: Genetic and molecular studies of apterous: a gene implicated in the juvenile hormone system of Drosophila. Arch Insect Biochem Physiol. 1995, 30 (2-3): 195-209. 10.1002/arch.940300209.

    Article  CAS  PubMed  Google Scholar 

  24. Gleeson JG, Minnerath SR, Fox JW, Allen KM, Luo RF, Hong SE, Berg MJ, Kuzniecky R, Reitnauer PJ, Borgatti R, Mira AP, Guerrini R, Holmes GL, Rooney CM, Berkovic S, Scheffer I, Cooper EC, Ricci S, Cusmai R, Crawford TO, Leroy R, Andermann E, Wheless JW, Dobyns WB, Walsh CA: Characterization of mutations in the gene doublecortin in patients with double cortex syndrome. Ann Neurol. 1999, 45 (2): 146-153. 10.1002/1531-8249(199902)45:2<146::AID-ANA3>3.0.CO;2-N.

    Article  CAS  PubMed  Google Scholar 

  25. Ochieng VO, Osir EO, Ochanda JO, Olembo NK: Temporal synthesis of cuticle proteins during larval development in Glossina morsitans. Comp Biochem Physiol B Biochem Mol Biol. 1993, 105 (2): 309-316. 10.1016/0305-0491(93)90234-V.

    Article  CAS  Google Scholar 

  26. Matsumoto N, Sekimizu K, Soma G, Ohmura Y, Andoh T, Nakanishi Y, Obinata M, Natori S: Structural analysis of a developmentally regulated 25-kDa protein gene of Sarcophaga peregrina. J Biochem, Tokyo. 1985, 97 (5): 1501-1508.

    CAS  Google Scholar 

  27. Misof B, Niehuis O, Bischoff I, Rickert A, Erpenbeck D, Staniczek A: Towards an 18S phylogeny of hexapods: accounting for group-specific character covariance in optimized mixed nucleotide/doublet models. Zoology (Jena). 2007, 110 (5): 409-429.

    Article  CAS  Google Scholar 

  28. Jinwal UK, Zakharkin SO, Litvinova OV, Jain S, Benes H: Sex-, stage- and tissue-specific regulation by a mosquito hexamerin promoter. Insect Mol Biol. 2006, 15 (3): 301-311. 10.1111/j.1365-2583.2006.00644.x.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  29. Moloo SK: An artificial feeding technique for Glossina. Parasitology. 1971, 63 (3): 507-512. 10.1017/S0031182000080021.

    Article  CAS  PubMed  Google Scholar 

  30. Valenzuela JG, Francischetti IM, Pham VM, Garfield MK, Ribeiro JM: Exploring the salivary gland transcriptome and proteome of the Anopheles stephensi mosquito. Insect Biochem Mol Biol. 2003, 33 (7): 717-732. 10.1016/S0965-1748(03)00067-5.

    Article  CAS  PubMed  Google Scholar 

  31. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25 (17): 3389-3402. 10.1093/nar/25.17.3389.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  32. Tatusov RL, Fedorova ND, Jackson JD, Jacobs AR, Kiryutin B, Koonin EV, Krylov DM, Mazumder R, Mekhedov SL, Nikolskaya AN, Rao BS, Smirnov S, Sverdlov AV, Vasudevan S, Wolf YI, Yin JJ, Natale DA: The COG database: an updated version includes eukaryotes. BMC Bioinformatics. 2003, 4: 41-10.1186/1471-2105-4-41.

    Article  PubMed Central  PubMed  Google Scholar 

  33. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000, 25 (1): 25-29. 10.1038/75556.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  34. Schaffer AA, Aravind L, Madden TL, Shavirin S, Spouge JL, Wolf YI, Koonin EV, Altschul SF: Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements. Nucleic Acids Res. 2001, 29 (14): 2994-3005. 10.1093/nar/29.14.2994.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  35. Bateman A, Birney E, Durbin R, Eddy SR, Howe KL, Sonnhammer EL: The Pfam protein families database. Nucleic Acids Res. 2000, 28 (1): 263-266. 10.1093/nar/28.1.263.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  36. Nielsen H, Engelbrecht J, Brunak S, von Heijne G: Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites. Protein Eng Des Sel. 1997, 10 (1): 1-6. 10.1093/protein/10.1.1.

    Article  CAS  Google Scholar 

  37. Tweedie S, Ashburner M, Falls K, Leyland P, McQuilton P, Marygold S, Millburn G, Osumi-Sutherland D, Schroeder A, Seal R, Zhang H: FlyBase: enhancing Drosophila Gene Ontology annotations. Nucleic Acids Res. 2009, D555-559. 10.1093/nar/gkn788. 37 Database

  38. Lawson D, Arensburger P, Atkinson P, Besansky NJ, Bruggner RV, Butler R, Campbell KS, Christophides GK, Christley S, Dialynas E, Emmert D, Hammond M, Hill CA, Kennedy RC, Lobo NF, MacCallum MR, Madey G, Megy K, Redmond S, Russo S, Severson DW, Stinson EO, Topalis P, Zdobnov EM, Birney E, Gelbart WM, Kafatos FC, Louis C, Collins FH: VectorBase: a home for invertebrate vectors of human pathogens. Nucleic Acids Res. 2007, D503-505. 10.1093/nar/gkl960. 35 Database

  39. Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG: The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 1997, 25 (24): 4876-4882. 10.1093/nar/25.24.4876.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  40. Hall TA: BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucl Acids Symp Ser. 1999, 41: 95-98.

    CAS  Google Scholar 

  41. Kumar S, Tamura K, Nei M: MEGA 3: Integrated Software for Molecular Evolutionary Genetics Analysis and Sequence Alignment. Brief Bioinform. 2004, 5: 150-163. 10.1093/bib/5.2.150.

    Article  CAS  PubMed  Google Scholar 

Download references


This research was supported by grants from NIH AI51584 and AI081774 to SA, the NIH Ruth Kirschstein Postdoctoral Training Award F32 GM077964 to GMA and by the Intramural Research Program of the National Institute of Allergy and Infectious Diseases to JMCR. The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products, or organization imply endorsement by the government of the United States of America.

Because JMCR is a government employee and this is a government work, the work is in the public domain in the United States. Notwithstanding any other agreements, the NIH reserves the right to provide the work to PubMedCentral for display and use by the public, and PubMedCentral may tag or modify the work consistent with its customary practices. You can establish rights outside of the U.S. subject to a government use license.

MB is supported by the Wellcome Trust [grant number WT 085775/Z/08/Z]. This study is part of the International Glossina Genomics Initiative (IGGI), established in 2004 with support from WHO/TDR to promote knowledge on Glossina biology including host-pathogen interactions, genetics of vector competence, olfactory biology and population genetics to support vector control efforts.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Serap Aksoy.

Additional information

Authors' contributions

GA prepared the biological material for library construction, supervised tissue expression experiments, performed data analysis and prepared the manuscript. JMCR performed data analysis and contributed to the manuscript. YW performed tissue specific gene expression experiments. MB participated in sequencing the cDNA library. SA participated in library construction, data analysis and contributed to the manuscript. All authors have read and approved the final manuscript.

Electronic supplementary material


Additional file 1: Table 1. This file contains the list of predicted (reproductive/immature library specific and non specific) reproduction associated proteins. (DOC 98 KB)


Additional file 2: Table 2. This file contains the list of reproductive/immature library specific hypothetical conserved and unknown proteins with RT-PCR expression data. (DOC 62 KB)


Additional file 3: Primer list supplement. This file contains a table of the primers used in the tissue specificity RT-PCR analysis from table 2. (DOC 50 KB)

Authors’ original submitted files for images

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Attardo, G.M., Ribeiro, J.M., Wu, Y. et al. Transcriptome analysis of reproductive tissue and intrauterine developmental stages of the tsetse fly (Glossina morsitans morsitans). BMC Genomics 11, 160 (2010).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: