Skip to main content

BMP signaling components in embryonic transcriptomes of the hover fly Episyrphus balteatus (Syrphidae)



In animals, signaling of Bone Morphogenetic Proteins (BMPs) is essential for dorsoventral (DV) patterning of the embryo, but how BMP signaling evolved with changes in embryonic DV differentiation is largely unclear. Based on the extensive knowledge of BMP signaling in Drosophila melanogaster, the morphological diversity of extraembryonic tissues in different fly species provides a comparative system to address this question. The closest relatives of D. melanogaster with clearly distinct DV differentiation are hover flies (Diptera: Syrphidae). The syrphid Episyrphus balteatus is a commercial bio-agent against aphids and has been established as a model organism for developmental studies and chemical ecology. The dorsal blastoderm of E. balteatus gives rise to two extraembryonic tissues (serosa and amnion), whereas in D. melanogaster, the dorsal blastoderm differentiates into a single extraembryonic epithelium (amnioserosa). Recent studies indicate that several BMP signaling components of D. melanogaster, including the BMP ligand Screw (Scw) and other extracellular regulators, evolved in the dipteran lineage through gene duplication and functional divergence. These findings raise the question of whether the complement of BMP signaling components changed with the origin of the amnioserosa.


To search for BMP signaling components in E. balteatus, we generated and analyzed transcriptomes of freshly laid eggs (0-30 minutes) and late blastoderm to early germband extension stages (3-6 hours) using Roche/454 sequencing. We identified putative E. balteatus orthologues of 43% of all annotated D. melanogaster genes, including the genes of all BMP ligands and other BMP signaling components.


The diversification of several BMP signaling components in the dipteran linage of D. melanogaster preceded the origin of the amnioserosa.

[Transcriptome sequence data from this study have been deposited at the NCBI Sequence Read Archive (SRP005289); individually assembled sequences have been deposited at GenBank (JN006969-JN006986).]


Across animals, the Bone Morphogenetic Protein (BMP) signaling pathway plays a major role in specifying the dorsoventral (DV) axis [1, 2]. However, the components of the BMP pathway have been repeatedly modified through lineage specific gene duplications and gene losses [3, 4]. Whether some of these genetic changes correlate with the origin of species-specific morphological traits that develop under the control of the BMP pathway is unknown. Flies (Diptera) provide an excellent opportunity to address this question firstly because the BMP signaling pathway of Drosophila melanogaster has been studied in great detail [5, 6], and secondly because tissue specification presumably under the control of BMP signaling along the DV axis of dipterans has undergone significant change [7]. In D. melanogaster, dorsal blastoderm differentiates into a single extraembryonic epithelium, called amnioserosa, which closes the developing embryo dorsally [8]. This tissue is found in higher cyclorrhaphan flies (Schizophora), but in other dipterans, dorsal blastoderm gives rise to distinct serosal and amniotic epithelia [911]. Serosa and amnion develop from an amnioserosal fold at the margins of the gastrulating embryo. The outer cell layer of this fold becomes the serosa, which closes about the embryo. Its inner cell layer detaches from the serosa but retains continuity with the embryo while closing dorsally (lower cyclorrhaphan flies) or ventrally (non-cyclorrhaphan dipterans). The lower cyclorrhaphan syrphids represent the closest relatives of D. melanogaster that have been shown to develop distinct serosa and amnion tissues [9]. Therefore, they are of particular interest in efforts to understand how the origin of the amnioserosa as a new morphology is linked to changes in the underlying developmental gene network.

In previous studies we have characterized the role that the homeobox gene zerknüllt (zen) may have played in the origin of amnioserosa development [reviewed in 7]. The transcription factor Zen is regulated by BMP signaling and essential in serosa specification in non-schizophoran insects and amnioserosa specification in D. melanogaster[9, 1214]. In lower Cyclorrhapha and more distant relatives of D. melanogaster, zen expression in the serosa is maintained after gastrulation, i. e., when the serosa begins to spread over the embryo [9], whereas in D. melanogaster zen expression in the amnioserosa is down-regulated immediately after gastrulation [13]. In lower cyclorrhaphan flies, postgastrular down-regulation of zen abrogates serosa development and results in the formation of a single extraembryonic tissue with amniotic gene expression [15]. Thus, the repression of this single transcription factor may account for the morphological tissue reorganization that accompanied the origin of the amnioserosa. However, loss of postgastrular zen expression does not explain, why in lower cyclorrhaphan and non-cyclorrhaphan dipterans the patterning of the dorsal blastoderm results in the specification of two distinct extraembryonic tissues types as opposed to one in schizophoran (i. e. higher cyclorrhaphan) flies such as D. melanogaster.

In D. melanogaster, amnioserosa specification occurs at the dorsal midline and requires peak-levels of BMP activity [14, 16], which are provided through the interaction of two extracellular ligands, Dpp and Scw [1719]. Both ligands are secreted into the perivitelline space and transported towards the dorsal midline [17, 2024], where BMP-ligand dimers are released from antagonists to activate a receptor complex and initiate intracellular signaling [17, 20, 2527]. A cell autonomous autoregulatory loop further increases the BMP signal at the dorsal midline and generates a narrow and sharply delineated domain of BMP peak activity [20, 28]. Dpp is essential for BMP activity and controls the specification of all tissues that develop under the control of this pathway in the early embryo, including amnioserosa and dorsal ectoderm [16, 29]. Scw boosts BMP activity along the dorsal midline and is in particular required for amnioserosa specification [19].

In other dipterans, the molecular mechanisms that specify the amnion and serosa are not known. Expression studies in a mosquito suggest that a tighter expression of the Dpp antagonist short gastrulation (Sog) leads to broader BMP signaling, which in turn may allow for the specification of two versus one extraembryonic tissue type [11]. Additionally, Scw is absent from the genomes of mosquitoes and other insects, and it has been suggested that its origin may correlate with the origin of the amnioserosa [3]. Several other BMP signaling components of D. melanogaster resulted from gene duplications that have been mapped to the dipteran lineage, while others - known from the BMP pathways of vertebrates - were lost in the lineage leading to D. melanogaster[4]. Here we use embryonic transcriptome data of the hover fly Episyrphus balteatus (Syrphidae) to address the question of whether evolutionary changes in the complement of BMP signaling components occurred in correlation with the origin of the amnioserosa. Specifically, we found that with the possible exception of one gene-duplication (crossveinless/shrew) and one gene loss (DAN), genes encoding known BMP signaling components, including scw, are conserved across the schizophoran boundary of the dipteran tree. Thus, most or all of the gains and losses of BMP signaling genes in the dipteran lineage do not correlate with the origin of the amnioserosa, suggesting that the origin of amnioserosa specification was probably achieved by rearranging the interaction of established factors.

Results and Discussion

Putative Orthologues of 6013 E. balteatus Genes

We sequenced the transcriptome of E. balteatus embryos at two successive time points during early embryogenesis: 0-0.5 hrs old embryos to sample pre-blastoderm stages prior to the onset of zygotic transcription ("maternal library"), and 3-6 hrs old embryos to sample blastoderm and gastrulation stages after the onset of zygotic transcription ("zygotic library"). The cDNA libraries that we prepared from these developmental stages were normalized (Additional file 1A,B) and sequenced using the 454 GS FLX Titanium platform. Following removal of contaminants (see Material and Methods, Additional file 1C, D) reads from both libraries were pooled and assembled using the Newbler Assembler from Roche. Above our chosen cutoff of 100 nt, this assembly yielded a total of 16,950 contigs with an average length of 798 nt (13.5 MB) and 26,862 singletons with an average length of 264 nt (7.1 MB). This data set (20.6 MB total sequence data) was used in subsequent analyses (Figure 1A).

Figure 1
figure 1

Overview of E. balteatus transcriptome assembly and annotation. (A) Number of sequences per 200-nucleotide-bin (y-axis) is shown as function of assembled sequence lengths (x-axis). Numbers of assembled contigs (dark grey) and singletons (light grey) are shown on a logarithmic scale (left scale), and the proportion of annotated E. balteatus sequences (red graph) are shown in percent (right scale). Note that sequences below 100 nt were not used for further analyses and were excluded from bin 0-200. (B) Gene ontology assignments of annotated E. balteatus genes, red bars indicate the proportion of annotated E. balteatus sequences in percent.

To identify E. balteatus genes, we pooled sequence data from both libraries and performed reciprocal BLAST against annotated genes of D. melanogaster. Based on reciprocal hits, we identified putative D. melanogaster orthologues of 6013 E. balteatus genes. In total, about 8.1 MB (39%) of the assembled E. balteatus sequence data could be annotated (red line in Figure 1A), corresponding to 43% of annotated D. melanogaster genes. Specifically, we recovered 85% of genes annotated for translational control, over 60% of the genes known to encode gene-specific transcription factors (transcription factor - strict [30]), 50% of the genes associated with known and putative gene-specific transcription factors (transcription factor - putative [30]), about 50% of genes associated with structural functions (structure) and enzymes, and 30-40% of genes associated with receptor binding, molecular transporters, and signal transduction (transducer) (Figure 1B).

Assessment of Coverage

To estimate the coverage of developmental genes, we separately mapped the reads from each library back onto all of 14 previously described E. balteatus segmentation genes, which comprise orthologues of bicoid (Eba-bcd), caudal (Eba-cad), nanos (Eba-nos), torso (Eba-tor), orthodenticle (Eba-otd), hunchback (Eba-hb), Krüppel (Eba-Kr), knirps (Eba-kni), giant (Eba-gt), hairy (Eba-h), even-skipped (Eba-eve), zerknüllt (Eba-zen), tailless (Eba-tll), and huckebein (Eba-hkb) [9, 3133]. The combined maternal and zygotic coverage of all fourteen genes was on average 8.6-fold (Additional file 2), slightly less than the average coverage of the entire assembled transcriptome (~12-fold). However, coverage of 5'UTR sequences (2.0-fold) and 3'UTR sequences (1.6-fold) was considerably lower than the coverage of ORF sequences (12.5-fold). As the CG content of UTRs (20%) was notably lower than the CG content of the ORFs (43%), a systematic bias against AT rich sequences may have been introduced by less efficient annealing of the random hexamer primers during first strand cDNA synthesis. In any case, coverage of our E. balteatus transcriptome data set was high enough to identify at least fragments of genes known to be active during early embryonic development.

All fourteen genes were represented with at least one read from the zygotic library (blue lines in Figure 2A-N), which is consistent with our previous finding that all these genes are expressed in the 3-6 hours time window of embryonic development [9, 3133]. For six of these genes (Eba-bcd, Eba-cad, Eba-nos, Eba-tor, Eba-otd, Eba-hb) we also obtained reads from the maternal library (red lines in Figure 2A-F). Previous and new (Additional file 3A-C) in situ hybridization data indicated maternal expression of Eba-bcd, Eba-cad, Eba-nos, Eba-tor and Eba-otd, but not of Eba-hb. Quantitative PCR (qPCR) on non-normalized cDNA suggested an 18-fold increase of Eba-hb expression levels following the onset of zygotic transcription, whereas expression levels both of Eba-otd and Eba-cad increased by about 2-fold (orange bars in Additional file 3D). Coverage of these genes in the maternal and zygotic transcriptomes closely reflected our qPCR data (light grey bars in Additional file 3D), suggesting that, despite cDNA normalization, the coverage of these genes remained roughly proportional to their expression levels.

Figure 2
figure 2

Coverage of previously identified E. balteatus developmental genes. (A) Eba-bcd (GenBank accession: HM044914), (B) Eba-cad (FJ387230), (C) Eba-nos (FJ387226), (D) Eba-tor (HM044920), (E) Eba-otd (FJ387225), (F) Eba-hb (FJ387229), (G) Eba-Kr (HM044918), (H) Eba-kni (HM044916), (I) Eba-gt (HM044915), (J) Eba-h (AY645032), (K) Eba-eve (AY645031), (L) Eba-zen (DQ323932), (M) Eba-kkb (HM067828), and (N) Eba-tll (HM044919). Coverage (i.e., number of reads) is shown as function of nucleotide position in cDNA (x-axis; ORF in grey). Coverage in the maternal cDNA pool (0-0.5 hrs) is labeled in red, coverage in the zygotic cDNA pool (3-6 hrs) is labeled in blue. Average coverage is indicated by dotted horizontal lines. Fold-changes in coverage levels from maternal to zygotic pool are given in green (increase) or red (decrease) numbers. A listing of coverage values detailing ORF and UTR is given in Additional file 2. Scale bars (top right of panel) are 500 nt.

BMP Signaling Components in the E. balteatus Transcriptome Database

In D. melanogaster, BMP signaling at the dorsal side of the blastoderm is required to specify the amnioserosa as a single extraembryonic tissue (see Background). Based on mosquito data, it has been suggested that in lower dipterans restricted expression of the BMP antagonist Short gastrulation (Sog) may account for an expanded BMP signaling domain in the dorsal blastoderm, which resolves into serosa and amnion territories [11]. Furthermore, it has been suggested that the complement of BMP signaling components changed in the dipteran lineage in parallel with the origin of the amnioserosa [3]. We used our transcriptome database as a tool to test the latter idea by searching for E. balteatus homologues of specific BMP signaling components of D. melanogaster.

Specific BMP signaling components include (1) extracellular ligands, (2) transmembrane receptors, (3) intracellular signal transducers, and (4) extracellular modulators of ligands. The D. melanogaster genome contains a total of three genes encoding BMP ligands: decapentaplegic (dpp) [18], glass bottom boat (gbb) [34], and screw (scw) [19]. These are selectively used depending on the developmental context [5]. In other insects, only homologues of dpp and gbb have been found [3, 4]. We identified E. balteatus homologues of all three ligands (Figure 3A), indicating that these genes existed prior to the origin of the amnioserosa. Consistent with previous reports [3, 4], our gene tree supports a sister gene relationship between scw and gbb. As expected based on a comprehensive survey of TGF-β signaling components in the beetle Tribolium castaneum[4], we also identified E. balteatus homologues for each of the D. melanogaster TGF-β receptors Thickveins (Tkv) and Saxophone (Sax) [3538], Punt (Put) [39, 40], Baboon (Babo) [41], and Wishful thinking (Wit) [42, 43], and the SMAD transducers Mothers against Dpp (Mad), Medea, and Smox [4448], as well as other TGF-ß signaling components (Figure 3B,C; Additional file 4).

Figure 3
figure 3

Orthologies of BMP signaling components based on maximum likelihood gene trees using predicted amino acid sequences. (A) TGFß ligands (substitution model DCMut+i+g+g); (B) TGFβ type I and type II receptors (substitution model WAG+i+g+f); (C) SMAD family (substitution model LG+g+f); (D) Metalloproteases related to Tld and Tok (substitution model Dayhoff+g+f); (E) Kekkon family (substitution model LG+i+g+f); (F) Crossveinless family (substitution model Dayhoff+i+g+f); (G) DAN family (substitution model Dayhoff+i+g+f); scale bars indicate estimated changes/position. BeetleBase (T. castaneum) and GenBank accession numbers (all others): (A) Act (Activin-β; NM_143685), Tca Act (TC015806), Tca BMP10 (TC006506), Daw (dawdle; NM_078737), Tca Daw (TC04297), Dpp (decapentaplegic; NM_057963), Eba Dpp (JN006972), Tca Dpp (TC008466), Gbb (glass bottom boat; NM_057992), Eba Gbb (JN006973), Tca Gbb1 (TC014017), Tca Gbb2 (TC014018), Mav (maverick; NM_079887.2), Tca Mav (TC004299), Myo (myoglianin; NM_166786) Tca Myo (TC015805), Scw (screw; NM_080124.4), Eba Scw (JN006978); (B) Babo (baboon; NM_057652), Eba Babo (JN006969), Tca Babo (TC003240), Put (punt; NM_169591), Eba Put (JN006976), Tca Put (TC011357), Sax (saxophone; NM_078928), Eba Sax (JN006977), Tca Sax (TC015984), Tkv (thickveins; NM_175975), Eba Tkv (JN006980), Tca Tkv (TC006474), Wit (wishful thinking; NM_079953), Eba Wit (JN006986), Tca Wit (TC009314); (C) Mad (mothers against dpp; NM_057669), Eba Mad (JN006974), Tca Mad (TC014924), Med (Medea; NM_079871), Eba Med (JN006975), Tca Med (TC010848), Smox (smad on X; NM_078524), Eba Smox (JN006979), Tca Smox (TC010162); (D) Tld (tolloid; NM_079763), Eba Tld (JN006985), Tok (tolkin; NM_057531), Tca Tok (TC011197); (E) Kek1 (kekkon-1; NM_078835), Kek2 (NM_078827), Tca Kek2 (TC007053), Tca Kek2' (TC008448), Eba Kek2 (JN006983), Kek3 (NM_078851), Tca Kek3 (TC007226), Kek4 (NM_135771), Tca Kek4 (TC007110), Tca Kek4' (TC008070), Kek5 (NM_133154), Eba Kek5 (JN006984); (F) Cv (crossveinless; NM_080525), Eba Cv (JN006982), Eba Cv-like (JN006970), Srw (shrew; NM_139629), Tsg (twisted gastrulation; NM_078580), Eba Tsg (JN006981), Tca Tsg (TC003620); (G) Mmu Cerberus (Mus musculus; NM_009887), Mmu Dante (NM_201227), Mmu Gremlin (NM_011824), Tca Gremlin (TC007044), Eba DAN (JN006971), Mmu DAN (NM_008675.2), Tca DAN (TC014861).

In D. melanogaster, activity of BMP ligands is modulated by Sog [22, 23, 26], which in turn is regulated by the related metalloproteases Tolloid (Tld) and Tolkin (Tok) [25, 49, 50]. We identified E. balteatus homologues of sog as well as of tld. While we were not able to identify an orthologue of tok, the presence of a distinct tld orthologue in E. balteatus suggests that the dipteran gene duplication giving rise to tld and tok occurred before the origin of the amnioserosa (Figure 3D). BMP ligand activity in D. melanogaster is additionally modulated by Twisted gastrulation (Tsg) [51, 52], Crossveinless (Cv) [53, 54], Shrew (Srw) [26, 55], as well as the membrane associated factors Crossveinless-2 (Cv-2) [56, 57], Kekkon 5 (Kek5) [58], Pentagone (Pent) [59] and Larval Translucida (Ltl) [60]. We identified E. balteatus homologues of cv-2, kek5 (Figure 3E) as well as tsg, cv and an additional cv paralogue, Eba-cv-like (Figure 3F). Previous studies have suggested that tsg, cv, and srw originated by two successive duplications of a cv-like ancestor in the dipteran lineage [3]. Our gene tree analysis is consistent with this idea, but does not resolve whether Eba-cv-like is orthologous to srw, or whether it is the product of an independent gene duplication in E. balteatus. We did not identify orthologues of pent and ltl in E. balteatus, but putative orthologues of both genes are present in the genome of T. castaneum (data not shown). Thus, all currently known modulators of BMP ligand activity in D. melanogaster may have existed prior to the origin of the amnioserosa.

Putative orthologues of the vertebrate BMP ligands BMP10 [61] and Anti-Dorsalizing Morphogenetic Protein (ADMP) [6264], as well as the vertebrate BMP inhibitors BAMBI [65], DAN and Gremlin [66] have been found in beetles and/or wasps but not in D. melanogaster[4]. Among these, we were able to identify a putative orthologue of DAN in E. balteatus (Figure 3G). The loss of this gene may correlate with the origin of the amnioserosa. However, the function of DAN in insects remains unknown and its potential role in BMP signaling is therefore speculative.

Differences in maternal expression of BMP signaling components in D. melanogaster and E. balteatus

Based on our finding that coverage levels of segmentation genes in the two sequenced transcriptomes were roughly proportional to their expected expression levels (see above; Additional file 3D), we decided to globally compare maternal gene expression between E. balteatus and D. melanogaster. For this purpose, we approximated the maternal expression profiles of all annotated E. balteatus genes based on their coverage in the maternal transcriptome and compared them to maternal expression profiles of their D. melanogaster orthologues. Maternal coverage levels of all annotated E. balteatus genes were corrected for sequence lengths and sequencing depth (see Methods) and plotted against coverage levels of their D. melanogaster orthologues, which were estimated from available SOLiD total RNA sequencing data of 0-2 hour old embryos (i.e. stages when the zygotic transcriptome is still essentially silent) [67] (Figure 4). Coverage levels derived from the non-normalized D. melanogaster transcriptome spread by 5.5 orders of magnitude, while those of the normalized E. balteatus transcriptome spread by 3.6 orders of magnitude. A reduced breadth in coverage levels of E. balteatus genes was expected due to the normalization protocol. The coverage levels of the maternal genes nanos, bicoid, torso, and caudal were higher than 1 in the transcriptome of D. melanogaster and E. balteatus. In contrast, the zygotic genes huckebein, even skipped, giant, hairy, knirps, tailless, Krüppel, and zerknüllt showed coverage levels lower than 1 in both species (Figure 4A). Notably, the scatter plot correctly revealed the expression differences of hunchback (maternally expressed in D. melanogaster but not in E. balteatus) and orthodenticle (maternally expressed in E. balteatus but not in D. melanogaster). When restricting the data set to BMP components (Figure 4B) or transcription factors (Figure 4C), we readily identified additional candidate maternal expression differences in both gene groups. For example, the data suggest maternal expression of crossveinless-2 or kekkon-5 in E. balteatus but not in D. melanogaster, which might reflect different interactions of BMP signaling molecules and regulators in both species during early embryonic development.

Figure 4
figure 4

Coverage of maternal E. balteatus genes in comparison with transcriptome data from 0-2 hours old D. melanogaster embryos. (A) Scatter plot of orthologous genes for which we determined expression levels in E. balteatus (using 454 data from 0-0.5 hrs old embryos) and D. melanogaster (using RNAseq data from 0-2 hrs old embryos) on a log2 scale. The 14 previously identified E. balteatus genes are circled in red (if their D. melanogaster orthologues are known to have a maternal effect) or in blue (if their D. melanogaster orthologues are thought to lack a maternal effect). Asterisks demark genes that have been manually annotated. Genes not detected at the indicated developmental stage in one of the two species are plotted outside the scale (grey area). (B,C) Same as in (A) but limited to 15 identified genes encoding BMP signaling components (B) or 89 genes of gene specific transcription factors (C, list of genes based on flyTF, see Material and Methods). Genes that might lack significant maternal expression in one of the two species are indicated. Our analysis suggests that the zygotic D. melanogaster genes crossveinless-2, kekkon-5, doublesex, abdominal-A, vrille and Optix are expressed maternally in E. balteatus, while the maternal D. melanogaster transcription factor BEAF-32 may lack significant maternal expression in E. balteatus.


Comprising orthologous sequences of nearly half (43%) of all annotated D. melanogaster genes, the newly generated transcriptome data of E. balteatus provide a convenient tool to identify putative orthologues of conserved insect genes. Here we used the transcriptome data of E. balteatus to show that the novel dipteran BMP ligand Scw and other BMP signaling components of D. melanogaster existed prior to the origin of the amnioserosa (Figure 5). These findings suggest that the origin of amnioserosa development was accompanied by subtle changes in the expression of conserved BMP signaling components, rather than on the origin or loss of individual genes. Modification of the BMP pathway is expected to be constrained due to its multiple functions in development. However, the duplicated genes (gbb and scw; tld and tok; cv, tsg and srw) may have relaxed these constraints, because BMP activity could now be provided by non-identical sets of genes in early (blastoderm) and later developmental stages. We suspect that increasing the role of scw, tld, tsg and srw in early (blastoderm) development at the expense of gbb, tok and cv facilitated genetic accommodation of early DV patterning following the origin of the amnioserosa. Conversely, the entire complement of the duplicated BMP signaling genes might still be required for early DV patterning in lower cyclorrhaphan flies such as E. balteatus.

Figure 5
figure 5

Evolution of BMP signaling components and amnioserosa origin.


Preparation of Transcriptome Library

Total RNA was prepared by homogenizing embryos in Trizol (Inivtrogen), treated with DNaseI, and enriched for polyA containing transcripts using the Oligotex kit (Qiagen). First-strand cDNA was synthesized from approximately 1 μg of mRNA. Annealing of random hexamer primers (15 mM) was at 25°C for 10 minutes, cDNA was synthesized at 50°C for 1 hour and followed by inactivation of the reverse transcriptase (Superscript, Invitrogen) at 85°C for 5 minutes. Second-strand cDNA was synthesized using the first strand reaction with Klenow DNA Polymerase at 15°C for 1.5 hours, and terminated by the addition of 0.5 M EDTA, pH 8. cDNA was purified using the QIAquick MinElute Reaction Clean-up Kit (Qiagen). cDNA ends were filled in ("polished") using a mix of Klenow DNA polymerase and T4 polynucleotide kinase with dNTPs at 20°C for 30 minutes, after which the reactions were purified again using the QIAquick MinElute Reaction Clean-up Kit. "A" overhangs were created by incubating polished cDNA with 0.2 mM dATP 0.3 U/μl Klenow exo- at 37°C for 30 minutes. cDNA was purified using the QIAquick MinElute Reaction Clean-up Kit (Qiagen), ligated with a mix of AdaptorA and AdaptorB using T4 DNA ligase, and purified again. Adaptors were generated by annealing equimolar amounts of complementary oligos in 2x TNE buffer (20 mM Tris-Cl, pH 8, 0.2 mM EDTA, pH 8, 100 mM NaCl). Oligo sequences for both adaptors were adapted from 454 Sequencing Technical Bulletin No. 004-2009, ordered from Integrated DNA Technologies and HPLC purified. Amplification of the library was performed in triplicate using Platinum Taq DNA polymerase HiFi with AdapterA and AdapterB primers. AdapterA FW and AdapterB FW primers. Pooled volumes of the library were purified using the QIAquick MinElute Reaction Clean-up Kit.

Library Normalization and Fragment Size Selection

Libraries were normalized using the TRIMMER DIRECT cDNA Normalization Kit (Evrogen) and were carried out essentially as described in the user manual. Briefly, 400 ng of each library were suspended in hybridization buffer and split into four tubes. Following 5 hr incubation at 68°C, the aliquots were treated either with 4 units, 2 units, 1 unit, or no duplex-specific nuclease (DSN). After DSN digestion, the normalized cDNA libraries were amplified by PCR. Optimal amplification within the exponential phase of the PCR was determined visually after electrophoresing different amplification runs of the non-DSN treated sample, after which all aliquots were amplified by a total of 20 cycles. The normalization efficiency was assessed by quantitative PCR (qPCR) of hunchback (Eba-hb, 5'-CTCAGCCCGAATCCAAAT/5'-GGTTGTGGGAGTTGATGTTG, amplicon 137 bp), caudal (Eba-cad, 5'-GAAAGAATACTGCACCTCCC/5'-GTCGTTCCGATAGTTGAAGC, amplicon 79 bp), and alpha-tubulin as reference (Eba-tub, 5'-TGAGGCTCGTGAGGATTT/5'-TCACCATCTCCAGAATCCA, amplicon 71 bp). Primer efficiency was estimated from a standard curve using five different template concentrations (12.5-200 ng); all analyses were run simultaneously in triplicates. For both libraries, the normalization with 4 units DSN were chosen for sequencing. Based on optimal cycle numbers for amplification and degree of normalization, these libraries were size fractionated using agarose gel electrophoresis, excising fragments at roughly 500 bp, and then purifying them using the MinElute Gel Extraction Kit (Qiagen) prior to sequencing.

454 Sequencing

Transcriptome library sequencing was performed on the Roche/454 Life Sciences GS-FLX platform at the Institute for Genomics and Systems Biology's (IGSB) High-Throughput Genome Analysis Core (HGAC) at Argonne National Laboratory according to the Roche GS-FLX XLR70 Titanium emPCR and amplicon sequencing protocols. Each transcriptome library was sequenced on one region of a two region GS-FLX gasket using Roche GS-FLX XLR70 titanium sequencing reagents. An emPCR titration was initially performed on each library to determine the proper bead:library copy ratio that yielded optimal clonal bead percent enrichment to be used in the final bulk XLR70 emPCR reaction.

Sequence Assembly

Sequencing raw data was processed with gsRunProcessor (software release using standard quality filtering and trimming as defined by the default settings. We obtained 417,735 reads with a mean length of 243 nt for the maternal time point and 406,580 reads with a mean length of 278 nt for the zygotic time point, totaling 214.6 MB of sequence data. This raw data set was contaminated with sequences of the pea aphid Acyrthosiphon pisum, because E. balteatus requires aphids for egg deposition, and embryos were collected in batch from leaves heavily infested with aphids. For subsequent analyses, we removed all reads that matched published A. pisum sequences (mRNA + genomic, NCBI, 2009-07-13) [68] with 95% or higher identity. Removal of A. pisum sequences reduced the total sequence data by about 26% to 158.1 MB and the number of reads from each library by about 22%, resulting in 325,200 reads from the maternal library with a mean read length of 228 nt, and 311,906 reads from the zygotic library with a mean read length of 269 nt. The distribution of read lengths displayed two peaks, one at about 400 nt (360 nt for reads from the zygotic library) and one at less than 100 nt (Additional file 1C,D). The peak at 400 nt corresponded to the expected mean length using the Titanium chemistry. The peak slightly below 100 nt presumably resulted from our cDNA preparation protocol, which had not yet been optimized for the Titanium chemistry at the time of library preparation and lacked, for example, any additional size exclusion steps following gel electrophoreses.

All reads have been deposited as the E. balteatus transcriptome at the NCBI Sequence Read Archive (SRA, SRP005289). Assembly was based on combined SFF sequence files of the maternal and zygotic libraries using Newbler Assembler (software release 2.3). Newbler parameters were default except: minimal identity of 90% in overlaps (-mi 90), overlaps to be at least 30 nt in length (-ml 30), minimal length of contigs to be 100 nt (-l 100). Newbler was run as cDNA assembly (-cdna), which includes processing of contigs that are found to be variants of the same transcript into distinct isotigs. From a total of 637,080 reads, 544,776 reads (85.5%) were assembled (fully assembled reads: 432,160 reads, 67.8%; partially assembled reads: 112,616 reads, 17.7%). The remaining reads were singletons (56,625 reads; 8.9%) or excluded as either originating from repeat regions (221 reads; 0.03%), outliers (23,060 reads; 3.6%), or too short (< 50 base pairs: 12,398 reads; 1.9%) (see Additional file 4 for comparison with Newbler 2.5.3).

Sequence Annotation

Reported isotigs (12,296; 12.9 MB) and singletons of at least 100 nt length (26,862; 7.1 MB; identified from 454ReadStatus.txt) were combined, and reciprocal BLAST searches of the E. balteatus transcriptome were carried out against the translated transcriptome of D. melanogaster (dmel-all-translation-r5.29.faa, using blastx (E. balteatus query against D. melanogaster database) and tblastn (D. melanogaster query against E. balteatus database). Annotation was performed with an e-value threshold of 10 to screen for all putative orthologues ("no cutoff") and with an e-value threshold of 1e-10 to obtain a conservative list of high confidence orthologues ("1e-10") (see Additional file 5 for comparison of annotation based on assemblies with Newbler 2.3 and Newbler 2.5.3). Reciprocal hits of E. balteatus with D. melanogaster were assigned with the same flybase D. melanogaster CG identifiers. Gene ontology terms were assigned based on the current D. melanogaster gene ontology, with the exception of 'transcription factor - strict' and 'transcription factor - putative', which were based on a curated list of genes of known and putative gene specific transcription factors [30].

Sequence Coverage

To determine coverage levels for individual published E. balteatus genes, associated reads of the maternal and zygotic library sequences were identified by BLAST search (blastn) and assembled with the published sequence using CAP3 with standard parameters [69]. Coverage levels were then calculated for each nucleotide position of the published gene sequences. Fold-coverage in the maternal library was used to approximate maternal expression levels of all annotated E. balteatus genes. The assembled E. balteatus transcriptome (isotigs and singletons) was blasted against all maternal reads (blastn). Coverage of annotated transcripts was then calculated from all reads that matched with at least 95% identity and over the length of at least 50 nt. To approximate levels of gene expression, coverage of each gene was divided by its sequence length and the total RNAseq data of the maternal library (74 MB). Fold-coverage in the 0-2 hr time point of the D. melanogaster development transcriptome was used to approximate maternal expression levels of D. melanogaster genes [67]. Expression data of D. melanogaster genes in 0-2 hrs old embryos was downloaded from modEncode as coverage data mapped onto the entire genome (BC1_plus.wig, BC1_minus.wig). Expression of gene transcripts was retrieved from this genomic map by extracting coverage information of all exons for each annotated gene (BDGP/dm3). To account for potentially mis-annotated exon-intron structures of computationally predicted genes, gene expression was approximated by the coverage of the most strongly expressed exon longer than 500 nt, or by the coverage of the most strongly expressed transcript variant, whichever was higher. To approximate levels of gene expression, coverage of each gene was divided by its sequence length and the total RNAseq data of the 0-2 hr time point (3.4 GB, SRP001696).

Gene Discovery of BMP Signaling Components

A list of D. melanogaster and Tribolium castaneum BMP signaling compounds was matched against assembled and unassembled E. balteatus transcriptome data (tblastn, e-value of 0.001). Identified E. balteatus sequences were assembled in Sequencher, assemblies were manually corrected for ORF frame shifts by alignment with D. melanogaster protein sequence, and sequence orthology was confirmed by reciprocal blast against a D. melanogaster database. To exhaustively search for putative duplicates specific to the E. balteatus lineage, blast searches were repeated using lower cut-offs (increasingly higher e-values) until all newly identified and assembled E. balteatus reads matched a clearly non-orthologous sequence in available insect protein databases. All T. castaneum sequences were retrieved from BeetleBase (

Phylogenetic Gene Trees

Protein alignments were created using the Clustal algorithm with standard parameters (MegAlign). When more than half of the aligned sequences carried a gap at a given position, these positions were removed from the alignment. The amino acid substitution model was estimated using AIC in ProtTest [70]; maximum likelihood trees were calculated using PhyML [71]. Bootstrap values were based on 1000 replicas. Trees were plotted with drawtree (Phylip package) [72] and the newick-utils package [73].

Custom scripts (Perl, R) were used to automate blast searches and evaluation, calculate E. balteatus and D. melanogaster gene coverage, and prune sequence alignments. Scripts are available on request. Plots were prepared with gnuplot and finished with Freehand. Assembly, blast searches, and bootstrap analysis were computed on the computer cluster of the Department of Ecology & Evolution at the University of Chicago (


  1. 1.

    Niehrs C: On growth and form: a Cartesian coordinate system of Wnt and BMP signaling specifies bilaterian body axes. Development. 2010, 137 (6): 845-857. 10.1242/dev.039651.

    CAS  PubMed  Article  Google Scholar 

  2. 2.

    De Robertis EM: Spemann's organizer and the self-regulation of embryonic fields. Mech Dev. 2009, 126 (11-12): 925-941. 10.1016/j.mod.2009.08.004.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  3. 3.

    Fritsch C, Lanfear R, Ray RP: Rapid evolution of a novel signalling mechanism by concerted duplication and divergence of a BMP ligand and its extracellular modulators. Dev Genes Evol. 2010, 220 (9-10): 235-250. 10.1007/s00427-010-0341-5.

    CAS  PubMed  Article  Google Scholar 

  4. 4.

    Van der Zee M, da Fonseca RN, Roth S: TGFbeta signaling in Tribolium: vertebrate-like components in a beetle. Dev Genes Evol. 2008, 218 (3-4): 203-213. 10.1007/s00427-007-0179-7.

    CAS  PubMed  Article  Google Scholar 

  5. 5.

    O'Connor MB, Umulis D, Othmer HG, Blair SS: Shaping BMP morphogen gradients in the Drosophila embryo and pupal wing. Development. 2006, 133 (2): 183-193.

    PubMed  Article  Google Scholar 

  6. 6.

    Umulis D, O'Connor MB, Blair SS: The extracellular regulation of bone morphogenetic protein signaling. Development. 2009, 136 (22): 3715-3728. 10.1242/dev.031534.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  7. 7.

    Schmidt-Ott U, Rafiqi AM, Lemke S: Hox3/zen and the Evolution of Extraembryonic Epithelia in Insects. Hox Genes' Studies from the 20th to the 21st Century. Edited by: Deutsch J. 2010, Austin, TX: Landes Biosciences

    Google Scholar 

  8. 8.

    Campos-Ortega J, Hartenstein V: The embryonic development of Drosophila melanogaster. 1997, Berlin, Heidelberg, New York: Springer Verlag, 2

    Chapter  Google Scholar 

  9. 9.

    Rafiqi AM, Lemke S, Ferguson S, Stauber M, Schmidt-Ott U: Evolutionary origin of the amnioserosa in cyclorrhaphan flies correlates with spatial and temporal expression changes of zen. Proc Natl Acad Sci USA. 2008, 105 (1): 234-239. 10.1073/pnas.0709145105.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  10. 10.

    Goltsev Y, Rezende G, Vranizan K, Lanzaro G, Valle D, Levine M: Developmental and evolutionary basis for drought tolerance of the Anopheles gambiae embryo. Dev Biol. 2009, 330: 462-470. 10.1016/j.ydbio.2009.02.038.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  11. 11.

    Goltsev Y, Fuse N, Frasch M, Zinzen RP, Lanzaro G, Levine M: Evolution of the dorsal-ventral patterning network in the mosquito, Anopheles gambiae. Development. 2007, 134 (13): 2415-2424. 10.1242/dev.02863.

    CAS  PubMed  Article  Google Scholar 

  12. 12.

    van der Zee M, Berns N, Roth S: Distinct functions of the Tribolium zerknüllt genes in serosa specification and dorsal closure. Curr Biol. 2005, 15: 624-636. 10.1016/j.cub.2005.02.057.

    CAS  PubMed  Article  Google Scholar 

  13. 13.

    Rushlow C, Levine M: Role of the zerknüllt gene in dorsal-ventral pattern formation in Drosophila. Adv Genet. 1990, 27: 277-307.

    CAS  PubMed  Google Scholar 

  14. 14.

    Ray RP, Arora K, Nüsslein-Volhard C, Gelbart WM: The control of cell fate along the dorsal-ventral axis of the Drosophila embryo. Development. 1991, 113 (1): 35-54.

    CAS  PubMed  Google Scholar 

  15. 15.

    Rafiqi AM, Lemke S, Schmidt-Ott U: Postgastrular zen expression is required to develop distinct amniotic and serosal epithelia in the scuttle fly Megaselia. Dev Biol. 2010, 341 (1): 282-90. 10.1016/j.ydbio.2010.01.040.

    CAS  PubMed  Article  Google Scholar 

  16. 16.

    Ferguson EL, Anderson KV: Decapentaplegic acts as a morphogen to organize dorsal-ventral pattern in the Drosophila embryo. Cell. 1992, 71 (3): 451-461. 10.1016/0092-8674(92)90514-D.

    CAS  PubMed  Article  Google Scholar 

  17. 17.

    Shimmi O, Umulis D, Othmer H, O'Connor MB: Facilitated transport of a Dpp/Scw heterodimer by Sog/Tsg leads to robust patterning of the Drosophila blastoderm embryo. Cell. 2005, 120 (6): 873-886. 10.1016/j.cell.2005.02.009.

    CAS  PubMed  Article  Google Scholar 

  18. 18.

    Padgett RW, St Johnston RD, Gelbart WM: A transcript from a Drosophila pattern gene predicts a protein homologous to the transforming growth factor-beta family. Nature. 1987, 325 (6099): 81-84. 10.1038/325081a0.

    CAS  PubMed  Article  Google Scholar 

  19. 19.

    Arora K, Levine MS, O'Connor MB: The screw gene encodes a ubiquitously expressed member of the TGF-beta family required for specification of dorsal cell fates in the Drosophila embryo. Genes Dev. 1994, 8 (21): 2588-2601. 10.1101/gad.8.21.2588.

    CAS  PubMed  Article  Google Scholar 

  20. 20.

    Wang YC, Ferguson EL: Spatial bistability of Dpp-receptor interactions during Drosophila dorsal-ventral patterning. Nature. 2005, 434 (7030): 229-234. 10.1038/nature03318.

    CAS  PubMed  Article  Google Scholar 

  21. 21.

    Eldar A, Dorfman R, Weiss D, Ashe H, Shilo BZ, Barkai N: Robustness of the BMP morphogen gradient in Drosophila embryonic patterning. Nature. 2002, 419 (6904): 304-308. 10.1038/nature01061.

    CAS  PubMed  Article  Google Scholar 

  22. 22.

    Decotto E, Ferguson EL: A positive role for Short gastrulation in modulating BMP signaling during dorsoventral patterning in the Drosophila embryo. Development. 2001, 128 (19): 3831-3841.

    CAS  PubMed  Google Scholar 

  23. 23.

    Ashe HL, Levine M: Local inhibition and long-range enhancement of Dpp signal transduction by Sog. Nature. 1999, 398 (6726): 427-431. 10.1038/18892.

    CAS  PubMed  Article  Google Scholar 

  24. 24.

    Francois V, Solloway M, O'Neill JW, Emery J, Bier E: Dorsal-ventral patterning of the Drosophila embryo depends on a putative negative growth factor encoded by the short gastrulation gene. Genes Dev. 1994, 8 (21): 2602-2616. 10.1101/gad.8.21.2602.

    CAS  PubMed  Article  Google Scholar 

  25. 25.

    Marqués G, Musacchio M, Shimell MJ, Wünnenberg-Stapleton K, Cho KW, O'Connor MB: Production of a DPP activity gradient in the early Drosophila embryo through the opposing actions of the SOG and TLD proteins. Cell. 1997, 91 (3): 417-426. 10.1016/S0092-8674(00)80425-0.

    PubMed  Article  Google Scholar 

  26. 26.

    Ferguson EL, Anderson KV: Localized enhancement and repression of the activity of the TGF-beta family member, decapentaplegic, is necessary for dorsal-ventral pattern formation in the Drosophila embryo. Development. 1992, 114 (3): 583-597.

    CAS  PubMed  Google Scholar 

  27. 27.

    Shimell MJ, Ferguson EL, Childs SR, O'Connor MB: The Drosophila dorsal-ventral patterning gene tolloid is related to human bone morphogenetic protein 1. Cell. 1991, 67 (3): 469-481. 10.1016/0092-8674(91)90522-Z.

    CAS  PubMed  Article  Google Scholar 

  28. 28.

    Umulis DM, Serpe M, O'Connor MB, Othmer HG: Robust, bistable patterning of the dorsal surface of the Drosophila embryo. Proc Natl Acad Sci USA. 2006, 103 (31): 11613-11618. 10.1073/pnas.0510398103.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  29. 29.

    Wharton KA, Ray RP, Gelbart WM: An activity gradient of decapentaplegic is necessary for the specification of dorsal pattern elements in the Drosophila embryo. Development. 1993, 117 (2): 807-822.

    CAS  PubMed  Google Scholar 

  30. 30.

    Pfreundt U, James DP, Tweedie S, Wilson D, Teichmann SA, Adryan B: FlyTF: improved annotation and enhanced functionality of the Drosophila transcription factor database. Nucleic Acids Res. 2010, 38: 443-447. 10.1093/nar/gkp910.

    Article  Google Scholar 

  31. 31.

    Lemke S, Busch S, Antonopoulos D, Meyer F, Domanus M, Schmidt-Ott U: Maternal activation of gap genes in the hover fly Episyrphus. Development. 2010, 137 (10): 1709-1719. 10.1242/dev.046649.

    CAS  PubMed  Article  Google Scholar 

  32. 32.

    Lemke S, Schmidt-Ott U: Evidence for a composite anterior determinant in the hover fly Episyrphus balteatus (Syrphidae), a cyclorrhaphan fly with an anterodorsal serosa anlage. Development. 2009, 136: 117-127. 10.1242/dev.030270.

    CAS  PubMed  Article  Google Scholar 

  33. 33.

    Bullock SL, Stauber M, Prell A, Hughes JR, Ish-Horowicz D, Schmidt-Ott U: Differential cytoplasmic mRNA localisation adjusts pair-rule transcription factor activity to cytoarchitecture in dipteran evolution. Development. 2004, 131 (17): 4251-4261. 10.1242/dev.01289.

    CAS  PubMed  Article  Google Scholar 

  34. 34.

    Doctor JS, Jackson PD, Rashka KE, Visalli M, Hoffmann FM: Sequence, biochemical characterization, and developmental expression of a new member of the TGF-beta superfamily in Drosophila melanogaster. Dev Biol. 1992, 151 (2): 491-505. 10.1016/0012-1606(92)90188-M.

    CAS  PubMed  Article  Google Scholar 

  35. 35.

    Xie T, Finelli AL, Padgett RW: The Drosophila saxophone gene: a serine-threonine kinase receptor of the TGF-beta superfamily. Science. 1994, 263 (5154): 1756-1759. 10.1126/science.8134837.

    CAS  PubMed  Article  Google Scholar 

  36. 36.

    Brummel TJ, Twombly V, Marqués G, Wrana JL, Newfeld SJ, Attisano L, Massagué J, O'Connor MB, Gelbart WM: Characterization and relationship of Dpp receptors encoded by the saxophone and thick veins genes in Drosophila. Cell. 1994, 78 (2): 251-261. 10.1016/0092-8674(94)90295-X.

    CAS  PubMed  Article  Google Scholar 

  37. 37.

    Nellen D, Affolter M, Basler K: Receptor serine/threonine kinases implicated in the control of Drosophila body pattern by decapentaplegic. Cell. 1994, 78 (2): 225-237. 10.1016/0092-8674(94)90293-3.

    CAS  PubMed  Article  Google Scholar 

  38. 38.

    Penton A, Chen Y, Staehling-Hampton K, Wrana JL, Attisano L, Szidonya J, Cassill JA, Massagué J, Hoffmann FM: Identification of two bone morphogenetic protein type I receptors in Drosophila and evidence that Brk25D is a decapentaplegic receptor. Cell. 1994, 78 (2): 239-250. 10.1016/0092-8674(94)90294-1.

    CAS  PubMed  Article  Google Scholar 

  39. 39.

    Letsou A, Arora K, Wrana JL, Simin K, Twombly V, Jamal J, Staehling-Hampton K, Hoffmann FM, Gelbart WM, Massagué J: Drosophila Dpp signaling is mediated by the punt gene product: a dual ligand-binding type II receptor of the TGF beta receptor family. Cell. 1995, 80 (6): 899-908. 10.1016/0092-8674(95)90293-7.

    CAS  PubMed  Article  Google Scholar 

  40. 40.

    Ruberte E, Marty T, Nellen D, Affolter M, Basler K: An absolute requirement for both the type II and type I receptors, punt and thick veins, for dpp signaling in vivo. Cell. 1995, 80 (6): 889-897. 10.1016/0092-8674(95)90292-9.

    CAS  PubMed  Article  Google Scholar 

  41. 41.

    Brummel T, Abdollah S, Haerry TE, Shimell MJ, Merriam J, Raftery L, Wrana JL, O'Connor MB: The Drosophila activin receptor baboon signals through dSmad2 and controls cell proliferation but not patterning during larval development. Genes Dev. 1999, 13 (1): 98-111. 10.1101/gad.13.1.98.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  42. 42.

    Aberle H, Haghighi AP, Fetter RD, McCabe BD, Magalhães TR, Goodman CS: wishful thinking encodes a BMP type II receptor that regulates synaptic growth in Drosophila. Neuron. 2002, 33 (4): 545-558. 10.1016/S0896-6273(02)00589-5.

    CAS  PubMed  Article  Google Scholar 

  43. 43.

    Marqués G, Bao H, Haerry TE, Shimell MJ, Duchek P, Zhang B, O'Connor MB: The Drosophila BMP type II receptor Wishful Thinking regulates neuromuscular synapse morphology and function. Neuron. 2002, 33 (4): 529-543. 10.1016/S0896-6273(02)00595-0.

    PubMed  Article  Google Scholar 

  44. 44.

    Sekelsky JJ, Newfeld SJ, Raftery LA, Chartoff EH, Gelbart WM: Genetic characterization and cloning of mothers against dpp, a gene required for decapentaplegic function in Drosophila melanogaster. Genetics. 1995, 139 (3): 1347-1358.

    CAS  PubMed  PubMed Central  Google Scholar 

  45. 45.

    Wisotzkey RG, Mehra A, Sutherland DJ, Dobens LL, Liu X, Dohrmann C, Attisano L, Raftery LA: Medea is a Drosophila Smad4 homolog that is differentially required to potentiate DPP responses. Development. 1998, 125 (8): 1433-1445.

    CAS  PubMed  Google Scholar 

  46. 46.

    Hudson JB, Podos SD, Keith K, Simpson SL, Ferguson EL: The Drosophila Medea gene is required downstream of dpp and encodes a functional homolog of human Smad4. Development. 1998, 125 (8): 1407-1420.

    CAS  PubMed  Google Scholar 

  47. 47.

    Das P, Maduzia LL, Wang H, Finelli AL, Cho SH, Smith MM, Padgett RW: The Drosophila gene Medea demonstrates the requirement for different classes of Smads in dpp signaling. Development. 1998, 125 (8): 1519-1528.

    CAS  PubMed  Google Scholar 

  48. 48.

    Henderson KD, Andrew DJ: Identification of a novel Drosophila SMAD on the X chromosome. Biochem Biophys Res Commun. 1998, 252 (1): 195-201. 10.1006/bbrc.1998.9562.

    CAS  PubMed  Article  Google Scholar 

  49. 49.

    Srinivasan S, Rashka KE, Bier E: Creation of a Sog morphogen gradient in the Drosophila embryo. Dev Cell. 2002, 2 (1): 91-101. 10.1016/S1534-5807(01)00097-1.

    CAS  PubMed  Article  Google Scholar 

  50. 50.

    Serpe M, Ralston A, Blair SS, O'Connor MB: Matching catalytic activity to developmental function: tolloid-related processes Sog in order to help specify the posterior crossvein in the Drosophila wing. Development. 2005, 132 (11): 2645-2656. 10.1242/dev.01838.

    CAS  PubMed  Article  Google Scholar 

  51. 51.

    Mason ED, Konrad KD, Webb CD, Marsh JL: Dorsal midline fate in Drosophila embryos requires twisted gastrulation, a gene encoding a secreted protein related to human connective tissue growth factor. Genes Dev. 1994, 8 (13): 1489-1501. 10.1101/gad.8.13.1489.

    CAS  PubMed  Article  Google Scholar 

  52. 52.

    Ross JJ, Shimmi O, Vilmos P, Petryk A, Kim H, Gaudenz K, Hermanson S, Ekker SC, O'Connor MB, Marsh JL: Twisted gastrulation is a conserved extracellular BMP antagonist. Nature. 2001, 410 (6827): 479-483. 10.1038/35068578.

    CAS  PubMed  Article  Google Scholar 

  53. 53.

    Shimmi O, Ralston A, Blair SS, O'Connor MB: The crossveinless gene encodes a new member of the Twisted gastrulation family of BMP-binding proteins which, with Short gastrulation, promotes BMP signaling in the crossveins of the Drosophila wing. Dev Biol. 2005, 282 (1): 70-83. 10.1016/j.ydbio.2005.02.029.

    CAS  PubMed  Article  Google Scholar 

  54. 54.

    Vilmos P, Sousa-Neves R, Lukacsovich T, Marsh JL: crossveinless defines a new family of Twisted-gastrulation-like modulators of bone morphogenetic protein signalling. EMBO Rep. 2005, 6 (3): 262-267. 10.1038/sj.embor.7400347.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  55. 55.

    Bonds M, Sands J, Poulson W, Harvey C, Von Ohlen T: Genetic screen for regulators of ind expression identifies shrew as encoding a novel twisted gastrulation-like protein involved in Dpp signaling. Dev Dyn. 2007, 236 (12): 3524-3531. 10.1002/dvdy.21360.

    CAS  PubMed  Article  Google Scholar 

  56. 56.

    Serpe M, Umulis D, Ralston A, Chen J, Olson DJ, Avanesov A, Othmer H, O'Connor MB, Blair SS: The BMP-binding protein Crossveinless 2 is a short-range, concentration-dependent, biphasic modulator of BMP signaling in Drosophila. Dev Cell. 2008, 14 (6): 940-953. 10.1016/j.devcel.2008.03.023.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  57. 57.

    Conley CA, Silburn R, Singer MA, Ralston A, Rohwer-Nutter D, Olson DJ, Gelbart W, Blair SS: Crossveinless 2 contains cysteine-rich domains and is required for high levels of BMP-like activity during the formation of the cross veins in Drosophila. Development. 2000, 127 (18): 3947-3959.

    CAS  PubMed  Google Scholar 

  58. 58.

    Evans TA, Haridas H, Duffy JB: Kekkon5 is an extracellular regulator of BMP signaling. Dev Biol. 2009, 326 (1): 36-46. 10.1016/j.ydbio.2008.10.002.

    CAS  PubMed  Article  Google Scholar 

  59. 59.

    Vuilleumier R, Springhorn A, Patterson L, Koidl S, Hammerschmidt M, Affolter M, Pyrowolakis G: Control of Dpp morphogen signalling by a secreted feedback regulator. Nat Cell Biol. 2010, 12 (6): 611-617. 10.1038/ncb2064.

    CAS  PubMed  Article  Google Scholar 

  60. 60.

    Szuperák M, Salah S, Meyer EJ, Nagarajan U, Ikmi A, Gibson MC: Feedback regulation of Drosophila BMP signaling by the novel extracellular protein Larval Translucida. Development. 2011, 138 (4): 715-724. 10.1242/dev.059477.

    PubMed  Article  Google Scholar 

  61. 61.

    Chen H, Shi S, Acosta L, Li W, Lu J, Bao S, Chen Z, Yang Z, Schneider MD, Chien KR, et al: BMP10 is essential for maintaining cardiac growth during murine cardiogenesis. Development. 2004, 131 (9): 2219-2231. 10.1242/dev.01094.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  62. 62.

    Moos M, Wang S, Krinks M: Anti-dorsalizing morphogenetic protein is a novel TGF-beta homolog expressed in the Spemann organizer. Development. 1995, 121 (12): 4293-4301.

    CAS  PubMed  Google Scholar 

  63. 63.

    Joubin K, Stern CD: Molecular interactions continuously define the organizer during the cell movements of gastrulation. Cell. 1999, 98 (5): 559-571. 10.1016/S0092-8674(00)80044-6.

    CAS  PubMed  Article  Google Scholar 

  64. 64.

    Lele Z, Nowak M, Hammerschmidt M: Zebrafish admp is required to restrict the size of the organizer and to promote posterior and ventral development. Dev Dyn. 2001, 222 (4): 681-687. 10.1002/dvdy.1222.

    CAS  PubMed  Article  Google Scholar 

  65. 65.

    Onichtchouk D, Chen YG, Dosch R, Gawantka V, Delius H, Massagué J, Niehrs C: Silencing of TGF-beta signalling by the pseudoreceptor BAMBI. Nature. 1999, 401 (6752): 480-485. 10.1038/46794.

    CAS  PubMed  Article  Google Scholar 

  66. 66.

    Hsu DR, Economides AN, Wang X, Eimon PM, Harland RM: The Xenopus dorsalizing factor Gremlin identifies a novel family of secreted proteins that antagonize BMP activities. Mol Cell. 1998, 1 (5): 673-683. 10.1016/S1097-2765(00)80067-2.

    CAS  PubMed  Article  Google Scholar 

  67. 67.

    Graveley BR, Brooks AN, Carlson JW, Duff MO, Landolin JM, Yang L, Artieri CG, van Baren MJ, Boley N, Booth BW, et al: The developmental transcriptome of Drosophila melanogaster. Nature. 2010

    Google Scholar 

  68. 68.

    Consortium TIAG: Genome sequence of the pea aphid Acyrthosiphon pisum. PLoS Biol. 2010, 8 (2): e1000313-10.1371/journal.pbio.1000313.

    Article  Google Scholar 

  69. 69.

    Huang X, Madan A: CAP3: A DNA sequence assembly program. Genome Res. 1999, 9: 868-877. 10.1101/gr.9.9.868.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  70. 70.

    Abascal F, Zardoya R, Posada D: ProtTest: selection of best-fit models of protein evolution. Bioinformatics. 2005, 21: 2104-2105. 10.1093/bioinformatics/bti263.

    CAS  PubMed  Article  Google Scholar 

  71. 71.

    Guindon S, Gascuel O: A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 2003, 52 (5): 696-704. 10.1080/10635150390235520.

    PubMed  Article  Google Scholar 

  72. 72.

    Felsenstein J: PHYLIP (Phylogeny Inference Package) version 3.6. Distributed by the author. Department of Genome Sciences, University of Washington, Seattle. 2005

    Google Scholar 

  73. 73.

    Junier T, Zdobnov EM: The Newick utilities: high-throughput phylogenetic tree processing in the Unix shell. Bioinformatics. 2010

    Google Scholar 

  74. 74.

    Van der Zee M, da Fonseca RN, Roth S: TGFbeta signaling in Tribolium: vertebrate-like components in a beetle. Dev Genes Evol. 2008, 218 (3-4): 203-213. 10.1007/s00427-007-0179-7.

    CAS  PubMed  Article  Google Scholar 

Download references


We thank Lucien Jarymowycz, IT systems manager in the Department of Organismal Biology at the University of Chicago, for his professional help. Funding was provided by NSF grants 0719445 and 0840687 to U. S.-O.

Author information



Corresponding authors

Correspondence to Steffen Lemke or Urs Schmidt-Ott.

Additional information

Authors' contributions

SL designed the research, performed the bioinformatic analyses and wrote the manuscript. DAA prepared the cDNA libraries for 454 sequencing, FM and MHD performed 454 sequencing. USO designed the research, analyzed the data, wrote the manuscript and obtained funding. All authors read and approved the final manuscript.

Electronic supplementary material

Normalization and 454 sequencing of transcriptome libraries

Additional file 1:. (A) Normalization of library of 0-0.5 hrs old embryos. (B) Normalization of library of 3-6 hrs old embryos. Normalization was assessed by quantitative PCR. Shown are transcript levels of E. balteatus alpha tubulin (Eba-tub) relative to E. balteatus orthologues of hunchback (Eba-hb) and caudal (Eba-cad). Bar heights indicate transcript levels, colors indicate levels in the non-normalized cDNA library (blue), and after normalization with 1 unit DSN (red), 2 units DSN (orange), and 4 units DSN (yellow). (C) Distribution of read lengths of library from 0-0.5 hrs old embryos. (D) Distribution of read lengths of library from 3-6 hrs old embryos. Read length (x-axis) is shown as a function of number of reads (y-axis) before (grey graph) and after removal of pea aphid sequence contaminations (black graph). (PDF 538 KB)


Additional file 2:Coverage of previously identified E. balteatus embryonic patterning genes in the libraries of 0-0.5 hrs old embryos (maternal) and 3-6 hrs old embryos (zygotic). (PDF 61 KB)

Expression of

Additional file 3: Eba-otd in ovarian follicles and early embryos. (A-C) Eba-otd in situ hybridizations of ovarian follicles (A,B) and an early embryo (C). Anterior is left, dorsal is up. (D) Increase of zygotic expression levels relative to maternal expression levels measured for Eba-otd, Eba-cad, and Eba-hb using qPCR (orange bars) and transcriptome coverage (grey bars). qPCR data are based on the average of two independent PCRs, each of which run in triplicate. Bars indicate variance between samples. (PDF 129 KB)

Schematic overview of TGF-β signaling in

Additional file 4: D. melanogaster. TGF-β ligands bind as dimers to a transmembrane receptor complex, which activates signal transducing protein that translocates to the nucleus to regulate gene activity (modified after [74]). Components of the BMP signaling cascade are shown in red, components of the Activin-β signaling cascade in blue, shared components in purple. In the case of generalized BMP signaling, extraecellular ligands (Dpp, Scw, Gbb) are regulated by BMP antagonists Sog and Tsg/Cv/Srw(?), and metalloprotease activity of Tld/Tok, which cleaves Sog and frees BMP. Additional BMP specific regulators are transmembrane protein CV-2, which directly binds BMPs, and membrane associated protein Kek5. Active BMPs signal through the receptor complex composed of Tkv, Sax and Put, which phophorylates Mad. Phophorylated Mad recruits Medea and translocates to the nucleus. For abbreviations, see legend to Figure 3. (PDF 281 KB)

Comparison of sequence annotation following de novo transcriptome assemblies produced by Newbler v2.3 and Newbler v2.5.3

Additional file 5:. During manuscript preparation, a new version of Newbler became available (v2.5). To address the concern of a potential underperformance of Newbler v2.3 in the case of our data set, we repeated the assembly and annotation with the latest available Newbler assembler (v2.5.3) and compared the results to the assembly with Newbler v2.3. We found the total number of assembled bases with Newbler v2.5.3 (10.5 Mb) decreased by almost 20% when compared with the number of assembled bases with Newbler v2.3 (12.9 Mb), suggesting that for our dataset Newbler v2.5.3 performed with more stringent assembly conditions. The observed differences did not affect our identification of BMP signaling components in E. balteatus as these genes were all identified by Blast and subsequent manual assembly. (PDF 263 KB)

Authors’ original submitted files for images

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Lemke, S., Antonopoulos, D.A., Meyer, F. et al. BMP signaling components in embryonic transcriptomes of the hover fly Episyrphus balteatus (Syrphidae). BMC Genomics 12, 278 (2011).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Bone Morphogenetic Protein
  • Coverage Level
  • Bone Morphogenetic Protein Signaling
  • Dorsal Midline
  • Extraembryonic Tissue