Skip to main content
  • Research article
  • Open access
  • Published:

Annotation of novel neuropeptide precursors in the migratory locust based on transcript screening of a public EST database and mass spectrometry



For holometabolous insects there has been an explosion of proteomic and peptidomic information thanks to large genome sequencing projects. Heterometabolous insects, although comprising many important species, have been far less studied. The migratory locust Locusta migratoria, a heterometabolous insect, is one of the most infamous agricultural pests. They undergo a well-known and profound phase transition from the relatively harmless solitary form to a ferocious gregarious form. The underlying regulatory mechanisms of this phase transition are not fully understood, but it is undoubtedly that neuropeptides are involved. However, neuropeptide research in locusts is hampered by the absence of genomic information.


Recently, EST (Expressed Sequence Tag) databases from Locusta migratoria were constructed. Using bioinformatical tools, we searched these EST databases specifically for neuropeptide precursors. Based on known locust neuropeptide sequences, we confirmed the sequence of several previously identified neuropeptide precursors (i.e. pacifastin-related peptides), which consolidated our method. In addition, we found two novel neuroparsin precursors and annotated the hitherto unknown tachykinin precursor. Besides one of the known tachykinin peptides, this EST contained an additional tachykinin-like sequence. Using neuropeptide precursors from Drosophila melanogaster as a query, we succeeded in annotating the Locusta neuropeptide F, allatostatin-C and ecdysis-triggering hormone precursor, which until now had not been identified in locusts or in any other heterometabolous insect. For the tachykinin precursor, the ecdysis-triggering hormone precursor and the allatostatin-C precursor, translation of the predicted neuropeptides in neural tissues was confirmed with mass spectrometric techniques.


In this study we describe the annotation of 6 novel neuropeptide precursors and the neuropeptides they encode from the migratory locust, Locusta migratoria. By combining the manual annotation of neuropeptides with experimental evidence provided by mass spectrometry, we demonstrate that the genes are not only transcribed but also translated into precursor proteins. In addition, we show which neuropeptides are cleaved from these precursor proteins and how they are post-translationally modified.


The desert locust (Schistocerca gregaria) and the African migratory locust (Locusta migratoria) are insects with a well-recognised human impact and ecological and economical importance. They show the interesting phenomenon of phase polymorphism. In the harmless solitarious phase their population density is limited to a few individuals per hectare. Under certain circumstances, as a result of external stimuli and internal (endocrinological) changes, solitarious locusts can switch to gregarious behaviour and begin to aggregate, marching in 'hopper bands' and swarming as adults. This gregarious form destroys all vegetation with a devastating effect on the environment and economy, and has a severe impact on the residents of the afflicted regions (from acute allergic responses to famine). The neuro-hormonal mechanisms that drive and accompany this transition are still far from completely understood, however, it has been proven that several classical neurotransmitters/modulators [1], pheromones [2] and also neuropeptides [3, 4], play an important role. The latter is not surprising, as (neuro)peptides are the largest class of signalling molecules found in animals. They can act as a transmitter, modulator and/or hormone and are known to be involved in most, if not all, physiological processes in Metazoa. Neuropeptides originate from larger precursor proteins. These are processed in the endoplasmatic reticulum to produce the actual bioactive peptides, ranging in size from a few amino acids to around 100. Processing of neuropeptides in insects generally occurs at the dibasic sites KR, RR or KK or at a single R residue proceeded by a basic amino acid residue at position -4, -6 or -8 [5]. In many cases, one neuropeptide precursor contains multiple neuropeptide isoforms and/or related peptides often containing a consensus motif. Some precursors produce structurally related peptides possessing equivalent potencies, whereas others have been shown to yield peptides with different (occasionally opposite) activities [6]. Therefore, it is important to identify all peptides originating from a single precursor since each of these peptides might have a different physiological relevance.

A large number of neuropeptides has already been identified and characterised in locusts, however, only for a few peptides the precursor protein is known. In the first instance (eighties and early nineties) peptide identification was done by means of laborious extraction procedures in combination with numerous bioassays and purification steps (for review see [7]). At a given point the steady stream of newly bioactive neuropeptides stagnated due to the restrictions of the available bioassays. With the introduction of "peptidomics" at the start of the new millennium, neuropeptide research was boosted once again. With the combination of new mass spectrometric methods, genomic databases and bioinformatic tools, it became possible to identify new peptides without the necessity to sacrifice vast amounts of animals. This led to an explosion in the number of identified neuropeptides in for example D. melanogaster [8] and Caenorhabditis elegans [9]. Also in the locust, this new peptidomics technique led to the identification of several new neuropeptides [10, 11] and in addition dozens of partial sequences (unpublished results). The mass spectral analysis mostly generates a small sequence tag (a few amino acids), which is then compared with a genomic database to yield the full sequence. Unfortunately, for locusts very little genome sequence information is available and as the size of their genome is unusually large (it is expected to be up to 30 times larger than the genome of D. melanogaster [12]), no sequencing efforts are likely to be undertaken. This is a major pitfall for neuropeptide identification in locusts and the search for their presumed role in locust phase transition. In general, compared to the genome sequencing projects in the group of Holometabola, genome information in heterometabolous insects is scarce, hindering peptidome identifications.

Recently, an EST database from L. migratoria, comprising 76 012 raw EST data which were assembled into 12 161 clustered unigenes, was constructed and deposited in GenBank [13, 14]. Only 30% of the genes discovered were annotated based on sequence similarity with other species, among which only one group of (neuro)peptide genes (pacifastin-like sequences). Neuropeptides, because of their small size and very limited similarity (mostly the conserved active core comprises only a few amino acids), are often neglected in annotation studies. In this study, we thoroughly searched the locust EST database specifically for the presence of neuropeptide precursors using various bioinformatic tools bearing in mind the typical features of a neuropeptide precursor.

Results and discussion

In the present study, we examined the EST database of L. migratoria generated by Kang et al. [13] for sequences that encode putative neuropeptides and their respective precursors. Besides a whole body database (from gregarious fifth instar locusts), ESTs were generated from the head, hind legs and midgut from solitarious and gregarious locusts separately.

In first instance, we performed BLAST searches using all known neuropeptide precursors from L. migratoria and a related species, S. gregaria, as queries. In a second search, neuropeptides from L. migratoria, for which the precursor proteins are not known, were used as an input query in the BLAST searches. In this analysis, neuropeptide sequences that are expected to originate from a single precursor were combined (all possible combinations) and each neuropeptide sequence was flanked by typical processing sites [(G)KR, (G)(R)R or (G)KK]. As a third group of queries, neuropeptide precursors from D. melanogaster were used for screening the locust EST database.

The resulting EST sequences were further analysed to identify start and stop codons and typical neuropeptide precursor features (length, possible signal peptide (SignalP), cleavage sites, post-translational modifications). This way, we found several new Locusta neuropeptide precursors, which are discussed below. This strategy also provided interesting and valuable information concerning the link between heterometabolous and holometabolous insects at the peptide level.


The pacifastins are a family of low molecular weight serine protease inhibitors from arthropods. They display sequence similarities with nine cysteine rich domains in the light chain of pacifastin, a heterodimeric serine protease inhibitor isolated from the hemolymph of the crayfish Pacifastacus leniusculus, hence their name pacifastins [15]. They are characterised by a conserved pattern of six cysteine residues Cys-Xaa9–12-Cys-Asn-Xaa-Cys-Xaa-Cys-Xaa2–3-Gly-Xaa3–6-Cys-Thr-Xaa3-Cys. They were first discovered in locusts, but in silico data mining of expressed sequence tags databases revealed the existence of additional pacifastin-like polypeptides in Lepidoptera, Diptera, Coleoptera, and Siphonaptera [16].

Thus far, 10 pacifastin-related precursors encoding together 22 different peptides have been characterised in L. migratoria and S. gregaria [1618]]. In L. migratoria 3 pacifastin-related precursors were cloned and sequenced (LMPP 1–3) [19, 20]. Blasting the LMPP sequences against the EST databases resulted in more than 140 hits (E-values ranging from e-112 to e-4), signifying the presence of pacifastin mRNAs in whole-body, midgut, hind leg and head. The presence of pacifastin mRNA in the head provides additional evidence that the brain is one of the sites of pacifastin biosynthesis. Although pacifastin peptides had been shown in several brain areas before, so far there was always controversy whether or not this was due to contamination by the hemolymph, where the pacifastins are known to be present in high concentrations [21, 22]. Most of the hits represent different length ESTs corresponding to one of the three previously cloned LMPP precursor genes (LMPP 1–3) [19, 20], which again shows the high abundance of these pacifastins in locusts. These findings are a good indication that the predefined parameters are chosen well and allow an adequate analysis.

LMPP-1 contains two protease inhibitor peptides (LMPI-1, also called PMP-D2 and LMPI-2, also called PMP-C), that were both isolated and sequenced before [21]. LMPP-2 displays three potential peptide sequences, one of which was isolated before (LMPI-3, previously named HI or hemolymph inhibitor) [23]. The LMPP-3 gene encodes for four putative additional protease inhibitor peptides, none of which was isolated before.

Only the ESTs corresponding to the LMPP-3 gene differ from the previously cloned LMPP-3 gene in one amino acid within the LMPI-7 peptide, as a result of one different nucleotide (AGA coding for R in ESTs instead of ACA coding for T in cDNA cloning). As none of the sequenced LMPP-3 ESTs shows a threonine residue at that position, we can assume that SEEQCTPGTTFKKDCNTCSCGNDGR AAVCTLKACRELTTDQAGSRA is the correct sequence for LMPI-7 (Fig. 1). It cannot be excluded that the long list of hits also includes ESTs corresponding to hitherto unknown Locusta pacifastin precursors.

Figure 1
figure 1

Locusta migratoria Pacifastin Precursor 3. Amino acid sequence alignment of LMPP-3 and the ORF of CO827143 and CO827145 (most identical sequences). The signal peptide is indicated in italic, dibasic cleavage sites are shown in bold and pacifastin-like peptides are marked. Identical residues are marked with an asterisk.

Based on the number of pacifastin ESTs that were found in both phases, the pacifastin genes are predicted to be differentially expressed in the solitary and gregarious phase with a higher prevalence in solitarious locusts [13]. This was previously also shown in a peptide differential display study by Clynen et al. [4].


The neuroparsins are a family of structurally related peptides from arthropods (crustaceans, insects, chelicerates) that is characterised by a pattern of positional conserved cysteine residues and shows striking similarities to the vertebrate insulin-like growth factor binding proteins (IGFBP). The neuroparsins were first discovered in L. migratoria, where they are produced by neurosecretory cells in the pars intercerebralis region of the brain and then transported to the corpora cardiaca, where they are stored prior to release into the hemolymph [24]. They are multifunctional neurohormones displaying antijuvenile [25], antidiuretic [26], hyperglycemic, hyperlipemic [27] and neuritogenic effects [28]. In L. migratoria one neuroparsin precursor has been cloned and sequenced [29]. From this precursor several N-terminally truncated polypeptide isoforms (neuroparsin-A 1–4 and neuroparsin-B) originate.

In the desert locust S. gregaria 4 different neuroparsin precursor cDNAs (Scg-NPP 1–4) were cloned and sequenced [30, 31]. While NPP-1 and 2 are restricted to the brain, NPP-3 and 4 also occur in peripheral tissue (fat body, reproductive system). All 4 neuroparsin precursors are identical in the N-terminal region (first 61 amino acid residues) and contain at least 10 cysteine residues. Recently, evidence was provided for a phase-dependent transcriptional regulation of neuroparsin-encoding genes [32, 33]. It was shown that neuroparsin transcripts are generally more abundant in solitarious locusts. In addition, in contrast to the gregarious phase, solitarious animals also contain detectable transcript levels of NPP 1–2 in the fat body.

A BLAST search with the L. migratoria neuroparsin precursor resulted in 5 hits with E-value < 5e-26. Three hits (CO849956, CO849957 and CO849958) were derived from the solitarious phase head cDNA library, one from the gregarious head (CO832876) and one from the gregarious whole-body (CO821291) cDNA library.

None of these hits, however, gave an exact match, meaning that they represent novel Locusta neuroparsin-like sequences (Fig. 2). This is not surprising, as also in the desert locust 4 different neuroparsin precursor cDNAs (Scg-NPP 1–4) are present.

Figure 2
figure 2

Locusta migratoria Neuroparsin Precursors. Multiple sequence alignment (ClustalW 1.82) of CO849956, CO849957, CO832876, CO849958, CO821291 and the known Locusta neuroparsin precursor (Lom-NPP). Translation initiation sites are indicated in black and cysteine residues are shaded.

Four of the EST tags (CO849956, CO849957, CO832876 and CO821291) most likely represent the same gene, as their protein-coding sequence is identical except for one amino acid in the signal peptide (A versus G in CO849956 as a result of one different nucleotide), which is probably due to a sequencing error. This novel neuroparsin-like sequence in L. migratoria shows 83% identity with NPP-4 of S. gregaria, both containing 10 cysteine residues.

The protein-coding sequence of CO849958 differs substantially from the other ESTs. Although both the start and stop codon is missing, this EST clearly represents a member of the neuroparsin family. The lack of a start codon is probably due to a sequencing error at the beginning of the EST, while at the 3' end the protein-coding sequence is incomplete. The partial sequence displays 76% identity with NPP-2 from S. gregaria. An attempt to complete the 3' end by the contig assembly procedure failed. Blasting the Schistocerca neuroparsin precursors against the Locusta EST database yielded no additional significant hits.


The tachykinins belong to an evolutionary conserved family of peptide neurotransmitters that play major roles in signalling in the nervous system and intestine of both vertebrates and invertebrates. The invertebrate tachykinin family is characterised by a common C-terminal sequence -FXGXRamide (X being a variable amino acid residue) analogous to the vertebrate consensus -FXGLMamide. The first members of the invertebrate tachykinin family were isolated as myotropic peptides from the central nervous system of L. migratoria [34, 35]. All four Locusta tachykinins contain the C-terminal -FXGVRamide (X being Y or H) sequence. Their precursor protein was as yet unknown.

When blasting the four tachykinin sequences separately, only tachykinin-4 gave one hit with a gregarious phase midgut cDNA sequence (CO848687), though with a relatively high E-value (0.13). Blasting the four tachykinin peptides in line (separated by GKR) against the EST database gave the same hit with a lower E-value 1e-7. This clearly demonstrates the benefit of combining several peptides that are expected to originate from the same precursor. The same hit was found when blasting the D. melanogaster tachykinin precursor (1e-4). The translated protein has all typical features of a neuropeptide precursor: SignalP predicts the presence of a signal peptide and several possible dibasic cleavage sites are present (Fig. 3). The EST is incomplete at the 3' side, as no stop codon is present. Of the four previously characterised Locusta tachykinin peptides, only TK-4 gives a perfect match and is present twice in this Locusta tachykinin precursor. The sequences of TK 1–3 were not found within this precursor, meaning that either there is more than one tachykinin precursor present in L. migratoria, or TK 1–3 are encoded by the missing part of the precursor at the 3' end. The latter assumption is strengthened by the fact that in all invertebrate species investigated so far, a single tachykinin precursor gene encodes multiple tachykinin forms [36]. The D. melanogaster genome contains a single tachykinin precursor, coding for six putative tachykinin sequences, five of which were identified in a peptide extract of Drosophila central nervous systems [37]. Also in cockroaches, up to 15 tachykinins are encoded by the same precursor gene [38]. A second argument is that the novel (partial) Locusta tachykinin precursor is only 141 amino acid residues in length, while all known insect tachykinin precursors vary between 289 and 366 amino acid residues.

Figure 3
figure 3

Locusta migratoria Tachykinin Precursor (top) and MS spectrum displaying the monoisotopic masses of 5 different Tachykinins (bottom). Amino acid sequence corresponding to the ORF of CO848687. The signal peptide as predicted by SignalP is shown in italic, possible amidation and dibasic cleavage sites are indicated in bold, possible tachykinin-like peptides are marked. MALDI-TOF mass spectrum of the first abdominal ganglion of Locusta migratoria. Masses corresponding to tachykinin peptides are marked (r.i. is relative intensity, m/z is mass to charge ratio).

Besides Lom-TK-4, the Locusta tachykinin precursor contains another putative tachykinin-like peptide (APMRGFQSVRamide), separated by dibasic cleavage sites. The C-terminus of this peptide resembles the -FXGXRamide consensus sequence for invertebrate tachykinins, except for the G, which is replaced by S. The mass of this putative peptide was calculated (1146.6 Da) and we searched MALDI-TOF mass spectra from the entire Locusta neuroendocrine system for the presence of this peptide. The exact mass was found in peptide extracts from the brain (proto-, deutero- and tritocerebrum), circumoesophageal connectives, suboesophageal ganglion, frontal ganglion, hypocerebral ganglion, pro-, meso and metathoracic ganglion and all five abdominal ganglia (Fig. 3). The novel tachykinin mass was neither found in the corpora allata nor in the neurohemal organs of the head (corpora cardiaca) and the abdomen (perisympathetic organs). The exact masses of the other four known tachykinins (TK 1–4) were found in the same peptide extracts. The fact that the distribution of the novel tachykinin is identical to that of the other four Locusta tachykinins presents an additional argument that all these Locusta tachykinins most likely originate from the same precursor. The presence of the precursor in the midgut (midgut EST library) is consistent with earlier immunocytochemical studies and the purification of a tachykinin from the midgut of S. gregaria [39, 40].

To confirm that the 1146.6 Da peptide found in the MALDI-spectra genuinely corresponds to the tachykinin-like peptide, we analysed a peptide extract of 10 ventral nerve cords by ESI-Q-TOF MSMS and fragmented the ion corresponding to the 1146.6 Da peptide. Hereby we confirmed its sequence. This proves that this peptide is produced from the tachykinin precursor through a common endoproteolytic pathway as seen for most neuropeptides. Hence, this peptide was designated as tachykinin-5 (TK-5). The presence of a peptide with an aberrant C-terminus it not unusual, as also in Leucophaea maderae, Periplaneta americana, Apis mellifera and D. melanogaster tachykinins with a substitution in the C-terminal consensus sequence occur. In both cockroach species it was proven that these peptides (-FXAXRamides) are also translated in vivo by demonstrating their presence in tissue extracts by mass spectrometry [38].

Ecdysis-triggering hormone

The ecdysis-triggering hormone (ETH) is one of the three principal neuroendocrine components known to regulate ecdysis behaviour, together with eclosion hormone (EH) and crustacean cardioactive peptide (CCAP). To date, ETHs have only been identified in the lepidopterans Manduca sexta and Bombyx mori and in the dipterans D. melanogaster and Anopheles gambiae [4145]]. The ETH-genes encode two peptides with ETH-like structure and biological activity: ETH and pre-ETH in Lepidoptera and ETH-1 and 2 in Diptera.

Blasting the D. melanogaster ETH precursor against the L. migratoria EST database gave one significant hit (E-value 6e-6) in the gregarious phase whole body cDNA library (CO822155). There are several indications that this EST actually corresponds to the L. migratoria ETH precursor: the translated protein has all features of a typical neuropeptide precursor, displaying a signal sequence and several dibasic cleavage sites. In addition, the precursor contains two peptides having a C-terminal -PRIamide, which is a typical consensus sequence for the ETH peptides (-PRVamide in Mas-PETH and Bom-PETH, -PRMamide in Mas-ETH and Bom-ETH, -PRIamide in Drm-ETH-2 and Aga-ETH-1 and 2 and -PRLamide in Drm-ETH-1). The Locusta EST sequence does not contain a stop codon. However, blasting the nucleotide sequence of CO822155 against the EST database gave two hits with an overlap in the C-terminal region (CO833211 and CO822156). This way we were able to complete the EST precursor sequence (Fig. 4).

Figure 4
figure 4

Locusta migratoria Ecdysis Triggering Hormone Precursor. Amino acid sequence alignment of CG18105 (the Drosophila ETH-precursor) and the novel Locusta ETH-precursor (assembly of ORF from CO822155 and CO833211). The signal peptide is indicated in italic, amidation sites and dibasic cleavage sites are marked in bold and (putative) ETH-peptides are marked.

The two putative Locusta ETH-peptides were designated as Lom-ETH-1 (SDFFLKTAKSVPRIamide) and Lom-ETH-2 (SDLFLKSAKSVPRIamide). They show most sequence similarity with the Anopheles ETHs (Fig. 5), sharing the C-terminal KSVPRIamide sequence. This EST provides the first biochemical evidence for the presence of ETHs in a heterometabolous insect.

Figure 5
figure 5

Amino acid sequence alignment of all known ETH-like peptides.

The ETHs are produced by the epitracheal organs, which are distributed along the tracheae and mainly consist of Inka cells. In D. melanogaster, the Inka cells are referred to as peritracheal cells. The peritracheal cell system is widely conserved in insects. Several holometabolous and heterometabolous insect orders show PETH-immunoreactivity among peritracheal cells [42]. The morphology and distribution along the tracheae varies considerably. In Orthoptera, tracheae have numerous immunopositive cells scattered on both the major and minor branches. We dissected tracheae from fifth instar L. migratoria just before adult ecdysis and analysed a peptide extract of these tracheae by nanoLC-Q-TOF MS(MS). Both the predicted masses of Lom-ETH-1 and Lom-ETH-2 (respectively 1606.9 Da and 1558.9 Da) were present and their sequences were confirmed by MSMS fragmentation analysis (Fig. 6), proving that these mature ETH-peptides are processed from the ETH-precursor.

Figure 6
figure 6

MSMS spectra of the two locust ETH-peptides. MSMS fragmentation spectrum of the triple charged ion at m/z 536.67, corresponding to ETH-1 (SDFFLKTAKSVPRIamide) and the double charged ion at m/z 780.45, corresponding to ETH-2 (SDLFLKSAKSVPRIamide). A-type, b-type, y-type and z-type fragment ions are shown. The theoretical fragment ion masses found in the spectrum are indicated in bold and the mass difference between the expected and observed fragment ion masses is shown below.

Neuropeptide F

Neuropeptide Y, the most abundant neuropeptide in the mammalian nervous system, is a highly conserved 36 amino acid neuromodulator [46]. The invertebrate NPY-related peptides are divided into two groups based on their size: the shortNPFs ranging in size from 8 to 11 amino acid residues and the (long)NPFs ranging in size from 36 to 40 amino acid residues. The longNPFs are considered as the invertebrate homologues of vertebrate NPY. Whereas the vertebrate NPYs C-terminally end with an amidated tyrosine (Y) residue, all NPFs end with an amidated phenylalanine (F) residue. The NPFs have been found in platyhelminths [47, 48], molluscs [49, 50] and insects [51]. In D. melanogaster, both the short NPF (sNPF; CG13968) and the (long) Drosophila NPF (dNPF; CG10342) gene have been characterised. dNPF regulates larval feeding behaviour [52]. The dNPF gene is highly expressed in larvae attracted to food and is turned off in older larvae that exhibit food aversion, hypermobility (wandering stage) and co-operative burrowing. Overexpression of the dNPF gene in older larvae prolonged feeding whereas down regulation in young larvae induced food aversion. Also sNPF was found to control food intake and regulate body size [53]. These findings suggest a functional similarity with vertebrates as NPY also exerts a central role in the regulation of feeding behaviour [54].

In locusts, recently two shortNPF peptides were characterised (unpublished results). Until now, nothing was known about longNPF in locusts, or in any other heterometabolous insect. Blasting the dNPF precursor against the L. migratoria EST database gave one significant hit (E-value 7e-6) with the solitarious phase head cDNA library (CO854418). There are several indications that this EST in fact corresponds to the L. migratoria longNPF precursor: the translated protein has all features of a typical neuropeptide precursor, displaying a signal sequence immediately followed by the NPF sequence (33 amino acid residues) with the typical -RXRFamide C-terminal motif, an amidation signal and a dibasic cleavage site (Fig. 7). The presence of this long neuropeptide F in the locust nervous system could not be demonstrated by mass spectrometry as yet.

Figure 7
figure 7

Locusta migratoria Neuropeptide F. Amino acid sequence alignment of CG10342 (the Drosophila NPF-precursor) and the ORF of CO854418. The signal peptide is indicated in italic, possible amidation and dibasic cleavage sites are marked in bold and a (putative) NPF-peptide is marked.


The allatostatins were originally isolated from brain tissue as inhibitors of juvenile hormone biosynthesis by the corpora allata, hence their name allatostatins [55]. After the discovery of additional allatostatins in a variety of insect orders they were classified in three different peptide groups based on their sequence similarity. There is the large group of the allatostatin A-type that have the common Y/FXFGL/Iamide C-terminal sequence, the allatostatin B-type which share a -WX6Wamide C-terminal sequence and the allatostatin C-Type with a common -PISCF C-terminal sequence. Besides the allatostatic activity of these three groups, which is confined to a specific insect order (Dictyoptera), they all appear to be pleiotropic in function [56, 57]. Thus far, only in D. melanogaster the three genes that encode the three different types of allatostatins have been identified [5860]]. Blasting with the D. melanogaster allatostatin precursors resulted in a good similarity for allatostatin-C with an EST sequence from gregarious head cDNA (CO835369). Although the E-value turned out to be rather high (4e-4) and the translated sequence yielded a number of stop codons, the peptide sequence itself is very good preserved (QV RYRQ CYFNPISCF in D. melanogaster and QL RYYR CYFNPISCF in L. migratoria) (Fig. 8). Also the Lepidopteran allatostatin-C (QV RFRQ CYFNPISCF), the only other allatostatin-C identified thus far, is almost identical. This is the first report of an allatostatin-C sequence in locusts and in heterometabolous insects in general. The allatostatin B-type was discovered as a myoinhibiting peptide in L. migratoria [61], whereas several A-type allatostatins have been purified and sequenced in S. gregaria, for which the precursor has also been cloned and sequenced [62, 63]. The presence of all three types of allatostatins in a representative of the insect order Diptera (D. melanogaster) and Orthoptera (L. migratoria) strengthens the assumption that all insects (holometabolous and heterometabolous) have peptides belonging to the three different allatostatin peptide families. The mass corresponding to this novel allatostatin-C peptide displaying one disulphide bridge (1969.9 Da) was found in peptide extracts of the ventral nerve cord, indicating that this mature peptide is processed from the presumed allatostatin-C precursor.

Figure 8
figure 8

Locusta migratoria Allatostatin-C. Amino acid sequence corresponding to the ORF of CO835369. Possible dibasic cleavage sites and translation stops are indicated in bold, a possible allatostatin-C peptide is marked.


Heretofore 9 neuropeptide precursors had been identified in L. migratoria, including the pacifastin-like precursor 1–3, adipokinetic hormone precursor 1–3, insulin-related peptide precursor, neuroparsin precursor and ion transport peptide precursor. In this study this number was increased to 15, as we annotated the peptide precursor transcripts of two novel neuroparsin precursors, the tachykinin precursor, ecdysis-triggering hormone precursor, long neuropeptide F precursor and allatostatin-C precursor in the EST database published by Kang et al [13]. These annotations were based on a profound sequence homology with neuropeptide precursors that have been cloned and sequenced previously in locusts (pacifastins and neuroparsins) [19, 20] or, for the smaller peptides, were supported by mass spectrometry data from this study (tachykinins, ecdysis-triggering hormones and allatostatin-C). Our search method was validated by the detection of three pacifastin-like precursors that were previously cloned and sequenced [19, 20]. Although blasting the neuroparsin peptide precursor did not result in an exact match, it did reveal the presence of two novel neuroparsin-related precursors. These novel precursors can be considered as paralogues, which probably evolved differently after a gene duplication event in evolution. Both newly annotated neuroparsin transcripts show high similarity with previously identified neuroparsin precursors from S. gregaria.

The BLAST search program is not optimal for detecting small peptides. In fact, no specialised search tools for small peptides are available to date. To circumvent this problem, we combined several peptide isoforms and (post-translational) processing sites in a single search. This way, the hitherto unknown Locusta tachykinin precursor was identified, which not only contained two copies of a known tachykinin peptide (TK-4), but in addition a putative novel tachykinin-related peptide (TK-5). Mass spectrometrical analyses of peptide fractions from the Locusta central nervous system revealed that this peptide is actually generated as an endogenous peptide via post-translational cleavage at the dibasic cleavage sites. Although the other 3 known tachykinins were not contained within this newly predicted precursor, the absence of a stop codon leaves open the possibility that the 3 other known tachykinin sequences may be located more C-terminally within this same precursor.

Blasting neuropeptide precursors from D. melanogaster also proved to be a successful approach. This way, we were able to annotate the long neuropeptide F, ecdysis-triggering hormone, and allatostatin-C precursor orthologues in L. migratoria. This is the first time that these peptide precursors have been demonstrated in a heterometabolous insect. L. migratoria (heterometabolous) and D. melanogaster (holometabolous) have followed different evolutionary paths, meaning that these peptides have been well conserved during evolution.

Several other neuropeptide(precursors) were not found within our EST database search. It is possible that these are not present in the EST database (which does not cover the entire transcriptome) or that these neuropeptides are not encoded at all in the genome of L. migratoria. We also have to keep in mind that our analysis of the EST database is biased towards some factors. It is estimated that 3% of ESTs contain sequencing errors. Because of these errors, translated sequences could be wrong or insignificant at first sight. A misread nucleotide can cause the formation of a stop codon or a missing nucleotide can disturb the reading frame.

This study is a nice example of how bioinformatics can offer a solution for biological questions. New precursors and peptides are now found in a relatively easy manner, compared with the months and years sometimes needed for a biologist to characterise just one peptide precursor. In the future these EST data can be used to design primers for cloning and full-length sequencing of the respective genes. The characterisation of the precursor protein can lead to the characterisation of novel peptide isoforms encoded by the same precursor protein (see tachykinin). Moreover, comparison of precursor sequences enables us to obtain more crucial information about the evolutionary and/or interphyletic relationship than does comparison of peptide sequences alone. Comparable studies have been conducted for holometabolous insects (D. melanogaster and Anopheles gambiae), however, here genomic data were mined to annotate neuropeptide precursors instead of EST data [6467]]. Our study is also unique in that respect that for three neuropeptide precursors our data were consolidated by mass spectrometry.


Database searches

All presently known neuropeptide (precursor) sequences from L. migratoria,S. gregaria and D. melanogaster were used in a homology search with the recently generated EST databases of L. migratoria, deposited in the GenBank database under the accession nos. CO819675–CO832059 and CO832067–C0865130[13]. For this purpose, tblastn, a program of the BLAST family, was used. The database was set to EST_others and was limited to the Locusta EST databases created by Kang et al. By entering the accession numbers under entrez query. For all other parameters default values were used. Signal peptide cleavage sites were predicted by using SignalP 3.0 [68]. Putative neuropeptide precursors were aligned by using CLUSTALW [69].

Animals and peptide extraction

L. migratoria was raised under laboratory conditions [70], under a 13 h light, 11 h dark photoperiod at a constant temperature of 32°C and relative humidity between 40 and 60%. They were kept under gregarious conditions in cages of 38 cm × 38 cm × 38 cm that contained at least 100 locusts and fed daily with fresh grass and oatmeal ad libitum. Both male and female locusts were decapitated without previous anaesthesia. Neural tissues (brain parts, suboesophageal ganglia, circumoesophageal connectives, corpora cardiaca, corpora allata, frontal ganglia, hypocerebral ganglia, thoracic ganglia, abdominal ganglia, abdominal perisympathetic organs) from adult locusts (5 equivalents) and tracheae from fifth instar larvae (approximately 15 equivalents) were dissected in Ringer solution and transferred to a 0.5-ml Eppendorf tube on ice, containing 50 μl methanol/water/acetic acid (90/9/1) for neural tissues and 200 μl of the same solution for tracheae (Ringer solution, 9.82 g/l NaCl, 0.32 g/l CaCl2, 0.48 g/l KCl, 0.73 g/l MgCl2, 0.25 g/l NaHCO3, 0.19 g/l NaH2PO4, pH 6.5). The samples were sonicated (on ice) three times for 1 min and the remaining solid fraction was centrifuged down (10 min at 9500 g).

Prior to the MS analysis the samples (neural tissues) were concentrated and desalted using ZipTipC18 pipette tips (Millipore, 15 μm). For this purpose, the supernatants were dried in a vacuum centrifuge and reconstituted in 50 μl of 0.1% aqueous TFA. The ZipTipC18 was pre-equilibrated for sample binding using 0.1% aqueous TFA containing 50% CH3CN, followed by 0.1% aqueous TFA. The sample was loaded onto the ZipTipC18 and, after flushing with 0.1% aqueous TFA to remove salts and other impurities, eluted in 4 μl 0.1% aqueous formic acid (FA) containing 70% CH3CN.

For nanoLC-Q-TOF MS(MS) (tracheae), the supernatant was filtered through a spindown filter (Ultrafree-MC, 0.22 μm, Millipore). The filtrate was dried in a vacuum centrifuge (Speedvac Concentrator SVC200H, Savant) and subsequently redissolved in 0.1% aqueous FA.

Mass spectrometry

Matrix-assisted laser desorption/ionisation (MALDI) time-of-flight (TOF) mass spectrometry was performed on a Reflex IV (Bruker Daltonic GmbH), equipped with a N2 laser and pulsed ion extraction accessory. One μl of the sample solution was transferred to a ground steel target plate, mixed with 0.5 μl of a saturated solution of α-cyano-4-hydroxycinnamic acid in acetone and air-dried. The instrument was calibrated using a standard peptide mixture (Bruker Daltonic GmbH). Spectra were recorded in the reflectron mode within a mass range from m/z 500 to m/z 3000.

Nanoflow electrospray ionisation (ESI) double quadrupole (Qq) orthogonal acceleration (oa) time-of-flight (TOF) mass spectrometry was performed on a Q-tof instrument (Micromass, UK). One microliter of the sample solution was loaded into a gold-coated capillary (Long NanoES spray capillaries for Micromass Q-tof, Protana Engineering A/S). The sample was sprayed at a typical flow rate of 30 nl/min giving extended analysis time in which MS spectra as well as several MSMS spectra were acquired. During MSMS or tandem mass spectrometry, fragment ions are generated from a selected precursor ion by collision-induced dissociation.

Nanoflow liquid chromatography (nanoLC) Q-TOF MS(MS) was conducted using a Famos autosampler, a Switchos column-switching device and an Ultimate HPLC pump (all LC Packings) coupled to a Q-TOF mass spectrometer (Micromass) as described earlier [8]. Ten μl of sample (corresponding to approximately 15 tracheae) was loaded. The separation was done on a NanoEase Atlantis™ Dc18 column (3 μm, 100 μm × 100 mm, Waters) using a linear gradient from 95% solvent A, 5% solvent B to 40% solvent A, 60% solvent B in 50 minutes, with a constant flow rate of 150 nl/min (solvent A: 0.1% FA in water; solvent B: 0.1% FA in CH3CN). The outlet of the HPLC was connected to the electrospray interface of the Q-TOF mass spectrometer. The column eluent was directed through a stainless steel nano-bore emitter (Proxeon). Tandem mass spectrometry was conduced in an automated fashion. Doubly and triply charged ions above a certain threshold were selected for fragmentation by collision-induced dissociation. The applied collision energy was chosen automatically depending on the number of charges and the mass to charge ratio of the selected ion.



Expressed Sequence Tag

D. melanogaster :

Drosophila melanogaster

L. migratoria :

Locusta migratoria


Basic Local Alignment Search Tool

S. gregaria :

Schistocerca gregaria


Locusta migratoria Pacifastin-related Precursor


Locusta migratoria Pacifastin-related Inhibitor


Insulin-like Growth Factor Binding Protein


Schistocerca gregaria


Neuroparsin Precursor




Locusta migratoria


Matrix-Assisted Laser Desorption/Ionisation


Electrospray Ionisation-Quadrupole


Time of Flight


Mass Spectrometry


Tandem Mass Spectrometry


Ecdysis-Triggering Hormone


Eclosion Hormone


Crustacean Cardioactive Peptide


Manduca sexta


Pre-Ecdysis-Triggering Hormone


Bombyx mori


Drosophila melanogaster


Anopheles gambiae


Liquid Chromatography


Neuropeptide Y (C-terminal tyrosine residue)


Neuropeptide F (C-terminal phenylalanine residue)


short Neuropeptide F


Drosophila Neuropeptide F


trifluoroacetic acid


formic acid


Open Reading Frame


  1. Rogers SM, Matheson T, Sasaki K, Kendrick K, Simpson SJ, Burrows M: Substantial changes in central nervous system neurotransmitters and neuromodulators accompany phase change in the locust. J Exp Biol. 2004, 207: 3603-3617. 10.1242/jeb.01183.

    Article  PubMed  CAS  Google Scholar 

  2. Ferenz HJ, Seidelmann K: Pheromones in relation to aggregation and reproduction in desert locusts. Physiol Entomol. 2003, 28: 1-18. 10.1046/j.1365-3032.2003.00318.x.

    Article  Google Scholar 

  3. Breuer M, Hoste B, De Loof A: The endocrine control of phase transition: some new aspects. Physiol Entomol. 2003, 28: 3-10. 10.1046/j.1365-3032.2003.00313.x.

    Article  CAS  Google Scholar 

  4. Clynen E, Stubbe D, De Loof A, Schoofs L: Peptide differential display: a novel approach for phase transition in locusts. Comp Biochem Physiol B. 2002, 132: 107-115. 10.1016/S1096-4959(01)00538-3.

    Article  PubMed  Google Scholar 

  5. Veenstra JA: Mono-and dibasic proteolytic cleavage sites in insect neuroendocrine peptide precursors. Arch Insect Biochem Physiol. 2000, 43: 49-63. 10.1002/(SICI)1520-6327(200002)43:2<49::AID-ARCH1>3.0.CO;2-M.

    Article  PubMed  CAS  Google Scholar 

  6. Zhang JV, Ren PG, Avsian-Kretchmer O, Luo CW, Rauch R, Klein C, Hsueh AJ: Obestatin, a peptide encoded by the ghrelin gene, opposes ghrelin's effects on food intake. Science. 2005, 310: 996-999. 10.1126/science.1117255.

    Article  PubMed  CAS  Google Scholar 

  7. Schoofs L, Veelaert D, Vanden Broeck J, De Loof A: Peptides in the locusts, Locusta migratoria and Schistocerca gregaria. Peptides. 1997, 18: 145-156. 10.1016/S0196-9781(96)00236-7.

    Article  PubMed  CAS  Google Scholar 

  8. Baggerman G, Cerstiaens A, De Loof A, Schoofs L: Peptidomics of the larval Drosophila melanogaster central nervous system. J Biol Chem. 2002, 277: 40368-40374. 10.1074/jbc.M206257200.

    Article  PubMed  CAS  Google Scholar 

  9. Husson SJ, Clynen E, Baggerman G, De Loof A, Schoofs L: Discovering neuropeptides in the nematode Caenorhabditis elegans by two dimensional nanoscale liquid chromatography and mass spectrometry. Biochem Biophys Res Commun. 2005, 335: 76-86. 10.1016/j.bbrc.2005.07.044.

    Article  PubMed  CAS  Google Scholar 

  10. Clynen E, Baggerman G, Huybrechts J, Vanden Bosch L, De Loof A, Schoofs L: Peptidomics of the locust corpora allata: identification of novel pyrokinins (-FXPRLamides). Peptides. 2003, 24: 1493-1500. 10.1016/j.peptides.2003.10.006.

    Article  PubMed  CAS  Google Scholar 

  11. Clynen E, Huybrechts J, De Loof A, Schoofs L: Mass spectrometric analysis of the perisympathetic organs in locusts: identification of novel periviscerokinins. Biochem Biophys Res Commun. 2003, 300: 422-428. 10.1016/S0006-291X(02)02869-3.

    Article  PubMed  CAS  Google Scholar 

  12. Animal genome size database. []

  13. Kang L, Chen X, Zhou Y, Liu B, Zheng W, Li R, Wang J, Yu J: The analysis of large-scale gene expression correlated to the phase changes of the migratory locust. Proc Natl Acad Sci USA. 2004, 101: 17611-17615. 10.1073/pnas.0407753101.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Ma Z, Yu J, Kang L: LocustDB: a relational database for the transcriptome and biology of the migratory locust (Locusta migratoria). BMC Genomics. 2006, 7: 11-10.1186/1471-2164-7-11.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Liang Z, Sottrup-Jensen L, Aspan A, Hall M, Soderhall K: Pacifastin, a novel 155-kDa heterodimeric proteinase inhibitor containing a unique transferrin chain. Proc Natl Acad Sci USA. 1997, 94: 6682-6687. 10.1073/pnas.94.13.6682.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  16. Simonet G, Claeys I, Franssens V, De Loof A, Vanden Broeck J: Genomics, evolution and biological functions of the pacifastin peptide family: a conserved serine protease inhibitor family in arthropods. Peptides. 2003, 24: 1633-1644. 10.1016/j.peptides.2003.07.014.

    Article  PubMed  CAS  Google Scholar 

  17. Simonet G, Breugelmans B, Proost P, Claeys I, Van Damme J, De Loof A, Vanden Broeck J: Characterization of two novel pacifastin-like peptide precursor isoforms in the desert locust (Schistocerca gregaria): cDNA cloning, functional analysis and real-time RT-PCR gene expression studies. Biochem J. 2005, 388: 281-289. 10.1042/BJ20041414.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  18. Hamdaoui A, Wataleb S, Devreese B, Chiou SJ, Vanden Broeck J, Van Beeumen J, De Loof A, Schoofs L: Purification and characterization of a group of five novel peptide serine protease inhibitors from theovaries of the desert locust, Schistocerca gregaria. FEBS Lett. 1998, 422: 74-78. 10.1016/S0014-5793(97)01585-8.

    Article  PubMed  CAS  Google Scholar 

  19. Kromer E, Nakakura N, Lagueux M: Cloning of a Locusta cDNA encoding a precursor peptide for two structurally related proteinase inhibitors. Insect Biochem Mol Biol. 1994, 24: 329-331. 10.1016/0965-1748(94)90013-2.

    Article  PubMed  CAS  Google Scholar 

  20. Simonet G, Claeys I, Vanderperren H, November T, De Loof A, Vanden Broeck J: cDNA cloning of two different serine protease inhibitor precursors in the migratory locust, Locusta migratoria. Insect Mol Biol. 2002, 11: 249-256. 10.1046/j.1365-2583.2002.00331.x.

    Article  PubMed  CAS  Google Scholar 

  21. Nakakura N, Hietter H, Van Dorsselaer A, Luu B: Isolation and structural determination of three peptides from the insect Locusta migratoria. Identification of a deoxyhexose-linked peptide. Eur J Biochem. 1992, 204: 147-153. 10.1111/j.1432-1033.1992.tb16617.x.

    Article  PubMed  CAS  Google Scholar 

  22. Clynen E, Baggerman G, Veelaert D, Cerstiaens A, Van der Horst D, Harthoorn L, Derua R, Waelkens E, De Loof A, Schoofs L: Peptidomics of the pars intercerebralis-corpus cardiacum complex of the migratory locust, Locusta migratoria. Eur J Biochem. 2001, 268: 1929-1939. 10.1046/j.1432-1327.2001.02067.x.

    Article  PubMed  CAS  Google Scholar 

  23. Kellenberger C, Boudier C, Bermudez I, Bieth JG, Luu B, Hietter H: Serine protease inhibition by insect peptides containing a cysteine knot and a triple-stranded β-sheet. J Biol Chem. 1995, 270: 25514-25519. 10.1074/jbc.270.43.25514.

    Article  PubMed  CAS  Google Scholar 

  24. Girardie J, Boureme D, Girardie A: Production sites of the three neurosecretory proteins characterised in the corpora cardiaca of the migratory locust. Insect Biochem. 1987, 17: 29-36. 10.1016/0020-1790(87)90140-5.

    Article  CAS  Google Scholar 

  25. Girardie J, Boureme D, Couillaud F, Tamarelle M, Girardie A: Anti-juvenile effect of neuroparsin-A, a neuroprotein isolated from locust corpora cardiaca. Insect Biochem. 1987, 17: 977-983. 10.1016/0020-1790(87)90106-5.

    Article  CAS  Google Scholar 

  26. Fournier B, Girardie J: A new function for the locust neuroparsins – stimulation of water reabsorption. J Insect Physiol. 1988, 34: 309-313. 10.1016/0022-1910(88)90141-2.

    Article  CAS  Google Scholar 

  27. Moreau R, Gourdoux L, Girardie J: Neuroparsin – a new energetic neurohormone in the African locust. Arch Insect Biochem Physiol. 1988, 8: 135-145. 10.1002/arch.940080207.

    Article  CAS  Google Scholar 

  28. Vanhems E, Delbos M, Girardie J: Insulin and neuroparsin promote neurite outgrowth in cultured locust CNS. Eur J Neurosci. 1990, 2: 776-782. 10.1111/j.1460-9568.1990.tb00468.x.

    Article  PubMed  Google Scholar 

  29. Lagueux M, Kromer E, Girardie J: Cloning of a Locusta cDNA encoding neuroparsin A. Insect Biochem Mol Biol. 1992, 22: 511-516. 10.1016/0965-1748(92)90027-C.

    Article  CAS  Google Scholar 

  30. Janssen T, Claeys I, Simonet G, De Loof A, Girardie J, Vanden Broeck J: cDNA cloning and transcript distribution of two different neuroparsin precursors in the desert locust,Schistocerca gregaria. Insect Mol Biol. 2001, 10: 183-189. 10.1046/j.1365-2583.2001.00257.x.

    Article  PubMed  CAS  Google Scholar 

  31. Claeys I, Simonet G, Van Loy T, De Loof A, Vanden Broeck J: cDNA cloning and transcript distribution of two novel members of the neuroparsin family in the desert locust, Schistocerca gregaria. Insect Mol Biol. 2003, 12: 473-481. 10.1046/j.1365-2583.2003.00431.x.

    Article  PubMed  CAS  Google Scholar 

  32. Claeys I, Simonet G, Breugelmans B, Van Soest S, Franssens V, Sas F, De Loof A, Vanden Broeck J: Quantitative real-time RT-PCR analysis in desert locusts reveals phase dependent differences in neuroparsin transcript levels. Insect Mol Biol. 2005, 14: 415-422. 10.1111/j.1365-2583.2005.00572.x.

    Article  PubMed  CAS  Google Scholar 

  33. Claeys I, Breugelmans B, Simonet G, Van Soest S, Sas F, De Loof A, Vanden Broeck J: Neuroparsin transcripts as molecular markers in the process of desert locust (Schistocerca gregaria) phase transition. Biochem Biophys Res Commun. 2006, 341: 599-606. 10.1016/j.bbrc.2006.01.011.

    Article  PubMed  CAS  Google Scholar 

  34. Schoofs L, Holman GM, Hayes TK, Nachman RJ, De Loof A: Locustatachykinin-I and locustatachykinin-II, 2 novel insect neuropeptides with homology to peptides of the vertebrate tachykinin family. FEBS Lett. 1990, 261: 397-401. 10.1016/0014-5793(90)80601-E.

    Article  PubMed  CAS  Google Scholar 

  35. Schoofs L, Holman GM, Hayes TK, Kochansky JP, Nachman RJ, De Loof A: Locustatachykinin-III and locustatachykinin-IV: 2 additional insect neuropeptides with homology to peptides of the vertebrate tachykinin family. Regul Pept. 1990, 31: 199-212. 10.1016/0167-0115(90)90006-I.

    Article  PubMed  CAS  Google Scholar 

  36. Satake H, Kawada T, Nomoto K, Minakata H: Insight into tachykinin-related peptides, their receptors and invertebratetachykinins: a review. Zool Sci. 2003, 20: 533-549. 10.2108/zsj.20.533.

    Article  PubMed  CAS  Google Scholar 

  37. Baggerman G, Boonen K, Verleyen P, De Loof A, Schoofs L: Peptidomic analysis of the larval Drosophila melanogaster central nervous system by two-dimensional capillary liquid chromatography quadrupole time-of-flight mass spectrometry. J Mass Spectrom. 2005, 40: 250-260. 10.1002/jms.744.

    Article  PubMed  CAS  Google Scholar 

  38. Predel R, Neupert S, Roth S, Derst C, Nässel DR: Tachykinin-related peptide precursors in two cockroach species. Molecular cloning and peptide expression in brain neurons and intestine. FEBS J. 2005, 272: 3365-3375. 10.1111/j.1742-4658.2005.04752.x.

    Article  PubMed  CAS  Google Scholar 

  39. Pabla N, Lange AB: The distribution and myotropic activity of locustatachykinin-like peptides in locust midgut. Peptides. 1999, 20: 1159-1167. 10.1016/S0196-9781(99)00119-9.

    Article  PubMed  CAS  Google Scholar 

  40. Veelaert D, Baggerman G, Derua R, Waelkens E, Meeusen T, Van deWater G, De Loof A, Schoofs L: Identification of a new tachykinin from the midgut of the desert locust, Schistocerca gregaria, by ESI-Qq-oa-TOF mass spectrometry. Biochem Biophys Res Commun. 1999, 266: 237-242. 10.1006/bbrc.1999.1808.

    Article  PubMed  CAS  Google Scholar 

  41. Zitnan D, Kingan TG, Hermesman JL, Adams ME: Identification of ecdysis-triggering hormone from an epitracheal endocrine system. Science. 1996, 271: 88-91.

    Article  PubMed  CAS  Google Scholar 

  42. Adams ME, Zitnan D: Identification of ecdysis-triggeringhormone in the silkworm Bombyx mori. Biochem Biophys Res Commun. 1997, 230: 188-191. 10.1006/bbrc.1996.5915.

    Article  PubMed  CAS  Google Scholar 

  43. Park Y, Zitnan D, Gill SJ, Adams ME: Molecular cloning and biological activity of ecdysis-triggering hormones in Drosophila melanogaster. FEBS Lett. 1999, 463: 133-138. 10.1016/S0014-5793(99)01622-1.

    Article  PubMed  CAS  Google Scholar 

  44. Zitnan D, Hollar L, Spalovska I, Takac P, Zitnanova I, Gill SS, Adams ME: Molecular cloning and function of ecdysis-triggeringhormones in the silkworm Bombyx mori. J Exp Biol. 2002, 205: 3459-3473.

    PubMed  CAS  Google Scholar 

  45. Zitnan D, Zitnanova I, Spalovska I, Takac P, Park Y, Adams ME: Conservation of ecdysis-triggering hormone signalling in insects. J Exp Biol. 2003, 206: 1275-1289. 10.1242/jeb.00261.

    Article  PubMed  CAS  Google Scholar 

  46. Minth CD, Bloom SR, Polak JM, Dixon JE: Cloning, characterization, and DNA sequence of a human cDNA encoding neuropeptide tyrosine. Proc Natl Acad Sci USA. 1984, 81: 4577-4581. 10.1073/pnas.81.14.4577.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  47. Curry WJ, Shaw C, Johnston CF, Thim L, Buchanan KD: Neuropeptide F: primary structure from the tubellarian, Artioposthia triangulata. Comp Biochem Physiol C. 1992, 101: 269-274. 10.1016/0742-8413(92)90272-9.

    Article  PubMed  CAS  Google Scholar 

  48. Maule AG, Shaw C, Halton DW, Brennan GP, Johnston CF, Moore S: Neuropeptide F (Moniezia expansa): localization and characterization using specific antisera. Parasitology. 1992, 105: 505-512.

    Article  PubMed  CAS  Google Scholar 

  49. Leung PS, Shaw C, Maule AG, Thim L, Johnston CF, Irvine GB: The primary structure of neuropeptide F (NPF) from the garden snail, Helix aspersa. Regul Pept. 1992, 41: 71-81. 10.1016/0167-0115(92)90515-V.

    Article  PubMed  CAS  Google Scholar 

  50. Rajpara SM, Garcia PD, Roberts R, Eliassen JC, Owens DF, Maltby D, Myers RM, Mayeri E: Identification and molecular cloning of a neuropeptide Y homolog that produces prolonged inhibition in Aplysia neurons. Neuron. 1992, 9: 505-513. 10.1016/0896-6273(92)90188-J.

    Article  PubMed  CAS  Google Scholar 

  51. Brown MR, Crim JW, Arata RC, Cai HN, Chun C, Shen P: Identification of a Drosophila brain-gut peptide related to the neuropeptide Y family. Peptides. 1999, 20: 1035-1042. 10.1016/S0196-9781(99)00097-2.

    Article  PubMed  CAS  Google Scholar 

  52. Wu Q, Wen T, Lee G, Park JH, Cai HN, Shen P: Developmental control of foraging and social behavior by the Drosophila neuropeptide Y-like system. Neuron. 2003, 39: 147-161. 10.1016/S0896-6273(03)00396-9.

    Article  PubMed  CAS  Google Scholar 

  53. Lee KS, You KH, Choo JK, Han YM, Yu K: Drosophila short neuropeptide F regulates food intake and body size. J Biol Chem. 2004, 279: 50781-50789. 10.1074/jbc.M407842200.

    Article  PubMed  CAS  Google Scholar 

  54. Beck B: KO's and organisation of peptidergic feeding behavior mechanisms. Neurosci Biobehav Rev. 2001, 25: 143-158. 10.1016/S0149-7634(01)00003-3.

    Article  PubMed  CAS  Google Scholar 

  55. Woodhead AP, Stay B, Seidel SL, Khan MA, Tobe SS: Primary structure of four allatostatins: neuropeptide inhibitors of juvenile hormone synthesis. Proc Natl Acad Sci USA. 1989, 86: 5997-6001. 10.1073/pnas.86.15.5997.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  56. Bendena WG, Donly BC, Tobe SS: Allatostatins: a growing family of neuropeptides with structural and functional diversity. Ann NY Acad Sci. 1999, 897: 311-329. 10.1111/j.1749-6632.1999.tb07902.x.

    Article  PubMed  CAS  Google Scholar 

  57. Gäde G, Hoffmann K-H: Neuropeptides regulating development and reproduction in insects. Physiol Entomol. 2005, 30: 103-121. 10.1111/j.1365-3032.2005.00442.x.

    Article  Google Scholar 

  58. Lenz C, Williamson M, Grimmelikhuijzen CJP: Molecular cloning and genomic organization of an allatostatin preprohormone from Drosophila melanogaster. Biochem Res Commun. 2000, 273: 1126-1131. 10.1006/bbrc.2000.3062.

    Article  CAS  Google Scholar 

  59. Williamson M, Lenz C, Winther AME, Nässel DR, Grimmelikhuijzen CJP: Molecular cloning, genomic organization, and expression of aB-type (cricket-type) allatostatin preprohormone from Drosophila melanogaster. Biochem Biophys Res Commun. 2001, 281: 544-550. 10.1006/bbrc.2001.4402.

    Article  PubMed  CAS  Google Scholar 

  60. Williamson M, Lenz C, Winther AME, Nässel DR, Grimmelikhuijzen CJP: Molecular cloning, genomic organization, and expression of a C-type (Manduca sexta -type) allatostatin preprohormone from Drosophila melanogaster. Biochem Biophys Res Commun. 2001, 282: 124-130. 10.1006/bbrc.2001.4565.

    Article  PubMed  CAS  Google Scholar 

  61. Schoofs L, Holman GM, Hayes TK, Nachman RJ, De Loof A: Isolation, identification and synthesis of locustamyoinhibiting peptide (Lom-MIP), a novel biologically active neuropeptide from Locusta migratoria. Regul Pept. 1991, 36: 111-119. 10.1016/0167-0115(91)90199-Q.

    Article  PubMed  CAS  Google Scholar 

  62. Veelaert D, Devreese B, Schoofs L, Van Beeumen J, Vanden BroeckJ, Tobe SS, De Loof A: Isolation and characterization of eight myoinhibiting peptides from the midgut of the desert locust, Schistocerca gregaria : new members of the cockroach allatostatin family. Mol Cell Endocrinol. 1996, 122: 183-190. 10.1016/0303-7207(96)03889-0.

    Article  PubMed  CAS  Google Scholar 

  63. Vanden broeck J, Veelaert D, Bendena WGH, Tobe SS, De Loof A: Molecular cloning of the precursor cDNA for schistostatins, locust allatostatin-like peptides with myoinhibiting properties. Mol Cell Endocrinol. 1996, 122: 191-198. 10.1016/0303-7207(96)03890-7.

    Article  PubMed  CAS  Google Scholar 

  64. Vanden Broeck J: Neuropeptides and their precursors in the fruitfly, Drosophila melanogaster. Peptides. 2001, 22: 241-254. 10.1016/S0196-9781(00)00376-4.

    Article  PubMed  CAS  Google Scholar 

  65. Hewes RS, Taghert PH: Neuropeptides and neuropeptide receptors in the Drosophila melanogaster genome. Genome Res. 2001, 11: 1126-1142. 10.1101/gr.169901.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  66. Liu F, Baggerman G, D'Hertog W, Verleyen P, Schoofs L, Wets G: In silico identification of new secretory peptide genes in Drosophila melanogaster. Mol Cell Proteomics. 2006, 5: 510-522. 10.1074/mcp.M400114-MCP200.

    Article  PubMed  CAS  Google Scholar 

  67. Riehle MA, Garczynski SF, Crim JW, Hill CA, Brown MR: Neuropeptides and peptide hormones in Anopheles gambiae. Science. 2002, 298: 172-175. 10.1126/science.1076827.

    Article  PubMed  CAS  Google Scholar 

  68. Signal P. []

  69. ClustalW. []

  70. Ashby GJ: Locusts. UFAW Handbook on the care and management of laboratory animals. Edited by: Poole T. 1972, Edinburgh: Churchill Livingstone, 582-587.

    Google Scholar 

Download references


We are grateful to J. Dombrecht and K. Theys for computing assistance. This research was supported by FWO (Fund for Scientific Research-Flanders, Belgium) grants G.0.146.03 and G.0444.05 and by the K.U. Leuven Research foundation (GOA/2005/09). J. Huybrechts, E. Clynen and P. Verleyen are postdoctoral fellows of the FWO.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Jurgen Huybrechts.

Additional information

Authors' contributions

EC, JH and PV performed the in silico analyses and verified manually all significant BLAST hits prior to annotation. JH obtained the tissues for the MS(MS) surveys performed by EC. EC and JH drafted the manuscript. LS was involved in design and coordination of the study and revision of the manuscript. LS and ADL obtained funding for the study. All authors read and approved the final manuscript.

Elke Clynen, Jurgen Huybrechts contributed equally to this work.

Authors’ original submitted files for images

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Clynen, E., Huybrechts, J., Verleyen, P. et al. Annotation of novel neuropeptide precursors in the migratory locust based on transcript screening of a public EST database and mass spectrometry. BMC Genomics 7, 201 (2006).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: