Skip to main content

A scaffold-level genome assembly of a minute pirate bug, Orius laevigatus (Hemiptera: Anthocoridae), and a comparative analysis of insecticide resistance-related gene families with hemipteran crop pests

Abstract

Background

Orius laevigatus, a minute pirate bug, is a highly effective beneficial predator of crop pests including aphids, spider mites and thrips in integrated pest management (IPM) programmes. No genomic information is currently available for O. laevigatus, as is the case for the majority of beneficial predators which feed on crop pests. In contrast, genomic information for crop pests is far more readily available. The lack of publicly available genomes for beneficial predators to date has limited our ability to perform comparative analyses of genes encoding potential insecticide resistance mechanisms between crop pests and their predators. These mechanisms include several gene/protein families including cytochrome P450s (P450s), ATP binding cassette transporters (ABCs), glutathione S-transferases (GSTs), UDP-glucosyltransferases (UGTs) and carboxyl/cholinesterases (CCEs).

Methods and findings

In this study, a high-quality scaffold level de novo genome assembly for O. laevigatus has been generated using a hybrid approach with PacBio long-read and Illumina short-read data. The final assembly achieved a scaffold N50 of 125,649 bp and a total genome size of 150.98 Mb. The genome assembly achieved a level of completeness of 93.6% using a set of 1658 core insect genes present as full-length genes. Genome annotation identified 15,102 protein-coding genes - 87% of which were assigned a putative function.

Comparative analyses revealed gene expansions of sigma class GSTs and CYP3 P450s. Conversely the UGT gene family showed limited expansion. Differences were seen in the distributions of resistance-associated gene families at the subfamily level between O. laevigatus and some of its targeted crop pests. A target site mutation in ryanodine receptors (I4790M, PxRyR) which has strong links to diamide resistance in crop pests and had previously only been identified in lepidopteran species was found to also be present in hemipteran species, including O. laevigatus.

Conclusion and significance

This assembly is the first published genome for the Anthocoridae family and will serve as a useful resource for further research into target-site selectivity issues and potential resistance mechanisms in beneficial predators. Furthermore, the expansion of gene families often linked to insecticide resistance may be an indicator of the capacity of this predator to detoxify selective insecticides. These findings could be exploited by targeted pesticide screens and functional studies to increase effectiveness of IPM strategies, which aim to increase crop yields by sustainably, environmentally-friendly and effectively control pests without impacting beneficial predator populations.

Peer Review reports

Background

Loss of crops to insect pests can account for ~ 10% of potential yield, as a result of both direct feeding damage and the transfer of viral plant diseases [1]. Thus, to maximise crop yields and sustain food production for a growing world population, pests need to be controlled. At present this control relies mainly on the use of synthetic pesticides, many of which are non-selective and are therefore toxic to both their target pest species and to beneficial predators and parasitoids. As a result there may be a reduction in the predator populations to a level where they are no longer able to contribute natural pest control. This, along with the development of insecticide resistance in pests, can lead to pest populations surging, sometimes to even higher levels than pre-pesticide application [2,3,4]. Beneficial predators, such as those in the genus Orius, have proven to be especially effective in the biological control of crop pests [5]. As generalist predators, Orius species target a wide variety of pest species including aphids, beet armyworm, leafhoppers, mites, thrips and whiteflies, many of which are the world’s most damaging crop pests [6, 7]. Some Orius species are commercially available as biological control agents and are widely used for this purpose as part of integrated pest management (IPM) strategies, especially in covered crops [8,9,10].

Whole genome sequences of insects are helping us to understand many aspects of their biology and behaviour, and this can be applied to potential insecticide resistance mechanisms in pest insects and their natural enemies. However, only a few genomes of beneficial predator species have been published to date, including a phytoseiid mite, Galendromus occidentalis [11]; three parasitoid wasps, Nasonia giraulti, Nasonia longicornis and Nasonia vitripennis [12] and two lady beetles, Harmonia axyridis and Coccinella septempunctata [13]. To date there are no published genomes for species of the Hemiptera: Anthocoridae (i.e. minute pirate bug) family of predators. In contrast, a growing number of genomes of crop pests are available [14,15,16,17,18,19,20,21,22,23,24,25,26]. This larger number of pest genomes, relative to beneficial predator genomes could be in part because up until recently, the genomes of the pests themselves have appeared more useful in terms of developing targeted pesticides and investigating mechanisms of pesticide resistance. However, agriculture is now moving increasingly away from pesticide use – particularly with the Directive on Sustainable Use of Pesticides 2009/128/EC [27] - and towards IPM strategies, which includes the use of beneficial predators. Future studies of pesticide resistance mechanisms should therefore include beneficial predator genomes alongside pest genomes in order to help select targeted pesticides which do not harm beneficials and subsequently improve the efficacy of IPM strategies [28,29,30,31,32].

The aim of the work reported here was to develop a high-quality genome assembly for O. laevigatus, to serve as a resource for research into this species as well as the wider Anthocoridae family, which consists of 400–600 mostly predaceous minute pirate bug species - a potentially valuable source of biological control agents [33]. The O. laevigatus genome was then used for comparative analyses between beneficial predators and crop pests, focusing on genes encoding potential insecticide resistance mechanisms.

There are two main types of insecticide resistance mechanisms: increased expression of genes encoding protein families involved in metabolic resistance and point mutations in genes encoding insecticide target proteins [34]. Gene families involved in insecticide resistance in pest species are known to include cytochrome P450 monooxygenases (P450s), ATP binding cassette transporters (ABCs), glutathione S-transferases (GSTs), UDP-glucosyltransferases (UGTs) and carboxyl/cholinesterases (CCEs) [35,36,37,38,39,40]. Comparisons of the genes/proteins which may be involved in insecticide resistance in crop pests with the corresponding genes in beneficial insects, could aid the development of insecticides which target crop pests but have limited impact on beneficial predator populations. This could prove key to developing successful IPM strategies which exploit differences in the ability of predators and crop pests to tolerate pesticides. Improving the availability of beneficial predator genomes could also help the selection of beneficial predators with genes/mutations for inherent insecticide resistance before being released in the field for biological control [41].

The results presented here provide a comprehensive foundation for further study of potential insecticide resistance mechanisms in beneficial predators and how they compare to crop pests.

Methods

Sample preparation and sequencing

Orius laevigatus (commonly known as a minute pirate bug) were obtained from ‘Bioline AgroSciences’. CO2 was used for anaesthesia to allow the insects to be sorted from the substrate. Both adults and nymphs were then flash frozen with liquid N2 and stored at − 80 °C. The whole process was done within 48 h of arrival.

~ 1000 individuals were pooled for genomic DNA/RNA extractions, which were carried out in-house at Rothamsted Research. The commercial DNAzol reagent was used for the DNA extractions, and the Bioline Isolate II RNA Mini Kit was used for the RNA extractions. The DNA and RNA were sent for library preparation and sequencing by Genewiz (New Jersey, US).

The genome assembly was developed using a hybrid assembly strategy with both Illumina short reads and Pacific Biosciences (PacBio) long reads.

Short reads were sequenced using 2 mg of DNA and a library with an insert size of 200 bp. Sequencing was done using Illumina HiSeq 4000 with a 2x150bp paired-end configuration. 413,143,574 reads were obtained with a total length of 123 Gb (820x). Raw reads are available under SRA accession: ERR6994870. K-mer counting of the raw Illumina DNA data was done using Jellyfish 2.2.6 [42]. Canonical (−C) 21-mers (−m 21) were counted and a histogram of k-mer frequencies outputted. GenomeScope 2.0 [43] was used to process this histogram with ‘ploidy’ set at 2 and ‘maximum k-mer coverage cut-off’ set at 10,000.

To obtain long read PacBio data, 3.7 mg of DNA first underwent blue pippin size selection (> = 10 kb) to remove low molecular weight DNA. < 500 ng of DNA remained after size selection, and so a low input protocol was used for library construction with an insert size of 20 kb. Sequencing was done using the PacBio Sequel I platform and 537,651 reads were obtained with a total length of 6Gb (44x) and an N50 of 11,287 bp. Raw reads are available under SRA accession: ERR6941611.

Transcriptome sequencing used 10 mg of RNA and a library construction with an insert size of 150 bp and PolyA selection for rRNA removal. Sequencing was done using Illumina HiSeq 4000 with a 2x150bp paired-end configuration. 413,137,378 reads were obtained. Raw reads are available under SRA accession: ERR7012629.

FastQC v.0.11.8 [44]. was used for quality checks on the raw Illumina HiSeq DNA and RNA sequence data. Adapters were trimmed, low-quality bases (below a score of 3) were removed from the start and end of reads and any reads with a length less than 36 bases were also removed. Trimmomatic v.0.38 [45]. was used for these trimming steps. Quality trimming of reads using Trimmomatic resulted in a 0.2% loss of reads for whole genome sequencing and a 5% loss of reads for transcriptome sequencing (Table 1).

Table 1 Number of paired-end Illumina HiSeq DNA sequences present before and after trimming

Genome quality assessment

Basic metrics from the genome assembly were calculated using a script developed for the ‘Assemblathon’ [46]. These metrics include scaffold/contig N50, longest and shortest scaffold length, number of scaffolds exceeding a range of lengths and number of gaps/N’s in the assembly.

The completeness of the genome assembly and annotation for Orius laevigatus was assessed using the Benchmarking Universal Single-Copy Orthologs (BUSCO) [47] of the insect gene set (insecta odb9). ‘Genome’ mode was used to assess the assembly, and ‘protein’ mode to assess the annotation. ‘Fly’ was used as the training species for Augustus gene prediction. BUSCO assessments were then run with default parameters.

De novo genome assembly

The overall assembly pipeline is shown in Fig. 1. The raw PacBio long reads were assembled into contigs with the Flye v2.5. de novo assembler [48, 49]. Rascaf was then used to improve the Flye genome assembly with RNA-seq data [50]. Contigs were also produced with the raw PacBio long reads using Canu v1.8 [51] as well as with FALCON v1.3.0 and FALCON-Unzip, which is recommended for heterozygous/outbred organisms with diploid or higher ploidy (and also includes phased-polishing with Arrow) [52, 53].

Fig. 1
figure 1

The assembly pipeline for the Orius laevigatus genome

QuickMerge v0.3 [54] was used to merge the assemblies, with Flye as the reference assembly. BUSCO outputs were compared between the merged assembly and the standalone assemblies to identify genes which had been lost during the merging process. Full-length contigs containing these missing genes were extracted from the standalone assemblies and added to the merged assembly, based on the assumption that these contigs would also contain other missed genes (i.e. those not included in BUSCO’s list of 1658 core insect genes). Multiple rounds of Pilon error polishing [55] were performed, using the Illumina short read data, until no further improvement in BUSCO score was seen.

Redundans [56] was used for scaffolding and redundant contig removal. Redundans is geared towards highly heterozygous genomes. Some redundant regions had to be removed manually, as Redundans does not detect redundancy when only part of the contig is duplicated. The nucmer tool from the MUMmer4 package [57] was used to detect these redundant regions through a whole genome self-alignment.

A BLAST search against the NCBI Reference Sequence (Refseq) database release 93 [58], was performed using the Tera-BLAST algorithm on a TimeLogic DeCypher system (Active Motif Inc., Carlsbad, CA). The results were processed with Megan [59] to identify any bacterial or viral sequences which were then removed manually in Geneious v10.2.6.

The mitochondrial genome sequence was identified and extracted by running a BLAST search of the O. laevigatus genome against the Orius sauteri mitochondrial genome which is publicly available at NCBI, GenBank accession No. KJ671626 [60].

Genome annotation

Gene prediction was performed using the MAKER v2.31.8 pipeline [61] through the incorporation of both transcriptome evidence and ab initio gene prediction as well as a custom repeat library (see below). MAKER was run using Augustus v3.3.1 [62], GeneMark-ES v4.32 [63] and FGeneSH v8.0.0 [64] as well as EVidenceModeler v1.1.1 [65] with default masking options.

A de novo species specific repeat library was constructed using RepeatModeler v1.0.7 [66] to identify repeat models. These models were searched against the GenBank non-redundant (nr) protein database for Arthropoda (e value < 10− 3) using Blastx to remove any potential protein-coding genes. This was combined with transposon data to create a custom library. Transposons were identified from the transcriptome assembly by running HMMER: hmmscan [67] against the Pfam database [68] and filtering the resultant Pfam descriptions for those containing “transposon”. A search for transposons was also done on transcripts produced from MAKER and these transposons were then added to the custom repeat library which was used for a second round of MAKER. RepeatMasker v4.0.7 [69] was used to mask repeats in the genome assembly using these repeat libraries, as well as to estimate the abundances of all predicted repeats.

RNA-seq reads were mapped to the genome with HISAT2 v2.0.5 [70] for assembly with StringTie v1.0.1 [71]. A de novo assembly was also done using Trinity v2.5.1 [72]. The best transcripts (classified by reasonable transcript size and homology to other species) were selected from the Trinity and StringTie assemblies using Evigene v19.jan01 [73].

Evidence from assembled transcripts was transferred to the genome assembly via MAKER. The output from this was then used to produce a high confidence level gene model training set - overlapping and redundant gene models were removed. Augustus and GeneMark were trained using this training set prior to being used for ab initio gene predictions. FGeneSH was run based on the Drosophila melanogaster genome.

The best transcripts from both the ab initio gene prediction annotation and the transcriptome-based annotation were selected using Evigene (classified by reasonable protein size and homology to other species) and combined to create the final annotation.

Orius laevigatus protein sequences were aligned using Blastp against the non-redundant (nr) NCBI protein database for Arthropoda. InterProscan searches were run against several databases (CDD, HAMAP, HMMPAnther, HMMPfam, HMMPIR, FPrintScan, BlastProDom, ProfileScan, HMMTigr) for functional annotation. BLAST2GO [74] was used to assign gene ontology (GO annotations). Infernal v1.1.2 [75] was used to predict and annotate non-coding RNAs.

The mitochondrial genome was annotated using MITOS2 [76] with reference database ‘RefSeq 81 Metazoa’ and genetic code ‘5 Invertebrate’.

Comparative genomics and phylogenetic analysis

To produce the species tree, orthogroup gene trees were produced using Orthofinder [77] and the tree was inferred from these using the STAG method [78].

In order to identify genes potentially involved in insecticide resistance, the PFAM domains assigned to gene models during annotation (as described in the ‘Genome Annotation’ methods section) were used as follows: CCEs (PF00135/IPR002018), GSTs (IPR004045/PF02798), (IPR004046/PF00043), P450s (IPR001128/PF00067), ABCs (IPR003439/PF00005) and UGTs (IPR002213/PF00201). Proteins from UniProt for the classes of interest, from hemipteran species, were used for BLAST queries against O. laevigatus to identify any missed genes and to assist with subfamily assignment within these classes. Subfamily assignment for O. laevigatus gene families was finalised using phylogenetic trees produced using MAFFT alignments [79, 80] and RaxML v8.2.11 [81]. The GAMMA LG protein model [82] was used and a bootstrap consensus tree was inferred from 100 replicates.

Manual checks and curation were performed for genes potentially involved in insecticide resistance. Increased copy numbers of these genes often led to adjacent tandem duplications being incorrectly annotated as one gene model, therefore curation was important to prevent incorrect gene numbers being reported in later analyses. The exon/intron boundaries and start/stop codons of the genes were confirmed through visualization in IGV [83] of RNAseq data mapped to the genome using HISAT2 v2.0.5 [70] and the gene models were edited in Geneious where necessary.

The P450s were classified and named by Dr. David Nelson [84]. The UGTs were classified and named by Dr. Michael Court [85]. Nomenclature of P450s and UGTs is based on the evolutionary relationships of the sequences. P450 and UGT sequences were BLAST searched against named insect sequences and were assigned to known families if they were > 40% (for P450 families) or > 45% (for UGT families) identical. Other sequences were assigned to new families based on their clustering on trees and their percent identity to each other.

Results and discussion

Sequencing

In order to produce enough DNA and RNA for sequencing, ~ 1000 individuals of O. laevigatus were required. Because they were obtained commercially, the level of inbreeding of the culture was not known. However, all individuals were obtained from a single colony within the rearing facility. A high heterozygosity level was therefore a possibility and this was kept in mind when making decisions during the assembly process.

Genome metrics evaluation based on raw reads

The raw read k-mer analysis with GenomeScope 2.0 estimated a haploid genome size of ~ 141 Mb (Table 2), in line with the final assembly size of 151 Mb. A genome size estimate using methods such as flow cytometry would have provided a more accurate estimate, however, such data was not available for the Orius genus. This could be considered a limitation to the study, as 141 Mb was provided as a genome size estimate to Canu, Flye and FALCON-Unzip which may have affected the outputted assemblies. Genome repeat length was 20 Mb, 16.5% of the total estimated genome size.

Table 2 Genome characteristics obtained from GenomeScope v2.0 Using a k-mer length of 21 and a maximum k-mer coverage of 10,000

The heterozygosity rate ranged from 1.20 to 1.30%. This alongside the small ‘shoulder’ to the left of the main ‘full-model’ peak (Fig. 2), indicates a fairly high level of heterozygosity, which was taken into consideration in the assembly strategy.

Fig. 2
figure 2

GenomeScope v2.0 profile plots of A: a transformed linear plot of k-mer frequency; and B: a transformed log plot of k-mer coverage at a k-mer length of 21 and a maximum k-mer coverage of 10,000

Assembly

Flye, FALCON and Canu were used to produce 3 separate genome assemblies. The statistics for these assemblies, as well as for subsequent versions of the assembly outlined in this section are shown in Additional file 2. Rascaf improved the contiguity of the Flye assembly through alignment of the RNA-seq data to the genome, likely because it is less affected by the use of multiple individuals versus genome assembly tools which include non-conserved sequences from a population of individuals. FALCON-Unzip improved the FALCON assembly contiguity with a 4.5-fold decrease in the total number of scaffolds (although this coincided with a ~ 9% loss of complete gene models found using BUSCO and suggests that FALCON-Unzip may have been too stringent for this genome - perhaps because it was designed with plant and fungal genomes in mind [52, 53]).

Flye (both with and without Rascaf) had the best assembly statistics in terms of scaffold N50 and BUSCO score. However, FALCON-Unzip achieved the largest ‘longest scaffold’ of the three assemblers.

Quickmerge was used to merge the FALCON-Unzip assembly, Rascaf improved Flye assembly and the Canu assembly. The resultant merged assembly had better continuity than any of the stand-alone assemblies, however, the BUSCO completeness was slightly worse than the standalone Flye assembly (and worsened with the second round of Quickmerge). This was likely due to mis-assemblies in the component assemblies causing alignment issues, which resulted in sections of the misassembled contigs being discarded.

Pilon was used for error polishing and improved the BUSCO completeness score. Redundans (redundancy removal and scaffolding/gap-closing) improved the scaffold N50 and removed redundant scaffolds.

A comparison of the gene models (core insect genes from the insecta odb9 BUSCO gene set) found in the original Flye / FALCON-Unzip / Canu assemblies versus the merged assembly showed that some of the gene models were found in at least one of the original assemblies, but were missing in the merged assembly. Of the 154 missing or fragmented genes in the merged assembly (out of a total 1658 core insect genes), 5 were found in the FALCON-Unzip assembly, 5 in the Flye assembly and 46 in the Canu assembly. Manual editing to bring the full-length contigs containing these missing genes into the merged assembly took the BUSCO completeness score up by 5%. A final round of Pilon improved this score by an additional 0.5% (further rounds of Pilon did not improve the score).

This brought the final assembly statistics to 93.6% BUSCO (insecta) complete, scaffold N50: 125,649 bp, the longest scaffold: 2,051,674 bp and 89.4% of scaffolds > 10 k in length (Table 3). The final assembly is available under GenBank accession: GCA_018703685.1. Transcriptome sequences are available under accessions: HBWI01000001-HBWI01209903.

Table 3 Final assembly statistics for the O. laevigatus genome

Annotation

Gene prediction with MAKER identified 15,102 protein-coding genes with the encoded proteins having a mean length of 464 amino acids. Of these, 12,949 (86%) had a match to NCBI’s non-redundant (nr) database and 11,616 (77%) contained InterPro motifs, domains or signatures. In total, 13,112 (87%) were annotated with either blastp or InterPro and 10,192 were annotated with a GO ID. More information on the InterPro member database annotations is given in additional file 1. The longest protein found was an ‘egf-like protein’ at 14,628 amino acids. The resultant gene set was 84.5% BUSCO (insecta) complete.

From the Infernal tool inference of RNA alignments, a total of 791 non-coding RNA elements and 269 cis-regulatory elements were found in the genome (Table 4).

Table 4 Number of ncRNAs predicted in the Orius laevigatus genome

Repeat annotation

Transposable and repetitive elements made up 27.07% of the assembled O. laevigatus genome (Table 5) and the majority of these (20.4%) were unclassified repeats. This is close to the reported repeat content of other hemipteran species, for example: Cimex lectularius - 31.63% [86] and Acyrthosiphon pisum - 38% [15], an exception is Rhodnius prolixus which has an unusually low repeat content of 5.6% [87].

Table 5 Summary of transposable and repetitive elements in the Orius laevigatus genome

Mitochondrial genome

A circularized mitochondrial genome of 16,246 bp, assembled and annotated using MITOS2, consisted of 13 protein coding genes, 19 tRNA genes, 2 rRNA genes and an A + T rich region with a length of 1460 bp and an A + T content of 72.7% (Fig. 3). This closely matches the Orius sauteri mitochondrial genome, which is also 16,246 bp and has 13 protein-coding genes, 22 tRNA genes, 2rRNA genes and an A + T rich region of 1758 bp and an A + T content of 73.5% [60].

Fig. 3
figure 3

The mitochondrial genome for Orius laevigatus, visualised using Geneious and annotation track obtained using MITOS2. The innermost graphs represent AT content shown in green, and GC content shown in blue

Phylogeny

OrthoFinder assigned 318,985 genes (88.8% of total) to 27,481 orthogroups. There were 1621 orthogroups with all species present and 45 of these consisted entirely of single-copy genes.

Phylogenetic analysis correctly clustered O. laevigatus within the hemipteran clade (Fig. 4) and identified Cimex lectularius as its closest relative.

Fig. 4
figure 4

Phylogeny and divergence of Insecta. Nodes are coloured by order, blue = Hemiptera, red = Lepidoptera, yellow = Diptera, purple = Hymenoptera, green = Coleoptera, orange = Thysanoptera, black = Chelicerata. Produced using the STAG tree inference method and full proteomes of the following species: C. lectularius: PRJNA167477, L. hesperus: PRJNA284294, R. prolixus: PRJNA13648, T. infestans: PRJNA589079, A. craccivora: PRJNA558689, A. pisum: PRJNA13657, B. tabaci: PRJNA312470, T. vaporariorum: PRJNA553773, D. citri: PRJNA2944, C. suppressalis: PRJNA506136, B. mori: PRJNA205630, D. melanogaster: PRJNA13812, A. mellifera: PRJNA471592, N. vitripennis: PRJNA575073, T. castaneum: PRJNA12540, F. occidentalis: PRJNA203209, T. palmi: PRJNA607431, T. urticae: PRJNA315122

Comparative genomics

ABC transporters

ATP-binding cassette transporters (ABCs), the largest known group of active transporters, can eliminate xenobiotic compounds - such as secondary metabolites produced by plants or insecticides - through translocation [36]. These transporters are subdivided into eight subfamilies: ABCA-H. ABCB, ABCC and ABCG are the subfamilies most associated with resistance to a variety of insecticides including pyrethroids, carbamates, organophosphates and neonicotinoids [88]. Forty-one of the 64 transporters in O. laevigatus belong to these 3 class-specific expansions (Table 6) which could confer resistance to insecticides (a phylogenetic tree showing relationships of ABC transporters in O. laevigatus is included in Additional file 3, full sequences are included in Additional file 4).

Table 6 Numbers of ABC transporter genes annotated in Orius laevigatus (this study), Cimex lectularius [86], Lygus hesperus [89], Frankliniella occidentalis [90], Thrips palmi [91], Aphis gossypii [92], Trialeurodes vaporariorum [93], Diuraphis noxia and Bemisia tabaci [94]

Table 6 shows a comparison of numbers of ABC transporter genes found in the current study with those reported for some pest species. The gene family expansions were generally seen in the ABCC and ABCG classes for all hemipteran species and slightly larger expansions were seen in some crop pests compared to O. laevigatus for the ABCC class, however, the expansions were of very similar size for both crop pests and O. laevigatus in the ABCG class. Overall, the total numbers of ABC transporter genes were similar across all the hemipteran species compared.

Glutathione S-Transferases

The glutathione S-transferases (GSTs) protein family is large and functionally diverse, and is known to confer resistance to all main insecticide classes. GST-mediated detoxification of insecticides takes place via several different mechanisms, including protecting against oxidative stress, binding and sequestration of the insecticide, and by catalysing the conjugation of glutathione to the insecticide to reduce their toxicity [37].

The number of GST genes in O. laevigatus was fairly similar to other hemipteran close relatives, with the exception of the sigma class, which was notably larger (Table 7, full sequences included in Additional file 4). Of the 16 genes in the sigma class, 9 genes (mRNA13082 and mRNA13086–13,093) were adjacent on the same scaffold, indicating a lineage specific expansion (Fig. 5). Expansions in this class have been reported in several hemipteran species including Triatoma infestans, Myzus persicae, Halyomorpha halys and Murgantia histrionica [23, 24, 95, 96]. The sigma class has been found to play an important role in detoxification of organophosphorus insecticides in hemipteran species [97], therefore this expansion could potentially confer some tolerance to organophosphates in O. laevigatus. The delta and epsilon classes of GSTs are linked to insecticide resistance to pyrethroids [98, 99]. The delta class is much larger in several crop pests compared to O. laevigatus and its close relatives which suggests these crop pests could exhibit a higher level of delta class GST-mediated pyrethroid resistance. The epsilon class has previously been thought to be specific to Holometabola [100], and whilst Trialeurodes vaporariorum has a single member, the epsilon class is absent from all other Hemiptera species, suggesting potential epsilon class GST-mediated pyrethroid resistance is most likely absent in O. laevigatus and its close relatives, as well as most Hemiptera crop pests.

Table 7 Numbers of glutathione S-transferase genes annotated in Orius laevigatus (this study), Cimex lectularius [101], Rhodnius prolixus, Triatoma Infestans [96], Thrips palmi [91], Myzus persicae, Acyrthosiphon pisum, Trialeurodes vaporariorum, Bemisia tabaci, Halyomorpha halys [102] and Murgantia histrionica [24]
Fig. 5
figure 5

Phylogenetic tree of the Orius laevigatus glutathione S-transferases. Amino acid sequences were aligned using MAFFT and analysed using RAxML (the GAMMA LG protein model was used). The bootstrap consensus tree was inferred from 100 replicates. Coloured stars on branches indicate tandem duplications, with each colour representing a different scaffold/set of tandem duplications in the O. laevigatus assembly

Carboxyl/cholinesterases

Many carboxyl/cholinesterases (CCEs) are linked to detoxification of organophosphorus, carbamate and pyrethroid insecticides and acetylcholinesterase (AChE) is the target for organophosphate and carbamate insecticides, with amino acid substitutions being linked to resistance [39]. Thirty-two members of the CCE superfamily, including 1 AChE gene, were found in the O. laevigatus genome (Table 8, full sequences included in Additional file 4) which is a similar number to that reported for Cimex lectularius, which had 30 CCE genes and 2 AChE genes [86].

Table 8 Numbers of Carboxyl/cholinesterases annotated in Orius laevigatus (this study), Cimex lectularius [101], Rhodnius prolixus [103], Triatoma infestans [96], Frankliniella occidentalis [90], Myzus persicae [95], Acyrthosiphon pisum, Bemisia tabaci [104] and Trialeurodes vaporariorum [105] and their distribution across classes and clades

The dietary class of CCEs is involved in insecticide and xenobiotic detoxification [106]. O. laevigatus has no genes within this class, in line with T. infestans and C. lectularius, whereas the crop pest species (i.e. thrips, aphids and whiteflies) all have at least 5 members in this class (Table 8). R. prolixus has 22 genes which have been classed as dietary; however this assignment was based heavily on a species-specific expansion which is characteristic of the dietary class. The real number of genes in the dietary class for R. prolixus may be 0, since this clade of 22 genes clusters with the hormone/semiochemical class in both the R. prolixus study [103] and this study (Additional file 5). A lack of dietary esterases in R. prolixus would make sense, as R. prolixus, C. lectularius and T. infestans are all blood-sucking insects and do not require dietary esterases to process the secondary metabolites found in plants. This could also explain why O. laevigatus, a beneficial predator of crop pests in both nymph and adult life stages, does not require dietary esterases.

The dietary class is involved in pyrethroid resistance [107]; however, T. infestans exhibits pyrethroid esterase activity despite having no dietary esterases [108]. O. laevigatus has also shown the ability to develop pyrethroid resistance - although the exact mechanism of this resistance is not yet known [109]. The hormone and semiochemical processing class is also involved in insecticide metabolism, due to the presence of β-esterases [110, 111]. There may be some redundancy in genes potentially involved in insecticide detoxification from the dietary and hormone/semiochemical processing classes. This might explain why only one of these classes shows an increased number of genes for each of these hemipteran species (Table 8), as having increased numbers of both classes would be redundant, whilst very low numbers of both classes would be detrimental. The lack of the dietary class may therefore not impact the xenobiotic resistance abilities of O. laevigatus, as it has 16 genes within the hormone/semiochemical processing class.

The remaining CCEs in O. laevigatus belong to the neurodevelopmental class and include the neuroligins, gliotactins, glutactins and neurotactins, which are non-catalytic due to the lack of a critical serine residue. Acetylcholinesterase is the only protein in this class which has been linked to organophosphate resistance [112, 113].

UGTs

UDP-glucosyltransferases (UGTs) are detoxification enzymes speculated to be involved in insecticide metabolism. Although the exact mechanisms of UGT-mediated resistance have not yet been identified, their upregulation has been shown in resistant strains of P. xylostella [35] and they have been linked to diamide resistance in Chilo suppressalis [114], neonicotinoid resistance in Diaphorina citri [115] and they also contribute to insecticide detoxification via the elimination of oxidative stress in Apis cerana [116].

The number of UGT genes in O. laevigatus was much lower than for other hemipteran species (Table 9). The UGTs were submitted to Dr. Michael Court for naming and are included in Additional file 4. Numbers of UGTs have been reported to be lower in non-phytophagous insects [91], which could explain the low numbers seen in O. laevigatus and R. prolixus compared to crop pests. This suggests that UGT-mediated detoxification may be lower in O. laevigatus than in crop pests.

Table 9 Numbers of UDP glucuronosyltransferase genes found in O. laevigatus (this study), Rhodnius prolixus, Tetranychus urticae, Nilaparvata lugens, Acyrthosiphon pisum, Bemisia tabaci [19], Myzus persicae [117] and Trialeurodes vaporariorum [118]

Cytochrome P450s

Cytochrome P450s are a diverse superfamily capable of metabolizing a huge variety of endogenous and exogenous substrates. In insects they are associated with growth and development, metabolism of pesticides and plant toxins as well as the production and metabolism of insect hormones and pheromones. P450s are associated with resistance to insecticides from a variety of classes, including pyrethroids, carbamates and neonicotinoids. They are also linked to the activation of organophosphates and other pro-insecticides (a form of insecticide which is metabolized into an active form inside the host) [38]. Upregulation of P450s in insects has been shown to confer insecticide resistance [119,120,121,122], and conversely downregulation occurs in response to pro-insecticides [123, 124].

A total of 58 full-length P450 genes were identified in the O. laevigatus genome, 11 P450 fragment genes were also found as well as 1 pseudogene. (Sequences are included in Additional file 4). These sequences were named by Dr. David Nelson using his in-house pipeline [84]. The majority of these genes (34) belonged to the diverse CYP3 class, which was a similar size to other hemipteran species (Table 10).

Table 10 Total numbers of Cytochrome P450 genes annotated in Orius laevigatus (this study), Cimex lectularius [86], Rhodnius prolixus, Triatoma infestans [96], Frankliniella occidentalis, Thrips palmi [90], Myzus persicae, Acyrthosiphon pisum [95], Trialeurodes vaporariorum [93], Bemisia tabaci [125], Halyomorpha halys [23] and Murgantia histrionica [24]

The CYP3 clade is currently the P450 clade most associated with insecticide resistance - notably the CYP6 and CYP9 families [128]. Interestingly the CYP9 family was not present in O. laevigatus, as found for T. infestans, R. prolixus, M. histrionica and H. halys [23, 24, 96]. Further investigation into the assignment of classes within the CYP3 clade suggests the lack of the CYP9 class could be a common feature within Hemiptera (Table 10).

Expansion of the CYP397 gene family was seen in O. laevigatus, (Fig. 6) with 7 full-length CYP397 genes and 1 fragment. CYP397B1, CYP397B2, CYP397B6 and CYP397C1 were directly adjacent on the same scaffold, indicating tandem duplications. Sequence similarity of the CYP397 genes to CYP397B1 ranged from 52 to 86%, which suggests a variation in ages of these tandem duplications. Cimex lectularius also showed an increased copy number of CYP397 with 6 copies (A1-A6) [86]. CYP397A1 is significantly upregulated (> 36 fold) in pyrethroid-resistant strains of C. lectularius [127], therefore the expansion of this gene family could potentially confer some tolerance to pyrethroids in O. laevigatus.

Fig. 6
figure 6

Phylogenetic tree of the Orius laevigatus cytochrome P450s. The Cyp397 gene family is a member of clan 3. Amino acid sequences were aligned using MAFFT and analysed using RAxML (the GAMMA LG protein model was used). The bootstrap consensus tree was inferred from 100 replicates

A previous study [129] looked at the effect of insecticide synergists on Orius tristicolor (another minute pirate bug of the Anthocoridae family), and found that PBO (an inhibitor of P450s and esterases) significantly increased the mortality rate when combined with indoxacarb (an oxadiazine insecticide). Whereas inhibition of solely GSTs or esterases did not reduce mortality. Upregulation of P450s, esterases and GSTs have all been seen in response to oxadiazines [130], therefore the fact that only P450 inhibition had an impact on mortality rate suggests P450s may be the primary detoxification mechanism of O. laevigatus.

Target site mutations

Point mutations resulting in amino acid substitutions in the target proteins of insecticides have been characterised in many insecticide resistant insect species, including in the sodium channel gene para which confers resistance to pyrethroids [131]; the acetylcholinesterase-1 (ace-1) enzyme associated with organophosphate resistance [132] and the acetyl-coenzyme A Carboxylase (ACC) enzyme linked to keto-enol (spirotetramat) resistance [133]. Despite these mutations having been observed in a variety of hemipteran crop pests, none were observed in this O. laevigatus assembly. Although, it is important to note that the O. laevigatus assembly was a consensus of ~ 1000 individuals, therefore differences in target sites would likely only be apparent if they were present in the majority of the population. Overall, tolerance of insecticides by O. laevigatus resulting from target site differences seems unlikely compared to what is seen in crop pests, where there has been intensive selection pressure.

The ryanodine receptor (RyR) is the target of diamide insecticides, and two target site resistance mutations conferring amino acid substitutions (I4790M and G4946E - numbering according to Plutella xylostella, PxRyR) have been identified in lepidopteran pests [21, 134]. Interestingly, O. laevigatus has the I4790M substitution (full sequence for Ryanodine in Additional file 4) which has been shown to confer varying levels of resistance to diamides. This point mutation was also present in other hemipteran species as shown in Fig. 7 (except for Lygus hesperus which had an I > L mutation). I4790M has been detected in lepidopteran populations across the globe and is considered to be a ‘selectivity switch’ for diamides [135]. O. tristicolor showed high levels of resistance to chlorantraniliprole (a diamide insecticide) with < 5% mortality [129] with the I4790M substitution being the main cause [136]. It is therefore possible that I4790M may confer some tolerance to diamides in O. laevigatus, and indeed, diamide resistance has been reported in O. laevigatus [137]. However, I4790M could potentially also confer diamide tolerance in crop pests - diamide resistance has already been shown in F. occidentalis [137]. Therefore this would likely not be an exploitable difference for IPM strategies.

Fig. 7
figure 7

An alignment of amino acid sequences to compare transmembrane domain 3 of the conserved ryanodine receptor (RyR) from Plutella xylostella with hemipteran species. The box indicates the I4790M RyR point mutation linked to diamide resistance (numbered according to PxRyR). A strain of P. xylostella without the point mutation was used for the alignment. RyR sequences were obtained from the UniProt database, excluding Orius laevigatus. UniProt entry names are as follows: Cimex lectularius: A0A7E4RNZ4_CIMLE, Triatoma infestans: A0A023F678_TRIIF, Lygus hesperus: A0A146M8X0_LYGHE, Dialeurodes citri: A0A141BN13_DIACT, Bemisia tabaci: A0A1U9JHP1_BEMTA, Toxoptera citricida: A0A0H3XSN1_TOXCI, Sipha flava: A0A2S2QG40_9HEMI, Plutella xylostella: I3NWV8_PLUXY, Myzus persicae: A0A0A7RS32_MYZPE, Acyrthosiphon pisum: X1WXB1_ACYPI, Aphis glycines: A0A6G0U418_APHGL

Conclusions

PacBio long-read technology combined with low error-rate short-read Illumina sequencing enabled the production of a high-quality genome and mitochondrial assembly for O. laevigatus. Whilst genome continuity may not be as good as an assembly generated from a single insect, the genome completeness is still of a sufficient quality to aid with comparative and functional genomics analyses and provides a useful first reference genome for the Anthocoridae family. An experimental estimate to confirm genome size and Hi-C based scaffolding would likely be the next best steps to significantly improve this genome in the future.

Comparative analyses of O. laevigatus with hemipteran crop pests showed evidence of possible differences in xenobiotic tolerance, including a potential increase in GST-mediated tolerance of organophosphates in O. laevigatus, whilst GST-mediated pyrethroid tolerance may be more prevalent in crop pests. There may also be less UGT-mediated tolerance to diamides and neonicotinoids in O. laevigatus compared to crop pests - although, the I4790M target site mutation may confer some degree of diamide insensitivity to O. laevigatus.

A recent study shows that there is significant variation in the susceptibility of O. laevigatus to pyrethroids when a variety of wild and commercial populations are assessed [109]. This suggests that beneficial predators such as O. laevigatus are certainly capable of developing insecticide resistance, but a combination of factors result in resistance developing slower than in pest species. This could be due to beneficial predators having smaller population sizes, longer life cycles, less exposure to pesticides and a lack of continuous selection pressure - beneficial predators often need to be re-released each season as populations migrate to new areas in search of food sources. These differences will have resulted in a lesser degree of selection for resistance mechanisms in O. laevigatus and therefore any observed differences in potential sensitivity would only be at low levels. Further comparisons looking at differences in gene expansions, expression levels and key target site mutations between resistant and susceptible strains of O. laevigatus would provide more concrete evidence for the findings in this study.

In conclusion, this study indicates differences in potential mechanisms of resistance between crop pests and O. laevigatus which could be exploited when selecting targeted insecticides. An increase in the number of pesticides which are safe for beneficial predators such as O. laevigatus would be of significant impact to pest management, especially at a time when the list of pesticides effective against crop pests is growing ever shorter. The findings also suggest that O. laevigatus has the ability to develop resistance to a variety of insecticides which could be used to our advantage through the selective breeding and selection of resistant strains of O. laevigatus for use in IPM strategies.

Availability of data and materials

The genome assembly generated in this study, as well as the raw PacBio and Illumina data used, are available under the BioProject accessions: PRJNA721944 and PRJEB38143. The curated Orius laevigatus genes used for comparative analyses are included in the additional files.

References

  1. 1.

    Oerke E-C. Crop losses to pests. J Agric Sci. 2006;144:31–43.

    Google Scholar 

  2. 2.

    Geiger F, Bengtsson J, Berendse F, Weisser WW, Emmerson M, Morales MB, et al. Persistent negative effects of pesticides on biodiversity and biological control potential on European farmland. Basic Appl Ecol. 2010;11:97–105.

    CAS  Google Scholar 

  3. 3.

    Bottrell DG, Schoenly KG. Resurrecting the ghost of green revolutions past: the brown planthopper as a recurring threat to high-yielding rice production in tropical Asia. J Asia Pac Entomol. 2012;15:122–40.

    Google Scholar 

  4. 4.

    Debach P, Rosen D. Biological control by natural enemies (second edition). J Trop Ecol. 1992;8:216–6.

    Google Scholar 

  5. 5.

    Hernandez LM. A review of the economically important species of the genus Orius (Heteroptera: Anthocoridae) in East Africa. J Nat Hist. 1999;33:543–68. https://doi.org/10.1080/002229399300245.

    Article  Google Scholar 

  6. 6.

    Kelton LA. The Anthocoridae of Canada and Alaska: Heteroptera. Agriculture Canada: Anthocoridae; 1978.

    Google Scholar 

  7. 7.

    Malais M. Knowing and recognizing: the biology of glasshouse pests and their natural enemies; 1992.

    Google Scholar 

  8. 8.

    Chambers RJ, Long S, Helyer NL. Effectiveness of Orius laevigatus(hem.: Anthocoridae) for the control of Frankliniella occidentalis on cucumber and pepper in the UK. Biocontrol Sci Tech. 1993;3:295–307.

    Google Scholar 

  9. 9.

    Sanchez JA, Sanchez JA, Lacasa A. Modelling population dynamics of Orius laevigatus and O. albidipennis (Hemiptera: Anthocoridae) to optimize their use as biological control agents of Frankliniella occidentalis (Thysanoptera: Thripidae). Bull Entomol Res. 2002;92:77–88.

    CAS  PubMed  Google Scholar 

  10. 10.

    Tommasini MG, Maini S, Nicoli G. Advances in the integrated pest management in protected-eggplant crops by seasonal inoculative releases of Orius laevigatus. Adv Hortic Sci. 1997;11:182–8.

    Google Scholar 

  11. 11.

    Hoy MA, Waterhouse RM, Wu K, Estep AS, Ioannidis P, Palmer WJ, et al. Genome sequencing of the Phytoseiid predatory mite Metaseiulus occidentalis reveals completely atomized Hox genes and Superdynamic intron evolution. Genome Biol Evol. 2016;8:1762–75.

    PubMed  PubMed Central  Google Scholar 

  12. 12.

    Werren JH, Richards S, Desjardins CA, Niehuis O, Gadau J, Colbourne JK, et al. Functional and evolutionary insights from the genomes of three parasitoid Nasonia species. Science. 2010;327:343–8.

    CAS  PubMed  Google Scholar 

  13. 13.

    Ando T, Matsuda T, Goto K, Hara K, Ito A, Hirata J, et al. Repeated inversions within a pannier intron drive diversification of intraspecific colour patterns of ladybird beetles. Nat Commun. 2018;9:1–13.

    Google Scholar 

  14. 14.

    Consortium TGS, Richards S, Gibbs RA, Weinstock GM, Brown SJ, Denell R, et al. The genome of the model beetle and pest Tribolium castaneum. Nature. 2008;452:949–55.

    Google Scholar 

  15. 15.

    International Aphid Genomics Consortium. Genome sequence of the pea aphid Acyrthosiphon pisum. PLoS Biol. 2010;8:e1000313.

    Google Scholar 

  16. 16.

    You M, Yue Z, He W, Yang X, Yang G, Xie M, et al. A heterozygous moth genome provides insights into herbivory and detoxification. Nat Genet. 2013;45:220–5.

    CAS  PubMed  Google Scholar 

  17. 17.

    Xue J, Zhou X, Zhang C-X, Yu L-L, Fan H-W, Wang Z, et al. Genomes of the rice pest brown planthopper and its endosymbionts reveal complex complementary contributions for host adaptation. Genome Biol. 2014;15:521.

    PubMed  PubMed Central  Google Scholar 

  18. 18.

    Nicholson SJ, Nickerson ML, Dean M, Song Y, Hoyt PR, Rhee H, et al. The genome of Diuraphis noxia , a global aphid pest of small grains. BMC Genomics. 2015;16:1–16.

    CAS  Google Scholar 

  19. 19.

    Chen W, Hasegawa DK, Kaur N, Kliot A, Pinheiro PV, Luan J, et al. The draft genome of whitefly Bemisia tabaci MEAM1, a global crop pest, provides novel insights into virus transmission, host adaptation, and insecticide resistance. BMC Biol. 2016;14:1–15.

    CAS  PubMed  PubMed Central  Google Scholar 

  20. 20.

    Wenger JA, Cassone BJ, Legeai F, Johnston JS, Bansal R, Yates AD, et al. Whole genome sequence of the soybean aphid. Aphis glycines Insect Biochem Mol Biol. 2020;123:102917.

    CAS  PubMed  Google Scholar 

  21. 21.

    Wang X, Cao X, Jiang D, Yang Y, Wu Y. CRISPR/Cas9 mediated ryanodine receptor I4790M knockin confers unequal resistance to diamides in Plutella xylostella. Insect Biochem Mol Biol. 2020;125:103453.

    CAS  PubMed  Google Scholar 

  22. 22.

    Schoville SD, Chen YH, Andersson MN, Benoit JB, Bhandari A, Bowsher JH, et al. A model species for agricultural pest genomics: the genome of the Colorado potato beetle, Leptinotarsa decemlineata (Coleoptera: Chrysomelidae). Sci Rep. 2018;8:1–18.

    CAS  Google Scholar 

  23. 23.

    Sparks ME, Bansal R, Benoit JB, Blackburn MB, Chao H, Chen M, et al. Brown marmorated stink bug, Halyomorpha halys (Stål), genome: putative underpinnings of polyphagy, insecticide resistance potential and biology of a top worldwide pest. BMC Genomics. 2020;21:227.

    PubMed  PubMed Central  Google Scholar 

  24. 24.

    Sparks ME, Rhoades JH, Nelson DR, Kuhar D, Lancaster J, Lehner B, et al. A Transcriptome survey spanning life stages and sexes of the harlequin bug, Murgantia histrionica. Insects. 2017;8. https://doi.org/10.3390/insects8020055.

  25. 25.

    Cao C, Sun L, Wen R, Shang Q, Ma L, Wang Z. Characterization of the transcriptome of the Asian gypsy moth Lymantria dispar identifies numerous transcripts associated with insecticide resistance. Pestic Biochem Physiol. 2015;119:54–61.

    CAS  PubMed  Google Scholar 

  26. 26.

    Sparks ME, Nelson DR, Haber AI, Weber DC, Harrison RL. Transcriptome Sequencing of the Striped Cucumber Beetle, Acalymma vittatum (F.), Reveals Numerous Sex-Specific Transcripts and Xenobiotic Detoxification Genes. BioTech. 2020;9:21.

    CAS  Google Scholar 

  27. 27.

    European Commission. Directive 2009/128/EC on the sustainable use of pesticides. Official Journal of the European Union; 2009. 10.2861/78.

    Google Scholar 

  28. 28.

    Cameron PJ, Walker GP, Hodson AJ, Kale AJ, Herman TJB. Trends in IPM and insecticide use in processing tomatoes in New Zealand. Crop Prot. 2009;28:421–7.

    CAS  Google Scholar 

  29. 29.

    Kranthi KR, Russell DA. Changing trends in cotton Pest management. In: Peshin R, Dhawan AK, editors. Integrated Pest management: innovation-development process: volume 1. Dordrecht: Springer Netherlands; 2009. p. 499–541.

    Google Scholar 

  30. 30.

    Meissle M, Mouron P, Musa T, Bigler F, Pons X, Vasileiadis VP, et al. Pests, pesticide use and alternative options in European maize production: current status and future prospects. J Appl Entomol. 2009;134:357–75.

    Google Scholar 

  31. 31.

    Hillocks RJ. Farming with fewer pesticides: EU pesticide review and resulting challenges for UK agriculture. Crop Prot. 2012;31:85–93.

    Google Scholar 

  32. 32.

    Lechenet M, Dessaint F, Py G, Makowski D, Munier-Jolain N. Reducing pesticide use while preserving crop productivity and profitability on arable farms. Nature Plants. 2017;3:1–6.

    Google Scholar 

  33. 33.

    Lattin JD, BIONOMICS OF, THE ANTHOCORIDAE. Annu Rev Entomol. 1999;44:207–31.

    CAS  PubMed  Google Scholar 

  34. 34.

    Heckel DG. Insecticide resistance after silent spring. Science. 2012;337:1612–4.

    PubMed  Google Scholar 

  35. 35.

    Li X, Shi H, Gao X, Liang P. Characterization of UDP-glucuronosyltransferase genes and their possible roles in multi-insecticide resistance in Plutella xylostella (L.). Pest Manag Sci. 2018;74:695–704.

    CAS  PubMed  Google Scholar 

  36. 36.

    Merzendorfer H. Chapter One - ABC Transporters and Their Role in Protecting Insects from Pesticides and Their Metabolites. In: Cohen E, editor. Advances in Insect Physiology, vol. 46, Academic Press; 2014, p. 1–72.

  37. 37.

    Pavlidi N, Vontas J, Van Leeuwen T. The role of glutathione S-transferases (GSTs) in insecticide resistance in crop pests and disease vectors. Curr Opin Insect Sci. 2018;27:97–102.

    PubMed  Google Scholar 

  38. 38.

    Scott JG. Cytochromes P450 and insecticide resistance. Insect Biochem Mol Biol. 1999;29:757–77.

    CAS  PubMed  Google Scholar 

  39. 39.

    Sogorb MA, Vilanova E. Enzymes involved in the detoxification of organophosphorus, carbamate and pyrethroid insecticides through hydrolysis. Toxicol Lett. 2002;128:215–28.

    CAS  PubMed  Google Scholar 

  40. 40.

    Rane RV, Ghodke AB, Hoffmann AA, Edwards OR, Walsh TK, Oakeshott JG. Detoxifying enzyme complements and host use phenotypes in 160 insect species. Curr Opin Insect Science. 2019;31:131–8.

    Google Scholar 

  41. 41.

    Roderick GK, Navajas M. Genes in new environments: genetics and evolution in biological control. Nat Rev Genet. 2003;4:889–99.

    CAS  PubMed  Google Scholar 

  42. 42.

    Marçais G, Kingsford C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics. 2011;27:764–70.

    PubMed  PubMed Central  Google Scholar 

  43. 43.

    Rhyker Ranallo-Benavidez T, Jaron KS, Schatz MC. GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes. Nat Commun. 2020;11:1–10.

    Google Scholar 

  44. 44.

    Andrews S. FastQC n.d. https://github.com/s-andrews/FastQC (accessed 20 Apr 2021).

  45. 45.

    Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–20.

    CAS  PubMed  PubMed Central  Google Scholar 

  46. 46.

    Bradnam K. A script to calculate a basic set of metrics from a genome assembly 2011. http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/assemblathon_stats.pl.

    Google Scholar 

  47. 47.

    Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31:3210–2.

    PubMed  Google Scholar 

  48. 48.

    Kolmogorov M, Yuan J, Lin Y, Pevzner PA. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol. 2019;37:540–6.

    CAS  PubMed  Google Scholar 

  49. 49.

    Lin Y, Yuan J, Kolmogorov M, Shen MW, Chaisson M, Pevzner PA. Assembly of long error-prone reads using de Bruijn graphs. Proc Natl Acad Sci U S A. 2016;113:E8396–405.

    CAS  PubMed  PubMed Central  Google Scholar 

  50. 50.

    Song L, Shankar DS, Florea L. Rascaf: improving genome assembly with RNA sequencing data. Plant Genome. 2016;9. https://doi.org/10.3835/plantgenome2016.03.0027.

  51. 51.

    Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 2017;27:722–36.

    CAS  PubMed  PubMed Central  Google Scholar 

  52. 52.

    Chin C-S, Alexander DH, Marks P, Klammer AA, Drake J, Heiner C, et al. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat Methods. 2013;10:563–9.

    CAS  PubMed  Google Scholar 

  53. 53.

    Chin C-S, Peluso P, Sedlazeck FJ, Nattestad M, Concepcion GT, Clum A, et al. Phased diploid genome assembly with single-molecule real-time sequencing. Nat Methods. 2016;13:1050–4.

    CAS  PubMed  PubMed Central  Google Scholar 

  54. 54.

    Chakraborty M, Baldwin-Brown JG, Long AD, Emerson JJ. Contiguous and accurate de novo assembly of metazoan genomes with modest long read coverage. Nucleic Acids Res. 2016;44:e147.

    PubMed  PubMed Central  Google Scholar 

  55. 55.

    Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One. 2014;9:e112963.

    PubMed  PubMed Central  Google Scholar 

  56. 56.

    Pryszcz LP, Gabaldón T. Redundans: an assembly pipeline for highly heterozygous genomes. Nucleic Acids Res. 2016;44:e113–3.

    PubMed  PubMed Central  Google Scholar 

  57. 57.

    Marçais G, Delcher AL, Phillippy AM, Coston R, Salzberg SL, Zimin A. MUMmer4: A fast and versatile genome alignment system. PLoS Comput Biol. 2018;14:e1005944.

    PubMed  PubMed Central  Google Scholar 

  58. 58.

    O’Leary NA, Wright MW, Brister JR, Ciufo S, Haddad D, McVeigh R, et al. Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation. Nucleic Acids Res. 2016;44:D733–45.

    PubMed  Google Scholar 

  59. 59.

    Huson DH, Auch AF, Qi J, Schuster SC. MEGAN analysis of metagenomic data. Genome Res. 2007;17:377–86.

    CAS  PubMed  PubMed Central  Google Scholar 

  60. 60.

    Du B-Z, Niu F-F, Wei S-J. The complete mitochondrial genome of the predatory bug Orius sauteri (Poppius) (Hemiptera: Anthocoridae). Mitochondrial DNA Part A. 2016;27:777–8.

    CAS  Google Scholar 

  61. 61.

    Holt C, Yandell M. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC Bioinformatics. 2011;12:1–14.

    Google Scholar 

  62. 62.

    Stanke M, Steinkamp R, Waack S, Morgenstern B. AUGUSTUS: a web server for gene finding in eukaryotes. Nucleic Acids Res. 2004;32:W309–12.

    CAS  PubMed  PubMed Central  Google Scholar 

  63. 63.

    Lomsadze A, Ter-Hovhannisyan V, Chernoff YO, Borodovsky M. Gene identification in novel eukaryotic genomes by self-training algorithm. Nucleic Acids Res. 2005;33:6494–506.

    CAS  PubMed  PubMed Central  Google Scholar 

  64. 64.

    Solovyev V. Statistical approaches in eukaryotic gene prediction. In: Balding DJ, Bishop M, Cannings C, editors. Handbook of statistical genetics. Chichester: Wiley; 2001. p. 83–127.

    Google Scholar 

  65. 65.

    Haas BJ, Salzberg SL, Zhu W, Pertea M, Allen JE, Orvis J, et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the program to assemble spliced alignments. Genome Biol. 2008;9:1–22.

    Google Scholar 

  66. 66.

    Smit AFA, Hubley R. RepeatModeler Open-1.0 2008–2015. http://www.repeatmasker.org.

  67. 67.

    Wheeler TJ, Eddy SR. Nhmmer: DNA homology search with profile HMMs. Bioinformatics. 2013;29:2487–9.

    CAS  PubMed  PubMed Central  Google Scholar 

  68. 68.

    Finn RD, Bateman A, Clements J, Coggill P, Eberhardt RY, Eddy SR, et al. Pfam: the protein families database. Nucleic Acids Res. 2014;42:D222–30.

    CAS  PubMed  Google Scholar 

  69. 69.

    Smit AFA, Hubley R, Green P. RepeatMasker Open-4.0 2013–2015. http://www.repeatmasker.org.

  70. 70.

    Kim D, Langmead B, Salzberg SL. HISAT: a fast spliced aligner with low memory requirements. Nat Methods. 2015;12:357–60.

    CAS  PubMed  PubMed Central  Google Scholar 

  71. 71.

    Pertea M, Pertea GM, Antonescu CM, Chang T-C, Mendell JT, Salzberg SL. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat Biotechnol. 2015;33:290–5.

    CAS  PubMed  PubMed Central  Google Scholar 

  72. 72.

    Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011;29:644–52.

    CAS  PubMed  PubMed Central  Google Scholar 

  73. 73.

    Gilbert D. EvidentialGene - evidence directed gene construction for eukaryotes 2013. https://sourceforge.net/projects/evidentialgene/.

    Google Scholar 

  74. 74.

    Götz S, García-Gómez JM, Terol J, Williams TD, Nagaraj SH, Nueda MJ, et al. High-throughput functional annotation and data mining with the Blast2GO suite. Nucleic Acids Res. 2008;36:3420–35.

    PubMed  PubMed Central  Google Scholar 

  75. 75.

    Nawrocki EP, Eddy SR. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics. 2013;29:2933–5.

    CAS  PubMed  PubMed Central  Google Scholar 

  76. 76.

    Bernt M, Donath A, Jühling F, Externbrink F, Florentz C, Fritzsch G, et al. MITOS: Improved de novo metazoan mitochondrial genome annotation. Mol Phylogenet Evol. 2013;69:313–9.

    PubMed  Google Scholar 

  77. 77.

    Emms DM, Kelly S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 2019;20:1–14.

    Google Scholar 

  78. 78.

    Emms DM, Kelly S. STAG: Species Tree Inference from All Genes. bioRxiv 2018:267914. https://doi.org/10.1101/267914.

  79. 79.

    Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30:772–80.

    CAS  PubMed  PubMed Central  Google Scholar 

  80. 80.

    Katoh K, Misawa K, Kuma K-I, Miyata T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002;30:3059–66.

    CAS  PubMed  PubMed Central  Google Scholar 

  81. 81.

    Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–3.

    CAS  PubMed  PubMed Central  Google Scholar 

  82. 82.

    Le SQ, Gascuel O. An improved general amino acid replacement matrix. Mol Biol Evol. 2008;25:1307–20.

    CAS  Google Scholar 

  83. 83.

    Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, et al. Integrative genomics viewer. Nat Biotechnol. 2011;29:24–6.

    CAS  PubMed  PubMed Central  Google Scholar 

  84. 84.

    Nelson DR. The cytochrome P450 homepage. Hum Genomics. 2009;4:1–7.

    Google Scholar 

  85. 85.

    UGT Committee. UGT Committee Home UGT Committee home n.d. https://prime.vetmed.wsu.edu/resources/udp-glucuronsyltransferase-homepage (accessed 25 Mar 2021).

  86. 86.

    Rosenfeld JA, Reeves D, Brugler MR, Narechania A, Simon S, Durrett R, et al. Genome assembly and geospatial phylogenomics of the bed bug Cimex lectularius. Nat Commun. 2016;7:1–10.

    Google Scholar 

  87. 87.

    Mesquita RD, Vionette-Amaral RJ, Lowenberger C, Rivera-Pomar R, Monteiro FA, Minx P, et al. Genome of Rhodnius prolixus, an insect vector of Chagas disease, reveals unique adaptations to hematophagy and parasite infection. Proc Natl Acad Sci U S A. 2015;112:14936–41.

    CAS  PubMed  PubMed Central  Google Scholar 

  88. 88.

    Dermauw W, Van Leeuwen T. The ABC gene family in arthropods: comparative genomics and role in insecticide transport and resistance. Insect Biochem Mol Biol. 2014;45:89–110.

    CAS  PubMed  Google Scholar 

  89. 89.

    Joe Hull J, Chaney K, Geib SM, Fabrick JA, Brent CS, Walsh D, et al. Transcriptome-based identification of ABC transporters in the Western tarnished plant bug Lygus hesperus. PLoS One. 2014;9:e113046.

    PubMed  Google Scholar 

  90. 90.

    Rotenberg D, Baumann AA, Ben-Mahmoud S, Christiaens O, Dermauw W, Ioannidis P, et al. Genome-enabled insights into the biology of thrips as crop pests. BMC Biol. 2020;18:142.

    CAS  PubMed  PubMed Central  Google Scholar 

  91. 91.

    Guo S-K, Cao L-J, Song W, Shi P, Gao Y-F, Gong Y-J, et al. Chromosome-level assembly of the melon thrips genome yields insights into evolution of a sap-sucking lifestyle and pesticide resistance. Mol Ecol Resour. 2020;20:1110–25.

    CAS  PubMed  Google Scholar 

  92. 92.

    Pan Y, Zeng X, Wen S, Gao X, Liu X, Tian F, et al. Multiple ATP-binding cassette transporters genes are involved in thiamethoxam resistance in Aphis gossypii glover. Pestic Biochem Physiol. 2020;167:104558.

    CAS  PubMed  Google Scholar 

  93. 93.

    Pym A, Singh KS, Nordgren Å, Emyr Davies TG, Zimmer CT, Elias J, et al. Host plant adaptation in the polyphagous whitefly, Trialeurodes vaporariorum , is associated with transcriptional plasticity and altered sensitivity to insecticides. BMC Genomics. 2019;20:1–19.

    Google Scholar 

  94. 94.

    Tian L, Song T, He R, Zeng Y, Xie W, Wu Q, et al. Genome-wide analysis of ATP-binding cassette (ABC) transporters in the sweetpotato whitefly, Bemisia tabaci. BMC Genomics. 2017;18:1–16.

    Google Scholar 

  95. 95.

    Ramsey JS, Rider DS, Walsh TK, De Vos M, Gordon KHJ, Ponnala L, et al. Comparative analysis of detoxification enzymes in Acyrthosiphon pisum and Myzus persicae. Insect Mol Biol. 2010;19(Suppl 2):155–64.

    CAS  PubMed  Google Scholar 

  96. 96.

    Traverso L, Lavore A, Sierra I, Palacio V, Martinez-Barnetche J, Latorre-Estivalis JM, et al. Comparative and functional triatomine genomics reveals reductions and expansions in insecticide resistance-related gene families. PLoS Negl Trop Dis. 2017;11:e0005313.

    PubMed  PubMed Central  Google Scholar 

  97. 97.

    Gawande ND, Subashini S, Murugan M, Subbarayalu M. Molecular screening of insecticides with sigma glutathione S-transferases (GST) in cotton aphid Aphis gossypii using docking. Bioinformation. 2014;10:679–83.

    PubMed  PubMed Central  Google Scholar 

  98. 98.

    Lumjuan N, Rajatileka S, Changsom D, Wicheer J, Leelapat P, Prapanthadara L-A, et al. The role of the Aedes aegypti Epsilon glutathione transferases in conferring resistance to DDT and pyrethroid insecticides. Insect Biochem Mol Biol. 2011;41:203–9.

    CAS  PubMed  Google Scholar 

  99. 99.

    Vontas JG, Small GJ, Hemingway J. Glutathione S-transferases as antioxidant defence agents confer pyrethroid resistance in Nilaparvata lugens. Biochem J. 2001;357:65–72.

    CAS  PubMed  PubMed Central  Google Scholar 

  100. 100.

    Friedman R. Genomic organization of the glutathione S-transferase family in insects. Mol Phylogenet Evol. 2011;61:924–32.

    CAS  PubMed  Google Scholar 

  101. 101.

    Adelman ZN, Kilcullen KA, Koganemaru R, Anderson MAE, Anderson TD, Miller DM. Deep sequencing of pyrethroid-resistant bed bugs reveals multiple mechanisms of resistance within a single population. PLoS One. 2011;6:e26228.

    CAS  PubMed  PubMed Central  Google Scholar 

  102. 102.

    Aidlin Harari O, Santos-Garcia D, Musseri M, Moshitzky P, Patel M, Visendi P, et al. Molecular evolution of the glutathione S-Transferase family in the Bemisia tabaci species complex. Genome Biol Evol. 2020;12:3857–72.

    PubMed  PubMed Central  Google Scholar 

  103. 103.

    Schama R, Pedrini N, Juárez MP, Nelson DR, Torres AQ, Valle D, et al. Rhodnius prolixus supergene families of enzymes potentially associated with insecticide resistance. Insect Biochem Mol Biol. 2016;69:91–104.

    CAS  PubMed  Google Scholar 

  104. 104.

    Xia J, Xu H, Yang Z, Pan H, Yang X, Guo Z, et al. Genome-wide analysis of Carboxylesterases (COEs) in the whitefly, (Gennadius). Int J Mol Sci. 2019;20 https://doi.org/10.3390/ijms20204973.

  105. 105.

    Karatolos N. Molecular mechanisms of insecticide resistance in the greenhouse whitefly, Trialeurodes vaporariorum. PhD. University of Exeter, 2011. https://doi.org/https://ore.exeter.ac.uk/repository/bitstream/handle/10036/3350/KaratolosN.pdf.

  106. 106.

    Oakeshott J, Claudianos C, Campbell PM, Newcomb RD, Russell RJ. Biochemical genetics and genomics of insect esterases. In: Gilbert LI, Gill SS, Iatrou K, editors. Insect pharmacology: channels, Receptors, Toxins and Enzymes, Elsevier; 2010. p. 229–81.

    Google Scholar 

  107. 107.

    Flores AE, Albeldaño-Vázquez W, Salas IF, Badii MH, Becerra HL, Garcia GP, et al. Elevated α-esterase levels associated with permethrin tolerance in Aedes aegypti (L.) from Baja California, Mexico. Pestic Biochem Physiol. 2005;82:66–78.

    CAS  Google Scholar 

  108. 108.

    Orihuela PLS, Vassena CV, Zerba EN, Picollo MI. Relative contribution of Monooxygenase and esterase to Pyrethroid resistance in Triatoma infestans (Hemiptera: Reduviidae) from Argentina and Bolivia. J Med Entomol. 2014;45:298–306.

    Google Scholar 

  109. 109.

    Balanza V, Mendoza JE, Cifuentes D, Bielza P. Selection for resistance to pyrethroids in the predator Orius laevigatus. Pest Manag Sci. 2021;77:2539–46.

    CAS  PubMed  Google Scholar 

  110. 110.

    Ganesh KN, Vijayan VA, Urmila J, Gopalan N, Prakash S. Role of esterases and monooxygenase in the deltamethrin resistance in Anopheles stephensi Giles (1908), at Mysore. Indian J Exp Biol. 2002;40:583–8.

    CAS  PubMed  Google Scholar 

  111. 111.

    Prasad KM, Raghavendra K, Verma V, Velamuri PS, Pande V. Esterases are responsible for malathion resistance in Anopheles stephensi: a proof using biochemical and insecticide inhibition studies. J Vector Borne Dis. 2017;54:226.

    CAS  PubMed  Google Scholar 

  112. 112.

    Jyoti SNK, Singh H, Singh NK, Rath SS. Multiple mutations in the acetylcholinesterase 3 gene associated with organophosphate resistance in Rhipicephalus (Boophilus) microplus ticks from Punjab, India. Vet Parasitol. 2016;216:108–17.

    CAS  PubMed  Google Scholar 

  113. 113.

    Zhang Y, Li S, Xu L, Guo HF, Zi J, Wang L, et al. Overexpression of carboxylesterase-1 and mutation (F439H) of acetylcholinesterase-1 are associated with chlorpyrifos resistance in Laodelphax striatellus. Pestic Biochem Physiol. 2013;106:8–13.

    CAS  Google Scholar 

  114. 114.

    Zhao J, Xu L, Sun Y, Song P, Han Z. UDP-Glycosyltransferase genes in the striped Rice stem borer, (Walker), and their contribution to Chlorantraniliprole resistance. Int J Mol Sci. 2019;20 https://doi.org/10.3390/ijms20051064.

  115. 115.

    Tian F, Wang Z, Li C, Liu J, Zeng X. UDP-Glycosyltransferases are involved in imidacloprid resistance in the Asian citrus psyllid, Diaphorina citri (Hemiptera: Lividae). Pestic Biochem Physiol. 2019;154:23–31.

    CAS  PubMed  Google Scholar 

  116. 116.

    Cui X, Wang C, Wang X, Li G, Liu Z, Wang H, et al. Molecular mechanism of the UDP-Glucuronosyltransferase 2B20-like gene (AccUGT2B20-like) in pesticide resistance of Apis cerana cerana. Front Genet. 2020;11:592595.

    CAS  PubMed  PubMed Central  Google Scholar 

  117. 117.

    Pan Y, Xu P, Zeng X, Liu X, Shang Q. Characterization of UDP-Glucuronosyltransferases and the potential contribution to nicotine tolerance in Myzus persicae. Int J Mol Sci. 2019;20:3637.

    CAS  PubMed Central  Google Scholar 

  118. 118.

    Xie W, He C, Fei Z, Zhang Y. Chromosome-level genome assembly of the greenhouse whitefly (Trialeurodes vaporariorum Westwood). Mol Ecol Resour. 2020;20:995–1006.

    CAS  PubMed  Google Scholar 

  119. 119.

    Karunker I, Benting J, Lueke B, Ponge T, Nauen R, Roditakis E, et al. Over-expression of cytochrome P450 CYP6CM1 is associated with high resistance to imidacloprid in the B and Q biotypes of Bemisia tabaci (Hemiptera: Aleyrodidae). Insect Biochem Mol Biol. 2008;38:634–44.

    CAS  PubMed  Google Scholar 

  120. 120.

    Liang X, Xiao D, He Y, Yao J, Zhu G, Zhu KY. Insecticide-mediated up-regulation of cytochrome P450 genes in the red flour beetle (Tribolium castaneum). Int J Mol Sci. 2015;16:2078–98.

    CAS  PubMed  PubMed Central  Google Scholar 

  121. 121.

    Puinean AM, Foster SP, Oliphant L, Denholm I, Field LM, Millar NS, et al. Amplification of a cytochrome P450 gene is associated with resistance to neonicotinoid insecticides in the aphid Myzus persicae. PLoS Genet. 2010;6:e1000999.

    PubMed  PubMed Central  Google Scholar 

  122. 122.

    Yang T, Liu N. Genome analysis of cytochrome P450s and their expression profiles in insecticide resistant mosquitoes, Culex quinquefasciatus. PLoS One. 2011;6:e29418.

    CAS  PubMed  PubMed Central  Google Scholar 

  123. 123.

    Main BJ, Everitt A, Cornel AJ, Hormozdiari F, Lanzaro GC. Genetic variation associated with increased insecticide resistance in the malaria mosquito, Anopheles coluzzii. Parasit Vectors. 2018;11:1–9.

    Google Scholar 

  124. 124.

    Vlogiannitis S, Mavridis K, Dermauw W, Snoeck S, Katsavou E, Morou E, et al. Reduced proinsecticide activation by cytochrome P450 confers coumaphos resistance in the major bee parasite Varroa destructor. Proc Natl Acad Sci U S A. 2021;118. https://doi.org/10.1073/pnas.2020380118.

  125. 125.

    Ilias A, Lagnel J, Kapantaidaki DE, Roditakis E, Tsigenopoulos CS, Vontas J, et al. Transcription analysis of neonicotinoid resistance in Mediterranean (MED) populations of B. tabaci reveal novel cytochrome P450s, but no nAChR mutations associated with the phenotype. BMC Genomics. 2015;16:1–23.

    Google Scholar 

  126. 126.

    Bai X, Mamidala P, Rajarapu SP, Jones SC, Mittapalli O. Transcriptomics of the bed bug (Cimex lectularius). PLoS One. 2011;6:e16336.

    CAS  PubMed  PubMed Central  Google Scholar 

  127. 127.

    Zhu F, Gujar H, Gordon JR, Haynes KF, Potter MF, Palli SR. Bed bugs evolved unique adaptive strategy to resist pyrethroid insecticides. Sci Rep. 2013;3:1–8.

    Google Scholar 

  128. 128.

    Feyereisen R. Evolution of insect P450. Biochem Soc Trans. 2006;34:1252–5.

    CAS  PubMed  Google Scholar 

  129. 129.

    Pereira RR, Picanço MC, Santana PA Jr, Moreira SS, Guedes RNC, Corrêa AS. Insecticide toxicity and walking response of three pirate bug predators of the tomato leaf minerTuta absoluta. Agric For Entomol. 2014;16:293–301.

    Google Scholar 

  130. 130.

    Shi L, Shi Y, Zhang Y, Liao X. A systemic study of indoxacarb resistance in Spodoptera litura revealed complex expression profiles and regulatory mechanism. Sci Rep. 2019;9:1–13.

    Google Scholar 

  131. 131.

    Martinez-Torres D, Foster SP, Field LM, Devonshire AL, Williamson MS. A sodium channel point mutation is associated with resistance to DDT and pyrethroid insecticides in the peach-potato aphid, Myzus persicae (Sulzer) (Hemiptera: Aphididae). Insect Mol Biol. 1999;8:339–46.

    CAS  PubMed  Google Scholar 

  132. 132.

    Alon M, Alon F, Nauen R, Morin S. Organophosphates’ resistance in the B-biotype of Bemisia tabaci (Hemiptera: Aleyrodidae) is associated with a point mutation in an ace1-type acetylcholinesterase and overexpression of carboxylesterase. Insect Biochem Mol Biol. 2008;38:940–9.

    CAS  PubMed  Google Scholar 

  133. 133.

    Lueke B, Douris V, Hopkinson JE, Maiwald F, Hertlein G, Papapostolou K-M, et al. Identification and functional characterization of a novel acetyl-CoA carboxylase mutation associated with ketoenol resistance in Bemisia tabaci. Pestic Biochem Physiol. 2020;166:104583.

    CAS  PubMed  Google Scholar 

  134. 134.

    Zuo Y-Y, Ma H-H, Lu W-J, Wang X-L, Wu S-W, Nauen R, et al. Identification of the ryanodine receptor mutation I4743M and its contribution to diamide insecticide resistance in Spodoptera exigua (Lepidoptera: Noctuidae). Insect Sci. 2020;27:791–800.

    CAS  PubMed  Google Scholar 

  135. 135.

    Richardson EB, Troczka BJ, Gutbrod O, Davies TGE, Nauen R. Diamide resistance: 10 years of lessons from lepidopteran pests. J Pest Sci. 2020;93:911–28.

    Google Scholar 

  136. 136.

    Nauen R, Steinbach D. Resistance to Diamide insecticides in Lepidopteran pests. In: Horowitz AR, Ishaaya I, editors. Advances in insect control and resistance management. Cham: Springer International Publishing; 2016. p. 219–40.

    Google Scholar 

  137. 137.

    Dáder B, Colomer I, Adán Á, Medina P, Viñuela E. Compatibility of early natural enemy introductions in commercial pepper and tomato greenhouses with repeated pesticide applications. Insect Sci. 2020;27:1111–24.

    PubMed  Google Scholar 

Download references

Acknowledgements

Acknowledgements go to Dr. David R. Nelson of the International Committee on the Nomenclature for Cytochrome P450 Enzymes for classification of Orius laevigatus P450s and to Dr. Michael H. Court of the UGT Nomenclature Committee for classification of Orius laevigatus UGTs.

Funding

This research was funded by the Pest Genomics Initiative, a collaborative project between Rothamsted Research, Bayer Crop Science and Syngenta AG. Rothamsted Research gratefully acknowledges strategic funding from the UK Biotechnology and Biological and Sciences Research Council (UKRI-BBSRC). The content is solely the responsibility of the authors and does not necessarily reflect the official views of the funding bodies.

Author information

Affiliations

Authors

Contributions

The Pest Genomics Initiative (BB, CR, EG, KH-P, LF, RK and RN) devised the original conceptual ideas. EB performed the DNA and RNA extractions with assistance from MW. EB assembled and annotated the genome with guidance from RK and DH. EB performed the comparative analyses with input from RN. FM, RK and KH-P supervised the project. EB wrote the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Emma Bailey.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

Numbers of proteins in the Orius laevigatus genome annotated by the InterPro member databases. (.docx file)

Additional file 2

The assembly statistics at each stage in the assembly pipeline of the Orius laevigatus genome. (.docx file)

Additional file 3

Phylogenetic tree of the Orius laevigatus ATP-binding cassette (ABC) transporters. (.docx file)

Additional file 4

Protein sequences of Orius laevigatus insecticide resistance genes. (.xlsx file)

Additional file 5

Phylogenetic tree of Orius laevigatus, Rhodnius prolixus and Acyrthosiphon pisum Carboxyl/cholinesterases (CCEs). (.docx file)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Bailey, E., Field, L., Rawlings, C. et al. A scaffold-level genome assembly of a minute pirate bug, Orius laevigatus (Hemiptera: Anthocoridae), and a comparative analysis of insecticide resistance-related gene families with hemipteran crop pests. BMC Genomics 23, 45 (2022). https://doi.org/10.1186/s12864-021-08249-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12864-021-08249-y

Keywords

  • Orius laevigatus
  • Pirate bug
  • PacBio
  • Illumina
  • Whole genome sequencing
  • Beneficial predator
  • Insecticide resistance
  • Comparative genomics
  • Hemiptera
  • Crop pests