- Research
- Open access
- Published:
A chromosome-level genome assembly of the soybean pod borer: insights into larval transcriptional response to transgenic soybean expressing the pesticidal Cry1Ac protein
BMC Genomics volume 25, Article number: 355 (2024)
Abstract
Background
Genetically modified (GM) crop plants with transgenic expression of Bacillus thuringiensis (Bt) pesticidal proteins are used to manage feeding damage by pest insects. The durability of this technology is threatened by the selection for resistance in pest populations. The molecular mechanism(s) involved in insect physiological response or evolution of resistance to Bt is not fully understood.
Results
To investigate the response of a susceptible target insect to Bt, the soybean pod borer, Leguminivora glycinivorella (Lepidoptera: Tortricidae), was exposed to soybean, Glycine max, expressing Cry1Ac pesticidal protein or the non-transgenic parental cultivar. Assessment of larval changes in gene expression was facilitated by a third-generation sequenced and scaffolded chromosome-level assembly of the L. glycinivorella genome (657.4 Mb; 27 autosomes + Z chromosome), and subsequent structural annotation of 18,197 RefSeq gene models encoding 23,735 putative mRNA transcripts. Exposure of L. glycinivorella larvae to transgenic Cry1Ac G. max resulted in prediction of significant differential gene expression for 204 gene models (64 up- and 140 down-regulated) and differential splicing among isoforms for 10 genes compared to unexposed cohorts. Differentially expressed genes (DEGs) included putative peritrophic membrane constituents, orthologs of Bt receptor-encoding genes previously linked or associated with Bt resistance, and those involved in stress responses. Putative functional Gene Ontology (GO) annotations assigned to DEGs were significantly enriched for 36 categories at GO level 2, respectively. Most significantly enriched cellular component (CC), biological process (BP), and molecular function (MF) categories corresponded to vacuolar and microbody, transport and metabolic processes, and binding and reductase activities. The DEGs in enriched GO categories were biased for those that were down-regulated (≥ 0.783), with only MF categories GTPase and iron binding activities were bias for up-regulation genes.
Conclusions
This study provides insights into pathways and processes involved larval response to Bt intoxication, which may inform future unbiased investigations into mechanisms of resistance that show no evidence of alteration in midgut receptors.
Background
The development of resistance to insecticidal agents by insect pests is a threat to sustainable agricultural production practices [1, 2] and human health [3]. The number of insect species with resistance to one or more insecticides has increased rapidly since the post-World War II development and subsequent widespread use of broadcast synthetic chemical agents [4, 5]. These resistant phenotypes result from mutations that disrupt the insecticidal mode of action (MoA), including those that alter the kinetics of target receptor binding, increase rates of detoxification and excretion, or cause behavioral avoidance [6]. Genetically modified (GM) cotton and maize that express one or more pore-forming 3-domain crystalline (Cry) pesticidal proteins or vegetative insecticidal protein 3A (Vip3A) derived from Bacillus thuringiensis (Bt) [7] are currently planted on most cultivated hectares in North and South America [8, 9]. Field populations of target insect pests have henceforth evolved practical resistance to crops that express Bt proteins [10]. Bt resistance in field populations has occurred despite implementation of insect resistance management (IRM) plans which rely on high-dose refuge (HDR) strategies [11,12,13]. Resistance arguably occurs when one or more assumptions of the HDR strategy are violated [14], and include instances when resistance alleles are non-recessive [15, 16], insects feed on plant tissues with a dose of Bt that does not cause mortality among heterozygous genotypes carrying one resistance allele resulting in functional resistance [17,18,19], or resistant phenotypes lacked fitness costs that were proposed to delay the onset of widespread resistance [20, 21]. These factors raise concerns for the long-term sustainability and efficacy of Bt technology [22].
The proposed MoA for Cry1A proteins includes the sequential binding model where pesticidal proteins bind membrane-anchored midgut receptors that potentiate the formation of pore channels that in-turn cause cell swelling and death via osmotic imbalance [23, 24]. Additionally, a signal transduction model proposes intracellular signaling and induction of the protein kinase A (PKA) pathway causes cell death (apoptosis) [25, 26]. Among species of Lepidoptera, resistance in selected laboratory colonies and field populations is associated with structural or functional changes in midgut receptors [27], including ATP binding cassette (ABC) transporters [28, 29], cadherin, tetraspanin, alkaline phosphatase and aminopeptidase N [30]. Genes in the mitogen-activated protein kinase (MAPK) pathway also have been linked or associated with resistance [31,32,33,34], wherein this pathway may impact the expression of Bt receptors themselves [35]. Additional evidence suggests that up-regulation of genes encoding constituents of insect peritrophic membranes may decrease susceptibility when exposed to Bt proteins [36, 37] or Bt bacterial infections [38]. Additionally, increased proteolytic degradation of Bt in the gut or repair mechanisms have been proposed [30].
The efficacy of the HDR strategy and other IRM tactics may arguably be enhanced through the greater understanding of the molecular mechanisms involved in resistance [1]. Furthermore, cellular responses to Bt intoxication even in susceptible insects may provide insights into Bt MoA and points at which disruption might lead to resistance [39]. Among such comparisons between susceptible or resistant insects exposed or unexposed to Bt proteins, hundreds or thousands of transcripts are often differentially expressed [39,40,41,42,43,44,45,46,47]. In some instances, differentially expressed transcripts are enriched for those putatively functioning in general stress response pathways (e.g. cytochrome P450 monooxygenases, esterases, oxidases, and peroxidases) and having transporter function [45], involved in cell survival mechanisms [39] or cell repair and immune function [47]. Additionally, orthologs and paralogs of genes putatively functioning as Bt receptors in the insect midgut (cadherin, aminopeptidases N and ABC transporters) are differentially expressed between Bt intoxicated susceptible insects compared to unintoxicated cohorts [39, 43, 45]. Role of these genes in response to intoxication remains uncertain, but may be informed following further studies [39].
The soybean pod borer, Leguminivora glycinivorella (Matsumura) (Lepidoptera: Tortricidae), is a destructive agricultural pest insect of cultivated soybean, Glycine max (Fabales: Fabaceae) (L.) [48]. This insect has a univoltine life cycle with mature 4th instars that enter the soil and form cocoons in hollowed chambers during mid- to late-September wherein they diapause. After approximately 8–9 months larvae pupate and emerge as adults in early summer [49]. Female oviposition and larval L. glycinivorella feeding primarily occurs on a narrow range of leguminous plants including G. max, wild soybean, G soja, and the shrubby relative of pea, Sophora flavescens. Larval feeding primarily occurs on the pods of G. max, which causes substantial reductions in grain yield and quality [50, 51]. High infestations can lead to 10–30% yield loss and exceed 50% during severe outbreaks [52]. The endemic geographic range of this species includes Japan and Korea, and across eastern China. The most severe outbreaks tend to be in the northeastern Heilongjiang, Jilin, and Liaoning Provinces, which coincides with areas where approximately two-thirds of soybeans are grown in China [53], making L. glycinivorella a primary pest of this crop. Although foliar insecticides are applied, effective contact and reduction in feeding damage is difficult to achieve due to protection of larva under G. max leaf canopies and within pods [54]. Use of systemic insecticides can lead to more consistent reductions in L. glycinivorella feeding damage, but ecological concerns limit their use especially given evidence of impacts on pollinator health [55].
Cultivars of G. max selected for host plant traits can reduce levels of L. glycinivorella feeding damage [56], but do not always prevent significant economic loss to growers. Transgenic expression of insecticidal molecules by G. max show promise as an L. glycinivorella control tactic. For instance, damage is reduced on plants that express double stranded RNA (dsRNA) that elicits an RNA interference (RNAi)-based knockdown of a serine protease in L. glycinivorella larvae [57, 58]. Pesticidal Bt proteins expressed by transgenic G. max reduce damage by several lepidopteran pests [59,60,61]. Commercial use of Bt cotton expressing the Bt Cry1Ac protein was approved by China in 1997 [62]. Since 2016 Chinese government directives aim to commercialize GE cotton, maize and soybean for domestic use [63] and has culminated in recent changes that allow for their widespread use [64]. Bt IRM strategies for cotton pests have been established in China [65]. Investigations into non-target effects and ecological assessments of Bt crops [66] and target insect baseline Bt susceptibilities have also been conducted pre-commercialization for many crops in China [67]. However, investigation into potential for Bt resistances among many target pests is lacking.
In this study, we used L. glycinivorella as a model to determine the effects of Bt intoxication on larval gene expression following exposures to a transgenic G. max cultivar that expresses the Bt Cry1Ac pesticidal protein [61], informing future research into Bt MoA and potential mechanisms of resistance. This was accomplished by predicting differentially expressed genes (DEGs) between susceptible L. glycinivorella larvae feeding for 2 days on Bt Cry1Ac compared to conventional non-Bt cultivars. An annotated chromosome-level genome assembly for L. glycinivorella was generated and provided resources to investigate transcriptional effects of sublethal Bt exposures. Changes in gene expression among target insects following exposure to Bt transgenic crops, such as those documented in this study, provide insights into cellular responses. Such insight may also inform future research regarding potential adaptive mechanisms that lead to field-evolved resistance.
Results
Genomic library construction, sequencing, and read filtering
Output from the Oxford Nanopore PromethION P48 provided 6.4 million reads spanning 66.6 Gb of raw read data for library LglyNP generated from a single male larva collected in Jilin Province, China (Fig. 1) (Table S1; N50 = 19.6 kb L50 = 1.1 M), of which 61.6 Gb among 5.7 million reads remained post-filtering (N50 = 10.8 kb L50 = 1.0 M). A high proportion of short reads failed to surpass the q > 7 cutoff (Fig S1A), but the number of filtered bases was less skewed (Fig S1B). The final set of filtered reads had a mean q = 12.35 (Fig S1C) with read lengths up to 246.3 kb. A total of 386.7 M short 150 bp PE reads were generated covering 58.0 Gb of nucleotide data (Table S1) of which 97.29% had a q ≥ 20. All raw genomic read data are available in the NCBI Sequence Read Archive (SRA; Table S1) under BioProject PRJNA759210.
K-mer based genome size estimates
Both GCE and GenomeScope2 output showed a frequency distribution with a large number with low coverages at kmer = 19 representing sequencing error within the short Illumina PE reads, along with a homozygous (1 N) peak at a kmer depth of 60 (Fig. 2). The greater frequency of different kmers in the peak at a depth of 36 represented the heterozygous portion for this diploid organism. The two additional peaks at 92 and 126 corresponding to duplicated heterozygous and homozygous fractions, respectively. Genome size estimates differed between GenomeScope2 (587.2 Mb; Fig. 2A) and GCE (652 Mb; Fig. 2B) while percentage unique sequence remained consistent at 47.8% (52.8% repetitive). An estimated heterozygosity of 2.05% was obtained from both GenomeScope2 and GCE.
Contig assembly
A final set of 225 contigs were obtained from the initial NextDenovo assembly of Illumina and Oxford Nanopore reads after duplicates were purged and error corrected using PE short read data. Contigs in this 657.4 Mb L. glycinivorella haploid assembly, ilLegGlyc1.0, ranged from 0.13 to 37.63 Mb in size with an N50 of 4.2 Mb (Table 1) and contained no gaps (Ns). Alignment of filtered short PE 150 bp reads to these 225 contigs resulted in 99.74% coverage at a mean depth of 85.48-fold among the 99.3% of reads that aligned. BUSCO assessment of genome completeness as gauged by representation of the lepidoptera_odb10 set of orthologs (Table 2) indicated 92.9% were complete [91.0% single copy (S) and 1.9% duplicated (D)] and 0.8% fragmented BUSCOs (F), while 6.3% were missing (M).
Scaffolding
Long range contacts predicted from Hi-C data aligned to the initial 225 contigs resulted in breaking of 11 misjoins in ilLegGlyc1.0. This refined set of 237 contigs were then joined into 40 scaffolds (NCBI accessions JAKXMO010000001-JAKXMO010000040) under the L. glycinivorella WGS Project JAKXMO01 of which 28 scaffolds were assigned to chromosomes (Chr; CM041121-CM041148; 27 autosomes + Z chromosome) and corresponding RefSeq chromosomes (NC_062971- NC_062998.1; Table S2; Fig. 3A). This chromosome-level scaffolded L. glycinivorella haploid assembly, ilLegGlyc1.1, had an N50 and longest scaffold (largest chromosome) of 25.3 Mb and 55.4 Mb, respectively (Table 1). Among genes in the BUSCO lepidoptera_odb10 set, scaffolds in ilLegGlyc1.1 contained 93.0% complete (91.5% S and 1.5% D) and 0.8% fragmented BUSCOs (F), while 6.2% were missing (M) (Table 2). BUSCO scores for ilLegGlyc1.1 were similar to those for assemblies from other Tortricid moths. This final scaffolded assembly, ilLegGlyc1.1, was submitted to DDBJ/ENA/GenBank under BioProject PRJNA759210 with assembly accession GCA_023078275.1 (Locus tag prefix: K7X69).
Whole genome alignment between scaffolded assemblies for L. glycinivorella, ilLegGlyc1.1 and C. splendana, ilCydSple1.2 contained 5,651 alignments (Table S3), of which 5,531 alignments had a one-to-one correspondence between chromosomes from the two assemblies (proportion ≥ 91.5% across chromosomes; range 91.5 to 100.0%) (Table S4). These alignments spanned a total of 9.75 Mb, with a range of 0.06 to 0.80 Mb between chromosomes, and corresponding mean identities and lengths of 1519.54 ± 613.66 bp and ≥ 87.90 ± 2.52%, respectively (Table S4). The number (count) of these one-to-one alignments were generally associated with corresponding chromosomal lengths, with exception of ilLepGlyc1.1 chr06 and 07 with ilCydSple1.2 chr05 and chr08, respectively (Figure S2). These results were used to propose putative orthology among 27 autosomes and the Z chromosome from assemblies ilLepGlyc1.1 and ilCydSple1.2 (Fig. 3B; Figure S2). Specifically, putative orthology between ilCydSple1.2 OU342871.1 (Z chromosome) and a single ilLegGlyc1.1 scaffold, NC_062998.1, was based on 467 alignments of 1711 ± 939 bp with 89.5 ± 3.0% identity spanning 0.8 Mb. This evidence was used to define NC_062998.1 as the Z chromosome in the L. glycinivorella Refseq assembly (GCF_023078275.1; scaffold accession JAKXMO010000028.1; chromosome accession: CM041148.1).
Genome annotation
Based on ab initio and evidence-based predictions, 18,197 RefSeq gene models were constructed by the NCBI automated Eukaryotic Genomic Annotation Pipeline (Annotation Release 100; https://www.ncbi.nlm.nih.gov/genome/annotation_euk/Leguminivora_glycinivorella/100/; Table 1), among which 15,681 and 2,382 were protein coding and non-coding, respectively. The corresponding 26,672 RefSeq transcript models encoded 23,735 mRNAs (Table 1), with the remaining including long non-coding RNAs (lncRNAs; n = 1,042), tRNAs (n = 912), rRNAs (n = 75), snRNAs n = 51), and snoRNAs (n = 17). Among the 23,735 mRNAs, 21,155 (89.1%) were fully-supported by our RNA-seq evidence (Table S1; 485.3 million reads) and/or from read data generated in other experiments (BioSample IDs: SAMN07498241 and SAMN07498246 to SAMN0749824; 314.7 million reads). NCBI performed internal corrections of putative artifactual premature stop codons, frameshifts and insertion/deletions for 959 mRNAs, and 59 mRNAs remained as partial or incomplete sequences (remaining data not shown). WindowMasker predicted 48.7% of ilLepGlyc1.1 (assembly accession GCF_023078275.1) as repetitive DNA.
Larval transcriptome response to feeding on transgenic G. max expressing Cry1Ac
Preprocessing filters removed transcript alignments with low read coverage and representation across replicates, and resulted in “smoothed” mean variance plots at gene (Figure S3A) and transcript levels (Figure S3B). Subsequent read counts normalized by log2(CPM) reduced inter-specific variance for the remaining 13,092 genes (Figure S4) and 18,561 transcripts (Figure S4B). A majority of the 13,092 gene models (n = 10,077; 77.0%) had single assigned transcripts. Principal Component Analysis (PCA) of mean log2(CPM) values, following correction for outlier effects, showed intraspecific clustering of replicates within Cry1Ac and non-Bt treatments at the gene level (Fig. 4A), where PC1 and PC2 accounted for 41.24 and 25.62% of the variance, respectively. From these estimates of read abundances among the 13,092 filtered gene set, there was a range of -12.40 ≥ Log2(fold-change) ≤ 14.47 between larval L. glycinivorella Cry1Ac and non-Bt treatment groups (Table S5a). A total of 204 genes were predicted to show significant DGE (Fig. 4B; Table S5b), and 10 genes to have significant DAS (Fig. 4C; Table S5c). Among the significant DGE predictions, 64 and 140 were up- and down-regulated (Fig. 5), respectively. The gene LOC125237008, a putative GTP cyclohydrolase, with significant DGE was also predicted to have significant DAS between encoded transcripts XM_048143924.1 and XM_048143925.1. Additionally, 12 significant DTU events were predicted (Table S5d), which by nature of their definitions, encompassed transcript isoforms assigned to the 10 genes with significant DAS (Table S5e; Fig. 5). Transcripts from 5 genes were predicted to show both DAS and DTU. DAS between transcripts XM_048143924.1 and XM_048143925.1 from LOC125237008 was based on DTU, and only predicted with DGE, DAS and DTU. Furthermore, transcripts XM_048149292.1 and XM_048149297.1 from LOC12524102, a probable mitochondrial zinc-binding oxidoreductase, and transcript XM_048149794.1 from LOC125241351 encoding a putative metalloprotease tolloid (TLD) domain-containing protein showed only significant DTU compared to other transcripts derived from respective loci.
There were 104 unique Pfam domains predicted in the 204 (PfamScan E-values ≤ 1.0 × 10–11; remaining data not shown), from which 203 associated curated GO terms were retrieved across CC, BP and MF categories from level 1 to 4 (Table S6). These encompassed 2, 22, and 12 CC, BP and MF categories at GO level 2, respectively (Table 3). The most significantly enriched terms for CC and BP categories were vacuolar part and microbody, and secondary metabolic and organic hydroxy compound metabolic processes, respectively. Most significant enrichment was predicted for fatty acid synthase activity and oxidoreductase activities among MF terms. Simultaneous assessment of significance, directional bias of DEGs (up- or down-regulated), and number of DEGs in each enriched GO category (Fig. 6) showed that BP GO:0008610 (lipid biosynthetic process), GO:0015849 (organic transport), GO:0032787 (monocarboxylic acid metabolism), and GO:0046394 (carboxylic acid biosynthesis) are most significantly enriched and represent groups of genes biased for reduced expression. The MF terms GO:0003924 (GTPase activity) and GO:0005506 (iron binding) contained genes with a positive (increased) expression bias, whereas DEGs in GO:0004312 (fatty acid synthase activity) and GO:0016903 (oxidoreductase activity) were mostly down-regulated in Cry1Ac exposed larvae. Six genes in enriched GO categories are putatively in cytochrome P450 superfamilies, all of which were significantly down-regulated. Genes with putative function in chitin maintenance were significantly up-regulated [chitinase (LOC125235932, LOC125235952, and LOC125235918 Table S7a-c); and deacetylase activities (LOC125236753)]. Terms for DEGs encoding constituents of the peritrophic membrane and cuticle [peritrophin-1-like (LOC125238640); and cuticle protein LCP-17-like (LOC125230076)], although their common MF term, chitin binding (GO:0008061)] were not among those that were significantly enriched.
Additionally, some genes previously identified as involved in Bt resistance among species of Lepidoptera showed significant DGE (Table 4; Table S5b; Table S7a-c). These included two ABC transporters in subfamily C and G members and 4 tetraspanin orthologs (Table 4), all of which were assigned to significantly enriched GO categories. Significant DGE was also predicted for an up-regulated membrane-bound aminopeptidase N (apn) and down-regulated alkaline phosphatase (alp) orthologs. The L. glycinivorella gene, LOC125230228, encoding two RefSeq annotated alp transcript isoforms, XM_048135319.1 and XM_048135320.1, was significantly down-regulated ion in Cry1Ac exposed larvae (Log2FC = -2.10; FDR = 0.0136; Table S5a). An amino acid sequence alignment with Heliothis virescens mALPs (HvmALP) predicted L. glycinivorella mALP protein XP_047991276.1 (isoform X1) encoded by XM_048135319.1 and XP_047991277.1 (isoform X2) encoded by XM_048135340.1, had 48.12 and 46.99 percent identity with HvmALP1 and HvmALP2, respectively (remaining results not shown).
Phylogenetic reconstruction of the aminopeptidase N gene family among orthologs from B. mori, L. glycinivorella, and O. nubilalis supported 18 putative clades, of which 16 corresponded to those in B. mori (Figure S5). The two additional clusters corresponded to putative L. glycinivorella-specific APN03- and APN07-like clades. The APN07-like clade contained aminopeptide N-like proteins XP_48003237.1 and XP_48003238.1 from LOC125242528, where LOC125242528 was significantly up-regulated in Cry1Ac exposed L. glycinivorella larvae (Log2FC = 2.06; FDR = 0.0167; Table S5a).
Discussion
Chromosome-level genome resources
Investigation of Bt intoxication in this study was facilitated by L. glycinivorella genomic resources comprised of a structurally annotated chromosome-level genome assembly. The 27 autosomes and a Z-chromosome in the scaffolded assembly, ilLegGlyc1.1, is expected based on karyotype data suggesting the ancestral chromosome number (N) among Lepidoptera in the Family Tortricidae, N = 30. This chromosome number is reduced to 28 in Subfamily Olethreutinae [68] to which L. glycinivorella belongs (Taxomony ID 10351111). Due to the ZW/ZZ female/male sex determination system of Lepidoptera, the homogametic male assembly ilLegGlyc1.1 lacks a W-chromosome, as does assemblies from other tortricids (Table 2). Although not investigated here, the Z-chromosome of tortricid moths is of interest due to predicted ancestral autosomal fusion [69], and involvement in speciation [70] and ecotype variation among Lepidoptera [71]. Nearly 90% of the 23,735 NCBI RefSeq gene annotations for ilLegGlyc1.1 are supported by transcript evidence (Table 1), and provides a resource for current and future research.
Transcriptional responses to transgenic Bt pesticidal plant exposure
The development of phenotypes in insect populations that are resistant to Bt insecticidal proteins reduce the efficacy of control strategies and pose a threat to the sustainability of current agricultural production practices. Mechanism(s) of lepidopteran resistance to Bt crystalline (Cry) insecticidal proteins are reported to involve structural or functional changes to the proteins that mediate binding and pore formation in the gut, alter proteolytic cleavage of native Bt toxins, or increased immune, cellular regeneration, and toxin sequestration capacities [30, 72,73,74,75]. Additionally, intracellular protein kinase signaling pathways are involved in Bt MoA [25] and are altered in resistance insects [34]. The role of mutations in genes tetraspanin [76] and kinesin [77] in Bt Cry1Ac toxin resistant Helicoverpa sp. and MoA of Bt intoxication remain enigmatic. Even though Bt Cry1A proteins may interact with different midgut receptor proteins compared to Cry1F [78], and the Bt vegetative insecticidal protein, Vip3, interacts with a different set of midgut receptors compared to Cry1 proteins [79, 80], a meta-analysis provided evidence for weak cross resistance between Cry1 and Vip3A [81]. These prior studies suggest that Bt MOA and points wherein disruption(s) lead to resistance are not yet fully understood, which could be assisted by further investigation of molecular and cellular impacts of Bt intoxication [37, 39]. Specifically, this would involve expanding knowledge of the molecular interactions and pathways that invoke cell death or recovery and repair following Bt intoxication, which ultimately determine organismal survival or mortality, and be informed by molecular responses of susceptible insects when exposed to Bt proteins.
In our study, relatively few genes (n = 204) were predicted to be significantly differentially expressed between L. glycinivorella larvae feeding on Cry1Ac (cultivar GP03-8–23) and a non-Bt cultivar compared to analogous results from prior studies involving species of Lepidoptera. Specifically, prior analyses predicted thousands of DEGs between Bt and non-Bt exposed lepidopteran larvae [41,42,43, 47]. Also, despite 104 of these 204 predicted DEGs being significantly down-regulated in Cry1Ac exposed L. glycinivorella (68.6%), there was a bias for down-regulation among genes assigned to significantly enriched GO categories (Fig. 6; 89.4% (17 of 19), 78.3 (54 of 69), 78.8% (78 of 99) in enriched CC, MF, and BP categories, respectively). Only exceptions were genes in MF categories GO:0003924 (GTPase activity) and GO:0016247 (iron ion binding) that were weakly biased for up-regulation. Although not investigated further, differences with prior studies in Lepidoptera may be influenced by different larval exposure times, level of exposure (dose), or other unaccounted for factor(s). Due to use of on-plant treatments (exposures) in our study, the absolute dose remains unknown compared to prior reports where exposures were to known concentrations of purified Bt protein in artificial diet bioassays [41,42,43, 47]. Regardless, clustering of PCA of Log2CPM estimates suggests relative responses that are homogeneous within and heterogeneous between L. glycinivorella larval treatment groups (Fig. 4A), indicating on-plant assays invoke a reproducible molecular response. It remains possible that transcriptional responses to undescribed differences in host plant resistance (HPR) factors between Bt and conventional cultivars may overlap with responses to Bt Cry1Ac exposures or be induced in both our treatments. This confounding factor could relegate a portion of genes involved in general response to Bt and unrelated HRP traits as non-significant and may be partially explanatory of the relatively small number of DEGs predicted in this study.
Changes in pathways associated with cell stress responses
Our results provide insights into the effects of feeding on transgenic Bt crops on gene expression by a susceptible pest insect and suggest a set of putative genes and pathways may be involved in physiological responses. Although functional GO annotations are extrapolated from model organisms, they are yet to be validated in L. glycinivorella. Enriched CC, BP and MF categories offer a baseline for interpreting potential systemic effects of feeding by susceptible insects on Bt compared to non-Bt plants. For instance, the MF category oxidoreductase activity was the second most significantly enriched GO category among L. glycinivorella DEGs (Table 3), agreeing with prior evidence that pathways which remediate oxidative stress are affected by Bt intoxication [82]. Genes encoding cytochrome P450 monooxygenases were down-regulated in Cry1Ac exposed L. glycinivorella, suggesting further cellular efforts to reduce the accumulation of reactive oxygen species (ROS). A number of BP categories for metabolic processes are modulated after Cry1Ac exposure. Metabolic changes were also predicted in response to oxidative stress in H. armigera [83], and more generally following exposures to insecticidal toxins [39, 84, 85]. Cellular repair and survival mechanisms are also accompanied by increased metabolic rates [86, 87] and are mechanisms by which Cry1Ac resistant H. armigera respond to Cry1Ac exposure [47]. These lines of evidence may support a hypothesis that changes in metabolic processes predicted in this study could be a consequence of or in support of stress response and repair mechanisms induced by Cry1Ac exposure in susceptible L. glycinivorella larvae.
The two enriched CC categories in our study, vacuole part and microbody, interestingly were also significantly enriched among genes down-regulated in larvae of the coleopteran D. v. virgifera following exposures to transgenic Bt Cry3Bb1- and Tpp34/Gpp35Ab1 maize [39]. This commonality could point to involvement of recycling damaged cell components and adaptation to stress in Bt exposure responses. Specifically, microbodies were described as a suite of single membrane-enclosed organelles [88] that encompass lysosomes and peroxisomes [89]. Peroxisomes are involved in lipid biosynthesis and house oxidative reactions that form peroxides [90], and are self-replicating and provisioned by importation of enzymes. Lysosomes are endosomal vesicles produced by the trans-Golgi network that function in autolysis of intracellular components and houses machinery involved in autophagy, a conserved eukaryotic mechanism that degrades and recycles damaged cellular components to maintain homeostasis [91]. For terminology’s sake, the vacuole (yeast and plants) is synonymous with the lysosome (animalia) [92]. Aspects of cell stress are mediated by functions of lysosomes [93] and peroxisomes [94], which may include autophagy as a response to Bt infection or toxin exposure [95, 96].
Other studies implicate an apoptotic response to Bt toxins [39, 82, 97, 98]. Similar to a prior study in D. v. virgifera [39], a putative lifeguard-1-like protein (LG1) with a B-cell-lymphoma protein 2 (Bcl-2)-associated X (BAX) inhibitor (Bax1-I) domain was significantly up-regulated in L. glycinivorella response to Bt exposure. Bax1-I domain proteins are involved in the negative regulation of apoptotic signaling pathways through the remediation of cell stress, where lifeguard does so by reduction of stress in the Golgi and endoplasmic reticulum [99]. In our study, LG1 is encoded by L. glycinivorella LOC125231679 and is significantly up-regulated following Cry1Ac exposure. When considered in conjunction with modulation of pathways that remediate damage by ROS, LG1-mediated suppression of apoptosis may indicate survival mechanisms are engaged following Bt-induced cellular damage [39], and similarly occur among Cry1Ac-exposed L. glycinivorella larvae. Granted these general conclusions are drawn upon comparisons among a relatively limited set of experiments, but meta-analyses of these and future studies may shed light on conserved underlying pathways involved in Bt intoxication and potentially inform steps whereby resistance might develop.
Differential-regulation of putative Bt binding protein coding genes
Proteins in the midgut of Lepidoptera function as receptors in a sequential binding model for pore-forming Cry1A pesticidal proteins [23, 24, 100]. Interestingly, genes encoding orthologs of some of these receptors show significant differential expression between susceptible L. glycinivorella fed on Cry1Ac transgenic G. max compared to those fed on a non-Bt cultivar (Table 4). It should be cautioned that changes in transcript levels are not necessarily reflected in corresponding protein levels, and latter have not been validated. Regardless, two transcripts encoding putative glycosylphosphatidylinositol (GPI)-anchored membrane-bound Bt receptor proteins, alkaline phosphatase (mALP) and aminopeptidase (APN), are differentially regulated. Specifically, we predicted that a L. glycinivorella malp is significantly down-regulated in Cry1Ac exposed larvae (Table S5a), and that two RefSeq protein isoforms encoded by L. glycinivorella malp are orthologous to H. virescens HvmALP1 and HvmALP2 that show a strong correlation between reduced transcript and protein levels in Cry1Ac resistant strains of H. virescens [101]. Additional evidence shows that mALP is a functional receptor for Cry1A and Cry2Ab2 proteins [102], but not Vip3A [103], where N-acetylgalactosamine modification is required for Cry1Ac binding [78]. Furthermore, constitutively reduced mALP or malp levels are documented in Cry1Ac resistant laboratory strains of Helicoverpa armigera and Spodoptera frugiperda [101], Cry1Ab resistant Ostrinia nubilalis [43] and Cry1C resistant Chilo suppressalis [104], but not in Cry1Ab resistant Diatraea saccharalis [96]. Ours is the first known report of induced malp down-regulation following Cry protein exposure among Lepidoptera, but expression of several alp gene family members decreased in Aedes aegypti following intoxication with insecticidal Bt subsp. israelensis [105]. The role or consequence of these changes following Bt intoxication in L. glycinivorella and other insects remains unknown.
Our results also predicted the significant up-regulation of gene LOC125242528 in Cry1Ac cultivar exposed larvae (Table 4), which encodes a membrane-bound alanyl aminopeptidase (APN). Phylogenetic reconstructions among members of this gene family in lepidopteran species predict 8 [106] or 16 clades [107]. Our corresponding interspecific phylogenetic reconstruction predicts 16 major clades corresponding to those in B. mori, wherein L. glycinivorella gene expansion is most abundant in an APN05 clade with three paralogs as compared to one in each B. mori [108] and O. nubilalis [109] (Figure S5). Protein products from LOC125242528 cluster adjacent to members of an APN07 clade that also includes products from a L. glycinivorella APN-encoding gene, LOC125242529. Our phylogenetic analysis also indicates that the up-regulated L. glycinivorella APN7-like gene LOC125242528 is not orthologous with APN1 and APN3, where one or both apn1 and apn3 are down-regulated in Cry1A resistant Helicoverpa armigera [110], Trichoplusia ni [111], O. nubilalis [112], O. furnacalis [44], and Cry1C resistant S. exigua [113]. Although some APN orthologs are functional Cry1A receptors [114], confer resistance in knock-out strains of Plutella xylostella [115], and interact with oligomerized Bt protein pre-pore structures to facilitate membrane insertion [23, 24], APNs have a diversity of physiological roles or are not expressed in gut tissues [107]. Specifically, apn1, 2, 4, 5, 6, 9, and 12 are expressed in midgut of larval B. mori, and all of these but apn12 are up-regulated at 6 h post-infection with a strain of Bacillus bombyseptieus [107]. Initial apn transcript up-regulation similarly occurred in analogous experiments, but significantly down-regulated by 24-h post B. bombyseptieus infection [116]. Our study shows a putative L. glycinivorella APN07-like transcript being significantly up-regulated 48-h post exposure to Cry1Ac-expressing G. max. Since there is no evidence of this APN07-like protein or orthologs functioning as Bt receptors, observed changes could be indicative of proteolytic changes involved in other cellular processes [117, 118]. Although not investigated further, differences in expression among APN gene family members following Bt exposure may reside in differences in exposures times and method of protein exposure (bacteria, transgenic plant, or artificial diet incorporated).
We predicted that the expression of an ABC transporter from subfamily C (ABCC) was reduced in Cry1Ac G. max exposed L. glycinivorella larvae (Table 4). ABC transporters are implicated in Bt resistance following discovery of an insertional knockout of ABCC2 linked to Cry1Ac resistance in H. armigera [119] and subsequent associations shown in other species [30, 120]. Interestingly, down-regulation of abcc2 and abcc3 was associated with Cry1Ac resistance in Plutella xylostella [121], where these paralogs show functional redundancy as Bt receptors in H. armigera [122]. Despite this evidence, the ABCC subfamily member down-regulated in L. glycinivorella Cry1Ac-exposed larvae was assigned as a member 4 ortholog (ABCC4), which has no evidence as being a functional Cry1Ac receptor among species of Lepidoptera. Regardless, ABC transporters function the eflux of various cellular substrates [123] including metabolites from xenobiotic breakdown in insects [28], and are involved in mammalian stress-response pathways [124]. The down-regulation of ABCC4 in L. glycinivorella Cry1Ac-exposed larvae remains perplexing, but was not investigated further in this study.
Tetraspanin paralogs are significantly up- (n = 3) or down-regulated (n = 1) in Cry1Ac-exposed L. glycinivorella compared to unexposed cohorts (Table 4). Tetraspanins are an evolutionarily conserved gene family of proteins structurally composed of four transmembrane domains, and intra- and extracellular domains [125]. Binding of different ligands to a unique extracellular domain of each paralog transduce signals responsible for cellular responses such as migration, adhesion, and intracellular trafficking [88]. There are 36 paralogs in the D. melanogaster genome [126], and functional redundancies among paralogs have made it difficult to resolve independent roles in cellular processes [125, 127]. Mutational change at an amino acid position (L31S) in the first transmembrane domain of an H. armigera tetraspanin, HaTSPAN1, is linked to Cry1Ac resistance, and although HaTSPAN1 transcript levels are also reduced 2.7-fold there is no alteration in Cry1Ac binding [76]. Transcripts encoding tetraspanin orthologs were also up-regulated among susceptible D. v. virgifera larvae feeding on transgenic Cry3Bb1 and Gpp34/Tpp35Ab1 maize [39]. Although members of the tetraspanin gene family are involved in insect immune response [128] and recent evidence of differential regulation of immunity genes in Cry1Ac exposed strains of H. armigera [47], there is no analogous evidence from our study to suggest tetraspanin effects on immunity (Table 3; Table S5b). Thus, the role of tetraspanins in larval L. glycinivorella response to transgenic Cry1Ac G. max remains unknown.
It should be noted that there is a functional difference between Bt binding proteins and Bt receptor, where the latter is defined as proteins that facilitate membrane insertion [129]. Within this framework, ABC transporters and cadherin are purported to be major and “accessory’ functional receptors, respectively. Thus, changes in these two receptors my incur changes in resistance. Despite this, there instances where resistance has evolved outside of modulation of known receptor interactions [30, 76, 77], suggesting that gains in understanding of the Bt MoA might facilitate deciphering those mechanisms involved.
Materials and methods
Genomic library construction, sequencing, and read filtering
A hybrid short- and long-read approach was used to assemble L. glycinivorella contigs. For this, greater than 20 live 4th instar larvae were collected from a cultivated non-Bt G. max field at the Jilin Academy of Agricultural Sciences, Northeast Agricultural Research Center (JAAS-NARC), Gongzhuling, Jilin Province, P.R. China on 20 August 2020 (BioSample SAMN21160035; Isolate SPB_JAAS2020; Table S1). Samples were brought to the lab and frozen alive in a -80 ℃ freezer. A subset of samples then shipped on dry ice to Benagen Tech Solutions Ltd. (Wuhan, China) where genomic libraries were constructed and sequenced. In brief, total genomic DNA was extracted from a single male L. glycinivorella larva (larva #1; Table S1) using an in-house modified CTAB protocol, quality checked by 0.75% agarose gel electrophoresis, and quantified on a Qubit 3.0 Fluorometer (Life Technologies, Carlsbad, CA, USA). Genomic DNA was selected for fragments > 15 kb using AMPure XP beads (Beckman-Coulter Life Sciences, Indianapolis, IN, USA), repaired using the NEBNext FFPE DNA Repair Mix (New England Biolabs, Ipswich, MA, USA), and used as input for the SQK-LSK109 Ligation Sequencing Kit according to manufacturer instructions (Oxford Nanopore, Oxford, UK). Following quantification and purification, the large-insert library was loaded into R9.4 Spot-On Flow Cells and run on a PromethION P48 Sequencer (Oxford Nanopore). GUPPY v 4.0.2 (Oxford Nanopore) was used to convert image files to base calls and remove failed reads from the raw data, then trim reads of adapters and those with a Phred quality score (q) < 7.
Remaining DNA (< 15 kb) from L. glycinivorella larva #1 (Table S1) was sheared and fragments used to construct an indexed short insert library with the Nextera DNA Flex Library Prep Kit according to manufacturer instructions (Illumina, San Diego, CA, USA). Library quality was assessed on an Agilent 2100 Bioanalyzer Instrument (Agilent Technologies, Santa Clara, CA, USA), and quantitative PCR (qPCR) was used to estimate effective concentration as described previously [130]. Then 150 bp paired end (PE) reads were generated from the short-insert library on a single S4 flow cell lane of a NovaSeq 6000 (Illumina). Fastq formatted short read data was trimmed of adapters and sequence with low quality (q < 5), and PCR duplicates removed using in-house scripts at Benagen Tech Solutions Ltd.
K-mer based genome size estimates
A kmer depth-frequency distribution was generated for short Illumina reads using kmer_freq from which the mean heterozygosity, repetitive fraction, duplication level, and genome size were estimated using the Genome Character Estimator (GCE) v.1.0.0 (https://github.com/fanagislab/GCE) [131] with default settings. This k-mer distribution was used as input to estimate genome size using the Lander-Waterman equation [132] and other parameters the R script GenomeScope2 [133].
Contig assembly
The string graph-based de novo assembler for long reads, NextDeNovo v.2.5.0 (GrandOmics, Beijing, China), that applies an initial internal error correction of long reads prior to assembly, was used with default settings to generate a primary contig assembly. The resulting contigs were subjected to two successive rounds of error correction (e.g. “polishing”) with short Illumina PE read data (Table S1; post-filtered) using default settings of Racon v.1.4.11 (https://github.com/lbcb-sci/racon), followed by two additional rounds of polishing using Pilon v.1.23 [134]. The final draft contig assembly was produced following removal of duplicated contigs with Purge_haplotigs v.1.0.4 [135] with input generated from the long read aligner, minimap2 (https://github.com/lh3/minimap2) [63]. Estimated coverage and completeness was assessed by mapping rate of aligned filtered short Illumina PE reads to the draft L. glycinivorella contig assembly using bwa v.0.7.17-r1188 [136] with default settings, and coverage depth defined using the -depth command of SamTools v.1.9 (https://github.com/samtools/samtools) [137]. Representation and completeness of the contig assembly was assessed using Benchmarking Universal Single-Copy Orthologs (BUSCO) v.5.2.2 against the set of 5,286 orthologs in lepidoptera_odb10 [138,139,140] using default parameters with the MetaEuk search algorithm [141].
Scaffolding
Chromatin Conformation Capture (Hi-C) libraries were prepared by Benagen Tech Solutions Ltd. (Wuhan, China) according to methods described previously [142]. For this, chromatin was cross linked to DNA by formaldehyde treatment in vitro, then DNA digested with DpnII, and the 5’ overhangs blunted by DNA pol I Kelnow fragment incorporation of biotinylated nucleotides. Following ligation of proximal blunted DNA ends and removal of histone crosslinks, genomic DNA was purified to remove protein and unincorporated biotinylated nucleotides. These purified ligation products were then sheared into 350 to 700 bp fragments, and short read libraries constructed using the NEB Next Ultra Library Prep kit (New England BioLabs, Ipswich, MA, USA), followed by purification using streptavidin beads and PCR enrichment. The library was quantified using Quant-iT™ PicoGreen™ dsDNA Assay Kit (Thermo-Fisher, Wilmington, DE, USA) on a Qubit® 2.0 fluorometer (Thermo-Fisher) and a qPCR-based method [130]. Library insert sizes were estimated following separation on an Agilent 2100 BioAnalyzer (Agilent Technologies Inc.). PE 150 bp reads were then generated from library inserts on a single S2 flow cell lane of an Illumina NovaSeq 6000 sequencer (Illumina).
All post-processing of Hi-C read data was performed using HiCUP v.0.7.2 [143] to generate di-tags (PE reads located on different contigs) and used for downstream scaffolding. Specifically, HiCUP v.0.7.2 was used to trim raw reads of sequencing adapters, linker sequence, and low-quality nucleotides (q < 30) followed by truncation of reads to sequence upstream of the DpnII site. Trimmed reads were then aligned to the draft contig assembly using Bowtie 2 [144]. Any di-tags resulting from self-circularization, representing dangling ends or internal fragments (map to the same DpnII site), too distant from a DpnII site in the primary contig assembly, or repetitive PCR duplicates were also removed. The valid unique aligned reads were then used to predict inter- and intra-contig interactions, and define putative order and orientation along chromosomes. Juicebox v1.11.08 (https://github.com/phasegenomics/juicebox_scripts) was used to manually curate the contact-map wherein mis-joins were eliminated, inverted contigs re-oriented, and contigs joined into scaffolds.
The final set of L. glycinivorella scaffolds were aligned to 28 chromosome-level scaffolds of the 630.6 Mb Cydia splendana (Lepidoptera: Tortricidae; Family Olethreutinae) genome assembly ilCydSple1.1 (Welcome Sager Institute, unpublished; GenBank: GCA_910591565.1 WGS Project: CAJUYE01) using Nucmer in the MUMmer4 package [145]. Nucmer alignments were generated using default parameters except anchored using maximal unique matching sequence (–mum) of 65 nt, followed by use of delta-filter to filter the subsequent delta (alignment) file for minimum identity (-i) of 75, minimum length (-l) 1000, minimum uniqueness (-u) of 50, and maximum alignment overlap (-o) of 100. Coordinates of the filtered delta file were output using the MUMmer4 show-coords application, and 2D plots generated from mummerplot in postscript format. Completeness of this final scaffolded assembly was assessed against the lepidoptera_odb10 gene set using BUSCO v.5.2.2 as described above. For comparison, BUSCO scores were similarly produced for other assemblies from species in the lepidopteran Family Tortricidae: Cydia splendana, ilCydSple1.1, Pammene fasciana, ilPamFasc1.1 (GenBank: GCA_911728535.1), Notocelia uddmanniana, ilNotUddm1.1 (GenBank: GCA_905163555.1), and Apotomis turbidana, ilApoTurb1.1 (GenBank: GCA_905147355.1).
Gene annotation
The final set of chromosome-assigned L. glycinivorella scaffolds were submitted to the National Center for Biotechnology Information (NCBI) for automated eukaryotic genome annotation pipeline [146]. Evidence used to predict NCBI reference sequence (RefSeq) gene models were based on 135.4 million RNA-seq reads in the NCBI Sequence Read Archive (SRA) runs SRR5985984—SRR5985989 previously generated from 1st instar L. glycinivorella larvae [147, 148]. Additional evidence was generated in this study as life-stage specific reads (Libraries Lgly00 to Lgly06c; Table S1) from samples of eggs, larvae, and adults collected from fields of cultivated G. max at JAAS-NARC during July and early August, 2021 (n = 5; BioSamples SAMN24169476 to SAMN24169480). For the latter, samples were immediately flash frozen in liquid nitrogen in the field, then sent to Benagen Tech Solutions Ltd (Wuhan, China) for RNA extraction, construction of indexed RNA-seq libraries, and generation of 150 bp PE read data on a single S4 flow cell lane of a NovaSeq 6000 (Illumina). De novo repeat detection relied upon NCBI running of WindowMasker [149].
Further phylogenetic analyses performed for putative candidate Bt resistance genes, membrane-bound aminopeptidase N (apn) and alkaline phosphatase (alp), where orthology was uncertain based on RefSeq structural annotation. The Clustal W algorithm was used in the MEGA8.0 alignment utility [150] to construct a multiple protein sequence alignment of 16 APN proteins from B. mori (n = 16; [107]), O. nubilalis (n = 9; [109, 112]) and 27 L. glycinivorella RefSeq protein models with putative alanyl-aminopeptidase annotations encoded by 20 RefSeq gene models. A query of the Conserved Domain Database with the O. nubilalis APN1 (accession ACJ64827.1) predicted a 460 aa Peptidase M1 domain that was used to define the orthologous region in the multiple sequence alignment. Variation in the Peptidase M1 domain was subsequently used for subsequent phylogenetic analysis. The BIC score was maximized for this trimmed alignment in the LG model of protein sequence evolution [151] with an empirically-determined gamma distribution (LG + G), and MEGA8.0 [150] used to construct a phylogeny from among sites with ≥ 80% representation among aligned residues using Maximum Likelihood (ML) based on a consensus of 1,000 bootstrap pseudo-replicates.
For the L. glycinivorella gene LOC125230228 (transcript isoforms XM_048135319.1 and XM_048135320.1 encoding proteins XP_047991276.1 and XP_047991277.1, respectively), amino acid sequence homology was predicted based on simple alignment with Heliothis virescens mALPs (HvmALP) orthologs. Specifically, L. glycinivorella XP_047991276.1 and XP_047991277.1 were aligned with Heliothis virescens mALP accessions EU729322.1 (HvmALP1) and EU729323.1 (HvmALP1; [78]) using the Clustal W algorithm with default parameters and percent identities used to define orthology among putative isoforms.
Larval transcriptome response to feeding on transgenic G. max expressing Cry1Ac
The G. max cultivar, GP03-8–23, that express 3.75 μg/g of the Bt Cry1Ac pesticidal protein [61] and a conventional (non-Bt) cultivar were grown to R3 stage in a greenhouse at JAAS at 24 ± 1 °C, 60% relative humidity (RH), and 16:8 h light:dark (L:D). Leguminivora glycinivorella adults were collected from non-Bt G. max fields at JAAS-NARC in July 2021, placed into a cage, and eggs collected. Subsequent neonates were fed detached conventional (non-Bt) G. max leaves placed on moistened filter paper in 15-cm Petri plates and incubated in a growth chamber (26 ± 1 °C, 60–70% RH, and 16:8 L:D photoperiod). Second instars were transferred to Cry1Ac GP03-8–23 or non-Bt G. max plants (treatments), one larva per plant, across three replicates of 20 plants per treatment (60 plants per treatment; 120 total). Larvae were allowed to feed for 48 h. Recovered larvae were pooled from within each of the 3 replicates for Cry1Ac GP03-8–23 and non-Bt treatments (n = 7 larvae per treatment: n = 42 total larvae) for a total of 6 pools (Table S1; SAMN24169481 to SAMN24169486). Pools were flash frozen in liquid nitrogen and shipped to Benagen Tech Solutions Ltd. (Wuhan, China) where RNA was extracted and indexed RNA-seq libraries constructed for each Cry1Ac GP03-8–23 (Glyc05a_1Ac to Glyc05c_1Ac) and non-Bt treatment (Lgly06a_nBt to Lgly06a_nBt). Equimolar amounts of each library loaded on a single S4 lane of a NovaSeq 6000 (Illumina) on which 150 bp PE reads were generated.
Raw non-normalized fastq formatted read data were trimmed to remove nucleotide sequence with a q < 20 and Illumina adapters using Trimmomatic v0.39 [152]. Only PE reads from each library that survived filtering were used for generating pseudoalignments to L. glycinivorella transcript models (NCBI accession GCF_023078275.1) with Kallisto v0.46.1 [153] (parameters: –fragment-length = 200; –sd 20; –bootstrap-samples = 100, –seed = 42). Estimated read counts for each transcript pseudoalignment was used to predict differential gene expression (DGE; aggregated estimate of transcript isoforms), differential alternative splicing (DAS), and differential transcript use (DTU) events between treatments of L. glycinivorella fed Cry1Ac cultivar GP03-8–23 (Lgly05a_1Ac to Lgly 05c_1Ac) vs. non-Bt cultivar (Lgly06a_nBt to Lgly06a_nBt; Table S1) using the 3D RNA-seq package v 1.0.0 (https://github.com/wyguo/ThreeDRNAseq) [154] run in R 4.4.2 [155] via the integrated development environment, Rstudio 2022.07.2 + 576 [156]. A transcript mapping file consisting of ≥ 1 RefSeq mRNA model (XP_) from each gene (LOC|GeneID) was provided as input [23,735 transcripts (XM_) and 2,055 non-coding RNAs (XR_) from 17,151 RefSeq gene models]. Pre-processing involved conversion of read counts to length-scaled transcripts per million (lengthScaledTPM) [156], and filtered to remove genes and transcripts with a decreasing mean–variance trend; those in which ≥ 1 of 6 replicates with counts per million (CPM) ≥ 1. Read counts were normalized to log2CPM via trimmed mean of M-values (TMM) [157]. Then counts were corrected for batch outlier effects using the RUVr package [158] run in R 4.4.2 [155], and Principal Component Analysis (PCA) used to assess corrected mean log2CPM estimates among genes (aggregate of estimates for all transcripts for each gene) from each replicate along principal coordinates 1 (PC1) and 2 (PC2). Limma-voom [159] was then used to estimate the log2(fold-change) of transcript abundances based on CPM values between treatment groups [Cry1Ac (Lgly05a_1Ac to Lgly 05c_1Ac) vs. non-Bt (Lgly06a_nBt to Lgly06a_nBt)] and determined a false discovery rate (FDR) adjusted for multiple comparisons using the Benjamini-Hochberg (BH) method [108]. Significance for DGE was defined at an FDR threshold ≤ 0.05 and log2(fold-change) ≥ 2.0. DAS and DTU events were predicted based on change in percentage spliced transcripts (∆PS; change in ratio of mean transcript abundances weighted by the mean abundance for all transcripts for the given gene) [160] as calculated in the 3D RNA-seq package [154]. In brief, ∆PS was calculated for genes with > 1 transcript and DAS only events defined as instances where proportional changes in transcript abundances are predicted without corresponding significant changes in expression when all transcript isoform levels are considered. DTU (DGE + DAS) events are those when significant changes in overall gene expression and relative transcript abundances are predicted. Significance of DAS and DTU estimates were determined using an F-test with a BH adjusted FDR ≤ 0.05 and ∆PS ≥ 0.1.
Functional annotation was performed for significant differentially expressed RefSeq transcripts by prediction of protein domains by searching the Pfam-A database [161] with PfamScan (https://www.ebi.ac.uk/Tools/pfa/pfamscan/) using each corresponding RefSeq proteins as the query, with results filtered for E-values ≤ 10–10. To avoid biased representation in instances when multiple transcripts were derived from same gene (LOC/GeneID), the longest RefSeq protein model was used to query Pfam-A. Pfam domains were then used to retrieve curated gene ontology (GO) terms and perform enrichment analyses at GO level 2 using the dcGOR 1.0.6 package [162] based Z-scores calculated on hypergeometric distributions of terms and a BH significance threshold (FDR) ≤ 1.0–4 applied for cellular component (CC) and molecular function (MF), and ≤ 1.0–6 biological process (BP) category terms. GOplot [163] was used to construct a bubble plot (GObubble) where -log(FDR) from GO enrichment was plotted against a Z-score that took into account expression bias among DEGs [Z-score = (count up-regulated – down-regulated genes)/sqrt(total number of genes)] with positive and negative values indicative of greater proportional of up- and down-regulated genes in a given GO category, respectively. Size of bubbles proportional to number of DEGs in each category.
Availability of data and materials
All read data generated in this study are under PRJNA759210 (co-listed under the i5K Umbrella Project, PRJNA163973). This includes sequence read archive (SRA) submissions for whole genome sequence reads (SRR15680761 to SRR15680763), and RNA-seq reads generated across developmental stages used for evidence-based gene predictions (SRR17269421 to SRR17269426) and prediction of differential expression (SRR17269416 to SRR17269424). Scaffolds from the final assembly LegGlyc1.1 are available in DDBJ/ENA/GenBank in accession JAKXMO000000000 and RefSeq genome assembly accession GCF_023078275.1 (WGS Project: JAKXMO01). Information from Annotation Release 100 (https://www.ncbi.nlm.nih.gov/genome/annotation_euk/Leguminivora_glycinivorella/100/) for transcript, CDS and protein sequences can be downloaded via FTP NCBI. The primary assembly and annotations are also available for view through WebApollo at the i5k Workspace@NAL [164].
References
Coates BS, Poelchau M, Childers C, Evans JD, Handler A, Guerrero F, Skoda S, Hopper K, Wintermantel WM, Ling KS, Hunter WB. Arthropod genomics research in the United States Department of Agriculture-Agricultural Research Service: Current impacts and future prospects. Trends Entomol. 2015;11(12):12–27.
Coates BS. Bacillus thuringiensis toxin resistance mechanisms among Lepidoptera: progress on genomic approaches to uncover causal mutations in the European corn borer, Ostrinia nubilalis. Curr Opin Insect Sci. 2016;15:70–7.
Karunamoorthi K, Sabesan S. Insecticide resistance in insect vectors of disease with special reference to mosquitoes: a potential threat to global public health. Health Scope. 2013;2:4–18.
Mallet J. The evolution of insecticide resistance: have the insects won? Trends Ecol Evol. 1989;4(11):336–40.
Sparks TC, Crossthwaite AJ, Nauen R, Banba S, Cordova D, Earley F, Ebbinghaus-Kintscher U, Fujioka S, Hirao A, Karmon D, Kennedy R. Insecticides, biologics and nematicides: Updates to IRAC’s mode of action classification-a tool for resistance management. Pestic Biochem Phys. 2020;167:104587.
Ffrench-Constant RH. The molecular genetics of insecticide resistance. Genetics. 2013;194(4):807–15.
Crickmore N, Berry C, Panneerselvam S, Mishra R, Connor TR, Bonning BC. A structure-based nomenclature for Bacillus thuringiensis and other bacteria-derived pesticidal proteins. J Invertebr Pathol. 2021;186:107438.
Briefs IS. Global status of commercialized biotech/GM crops in 2017: Biotech crop adoption surges as economic benefits accumulate in 22 years. ISAAA Brief. 2017;53:25–6.
James C. International Service for the Acquisition of Agribiotech Applications Brief 53. In: Global status of commercialized biotech/GM crops. Ithaca. 2017.
Tabashnik BE, Carrière Y. Surge in insect resistance to transgenic crops and prospects for sustainability. Nat Biotechnol. 2017;35(10):926–35.
Gould F. Sustainability of transgenic insecticidal cultivars: integrating pest genetics and ecology. Annu Rev Entomol. 1998;43(1):701–26.
Tabashnik BE. Delaying insect resistance to transgenic crops. PNAS. 2008;105(49):19029–30.
Onstad DW, Crespo AL, Pan Z, Crain PR, Thompson SD, Pilcher CD, Sethi A. Blended refuge and insect resistance management for insecticidal corn. Environ Entomol. 2018;47(1):210–9.
Gould F. Potential and problems with high-dose strategies for pesticidal engineered crops. Biocontrol Sci Technol. 1994;4(4):451–61.
Zhang H, Tian W, Zhao J, Jin L, Yang J, Liu C, Yang Y, Wu S, Wu K, Cui J, Tabashnik BE. Diverse genetic basis of field-evolved resistance to Bt cotton in cotton bollworm from China. PNAS. 2012;109(26):10275–80.
Campagne P, Kruger M, Pasquet R, Le Ru B, Van den Berg J. Dominant inheritance of field-evolved resistance to Bt corn in Busseola fusca. PLoS ONE. 2013;8(7):e69675.
Mallet J, Porter P. Preventing insect adaptation to insect-resistant crops: are seed mixtures or refugia the best strategy? Proc R Soc B: Biol Sci. 1992;250(1328):165–9.
Tabashnik BE, Gould F, Carrière Y. Delaying evolution of insect resistance to transgenic crops by decreasing dominance and heritability. J Evol Biol. 2004;17(4):904–12.
Horikoshi RJ, Bernardi D, Bernardi O, Malaquias JB, Okuma DM, Miraldo LL, Amaral FS, Omoto C. Effective dominance of resistance of Spodoptera frugiperda to Bt maize and cotton varieties: implications for resistance management. Sci Rep. 2016;6(1):34864.
Liu Y, Tabashnik BE. Inheritance of resistance to the Bacillus thuringiensis toxin Cry1C in the diamondback moth. Appl Environ Microbiol. 1997;63(6):2218–23.
Gassmann AJ, Carrière Y, Tabashnik BE. Fitness costs of insect resistance to Bacillus thuringiensis. Annu Rev Entomol. 2009;54:147–63.
Tabashnik BE, Brévault T, Carrière Y. Insect resistance to Bt crops: lessons from the first billion acres. Nat Biotechnol. 2013;31(6):510–21.
Pigott CR, Ellar DJ. Role of receptors in Bacillus thuringiensis crystal toxin activity. Microbiol Mol Biol Rev. 2007;71(2):255–81.
Vachon V, Laprade R, Schwartz JL. Current models of the mode of action of Bacillus thuringiensis insecticidal crystal proteins, a critical review. J Invertebr Pathol. 2012;111(1):1–12.
Zhang X, Candas M, Griko NB, Rose-Young L, Bulla LA Jr. Cytotoxicity of Bacillus thuringiensis Cry1Ab toxin depends on specific binding of the toxin to the cadherin receptor BT-R1 expressed in insect cells. Cell Death Differ. 2005;12(11):1407–16.
Zhang X, Candas M, Griko NB, Taussig R, Bulla LA. A mechanism of cell death involving an adenylyl cyclase/PKA signaling pathway is induced by the Cry1Ab toxin of Bacillus thuringiensis. Proc Natl Acad Sci USA. 2006;103(26):9897–902.
Soberón M, Monnerat R, Bravo A. Mode of action of cry toxins from Bacillus thuringiensis and resistance mechanisms. Microb Toxins. 2018:15–27.
Wu C, Chakrabarty S, Jin M, Liu K, Xiao Y. Insect ATP-binding cassette (ABC) transporters: roles in xenobiotic detoxification and Bt insecticidal activity. Int J Mol Sci. 2019;20(11):2829.
Heckel DG. The essential and enigmatic role of ABC transporters in Bt resistance of noctuids and other insect pests of agriculture. Insects. 2021;12(5):389.
Jurat-Fuentes JL, Heckel DG, Ferré J. Mechanisms of resistance to insecticidal proteins from Bacillus thuringiensis. Annu Rev Entomol. 2021;66:121–40.
Cancino-Rodezno A, Alexander C, Villaseñor R, Pacheco S, Porta H, Pauchet Y, Soberón M, Gill SS, Bravo A. The mitogen-activated protein kinase p38 is involved in insect defense against Cry toxins from Bacillus thuringiensis. Insect Biochem Mol Biol. 2010;40(1):58–63.
Gao J, Cao M, Ye W, Li H, Kong L, Zheng X, Wang Y. PsMPK7, a stress-associated mitogen-activated protein kinase (MAPK) in Phytophthora sojae, is required for stress tolerance, reactive oxygenated species detoxification, cyst germination, sexual reproduction and infection of soybean. Mol Plant Pathol. 2015;16(1):61–70.
Qiu L, Fan J, Liu L, Zhang B, Wang X, Lei C, Lin Y, Ma W. Knockdown of the MAPK p38 pathway increases the susceptibility of Chilo suppressalis larvae to Bacillus thuringiensis Cry1Ca toxin. Sci Reports. 2017;7(1):43964.
Guo Z, Kang S, Sun D, Gong L, Zhou J, Qin J, Guo L, Zhu L, Bai Y, Ye F, Wu Q. MAPK-dependent hormonal signaling plasticity contributes to overcoming Bacillus thuringiensis toxin action in an insect host. Nat Commun. 2020;11(1):3003.
Guo Z, Kang S, Wu Q, Wang S, Crickmore N, Zhou X, Bravo A, Soberón M, Zhang Y. The regulation landscape of MAPK signaling cascade for thwarting Bacillus thuringiensis infection in an insect host. PLoS Pathog. 2021;17(9):e1009917.
Guo W, Kain W, Wang P. Effects of disruption of the peritrophic membrane on larval susceptibility to Bt toxin Cry1Ac in cabbage loopers. J Insect Physiol. 2019;117:103897.
Mitsuhashi W, Miyamoto K. Interaction of Bacillus thuringiensis Cry toxins and the insect midgut with a focus on the silkworm (Bombyx mori) midgut. Biocontrol Sci Technol. 2020;30(1):68–84.
Fang S, Wang L, Guo W, Zhang X, Peng D, Luo C, Yu Z, Sun M. Bacillus thuringiensis bel protein enhances the toxicity of Cry1Ac protein to Helicoverpa armigera larvae by degrading insect intestinal mucin. Appl Environ Microbiol. 2009;75(16):5237–43.
Coates BS, Deleury E, Gassmann AJ, Hibbard BE, Meinke LJ, Miller NJ, Petzold-Maxwell J, French BW, Sappington TW, Siegfried BD, Guillemaud T. Up-regulation of apoptotic-and cell survival-related gene pathways following exposures of western corn rootworm to Bacillus thuringiensis crystalline pesticidal proteins in transgenic maize roots. BMC Genomics. 2021;22:1–27.
Oppert B, Dowd SE, Bouffard P, Li L, Conesa A, Lorenzen MD, Toutges M, Marshall J, Huestis DL, Fabrick J, Oppert C. Transcriptome profiling of the intoxication response of Tenebrio molitor larvae to Bacillus thuringiensis Cry3Aa protoxin. PLoS ONE. 2012;7(4):e34624.
Bel Y, Jakubowska AK, Costa J, Herrero S, Escriche B. Comprehensive analysis of gene expression profiles of the beet armyworm Spodoptera exigua larvae challenged with Bacillus thuringiensis Vip3Aa toxin. PLoS ONE. 2013;8:e81927.
Sparks ME, Blackburn MB, Kuhar D, Gundersen-Rindal DE. Transcriptome of the Lymantria dispar (gypsy moth) larval midgut in response to infection by Bacillus thuringiensis. PLoS ONE. 2013;8(5):e61190.
Vellichirammal NN, Wang H, Eyun S, Moriyama E, Coates BS, Miller NJ, Siegfried BD. Transcriptional analysis of susceptible and resistant European corn borer strains and their response to Cry1F protoxin. BMC Genomics. 2015;16(1):558.
Zhang T, Coates BS, Wang Y, Wang Y, Bai S, Wang Z, He K. Down-regulation of aminopeptidase N and ABC transporter subfamily G transcripts in Cry1Ab and Cry1Ac resistant Asian corn borer, Ostrinia furnacalis (Lepidoptera: Crambidae). Int J Biol Sci. 2017;13(7):835–51.
Rault LC, Siegfried BD, Gassmann AJ, Wang H, Brewer GJ, Miller NJ. Investigation of Cry3Bb1 resistance and intoxication in western corn rootworm by RNA sequencing. J Appl Entomol. 2018;142(10):921–36.
Zhao Z, Meihls LN, Hibbard BE, Ji T, Elsik CG, Shelby KS. Differential gene expression in response to eCry3.1Ab ingestion in an unselected and eCry3.1Ab-selected western corn rootworm (Diabrotica virgifera virgifera LeConte) population. Sci Reports. 2019;9(1):4896.
Yu S, Wang C, Li K, Yang Y, He Y-Z, Wu Y. Transcriptional analysis of cotton bollworm strains with different genetic mechanisms of resistance and their response to Bacillus thuringiensis Cry1Ac toxin. Toxins. 2022;14(6):366.
Edmonds RP, Borden JH, Angerilli NP, Rauf A. A comparison of the developmental and reproductive biology of two soybean pod borers, Etiella spp. in Indonesia. Entomol Exp Appl. 2000;97(2):137–47.
Sakagami SF, Tanno K, Tsutsui H, Honma K. The role of cocoons in overwintering of the soybean pod borer Leguminivora glycinivorella (Lepidoptera: Tortricidae). J Kans Entomol Soc. 1985;2:240–7.
Le VV, Ishitani M, Komai F, Yamamoto M, Ando T. Sex pheromone of the soybean pod borer, Leguminivora glycinivorella (Lepidoptera: Tortricidae): Identification and field evaluation. Appl Entomol Zool. 2006;41(3):507–13.
Zhao XL. Methods for controlling soybean moth (Leguminivora glycinivorella). Soybean Sci. 2004;23(1):77–80.
Wu BZ. Study of new methods of prevention and control of Leguminirora glycinivorella. Soybean Bulletin. 2001;3(9).
Zheng XL, Zhang BS, Kou H. Main problems and developing countermeasures of soybean in northeast China. Heilongjiang Agric Sci. 2012;2:146–9.
Talekar NS. Insects damaging soybean in Asia. In: Soybeans for the tropics: research production, and utilization. New York: Wiley. 1987; 25–45.
Gill RJ, Ramos-Rodriguez O, Raine NE. Combined pesticide exposure severely affects individual-and colony-level traits in bees. Nature. 2012;491(7422):105–8.
Zhao G, Wang J, Han Y, Teng W, Sun G, Li W. Identification of QTL underlying the resistance of soybean to pod borer, Leguminivora glycinivorella (Mats.) obraztsov, and correlations with plant, pod and seed traits. Euphytica. 2008;164:275–82.
Meng FL, Ran RX, Li Y, Li N, Li HZ, Wang ZK, Li WB. RNAi-mediated knockdown of a serine protease gene (Spbtry1) from SPB (soybean pod borer) affects the growth and mortality of the pest. Fla Entomol. 2017;100(3):607–15.
Meng F, Li Y, Zang Z, Li N, Ran R, Cao Y, Li T, Zhou Q, Li W. Expression of the double-stranded RNA of the soybean pod borer Leguminivora glycinivorella (Lepidoptera: Tortricidae) ribosomal protein P0 gene enhances the resistance of transgenic soybean plants. Pest Manag Sci. 2017;73(12):2447–55.
Yu H, Li Y, Li X, Romeis J, Wu K. Expression of Cry1Ac in transgenic Bt soybean lines and their efficiency in controlling lepidopteran pests. Pest Manag Sci. 2013;69(12):1326–33.
Dourado PM, Bacalhau FB, Amado D, Carvalho RA, Martinelli S, Head GP, Omoto C. High susceptibility to Cry1Ac and low resistance allele frequency reduce the risk of resistance of Helicoverpa armigera to Bt soybean in Brazil. PLoS ONE. 2016;11(8):e0161388.
Qian X, Su Y, Yao Y, Li Q, Guo D, Dong Y. Soybean transformation with insect-resistant Cry1Ac/Ab mediated by Agrobacterium tumefaciens. J Jilin Agric Univ. 2017;39(5):527–33.
Pray CE, Huang J, Hu R, Rozelle S. Five years of Bt cotton in China–the benefits continue. Plant J. 2002;31(4):423–30.
Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34(18):3094–100.
Xiao Z, Kerr WA. Biotechnology in China–regulation, investment, and delayed commercialization. GM Crops Food. 2022;13(1):86–96.
Wu KM, Guo YY. The evolution of cotton pest management practices in China. Annu Rev Entomol. 2005;50:31–52.
Yu HL, Li YH, Wu KM. Risk assessment and ecological effects of transgenic Bacillus thuringiensis crops on non-target organisms F. J Integr Plant Biol. 2011;53(7):520–38.
Li Y, Peng Y, Hallerman EM, Wu K. Biosafety management and commercial use of genetically modified crops in China. Plant Cell Rep. 2014;33:565–73.
Šíchová J, Nguyen P, Dalikova M, Marec F. Chromosomal evolution in tortricid moths: conserved karyotypes with diverged features. PLoS ONE. 2013;8(5):e64520.
Nguyen P, Sýkorová M, Šíchová J, Kůta V, Dalíková M, Čapková Frydrychová R, Neven LG, Sahara K, Marec F. Neo-sex chromosomes and adaptive potential in tortricid pests. PNAS. 2013;110(17):6931–6.
Presgraves DC. Patterns of postzygotic isolation in Lepidoptera. Evolution. 2002;56(6):1168–83.
Coates BS, Dopman EB, Wanner KW, Sappington TW. Genomic mechanisms of sympatric ecological and sexual divergence in a model agricultural pest, the European corn borer. Curr Opin Insect Sci. 2018;26:50–6.
Griffitts JS, Aroian RV. Many roads to resistance: how invertebrates adapt to Bt toxins. BioEssays. 2005;27(6):614–24.
Heckel DG, Gahan LJ, Baxter SW, Zhao JZ, Shelton AM, Gould F, Tabashnik BE. The diversity of Bt resistance genes in species of Lepidoptera. J Invertebr Pathol. 2007;95(3):192–7.
Adang MJ, Crickmore N, Jurat-Fuentes JL. Diversity of Bacillus thuringiensis crystal toxins and mechanism of action. Adv Insect Physiol. 2014;47:39–87.
Pinos D, Andrés-Garrido A, Ferré J, Hernández-Martínez P. Response mechanisms of invertebrates to Bacillus thuringiensis and its pesticidal proteins. Microbiol Mol Biol Rev. 2021;85(1):e00007-20.
Jin L, Wang J, Guan F, Zhang J, Yu S, Liu S, Xue Y, Li L, Wu S, Wang X, Yang Y. Dominant point mutation in a tetraspanin gene associated with field-evolved resistance of cotton bollworm to transgenic Bt cotton. PNAS. 2018;115(46):11760–5.
Benowitz KM, Allan CW, Degain BA, Li X, Fabrick JA, Tabashnik BE, Carrière Y, Matzkin LM. Novel genetic basis of resistance to Bt toxin Cry1Ac in Helicoverpa zea. Genetics. 2022;221(1):iyac037.
Jurat-Fuentes JL, Adang MJ. The Heliothis virescens cadherin protein expressed in Drosophila S2 cells functions as a receptor for Bacillus thuringiensis Cry1A but not Cry1Fa toxins. Biochemistry. 2006;45(32):9688–95.
Abdelkefi-Mesrati L, Rouis S, Sellami S, Jaoua S. Prays oleae midgut putative receptor of Bacillus thuringiensis vegetative insecticidal protein Vip3LB differs from that of Cry1Ac toxin. Mol Biotechnol. 2009;43:15–9.
Sena JA, Hernández-Rodríguez CS, Ferré J. Interaction of Bacillus thuringiensis Cry1 and Vip3A proteins with Spodoptera frugiperda midgut binding sites. Appl Environ Microbiol. 2009;75(7):2236–7.
Tabashnik BE, Carrière Y. Evaluating cross-resistance between Vip and Cry toxins of Bacillus thuringiensis. J Econ Entomol. 2020;113(2):553–61.
Muita BK, Baxter SW. Temporal exposure to Bt insecticide causes oxidative stress in larval midgut tissue. Toxins. 2023;15(5):323.
Apirajkamol NB, James B, Gordon KHJ, Walsh TK, McGaughran A. Oxidative stress delays development and alters gene expression in the agricultural pest moth, Helicoverpa armigera. Ecol Evol. 2020;10(12):5680–93.
Wang X, Martínez MA, Wu Q, Ares I, Martínez-Larrañaga MR, Anadón A, Yuan Z. Fipronil insecticide toxicology: oxidative stress and metabolism. Crit Rev Toxicol. 2016;46(10):876–99.
Wang X, Anadón A, Wu Q, Qiao F, Ares I, Martínez-Larrañaga MR, Yuan Z, Martínez MA. Mechanism of neonicotinoid toxicity: impact on oxidative stress and metabolism. Annu Rev Pharmacol Toxicol. 2018;58:471–507.
Mason EF, Rathmell JC. Cell metabolism, an essential link between cell growth and apoptosis. Biochimica et Biophysica Acta (BBA)-Mol Cell Res. 2011;1813(4):645–54.
Eming SA, Wynn TA, Martin P. Inflammation and metabolism in tissue repair and regeneration. Science. 2017;356(6342):1026–30.
Boucheix C, Rubinstein E. Tetraspanins. CMLS. 2001;58:1189–205.
Bowers WE. Christian de Duve and the discovery of lysosomes and peroxisomes. Trends Cell Biol. 1998;8(8):330–3.
Pracharoenwattana I, Smith SM. When is a peroxisome not a peroxisome? Trends Plant Sci. 2008;13(10):522–5.
Marshall RS, Vierstra RD. Autophagy: the master of bulk and selective recycling. Annu Rev Plant Bioly. 2018;69:173–208.
Wada Y. Vacuoles in mammals: a subcellular structure indispensable for early embryogenesis. BioArchitecture. 2013;3(1):13–9.
Saftig P, Puertollano R. How lysosomes sense, integrate, and cope with stress. Trends Biochem Sci. 2021;46(2):97–112.
He A, Dean JM, Lodhi IJ. Peroxisomes as cellular adaptors to metabolic and environmental stress. Trends Cell Biol. 2021;31(8):656–70.
Chen HD, Kao CY, Liu BY, Huang SW, Kuo CJ, Ruan JW, Lin YH, Huang CR, Chen YH, Wang HD, Aroian RV, Chen CS. HLH-30/TFEB-mediated autophagy functions in a cell-autonomous manner for epithelium intrinsic cellular defense against bacterial pore-forming toxin in C. elegans. Autophagy. 2017;13(2):371–85.
Yang Y, Huang X, Yuan W, Xiang Y, Guo X, Wei W, Soberón M, Bravo A, Liu K. Bacillus thuringiensis cry toxin triggers autophagy activity that may enhance cell death. Pestic Biochem Phys. 2021;171:104728.
Porta H, Muñoz-Minutti C, Soberón M, Bravo A. Induction of Manduca sexta larvae caspases expression in midgut cells by Bacillus thuringiensis Cry1Ab toxin. Psyche. 2011;938249:1–7.
Hernández-Martínez P, Gomis-Cebolla J, Ferré J, Escriche B. Changes in gene expression and apoptotic response in Spodoptera exigua larvae exposed to sublethal concentrations of Vip3 insecticidal proteins. Sci Reports. 2017;7(1):1–12.
Rojas-Rivera D, Hetz C. TMBIM protein family, ancestral regulators of cell death. Oncogene. 2015;34(3):269–80.
Liu L, Li Z, Luo X, Zhang X, Chou SH, Wang J, He J. Which is stronger? A continuing battle between Cry toxins and insects. Front Microbiol. 2021;12:665101.
Jurat-Fuentes JL, Karumbaiah L, Jakka SR, Ning C, Liu C, Wu K, Jackson J, Gould F, Blanco C, Portilla M, Perera O. Reduced levels of membrane-bound alkaline phosphatase are common to lepidopteran strains resistant to Cry toxins from Bacillus thuringiensis. PLoS ONE. 2011;6(3):e17606.
Yuan X, Zhao M, Wei J, Zhang W, Wang B, Khaing MM, Liang G. New insights on the role of alkaline phosphatase 2 from Spodoptera exigua (Hübner) in the action mechanism of Bt toxin Cry2Aa. J Insect Physiol. 2017;98:101–7.
Pinos D, Chakroun M, Millán-Leiva A, Jurat-Fuentes JL, Wright DJ, Hernández-Martínez P, Ferré J. Reduced membrane-bound alkaline phosphatase does not affect binding of Vip3Aa in a Heliothis virescens resistant colony. Toxins. 2020;12(6):409.
Chen G, Wang Y, Liu Y, Chen F, Han L. Differences in midgut transcriptomes between resistant and susceptible strains of Chilo suppressalis to Cry1C toxin. BMC Genomics. 2020;21(1):1–9.
Stalinski R, Laporte F, Després L, Tetreau G. Alkaline phosphatases are involved in the response of Aedes aegypti larvae to intoxication with Bacillus thuringiensis subsp. israelensis cry toxins. Environ Microbiol. 2016;18(3):1022–36.
Hughes AL. Evolutionary diversification of aminopeptidase N in Lepidoptera by conserved clade-specific amino acid residues. Mol Phylogenet Evol. 2014;76:127–33.
Lin P, Cheng T, Jin S, Jiang L, Wang C, Xia Q. Structural, evolutionary and functional analysis of APN genes in the Lepidoptera Bombyx mori. Gene. 2014;535(2):303–11.
Benjamini Y, Hochberg Y. Controlling the false discovery rate, A practical and powerful approach to multiple testing. J R Stat Soc B (Methodological). 1995;57(1):289–300.
Crava CM, Bel Y, Lee SF, Manachini B, Heckel DG, Escriche B. Study of the aminopeptidase N gene family in the lepidopterans Ostrinia nubilalis (Hübner) and Bombyx mori (L.): Sequences, mapping and expression. Biochem Mol Biol. 2010;40(7):506–15.
Zhang S, Cheng H, Gao Y, Wang G, Liang G, Wu K. Mutation of an aminopeptidase N gene is associated with Helicoverpa armigera resistance to Bacillus thuringiensis Cry1Ac toxin. Insect Biochem Mol Biol. 2009;39(7):421–9.
Tiewsiri K, Wang P. Differential alteration of two aminopeptidases N associated with resistance to Bacillus thuringiensis toxin Cry1Ac in cabbage looper. PNAS. 2011;108(34):14037–42.
Coates BS, Sumerford DV, Siegfried BD, Hellmich RL, Abel CA. Unlinked genetic loci control the reduced transcription of aminopeptidase N 1 and 3 in the European corn borer and determine tolerance to Bacillus thuringiensis Cry1Ab toxin. Insect Biochem Mol Biol. 2013;43(12):1152–60.
Herrero S, Gechev T, Bakker PL, Moar WJ, de Maagd RA. Bacillus thuringiensis Cry1Ca-resistant Spodoptera exigua lacks expression of one of four Aminopeptidase N genes. BMC Genomics. 2005;6(1):1.
Knight PJ, Crickmore N, Ellar DJ. The receptor for Bacillus thuringiensis CrylA (c) delta-endotoxin in the brush border membrane of the lepidopteran Manduca sexta is aminopeptidase N. Mol Microbiol. 1994;11(3):429–36.
Sun D, Zhu L, Guo L, Wang S, Wu Q, Crickmore N, Zhou X, Bravo A, Soberón M, Guo Z, Zhang Y. A versatile contribution of both aminopeptidases N and ABC transporters to Bt Cry1Ac toxicity in the diamondback moth. BMC Biol. 2022;20(1):33.
Huang L, Cheng T, Xu P, Cheng D, Fang T, Xia Q. A genome-wide survey for host response of silkworm, Bombyx mori, during pathogen Bacillus bombyseptieus infection. PLoS ONE. 2009;4(12): e8098.
Terra WR, Ferreira C. Insect digestive enzymes: properties, compartmentalization and function. Comp Biochem Physiol B, Comp Biochem. 1994;109(1):1–62.
Adang MJ. Insect aminopeptidase N. In: Handbook of proteolytic enzymes. Academic. 2004; 296–299.
Gahan LJ, Pauchet Y, Vogel H, Heckel DG. An ABC transporter mutation is correlated with insect resistance to Bacillus thuringiensis Cry1Ac toxin. PLoS Genet. 2010;6(12):e1001248.
Heckel DG. Learning the ABCs of Bt: ABC transporters and insect resistance to Bacillus thuringiensis provide clues to a crucial step in toxin mode of action. Pestic Biochem Phys. 2012;104:103–10.
Guo Z, Kang S, Chen D, Wu Q, Wang S, Xie W, Zhu X, Baxter SW, Zhou X, Jurat-Fuentes JL, Zhang Y. MAPK signaling pathway alters expression of midgut ALP and ABCC genes and causes resistance to Bacillus thuringiensis Cry1Ac toxin in diamondback moth. PLoS Genet. 2015;11(4):e1005124.
Wang H, Liu B, Zhang Y, Jiang F, Ren Y, Yin L, Liu H, Wang S, Fan W. Estimation of genome size using k-mer frequencies from corrected long reads. arXiv preprint arXiv:2003.11817. 2020 Mar 26.
Hollenstein K, Dawson RJ, Locher KP. Structure and mechanism of ABC transporter proteins. Curr Opin Struct Biol. 2007;17(4):412–8.
Lynch J, Fukuda Y, Krishnamurthy P, Du G, Schuetz JD. Cell survival under stress is enhanced by a mitochondrial ATP-binding cassette transporter that regulates hemoproteins. Cancer Res. 2009;69(13):5560–7.
Huang S, Yuan S, Dong M, Su J, Yu C, Shen Y, Xie X, Yu Y, Yu X, Chen S, Zhang S. The phylogenetic analysis of tetraspanins projects the evolution of cell–cell interactions from unicellular to multicellular organisms. Genomics. 2005;86(6):674–84.
Todres E, Nardi JB, Robertson HM. The tetraspanin superfamily in insects. Insect Mol Biol. 2000;9(6):581–90.
Murungi EK, Kariithi HM, Adunga V, Obonyo M, Christoffels A. Evolution and structural analyses of Glossina morsitans (Diptera; Glossinidae) tetraspanins. Insects. 2014;5(4):885–908.
Zhuang S, Kelo L, Nardi JB, Kanost MR. An integrin-tetraspanin interaction required for cellular innate immune responses of an insect. Manduca sexta J Biol Chem. 2007;282(31):22563–72.
Endo H. Molecular and kinetic models for pore formation of Bacillus thuringiensis Cry toxin. Toxins. 2022;14:433.
Buehler B, Hogrefe HH, Scott G, Ravi H, Pabón-Peña C, O’Brien S, Formosa R, Happe S. Rapid quantification of DNA libraries for next-generation sequencing. Methods. 2010;50(4):S15–8.
Liu B, Shi Y, Yuan J, Hu X, Zhang H, Li N, Li Z, Chen Y, Mu D, Fan W. Estimation of genomic characteristics by analyzing k-mer frequency in de novo genome projects. arXiv preprint arXiv:1308.2012. 2013.
Lander ES, Waterman MS. Genomic mapping by fingerprinting random clones: a mathematical analysis. Genomics. 1988;2(3):231–9.
Vurture GW, Sedlazeck FJ, Nattestad M, Underwood CJ, Fang H, Gurtowski J, Schatz MC. GenomeScope: fast reference-free genome profiling from short reads. Bioinformatics. 2017;33(14):2202–4.
Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE. 2014;9(11):e112963.
Roach MJ, Schmidt SA, Borneman AR. Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies. BMC bioinforms. 2018;19(1):1.
Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25(14):1754–60.
Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, Whitwham A, Keane T, McCarthy SA, Davies RM, Li H. Twelve years of SAMtools and BCFtools. Gigascience. 2021;10(2):giab008.
Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31(19):3210–2.
Seppey M, Manni M, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness. Gene Prediction: Methods and Protocols. 2019;1962:227–45.
Manni M, Berkeley MR, Seppey M, Simão FA, Zdobnov EM. BUSCO update: novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Mol Biol Evol. 2021;38(10):4647–54.
Levy Karin E, Mirdita M, Söding J. MetaEuk—sensitive, high-throughput gene discovery, and annotation for large-scale eukaryotic metagenomics. Microbiome. 2020;8:1–5.
Lieberman-Aiden E, Van Berkum NL, Williams L, Imakaev M, Ragoczy T, Telling A, Amit I, Lajoie BR, Sabo PJ, Dorschner MO, Sandstrom R. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science. 2009;326(5950):289–93.
Wingett S, Ewels P, Furlan-Magaril M, Nagano T, Schoenfelder S, Fraser P, Andrews S. HiCUP: pipeline for mapping and processing Hi-C data. F1000Research. 2015;4:1310.
Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9(4):357–9.
Marçais G, Delcher AL, Phillippy AM, Coston R, Salzberg SL, Zimin A. MUMmer4: A fast and versatile genome alignment system. PLoS Computat Biol. 2018;14(1):e1005944.
Thibaud-Nissen F, DiCuccio M, Hlavina W, Kimchi A, Kitts PA, Murphy TD, Pruitt KD, Souvorov A. P8008 the NCBI eukaryotic genome annotation pipeline. J Anim Sci. 2016;94(4):184.
Ran R, Li T, Liu X, Ni H, Li W, Meng F. RNA interference-mediated silencing of genes involved in the immune responses of the soybean pod borer Leguminivora glycinivorella (Lepidoptera: Olethreutidae). Peer J. 2018;6:e4931.
Meng F, Yang M, Li Y, Li T, Liu X, Wang G, Wang Z, Jin X, Li W. Functional analysis of RNA interference-related soybean pod borer (Lepidoptera) genes based on transcriptome sequences. Front Physiol. 2018;9:383.
Morgulis A, Gertz EM, Schäffer AA, Agarwala R. WindowMasker: window-based masker for sequenced genomes. Bioinformatics. 2006;22(2):134–41.
Kumar S, Nei M, Dudley J, Tamura K. MEGA: a biologist-centric software for evolutionary analysis of DNA and protein sequences. Brief Bioinform. 2008;9(4):299–306.
Le SQ, Gascuel O. An improved general amino acid replacement matrix. Mol Biol Evol. 2008;25(7):1307–20.
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20.
Bray NL, Pimentel H, Melsted P, Pachter L. Near-optimal probabilistic RNA-seq quantification. Nat Biotechnol. 2016;34(5):525–7.
Guo W, Tzioutziou NA, Stephen G, Milne I, Calixto CP, Waugh R, Brown JW, Zhang R. 3D RNA-seq: a powerful and flexible tool for rapid and accurate differential expression and alternative splicing analysis of RNA-seq data for biologists. RNA Biol. 2021;18(11):1574–87.
R Core Team. R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing. https://www.R-project.org/. 2022
Soneson C, Love MI, Robinson MD. Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences. F1000Research. 2015;4:1520.
Bullard JH, Purdom E, Hansen KD, Dudoit S. Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments. BMC Bioinform. 2010;11(1):1–3.
Risso D, Ngai J, Speed TP, Dudoit S. Normalization of RNA-seq data using factor analysis of control genes or samples. Nat Biotechnol. 2014;32(9):896–902.
Law CW, Chen Y, Shi W, Smyth GK. Voom: Precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol. 2014;15(2):1–7.
Calixto CP, Guo W, James AB, Tzioutziou NA, Entizne JC, Panter PE, Knight H, Nimmo HG, Zhang R, Brown JW. Rapid and dynamic alternative splicing impacts the Arabidopsis cold response transcriptome. Plant Cell. 2018;30(7):1424–44.
Mistry J, Chuguransky S, Williams L, Qureshi M, Salazar GA, Sonnhammer EL, Tosatto SC, Paladin L, Raj S, Richardson LJ, Finn RD. Pfam: The protein families database in 2021. Nucleic Acids Res. 2021;49(D1):D412–9.
Fang H. dcGOR, An R package for analysing ontologies and protein domain annotations. PloS Comput Biol. 2014;10(10):e1003929.
Walter W, Sánchez-Cabo F, Ricote M. GOplot: an R package for visually combining expression data with functional analysis. Bioinformatics. 2015;31(17):2912–4.
Poelchau M, Childers C, Moore G, Tsavatapalli V, Evans J, Lee CY, Lin H, Lin JW, Hackett K. The i5k Workspace@ NAL—enabling genomic data access, visualization and curation of arthropod genomes. Nucl Acids Rese. 2015;43(D1):D714–9.
Wang Y, Li Q, Du Q, Guo D, Zhang Y, Dong Y, Gao Y, Zhang Z. Specification for evaluation of the genetically modified soybean resistance to soybean pod borer. Regional standard for assessing transgenic soybean, DB22/T2556–2016, published by Jilin Provincial Bureau of Quality and Technical Supervision. 2016.
Acknowledgements
We are grateful to Jinfeng Luan for involving in bioassay and obtaining resistant soybean pod borer. The research was supported by Jilin Academy of Agricultural Sciences and in cooperation with the United States Department of Agriculture (USDA), Agricultural Research Service (ARS). Mention of trade names or commercial products in this publication is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the U.S. Department of Agriculture. USDA-ARS is an equal opportunity employer and provider.
Funding
This research was supported by Jilin Academy of Agricultural Sciences through the grants of “Agricultural Science and Technology Innovation Project” (CXGC2021ZY028) and “Industrial Technology Research and Development Project of Jilin Province” (2022C037-7). This research was performed in cooperation with the United States Department of Agriculture (USDA), Agricultural Research Service (ARS) (CRIS Project 5030–22000-019-00D) “Ecologically-based Management of Arthropods in the Maize Agroecosystem”. This research used high performance computing resources provided by USDA-ARS SCINet (project number 0500–00093-001–00-D).
Author information
Authors and Affiliations
Contributions
Y.W. conceived the study, performed experimental treatments, and collected samples. Y.W. and B.S.C designed the experiments; B.S.C. carried out data analysis and drafted the manuscript; D.G obtained funding and supplied genetically modified soybean; Y.Y. and X.Q cultivated genetically modified soybean.
Corresponding authors
Ethics declarations
Ethics approval and consent to participate
Research was performed according to guidelines for good scientific conduct. All experiments were performed following relevant guidelines and adhering to Regional Transgenic Crop Assessment Standard (DB22/T2556-2016) [165].
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Additional file 1.
Supplementary Figure S1a-c.
Additional file 2: Supplementary Figure S2.
NucmerSummaryStats
Additional file 3: Supplementary Figure S3.
RNAseqPreProc_MeanVarPlots
Additional file 4: Supplementary Figure S4.
RNAseq_Normalization
Additional file 5: Supplementary Figure S5.
APNphylogeny
Additional file 7: Supplementary Table S2.
ScaffoldAssignments
Additional file 8: Supplementary Table S3.
Updated_Nucmer_show-coords_LglyVsCydSple
Additional file 9: Supplementary Table S4.
NucmerSummaryV2
Additional file 10: Supplementary Table 5a-e.
DE gene testing statistics+13902+Pfam+GO_05222023
Additional file 11: Supplementary Table S6.
DE transcripts testing statistics
Additional file 12: Supplementary Table S7.
GOsAllLevels_Significant.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Wang, Y., Yao, Y., Zhang, Y. et al. A chromosome-level genome assembly of the soybean pod borer: insights into larval transcriptional response to transgenic soybean expressing the pesticidal Cry1Ac protein. BMC Genomics 25, 355 (2024). https://doi.org/10.1186/s12864-024-10216-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12864-024-10216-2