Transcriptome assembly and microarray construction for Enchytraeus crypticus, a model oligochaete to assess stress response mechanisms derived from soil conditions
© Castro-Ferreira et al.; licensee BioMed Central Ltd. 2014
Received: 26 August 2013
Accepted: 17 April 2014
Published: 23 April 2014
The soil worm Enchytraeus crypticus (Oligochaeta) is an ecotoxicology model species that, until now, was without genome or transcriptome sequence information. The present research aims at studying the transcriptome of Enchytraeus crypticus, sampled from multiple test conditions, and the construction of a high-density microarray for functional genomic studies.
Over 1.5 million cDNA sequence reads were obtained representing 645 million nucleotides. After assembly, 27,296 contigs and 87,686 singletons were obtained, from which 44% and 25% are annotated as protein-coding genes, respectively, sharing homology with other animal proteomes. Concerning assembly quality, 84% of the contig sequences contain an open reading frame with a start codon while E. crypticus homologs were identified for 92% of the core eukaryotic genes. Moreover, 65% and 77% of the singletons and contigs without known homologs, respectively, were shown to be transcribed in an independent microarray experiment. An Agilent 180 K microarray platform was designed and validated by hybridizing cDNA from 4 day zinc- exposed E. crypticus to the concentration corresponding to 50% reduction in reproduction after three weeks (EC50). Overall, 70% of all probes signaled expression above background levels (mean signal + 1x standard deviation). More specifically, the probes derived from contigs showed a wider range of average intensities when compared to probes derived from singletons. In total, 522 significantly differentially regulated transcripts were identified upon zinc exposure. Several significantly regulated genes exerted predicted functions (e.g. zinc efflux, zinc transport) associated with zinc stress. Unexpectedly, the microarray data suggest that zinc exposure alters retro transposon activity in the E. crypticus genome.
An initial investigation of the E. crypticus transcriptome including an associated microarray platform for future studies proves to be a valuable resource to investigate functional genomics mechanisms of toxicity in soil environments and to annotate a potentially large number of lineage specific genes that are responsive to environmental stress conditions.
KeywordsEcotoxicogenomics Next-generation pyrosequencing Invertebrate Zinc Annelid 454 sequencing
The ecologically relevant soil-living worm Enchytraeus crypticus (Annelida; Oligochaeta; Enchytraeidae) has been widely used as model species in soil ecotoxicology for over two decades, both under International Organization for Standardization (ISO) and Organization for Economic Co-operation and Development (OECD) guidelines [1–4]. Ecotoxicological tests generally focus on survival and reproduction endpoints, i.e. organism effects, whereas in ecotoxicogenomics, responses at the transcriptome level are studied aiming to reconstitute the mechanisms preceding effects at organismal and/or population level (e.g. reproduction). In-depth information about the molecular mechanisms is provided allowing a better understanding of molecular pathways activated by toxicity, which will not be possible when using the classical tests alone .
Previously, transcript information with classical Sanger sequencing was generated for the test species Enchytraeus albidus and has been very useful to further explore the response mechanisms of enchytraeids to various stressors, such as pesticides , heavy metals, nanoparticles , and soil properties . However, the drawback of the current E. albidus microarray is that it contains a low number of transcripts, so a full transcriptomic assessment of stress response cannot be executed for this model. Thanks to the progress in sequencing technology, transcriptome and genome information can now be obtained for any non-genomic model species . Transcriptome sequencing is of special interest, since it is directly targeted to obtain sequence information from transcribed regions in the genome [11, 12]. To pursue a comprehensive transcriptome of an enchytraeid test species we selected E. crypticus instead of the more widely used E. albidus, despite the seemingly lower sensitivity to toxicants  of this species. Several advantages over the use of E. albidus have been recently identified, such as easy culturing conditions in agar media, higher reproductive rate, lack of cryptic speciation, and a shorter reproduction cycle (3 weeks instead of 6) .
Given the clear advantages of using E. crypticus for soil quality assessment, the study presented here aims at studying its transcriptome by de novo transcriptome pyrosequencing and annotation. A previous transcriptome study showed that it is beneficial to pool RNA from multiple treatments in order to obtain a wider range of expressed transcript sequences . Thus, we decided to set up a similar approach to get transcriptome-wide information from E. crypticus and sampled mRNA from multiple exposure conditions, including chemical , physicochemical (moisture, temperature, pH) and developmental stages. The quality of the assembled transcriptome was critically assessed according to the guideline provided by Gibson et al.  by evaluating sequence quality, assembly quality, overall transcriptome completeness, and transcriptional activity. Subsequently, a high-density microarray platform was constructed and validated using zinc-exposed organisms. This platform will be a valuable resource to conduct gene expression studies in future ecotoxicogenomic research.
Test organism culturing conditions and experiment overview
The annelid Enchytraeus crypticus (Oligochaeta: Enchytraeidae), commonly known as potworm, has been cultured for several years in laboratory at the VU University Amsterdam. Cultures are kept in agar substrate prepared with aqueous soil extract, fed ad libitum with ground oatmeal, at 16°C with 75% relative humidity and 16/8 h light/dark photoperiod [1, 3, 4]. For this study we selected, for each biological replicate per exposure, 5–20 Enchytraeus crypticus adults with developed clitella, similar size, and an average fresh weight of 1.4 (±0.2) mg per adult. In total, 40 different experimental conditions were included: 12 chemical exposures 12 temperature treatments, 5 soil moisture treatments, 3 pH treatments, 4 different developmental stages, plus associated control conditions (Additional file 1).
LUFA 2.2 soil is a loamy sand soil sieved at 2 mm with pH (CaCl2) 5.5, cation exchange capacity (CEC) 10 meq/100 g, maximum water-holding capacity (WHCmax) of 46%, organic matter content of 4%, grain size distribution of 6% clay, 14% silt and 80% sand. For testing, control moisture was adjusted to 50% of the WHCmax.
The OECD artificial soil was prepared according to the standard procedure  and further used to test various pHs adjusting it by adding predetermined amounts of calcium carbonate (CaCO3).
Toxicity tests were performed following the standard guidelines [1, 4]. Chemical test concentrations were based on previous results  from which the 3-week reproduction EC50 was estimated. This EC50 was used as a reference exposure concentration in the short-term (48 h) experiments from which the mRNA pool was derived. Test chemicals (Additional file 2) included cadmium chloride hemipentahydrate (Cd), carbendazim (Car), phenanthrene (Phe), pentachloroaniline (PCA) and 3,5-dichloroaniline (DCA). Chemical formulas, chemical abstract service (CAS) numbers and concentrations are described in Additional file 2. Non-water soluble chemicals were solubilized in acetone. For the acetone control and acetone chemical spiking the following was performed: 25 gr of dry LUFA 2.2 soil was spiked with acetone (10 mL; CAS No. 67-64-1; purity ≥ 99.8%; supplied by Sigma Aldrich) or the appropriate organic-acetone solution for Phe, Car, PCA and DCA, equilibrated for 24 h, left to evaporate overnight, mixed with dry soil (75 g) and homogeneously moistened to 50% of the WHCmax. The soil batches were used immediately after being prepared. For the metal exposures (Additional file 1), dry LUFA 2.2 soil (100 g) was spiked with the specific metal-aqueous solution (10 mL) to reach the nominal concentration and 50% of the WHCmax in the soil batch. The soil was kept under standard conditions while equilibrating for 21 days prior to exposure. Two replicates were prepared per test condition (Additional file 1).
Per replicate, twenty adult E. crypticus were introduced into each glass vial (100 mL) containing 30 g of soil prepared as described above. Food (2 mg) was supplied and vessels were closed with perforated aluminum foil. The exposures lasted 48 h at 20°C, with 75% relative humidity and 16/8 h light/dark photoperiod. After 48 h of exposure, the individuals in each replicate were sampled, carefully rinsed in deionised water, transferred to a cryotube, snap-frozen with liquid nitrogen and stored at -80°C.
Zinc (Zn) exposures were performed in order to validate the microarray platform. The EC50 for inhibition of reproduction was used (De Boer et al., unpublished data). For the microarray analysis, E. crypticus adults were exposed to 145 mg Zn/kg soil (EC50) and control soil for 4 days using four replicates, containing 5–7 individuals per replicate.
Temperature, moisture and pH treatments were performed according to the guidelines, with standard test conditions as described in the previous paragraph, except for the specific factor being studied (Additional file 1). The temperature exposures lasted 0.5, 24, 48 or 96 h, at the temperatures -10, 0, 10, 20, 30 and 40°C. The moisture exposures lasted 24, 48 or 96 h, using soils moistened to 20, 50 and 90% of WHCmax. The pH exposures lasted 48 h, using soils with pH 3, 4.5 and 6.
Enchytraeus crypticus samples in distinct developmental stages were taken from synchronization of cultures, which were obtained by transferring thirty adults with developed clitella to fresh culture boxes that were kept at identical culturing conditions as the source culture. Every 6 hours, cocoons were collected (20 and 50 cocoons per 6 hour time-point), immediately snap-frozen with liquid nitrogen and stored at -80°C. Twenty synchronized cocoons were carefully transferred into fresh culture media and kept at 20°C. After twenty days twenty juveniles were collected into a cryotube, immediately snap-frozen with liquid nitrogen and stored at -80°C.
RNA isolation, RNA pool and cDNA library construction
Total RNA was collected from the 90 individual samples of Enchytraeus crypticus as listed in Additional file 1. All RNA extractions were performed using the SV Total RNA Isolation System (Promega) following manufacturer’s protocol. All samples were homogenized with liquid nitrogen in the cryotube before RNA extraction. Addition of sterilized quartz sand to the liquid nitrogen was needed to disrupt and homogenize cocoons sufficiently.
Transcriptome sequencing and assembly
Cytochrome oxidase I and 18S rRNA gene sequencing of 10 individuals yielded only one polymorphic site (Roelofs et al., unpublished data), providing strong evidence that this strain is highly inbred. Thus, the assembly procedure is not hampered by the presence of genetic variation within the cDNA library. This library was submitted to one full-plate sequencing run using a 454 Roche GS FLX pyrosequencing instrument using Titanium chemistry (454 Life Sciences Corporation), following manufacturer’s protocol and methods previously described . After 454 sequencing, the generated sequence reads were assembled using Newbler (software v. 2.5.3 from 454 Life Sciences Corporation) in de novo mode and default parameters.
Transcriptome annotation was performed through the ISGA transcriptome analysis pipeline, available from the Center for Genomics and Bioinformatics at Indiana University (http://isga.cgb.indiana.edu/Pipeline/View?pipeline=248). First, sequence homology to known metazoan proteins was obtained by submitting contigs and singletons to BLASTx searches against NCBI’s non-redundant database and dbEST . Moreover, protein domains were identified among the 6 frame translations of the assembled sequences using Pfam, TIGRfam and HMMER3 searches. Open reading frames were determined with the ORFpredictor software on the proteomics server of Youngstown state university . Orthology and paralogy was assigned by BLASTx against orthoMCL databases . Assembly quality was tested by retrieving BLASTx hits against the Drosophila orthologs in the CEGMA core eukaryotic genes dataset . Transcriptome quality assessment was performed according to the guidelines described by Gibson et al. . In short, open reading frame (ORF) lengths, EST nucleotide lengths and GC contents of annotated and unannoted ESTs were compared using a Welch’s t-test in R version 2.15. Furthermore, transcript abundance (microarray intensity) was compared between contigs and singletons using Welch’s t-test. Finally, the interactive pathway explorer iPATH was used to identify metabolic, regulatory and biosynthesis pathways based on enzyme code (EC) numbers .
Probe design for microarray construction
Oligonucleotide probes were designed according to the thermodynamics/technical features of gene expression microarrays within the eArray webtool of Agilent (Agilent Technologies). The workflow is summarized into these consecutive steps: the target sequences were selected by length threshold ≥300 bases, converted into sense orientation, and then submitted to eArray for probe design. In order to avoid cross-hybridization, the probes that could cross-hybridize with any sequence other than the specific target (indicated by a probe sequence similarity of >90%), were discarded. A final microarray design was created from the probe group (86,412 probes) that were printed in duplicate (one probe per contig/singleton represented twice) onto a 4 X188 K platform. The array design was supplemented with 4,854 additional Agilent control probes (random probes for background correction plus spike-in positive control probes), so that in total 177,678 probes were present.
Microarray validation and data analysis
RNA samples were extracted from control and EC50 Zn-exposed E. crypticus adults. The RNA was then precipitated with ethanol to remove any Guanidine Thiocyanate traces from the SV RNA extraction procedure and to increase its concentration. RNA concentration and quality was determined on the Nanodrop nd1000 spectrophotometer (Thermo Scientific) and Bioanalyzer 2100 (Agilent Technologies). RNA samples were labeled using the Low Input Quick Amp kit (Agilent Technologies) according to the manufacturer’s instructions and as described . Four replicates were labeled using Cy-3/Cy-5 dye swaps. Labeled samples were hybridized to the microarray as described above. Microarrays were scanned using an Agilent microarray scanner and raw data was extracted with Feature Extraction software version 10.7.3 (Agilent Technologies). Microarray raw intensities were LOESS normalized with a background subtraction according to the Edwards method which contained a minimal offset of 30 [23, 24]. Significant differential expression of genes was determined by directly contrasting the control signal to the EC50 exposure samples and fitting a linear model through the data all using the Limma package for R/Bioconductor  after which pP-values were adjusted for multiple testing according to the Benjamini-Hochberg method. A Gene Set Enrichment analysis was performed in Blast2GO using a Fisher’s exact test again using a p-value correction with the false discovery rate (FDR) method by Benjamini-Hochberg .
Results and discussion
Transcriptome sequencing and assembly
Sequencing results and assembly statistics (using Newbler) for E. crypticus transcriptome sequences
Number of reads1
Number of bases
Number of sequences
Number of bases
Average sequence size (bp)
Length of N50 sequence (bp)2
Largest sequence size (bp)
In the present study the largest singleton was 1,195 bp long. The largest contig was 6,693 bp long and 13,063 contigs were over 1,000 bp long. N50 is a weighted median statistic, such that 50% of all bases are contained in sequences not inferior than the N50 length. The N50 for contig length was 1,354 bp. The assembly procedure using Newbler considers alternative splicing, and this resulted in the integration of 27,296 contigs into 24,748 isotigs representing candidate transcripts. Furthermore, the assembler indicated that the isotigs are represented by 21,721 isogroups, defined as candidate gene sequences. The N50 for isotigs was 1,443 bp and the largest isotig was 7,689 bp long. The largest contig count was 13 per single isotig and 28 contigs per single isogroup. Moreover 80% of isotigs and 90% of isogroups comprised one single contig.
Quality control of transcriptome completeness
Comparison of annotation success between contigs and singletons
Mean ORF length
Containing start codon (%)
Mean GC-content (%)
Functional annotation of the E. crypticustranscriptome
Summary of E. crypticus transcriptome annotation
With GO terms
With mapped GO terms
With enzyme codes
No. GO terms for…
Most singletons and contigs showed high similarity with invertebrate deuterostomes such as: the chordate Branchiostoma floridae (lancelet, GenBank Taxonomy ID: 7739), the hemichordate acorn worm Saccoglossus kowalevskii (ID: 10224), and the purple sea urchin Strongylocentrotus purpuratus (ID: 7668) (Figure 3). BLASTx top-hits for 1,810 singletons and 1,428 contigs were obtained by comparison with B. floridae, while top-hits for 1,457 singletons and 1,298 contigs were retrieved by comparison with S. kowalevskii. The large number of matches between E. crypticus and chordates confirms the suggestion that the lophotrochozoan gene complement represents a rather complete bilaterian genome and that relatively few genes were lost in comparison to ecdysozoan genomes [30, 31]. This was recently confirmed by investigating the syntheny in three spiralian genomes (molluscs and annelids). They showed that the analysed worm and snail genomes are more similar to invertebrate deuterostomes when considering macrosynteny (large chromosomal regions), microsynteny (a group of 3 or more genes whose orthologous are linked in two or more genomes) of 469 gene clusters including the Hox gene complex , despite the fact that molluscs and annelids are evolutionary more closely related to protostomes such as arthropods, nematodes and flatworms.
Additional file 6 illustrates the general distribution of GO terms across the directed acyclic graph (DAG) structure at level 4, which is based on the percentages of mapped sequences for singletons and contigs with a cutoff at 7%. In total, thirty GO terms represented the most abundant annotation of singletons and contigs. When contigs are compared to singletons, we observe that GO term distribution is balanced with regard to biological process as well as cellular compartment. In contrast, GO terms related to the molecular functions cation binding and nucleotide binding are clearly more present within contigs, while GO terms related to the molecular functions protein binding, small molecule binding, hydrolase activity, ion binding, transferase activity and nucleic acid binding are more present within singletons. This suggests that certain molecular functions may be associated with transcript length, but further research should provide more detailed information on this.
Finally, the enzyme numbers, retrieved from the annotation of contigs and singletons, were submitted to iPATH interactive pathway explorer to verify to what extent metabolic pathways and biosynthesis pathways are represented in the transcriptome. As shown in Additional file 7, the present E. crypticus transcriptome comprises most of all essential metabolic pathways. Metabolism for terpenoids is not represented as such pathways are exclusive to plants.
Microarray validation and data analysis
Unknown sequences may include intronic regions or untranscribed sequences that remained in the library despite DNAse I treatment followed by reverse transcription and cDNA synthesis. If this is the case, probes derived from such sequences do not represent functional genes and will not show a hybridization signal. Remarkably, 77% of probes derived from unknown contigs and 66% probes derived from unknown singletons showed detectable levels of transcription. This suggested that most unannotated sequences belong to genomic regions that are being transcribed.
Additional file 8 shows MA plots of all probes (A), contigs (B) and singletons (C). Log2 average intensity values range between 4.9 and 18, which is very well in line with average microarray hybridization data . The fact that M is independent of A over a large range demonstrates sufficient linearity of the responses over a large dynamic range of microarrays signal intensity. Still, a substantial collection of probes seem to show very low average intensity values. As mentioned above, these low intensity probes were most abundant among singletons.
Exposure to zinc resulted in 522 significantly regulated transcripts (Benjamini-Hochberg adjusted p < 0.05), of which 210 are annotated (Additional file 9). Figure 4B represents the volcano plot of this experiment. Among these transcripts, two were annotated as zinc efflux proteins and are both up-regulated between 1.4 and 8.8 fold (contig19147 and singleton GYUW2K402JE6WU). Additionally, a zinc transporter 10 was identified (contig24978), which was 4.4 fold up-regulated. Also, a zinc cadmium resistance protein (singleton GYUW2K401AMILL) was 1.6 fold up-regulated. These up-regulated transcripts are associated with zinc stress, indicating that an active efflux mechanism is induced aiming to remove excess zinc ions [33, 34].
GO term enrichment analysis from zinc exposure
SRP-dependent cotranslational protein targeting to membrane
Cotranslational protein targeting to membrane
Protein targeting to ER
Establishment of protein localization in endoplasmic reticulum
Protein localization in endoplasmic reticulum
Protein targeting to membrane
Cytosolic large ribosomal subunit
Viral infectious cycle
Establishment of protein localization to organelle
Viral reproductive process
Nuclear-transcribed mRNA catabolic process, nonsense-mediated decay
Large ribosomal subunit
Viral genome expression
Cellular protein complex disassembly
Nuclear-transcribed mRNA catabolic process, nonsense-mediated decay
Protein complex disassembly
mRNA catabolic process
Unexpectedly, five GO terms representing biological processes associated with viral transcription and reproduction were significant in the enrichment analysis. Closer inspection of the transcripts from these GO terms showed that most of these genes are associated with transposon activity (e.g. up-regulated singleton transcript GYUW2K402IXZSO, RNA-directed DNA polymerase from mobile element jockey). Jockey is a Non-Long Terminal Repeat retrotransposon (N-LTRs), commonly found in invertebrates and amplifies via an RNA intermediate . The autonomous retrotransposition mechanism of N-LTRs is facilitated by a single open reading frame containing polymerases with zinc-finger domains for nucleic acid binding . Apparently, zinc exposure inactivates retrotransposition, since all these transcripts are down-regulated (Additional file 9). Our results suggest that excess zinc ions may impair proper nucleic acid binding facilitated by the zinc-finger domains of the N-LTR open reading frame. The final three processes among the GO-enriched terms (Table 4) are involved in RNA catabolism.
Finally, we compared the E. crypticus transcriptome to transcript sequences of the sister species E. albidus available in Genbank and EnchyBASE  by using BLASTx. A total of 2100 E. albidus sequences clustered into 1124 unique gene clusters, of which 856 showed highly significant BLASTx similarities to contigs in the E. crypticus transcriptome. Novais et al.  studied the impact of Cadmium and Zinc on E. albidus and also included a 4 day EC50 zinc treatment. Although this microarray study only assessed a limited amount (1124 sequences) of E. albidus sequences we were able to retrieve 10 orthologous transcripts that show highly comparable significant gene regulation upon Zn exposure. Two of the ten transcripts were cysteine rich proteins, which contained metallothionein-6 clusters denoting putative metallothionein genes. Other genes of interest are tropomyosin-2 and calcium binding alpha-actinin, which are both involved in the cell-skeleton integrity. Moreover, homologs of several gene families show comparable significant regulation in E. albidus as well as E. crypticus Zn exposure studies. Seven members of the ribosomal protein family are significantly up regulated in E. crypticus, while two ribosomal protein genes are up regulated in E. albidus. Also, both species show up regulation of actin, glycine N-methyltransferase and nicotinic acetylcholine receptor alpha subunit precursor homologs (Additional file 9, ). This indicates that substantial degree of conservation of transcriptional regulation in response to Zn stress can be observed among related species.
In conclusion we showed that the current microarray platform is functional in terms of identification of transcripts associated with toxicity. Zinc-exposed animals were not included in the initial RNA pool. Still, we identified differentially expressed genes responding to zinc toxicity, which were identified in previous biochemical studies to be important in zinc detoxification. Additionally, microarray data suggest that transposon activity is altered by zinc toxicity, which are highly interesting avenues for further (functional) research. However, this requires a more elaborate experimental design that addresses developmental programming, dose–response measurements and inter-individual variation. Also. we deduce from the scientific literature that expression microarrays are nowadays more and more supplanted by RNASeq analysis. According to Robbens et al.  lack of standardization and cross-platform validation prevents acceptance of genomic data integration into risk assessment. However, more recently, the microarray quality control consortium provided such standardization and cross-validation data specifically for microarray platforms [41, 42]. This led to acceptance of this tool in clinical and pharmaceutical testing, while RNAseq technology currently lacks such high level of standardization. It also initiated case studies that successfully integrated microarray data into environmental risk assessment [43, 44], which indicates that microarray analysis will remain an important tool, especially in 21st century risk assessment practice.
Availability of supporting data
Raw sequence read data were submitted to the NCBI Sequence Read Archive under accession number [SRP024387]. The assembled E. crypticus contigs were submitted to the NCBI Transcriptome Shotgun Assembly (TSA) Sequence Database under the TSA accession number [GALF01000000]. Raw microarray data and its MIAME compliant metadata were deposited at NCBI Gene-Expression Omnibus under accession number [GSE51838] and platform ID [GPL17851]. All three datasets are organized under Bio-project [PRJNA207507].
Thanks to Dr. K. Mokaitis, Dr J.-H. Choi, Z. Smith, J. Ford, A. Buechlein at The Centre for Genomics and Bioinformatics, Indiana University, for sequencing and bioinformatics; Janine Mariën and Rudo A. Verweij at VU University Amsterdam, for technical support in the lab. This study was financially supported by FEDER funding through COMPETE Programa Operacional Factores de Competitividade, and by Portuguese funding through Fundação para a Ciência e Tecnologia (FCT), within the research project NANOkA FCOMP-01-0124-FEDER-008944 (Ref. FCT PTDC/BIA-BEC/103716/2008), and through the FCT PhD grant to M.P. Castro-Ferreira (SFRH/BD/46759/2008). The Ecogenomics consortium “Ecolinc” research project was funded by Netherlands Organisation for Scientific Research (NWO). Additional support was provided by the METACyt Initiative of Indiana University, funded in part through a major grant from the Lilly Endowment, Inc.
- Guidelines for Testing of Chemicals – Enchytraeid Reproduction Test. OECD 220. 2004, Paris: The Organisation for Economic Co-operation and Development
- Didden W, Römbke J: Enchytraeids as indicator organisms for chemical stress in terrestrial ecosystems. Ecotoxicol Environ Saf. 2001, 50: 25-43. 10.1006/eesa.2001.2075.PubMedView ArticleGoogle Scholar
- Castro-Ferreira MP, Roelofs D, van Gestel CAM, Verweij RA, Soares AMVM, Amorim MJB: Enchytraeus crypticus as model species in soil ecotoxicology. Chemosphere. 2012, 87: 1222-1227. 10.1016/j.chemosphere.2012.01.021.PubMedView ArticleGoogle Scholar
- Soil Quality — Effects of Pollutants on Enchytraeidae (Enchytraeus sp.) – Determination of Effects on Reproduction and Survival. ISO 16387. 2004, Geneva: International Standards Organization
- van Straalen N, Roelofs D: Genomics technology for assessing soil pollution. J Biol. 2008, 7: 19-10.1186/jbiol80.PubMed CentralPubMedView ArticleGoogle Scholar
- Novais SC, Arrais J, Lopes P, Vandenbrouck T, De Coen W, Roelofs D, Soares AMVM, Amorim MJB: Enchytraeus albidus microarray: enrichment, design, annotation and database (EnchyBASE). PLoS One. 2012, 7: e34266-10.1371/journal.pone.0034266.PubMed CentralPubMedView ArticleGoogle Scholar
- Novais SC, De Coen W, Amorim MJB: Gene expression responses linked to reproduction effect concentrations (EC10,20,50,90) of dimethoate, atrazine and carbendazim, in enchytraeus albidus. PLoS One. 2012, 7: e36068-10.1371/journal.pone.0036068.PubMed CentralPubMedView ArticleGoogle Scholar
- Gomes SIL, Novais SC, Scott-Fordsmand JJ, De Coen W, Soares AMVM, Amorim MJB: Effect of Cu-nanoparticles versus Cu-salt in enchytraeus albidus (Oligochaeta): differential gene expression through microarray analysis. Comp Biochem Phys C. 2012, 155: 219-227.Google Scholar
- Gomes SIL, Novais SC, Soares AMVM, Amorim MJB: Effects of soil properties and time of exposure on gene expression of Enchytraeus albidus (Oligochaeta). Soil Biol Biochem. 2011, 43: 2078-2084. 10.1016/j.soilbio.2011.06.006.View ArticleGoogle Scholar
- Cahais V, Gayral P, Tsagkogeorga G, Melo-Ferreira J, Ballenghien M, Weinert L, Chiari Y, Belkhir K, Ranwez V, Galtier N: Reference-free transcriptome assembly in non-model animals from next-generation sequencing data. Mol Ecol Res. 2012, 12: 834-845. 10.1111/j.1755-0998.2012.03148.x.View ArticleGoogle Scholar
- Vera JC, Wheat CW, Fescemyer HW, Frilander MJ, Crawford DL, Hanski I, Marden JH: Rapid transcriptome characterization for a nonmodel organism using 454 pyrosequencing. Mol Ecol. 2008, 17: 1636-1647. 10.1111/j.1365-294X.2008.03666.x.PubMedView ArticleGoogle Scholar
- Parchman TL, Geist KS, Grahnen JA, Benkman CW, Buerkle CA: Transcriptome sequencing in an ecologically important tree species: assembly, annotation, and marker discovery. BMC Genomics. 2010, 11: 180-10.1186/1471-2164-11-180.PubMed CentralPubMedView ArticleGoogle Scholar
- Timmermans MJTN, de Boer ME, Nota B, de Boer TE, Marien J, Klein-Lankhorst RM, van Straalen NM, Roelofs D: Collembase: a repository for springtail genomics and soil quality assessment. BMC Genomics. 2007, 8: 341-10.1186/1471-2164-8-341.PubMed CentralPubMedView ArticleGoogle Scholar
- Gibson AK, Smith Z, Fuqua C, Clay K, Colbourne JK: Why so many unknown genes? Partitioning orphans from a representative transcriptome of the lone star tick Amblyomma americanum. BMC Genomics. 2013, 14: 135-10.1186/1471-2164-14-135.PubMed CentralPubMedView ArticleGoogle Scholar
- Meyer E, Aglyamova GV, Wang S, Buchanan-Carter J, Abrego D, Colbourne JK, Willis BL, Matz MV: Sequencing and de novo analysis of a coral larval transcriptome using 454 GSFlx. BMC Genomics. 2009, 10: 219-10.1186/1471-2164-10-219.PubMed CentralPubMedView ArticleGoogle Scholar
- Margulies M, Egholm M, Altman W, Attiya S, Bader J, Bemben L, Berka J, Braverman M, Chen Y, Chen Z, Dewell SB, Du L, Fierro JM, Gomes XV, Godwin BC, He W, Helgesen S, Ho CH, Irzyk GP, Jando SC, Alenquer ML, Jarvie TP, Jirage KB, Kim JB, Knight JR, Lanza JR, Leamon JH, Lefkowitz SM, Lei M, Li J, et al: Genome sequencing in microfabricated high-density picolitre reactors. Nature. 2005, 437: 376-380.PubMed CentralPubMedGoogle Scholar
- Altschul S, Madden T, Schaffer A, Zhang J, Zhang Z, Miller W, Lipman D: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25: 3389-3402. 10.1093/nar/25.17.3389.PubMed CentralPubMedView ArticleGoogle Scholar
- Min XJ, Butler G, Storms R, Tsang A: OrfPredictor: predicting protein-coding regions in EST-derived sequences. Nucleic Acids Res. 2005, 33: W677-W680. 10.1093/nar/gki394.PubMed CentralPubMedView ArticleGoogle Scholar
- Li L, Stoeckert CJ, Roos DS: OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res. 2003, 13: 2178-2189. 10.1101/gr.1224503.PubMed CentralPubMedView ArticleGoogle Scholar
- Parra G, Bradnam K, Korf I: CEGMA: a pipeline to accurately annotate core genes in eukaryotic genornes. Bioinformatics. 2007, 23: 1061-1067. 10.1093/bioinformatics/btm071.PubMedView ArticleGoogle Scholar
- Yamada T, Letunic I, Okuda S, Kanehisa M, Bork P: iPath2.0: interactive pathway explorer. Nucleic Acids Res. 2011, 39: W412-W415. 10.1093/nar/gkr313.PubMed CentralPubMedView ArticleGoogle Scholar
- Nota B, Verweij RA, Molenaar D, Ylstra B, van Straalen NM, Roelofs D: Gene expression analysis reveals a gene set discriminatory to different metals in soil. Toxicol Sci. 2010, 115: 34-40. 10.1093/toxsci/kfq043.PubMedView ArticleGoogle Scholar
- Edwards D: Non-linear normalization and background correction in one-channel cDNA microarray studies. Bioinformatics. 2003, 19: 825-833. 10.1093/bioinformatics/btg083.PubMedView ArticleGoogle Scholar
- Yang YH, Dudoit S, Luu P, Lin DM, Peng V, Ngai J, Speed TP: Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. Nucleic Acids Res. 2002, 30: e15-10.1093/nar/30.4.e15.PubMed CentralPubMedView ArticleGoogle Scholar
- Smyth GK: Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol. 2004, 3: 1-26.Google Scholar
- Conesa A, Gotz S, Garcia-Gomez J, Terol J, Talon M, Robles M: Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005, 21: 3674-3676. 10.1093/bioinformatics/bti610.PubMedView ArticleGoogle Scholar
- Zdobnov EM, von Mering C, Letunic I, Torrents D, Suyama M, Copley RR, Christophides GK, Thomasova D, Holt RA, Subramanian GM, Mueller HM, Dimopoulos G, Law JH, Wells MA, Birney E, Charlab R, Halpern AL, Kokoza E, Kraft CL, Lai Z, Lewis S, Louis C, Barillas-Mury C, Nusskern D, Rubin GM, Salzberg SL, Sutton GG, Topalis P, Wides R, Wincker P, et al: Comparative genome and proteome analysis of anopheles gambiae and Drosophila melanogaster. Science. 2002, 298: 149-159. 10.1126/science.1077061.PubMedView ArticleGoogle Scholar
- Zdobnov EM, Apweiler R: InterProScan – an integration platform for the signature-recognition methods in InterPro. Bioinformatics. 2001, 17: 847-848. 10.1093/bioinformatics/17.9.847.PubMedView ArticleGoogle Scholar
- Chen F, Mackey AJ, Stoeckert CJ, Roos DS: OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups. Nucleic Acids Res. 2006, 34: D363-D368. 10.1093/nar/gkj123.PubMed CentralPubMedView ArticleGoogle Scholar
- Miller DJ, Ball EE: The gene complement of the ancestral bilaterian - was Urbilateria a monster?. J Biol. 2009, 8: 89-10.1186/jbiol192.PubMed CentralPubMedView ArticleGoogle Scholar
- Simakov O, Marletaz F, Cho SJ, Edsinger-Gonzales E, Havlak P, Hellsten U, Kuo DH, Larsson T, Lv J, Arendt D, Savage R, Osoegawa K, de Jong P, Grimwood J, Chapman JA, Shapiro H, Aerts A, Otillar RP, Terry AY, Boore JL, Grigoriev IV, Lindberg DR, Seaver EC, Weisblat DA, Putnam NH, Rokhsar DS: Insights into bilaterian evolution from three spiralian genomes. Nature. 2013, 493: 526-531.PubMed CentralPubMedView ArticleGoogle Scholar
- Roelofs D, Morgan J, Stürzenbaum S: The significance of genome-wide transcriptional regulation in the evolution of stress tolerance. Evol Ecol. 2010, 24: 527-539. 10.1007/s10682-009-9345-x.View ArticleGoogle Scholar
- Persans MW, Nieman K, Salt DE: Functional activity and role of cation-efflux family members in Ni hyperaccumulation in Thlaspi goesingense. Proc Natl Acad Sci U S A. 2001, 98: 9995-10000. 10.1073/pnas.171039798.PubMed CentralPubMedView ArticleGoogle Scholar
- Palmiter RD, Cole TB, Findley SD: ZnT-2, a mammalian protein that confers resistance to zinc by facilitating vesicular sequestration. EMBO J. 1996, 15: 1784-1791.PubMed CentralPubMedGoogle Scholar
- Berg JM, Shi Y: The galvanization of biology: a growing appreciation for the roles of Zinc. Science. 1996, 271: 1081-1085. 10.1126/science.271.5252.1081.PubMedView ArticleGoogle Scholar
- Biémont C, Vieira C, Borie N, Lepetit D: Transposable elements and genome evolution: the case of Drosophila simulans. Genetica. 1999, 107: 113-120. 10.1023/A:1003937603230.PubMedView ArticleGoogle Scholar
- Heras SR, López MC, García-Pérez JL, Martin SL, Thomas MC: The L1Tc C-terminal domain from Trypanosoma cruzi non-long terminal repeat retrotransposon codes for a protein that bears two C2H2 zinc finger motifs and is endowed with nucleic acid chaperone activity. Mol Cell Biol. 2005, 25: 9209-9220. 10.1128/MCB.25.21.9209-9220.2005.PubMed CentralPubMedView ArticleGoogle Scholar
- Novais SC, De Coen W, Amorim MJ: Transcriptional responses in Enchytraeus albidus (Oligochaeta): comparison between cadmium and zinc exposure and linkage to reproduction effects. Environ Toxicol Chem. 2012, 31: 2289-2299. 10.1002/etc.1946.PubMedView ArticleGoogle Scholar
- Novais SC, Soares AMVM, De Coen W, Amorim MJB: Exposure of Enchytraeus albidus to Cd and Zn - changes in cellular energy allocation (CEA) and linkage to transcriptional, enzymatic and reproductive effects. Chemosphere. 2013, 90: 1305-1309. 10.1016/j.chemosphere.2012.09.030.PubMedView ArticleGoogle Scholar
- Robbens J, van der Ven K, Maras M, Blust R, De Coen W: Ecotoxicological risk assessment using DNA chips and cellular reporters. Trends Biotech. 2007, 25: 460-466. 10.1016/j.tibtech.2007.08.005.View ArticleGoogle Scholar
- Shi L, Campbell G, Jones WD, Campagne F, Wen Z, Walker SJ, Su Z, Chu TM, Goodsaid FM, Pusztai L, Shaughnessy JD, Oberthuer A, Thomas RS, Paules RS, Fielden M, Barlogie B, Chen W, Du P, Fischer M, Furlanello C, Gallas BD, Ge X, Megherbi DB, Symmans WF, Wang MD, Zhang J, Bitter H, Brors B, Bushel PR, Bylesjo M, et al: The MicroArray Quality Control (MAQC)-II study of common practices for the development and validation of microarray-based predictive models. Nat Biotechnol. 2010, 28: 827-838. 10.1038/nbt.1665.PubMedView ArticleGoogle Scholar
- Shi L, Reid LH, Jones WD, Shippy R, Warrington JA, Baker SC, Collins PJ, de Longueville F, Kawasaki ES, Lee KY, Luo Y, Sun YA, Willey JC, Setterquist RA, Fischer GM, Tong W, Dragan YP, Dix DJ, Frueh FW, Goodsaid FM, Herman D, Jensen RV, Johnson CD, Lobenhofer EK, Puri RK, Schrf U, Thierry-Mieg J, Wang C, Wilson M, Wolber PK, et al: The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements. Nat Biotechnol. 2006, 24: 1151-1161. 10.1038/nbt1239.PubMedView ArticleGoogle Scholar
- Roelofs D, de Boer M, Agamennone V, Bouchier P, Legler J, van Straalen N: Functional environmental genomics of a municipal landfill soil. Front Genet. 2012, 3: 85-PubMed CentralPubMedView ArticleGoogle Scholar
- Euling SY, Thompson CM, Chiu WHA, Benson R: An approach for integrating toxicogenomic data in risk assessment: the dibutyl phthalate case study. Toxicol Appl Pharm. 2013, 271: 324-335. 10.1016/j.taap.2013.03.013.View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.