The potential role of Alu Y in the development of resistance to SN38 (Irinotecan) or oxaliplatin in colorectal cancer
- Xue Lin1,
- Jan Stenvang2,
- Mads Heilskov Rasmussen3,
- Shida Zhu4,
- Niels Frank Jensen2,
- Line S Tarpgaard5,
- Guangxia Yang4,
- Kirstine Belling6,
- Claus Lindbjerg Andersen3Email author,
- Jian Li1, 4, 7Email author,
- Lars Bolund†1, 4Email author and
- Nils Brünner†2Email author
© Lin et al.; licensee BioMed Central. 2015
Received: 9 August 2014
Accepted: 17 April 2015
Published: 22 May 2015
Irinotecan (SN38) and oxaliplatin are chemotherapeutic agents used in the treatment of colorectal cancer. However, the frequent development of resistance to these drugs represents a considerable challenge in the clinic. Alus as retrotransposons comprise 11% of the human genome. Genomic toxicity induced by carcinogens or drugs can reactivate Alus by altering DNA methylation. Whether or not reactivation of Alus occurs in SN38 and oxaliplatin resistance remains unknown.
We applied reduced representation bisulfite sequencing (RRBS) to investigate the DNA methylome in SN38 or oxaliplatin resistant colorectal cancer cell line models. Moreover, we extended the RRBS analysis to tumor tissue from 14 patients with colorectal cancer who either did or did not benefit from capecitabine + oxaliplatin treatment. For the clinical samples, we applied a concept of ‘DNA methylation entropy’ to estimate the diversity of DNA methylation states of the identified resistance phenotype-associated methylation loci observed in the cell line models. We identified different loci being characteristic for the different resistant cell lines. Interestingly, 53% of the identified loci were Alu sequences- especially the Alu Y subfamily. Furthermore, we identified an enrichment of Alu Y sequences that likely results from increased integration of new copies of Alu Y sequence in the drug-resistant cell lines. In the clinical samples, SOX1 and other SOX gene family members were shown to display variable DNA methylation states in their gene regions. The Alu Y sequences showed remarkable variation in DNA methylation states across the clinical samples.
Our findings imply a crucial role of Alu Y in colorectal cancer drug resistance. Our study underscores the complexity of colorectal cancer aggravated by mobility of Alu elements and stresses the importance of personalized strategies, using a systematic and dynamic view, for effective cancer therapy.
Colorectal cancer is a common and often lethal disease . FOLFIRI (folinic acid, 5-fluorouracil and irinotecan) , FOLFOX (folinic acid, 5-fluorouracil and oxaliplatin)  and XELOX (capecitabine and oxaliplatin)  are commonly used chemotherapeutic combinations used to treat colorectal cancer. However, a considerable subpopulation of patients will experience disease recurrence due to acquired resistance to treatment. The molecular mechanisms underlying acquired resistance to these drugs remain elusive.
Cancer cells usually harbour numerous genomic and epigenomic aberrations, thereby presenting high diversities of genotypes and phenotypes as well as cell fate dynamics. For somatic cells, cell fate is well defined and stably maintained by epigenetic mechanisms. DNA methylation is a long-term stable epigenetic mechanism. Additionally, DNA methylation represses the activity of mobile genetic elements and maintains genome integrity.
The concept ‘eukaryotic genomes are dynamic’ has been well accepted  since mobile genetic elements were first discovered in maize . One remarkable feature in the human genome is that the DNA consists of at least 45% mobile genetic elements, including short interspersed nuclear elements (SINEs), long interspersed nuclear elements (LINEs) and long terminal repeats (LTRs) . LINE-1 (L1) is a predominant member of LINEs and Alu is the largest family of SINEs in the human genome . Moreover, L1s are the only autonomous retrotransposable element in the human genome. Alus are non-autonomous retrotransposable elements, which depend on an L1 coded protein ORP2 (endonuclease and reverse transcriptase) to mediate their mobility. Alus are primate-specific sequences sharing a typical 282-nucleotide consensus sequence and a characteristic structure . There are more than one million Alu family members, constituting 11% of human DNA . The members are ubiquitously dispersed throughout the genome but preferentially overrepresented in GC-rich and high gene density regions . It is thought that about 75% of the total number of genes in the genome are associated with Alus . The presence of Alu sequences is strongly correlated with multifractality in human genome sequences . Alu elements are also associated with more than 25% of all the simple repetitive sequences in primate genomes, including microsatellites . It is reported that Alu and L1 initiate the spread of CpG methylation, and the length of CpG islands is associated with the distribution of Alu and L1 retrotransposons . Moreover, Alu elements are supposed to act as global modifiers of gene expression through changes in their own methylation state . It is estimated that there are about 80-100 active L1s and 2000-3000 active Alus in the human genome per individual [13,14]. Mobilization of Alus mainly occurs during the production of gametes or at early stages of embryo development . In contrast to germ line retrotransposition, the activity of Alus and other mobile genetic elements in somatic cells is mostly silenced by DNA methylation and post-transcriptional mechanisms mediated by piwi-interacting RNAs, siRNAs, miRNAs and AID/APOBEC gene family members [5,15-18]. However, genomic toxicity induced by carcinogens or drugs can reactivate Alus by altering DNA methylation . Accordingly, Alu and L1 have been shown to display DNA methylation alterations in colorectal cancers compared with matched normal tissues [20,21]. Additionally, Alu elements pose the largest transposon-based mutagenic threat to the human genome . A recent study, which intensively sequenced 43 cancer and matched germ line genomes, revealed that colorectal cancers and other cancers of epithelial cell origin show activity of somatic L1 and Alu transpositions .
Among Alu sequences, the Alu Y subfamily is the youngest Alu sequence  with an evolutionary age of ~15-20 million years (Mya) . Even though the copy number of Alu Y (~125,000 copies) is less than that of Alu S (550,000 copies, at evolutionary age ~40-50 Mya ) and Alu J (~160,000 copies, at evolutionary age ~55 Mya ), the Alu Y subfamily harbours the largest number of functionally intact Alu core elements that are more active than the older Alus [14,24]. Activation of Alus can have many important biological consequences: Alus can reshuffle the genome, generating transposon-mediated mutagenesis , inducing genomic instability , and increasing recombination between elements , thereby contributing to genetic population diversity [8,27] as well as to heterogeneity in tumorigenesis. Alus can also remodel the epigenome and alter gene expression patterns by changing epigenetic marks of neighbouring genes at new insertion sites, introducing ectopic promoters of transcription factor binding sites, and generating novel alternative splicing. Integration of Alu sequences and subsequent remodelling of DNA methylation might lead to epigenetic reprogramming  as well as pluripotency induction and maintenance by A-to-I RNA editing of Alu sequences . Whether Alu retrotransposition occurs during chemotherapy with SN38 or oxaliplatin, and thereby plays a potential role in the development of chemotherapy resistance, remains unknown.
We hypothesized that development of drug resistance in colorectal cancer follows a linear step-wise progressive model and in the present study, we applied reduced representation bisulfite sequencing (RRBS) assay to analyse the DNA methylome from 3 established SN38-resistant and 3 established oxaliplatin-resistant human colorectal cancer cell line models . Our results indicate a potential role of Alu elements, especially the Alu Y subfamily, in the resistance to SN38 and oxaliplatin. To validate the findings from the cell line models, we extended our RRBS analysis to 14 clinical colorectal cancer samples. Based on the analyses of the cell lines and clinical samples, we have attempted to delineate the influence of altered DNA methylation on activation of retrotransposons as a model for colorectal cancer chemotherapy resistance.
Global methylome and non-CpG methylation in the cell line models and clinical samples
We applied the QDMR software  with a concept of ‘DNA methylation entropy’ adopted from the ‘Shannon entropy’ , to identify differentially methylated cytosines (DMCs) by estimating variability of DNA methylation states between all colorectal cancer cells and clinical samples. DMCs in all samples in the context of CpG as well as CHG and CHH (where H means A, T or C), were identified. Unsupervised clustering using DMCs in the context of CpG, CHG and CHH were performed (Additional file 1: Figure S1A, 1B and 1C). The drug-resistant cell lines clustered with their parental cell origin in the dendrogram representing the methylome profiles. This clustering represents the phenomenon “somatic memory” and is in accord with data from gene expression profiles from the cell lines . Thus, the somatic memory leads to clustering of the resistant and parental cells rather than clustering according to specific drug-resistance. Also, all cell lines merged in a big cluster separated from the clinical samples in unsupervised clustering, suggesting that the colorectal cancer cell lines might show similar features of DNA methylome, whereas sporadic clinical samples show high diversity between individual methylomes. In the cell line studies, analyses were performed after chemotherapy-induced resistance while in the clinical cancers the samples for analyses were obtained prior to any chemotherapy. This could also explain separation between the cell line samples and clinical samples in the unsupervised cluster analysis. An additional factor distinguishing the clinical samples from the cell lines is the clinical samples contain a mixture of cells including cancer cells, stromal cells and endothelial cells whereas the cell lines are much less heterogeneous.
There were a certain number of non-CpG cytosine (CHH and CHG) methylations in all colorectal cancer cell lines and sporadic colorectal cancer samples. Clustering based on non-CpG cytosine methylation data was largely consistent with that based on CpG cytosine methylation data (Additional file 1: Figure S1B and 1C). This suggests that both CpG methylation and non-CpG methylation reflect the general somatic memory of DNA methylation modification. From the identified DMCs, we selected the non-CpG cytosine loci (CHH and CHG) shared by both the cell line models and the 14 clinical samples with a methylation level of at least 50% or higher in every sample. There were totals of 19 and 29 cytosine loci identified, in CHG and CHH formats respectively. Among the CHG methylated cytosine loci, 12 loci were located in gene bodies (exon, intron, promoter and TSS (transcription start site)) and 7 loci in intergenic regions. Among the CHH methylated cytosine loci, 15 loci were located in coding genes, 1 locus in a microRNA gene, and 13 in intergenic regions. For the CHG and CHH loci located in intergenic regions, most of them were in repeat sequences. For example, there were 3 and 6 loci harbouring in Alu elements in CHG and CHH formats, respectively. Interestingly, 2 of these (out of 3) and 5 (out of 6), respectively, belonged to the Alu Y subfamily. The information concerning the identified CHG and CHH methylated loci is available in the Additional file 2: Table S1A and 1B, respectively.
The cytosine loci uniquely presented in the different drug-sensitive or drug-resistant phenotypes enrich Alu elements
We found that sets P, O and S are highly enriched in Alu elements, which accounted for 48.8%, 60.1% and 53.3% of all identified cytosine loci, respectively. Notably, many identified Alu elements belong to the youngest Alu subfamily – Alu Y – accounting for 32.1%, 35.8% and 34.3% in the identified Alu elements in the set P, O and S, respectively. Subsequently, we performed RRBS analysis for the 14 clinical samples, and then selected the cytosine loci commonly found in these clinical samples, defined as set C. We further selected the cytosine loci by only keeping the loci that were commonly found in both set C and the united set of P, O and S and defined a new set E, contains only the commonly shared cytosine loci. We identified 48,944 loci in set E. Information about these set E cytosine loci is available in the Additional file 4: Table S3. Notably, the percentage of Alu elements among the identified loci in set E was 53.4%, and the percentage of the Alu Y subfamily in all identified Alu elements in the set E was increased to 46.3% (Figure 2B). Subsequently, we performed DMCs analysis for the identified loci and found that the Alu Y subfamily accounted for 48.4% of the identified DMCs Alu loci.
To exclude the possibility that the enriched Alu Y sequences came from a possible bias of the RRBS technique, we performed an in-silico simulation. We used the same reference genome (GRCh37) that was used for RRBS mapping as the virtual test genome, subjecting it to MspI digestion and recovery of the resulting DNA from the gel selection by keeping the proper size of the digested DNA fragments in the in-silico stimulation. According to our experimental protocol of RRBS library generation, there was a maximum of about 28% sequences from Alu repeats in the in-silico stimulation, of which 39.5% belonged to the Alu Y subfamily. Furthermore, we compared the number of Alu Y loci in the data set P, O, S and the number of Alu Y loci in the simulated data set, which demonstrated that the Alu Y enrichment in the set of P, O, S was statistically significant (Fisher’s exact test, p-value < 2.2e-16).
In consideration of the potential variance of cutting sections of agarose gel from the smear of the digested genomic DNA, we applied a sliding window by moving a fixed size selection section from the simulated smear of the digested genomic DNA in both directions (towards smaller selection size or bigger selection size) and we also extended the selected section by extending additional 50 bp towards the bigger size or towards the smaller size. Neither sliding the fixed size selection section nor extending selection size showed significant variance in the amount of Alu sequences (Figure 2C). We could not further simulate the alignment because it is hard to estimate potential DNA methylation state for the simulated cytosines. Since Alus are repetitive sequences, alignment of simulated digested genomic DNA to the human reference genome could lead to further decrease in the proportion of Alu sequences. This is due to problems with low mapping quality, which results from repetitive sequences mapping to multiple locations in the human genome reference and/or the low priority for annotation (the priority order is exon, intron, promoter, intergenic region, and finally, repetitive).
Moreover, we calculated the percentage of the Alu subfamilies (Alu Y, Alu J and Alu S) in the selected Alu elements according to the use of sliding selection window and extending selection windows with different size in the simulations. The percentages of the Alu subfamilies were generally consistent in all simulations (Figure 2C). We also calculated the variance in the range of the percentages of Alu elements and the Alu Y subfamily in the above simulations and compared these variances to the variance of both the percentage of Alu elements and the Alu Y subfamily among all the RRBS data, including all colorectal cancer cell models and the 14 clinical colorectal cancer samples. Clearly, the variance of the Alu Y subfamily in all the RRBS samples was much higher than that in the simulations, which indicates that bias from RRBS technology cannot explain the observed Alu Y enrichment in the sets P, O and S. The information of variance in all the RRBS samples and variance in the simulations are available in Additional file 5: Table S4.
Alu sequences can be activated and propagate into new loci of the human genome triggered by genotoxic stress [5,33]. Especially, Alu Y sequences are the biggest active subfamily of Alu elements in the human genome. The most likely explanation of our observation in the cell lines is that Alu Y elements were reactivated and spread their copies in the genome when triggered by genotoxic stress of OxPt or SN38. In the RRBS library generation, MspI digested the genomic DNA of the drug-resistant cell line that carried many insertions of Alus, mainly Alu Y elements. The newly inserted Alus in the drug-resistant cell lines will change the constitution of digested genomic fragments. When the digested genomic DNA underwent gel selection, the portion of the digested DNA that can be finally selected by the selection section window (40-300 bp) will be changed. Consequently, the sequenced part of the genome in the RRBS libraries from the parental cell lines and the drug resistant cell lines will be different. Taken together, the results from RRBS might reflect Alu Y subfamily retrotransposition in the drug-resistant cell lines.
Identifying flanking sequence motif of Alu sequences
The identified cytosine loci in Set E highlight the high diversity of DNA methylation in SOX1 in the clinical samples
We identified 48,944 loci in set E, which were related to a total of 5,816 genes. Thus, many genes harbour more than one identified cytosine locus. For example, the TSPYL2 gene harbours 90 cytosine loci, ranking first among the identified total of 5,816 genes. This gene encodes a testis specific protein, Y-encoded-like 2 (TSPYL2), which is a nucleosome assembly protein and plays a role in chromatin remodelling to determine gene expression, cell proliferation, and terminal differentiation . Notably, SOX1 harbours 35 of the identified cytosine loci, ranking fifteenth among the 5,816 genes. In addition to SOX1, other SOX gene members including SOX2, SOX3, SOX4, SOX6, SOX8, SOX11, SOX14, SOX18, SOX21 and SOX30 have been identified to harbour at least one identified cytosine locus. Thus there was a clear enrichment of the SOX gene family in this data set. The gene list and the number of harboured loci are available in Additional file 6: Table S5.
We extracted the DNA methylation information of the 14 clinical samples on the basis of the cytosine loci in set E. Furthermore, we estimated the diversity of DNA methylation states across the 14 clinical samples by measuring the DNA methylation entropy using QDMR . Interestingly, many cytosine loci located in the SOX1 gene region showed high diversity of DNA methylation states across all the 14 clinical samples (Figure 6A and B). The cytosine loci harboured in the TSPYL2 gene and their DNA methylation level and entropy are shown in Additional file 7: Figure S2A and B.
Identifying differentially methylated cytosines (DMCs) commonly presenting in both the parental cell lines and the OxPt-resistant cell lines
In addition to the analysis of the uniquely presenting loci in the parental cell lines (set P) or in drug-resistant cell lines (sets O and S), we extracted the loci that were commonly presented in the OxPt-resistant cell lines and their parental cell lines. DNA methylation entropy analysis  was applied to identify the differentially methylated cytosines that presented high diversity of DNA methylation state across all the cell line samples. There were 1,089,634, 2,105,795 and 726,658 cytosine loci identified as OxPt-resistant phenotype associated methylation loci in the context of CpG, CHH and CHG, respectively. We hypothesized that colorectal cancer cells either from in vitro samples (the cell line models) or from in vivo samples (the clinical samples) share common epigenetic alterations, which are epigenetic changes responsible for the development of an OxPt-resistant phenotype. Thus, we transferred the extracted OxPt-resistant phenotype associated methylation loci from the analysis of the cell line models to the clinical samples.
Because the methylomes of the patient samples represented the DNA methylation profiles prior to drug treatment, we tested whether the identified ‘OxPt resistant phenotype associated methylation loci’ in the cell line models could classify the patients into good or poor outcome groups correctly.
Briefly, we extracted the ‘OxPt resistant phenotype-associated methylation loci’ from the 14 RRBS clinical samples. Subsequently, we performed unsupervised clustering analysis based on the extracted loci to see whether the identified loci in the cell line models could group the patients according to good or poor outcome to treatment (complete response (CR) and partial response (PR) versus no change (NC) and progressive disease (PD)). However, we did not obtain a clearly distinguishable grouping according to good and poor phenotypes (data not shown).
In the next step, we performed a prediction analysis for individual patients to see whether the identified ‘OxPt resistant phenotype associated methylation loci’ could correctly predict the outcome for each patient. We used the DNA methylation information of the identified loci extracted from all the 14 patients as a training data set to select key features and then build a predictor using K-Nearest Neighbour (KNN) . Then, we performed a leave-one-out validation to estimate the accuracy of the prediction. For the ‘OxPt resistant phenotype associated loci’ in the context of CpG, we got an accuracy of 35.7% for the clinical samples to be predicted as good or poor outcome group correctly (Fisher exact test, p-value = 0.59). The OxPt resistant phenotype associated CHH and CHG loci showed 35.7% (p-value = 0.3) and 64.3% (p-value = 0.5804) accuracies, respectively. These results show that it is hard to precisely predict the outcome for individual patients simply based on the DNA methylation state in certain regions identified by the limited number of cell line models. This suggests that DNA methylomes of sporadic clinical samples may show a large diversity in epigenetic reprogramming during the development of drug resistance. The high variability of inter-individual epigenomic profiles poses a big challenge for the selection of useful epigenetic markers for clinical practice.
Chemotherapeutic agents triggering genotoxic stress
Irinotecan is activated by hydrolysis to SN38 which is a topoisomerase I inhibitor . Inhibition of topoisomerase I by SN38 can result in repression of both DNA replication and transcription . Oxaliplatin is a platinum-based chemotherapeutic agent , which exerts its effects by interfering with the DNA replication and transcription machinery through nuclear DNA adduct formation . In the clinic, oxaliplatin’s efficacy depends on combined use with 5-fluorouracil (5-FU) . Capecitabine is a prodrug, that is enzymatically converted to 5-fluorouracil , which inhibits the production of nucleotide thymidine by inhibiting the enzyme thymidylate synthase . These chemotherapeutic agents are able to kill the bulk of cancer cells by introducing stress. However, in some cases, stress also can reactivate retrotransposition in somatic cells. For instance, Hagan et al., reported that Alu retrotransposition can be induced by exposure to a variety of genotoxic stressors including the topoisomerase II inhibitor etoposide . In addition, non-genotoxic stress such as hypoxia, contributing to the cancer phenotype including drug resistance and genomic instability, can increase transcription of SINEs (mainly Alu elements) and LINEs by global demethylation . Through the analysis of RRBS data for the cell line models, we found the enrichment of Alu sequences, especially the Alu Y subfamily, in the SN38- and oxaliplatin-resistant cell lines, which provides evidence of reactivation of Alu retrotransposition during the development of drug resistance in colorectal cancer cells. This finding sheds light on the potential role of mobility of Alu elements in colorectal cancer chemotherapeutic resistance by presenting a genomic response to environmental stress.
At the molecular level, cancers are complex diseases attributed to the accumulation of multiple risk factors, from genetic predisposition to environmental factors such as diet, lifestyle and exposure to toxic compounds [44,45]. Epidemiological studies suggest that the environment influences cancer aetiology far more decisively than genetics in many types of cancers [44,45]. DNA methylation, as an important and long-term stable epigenetic mechanism, defines cell fate by maintaining gene expression patterns and stabilizing genetic mobile elements. During development, germ line cells and embryonic stem cells show high cell fate dynamics and activity of mobile genetic elements. Accordingly, DNA methylation also shows dynamic change. In somatic cells, cell fate shows a stable differentiated state and mobile genetic elements present in silent states, partly due to DNA methylation locks. However, DNA methylation, as a reversible chemical modification of DNA sequences can also be changed according to environmental changes involving endogenous or exogenous (bio)chemical molecules. DNA methylation changes could lead to instability of cell fate and reactivation of retrotransposons. At the cell level, somatic cells can become dedifferentiated and heterogeneous, through reshuffling of the genome and remodelling of the epigenome by reactivation of retrotransposons. Through the analysis of DNA methylomes from 421 individuals, ranging in age from 14 to 94 year old, Johansson et al., recently demonstrated that aging at least affects DNA methylation of 29% of investigated sites, of which 60.5% are hypomethylated and 39.6% are hypermethylated . Notably, they also found that a higher fraction of sites in repetitive regions is not affected by the process of aging . This observation suggests that reactivation of Alu retrotransposition presented in our study is not a passive outcome of aging but reflecting a response from mobile genetic elements to environmental stress in line with the finding that the expression of Alu RNAs is shown to increase in response to cellular stress, viral and translational inhibition . Of particular interest is that many prior observations support a correlation between alterations of DNA methylation of retrotransposons and colorectal cancers [20,21,47,48].
Correlation between Alu retrotransposition and cell stemness
An increasing body of evidence indicates that integration of L1 and Alu elements occurs in germ cells or during early embryonic development . Furthermore, it is reported that the most expressed Alu elements are enriched for the youngest subfamily Y in hESCs , which is also in agreement with their recent evolutionary amplification in humans . Additionally, non-CpG methylation has been reported to occur in an asymmetric, strand-specific manner in SINEs and LINEs in hESCs and iPSCs, which is a characteristic property of pluripotent cells .
The SOX1 and other SOX gene family members being representative of stemness-related genes were identified loci in set E. Furthermore, high diversity of DNA methylation states of SOX1 and other SOX genes presented in the clinical samples suggests that the cancer cells from different patients are variably dedifferentiated. Notably, SOX1, SOX2 and SOX3 compose the SOXB1 gene subfamily, which shares more than 90% amino acid identity with respect to the DNA binding high-mobility group (HMG) box (a key characteristic sequence feature for defining SOX gene family) and also a high degree of sequence similarity outside the HMG box . Moreover, SOX1 or SOX3 can substitute for SOX2 to produce iPS cells . The SOXB1 genes are frequently co-expressed in development and exhibit high biological redundancy. In our previous studies, SOX2 and other SOX gene family members were implicated in the development of resistance to the anti-cancer drug tamoxifen . Moreover, the tamoxifen-resistant breast cancer cell lines shared some common features with iPSCs (Li et al., in preparation). By sequencing small cell lung cancers, Rudin et al., reported that a considerable portion of mutations occurring in SOX2 and other SOX genes indicated a correlation between lung cancers and cell stemness . Our observations in the colorectal cancers and in the tamoxifen-resistant breast cell line models (; Li et al., in preparation) also suggest that the SOX gene family might play an important role in tumorigenesis and drug resistance.
Alu retrotransposition increases uncertainty for cancer progression
In the present study, we initially hypothesized that development of drug resistance in colorectal cancer follows a linear step-wise progressive model. In this hypothesis, we assume that all colorectal cancer cells undergo a common path to develop drug resistance, thereby presenting recurrent landmarks of epigenetic alterations. Based on this hypothesis, we should be able to use the selected ‘OxPt resistant phenotype-associated methylation loci’ from the cell line models to predict the outcome of response to oxaliplatin for the clinical samples based on the information of DNA methylation from the primary tumors. However, we could not find such statistically significant predictors to precisely predict the outcome of treatment for the patients. An alternative hypothesis is that drug resistance may follow a non-linear model, in which changes in genomes, epigenomes and cell fate could happen as part of the same mechanism, i.e., retrotransposition, but individual patients could show a diversity of reshaped genome and epigenome and high dynamics of cell fate states because of their different initial conditions and potential stochastic events during the development of drug resistance.
Alus act as endogenous genomic parasites using a ‘copy-and-paste’ mechanism to spread their copies to new locations in the human genome, which can bring three main biological consequences: reshuffling genomes, remodelling epigenomes and reprogramming cell fates. All of these will contribute to heterogeneity of cancers, posing a big challenge for cancer therapy. A given chemotherapy can kill many, even most, of the cancer cells. On the other hand, cancer chemotherapy using certain molecules targeting a given single target or pathway might lose some of their effectiveness if the cancer cells have acquired increased genetic diversity and changed cell fate. In the clinic, one failed therapeutic protocol will be replaced by another one with new chemotherapeutic agent(s). This strategy usually is effective at the beginning, because a new environmental stress is introduced to the cancer cells. However, retrotransposition might act as a genomic response to environmental stress again and eventually lead to resistance to the second treatment as well. If a tumor can be detected at a very early phase, the number of malignant cells is still limited. The probability of development of fitness phenotypes by retrotransposition from the limited number of cells is lower than that from large number of cells in a late phase tumor during treatment. This non-linear model thus fits the clinical notion that ‘earlier detection leads to better outcome’. Additionally, this model also fits the McClintock doctrine, that increases in mobile genetic element transcription that are caused by environmental stress lead to higher levels of mobile genetic element integration and these insertions have an impact on host phenotypes and/or survival [5,33].
Consequently, our study underscores the uniqueness of individual cancers, dynamic tracking of cancer progression and new therapy strategies targeting the entire cell system.
SN38- and OxPt- resistant cell line models
Cell culture and generation of drug resistant cell lines
The cell lines HCT116 and HT29 were obtained from the NCI/Development Therapeutics Program, while LoVo was obtained from the American Tissue Culture Collection. Cells were maintained at 37°C, 5% CO2 in RPMI 1640 + Glutamax growth medium (Invitrogen, Naerum, Denmark) supplemented with 10% foetal calf serum (Invitrogen). Oxaliplatin or SN-38 resistant cell lines were generated in our laboratory over a period of 8-10 months by continuous exposure to gradually increasing concentrations of drug . The cell lines were passaged three times at each drug concentration and cell vials were frozen at each increase in drug concentration. Prior to subsequent experiments, the cells were maintained in drug-free growth medium for at least 1 week. The strategy to establish the three drug resistant cell line models is presented in Additional file 8: Figure S3.
The cell line identity of parental and resistant cell lines were confirmed using a short tandem repeat DNA analysis (IdentiCell – Cell Line Authentication Service, Aarhus University Hospital, Aarhus, Denmark). In addition, all cell lines were confirmed to be mycoplasma-free (Mycoplasma PCR Detection Kit, Minerva Biolabs, Berlin, Germany).
Oxaliplatin (Eloxatin, 5 mg/ml, Sanofi-Aventis, Paris, France) was stored at 4°C protected from light. SN-38 (Sigma-Aldrich, Copenhagen, Denmark) was dissolved in dimethyl sulfoxide (DMSO) at a concentration of 10 mM and stored at -20°C. Drugs were diluted in growth medium immediately prior to use.
The clinical colorectal cancer samples
The timeline of the 14 patients under medical care is shown in Additional file 9: Figure S4. Fresh frozen tumor samples were obtained from a previously published cohort , and were collected prior to any chemotherapy. The clinicopathological information from the 14 colorectal cancer patients has been displayed in Additional file 10: Table S6. According to outcome of the therapy (CR and PR versus NC and PD), the 14 patients were divided into a ‘benefited’ and a ‘not benefited’ group (Additional file 10: Table S6). By histological examination, the percentage of tumor cells were evaluated to account for more than 70% (except MOMA5 and MOMA22 with 60% tumor cells) of the cells in each sample. The genomic DNA was isolated from the samples and passed the quality control for construction of RRBS libraries. Written informed consent was obtained from all patients and was approved by The Regional Ethics Committee (DK: 1999/4678).
RRBS library generation and sequencing
RRBS was performed as previously described . Briefly, 5 μg genome DNA from the cell line models and the clinical samples was digested by restriction enzyme, MspI (New England BioLabs) over night at 37°C and QIAGEN Mini Purification kit was used to purify the digested products. End repair was performed, adding A and adaptors in which the cytosines in the paired end adaptor sequence were methylated. The ligated product was subjected to size selection in 2% agarose gel (Bio-RAD) at 100 V for 2 hours. Agarose gel bands with the inserted genomic DNA size 40 ~ 110 bp and the inserted genomic DNA size 110 ~ 220 bp were excised, so that two libraries were generated from each of the MOMA3, MOMA4, MOMA5, MOMA7, MOMA8 and MOMA9 samples (one consisting of 40 ~ 110 bp target sequences and the other of 110 ~ 220 bp target sequences). The rest of the clinical samples and all cell line samples were generated with a single library with inserted DNA fragments of 40 ~ 300 bp length. The DNA from the excised gel pieces was recovered with the QIAGEN Gel Extraction Purification Kit, followed by bisulfite treatment using ZYMO EZ DNA Methylation-Gold kit. The resulting converted DNA was amplified by PCR and purified. The RRBS libraries were subjected to paired-end 50 nt sequencing with HiSeq 2000 (Illumina).
The adaptor sequences were filtered out before the subsequent analysis and the resulting reads were aligned using Bismark software . Only uniquely mapped reads, which had the restriction enzyme cutting site at the 5’ end were used in the subsequent analysis. The sequencing depth and the percentages of methylated cytosines/total investigated cytosines for each C location were calculated. The genomic annotation information was based on the hg19 human genome (http://genome.ucsc.edu). Differentially methylated regions (DMRs) were identified by quantitative differentially methylated regions (QDMR) . The QDMR is a quantitative approach to quantify methylation difference and identify DMRs from genome-wide methylation profiles with a concept of ‘DNA methylation entropy’ . The ‘DNA methylation entropy’ adapting Shannon entropy was used to estimate diversity (or variety) of DNA methylation states for a given locus across samples . We applied WebLogo 3.3  to extract the flanking sequence (up- and down- stream 20 bp) motifs for the investigated genomic sequences. We applied GeneCluster 2.0  to perform the supervised cluster analysis and prediction analysis.
Availability of supporting data
The data set supporting the results of this article is available in the NCBI Gene Expression Omnibus database accession number (for the raw data and metadata of RRBS for the three colorectal cancer drug-resistant cell line models and the 14 sporadic clinical colorectal cancer samples) is GSE56269. http://www.ncbi.nlm.nih.gov/gds/?term=GSE56269.
This work was supported by the Danish Council for Strategic Research (09-065177/DSF), the Danish Cancer Society (R72-A-4566-B214 and R20-A-1087-B214), the Vigo and Kathrine Skovgaard Foundation, Sawmill-owner Jeppe Juul and Wife Foundation, Director Ib Henriksens Foundation, the John and Birthe Meyer Foundation, and the IMK Foundation.
- GLOBOCAN 2012: Estimated cancer incidence, mortality and prevalence worldwide in 2012. http://globocan.iarc.fr/Pages/fact_sheets_population.aspx.
- Van Cutsem E, Kohne CH, Hitre E, Zaluski J, Chang Chien CR, Makhson A, et al. Cetuximab and chemotherapy as initial treatment for metastatic colorectal cancer. N Engl J Med. 2009;360:1408–17.View ArticlePubMedGoogle Scholar
- Goldberg RM, Sargent DJ, Morton RF, Fuchs CS, Ramanathan RK, Williamson SK, et al. A randomized controlled trial of fluorouracil plus leucovorin, irinotecan, and oxaliplatin combinations in patients with previously untreated metastatic colorectal cancer. J Clin Oncol. 2004;22:23–30.View ArticlePubMedGoogle Scholar
- Cassidy J, Tabernero J, Twelves C, Brunet R, Butts C, Conroy T, et al. XELOX (capecitabine plus oxaliplatin): active first-line therapy for patients with metastatic colorectal cancer. J Clin Oncol. 2004;22:2084–91.View ArticlePubMedGoogle Scholar
- Levin HL, Moran JV. Dynamic interactions between transposable elements and their hosts. Nat Rev Genet. 2011;12:615–27.View ArticlePubMed CentralPubMedGoogle Scholar
- McClintock B. The origin and behavior of mutable loci in maize. Proc Natl Acad Sci U S A. 1950;36:344–55.View ArticlePubMed CentralPubMedGoogle Scholar
- Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, et al. Initial sequencing and analysis of the human genome. Nature. 2001;409:860–921.View ArticlePubMedGoogle Scholar
- Batzer MA, Deininger PL. Alu repeats and human genomic diversity. Nat Rev Genet. 2002;3:370–9.View ArticlePubMedGoogle Scholar
- Grover D, Mukerji M, Bhatnagar P, Kannan K, Brahmachari SK. Alu repeat analysis in the complete human genome: trends and variations with respect to genomic composition. Bioinformatics. 2004;20:813–7.View ArticlePubMedGoogle Scholar
- Moreno PA, Velez PE, Martinez E, Garreta LE, Diaz N, Amador S, et al. The human genome: a multifractal analysis. BMC Genomics. 2011;12:506.View ArticlePubMed CentralPubMedGoogle Scholar
- Jurka J, Pethiyagoda C. Simple repetitive DNA sequences from primates: compilation and analysis. J Mol Evol. 1995;40:120–6.View ArticlePubMedGoogle Scholar
- Kang MI, Rhyu MG, Kim YH, Jung YC, Hong SJ, Cho CS, et al. The length of CpG islands is associated with the distribution of Alu and L1 retroelements. Genomics. 2006;87:580–90.View ArticlePubMedGoogle Scholar
- Brouha B, Schustak J, Badge RM, Lutz-Prigge S, Farley AH, Moran JV, et al. Hot L1s account for the bulk of retrotransposition in the human population. Proc Natl Acad Sci U S A. 2003;100:5280–5.View ArticlePubMed CentralPubMedGoogle Scholar
- Bennett EA, Keller H, Mills RE, Schmidt S, Moran JV, Weichenrieder O, et al. Active Alu retrotransposons in the human genome. Genome Res. 2008;18:1875–83.View ArticlePubMed CentralPubMedGoogle Scholar
- Maksakova IA, Mager DL, Reiss D. Keeping active endogenous retroviral-like elements in check: the epigenetic perspective. Cell Mol Life Sci. 2008;65:3329–47.View ArticlePubMedGoogle Scholar
- Slotkin RK, Martienssen R. Transposable elements and the epigenetic regulation of the genome. Nat Rev Genet. 2007;8:272–85.View ArticlePubMedGoogle Scholar
- Yang N, Kazazian Jr HH. L1 retrotransposition is suppressed by endogenously encoded small interfering RNAs in human cultured cells. Nat Struct Mol Biol. 2006;13:763–71.View ArticlePubMedGoogle Scholar
- Chiu YL, Greene WC. The APOBEC3 cytidine deaminases: an innate defensive network opposing exogenous retroviruses and endogenous retroelements. Annu Rev Immunol. 2008;26:317–53.View ArticlePubMedGoogle Scholar
- Hagan CR, Sheffield RF, Rudin CM. Human Alu element retrotransposition induced by genotoxic stress. Nat Genet. 2003;35:219–20.View ArticlePubMedGoogle Scholar
- Rodriguez J, Vives L, Jorda M, Morales C, Munoz M, Vendrell E, et al. Genome-wide tracking of unmethylated DNA Alu repeats in normal and cancer cells. Nucleic Acids Res. 2008;36:770–84.View ArticlePubMed CentralPubMedGoogle Scholar
- Ogino S, Nosho K, Kirkner GJ, Kawasaki T, Chan AT, Schernhammer ES, et al. A cohort study of tumoral LINE-1 hypomethylation and prognosis in colon cancer. J Natl Cancer Inst. 2008;100:1734–8.View ArticlePubMed CentralPubMedGoogle Scholar
- Lee E, Iskow R, Yang L, Gokcumen O, Haseley P, Luquette 3rd LJ, et al. Landscape of somatic retrotransposition in human cancers. Science. 2012;337:967–71.View ArticlePubMed CentralPubMedGoogle Scholar
- Wagstaff BJ, Kroutter EN, Derbes RS, Belancio VP, Roy-Engel AM. Molecular reconstruction of extinct LINE-1 elements and their interaction with nonautonomous elements. Mol Biol Evol. 2013;30:88–99.View ArticlePubMed CentralPubMedGoogle Scholar
- Mills RE, Bennett EA, Iskow RC, Devine SE. Which transposable elements are active in the human genome? Trends Genet. 2007;23:183–91.View ArticlePubMedGoogle Scholar
- Iskow RC, McCabe MT, Mills RE, Torene S, Pittard WS, Neuwald AF, et al. Natural mutagenesis of human genomes by endogenous retrotransposons. Cell. 2010;141:1253–61.View ArticlePubMed CentralPubMedGoogle Scholar
- Ade C, Roy-Engel AM, Deininger PL. Alu elements: an intrinsic source of human genome instability. Curr Opin Virol. 2013;3:639–45.View ArticlePubMed CentralPubMedGoogle Scholar
- Cordaux R, Batzer MA. The impact of retrotransposons on human genome evolution. Nat Rev Genet. 2009;10:691–703.View ArticlePubMed CentralPubMedGoogle Scholar
- Estecio MR, Gallegos J, Dekmezian M, Lu Y, Liang S, Issa JP. SINE retrotransposons cause epigenetic reprogramming of adjacent gene promoters. Mol Cancer Res. 2012;10:1332–42.View ArticlePubMed CentralPubMedGoogle Scholar
- Germanguz I, Shtrichman R, Osenberg S, Ziskind A, Novak A, Domev H, et al. A-to-I RNA editing of Alu sequences is involved in the regulation of pluripotency induction and maintenance. Stem Cells Dev. 2014 Mar 1;23(5):443-56. doi:10.1089/scd.2013.0206. Epub 2013 Dec 14.Google Scholar
- Jensen NF, Stenvang J, Beck MK, Hanáková B, Belling KC, Do KN, et al. Establishment and characterization of models of chemotherapy resistance in colorectal cancer: Towards a predictive signature of chemoresistance. Mol Oncol. 2015 Feb 24. doi:10.1016/j.molonc.2015.02.008. [Epub ahead of print] PMID: 25759163.Google Scholar
- Zhang Y, Liu H, Lv J, Xiao X, Zhu J, Liu X, et al. QDMR: a quantitative method for identification of differentially methylated regions by entropy. Nucleic Acids Res. 2011;39:e58.View ArticlePubMed CentralPubMedGoogle Scholar
- Shannon CE. The mathematical theory of communication. 1963. MD Comput. 1997;14:306–17.PubMedGoogle Scholar
- McClintock B. The significance of responses of the genome to challenge. Science. 1984;226:792–801.View ArticlePubMedGoogle Scholar
- Crooks GE, Hon G, Chandonia JM, Brenner SE. WebLogo: a sequence logo generator. Genome Res. 2004;14:1188–90.View ArticlePubMed CentralPubMedGoogle Scholar
- Tao KP, Fong SW, Lu Z, Ching YP, Chan KW, Chan SY. TSPYL2 is important for G1 checkpoint maintenance upon DNA damage. PLoS One. 2011;6:e21602.View ArticlePubMed CentralPubMedGoogle Scholar
- Reich M, Ohm K, Angelo M, Tamayo P, Mesirov JP. GeneCluster 2.0: an advanced toolset for bioarray analysis. Bioinformatics. 2004;20:1797–8.View ArticlePubMedGoogle Scholar
- Hsiang YH, Liu LF. Identification of mammalian DNA topoisomerase I as an intracellular target of the anticancer drug camptothecin. Cancer Res. 1988;48:1722–6.PubMedGoogle Scholar
- Raymond E, Faivre S, Woynarowski JM, Chaney SG. Oxaliplatin: mechanism of action and antineoplastic activity. Semin Oncol. 1998;25:4–12.PubMedGoogle Scholar
- Woynarowski JM, Chapman WG, Napier C, Herzig MC, Juniewicz P. Sequence- and region-specificity of oxaliplatin adducts in naked and cellular DNA. Mol Pharmacol. 1998;54:770–7.PubMedGoogle Scholar
- Alian OM, Azmi AS, Mohammad RM. Network insights on oxaliplatin anti-cancer mechanisms. Clin Transl Med. 2012;1:26.View ArticlePubMed CentralPubMedGoogle Scholar
- Tabata T, Katoh M, Tokudome S, Hosakawa M, Chiba K, Nakajima M, et al. Bioactivation of capecitabine in human liver: involvement of the cytosolic enzyme on 5’-deoxy-5-fluorocytidine formation. Drug Metab Dispos. 2004;32:762–7.View ArticlePubMedGoogle Scholar
- Longley DB, Harkin DP, Johnston PG. 5-fluorouracil: mechanisms of action and clinical strategies. Nat Rev Cancer. 2003;3:330–8.View ArticlePubMedGoogle Scholar
- Pal A, Srivastava T, Sharma MK, Mehndiratta M, Das P, Sinha S, et al. Aberrant methylation and associated transcriptional mobilization of Alu elements contributes to genomic instability in hypoxia. J Cell Mol Med. 2010;14:2646–54.View ArticlePubMed CentralPubMedGoogle Scholar
- Lichtenstein P, Holm NV, Verkasalo PK, Iliadou A, Kaprio J, Koskenvuo M, et al. Environmental and heritable factors in the causation of cancer–analyses of cohorts of twins from Sweden, Denmark, and Finland. N Engl J Med. 2000;343:78–85.View ArticlePubMedGoogle Scholar
- Sorensen TI, Nielsen GG, Andersen PK, Teasdale TW. Genetic and environmental influences on premature death in adult adoptees. N Engl J Med. 1988;318:727–32.View ArticlePubMedGoogle Scholar
- Johansson A, Enroth S, Gyllensten U. Continuous Aging of the Human DNA Methylome Throughout the Human Lifespan. PLoS One. 2013;8:e67378.View ArticlePubMed CentralPubMedGoogle Scholar
- Baba Y, Huttenhower C, Nosho K, Tanaka N, Shima K, Hazra A, et al. Epigenomic diversity of colorectal cancer indicated by LINE-1 methylation in a database of 869 tumors. Mol Cancer. 2010;9:125.View ArticlePubMed CentralPubMedGoogle Scholar
- Ogino S, Kawasaki T, Nosho K, Ohnishi M, Suemoto Y, Kirkner GJ, et al. LINE-1 hypomethylation is inversely associated with microsatellite instability and CpG island methylator phenotype in colorectal cancer. Int J Cancer. 2008;122:2767–73.View ArticlePubMed CentralPubMedGoogle Scholar
- Macia A, Munoz-Lopez M, Cortes JL, Hastings RK, Morell S, Lucena-Aguilar G, et al. Epigenetic control of retrotransposon expression in human embryonic stem cells. Mol Cell Biol. 2011;31:300–16.View ArticlePubMed CentralPubMedGoogle Scholar
- Guo W, Chung WY, Qian M, Pellegrini M, Zhang MQ. Characterizing the strand-specific distribution of non-CpG methylation in human pluripotent cells. Nucleic Acids Res. 2014 Mar;42(5):3009-16. doi:10.1093/nar/gkt1306. Epub 2013 Dec 16.Google Scholar
- Miyagi S, Kato H, Okuda A. Role of SoxB1 transcription factors in development. Cell Mol Life Sci. 2009;66:3675–84.View ArticlePubMedGoogle Scholar
- Lin X, Li J, Yin G, Zhao Q, Elias D, Lykkesfeldt AE, et al. Integrative analyses of gene expression and DNA methylation profiles in breast cancer cell line models of tamoxifen-resistance indicate a potential role of cells with stem-like properties. Breast Cancer Res. 2013;15:R119.View ArticlePubMed CentralPubMedGoogle Scholar
- Rudin CM, Durinck S, Stawiski EW, Poirier JT, Modrusan Z, Shames DS, et al. Comprehensive genomic analysis identifies SOX2 as a frequently amplified gene in small-cell lung cancer. Nat Genet. 2012;44:1111–6.View ArticlePubMed CentralPubMedGoogle Scholar
- Xu Y, Hu B, Choi AJ, Gopalan B, Lee BH, Kalady MF, et al. Unique DNA methylome profiles in CpG island methylator phenotype colon cancers. Genome Res. 2012;22:283–91.View ArticlePubMed CentralPubMedGoogle Scholar
- Weisenberger DJ, Siegmund KD, Campan M, Young J, Long TI, Faasse MA, et al. CpG island methylator phenotype underlies sporadic microsatellite instability and is tightly associated with BRAF mutation in colorectal cancer. Nat Genet. 2006;38:787–93.View ArticlePubMedGoogle Scholar
- Rasmussen MH, Jensen NF, Tarpgaard LS, Qvortrup C, Romer MU, Stenvang J, et al. High expression of microRNA-625-3p is associated with poor response to first-line oxaliplatin based treatment of metastatic colorectal cancer. Mol Oncol. 2013;7:637–46.View ArticlePubMedGoogle Scholar
- Krueger F, Andrews SR. Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications. Bioinformatics. 2011;27:1571–2.View ArticlePubMed CentralPubMedGoogle Scholar
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.