Retrotransposon-centered analysis of piRNA targeting shows a shift from active to passive retrotransposon transcription in developing mouse testes
© Mourier; licensee BioMed Central Ltd. 2011
Received: 15 April 2011
Accepted: 1 September 2011
Published: 1 September 2011
Piwi-associated RNAs (piRNAs) bind transcripts from retrotransposable elements (RTE) in mouse germline cells and seemingly act as guides for genomic methylation, thereby repressing the activity of RTEs. It is currently unknown if and how Piwi proteins distinguish RTE transcripts from other cellular RNAs. During germline development, the main target of piRNAs switch between different types of RTEs. Using the piRNA targeting of RTEs as an indicator of RTE activity, and considering the entire population of genomic RTE loci along with their age and location, this study aims at further elucidating the dynamics of RTE activity during mouse germline development.
Due to the inherent sequence redundancy between RTE loci, assigning piRNA targeting to specific loci is problematic. This limits the analysis, although certain features of piRNA targeting of RTE loci are apparent. As expected, young RTEs display a much higher level of piRNA targeting than old RTEs. Further, irrespective of age, RTE loci near protein-coding coding genes are targeted to a greater extent than RTE loci far from genes. During development, a shift in piRNA targeting is observed, with a clear increase in the relative piRNA targeting of RTEs residing within boundaries of protein-coding gene transcripts.
Reanalyzing published piRNA sequences and taking into account the features of individual RTE loci provide novel insight into the activity of RTEs during development. The obtained results are consistent with some degree of proportionality between what transcripts become substrates for Piwi protein complexes and the level by which the transcripts are present in the cell. A transition from active transcription of RTEs to passive co-transcription of RTE sequences residing within protein-coding transcripts appears to take place in postnatal development. Hence, the previously reported increase in piRNA targeting of SINEs in postnatal testis development does not necessitate widespread active transcription of SINEs, but may simply be explained by the prevalence of SINEs residing in introns.
Retrotransposable elements (RTE) constitute a significant proportion of mammalian genomes. The RTEs proliferate through an RNA stage that is subsequently reverse transcribed back to genomic DNA . The high level of divergence in RTE insertions between closely related organisms [2–5] and the link between RTE insertions and diseases [6–8] witness the ongoing activity of RTEs in mammalian genomes. Several genomic mechanisms are devised to minimize the proliferation of RTEs acting both at pre- and post-transcriptional levels [9–11].
Mouse retrotransposable elements
Around 40 percent of the mouse genome consists of RTE sequence, slightly lower than observed for the human genome, although this presumably is a result of the higher substitution rate in mouse, limiting the identification of old RTE sequence [12, 13]. RTEs are divided according to the presence or absence of long terminal repeats (LTR). Mammalian LTR elements consist mainly of endogenous retroviruses (ERV) that at some point during evolution have been inserted in the germline and fixed. Although the amount of sequence occupied by LTR elements is comparable between human and mouse, the level of de novo mutations caused by LTR element activity is extensively higher in mouse than in human [8, 14]. The most abundant ERV class in the mouse genome (~5.5%) is the Class III ERVs, which in the RepeatMasker  annotation - upon which this study is based - is broadly divided in two groups, the ERVL and MaLR elements. The latter is a non-autonomous transposon, meaning that the elements do not encode the enzymatic machinery required for its own transposition. The Class II ERVs (~4% of the genome), annotated as ERVK in RepeatMasker, is believed to be younger than Class III ERVs  and consists of a broad range of clades, including the IAP elements (Intracisternal A-type Particles). Class I ERVs (ERV1 in RepeatMasker) cover less than 1% of the mouse genome. Through ectopic recombination between the flanking LTR sequences, solitary LTR sequences may be formed. In RepeatMasker, terminal LTR sequences and the internal sequences (residing between the terminal LTRs in a complete LTR retrotransposon) are annotated independently. Although the terminal and internal sequences may in many cases be determined to form a single LTR retrotransposon, for simplicity, the two annotations (termed 'LTRter' and 'LTRint', respectively) are analysed independently in this study.
Non-LTR retrotransposons are divided into LINEs (Long INterspersed Elements) that are autonomous, and SINEs (Short INterspersed Elements) that are non-autonomous. LINEs occupy roughly 20% of the mouse genome. The majority of mouse LINE elements belong to the L1 superfamily, which contains sub-families that are still active [17–19]. Despite the comparable levels of genome occupied by LINE sequences in human and mouse [12, 13], there are more than 15 times as many full-length L1 elements with intact open reading frames in the mouse genome . Almost 1.5 million SINE elements are present in the mouse genome, making up approximately 8% of the total genome size. Unlike the human genome where a single SINE, the Alus, is dominating , the mouse genome harbours two successful superfamilies of SINEs, Alu and B2 that are present in equal numbers . The evolutionary histories of the mouse SINEs are truly different; Alus are derived from a 7SL RNA, whereas B2s evolved from a tRNA sequence [21, 22].
Piwi proteins and small RNAs
Piwi-associated RNAs (piRNAs) are small (24-30 nucleotides long) RNAs that bind Piwi proteins of the Argonaute family [23, 24]. The mouse genome encodes 3 Piwi proteins, MILI, MIWI and MIWI2 that all binds piRNAs in the male germline [25, 26]. Initially, piRNAs from adult mouse testis were found to contain less RTE sequence than would be expected from the genomic content of RTEs, suggesting that piRNAs were not specifically targeting RTEs [27, 28]. However, a later study on piRNAs from an earlier (pre-pachytene) stage showed a high content of RTE sequence in piRNAs . Further evidence for the involvement of mouse piRNAs in controlling RTE activity came with the finding that knockout of Mili and Miwi2 resulted in reduced piRNA levels and increased RTE transcription [29, 30]. Knockout mice further showed decreased DNA methylation levels at RTE loci [31, 32]. As the temporal expression of Piwi proteins in developing mouse testis coincides with the resetting of genomic methylation , it is hypothesised that piRNAs act as guides for the methylation machinery [29, 31, 32].
By analysing the piRNAs bound to MIWI2 and MILI, Aravin and colleagues  suggested the following scenario: In prenatal development (16.5 days postcoitum, dpc), transcripts from full-length active RTEs are the main substrates for piRNAs that primarily associate with MILI (and to a lesser extent to MIWI2). Available transcripts containing antisense RTE sequence bind this complex and antisense RTE piRNAs are formed which in turn associate primarily with MIWI2 (and MILI, respectively). Both complexes may bind complementary RTE transcripts, entering the so-called ping-pong amplification cycle of piRNAs, in which Piwi-bound piRNAs pair with complementary transcripts that are subsequently cut into new piRNAs having a 10 nucleotide overlap with the template piRNAs [31, 34]. In prenatal development, piRNAs are primarily targeting L1 and IAP RTEs, for which activity has been reported at this stage [35, 36]. In postnatal development (10 days postpartum, dpp) MIWI2 is no longer detectable, whereas MILI is present throughout germline development [31, 37, 38]. The overall level of piRNA targeting of RTE sequences drops at 10 dpp, but interestingly, a relative increase in the piRNAs targeting B1 SINEs (members of the Alu superfamily) was observed .
This raises two fundamental questions. Firstly, do Piwi proteins discriminate between transcripts and how is RTE sequences then distinguished from other transcripts? The finding of piRNAs targeting supported a scenario with limited discrimination . Secondly, what lies behind the apparent shift in RTEs being targeted by piRNAs during development in male mouse germline? By analysing to extent to which genomic RTE loci are targeted by piRNAs in developing mouse testes, the present study aims at assessing the transcriptional dynamics of RTE during development, and consider the relationship between RTE activity and piRNA generation further.
The data for such analysis should meet a range of criteria. Although numerous mouse RNA libraries are available, only a small subset is derived from wild-type developing mouse testes . Further, as the prevalent transcription of RTEs will results in a large population of fragmented transcripts of sizes similar to piRNAs, analysis should be restricted to libraries of RNAs associated with Piwi proteins. This limits the available data to libraries from the above-mentioned study by Aravin and colleagues .
Results and Discussion
Theoretical piRNA coverage of individual RTE loci
Piwi-RNA sequence read libraries
Percent of mapped reads covering:
Raw reads (× 1000)
Mapped reads (× 1000)
Higher piRNA coverage of younger elements
Gene expression levels and piRNA coverage
For some RTE types, the relative number of young and old loci differ between the vicinities of highly expressed genes and lowly expressed genes, (Additional File 1, Figure S2), suggesting that the higher levels of piRNA coverage of RTE near highly expressed genes could simply be explained by the age of the RTE sequences. However, when repeating the analysis without the youngest RTE sequences, essentially similar results and significance levels are found (Additional File 1, Table S3).
Interestingly, when assessing the piRNA coverage of RTE sequence near transcription start sites (TSS), peaks are observed immediately upstream of TSS on the reverse strand, and for piRNAs not targeting RTEs, also immediately downstream of TSS on the forward strand (Additional File 1, Figure S3). Such a pattern resembles that of the recently discovered short transcripts generated around TSS (the TSS-associated RNAs) [45, 46] suggesting that these piRNAs may in fact be TSS-associated RNAs. It is uncertain if this represents experimental contamination of non-piRNAs or if these TSS-associated RNAs provide the transcripts that a processed into piRNAs, although the presence of RNA reads smaller than the usual 24-30 nucleotides - especially among early MILI piRNAs - hints that a contribution from the former scenario cannot be ruled out (Additional File 1, Figure S4). Assuming all RNAs mapping within 1000 base pairs of an annotated TSS are TSS-associated RNAs and removing these from the analysis does not change any of the presented conclusions (data not shown).
piRNA coverage and distance to genes
Strand bias in piRNA coverage of genic RTEs
As reported previously, RTE families are targeted very differently by piRNAs in developing mouse testes. By focusing on the total population of RTE loci, the present reanalysis of published data reveal further differences in piRNA targeting between individual members of RTE families. The available data for this analysis is arguably limited and the presented data relies on a single set of experiments. Although deep-sequencing techniques ideally should provide sequences from all available transcripts in a neutral fashion, biases may be introduced experimentally, especially during construction of libraries . Furthermore, considerable biological differences in RTE sequences have been reported between mouse strains [49, 50]. Nevertheless, the vast majority of RTE sequences will be shared among all extant mice, and the results presented here are all of a global genomic character with no predictions for individual loci, suggesting a fair generality of the findings.
Transcriptional activity is correlated between genomic regions residing near each other [51–53], and the observation that piRNA targeting of RTEs is higher around highly expressed genes, may simply reflect that transcription of RTEs is more permissible near highly expressed genes. A correlation between transcription levels of LTR sequences and their neighbouring genes has previously been reported in fission yeast . This further supports the notion that RTE transcripts are not specifically recognized as RTEs by the Piwi proteins, but are largely triggering the piRNA response in a manner proportional to their presence. It should be stressed that the reported preference by MILI for sense RTE sequences and the corresponding preference by MIWI2 for antisense sequences  suggest some level of discrimination of transcripts.
In postnatal testis development, piRNA targeting is shifted towards loci residing in introns of protein-coding genes. If, as assumed, active transcription of RTE loci is repressed at this stage, one would expect a higher proportion of RTE sequences in the total transcriptome to be derived from co-transcription of intronic RTE loci. This observation could at least in part explain the previously observed increase in piRNAs targeting SINE elements in postnatal stages , as SINE elements are the most abundant RTEs in introns (Additional File 1, Figure S5). Therefore, the increased piRNA response directed at SINE sequences does not necessitate transcription of active SINE elements in postnatal development. In fact, as SINE elements are non-autonomous, presumably using the enzymatic machinery provided by LINE elements [55, 56], there should be no basis for SINE proliferation in postnatal development if the prenatal silencing of LINE persists. Yet, SINE transcription may take place without subsequent transposition, and the known functional effects of mammalian SINE transcription [57–59] and the recently reported SINE RNA toxicity  suggest both active SINE transcription in later development, and the possible need for regulation.
On an evolutionary time scale, RTE activity has contributed hugely to the evolution of mammalian genomes [61–64], and when attempting to understand the diversity of present eukaryotic life it is essential to include the history and activity of RTEs. However, RTEs are not just silent passengers that occasionally spring into action, but have to be dealt with within each individual's life history. In this respect, the indirect approach of analysing small RNAs generated to repress RTE activity in the germline may produce further valuable knowledge on the activity of RTEs during development.
Data and Annotations
Small RNA libraries accession numbers GSM319953 (MILI late), GSM319956 (MILI early) and GSM319957 (MIWI2 early) were retrieved through DeepBase (http://deepbase.sysu.edu.cn/)  and mapped to the mouse genome (mm9 assembly) using bowtie . Prior to mapping, reads were filtered and sequences with ambiguous base calls and low-complexity sequences were removed. The latter was done measuring the linguistic complexity  of the sequences in 16 bp windows, and excluding reads with an average complexity of less than 0.75. Preliminary tests showed that this would remove highly repetitive reads with very large numbers of genomic mappings (not shown). For each library, this procedure filtered out between 0.14 and 0.17% of all raw reads. RepeatMasker and known gene annotations were downloaded from the UCSC Genome Browser [15, 40, 67]. A set of non-overlapping TSS was selected by grouping all known genes according to their assignment to ENSEMBL genes . For each ENSEMBL gene, the most abundant genomic start point was selected. If more than one point was found to have the highest abundance, the one furthest upstream of these was chosen. Gene expression levels were assessed from the 'testis' signal intensities in the Mouse GNF1M Gene Atlas from BioGPS (http://biogps.gnf.org/) .
Mapping and coverage
For each RTE loci the number of reads mapped within the locus were recorded. Reads were assigned a score of 1/(number of genomic mappings of read), so that only uniquely mapping reads scored 1. The read score were then divided by the size of the RTE loci (in kilo-base pairs). Finally, scores were divided by the total number (in millions) of mapped reads from the library in question.
To test for difference between RTE loci from different genomic regions (data presented in Figure 5), all RTE families were first split in 3 groups based on age where after members in each group were divided according to their genomic context (genic, proximal, distal). Hence, for each RTE family, nine sets of loci were formed, and the average piRNA coverage for each set was recorded. To test for difference between two groups (for example, between old LINE loci being genic or distal), pairs of average values were collected for the 90 LINE families (Additional File 1, Table S1) and tested using a Wilcoxon Signed-Rank Test. Bonferroni corrections (n = 108 in Figure 5) were calculated as: pcorrected = 1-(1-p)n. All statistical analyses were carried out using R .
This work was supported by a grant from the Lundbeck Foundation. I am indebted to JianHua Yang for invaluable advice on small RNAs, for preparing and providing RNA reads from DeepBase, and for critical reading of the manuscript.
- Kazazian HH: Mobile Elements: Drivers of Genome Evolution. Science. 2004, 303 (5664): 1626-1632. 10.1126/science.1089670.PubMedView ArticleGoogle Scholar
- Yohn CT, Jiang Z, McGrath SD, Hayden KE, Khaitovich P, Johnson ME, Eichler MY, McPherson JD, Zhao S, Paabo S, et al: Lineage-specific expansions of retroviral insertions within the genomes of African great apes but not humans and orangutans. PLoS Biol. 2005, 3 (4): e110-10.1371/journal.pbio.0030110.PubMed CentralPubMedView ArticleGoogle Scholar
- Akagi K, Li J, Stephens RM, Volfovsky N, Symer DE: Extensive variation between inbred mouse strains due to endogenous L1 retrotransposition. Genome Research. 2008, 18 (6): 869-880. 10.1101/gr.075770.107.PubMed CentralPubMedView ArticleGoogle Scholar
- Bennett EA, Coleman LE, Tsui C, Pittard WS, Devine SE: Natural genetic variation caused by transposable elements in humans. Genetics. 2004, 168 (2): 933-951. 10.1534/genetics.104.031757.PubMed CentralPubMedView ArticleGoogle Scholar
- Mills RE, Bennett EA, Iskow RC, Luttig CT, Tsui C, Pittard WS, Devine SE: Recently mobilized transposons in the human and chimpanzee genomes. American journal of human genetics. 2006, 78 (4): 671-679. 10.1086/501028.PubMed CentralPubMedView ArticleGoogle Scholar
- Callinan PA, Batzer MA: Retrotransposable elements and human disease. Genome Dyn. 2006, 1: 104-115.PubMedView ArticleGoogle Scholar
- Deininger PL, Batzer MA: Alu repeats and human disease. Molecular genetics and metabolism. 1999, 67 (3): 183-193. 10.1006/mgme.1999.2864.PubMedView ArticleGoogle Scholar
- Kazazian HH: Mobile elements and disease. Current Opinion in Genetics & Development. 1998, 8 (3): 343-350. 10.1016/S0959-437X(98)80092-0.View ArticleGoogle Scholar
- Maksakova IA, Mager DL, Reiss D: Keeping active endogenous retroviral-like elements in check: the epigenetic perspective. Cell Mol Life Sci. 2008, 65 (21): 3329-3347. 10.1007/s00018-008-8494-3.PubMedView ArticleGoogle Scholar
- Schumann GG: APOBEC3 proteins: major players in intracellular defence against LINE-1-mediated retrotransposition. Biochemical Society transactions. 2007, 35 (Pt 3): 637-642.PubMedView ArticleGoogle Scholar
- Zamudio N, Bourc'his D: Transposable elements in the mammalian germline: a comfortable niche or a deadly trap?. Heredity. 2010, 105 (1): 92-104. 10.1038/hdy.2010.53.PubMedView ArticleGoogle Scholar
- Waterston RH, Lindblad-Toh K, Birney E, Rogers J, Abril JF, Agarwal P, Agarwala R, Ainscough R, Alexandersson M, An P, et al: Initial sequencing and comparative analysis of the mouse genome. Nature. 2002, 420 (6915): 520-562. 10.1038/nature01262.PubMedView ArticleGoogle Scholar
- Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, et al: Initial sequencing and analysis of the human genome. Nature. 2001, 409 (6822): 860-921. 10.1038/35057062.PubMedView ArticleGoogle Scholar
- Maksakova IA, Romanish MT, Gagnier L, Dunn CA, van de Lagemaat LN, Mager DL: Retroviral elements and their hosts: insertional mutagenesis in the mouse germ line. PLoS genetics. 2006, 2 (1): e2-10.1371/journal.pgen.0020002.PubMed CentralPubMedView ArticleGoogle Scholar
- Smit AFA, Hubley R, Green P: RepeatMasker Open-3.0. 1996, [http://www.repeatmasker.org]Google Scholar
- Stocking C, Kozak CA: Murine endogenous retroviruses. Cell Mol Life Sci. 2008, 65 (21): 3383-3398. 10.1007/s00018-008-8497-0.PubMedView ArticleGoogle Scholar
- Mears ML, Hutchison CA: The evolution of modern lineages of mouse L1 elements. J Mol Evol. 2001, 52 (1): 51-62.PubMedView ArticleGoogle Scholar
- Goodier JL, Ostertag EM, Du K, Kazazian HH: A novel active L1 retrotransposon subfamily in the mouse. Genome Res. 2001, 11 (10): 1677-1685. 10.1101/gr.198301.PubMed CentralPubMedView ArticleGoogle Scholar
- Naas TP, DeBerardinis RJ, Moran JV, Ostertag EM, Kingsmore SF, Seldin MF, Hayashizaki Y, Martin SL, Kazazian HH: An actively retrotransposing, novel subfamily of mouse L1 elements. Embo J. 1998, 17 (2): 590-597. 10.1093/emboj/17.2.590.PubMed CentralPubMedView ArticleGoogle Scholar
- Penzkofer T, Dandekar T, Zemojtel T: L1Base: from functional annotation to prediction of active LINE-1 elements. Nucl Acids Res. 2005, 33 (suppl_1): D498-500.PubMed CentralPubMedGoogle Scholar
- Batzer MA, Deininger PL: Alu repeats and human genomic diversity. Nat Rev Genet. 2002, 3 (5): 370-379. 10.1038/nrg798.PubMedView ArticleGoogle Scholar
- Daniels GR, Deininger PL: Repeat sequence families derived from mammalian tRNA genes. Nature. 1985, 317 (6040): 819-822. 10.1038/317819a0.PubMedView ArticleGoogle Scholar
- O'Donnell KA, Boeke JD: Mighty Piwis defend the germline against genome intruders. Cell. 2007, 129 (1): 37-44. 10.1016/j.cell.2007.03.028.PubMed CentralPubMedView ArticleGoogle Scholar
- Moazed D: Small RNAs in transcriptional gene silencing and genome defence. Nature. 2009, 457 (7228): 413-420. 10.1038/nature07756.PubMed CentralPubMedView ArticleGoogle Scholar
- Tang F: Small RNAs in mammalian germline: Tiny for immortal. Differentiation. 2010, 79 (3): 141-146. 10.1016/j.diff.2009.11.002.PubMedView ArticleGoogle Scholar
- Klattenhoff C, Theurkauf W: Biogenesis and germline functions of piRNAs. Development. 2008, 135 (1): 3-9.PubMedView ArticleGoogle Scholar
- Girard Al, Sachidanandam R, Hannon GJ, Carmell MA: A germline-specific class of small RNAs binds mammalian Piwi proteins. Nature. 2006, 442 (7099): 199-202.PubMedGoogle Scholar
- Lau NC, Seto AG, Kim J, Kuramochi-Miyagawa S, Nakano T, Bartel DP, Kingston RE: Characterization of the piRNA complex from rat testes. Science. 2006, 313 (5785): 363-367. 10.1126/science.1130164.PubMedView ArticleGoogle Scholar
- Aravin AA, Sachidanandam R, Girard A, Fejes-Toth K, Hannon GJ: Developmentally Regulated piRNA Clusters Implicate MILI in Transposon Control. Science. 2007, 316 (5825): 744-747. 10.1126/science.1142612.PubMedView ArticleGoogle Scholar
- Carmell MA, Girard A, van de Kant HJ, Bourc'his D, Bestor TH, de Rooij DG, Hannon GJ: MIWI2 is essential for spermatogenesis and repression of transposons in the mouse male germline. Dev Cell. 2007, 12 (4): 503-514. 10.1016/j.devcel.2007.03.001.PubMedView ArticleGoogle Scholar
- Aravin AA, Sachidanandam R, Bourc'his D, Schaefer C, Pezic D, Toth KF, Bestor T, Hannon GJ: A piRNA pathway primed by individual transposons is linked to de novo DNA methylation in mice. Molecular cell. 2008, 31 (6): 785-799. 10.1016/j.molcel.2008.09.003.PubMed CentralPubMedView ArticleGoogle Scholar
- Kuramochi-Miyagawa S, Watanabe T, Gotoh K, Totoki Y, Toyoda A, Ikawa M, Asada N, Kojima K, Yamaguchi Y, Ijiri TW, et al: DNA methylation of retrotransposon genes is regulated by Piwi family members MILI and MIWI2 in murine fetal testes. Genes Dev. 2008, 22 (7): 908-917. 10.1101/gad.1640708.PubMed CentralPubMedView ArticleGoogle Scholar
- Weaver JR, Susiarjo M, Bartolomei MS: Imprinting and epigenetic changes in the early embryo. Mamm Genome. 2009, 20 (9-10): 532-543. 10.1007/s00335-009-9225-2.PubMedView ArticleGoogle Scholar
- Aravin AA, Hannon GJ, Brennecke J: The Piwi-piRNA pathway provides an adaptive defense in the transposon arms race. Science. 2007, 318 (5851): 761-764. 10.1126/science.1146484.PubMedView ArticleGoogle Scholar
- Trelogan SA, Martin SL: Tightly regulated, developmentally specific expression of the first open reading frame from LINE-1 during mouse embryogenesis. Proc Natl Acad Sci USA. 1995, 92 (5): 1520-1524. 10.1073/pnas.92.5.1520.PubMed CentralPubMedView ArticleGoogle Scholar
- Dupressoir A, Heidmann T: Germ line-specific expression of intracisternal A-particle retrotransposons in transgenic mice. Molecular and cellular biology. 1996, 16 (8): 4495-4503.PubMed CentralPubMedView ArticleGoogle Scholar
- Kuramochi-Miyagawa S, Kimura T, Yomogida K, Kuroiwa A, Tadokoro Y, Fujita Y, Sato M, Matsuda Y, Nakano T: Two mouse piwi-related genes: miwi and mili. Mech Dev. 2001, 108 (1-2): 121-133. 10.1016/S0925-4773(01)00499-3.PubMedView ArticleGoogle Scholar
- Kuramochi-Miyagawa S, Kimura T, Ijiri TW, Isobe T, Asada N, Fujita Y, Ikawa M, Iwai N, Okabe M, Deng W, et al: Mili, a mammalian member of piwi family gene, is essential for spermatogenesis. Development. 2004, 131 (4): 839-849. 10.1242/dev.00973.PubMedView ArticleGoogle Scholar
- Yang JH, Shao P, Zhou H, Chen YQ, Qu LH: deepBase: a database for deeply annotating and mining deep sequencing data. Nucleic Acids Res. 2010, 38 (Database): D123-130. 10.1093/nar/gkp943.PubMed CentralPubMedView ArticleGoogle Scholar
- Jurka J, Kapitonov VV, Pavlicek A, Klonowski P, Kohany O, Walichiewicz J: Repbase Update, a database of eukaryotic repetitive elements. Cytogenet Genome Res. 2005, 110 (1-4): 462-467. 10.1159/000084979.PubMedView ArticleGoogle Scholar
- Lowe CB, Bejerano G, Haussler D: Thousands of human mobile element fragments undergo strong purifying selection near developmental genes. Proc Natl Acad Sci USA. 2007, 104 (19): 8005-8010. 10.1073/pnas.0611223104.PubMed CentralPubMedView ArticleGoogle Scholar
- Tsirigos A, Rigoutsos I: Alu and b1 repeats have been selectively retained in the upstream and intronic regions of genes of specific functional classes. PLoS Comput Biol. 2009, 5 (12): e1000610.-PubMed CentralPubMedView ArticleGoogle Scholar
- Abrusan G, Giordano J, Warburton PE: Analysis of transposon interruptions suggests selection for L1 elements on the × chromosome. PLoS genetics. 2008, 4 (8): e1000172-10.1371/journal.pgen.1000172.PubMed CentralPubMedView ArticleGoogle Scholar
- Su AI, Wiltshire T, Batalov S, Lapp H, Ching KA, Block D, Zhang J, Soden R, Hayakawa M, Kreiman G, et al: A gene atlas of the mouse and human protein-encoding transcriptomes. Proc Natl Acad Sci USA. 2004, 101 (16): 6062-6067. 10.1073/pnas.0400782101.PubMed CentralPubMedView ArticleGoogle Scholar
- Jacquier A: The complex eukaryotic transcriptome: unexpected pervasive transcription and novel small RNAs. Nat Rev Genet. 2009, 10 (12): 833-844.PubMedView ArticleGoogle Scholar
- Seila AC, Calabrese JM, Levine SS, Yeo GW, Rahl PB, Flynn RA, Young RA, Sharp PA: Divergent transcription from active promoters. Science. 2008, 322 (5909): 1849-1851. 10.1126/science.1162253.PubMed CentralPubMedView ArticleGoogle Scholar
- Medstrand P, van de Lagemaat LN, Mager DL: Retroelement distributions in the human genome: variations associated with age and proximity to genes. Genome Res. 2002, 12 (10): 1483-1495. 10.1101/gr.388902.PubMed CentralPubMedView ArticleGoogle Scholar
- McCormick KP, Willmann MR, Meyers BC: Experimental design, preprocessing, normalization and differential expression analysis of small RNA sequencing experiments. Silence. 2011, 2 (1): 2.-10.1186/1758-907X-2-2.PubMed CentralPubMedView ArticleGoogle Scholar
- Akagi K, Li J, Stephens RM, Volfovsky N, Symer DE: Extensive variation between inbred mouse strains due to endogenous L1 retrotransposition. Genome Res. 2008, 18 (6): 869-880. 10.1101/gr.075770.107.PubMed CentralPubMedView ArticleGoogle Scholar
- Chang-Yeh A, Mold DE, Brilliant MH, Huang RC: The mouse intracisternal A particle-promoted placental gene retrotransposition is mouse-strain-specific. Proc Natl Acad Sci USA. 1993, 90 (1): 292-296. 10.1073/pnas.90.1.292.PubMed CentralPubMedView ArticleGoogle Scholar
- Batada NN, Hurst LD: Evolution of chromosome organization driven by selection for reduced gene expression noise. Nat Genet. 2007, 39 (8): 945-949. 10.1038/ng2071.PubMedView ArticleGoogle Scholar
- Ebisuya M, Yamamoto T, Nakajima M, Nishida E: Ripples from neighbouring transcription. Nat Cell Biol. 2008, 10 (9): 1106-1113. 10.1038/ncb1771.PubMedView ArticleGoogle Scholar
- Batada NN, Urrutia AO, Hurst LD: Chromatin remodelling is a major source of coexpression of linked genes in yeast. Trends in Genetics. 2007, 23 (10): 480-484. 10.1016/j.tig.2007.08.003.PubMedView ArticleGoogle Scholar
- Mourier T, Willerslev E: Large-scale transcriptome data reveals transcriptional activity of fission yeast LTR retrotransposons. BMC genomics. 2010, 11: 167-10.1186/1471-2164-11-167.PubMed CentralPubMedView ArticleGoogle Scholar
- Esnault C, Maestre J, Heidmann T: Human LINE retrotransposons generate processed pseudogenes. Nat Genet. 2000, 24 (4): 363-367. 10.1038/74184.PubMedView ArticleGoogle Scholar
- Dewannieux M, Esnault C, Heidmann T: LINE-mediated retrotransposition of marked Alu sequences. Nat Genet. 2003, 35 (1): 41-48. 10.1038/ng1223.PubMedView ArticleGoogle Scholar
- Espinoza CA, Allen TA, Hieb AR, Kugel JF, Goodrich JA: B2 RNA binds directly to RNA polymerase II to repress transcript synthesis. Nat Struct Mol Biol. 2004, 11 (9): 822-829. 10.1038/nsmb812.PubMedView ArticleGoogle Scholar
- Allen TA, Von Kaenel S, Goodrich JA, Kugel JF: The SINE-encoded mouse B2 RNA represses mRNA transcription in response to heat shock. Nat Struct Mol Biol. 2004, 11 (9): 816-821. 10.1038/nsmb813.PubMedView ArticleGoogle Scholar
- Mariner PD, Walters RD, Espinoza CA, Drullinger LF, Wagner SD, Kugel JF, Goodrich JA: Human Alu RNA Is a Modular Transacting Repressor of mRNA Transcription during Heat Shock. Molecular cell. 2008, 29 (4): 499-509. 10.1016/j.molcel.2007.12.013.PubMedView ArticleGoogle Scholar
- Kaneko H, Dridi S, Tarallo V, Gelfand BD, Fowler BJ, Cho WG, Kleinman ME, Ponicsan SL, Hauswirth WW, Chiodo VA, et al: DICER1 deficit induces Alu RNA toxicity in age-related macular degeneration. Nature. 2011, 471 (7338): 325-330. 10.1038/nature09830.PubMed CentralPubMedView ArticleGoogle Scholar
- Mourier T: Reverse transcription in genome evolution. Cytogenet Genome Res. 2005, 110 (1-4): 56-62. 10.1159/000084938.PubMedView ArticleGoogle Scholar
- Krull M, Brosius J, Schmitz J: Alu-SINE exonization: en route to protein-coding function. Mol Biol Evol. 2005, 22 (8): 1702-1711. 10.1093/molbev/msi164.PubMedView ArticleGoogle Scholar
- Medstrand P, van de Lagemaat LN, Dunn CA, Landry JR, Svenback D, Mager DL: Impact of transposable elements on the evolution of mammalian gene regulation. Cytogenet Genome Res. 2005, 110 (1-4): 342-352. 10.1159/000084966.PubMedView ArticleGoogle Scholar
- Kunarso G, Chia NY, Jeyakani J, Hwang C, Lu X, Chan YS, Ng HH, Bourque G: Transposable elements have rewired the core regulatory network of human embryonic stem cells. Nat Genet. 2010, 42 (7): 631-634. 10.1038/ng.600.PubMedView ArticleGoogle Scholar
- Langmead B, Trapnell C, Pop M, Salzberg SL: Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009, 10 (3): R25-10.1186/gb-2009-10-3-r25.PubMed CentralPubMedView ArticleGoogle Scholar
- Gabrielian A, Bolshoy A: Sequence complexity and DNA curvature. Comput Chem. 1999, 23 (3-4): 263-274. 10.1016/S0097-8485(99)00007-8.PubMedView ArticleGoogle Scholar
- Kuhn RM, Karolchik D, Zweig AS, Trumbower H, Thomas DJ, Thakkapallayil A, Sugnet CW, Stanke M, Smith KE, Siepel A, et al: The UCSC genome browser database: update 2007. Nucleic Acids Res. 2007, 35 (Database): D668-673. 10.1093/nar/gkl928.PubMed CentralPubMedView ArticleGoogle Scholar
- Hubbard TJ, Aken BL, Beal K, Ballester B, Caccamo M, Chen Y, Clarke L, Coates G, Cunningham F, Cutts T, et al: Ensembl 2007. Nucleic Acids Res. 2007, 35 (Database): D610-617. 10.1093/nar/gkl996.PubMed CentralPubMedView ArticleGoogle Scholar
- R Development Core Team: R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0, URL. 2007, [http://www.R-project.org]Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.