Identification of novel endogenous antisense transcripts by DNA microarray analysis targeting complementary strand of annotated genes
BMC Genomics volume 10, Article number: 392 (2009)
Recent transcriptomic analyses in mammals have uncovered the widespread occurrence of endogenous antisense transcripts, termed natural antisense transcripts (NATs). NATs are transcribed from the opposite strand of the gene locus and are thought to control sense gene expression, but the mechanism of such regulation is as yet unknown. Although several thousand potential sense-antisense pairs have been identified in mammals, examples of functionally characterized NATs remain limited. To identify NAT candidates suitable for further functional analyses, we performed DNA microarray-based NAT screening using mouse adult normal tissues and mammary tumors to target not only the sense orientation but also the complementary strand of the annotated genes.
First, we designed microarray probes to target the complementary strand of genes for which an antisense counterpart had been identified only in human public cDNA sources, but not in the mouse. We observed a prominent expression signal from 66.1% of 635 target genes, and 58 genes of these showed tissue-specific expression. Expression analyses of selected examples (Acaa1b and Aard) confirmed their dynamic transcription in vivo. Although interspecies conservation of NAT expression was previously investigated by the presence of cDNA sources in both species, our results suggest that there are more examples of human-mouse conserved NATs that could not be identified by cDNA sources. We also designed probes to target the complementary strand of well-characterized genes, including oncogenes, and compared the expression of these genes between mammary cancerous tissues and non-pathological tissues. We found that antisense expression of 95 genes of 404 well-annotated genes was markedly altered in tumor tissue compared with that in normal tissue and that 19 of these genes also exhibited changes in sense gene expression. These results highlight the importance of NAT expression in the regulation of cellular events and in pathological conditions.
Our microarray platform targeting the complementary strand of annotated genes successfully identified novel NATs that could not be identified by publically available cDNA data, and as such could not be detected by the usual "sense-targeting" microarray approach. Differentially expressed NATs monitored by this platform may provide candidates for investigations of gene function. An advantage of our microarray platform is that it can be applied to any genes and target samples of interest.
There is a growing body of evidence that natural antisense transcripts (NATs) play important regulatory roles in various biological processes. NATs are usually transcribed from the opposite strand of a particular gene locus, and they are thought to regulate sense gene expression [1, 2]. One of the proposed models of NAT-mediated regulation is for the antisense transcript to act as a cis-repressor of gene expression from the sense strand. For example, in early embryogenesis, transcription of the antisense genes Tsix and Air determines the fate of expression of their sense partners Xist and Igf2r, respectively [3, 4]. The appearance of NATs within several imprinted loci suggests that NATs may regulate gene expression by controlling the epigenetic status of surrounding genes [5–7]. Moreover, NATs may function in pathological conditions by causing epigenetic alterations such as histone modification and DNA methylation [8, 9].
The other primary model of NAT-mediated gene regulation is induction of the production of small RNAs from NAT loci and their subsequent function in RNA interference (RNAi) pathways. Endogenous small interfering RNA (endo-siRNA) molecules, generated from NAT loci, are induced specifically under conditions of salt stress and immune response in plants [10–15]. Recent experimental data also suggests the presence of NAT-associated endo-siRNA molecules in animals [16–18].
Although the number of NATs thought to have biological functions has gradually increased, the functions of most NATs discovered in recent large-scale in silico studies are unknown. Computational identification of NATs is based mostly on the analysis of cDNA and EST sequence collections by sequence alignment, and this process has identified several thousand sense-antisense pairs . However, in principle, cDNA sequencing accumulates data on transcripts with poly(A)-stretches and does not access the non-poly-adenylated population of transcripts. A recent genome-wide tiling array study of the human genome revealed that many genomic regions that could not be identified from cDNA collections are apparently transcribed and tend not to be poly-adenylated . This finding indicates that antisense transcriptome analyses based solely on cDNA information may be inefficient. In addition, most publicly available cDNA sequences are derived from normal cellular conditions, such as normal adult tissues, and thus are not useful for the identification of NATs specific to abnormal cellular conditions.
To discover novel NATs expressed under various biological conditions, we proposed a microarray-based technique involving the use of 60-mer oligonucleotide DNA probes selected from the complementary sequences of cDNAs (i.e., known genes), referred to as artificial antisense sequence (AFAS) probes. This approach has the ability to detect antisense expression that cannot be identified by using information from the cDNA and EST collections and has the advantage of compatibility with the computational methodology widely used for sense gene expression analysis . We performed microarray analyses with AFAS probes by using oligo-dT and random primed target samples to provide a comprehensive approach for the detection of novel non-poly-adenylated transcripts in the antisense transcriptome.
Here, we designed AFAS probes to correspond to the antisense strand of well-studied selected genes, including oncogenes and tumor suppressor genes, imprinted genes, and human-mouse orthologous genes. We studied the expression profiles of targeted transcripts in normal mouse adult tissues and in mouse mammary tumor virus (MMTV)-induced mammary tumors. This technique is applicable to all genes and sample types and can be used for antisense expression identification that is not possible by using conventional cDNA information alone.
AFAS probes detect previously known NATs
To verify whether our methods can detect NAT expression, we initially examined the signal intensities of AFAS probes that targeted previously identified antisense transcripts. For example, AFAS probes designed for Tsix, which reflects the abundance of Xist RNA, detected expression in the 11 adult mouse tissues (mixed males and females), but not in the testis, as expected (see Additional file 1). Such expression patterns were detected only for probes corresponding to the exonic-overlapping regions between Tsix and Xist, and also for the sense probes corresponding to Xist RNA (see Additional file 1). This finding suggests that AFAS probes for Tsix can identify not only the presence of its antisense counterpart, but also its exonic regions. In addition, AFAS probes for several imprinted genes (Igf2r, Kcnq1, Gnas, Dio3, and Ube3a), which are known to give rise to antisense transcripts , also gave prominent signals (see Additional file 2). Moreover, previously known antisense transcripts, such as those arising from Myc (myelocytomatosis oncogene)  and Tgfb2 (transforming growth factor beta2) , were also detected by our microarray platform (data not shown). Although the number of documented NAT examples in normal mouse adult tissues is limited, the mean signal intensities generated by AFAS probes corresponding to these genes were higher than those for the negative control genes. The control genes comprise a set of randomly selected genes, without cDNA, EST, and CAGE tags in the antisense orientation (Figure 1A, P = 5.1e-12 by Welch's t-test). These data indicate that endogenous NAT expression of known genes is potentially detectable by probes designed for the complementary strand of known genes.
Global analyses of AFAS probes
Before screening for novel NATs using AFAS probes, we first analyzed the global tendency of signal intensities from all AFAS probes applied to our custom microarray platform. Because Northern blot analyses for particular gene loci have previously shown that NATs tend to be poly(A)-negative , we checked whether our AFAS probes also showed this tendency in normal mouse tissue expression profiling. A significantly higher number of AFAS probes than sense probes detected transcripts only within random-primed samples, but not among the oligo-dT primed targets (P < 2.2e-16, Fisher's exact test, see Additional file 3). This result indicates that transcripts detected by AFAS probes also lack poly(A)-tails, similar to the finding for NATs characterized by Northern blot analyses . Also, the number of sense probes detecting transcripts in both oligo-dT and random primed samples was higher than that of antisense probes (P < 2.2e-16, Fisher's exact test, see Additional file 3). This finding indicates that sense transcripts with poly(A)-tails can be identified by both priming methods, because sense probes target the protein-coding strand of the mRNA, which is expected to have a poly(A)-tail. Another characteristic of endogenous NATs is their nuclear localization . A distribution comparison of AFAS probe signals between nuclear and cytoplasmic fractions clearly showed nuclear enrichment of detected transcripts (Figure 1B, P < 2.2e-16, Median test).
Several large-scale studies using information in cDNA and EST collections and from genome-wide tiling arrays in yeast previously showed that NATs tend to be transcribed from the 3' region of its counterpart mRNA [25–27], thus implying the presence of regulatory mechanisms involving tail-to-tail overlapping. We also observed this characteristic for the AFAS probes, because the AFAS probe signals clearly showed positional preference relative to the sense mRNA (Figure 1C). This result indicates that AFAS probes indeed detect the positional bias of antisense transcription. Similarly, we also observed higher signals within 5' regions (Figure 1C), thus suggesting that NATs may also arise near the transcriptional start site, as previously shown for head-to-head overlapping NATs such as WT1, Sphk1, and Tsix [28–30].
Novel conserved NAT detection by normal tissue profiling
To test the ability of AFAS probes to detect novel NATs, we initially applied our microarray approach to the human-mouse orthologous gene set. In many studies, inter-species conservation of NATs is implied by the presence of common cDNA sequences between the two species [26, 31, 32]. However, recent genome-wide tiling array and CAGE analyses revealed that a large fraction of the genome is transcribed [33, 34], indicating that current cDNA collection is not sufficient for comprehensive comparative genomics, including comparative antisense transcriptome analyses. In this situation, the use of AFAS probes corresponding to genes for which the antisense counterpart has been identified in humans, but not in mice, may lead to the detection of novel conserved NATs (Figure 2).
We designed AFAS probes corresponding to 635 mouse orthologous partners, for which the antisense counterpart has been identified in humans, but not in mice (one sense and one antisense probe were designed per gene). We then profiled the expression of these genes to detect antisense expression within 12 normal mouse tissues. We identified 420 (66.1%) probes that gave a signal (signal intensity ≥100, which is our empirically defined criterion), at least in a single particular tissue, and 58 of these (9.2%) showed tissue-specific expression. Probes of 120 genes gave signals with a higher than average intensity according to inter-array normalization (see Additional file 4). These results suggest that many NATs identified only in the human cDNA collection may also be expressed in mice.
We attempted to validate the expression of two candidate conserved NATs (antisense of Acaa1b and Aard) by performing Northern and in situ hybridization (ISH) analyses. Whereas human ACAA1 (acetyl-Coenzyme A acyltransferase 1) overlaps with DLEC1 (deleted in lung and esophageal cancer 1) in a tail-to-tail overlapping manner, its orthologous counterpart in the mouse genome (Acaa1b and Dlec1) shows a tail-to-tail relationship but not a reciprocal overlapping relationship, according to the annotated gene structure (see Additional file 5). Both microarray and Northern analyses confirmed that the Acaa1b sense transcript is expressed within liver and kidney (see Additional file 5). Northern analyses were not able to detect the antisense transcript of Acaa1b from either poly(A)+ or total RNA (data not shown), but quantitative RT-PCR, ISH and microarray analyses were able to detect this transcript within the testis and kidney (see Additional file 5). This result implies that NATs detected by microarray analysis using AFAS probes are transcribed in vivo.
We also analyzed the expression of Aard (alanine- and arginine-rich domain-containing protein), which is a functionally uncharacterized gene but is known to be expressed within the adult testis and XY fetal gonad . In humans, exons of AARD (also known as C8orf85) overlap with that of an unnamed uterus EST (GenBank: AK093981), whereas mouse Aard has no EST arising from the antisense strand (Figure 3B). Northern analysis confirmed that expression of the sense transcript of Aard was testis-specific (Figure 3C); however, Northern analysis of the antisense transcript showed laddered hybridization patterns for total RNA, but not for poly(A)+ RNA isolated from all samples (Figure 3D). By comparison, both the sense and antisense transcripts (Aard-AS) were detected by ISH within a particular region of the seminiferous tubules (Figure 4A,B), thus confirming that the Aard-AS is also expressed in the testis. In addition, Aard-AS was most likely located within the nucleus, whereas Aard was located within the cytoplasm (Figure 4C,D). Because ISH shows that Aard-AS is expressed in a particular region of the seminiferous tubules, we checked our microarray data on fractionated testis samples that reflected the three steps of spermatogenesis (i.e., pachytene spermatocytes, round spermatids, and elongated spermatids). We found that Aard-AS was expressed within the early period of spermatogenesis, whereas the sense transcript appeared at a later phase (Figure 4E). This finding shows that sense and antisense transcripts of Aard are transcribed exclusively and in a mutually antagonistic fashion during spermatogenesis. In addition, Aard-AS expression was detected only in the random-primed target sample, not in the oligo-dT primed target (Figure 4E), indicating that Aard-AS tends to be poly(A)-negative and nuclear-localized.
These data clearly confirm that AFAS probes can detect the expression of antisense transcripts in normal tissues, and that they can also identify transcripts expressed in a tissue- and cell-type-specific manner. Detection of such expression dynamics for antisense transcripts is possible only by using the analytical platform targeting the complementary strand of the annotated genes. Thus, AFAS probes, when used within appropriate biological samples and combined with other analytical modalities, can be used to discover genuine functional NATs; this is an advantage over conventional approaches that depend on publicly available cDNA data.
Detection of novel NATs differentially expressed under pathological conditions
We next checked whether AFAS probes have the ability to detect antisense transcripts in cancerous tissues. Examples of functional antisense transcripts identified in abnormal cells are CDKN2B, WT1, and HBA2 [8, 9, 29]. These antisense transcripts control the epigenetic status of surrounding genes by DNA methylation or histone modification and thus are thought to affect the expression of their sense partners. To confirm this notion, we applied the AFAS probe technique to the 404 well-characterized genes including oncogenes and tumor suppressors (1752 AFAS probes were successfully designed, giving 4.4 probes per gene on average). We used these probes in microarray experiments based on the GRS/A mouse strain, which frequently suffers from (MMTV)-induced mammary tumors .
For the probes designed to detect the sense transcripts, we identified 57 genes showing differential expression. Among these, 48 were up-regulated and 9 were down-regulated within tumor regions, compared with in normal regions, according to a set statistical threshold (P ≤ 0.05 by Student's t-test) (Figure 5 and Additional file 6). Among the up-regulated genes in tumors, 12 genes (Pdcd6 is shown as an example in Additional file 7) showed loss of antisense expression (Figure 5A, right lower), whereas among the down-regulated genes Nr2c2 showed up-regulation of its antisense expression in an anti-correlated manner with the sense transcript counterpart (Figure 5A, left upper). These genes are reminiscent of the model in which antisense transcription may lead to the silencing of sense gene expression, such as cyclin-dependent kinase inhibitor (CDKN2B) and its antisense counterpart . These genes may be regulated through an antisense-mediated pathway.
Interestingly, the expression of antisense transcripts representing 37 genes (Thbd is shown as an example in Figure 6A) was found to increase, despite the absence of changes in expression of their sense transcript counterparts (Figure 5B). We also identified down-regulated antisense transcripts corresponding to 45 genes (Drd4 is shown as an example in Additional file 7) for which there were no changes in expression of their corresponding sense transcripts. Because ISH using cancerous tissues, like microarray analysis, can detect antisense expression arising from Thbd (thrombomodulin) (Figure 6B–D), there might be more examples of genes for which antisense expression is altered in cancerous tissue but cannot be detected by microarray analysis that targets expression from the sense strand of genes.
This paper shows that microarray probes targeting transcription from the complementary strand of known genes can identify novel NATs, an approach that has not been possible solely on the basis of publicly available cDNA data. Recently described high-density oligonucleotide tiling-array platforms are designed to overview the transcriptional landscape of specific genomic regions at high resolution. By comparison, our platform uses multiple probes to specifically screen for transcription from the antisense strand of known genes. Many previous studies have attempted to identify NATs by DNA microarray analysis using cDNA-oriented custom microarrays or commercially available microarray platforms [37–41]. Since our microarray platform is custom-made and not commercial, it can be applied to any genes or gene loci of interest. Furthermore, our method does not introduce bias from cDNA synthesis between sense and antisense profiling because it does not require specific protocols for target cDNA synthesis for NAT detection. In addition, our microarray platform approach can simultaneously profile sense and antisense expression in one microarray hybridization experiment.
Many NATs detected by AFAS probes were appeared only in the random-primed targets. This was concordance with previous cDNA-based microarray profiling of NAT expression . Whereas poly(A)-plus RNA population is roughly represented by oligo-dT primed cDNAs, whole transcriptome (including the poly(A)-minus RNA population) is represented by cDNAs synthesized by random primers. Therefore, NATs detected by our analysis tend to be poly(A)-negative. Although oligo-dT primers can pick the internal poly(A)-stretches, this is not an issue at the level of microarray-based NAT screening, because the vast majority of the poly(A)-stretch (approximately 90%) is located within the 3' end of the transcripts (data not shown).
By designating AFAS probes to human-mouse orthologous genes, we identified many probes showing positive signals. Two of these probes identified transcripts for which in vivo expression was confirmed. Thus, our approach may reveal more, as yet unidentified, conserved NATs; this has not been possible by conventional approaches, as previously reported using cDNA data [26, 31, 32]. Of the individually validated examples (Acaa1b and Aard), expression of Aard-AS was localized to the nucleus and was detected only in random-primed target samples. In addition, multiple-size hybridized bands pattern was observed especially for total RNA membrane, not for poly(A)+ RNA membrane. This observation is similar to that of previously identified antisense transcripts , and this is probably due to heterogeneously sized molecules of Aard-AS transcripts. Because ISH and the microarray data on other antisense transcript examples also show nuclear localization and poly(A)-avoidance (data not shown), it is possible that these features are general characteristics of the antisense transcriptome.
We also designed AFAS probes for well-characterized genes and identified several examples of correlated and anti-correlated expression between the NATs and the corresponding sense transcript within MMTV-induced mammary tumors. We observed differentially expressed genes for which expression of the antisense transcript had changed, whereas that of the sense transcript had not. Given that differential antisense expression might induce changes in epigenetic status, for example in CDKN2B and CDKN2BAS , antisense transcription may cause changes in the methylation status of neighboring genes. This notion can be tested by using methylated DNA immunoprecipitation (MeDIP) and chromatin immunoprecipitation (ChIP) on chip analyses to further characterize the antisense transcriptome and to determine whether specific NATs function as epigenetic regulators. Whereas this study revealed NATs specific to mouse tumors, human clinical samples have also been analyzed to screen for novel NATs by the same methodology; this new study has identified many antisense transcripts showing increased or decreased expression in human colon cancer tissues compared with controls (Saito R., Kohno K., Okada Y., Osada Y., Numata K., Watanabe K., Nakaoka H., Yamamoto N., Kanai A., Yasue H. et al., manuscript in preparation).
Although next-generation high-throughput transcriptome sequencing (RNA-seq) might replace microarray-based expression analyses, antisense transcriptome analysis by sequencing is still under development because of the laborious nature of strand-specific library construction . DNA microarray-based profiling makes it possible to gain a detailed view of specific genes or gene loci and can also provide expression profiles of both poly(A)-plus and poly(A)-minus RNAs.
We showed here that probes targeting the complementary strand of the annotated genes successfully identify novel NAT expression, including those altered tissue- and tumor-specifically. The results suggest that there are more examples of NATs that cannot not be collected from public cDNA sources. Further functional investigation is required for such dynamically expressed NATs, and the use of microarray platforms targeting both strands of the gene locus will help to narrow down the proper candidates for further functional analyses.
Custom microarray construction
The AFAS probes for detecting NATs were designed to detect antisense transcription originating from genes categorized into three groups: (1) 48 genes in which antisense transcription has been previously reported and 87 imprinted genes in mice, (2) 404 selected well-annotated genes, (3) orthologous genes in NAT loci (detailed definition given below), and (4) randomly selected genes for which there were no cDNA, EST, and CAGE tags in the antisense orientation. For categories (1) and (2), the AFAS probes were designed to correspond to every 500 bases of the antisense strand of the exonic regions of each gene. For category (3), the AFAS probes were designed to correspond to a single specific sequence in each transcript. For category (4), two AFAS probes were designed per transcript. Target region selection for the probe design is summarized in Additional file 8. All probes were computationally designed by using the OligoWiz program  and were used in the Agilent 44K custom oligoarray platform for single-color microarray analysis.
Target sample preparation for the microarray analysis
Total RNA for the mouse (C57BL/6J) microarray experiments was isolated from NIH3T3 cells (fibroblast cell line), SL10 cells (fibroblast cell line), brain, heart, intestine, kidney, liver, lung, placenta (d.p.c. 10.5 and 13.5), spleen, stomach, testis, and thymus. Testis was from C57BL/6J males (8 to 10 weeks), placenta was from pregnant mice, and the other tissue was from both male and female mice. Nuclear and cytoplasmic fractionation of NIH3T3 cells was carried out according to the Protein and RNA Isolation System (PARIS) instructions (Ambion Inc.). For the microarray analysis of murine mammary tumors, RNA samples were collected from normal and cancerous mammary glands of dissected GRS/A mice .
Data processing and the accessibility
Numerical processed signal values (gProcessedSignal) of the Agilent Feature Extraction File were obtained as representative expression levels for each probe within the array. If a spot had an intensity value lower than five, or if there was no prominent difference between foreground and background signals, then the intensity value was adjusted to five and the corresponding probe was treated as an "absent probe". To perform normalization of signal intensity distribution between multiple arrays, the whole mean signal of every hybridization experiment was adjusted to that of the data from SL10 cells by oligo-dT priming. Probes with intensity values lower than five, as well as being flagged as "saturated", were discarded for the inter-array-normalization step. Tissue-specificity of the expression signals was evaluated according to τ measurement . The raw data from the microarray analyses were deposited in the NCBI Gene Expression Omnibus (GEO) under accession number GSE14568 . Expression data as well as a simplified genomic structure can be accessed via an originally constructed viewer .
In silico identification of orthologous genes in NAT loci
To identify orthologous genes in NAT loci (Figure 2), we initially performed in silico identification of sense-antisense pairs by the same procedures as previously published , by using the latest full-length cDNA collections [33, 48], NCBI RefSeq mRNA  and the UniGene collection . This identified 3524 and 5351 exon-overlapping sense-antisense pairs in humans and mice, respectively. Genomic synteny data between human and mouse (defined by BLASTZ derived from UCSC ) was then exploited to determine whether each identified pair was located within the syntenic region between the two species. Those pairs located within the syntenic regions were retained for the orthologous relationship validation. The orthologous relationship between the genes located within the syntenic regions was defined according to the orthologous gene table from the BioMart Project . Finally, 648 genes are identified as orthologous genes for which NAT was identified in human cDNAs but not in mouse cDNAs. AFAS probes for these (635 of 648) were successfully designed.
Northern hybridization analyses
RNA from mouse tissues (C57BL/6J, 8 to 10 weeks, male and female mixed), and the NIH3T3 was isolated by using Trizol reagent (Invitrogen Corporation). Northern analyses were performed as previously described . Loading of equal amounts of RNA samples was confirmed by visualization of ethidium bromide-stained RNA in the gel. Probes specific for sense and antisense of Acaa1b (NM_146230), Aard (NM_175503), and Thbd (NM_009378) were amplified by the PCR (see Additional file 9). All the probe sequences contained their corresponding microarray probe sequences. cDNA fragments were cloned to the pGEM-T Easy Vector (Promega Corporation), and strand-specific cRNA was prepared for hybridization.
In situ hybridization
Probes specific for sense and antisense of Acaa1b (NM_146230), Aard (NM_175503), and Thbd (NM_009378) were amplified by the PCR (see Additional file 9). All the probe sequences contained their corresponding microarray probe sequences. The amplified fragment was sub-cloned into pGEMT-Easy vector (Promega) and was used for generation of sense or antisense RNA probes. Paraffin-embedded testis sections (6 μm) of normal adult mouse (C57BL/6 mouse, male, 8 weeks) were obtained from Genostaff Co., Ltd. For in situ hybridization the sections were hybridized with digoxigenin-labeled RNA probes at 60°C for 16 h. The bound label was detected using NBT-BCIP, an alkaline phosphate color substrate. The sections were counterstained with Kernechtrot (Muto Pure Chemicals Co., Ltd.). Probe sequence of negative control experiment was selected from Oryza sativa putative leaf protein (NM_197207) (see Additional file 5 and 10).
Real-time quantitative RT-PCR
cDNA was initially synthesized with gene-specific reverse primers (Acaa1b-AS and Gapdh) from selected tissue RNA (Brain, Testis, Kidney, and Liver), then subjected to quantitative RT-PCR. Gene expression level was normalized with Gapdh. Primers are listed in Additional file 11.
Lapidot M, Pilpel Y: Genome-wide natural antisense transcription: coupling its regulation to its different regulatory mechanisms. EMBO Rep. 2006, 7 (12): 1216-1222. 10.1038/sj.embor.7400857.
Lavorgna G, Dahary D, Lehner B, Sorek R, Sanderson CM, Casari G: In search of antisense. Trends Biochem Sci. 2004, 29 (2): 88-94. 10.1016/j.tibs.2003.12.002.
Shibata S, Lee JT: Tsix transcription- versus RNA-based mechanisms in Xist repression and epigenetic choice. Curr Biol. 2004, 14 (19): 1747-1754. 10.1016/j.cub.2004.09.053.
Sleutels F, Zwart R, Barlow DP: The non-coding Air RNA is required for silencing autosomal imprinted genes. Nature. 2002, 415 (6873): 810-813.
O'Neill MJ: The influence of non-coding RNAs on allele-specific gene expression in mammals. Hum Mol Genet. 2005, 14 (Spec No 1): R113-120. 10.1093/hmg/ddi108.
Pandey RR, Mondal T, Mohammad F, Enroth S, Redrup L, Komorowski J, Nagano T, Mancini-Dinardo D, Kanduri C: Kcnq1ot1 antisense noncoding RNA mediates lineage-specific transcriptional silencing through chromatin-level regulation. Mol Cell. 2008, 32 (2): 232-246. 10.1016/j.molcel.2008.08.022.
Babak T, Deveale B, Armour C, Raymond C, Cleary MA, Kooy van der D, Johnson JM, Lim LP: Global survey of genomic imprinting by transcriptome sequencing. Curr Biol. 2008, 18 (22): 1735-1741. 10.1016/j.cub.2008.09.044.
Tufarelli C, Stanley JA, Garrick D, Sharpe JA, Ayyub H, Wood WG, Higgs DR: Transcription of antisense RNA leading to gene silencing and methylation as a novel cause of human genetic disease. Nat Genet. 2003, 34 (2): 157-165. 10.1038/ng1157.
Yu W, Gius D, Onyango P, Muldoon-Jacobs K, Karp J, Feinberg AP, Cui H: Epigenetic silencing of tumour suppressor gene p15 by its antisense RNA. Nature. 2008, 451 (7175): 202-206. 10.1038/nature06468.
Borsani O, Zhu J, Verslues PE, Sunkar R, Zhu JK: Endogenous siRNAs derived from a pair of natural cis- antisense transcripts regulate salt tolerance in Arabidopsis. Cell. 2005, 123 (7): 1279-1291. 10.1016/j.cell.2005.11.035.
Jin H, Vacic V, Girke T, Lonardi S, Zhu JK: Small RNAs and the regulation of cis-natural antisense transcripts in Arabidopsis. BMC Mol Biol. 2008, 9: 6-10.1186/1471-2199-9-6.
Katiyar-Agarwal S, Gao S, Vivian-Smith A, Jin H: A novel class of bacteria-induced small RNAs in Arabidopsis. Genes Dev. 2007, 21 (23): 3123-3134. 10.1101/gad.1595107.
Katiyar-Agarwal S, Morgan R, Dahlbeck D, Borsani O, Villegas A, Zhu JK, Staskawicz BJ, Jin H: A pathogen-inducible endogenous siRNA in plant immunity. Proc Natl Acad Sci USA. 2006, 103 (47): 18002-18007. 10.1073/pnas.0608258103.
Lu C, Jeong DH, Kulkarni K, Pillay M, Nobuta K, German R, Thatcher SR, Maher C, Zhang L, Ware D, et al: Genome-wide analysis for discovery of rice microRNAs reveals natural antisense microRNAs (nat-miRNAs). Proc Natl Acad Sci USA. 2008, 105 (12): 4951-4956. 10.1073/pnas.0708743105.
Zhou X, Sunkar R, Jin H, Zhu JK, Zhang W: Genome-wide identification and analysis of small RNAs originated from natural antisense transcripts in Oryza sativa. Genome Res. 2009, 19 (1): 70-78. 10.1101/gr.084806.108.
Carlile M, Swan D, Jackson K, Preston-Fayers K, Ballester B, Flicek P, Werner A: Strand selective generation of endo-siRNAs from the Na/phosphate transporter gene Slc34a1 in murine tissues. Nucleic Acids Res. 2009, 37 (7): 2274-2282. 10.1093/nar/gkp088.
Okamura K, Balla S, Martin R, Liu N, Lai EC: Two distinctmechanisms generate endogenous siRNAs from bidirectional transcription in Drosophila melanogaster. Nat Struct Mol Biol. 2008, 15 (6): 581-590. 10.1038/nsmb.1438.
Watanabe T, Totoki Y, Toyoda A, Kaneda M, Kuramochi-Miyagawa S, Obata Y, Chiba H, Kohara Y, Kono T, Nakano T, et al: Endogenous siRNAs from naturally formed dsRNAs regulate transcripts in mouse oocytes. Nature. 2008, 453: 539-543. 10.1038/nature06908.
Zhang Y, Liu XS, Liu QR, Wei L: Genome-wide in silico identification and analysis of cis natural antisense transcripts (cis-NATs) in ten species. Nucleic Acids Res. 2006, 34 (12): 3465-3475. 10.1093/nar/gkl473.
Cheng J, Kapranov P, Drenkow J, Dike S, Brubaker S, Patel S, Long J, Stern D, Tammana H, Helt G, et al: Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution. Science. 2005, 308 (5725): 1149-1154. 10.1126/science.1108625.
Steinhoff C, Vingron M: Normalization and quantification of differential expression in gene expression microarrays. Brief Bioinform. 2006, 7 (2): 166-177. 10.1093/bib/bbl002.
Nepveu A, Marcu KB: Intragenic pausing and anti-sense transcription within the murine c-myc locus. EMBO J. 1986, 5 (11): 2859-2865.
Coker RK, Laurent GJ, Dabbagh K, Dawson J, McAnulty RJ: A novel transforming growth factor beta2 antisense transcript in mammalian lung. Biochem J. 1998, 332 (Pt 2): 297-301.
Kiyosawa H, Mise N, Iwase S, Hayashizaki Y, Abe K: Disclosing hidden transcripts: mouse natural sense-antisense transcripts tend to be poly(A) negative and nuclear localized. Genome Res. 2005, 15 (4): 463-474. 10.1101/gr.3155905.
David L, Huber W, Granovskaia M, Toedling J, Palm CJ, Bofkin L, Jones T, Davis RW, Steinmetz LM: A high-resolution map of transcription in the yeast genome. Proc Natl Acad Sci USA. 2006, 103 (14): 5320-5325. 10.1073/pnas.0601091103.
Numata K, Okada Y, Saito R, Kiyosawa H, Kanai A, Tomita M: Comparative analysis of cis- encoded antisense RNAs in eukaryotes. Gene. 2007, 392 (1–2): 134-141. 10.1016/j.gene.2006.12.005.
Sun M, Hurst LD, Carmichael GG, Chen J: Evidence for a preferential targeting of 3'-UTRs by cis- encoded natural antisense transcripts. Nucleic Acids Res. 2005, 33 (17): 5533-5543. 10.1093/nar/gki852.
Imamura T, Yamamoto S, Ohgane J, Hattori N, Tanaka S, Shiota K: Non-coding RNA directed DNA demethylation of Sphk1 CpG island. Biochem Biophys Res Commun. 2004, 322 (2): 593-600. 10.1016/j.bbrc.2004.07.159.
Dallosso AR, Hancock AL, Malik S, Salpekar A, King-Underwood L, Pritchard-Jones K, Peters J, Moorwood K, Ward A, Malik KT, et al: Alternately spliced WT1 antisense transcripts interact with WT1 sense RNA and show epigenetic and splicing defects in cancer. RNA. 2007, 13 (12): 2287-2299. 10.1261/rna.562907.
Ohhata T, Hoki Y, Sasaki H, Sado T: Crucial role ofantisense transcription across the Xist promoter in Tsix-mediated Xist chromatin modification. Development. 2008, 135 (2): 227-235. 10.1242/dev.008490.
Engstrom PG, Suzuki H, Ninomiya N, Akalin A, Sessa L, Lavorgna G, Brozzi A, Luzi L, Tan SL, Yang L, et al: Complex loci in human and mouse genomes. PLoS Genet. 2006, 2 (4): e47-10.1371/journal.pgen.0020047.
Galante PA, Vidal DO, de Souza JE, Camargo AA, de Souza SJ: Sense-antisense pairs in mammals: functional and evolutionary considerations. Genome Biol. 2007, 8 (3): R40-10.1186/gb-2007-8-3-r40.
Carninci P, Kasukawa T, Katayama S, Gough J, Frith MC, Maeda N, Oyama R, Ravasi T, Lenhard B, Wells C, et al: The transcriptional landscape of the mammalian genome. Science. 2005, 309 (5740): 1559-1563. 10.1126/science.1112014.
Birney E, Stamatoyannopoulos JA, Dutta A, Guigo R, Gingeras TR, Margulies EH, Weng Z, Snyder M, Dermitzakis ET, Thurman RE, et al: Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature. 2007, 447 (7146): 799-816. 10.1038/nature05874.
Svingen T, Beverdam A, Verma P, Wilhelm D, Koopman P: Aard is specifically up-regulated in Sertoli cells during mouse testis differentiation. Int J Dev Biol. 2007, 51 (3): 255-258. 10.1387/ijdb.062219ts.
van Nie R, Hilgers J: Genetic analysis of mammary tumor induction and expression of mammary tumor virus antigen in hormone-treated ovariectomized GR mice. J Natl Cancer Inst. 1976, 56 (1): 27-32.
Werner A, Schmutzler G, Carlile M, Miles CG, Peters H: Expression profiling of antisense transcripts on DNA arrays. Physiol Genomics. 2007, 28 (3): 294-300.
Oeder S, Mages J, Flicek P, Lang R: Uncovering information on expression of natural antisense transcripts in Affymetrix MOE430 datasets. BMC Genomics. 2007, 8: 200-10.1186/1471-2164-8-200.
Vallon-Christersson J, Staaf J, Kvist A, Medstrand P, Borg A, Rovira C: Non-coding antisense transcription detected by conventional and single-stranded cDNA microarray. BMC Genomics. 2007, 8: 295-10.1186/1471-2164-8-295.
Gyorffy A, Surowiak P, Tulassay Z, Gyorffy B: Highly expressed genes are associated with inverse antisense transcription in mouse. J Genet. 2007, 86 (2): 103-109. 10.1007/s12041-007-0015-x.
Ge X, Rubinstein WS, Jung YC, Wu Q: Genome-wide analysis of antisense transcription with Affymetrix exon array. BMC Genomics. 2008, 9: 27-10.1186/1471-2164-9-27.
Wang Z, Gerstein M, Snyder M: RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet. 2009, 10 (1): 57-63. 10.1038/nrg2484.
Wernersson R, Nielsen HB: OligoWiz 2.0-integrating sequence feature annotation into the design of microarray probes. Nucleic Acids Res. 2005, W611-615. 10.1093/nar/gki399. 33 Web Server
Yanai I, Benjamin H, Shmoish M, Chalifa-Caspi V, Shklar M, Ophir R, Bar-Even A, Horn-Saban S, Safran M, Domany E, et al: Genome-wide midrange transcription profiles reveal expression level relationships in human tissue specification. Bioinformatics. 2005, 21 (5): 650-659. 10.1093/bioinformatics/bti042.
National Center for Biotechnology Information Gene Expression Omnibus. [http://www.ncbi.nlm.nih.gov/geo/]
Antisense Viewer. [http://www.brc.riken.jp/archives/Kiyosawa/BMC_Genomics09/]
Okada Y, Tashiro C, Numata K, Watanabe K, Nakaoka H, Yamamoto N, Okubo K, Ikeda R, Saito R, Kanai A, et al: Comparative expression analysis uncovers novel features of endogenous antisense transcription. Hum Mol Genet. 2008, 17 (11): 1631-1640. 10.1093/hmg/ddn051.
Ota T, Suzuki Y, Nishikawa T, Otsuki T, Sugiyama T, Irie R, Wakamatsu A, Hayashi K, Sato H, Nagai K, et al: Complete sequencing and characterization of 21,243 full-length human cDNAs. Nat Genet. 2004, 36: 40-45. 10.1038/ng1285.
Pruitt KD, Tatusova T, Maglott DR: NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 2007, D61-65. 10.1093/nar/gkl842. 35 Database
Wheeler DL, Church DM, Federhen S, Lash AE, Madden TL, Pontius JU, Schuler GD, Schriml LM, Sequeira E, Tatusova TA, et al: Database resources of the National Center for Biotechnology. Nucleic Acids Res. 2003, 31 (1): 28-33. 10.1093/nar/gkg033.
Schwartz S, Kent WJ, Smit A, Zhang Z, Baertsch R, Hardison RC, Haussler D, Miller W: Human-mouse alignments with BLASTZ. Genome Res. 2003, 13 (1): 103-107. 10.1101/gr.809403.
BioMart Project. [http://www.biomart.org/]
Bellve AR, Cavicchia JC, Millette CF, O'Brien DA, Bhatnagar YM, Dym M: Spermatogenic cells of the prepuberal mouse. Isolation and morphological characterization. J Cell Biol. 1977, 74 (1): 68-85. 10.1083/jcb.74.1.68.
Romrell LJ, Bellve AR, Fawcett DW: Separation of mouse spermatogenic cells by sedimentation velocity. A morphological characterization. Dev Biol. 1976, 49 (1): 119-131. 10.1016/0012-1606(76)90262-1.
The authors acknowledge Yutaka Watanabe and Takahiro Doi (RIKEN BRC) for their helpful discussions, Hidemasa Kato (Saitama Medical University) for critical reading of the manuscript, Shinichi Kashiwabara (Tsukuba University) for fractionation of the testis germ cells, and Toutai Mitsuyama (Computational Biology Research Center, Advanced Industrial Science and Technology, Japan) and Kiyoshi Asai (Tokyo University) for bioinformatic support. We also thank Naoto Kaneko (Tsukuba University), Kouichi Tatsuguchi and Yukiaki Kikuta (C's Lab Co. Ltd), members of Genostaff Inc., and staff at Hokkaido System Science Co. Ltd for technical and experimental support. This work was supported in part by grants from the Non-coding RNA Project by the New Energy and Industrial Technology Development Organization (NEDO) of Japan; and by a Research Fellowship of the Japan Society for the Promotion of Science (JSPS) for Young Scientists to K.N.
KN wrote the manuscript, with editing by RS, AK, KA, and HK Microarray design and bioinformatics analyses were performed by KN, YOs, and YOk GRS/A mice were prepared and dissected by NH. HN and NY developed the original viewer for the expression data. KW performed the microarray experiments, and KO performed the in-situ hybridization experiments. CK performed quantitative RT-PCR analysis. HK organized and directed the project.
Electronic supplementary material
Additional file 1: Signal intensities from AFAS probes for Tsix. AFAS probes designed for Tsix, which reflects the abundance of Xist RNA, detected expression in the 11 adult mouse tissues (mixed males and females), but not in the testis. (PDF 215 KB)
Additional file 2: Antisense expression of mouse imprinted genes. AFAS probes for several imprinted genes (Igf2r, Kcnq1, Gnas, Dio3, and Ube3a), which are known to give rise to antisense transcripts, gave prominent signals. (PDF 123 KB)
Additional file 3: Numbers of valid probes in adult mouse tissue profiling. A significantly higher number of AFAS probes than sense probes detected transcripts only within random-primed samples, but not among the oligo-dT primed targets. (PDF 83 KB)
Additional file 4: Highest signal intensities from expression profiling of the 12 normal adult tissues. Probes of 120 genes gave signals with a higher than average intensity according to inter-array normalization. (PDF 65 KB)
Additional file 5: Expression analyses of sense and antisense transcripts of Acaa1b. Quantitative RT-PCR, ISH and microarray analyses were able to detect this transcript within the testis and kidney. (PDF 2 MB)
Additional file 6: List of genes for which expression of the antisense transcript and sense transcript markedly changed in tumors. Forty-eight were up-regulated and nine were down-regulated within tumor regions, compared with in normal regions. (PDF 34 KB)
Additional file 8: Selection of target regions for microarray probe design. Target region selection for the probe design is summarized. (PDF 92 KB)
Additional file 9: List of primers for PCR amplification of cDNA fragments to generate probes for Northern blot analysis and in situ hybridization. Primers to amplify probes specific for sense and antisense of Acaa1b, Aard, and Thbd are listed. (PDF 41 KB)
Additional file 11: Primers for real-time quantitative RT-PCR. Primers for real-time quantitative RT-PCR (Gapdh and Acaa1b-AS) are listed. (PDF 28 KB)
Authors’ original submitted files for images
About this article
Cite this article
Numata, K., Osada, Y., Okada, Y. et al. Identification of novel endogenous antisense transcripts by DNA microarray analysis targeting complementary strand of annotated genes. BMC Genomics 10, 392 (2009). https://doi.org/10.1186/1471-2164-10-392