Intron retention and transcript chimerism conserved across mammals: Ly6g5b and Csnk2b-Ly6g5b as examples
© Hernández-Torres et al.; licensee BioMed Central Ltd. 2013
Received: 24 July 2012
Accepted: 13 March 2013
Published: 22 March 2013
Alternative splicing (AS) is a major mechanism for modulating gene expression of an organism, allowing the synthesis of several structurally and functionally distinct mRNAs and protein isoforms from a unique gene. Related to AS is the Transcription Induced Chimerism (TIC) or Tandem Chimerism, by which chimeric RNAs between adjacent genes can be found, increasing combinatorial complexity of the proteome. The Ly6g5b gene presents particular behaviours in its expression, involving an intron retention event and being capable to form RNA chimera transcripts with the upstream gene Csnk2b. We wanted to characterise these events more deeply in four tissues in six different mammals and analyse their protein products.
While canonical Csnk2b isoform was widely expressed, Ly6g5b canonical isoform was less ubiquitous, although the Ly6g5b first intron retained transcript was present in all the tissues and species analysed. Csnk2b-Ly6g5b chimeras were present in all the samples analysed, but with restricted expression patterns. Some of these chimeric transcripts maintained correct structural domains from Csnk2b and Ly6g5b. Moreover, we found Csnk2b, Ly6g5b, and Csnk2b-Ly6g5b transcripts that present exon skipping, alternative 5' and 3' splice site and intron retention events. These would generate truncated or aberrant proteins whose role remains unknown. Some chimeric transcripts would encode CSNK2B proteins with an altered C-terminus, which could affect its biological function broadening its substrate specificity. Over-expression of human CSNK2B, LY6G5B, and CSNK2B-LY6G5B proteins, show different patterns of post-translational modifications and cell distribution.
Ly6g5b intron retention and Csnk2b-Ly6g5b transcript chimerism are broadly distributed in tissues of different mammals.
To date, a high number of eukaryotic genomes have been sequenced. Surprisingly, it is interesting to observe that Homo sapiens and Caenorhabditis elegans genomes contain a similar number of protein-coding genes (~21.000), according to the Ensembl (http://www.ensembl.org/) database. Initially, these findings disconcerted researchers who thought the number of genes should be correlated with developmental and physiological complexity, and made them realize that other mechanisms should be involved in this evolutionary variety. Alternative splicing (AS) is a major mechanism for modulating gene expression of an organism, and enables a single gene to increase its expression capacity, allowing the synthesis of several structurally and functionally distinct mRNAs and protein isoforms from a unique gene (For reviews see [1–3]). This mechanism, which was initially described in viruses [4–6], is now known to affect 95% of all human genes  and has been proposed as a primary driver of the evolution of phenotypic complexity in mammals [8–10]. The human Major Histocompatibility Complex (MHC) is located on chromosome 6, and is ~4 Mb in length. It is composed by three regions, the class I and class II regions flanking the central class III region. The class III region is ~0.9 Mb in length and contains 62–64 genes and 2–3 pseudogenes, depending on the haplotype [11, 12]. Previously, our group precisely defined the AS patterns of a five gene cluster from the Lymphocyte antigen-6 (LY-6) superfamily  and characterised the expression of the corresponding proteins  in human and mouse. LY-6 superfamily members are cysteine-rich, generally GPI-anchored, cell surface proteins, which have definite or putative immune-related roles . Among these LY-6 MHC class III region genes, Ly6g5b showed a particular behaviour in the regulation of its expression [13, 16], involving an intron retention event in human and mouse, the rarest form of alternative splicing found in metazoan species . The intron retained is the first one after the initial exon and interrupts the open reading frame (ORF) just after the signal peptide introducing a premature stop codon (PSC). The presence of a PSC at this position should cause this intron-retained transcript to undergo Nonsense Mediated Decay (NMD) [18, 19]. However, this transcript seemed to escape NMD and was more abundant than the correctly spliced mRNA [13, 16]. In addition, findings in our laboratory showed the presence of LY6G5B gene exons in transcripts derived from the upstream gene CSNK2B, which encodes the Casein Kinase II β subunit, a ubiquitous protein kinase which regulates metabolic pathways, signal transduction, transcription, translation, and replication [20, 21]. The Transcription Induced Chimerism (TIC) or Tandem Chimerism, is a phenomenon whose mechanism still remains largely unknown, although it is being promoted as a novel way to increase combinatorial complexity of the proteome [22, 23]. At least 4%-5% of the tandem gene pairs in the human genome can be eventually transcribed into a single RNA sequence encoding a putative chimeric protein  but recent bioinformatic analyses, partially supported by experimental data, show that this phenomenon could be quite frequent [24, 25]. Recently, it has been showed that these chimeras significantly exploit signal peptides and transmembrane domains, which could alter the cellular localisation of cognate proteins, and that chimeric RNAs are more tissue specific than non-chimeric transcripts . Thus, this novel mechanism directly related to AS could have an important role in evolution divergence. In this regard, the majority of these studies give relevance to this phenomenon in Homo sapiens, but detailed comparative analyses among different species are, to our knowledge, not described. Here, we have deeply characterised the expression of Ly6g5b and Csnk2b transcripts independently and of the Csnk2b-Ly6g5b chimeric transcripts in four defined tissues among six different mammals. We conclude that Ly6g5b intron retention and Csnk2b-Ly6g5b chimerism is present in the tissues of the analysed mammalian group. In addition, we have made a comparative analysis of human CSNK2B, LY6G5B, and CSNK2B-LY6G5B protein expressions.
Canonical Ly6g5b ORF orthologue sequences from Homo sapiens (NM_021221), Macaca mulata (XR_014070), Sus scrofa (XM_001926307), Bos taurus (XM_585827), Rattus norvegicus (NM_001001934) and Mus musculus (NM_148939) were compared (Figure 1B). Although some differences in amino acid sequence can be detected, a LY-6 protein domain conservation in these Ly6g5b orthologues is clearly present (Figure 1B). This domain is composed of ~80 amino acids and is characterised by a conserved pattern of eight to ten cysteine residues that have a defined disulfide-bounding pattern . Through RT-PCR analysis, we found four different transcripts for Ly6g5b in Homo sapiens, two in Macaca mulata, three in Sus scrofa, three in Bos taurus, four in Rattus norvegicus and two in Mus musculus (Figure 2 and Additional file 1). The majority of these transcripts were not available on databases; even the canonical sequences (see Additional file 2: Table S1). Curiously, the presence of the canonical Ly6g5b transcript was only detected in three of the analysed species (Homo sapiens, Rattus norvegicus and Mus musculus). Similarly to what happened for Csnk2b, different transcript variants are generated for Ly6g5b through AS. These transcripts present a remarkable specificity among the analysed tissues and species, and assuming that they could be translated into proteins starting from the first canonical start codon, only truncated or aberrant proteins could be generated by them. Nevertheless, there is an interesting feature that should be stressed, the retention of the first intron in Ly6g5b transcripts, giving rise to a particular isoform (exon 1, intron 1, exon 2 and exon 3) that is present in all the tissues and analysed species (Figure 2), indicating conservation, and presenting the highest expression levels (data not shown). This isoform contains a PSC after the canonical start codon, in the middle of the retained intron, and therefore should be degraded through control mechanisms like NMD [18, 19]. However this seems not to be the case, as we have previously described in human .
Csnk2b-Ly6g5bChimeric transcript analysis
It is interesting to note that several (23/35) chimeric transcripts could encode Csnk2b truncated proteins (7/35) or with modifications in their C-terminus (16/35). The CSNK2B C-terminal part is involved in homodimerization and binding to CSNK2A subunit (see Figure 1) . Some of these detected chimeric transcripts are generated by replacing the canonical sequence of exon 7 by sequences encoded by total or partial exon 2 or 3 of Ly6g5b but not corresponding to LY-6 amino-acid sequences due to changes in the reading frames. These are chimeras HsCsnk2b-Ly6g5b-1181, MamCsnk2b-Ly6g5b-1218, SsCsnk2b-Ly6g5b-728, MumCsnk2b-Ly6g5b-979 and MumCsnk2b-Ly6g5b-1108, with variable tissue distribution, except MumCsnk2b-Ly6g5b-979 that is expressed in the four tissues. Other transcripts are altered on Csnk2b exon 6 or 7, lacking the zinc-finger domain, α6 and C-terminal regions of CSNK2B (see Figure 1), such as MamCsnk2b-Ly6g5b-1141 (only expressed in brain) and BtCsnk2b-Ly6g5b-1049 (expressed in brain and lung) which maintain the same exon-intron structure (lack of Csnk2b exon 6 and Ly6g6b exon 1), and SsCsnk2b-Ly6g5b-538 (expressed in the four tissues) (see Figure 2). In addition, there are some chimeric transcripts that would encode a complete CSNK2B protein considering that they contain all exons (1-7) of Csnk2b including the stop codon and in which the Ly6g5b nucleotide sequences will act as 3´ UTRs. These are MamCsnk2b-Ly6g5b-1331, MamCsnk2b-Ly6g5b-2338, BtCsnk2b-Ly6g5b-1239, RnCsnk2b-Ly6g5b-2050, RnCsnk2b-Ly6g5b-2275 and RnCsnk2b-Ly6g5b-2531. They present variable tissue distribution and exon-intron structure (see Figure 2).
Csnk2b, Ly6g5b and Csnk2b -Ly6g5bChimera protein analysis
On the other hand, anti-PentaHis antibodies showed similar pattern for CSNK2B protein to that showed by anti-V5 antibodies, but no signal of LY6G5B protein or CSNK2B-LY6G5B protein was detected (Figure 4B). This lack of detection on LY6G5B could be due to a C-terminal processing cleaving also the Tag. This cleavage signal sequence would also be present in CSNK2B-LY6G5B protein and for that also prone to be processed.
Cellular CSNK2B protein distribution has been described before in cytoplasm, nuclei, and other organelles . Our results, through immunoflourescent confocal microscopy experiments by using anti-V5 antibodies under permeabilised conditions, showed mainly cytoplasmic cellular CSNK2B protein distribution, in agreement with previous data . Under the same conditions, LY6G5B showed a protein distribution clearly related with ER pattern, and not extracellular staining, as previously described . Here, for the first time, we show CSNK2B-LY6G5B protein distribution, which is quite similar to the one presented by CSNK2B and which clearly differs from the one of the LY6G5B protein.
Although the LY6G5B protein belongs to a GPI-anchored protein family, it has not been found to be located on the outside of the cellular membrane . In addition, it is known that CSNK2B can be exported to the external side of the cellular membrane , and CSNK2B-LY6G5B presents CSNK2B domains needed for its exportation to cell surface and/or its excretion, as well as a mature Ly-6 domain (Figure 3). To know whether CSNK2B-LY6G5B could be on the cell surface we carried out two experimental strategies. The first one consisted of immunoflourescent confocal microscopy experiments under non-permeabilised conditions. Our results showed the absence of CSNK2B, LY6G5B as well as CSNK2B-LY6G5B chimeric proteins in the cell surface (Figure 4C), in COS 7 cells. The second one was to test CSNK2B, LY6G5B and/or CSNK2B-LY6G5B proteins presence in supernatant by western-blot experiment. It was not possible to observe expression of these proteins in the supernatant, either when loaded directly on a gel or when TCA-precipitated (data not shown).
After the challenge supposed by the human genome sequencing, as well as of other organisms with medical or commercial interest, the determination of the transcriptome of these species is a prerequisite for fully understanding not only their molecular biology, but also for translating this information to technical applications in medicine, pharmacology and biotechnology. In this sense, high throughput technologies supported by systematic cDNA libraries sequencing has been the main approach used for transcriptome characterisation. Thus, sequencing of random transcript cDNA clones that results in short partial sequences, known as expressed sequence tags  (EST) or, more recently, methods for full length isolation and sequencing of random clones from cDNA libraries have been used [33, 34]. In addition, in the last few years massive sequencing technologies have also been developed for transcriptome analysis [7, 24, 35]. All these high throughput techniques present the advantage to generate a considerable volume of information in a relative short time, but these techniques seem to be inefficient for discovering relative rare transcripts. This affirmation is supported by recent data that suggest the existence of a wealth of transcripts which had, so far, escaped detection through systematic sequencing of cDNA libraries . In a recent published work  combining EST and RNAseq data, it has been showed that chimeras are lowly expressed transcripts. Thus, here we present that nested RT-PCR shows to be an efficient tool to discover a number of transcripts expressed from a concrete locus, not only those highly expressed but also those expressed at lower levels. In fact, 62 of 76 transcripts detected in our experiments had never been described before. This supposes 81.5% of novel sequences, showing how powerful this technique is in order to detect transcripts from a concrete locus. In this respect, our results show that current gene and transcript annotation sets might cover only a small fraction of the total transcriptional output of the human and other organism described genomes, and in this sense, a major enforce should be taken in order to detect more transcripts of a particular organism.
A detailed human and mouse Ly6g5b transcript analysis has been previously described by our group [13, 16]. However, here we have extended this comparative analysis among different mammalian species, showing that Ly6g5b first intron retention is a ubiquitous and important characteristic conserved in mammals. Its apparent capacity to escape NMD together with its conservation in the mammalian group studied points to an important role for this non-coding RNA, which should be investigated. So, we could conclude that Ly6g5b gene presents a double expression pattern. The first one, quite similar among tissues and species, is constituted by Ly6g5b transcripts with first intron retention event. The second one seems to be tissue and species specific and is constituted by canonical Ly6g5b transcripts with a complete Ly-6 ORF as well as aberrant and non conserved transcripts, with no common pattern distribution.
For Csnk2b gene expression we can also describe a double expression pattern. The first one constituted by the canonical isoform, which is quite similar among tissues and species, and the second one which seems to be tissue and species specific and is constituted by aberrant and non conserved transcripts, with no common pattern distribution.
CSNK2B-LY6G5B human chimerism was initially described by Calvanesse et al (2008) by using different human cell lines . However, here we show the first comparative analysis of this TIC event using RNA from different mammalian tissues and species. Our results show that, far to be a human characteristic, Csnk2b-Ly6g5b chimerism is widely conserved in mammals. Its conservation among all the species analysed in this study shows how Csnk2b-Ly6g5b chimerism is not a trivial event. The majority of them lack the last exon of the upstream gene as well as the first exon of the downstream gene which is consistent with previous reported data [22, 23]. This eliminates the stop codon as well as the molecular targets present in the 3’ UTR region of the transcripts of the upstream gene. We also agree with these authors that run-off is the most likely mechanism involved in the origin of TIC, since some chimeric transcripts detected in our study maintain the intergenic region. Other mechanisms proposed for generating chimeric transcripts, like trans-splicing, are not likely to maintain these intergenic regions.
In addition to the canonical Csnk2b, Ly6g5 and Csnk2b-Ly6g5b transcripts with a coherent ORF, other transcripts detected in our study present exon skipping as well as intron retention events which allow to generate, assuming that all these transcripts could be translated into protein by using canonical start codon, truncated or aberrant proteins. Others are non-coding RNAs. The observation that there are tens of thousands of non-coding RNA (ncRNA) expressed in mammals, and that most of the genome is transcribed, confronts and contradicts the traditional protein-centric view of genetic information and genome organisation , . Thus, there are two opposing alternatives either the bulk of the transcription which does not yield mRNAs is ‘transcriptional noise’ and/or the residue of evolutionary baggage retained or accumulated within genes, or this transcription comprises another level of expression and transaction of RNA information that is important to the evolution and developmental ontogeny of the higher organisms . If one assumes that all this is transcriptional noise and that all these transcripts are the result of transcriptional machinery mistakes while it is working, they should not be distributed in a specific manner.
Some authors defend that chimerism might generate bi-functional proteins having properties from both original proteins . Through different analyses of our results, we could identify chimeras which maintain the ORFs of Csnk2b and Ly6g5b susceptible to form bi-functional chimeric proteins in Homo sapiens, Macaca mulata, and Sus scrofa. The fact that these were not found in cow, rat and mouse does not indicate functional chimera absence in these species, since they could be present in other tissues not analysed in this study. These hypothetical new chimeric proteins all carry N-terminus domains from Csnk2b involved in structural aspects that are required for Csnk2b exportation to the cellular surface  and/or its regulation , as well as Ly-6 domain amino acid sequence [13, 14] at their C-terminus. However, the chimera “bi-functional” protein will affect the juxta-dimer interface region  containing the zinc-finger involved in the homo-dimerisation, as well as all the C-terminal domain involved in the interaction with the CSNK2A subunits and the crucial last 20 amino-acids also involved in the homo-dimerisation. In addition, the Ly-6 domain, with 10 Cysteins, could not be folded as such due to the intracellular localisation. These two facts indicate that the “bi-functional” chimera would not be formed, and would only have one function: the one of CSNK2B, although possibly binding to other kinase different to CSNK2A due to the alterations produced on the C-terminal domain commented above. Other chimeras would only be affected from exon 7, containing then the juxta-dimer interface region but differing at the very end C-terminal region and not containing the Ly-6 domain sequence due to frameshifts, and probably only affecting the binding to CSNK2A. It has been proposed that CSNK2B might bind other kinases such as Ras-1 and Mos to modify their catalytic affinity in a CK2-independent fashion. The alternative C-terminal ends, generated by the chimeric transcripts, could increment the binding repertoire of CSNK2B to other kinases or non-kinase proteins converting CSNK2B in even a wider “wild-card” regulator subunit than previously proposed .
In addition, we have found chimeric transcripts that would encode a complete CSNK2B protein, but with different 3´UTR due to the Ly6g5b sequence, in Macaca mulata, Bos taurus and Rattus norvegicus. These transcripts could have different mRNA localisations or stabilities which could have altered protein functional implications [40, 41]. It has been described that some 3´UTR contain “localization elements” or “zip codes” which target mRNAs to specific subcellular sites.
Alternative splicing has an important role in Csnk2b and Ly6g5b gene expression. Ly6g5b intron retention and Csnk2b-Ly6g5b chimera transcripts are present in many tissues of different mammals. The data and analysis we have performed should serve as a valuable resource for further characterizing the possible functional role of TIC and the mechanisms that affect it.
Csnk2b and Ly6g5b orthologue ORF sequences Homo sapiens, Macaca mulata, Sus scrofa, Bos taurus, Rattus norvegicus and Mus musculus were obtained from the National Centre for Biotechnology Information (NCBI) (http://www.ncbi.nlm.nih.gov/BLAST/) . Multiple alignments of sequences were performed with ClustalW2 software (http://www.ebi.ac.uk/Tools/msa/clustalw2/) [43, 44]. Protein sequences analyses were carried out using Pfam (http://www.sanger.ac.uk/resources/databases/pfam.html) , SMART (http://smart.embl-heidelberg.de/) and InterProScan (http://www.ebi.ac.uk/InterProScan/) [46–49] databases. The sequencing results were evaluated using the BLAST algorithm at the NCBI web page and the MultAlin alignment software (http://bioinfo.genotoul.fr/multalin/multalin.html) . Primers were designed using the Primer3 software (http://www.bioinformatics.nl/cgi-bin/primer3plus/primer3plus.cgi).
mRNA extraction and retrotranscription
Homo sapiens, Macaca mulata, Sus scrofa, Bos taurus, Rattus norvegicus and Mus musculus brain, kidney, liver and lung tissue total RNAs were obtained from BioChain® (USA) http://www.biochain.com through one of their Europe distributor "AMS" http://www.amsbio.com (UK). One μg of total RNA from each tissue was used for oligo-dT primed cDNA synthesis which was performed using the ImProm-II(TM) Reverse Transcription System (Promega) in a 20 μl reaction volume following the manufacturer's instructions.
Expression constructs primers
Long primers for V5 epitope cloning in pcDNA3.1/Zeo(-)
Primers for CSNK2B, LY6G5B and CSNK2B-LY6G5B Chimera ORFs cloning in pcDNA3.1/V5-His-TOPO
Primers for final construct in pcDNA3.1/Zeo(-)
Transfections and western blot analyses
Transfections of COS-7 cells were carried in 24-well plates, with 0.5 μg of plasmid per well, by using TransIT® COS Transfection kit (Mirus-BioNova) following the manufacturer´s instructions. For Western Blot analyses, two days after transfection, cells were harvested in Laemmli´s SDS sample buffer. Samples were resolved on SDS 12% (w/v) polyacrylamide gels and the proteins were transferred onto nitrocellulose membranes (Protran). After blocking the membrane 30 minutes in PBST (PBS-0,05% (v/v) Tween) containing 5% (w/v) skimmed milk powder (blocking solution), the blot was first incubated for 90 min either in a 1/2000 dilution of mouse anti-Hisx5 (Qiagen), a 1/10000 dilution of mouse anti-V5 (Sigma) or a 1/5000 dilution of mouse anti-GAPDH monoclonal antibodies in blocking solution, washed three times for 10 min with PBST, and then incubated for 30 min in a 1:5000 dilution of peroxidase-conjugated anti-mouse IgG antibody (Sigma) in PBST. Membrane was washed twice for 10 min with PBST followed by a final wash in PBS for 10 min. Bound antibodies were detected by ECL Plus Western Blotting (Amershan). Secreted proteins were TCA-precipitated from the culture media after the addition of 1 volume of TCA to 4 volumes of media, incubated 10 min at 4°C, washed with 200 μl cold acetone, resuspended in Laemmli´s SDS sample buffer, and analysed by SDS-PAGE followed by Western blotting, as described above.
Transfections were performed as described above. Two days after transfection, cells were fixed for 15 min at room temperature with 4% (v/v) paraformaldehyde in PBS. Following fixation, cells were washed three times for 15 min with PBS, treated with ammonium chloride 50 mM for 30 min and permeabilised for 20 min with PBS containing 0.1% (v/v) Triton X-100 (wash solution). Cells were blocked for 30 min in wash solution with 5% (w/v) BSA (Bovine Serum Albumin, blocking solution) and then incubated for 1 h with 1:400 dilution of mouse anti-V5 monoclonal antibody (Sigma) in blocking solution. After that, cells were washed three times for 15 min with wash solution and then incubated with a secondary antibody coupled to Alexa 488 (anti mouse IgG, Invitrogen) in a 1:500 dilution for 1 h in blocking solution. Cells were washed three times for 15 min with wash solution and nucleic acids were stained with To-Pro3 in a 1:500 dilution in blocking solution. Finally, cells were washed three times for 15 min with wash solution, mounted with Prolong Gold Antifade Reagent (Molecular Probes) and photographed with a Leika confocal microscope. Non-permeabilised immunofluorescence experiments were carried out under the same conditions described above but avoiding Triton X-100 addition.
Reverse transcription polymerase chain reaction
Major Histocompatibility Complex
Open reading frame
Premature stop codon
Nonsense mediated decay
Transcription induced chimerism
Expressed sequence tag
We are grateful to Fernando Carrasco from the Genomics Service at CBMSO for his technical expertise and advice. This work was supported by grants from the Ministerio de Educación y Ciencia (BFU2005-03683), Ministerio de Ciencia e Innovación (BFU 2008-03126, BFU2009-09117), Comunidad de Madrid (GR/SAL/0670/2004, and 200620 M078) and Fundación Ramón Areces. F Hernández-Torres was funded by Genoma España and A Rastrojo by a postgraduate fellowship (FPU) from the Ministerio de Educación y Ciencia. B Aguado held a Programa Ramón y Cajal (MEC) contract and an Amarouto (Comunidad Madrid-Fundación Severo Ochoa) contract. We acknowledge support of the publication fee by the CSIC Open Access Publication Support Initiative through its Unit of Information Resources for Research (URICI). The CBMSO receives an institutional grant from Fundación Ramón Areces.
- Villate O, Rastrojo A, Lopez-Diez R, Hernandez-Torres F, Aguado B: Differential splicing, disease and drug targets. Infect Disord Drug Targets. 2008, 8 (4): 241-251. 10.2174/187152608786734188.View ArticlePubMed
- Irimia M, Blencowe BJ: Alternative splicing: decoding an expansive regulatory layer. Curr Opin Cell Biol. 2012, 24 (3): 323-332. 10.1016/j.ceb.2012.03.005.View ArticlePubMed
- Kalsotra A, Cooper TA: Functional consequences of developmentally regulated alternative splicing. Nat Rev Genet. 2011, 12 (10): 715-729. 10.1038/nrg3052.PubMed CentralView ArticlePubMed
- Chow LT, Gelinas RE, Broker TR, Roberts RJ: An amazing sequence arrangement at the 5' ends of adenovirus 2 messenger RNA. Cell. 1977, 12 (1): 1-8. 10.1016/0092-8674(77)90180-5.View ArticlePubMed
- Gelinas RE, Roberts RJ: One predominant 5'-undecanucleotide in adenovirus 2 late messenger RNAs. Cell. 1977, 11 (3): 533-544. 10.1016/0092-8674(77)90071-X.View ArticlePubMed
- Berget SM, Moore C, Sharp PA: Spliced segments at the 5' terminus of adenovirus 2 late mRNA. Proc Natl Acad Sci USA. 1977, 74 (8): 3171-3175. 10.1073/pnas.74.8.3171.PubMed CentralView ArticlePubMed
- Wang ET, Sandberg R, Luo S, Khrebtukova I, Zhang L, Mayr C, Kingsmore SF: Schroth GP. 2008, Burge CB: Alternative isoform regulation in human tissue transcriptomes. Nature
- Johnson JM, Castle J, Garrett-Engele P, Kan Z, Loerch PM, Armour CD, Santos R, Schadt EE, Stoughton R, Shoemaker DD: Genome-wide survey of human alternative pre-mRNA splicing with exon junction microarrays. Science. 2003, 302 (5653): 2141-2144. 10.1126/science.1090100.View ArticlePubMed
- Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W: Initial sequencing and analysis of the human genome. Nature. 2001, 409 (6822): 860-921. 10.1038/35057062.View ArticlePubMed
- Venter JC, Adams MD, Myers EW, Li PW, Mural RJ, Sutton GG, Smith HO, Yandell M, Evans CA, Holt RA: The sequence of the human genome. Science. 2001, 291 (5507): 1304-1351. 10.1126/science.1058040.View ArticlePubMed
- Xie T, Rowen L, Aguado B, Ahearn ME, Madan A, Qin S, Campbell RD, Hood L: Analysis of the gene-dense major histocompatibility complex class III region and its comparison to mouse. Genome Res. 2003, 13 (12): 2621-2636. 10.1101/gr.1736803.PubMed CentralView ArticlePubMed
- The MHC sequencing consortium: Complete sequence and gene map of a human major histocompatibility complex. Nature. 1999, 401 (6756): 921-923. 10.1038/44853.View Article
- Mallya M, Campbell RD, Aguado B: Transcriptional analysis of a novel cluster of LY-6 family members in the human and mouse major histocompatibility complex: five genes with many splice forms. Genomics. 2002, 80 (1): 113-123. 10.1006/geno.2002.6794.View ArticlePubMed
- Mallya M, Campbell RD, Aguado B: Characterization of the five novel Ly-6 superfamily members encoded in the MHC, and detection of cells expressing their potential ligands. Protein Sci. 2006, 15 (10): 2244-2256. 10.1110/ps.062242606.PubMed CentralView ArticlePubMed
- Stroncek DF, Caruccio L, Bettinotti M: CD177: A member of the Ly-6 gene superfamily involved with neutrophil proliferation and polycythemia vera. J Transl Med. 2004, 2 (1): 8-10.1186/1479-5876-2-8.PubMed CentralView ArticlePubMed
- Calvanese V, Mallya M, Campbell RD, Aguado B: Regulation of expression of two LY-6 family genes by intron retention and transcription induced chimerism. BMC Mol Biol. 2008, 9: 81-10.1186/1471-2199-9-81.PubMed CentralView ArticlePubMed
- Kim E, Magen A, Ast G: Different levels of alternative splicing among eukaryotes. Nucleic Acids Res. 2007, 35 (1): 125-131. 10.1093/nar/gkm529.PubMed CentralView ArticlePubMed
- Lejeune F, Maquat LE: Mechanistic links between nonsense-mediated mRNA decay and pre-mRNA splicing in mammalian cells. Curr Opin Cell Biol. 2005, 17 (3): 309-315. 10.1016/j.ceb.2005.03.002.View ArticlePubMed
- Conti E, Izaurralde E: Nonsense-mediated mRNA decay: molecular insights and mechanistic variations across species. Curr Opin Cell Biol. 2005, 17 (3): 316-325. 10.1016/j.ceb.2005.04.005.View ArticlePubMed
- Jakobi R, Voss H, Pyerin W: Human phosvitin/casein kinase type II. Molecular cloning and sequencing of full-length cDNA encoding subunit beta. Eur J Biochem. 1989, 183 (1): 227-233. 10.1111/j.1432-1033.1989.tb14917.x.View ArticlePubMed
- Rodriguez FA, Contreras C, Bolanos-Garcia V, Allende JE: Protein kinase CK2 as an ectokinase: the role of the regulatory CK2beta subunit. Proc Natl Acad Sci USA. 2008, 105 (15): 5693-5698. 10.1073/pnas.0802065105.PubMed CentralView ArticlePubMed
- Parra G, Reymond A, Dabbouseh N, Dermitzakis ET, Castelo R, Thomson TM, Antonarakis SE, Guigo R: Tandem chimerism as a means to increase protein complexity in the human genome. Genome Res. 2006, 16 (1): 37-44.PubMed CentralView ArticlePubMed
- Akiva P, Toporik A, Edelheit S, Peretz Y, Diber A, Shemesh R, Novik A, Sorek R: Transcription-mediated gene fusion in the human genome. Genome Res. 2006, 16 (1): 30-36.PubMed CentralView ArticlePubMed
- Nacu S, Yuan W, Kan Z, Bhatt D, Rivers CS, Stinson J, Peters BA, Modrusan Z, Jung K, Seshagiri S: Deep RNA sequencing analysis of readthrough gene fusions in human prostate adenocarcinoma and reference samples. BMC Med Genomics. 2011, 4: 11-10.1186/1755-8794-4-11.PubMed CentralView ArticlePubMed
- Denoeud F, Kapranov P, Ucla C, Frankish A, Castelo R, Drenkow J, Lagarde J, Alioto T, Manzano C, Chrast J: Prominent use of distal 5' transcription start sites and discovery of a large number of additional exons in ENCODE regions. Genome Res. 2007, 17 (6): 746-759. 10.1101/gr.5660607.PubMed CentralView ArticlePubMed
- Frenkel-Morgenstern M, Lacroix V, Ezkurdia I, Levin Y, Gabashvili A, Prilusky J, Del Pozo A, Tress M, Johnson R, Guigo R: Chimeras taking shape: Potential functions of proteins encoded by chimeric RNA transcripts. Genome Res. 2012, 22 (7): 1231-1242. 10.1101/gr.130062.111.PubMed CentralView ArticlePubMed
- Bolanos-Garcia VM, Fernandez-Recio J, Allende JE, Blundell TL: Identifying interaction motifs in CK2beta–a ubiquitous kinase regulatory subunit. Trends Biochem Sci. 2006, 31 (12): 654-661. 10.1016/j.tibs.2006.10.005.View ArticlePubMed
- Kumar-Sinha C, Kalyana-Sundaram S, Chinnaiyan AM: SLC45A3-ELK4 chimera in prostate cancer: spotlight on cis-splicing. Cancer discovery. 2012, 2 (7): 582-585. 10.1158/2159-8290.CD-12-0212.PubMed CentralView ArticlePubMed
- Zhang Y, Gong M, Yuan H, Park HG, Frierson HF, Li H: Chimeric transcript generated by cis-splicing of adjacent genes regulates prostate cancer cell proliferation. Cancer discovery. 2012, 2 (7): 598-607. 10.1158/2159-8290.CD-12-0042.View ArticlePubMed
- Ackerman P, Glover CV, Osheroff N: Stimulation of casein kinase II by epidermal growth factor: relationship between the physiological activity of the kinase and the phosphorylation state of its beta subunit. Proc Natl Acad Sci USA. 1990, 87 (2): 821-825. 10.1073/pnas.87.2.821.PubMed CentralView ArticlePubMed
- Rodriguez F, Allende CC, Allende JE: Protein kinase casein kinase 2 holoenzyme produced ectopically in human cells can be exported to the external side of the cellular membrane. Proc Natl Acad Sci USA. 2005, 102 (13): 4718-4723. 10.1073/pnas.0501074102.PubMed CentralView ArticlePubMed
- Adams MD, Soares MB, Kerlavage AR, Fields C, Venter JC: Rapid cDNA sequencing (expressed sequence tags) from a directionally cloned human infant brain cDNA library. Nat Genet. 1993, 4 (4): 373-380. 10.1038/ng0893-373.View ArticlePubMed
- Carninci P, Kasukawa T, Katayama S, Gough J, Frith MC, Maeda N, Oyama R, Ravasi T, Lenhard B, Wells C: The transcriptional landscape of the mammalian genome. Science. 2005, 309 (5740): 1559-1563.View ArticlePubMed
- Kawai J, Shinagawa A, Shibata K, Yoshino M, Itoh M, Ishii Y, Arakawa T, Hara A, Fukunishi Y, Konno H: Functional annotation of a full-length mouse cDNA collection. Nature. 2001, 409 (6821): 685-690. 10.1038/35055500.View ArticlePubMed
- Mercer TR, Gerhardt DJ, Dinger ME, Crawford J, Trapnell C, Jeddeloh JA, Mattick JS, Rinn JL: Targeted RNA sequencing reveals the deep complexity of the human transcriptome. Nat Biotechnol. 2012, 30 (1): 99-104.View Article
- Djebali S, Kapranov P, Foissac S, Lagarde J, Reymond A, Ucla C, Wyss C, Drenkow J, Dumais E, Murray RR: Efficient targeted transcript discovery via array-based normalization of RACE libraries. Nat Methods. 2008, 5 (7): 629-635. 10.1038/nmeth.1216.PubMed CentralView ArticlePubMed
- Ecker JR, Bickmore WA, Barroso I, Pritchard JK, Gilad Y, Segal E: Genomics: ENCODE explained. Nature. 2012, 489 (7414): 52-55. 10.1038/489052a.View ArticlePubMed
- Amaral PP, Clark MB, Gascoigne DK, Dinger ME, Mattick JS: lncRNAdb: a reference database for long noncoding RNAs. Nucleic Acids Res. 2011, 39 (Database issue): D146-D151.PubMed CentralView ArticlePubMed
- Mattick JS, Makunin IV: Non-coding RNA. Hum Mol Genet. 2006, 15 (Spec No 1): R17-R29.View ArticlePubMed
- Holt CE, Bullock SL: Subcellular mRNA localization in animal cells and why it matters. Science. 2009, 326 (5957): 1212-1216. 10.1126/science.1176488.PubMed CentralView ArticlePubMed
- Martin KC, Ephrussi A: mRNA localization: gene expression in the spatial dimension. Cell. 2009, 136 (4): 719-730. 10.1016/j.cell.2009.01.044.PubMed CentralView ArticlePubMed
- Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL: BLAST+: architecture and applications. BMC Bioinformatics. 2009, 10: 421-10.1186/1471-2105-10-421.PubMed CentralView ArticlePubMed
- Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R: Clustal W and Clustal X version 2.0. Bioinformatics. 2007, 23 (21): 2947-2948. 10.1093/bioinformatics/btm404.View ArticlePubMed
- Goujon M, McWilliam H, Li W, Valentin F, Squizzato S, Paern J, Lopez R: A new bioinformatics analysis tools framework at EMBL-EBI. Nucleic Acids Res. 2010, 38 (Web Server issue): W695-W699.PubMed CentralView ArticlePubMed
- Finn RD, Mistry J, Tate J, Coggill P, Heger A, Pollington JE, Gavin OL, Gunasekaran P, Ceric G, Forslund K: The Pfam protein families database. Nucleic Acids Res. 2010, 38 (Database issue): D211-D222.PubMed CentralView ArticlePubMed
- Bucher P, Karplus K, Moeri N, Hofmann K: A flexible motif search technique based on generalized profiles. Comput Chem. 1996, 20 (1): 3-23. 10.1016/S0097-8485(96)80003-9.View ArticlePubMed
- Scordis P, Flower DR, Attwood TK: FingerPRINTScan: intelligent searching of the PRINTS motif database. Bioinformatics. 1999, 15 (10): 799-806. 10.1093/bioinformatics/15.10.799.View ArticlePubMed
- Eddy SR: Profile hidden Markov models. Bioinformatics. 1998, 14 (9): 755-763. 10.1093/bioinformatics/14.9.755.View ArticlePubMed
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25 (17): 3389-3402. 10.1093/nar/25.17.3389.PubMed CentralView ArticlePubMed
- Corpet F: Multiple sequence alignment with hierarchical clustering. Nucleic Acids Res. 1988, 16 (22): 10881-10890. 10.1093/nar/16.22.10881.PubMed CentralView ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.