- Research article
- Open Access
A human 3′UTR clone collection to study post-transcriptional gene regulation
BMC Genomics volume 16, Article number: 1036 (2015)
3′untranslated regions (3′UTRs) are poorly understood portions of eukaryotic mRNAs essential for post-transcriptional gene regulation. Sequence elements in 3′UTRs can be target sites for regulatory molecules such as RNA binding proteins and microRNAs (miRNAs), and these interactions can exert significant control on gene networks. However, many such interactions remain uncharacterized due to a lack of high-throughput (HT) tools to study 3′UTR biology. HT cloning efforts such as the human ORFeome exemplify the potential benefits of genomic repositories for studying human disease, especially in relation to the discovery of biomarkers and targets for therapeutic agents. Currently there are no publicly available human 3′UTR libraries. To address this we have prepared the first version of the human 3′UTRome (h3′UTRome v1) library. The h3′UTRome is produced to a single high quality standard using the same recombinational cloning technology used for the human ORFeome, enabling universal operating methods and high throughput experimentation. The library is thoroughly sequenced and annotated with simple online access to information, and made publically available through gene repositories at low cost to all scientists with minimal restriction.
The first release of the h3′UTRome library comprises 1,461 human 3′UTRs cloned into Gateway® entry vectors, ready for downstream analyses. It contains 3′UTRs for 985 transcription factors, 156 kinases, 171 RNA binding proteins, and 186 other genes involved in gene regulation and in disease. We demonstrate the feasibility of the h3′UTRome library by screening a panel of 87 3′UTRs for targeting by two miRNAs: let-7c, which is implicated in tumorigenesis, and miR-221, which is implicated in atherosclerosis and heart disease. The panel is enriched with genes involved in the RAS signaling pathway, putative novel targets for the two miRNAs, as well as genes implicated in tumorigenesis and heart disease.
The h3′UTRome v1 library is a modular resource that can be utilized for high-throughput screens to identify regulatory interactions between trans-acting factors and 3′UTRs, Importantly, the library can be customized based on the specifications of the researcher, allowing the systematic study of human 3′UTR biology.
3′untranslated regions (3′UTRs) are the sequences located immediately downstream of from the STOP codon of mature mRNAs. Although historical attention focused on protein coding sequences and upstream regions, 3′UTRs have recently become subject to intense study because they are targets of a variety of regulatory molecules, including RNA binding proteins (RBPs) and small non-coding RNAs (ncRNAs), that recognize small cis-elements present in the 3′UTRs. These cis-elements play critical roles in deciding the fate of the mRNA via various mechanisms, including co-transcriptional processing, modulating protein translation, mRNA localization and trafficking, and mRNA degradation and stability . Disruption of these processes is known to affect diverse developmental and metabolic processes, and contributes to various diseases, including neurodegenerative diseases, diabetes, and cancer [2–5].
RBPs play a role in every aspect of mRNA biogenesis, such as stability, localization, translation and decay. The human transcriptome contains approximately ~400 proteins with distinguishable RNA binding domains , and their deregulation is linked to major neurodegenerative disorders, cancer, and muscular dystrophies. Compared to transcription factors, which generally bind highly specific linear DNA sequence elements, elements in 3′UTRs targeted by RBPs are generally more degenerate and difficult to identify bioinformatically because RNA is a single-stranded molecule and RBP binding is mostly dictated by local folding and polarity . Consequently, RBPs have the potential to bind to multiple elements in different 3′UTRs, leading to intricate, dynamic, and mostly unknown networks of RNA-protein interactions.
3′UTRs are also targeted by a class of post-transcriptional regulators known as microRNAs (miRNAs), which are short non-coding RNAs that bind to complementary sequences in the 3′UTRs of metazoans . Once bound, based on the degree of complementarity, miRNAs can induce either translational repression or mRNA degradation . MiRNAs canonically recognize targets in 3′UTRs via Watson-Crick base pairing, requiring complementarity with as few as six consecutive nucleotides between the 5′end of a mature miRNAs and the 3′UTR of a target transcript . However, recent evidence suggest that miRNAs do not require perfect complementarity with target 3′UTRs to induce functional translational repression, and non-canonical interactions are frequent . Because miRNA target elements are degenerate and small they are difficult to detect, thus a vast majority of biologically relevant miRNA targets are still unknown. Based on bioinformatic predictions of miRNA-binding sites in 3′UTRs, it has been proposed that each miRNA controls large networks of hundreds of mRNAs . However, recent analysis of the predictive performance of several of the most prominent prediction algorithms, such as TargetScan , PicTar  and DIANA-microT  report extremely high false negative rates [8, 13, 14]. While these algorithms are very useful for candidate gene approaches to identify miRNA targets, the extremely high error rates make high-throughput target detection challenging. Coupled with the absence of a publically available and comprehensive 3′UTR library, the field currently lacks tools to systematically study miRNA targets, which is the gold standard in miRNA biology.
Several genomic resources are currently available to systematically study gene expression and its regulation in humans. The human ORFeome for example, is a collection of over 12,000 human protein-coding genes cloned in modular vectors and optimized to study the dynamics of gene expression [15, 16]. The ORFeome has been used to characterize genome wide protein-protein interaction networks, leading to important discoveries relevant to human disease . HT resources such as this can significantly advance our understanding of gene functions in multicellular organisms. Unfortunately, such a standardized HT tool to detect and study regulatory elements in 3′UTRs are not available since 3′UTR sequences are not present in the ORFeome. Some individual 3′UTR clones are available commercially, but these products have sporadic coverage, are too expensive for HT studies, use only proprietary vectors and are not compatible with the ORFeome. Furthermore, endogenous full length 3′UTRs frequently undergo alternative processing in a tissue specific fashion , which limits the biological relevance of experiments that use truncated or partial 3′UTRs.
To overcome this limitation, a recent study used ~240,000 short RNA sequences containing all possible 9-base nucleotide permutations immobilized on microarrays to study the binding requirements of 205 human RBPs . Although this work and others highlights important binding properties of RBPs, they do not necessarily reflect biological settings, where accessory elements near binding sites that may cooperate with the RBPs targeting are not present.
Recently, our group experimented with the usage of a pilot human full length 3′UTR library to detect miRNA targets in 3′UTRs using a scalable dual-luciferase assay named Luminescent Identification of Functional Elements in 3′UTRs (3′LIFE) [8, 18]. Although we cloned and screened only ~300 query 3′UTRs, the proof of principle 3′LIFE screen was highly effective at the rapid and efficient discovery of many novel targets for two cancer relevant miRNAs, let-7c and miR-10b . This pilot screen demonstrated the value of such an unbiased HT approach, and supports the need for the development of a publically available genome wide 3′UTR library.
Furthermore, there is a critical need in the field for a high-quality and standardized human 3′UTR resource, which could be widely used in the community to study miRNAs and RBPs using full length 3′UTRs in unbiased and HT experiments.
To overcome these limitations, we have developed the first publically available and high-quality human 3′UTR clone library, sequenced verified and cloned in modular vectors amenable to various downstream analyses. This resource enables the systematic study of 3′UTR biology, can be used to efficiently detect miRNA and RBP targets at high resolution, and study mRNA localization and dynamics. In the context of disease states, this library allows the study of key disease alterations in post-transcriptional processing, such as disease-specific: 1) mRNA mislocalization, 2) alternative polyadenylation, 3) altered miRNA expression, 4) mutation of RNA binding protein elements in 3′UTRs, and 5) more generally, the contribution of post-transcriptional gene regulation to gene output in disease initiation and progression.
Results and discussion
The human 3′UTRome v1 clone collection (h3′UTRome v1) consists of 1,461 unique, cloned and sequence-validated human 3′UTRs from transcription factors, kinases and other regulatory genes (Fig. 1a). This collection is contained in modular Gateway® compatible Entry vectors, is amenable for large screens and is publically available to the community through the at the DNASU plasmid repository (https://dnasu.org/DNASU/Home.do).
Primer design and genomic PCR
As a first release, we targeted and designed genomic primer pairs encompassing the 3′UTR regions of 1,815 human protein coding genes using the human genome release 19 (Additional file 1: Table S1) (GRCh37/hg19 Feb. 2009) .
The forward primers used for the genomic PCR were designed to anneal within the last exon of the target gene, ending with the gene-specific STOP codon in frame with the rest of the transcript (Fig. 1b). This expedient allowed us to increase the melting temperature of each forward primer, since the G/C content drops considerably after the STOP codon. In addition, designing the forward primer within the open reading frame provides the 3′UTR with its natural gene-specific STOP codon at its 5′end, allowing convenient in-frame integration with the human ORFeome library, which instead lacks termination codons  (Fig. 1b). The melting temperatures of the primers ranged from 50 to 76 °C. Given this wide range of temperature we opted for a touchdown genomic PCR approach, starting at 66 °C and decreasing by 1 °C each cycle . The reverse primers were designed to target a genomic site 150 nt downstream of the annotated transcript, encompassing downstream elements that may play a role in mRNA 3′end formation (Fig. 1b). We added the Gateway® recombination elements attB2 (forward primers) and attB3 (reverse primers) to the 5′ends of the genomic primers, to facilitate the cloning into Gateway® compatible Entry vectors. A minimum of 200 ng of genomic DNA per reaction was required to obtain an enriched PCR product while minimizing non-specific amplicons, which is known to impact the recombinational cloning procedure. The complete pipeline used in this study is shown in Fig. 1c.
Gateway® recombinational cloning
The full understanding of gene expression must consider both transcriptional and post-transcriptional regulation, requiring attention to the transcriptional promoter, the Open reading frame (ORF) and the regulatory sites within the 3′UTR.
The human 3′UTRs in this collection were cloned into the pDONR P2r-P3 Gateway® Entry vector (Invitrogen) using BP recombinant cloning. This vector is part of the three-fragment Gateway® technology, which allows modular cloning of a given promoter, an ORFeome entry and correspondent 3′UTR to be assembled in order, into a single vector in the same reaction. This allows investigators to combine these 3′UTRs with different ORFs (which are already available in the ORFeome collection) to create both natural and novel regulatory contexts. Current protein expression vectors typically rely on viral 3′UTRs, such as the SV40 polyA, which often do not reflect natural translational levels or post-transcriptional regulation. In addition, the natural 3′UTR may contribute to proper localization and stability. This technology is also compatible with the 3′LIFE assay system and has been previously used to screen for functional miRNA targeting in 3′UTRs . Successfully cloned colonies were isolated and grown in LB and analyzed by colony PCR using primers specific to the pDONR P2r-P3 backbone. The PCR amplicons were analyzed by agarose gel electrophoresis and screened based on the expected lengths of the 3′UTRs (Fig. 1d). We observed an inverse correlation between the size of the inserted 3′UTR and the BP cloning success rate (Fig. 2a). A size bias during the BP cloning reaction has been previously reported , with a decreased efficiency for amplicons greater than 1,000 nt and in agreement with our observations . However 3′UTRs in the h3′UTRome v1 are enriched with longer 3′UTR isoforms and on average contain longer 3′UTRs than those within the human transcriptome (Fig. 2b). The nucleotide lengths of the human 3′UTR clones in this release span from 200 nt to 2,500 nt and have a median length of 1,159 nt, as opposed to the median length of 3′UTRs within the human genome, which is 1,040 nt (Fig. 2b, purple and red arrows).
The first pass of cloning produced a yield of 1,410 bacterial colonies with PCR products of the expected size. We performed a second pass on all 405 missed 3′UTRs and gained an additional 172 3′UTRs, a 12 % increase to the total number of size verified clones (Fig. 1c). The complete size-verified first release of the h3′UTRome v1 is shown in Fig. 1d.
A total of 1,582 size verified clones were subsequently sequenced using Sanger method using a custom primer anchored within the P2rP3 plasmid backbone. We used Perl scripts to perform BLAT alignments  using the Sanger trace files obtained during the sequencing. Our analysis revealed that out of the initially targeted 1,815 unique 3′UTRs, 1,461 were successfully sequence verified (~80 % success rate from genomic PCR to sequence verification).
3′UTRome library overview
The human 3′UTR clones contained in the h3′UTRome v1 are unbiased towards any particular regions of the genome and correspond to ~6-10 % of the total protein-coding genes present in each chromosome (Fig. 2c). The source of DNA used for the genomic PCR was GM12878, a lymphoblastoid cell line of female origin recommended as a Tier 1 cell line by the ENCODE project. Over 54 % of the 3′UTRs in the h3′UTRome v1 overlap with genes present in the hORFeome V8.1 (Fig. 2d) . We targeted 971 3′UTRs of genes already present in the ORFeome and successfully cloned 790 3′UTRs (Fig. 2d). For this first release, we targeted predominantly 3′UTRs of genes previously classified as transcription factors [22, 23], kinases , and RBPs  (Fig. 2e). We targeted the 3′UTRs of this class of genes because they have widespread regulatory functions and have corresponding ORFeome clones. The h3′UTRome v1 release includes 3′UTRs for 985 transcription factors, 171 Kinases and 156 RBPs (Fig. 2e).
The h3′UTRome v1 library is distributed by the DNA repository DNASU (https://dnasu.org/DNASU/Home.do), a public plasmid repository hosted at the Biodesign Institute at Arizona State University, which already distributes over 180,000 individual plasmids and full genome collections, including the human ORFeome . Users can either search for a given 3′UTR clone, a plate or order the complete dataset. Many researchers are not interested in HT screens nor have the resources for large screens in their departments, but want to detect miRNA targets, mutations, or truncation of regulatory elements in the 3′UTR of their gene of interest. These researchers will be able to accelerate their research significantly because they can now order the correct ORF, 3′UTR clones and the vectors they need for their analysis at reduced cost. To simplify the ordering procedure we have given a unique ID prefix ‘HSU’ to the human 3′UTRs available with this release.
3′LIFE validation screen
The 3′LIFE screen is a high throughput dual luciferase assay, previously shown to detect functional repression of test 3′UTRs by query miRNAs [8, 18]. The 3′LIFE screen utilizes Gateway® cloning technology and is fully compatible with the h3′UTRome v1. In order to demonstrate the usability and functionality of this library we have selected 87 human 3′UTRs from the h3′UTRome v1 library and screened for miRNA targets of two disease-relevant miRNAs: let-7c and miR-221 using the 3′LIFE assay [8, 18]. let-7c is a well-characterized tumor suppressor gene, is down-regulated in many cancers, and is known to target genes in the RAS pathway . Conversely, miR-221 is frequently overexpressed in breast cancer, hepatocellular carcinomas, glioblastoma and prostate cancer [28–31], and has been shown to target several tumor suppressor genes such as Kip-1 (p-27), CDKN1B, CDKN1C, PTEN, ARHI and PUMA [29, 32, 33]. In addition, miR-221 is known to be involved with muscle damage repair and atherosclerosis [34, 35]. One of the goals of this experiment was to use this 3′UTR library to rapidly identify bona fide miRNA targets from false targets predicted by miRNA targeting software. These programs, such as TargetScan , PicTar  and DIANA-microT  are known to have high false negative rates (~43 %) [8, 13, 14] and false positive rates (~66 %) [8, 36, 37], and cannot be used alone to definitively assign targets.
These 87 human 3′UTRs were enriched with let-7c and miR-221 predicted and validated targets from all three prediction softwares (9 predicted and 3 validated targets for let-7c and 10 predicted and 9 validated targets for miR-221) (Additional file 2: Table S2). For the let-7c screens, we also included two genes that contain validated miRNA targets identified in a previous screen . In addition, since miRNAs preferentially target genes within the same regulatory pathways , and let-7c was previously shown to target the RAS family of genes [8, 27], we were interested to test if let-7c could also target additional 25 members of this pathway (Additional file 2: Table S2), as defined by Gene Ontology  and KEGG databases .
We shuttled these 87 human 3′UTRs from the h3′UTRome v1 clone library into the 3′LIFE vector using LR recombination reactions. Using this custom library, we performed 435 fully automated transfections and dual luciferase experiments. The results of the screen are shown in Fig. 3. Using a cut-off for functionally repressed targets at a repression index of 0.8 and a p-value <0.05, we obtained 19 statistically significant hits for let-7c, and 13 for miR-221 (Fig. 3).
Our results validate 4 out of 9 of the let-7c targets predicted by prediction softwares [10–12]. Within the predicted hits, we detected all three previously validated targets (CDC25A, TRIM71 and BCL2L1) and an unvalidated, predicted target (RNF7) (Fig. 3 and Additional file 2: Table S2). Furthermore we detected an additional 10 novel and unpredicted targets for let-7c (Fig. 3 and Additional file 2: Table S2). We found that one of these novel targets PAK3, was predicted by the prediction algorithm miRanda , which takes into account non-canonical seed interactions. Of note, 3 targets within this group (MCM2, BUB1B and GMNN) were previously correlated indirectly with let-7 expression .
For miR-221, our results validated 4 out of 10 of the miR-221 targets predicted by the prediction software [10–12], and an additional 4 out of 9 of the targets previously validated by others (WEE1, ETS2, FMR1 and KIT) (Additional file 2: Table S2) [42–45]. Interestingly, we were unable to detect repression in 3′UTRs of 5 genes previously known to contain miR-221 responsive elements (CDKN1C, FOS, IRF2, ICAM1, PAK1) [46–50]. Upon further review, we found that repression of all five targeted elements was demonstrated using truncated sections of the 3′UTR. Thus, the observation that the 3′LIFE screen did not detect these targets could be caused by the inability of these elements to recruit miR-221 when expressed within their full length endogenous 3′UTRs, or by the presence of alternative polyadenylation events that cause the lost of these elements . Two targets of miR-221 called by the prediction software [10–12] were also not detected as hits in our assay (KHDRBS2 and RORB). We also discovered 9 novel and unpredicted targets for miR-221 not anticipated by major prediction software [10–12], or detected by others. (Additional file 2: Table S2). Within this group, FRAP1 was the only gene predicted by miRanda . Perfect complementarity within the seed region is considered the canonical indicator of miRNAs targeting. Interestingly, most of these novel targets do not always contain canonical seeds. Recent studies indicate that miRNAs are also capable of recognizing non-canonical elements in target mRNAs [8, 36, 51, 52], supporting our findings.
Taken together, these experiments validate 9 out of 18 bioinformatically predicted targets [10–12] (50 % false positive rate), which is in accordance with the false positive rates of prediction algorithms reported in previous studies [8, 13, 14, 36, 37]. In previous studies we used repression data from the 3′LIFE assay to identify and validate functional miRNA binding sites . With experimentally validated miRNA target sites, targeting signatures can be extrapolated to refine target predictions for specific miRNAs.
Interestingly, while the 3′LIFE assay is designed to detect repression of 3′UTRs by miRNAs, we detected several 3′UTRs that significantly enhanced the expression of the luciferase reporter gene in the presence of let-7c and miR-221 (Fig. 3). Perhaps these enhancements are caused by increase stability of a given 3′UTR due to direct or indirect interactions with the query miRNAs.
The ability to systematically screen large numbers of human 3′UTRs allowed in depth analysis of high-confidence target genes regulated by different miRNAs, and may reveal novel mechanisms that miRNAs use to regulate biological processes. For example, a gene ontology analysis of the let-7c top hits showed an enrichment for genes involved in cell cycle checkpoint regulation, while a similar analysis for miR-221 revealed a relationship with genes involved in negative regulation of muscle differentiation (Additional file 3: Table S3).
In addition, out of the 25 genes involved in the RAS pathway, our screen identified 7 genes directly targeted by let-7c (RhoB, PAK1, PAK3, BRAF, NFKBIA, BCL2L1, KIT), suggesting a role for let-7c in regulating this pathway.
3′UTRs contain powerful regulatory elements that are critical in various biological processes, yet remain poorly characterized because due to the absence of genomic tools that allow their systematic study. In this work we have prepared the first human 3′UTR clone collection named h3′UTRome v1, which is produced to a single high quality standard. This library is compatible with the cloning technology used to produce the human ORFeome, expanding the potential of well-established operating methods for high throughput experimentation. The h3′UTRome v1 library is sequence verified, and readily available to the community with simple online access to information through the DNASU repository , at a low cost to all scientists with minimal restriction. In order to demonstrate its utility, we performed a screen with 87 human 3′UTRs cherry picked from the h3′UTRome v1, and rapidly identified 27 miRNA targets for two disease-relevant miRNAs, let-7c and miR-221. Within this pool, we identified 18 novel targets for these two miRNAs, which were previously uncharacterized (67 %). In addition, we were able to eliminate 9 out of 18 bioinformatically predicted targets (50 % false positive), and rapidly associate miRNA activities to biological pathways using a rapid screening technology.
The h3′UTRome v1 can be easily used in similar HT experiments to systematically study RBP targeting in 3′UTRs, mRNA localization and the role of small ncRNAs in post transcriptional gene regulation.
DNA primer sequences were designed using custom Perl scripts using the annotated 3′UTR sequences in the Human Genome release 19 (Additional file 1: Table S1) (GRCh37/hg19 Feb. 2009) . The forward primers were anchored upstream of the last exon of each gene and included the gene specific endogenous STOP codons in frame with the ORFome library (Fig. 1). The reverse primers were designed to target sites 150 nt downstream of the longest annotated transcript, as per the RefSeq annotation, in order to include downstream 3′end processing elements (Fig. 1). Forward and reverse primers were fused to the attB2 (5′-GGGGACAGCTTTCTTGTACAAAGTGGAG-3′) and attB3 (5′-GGGGACAACTTTGTATAATAAAGTTG-3′) Gateway® sequences to allow modular cloning into pDONR P2rP3 Entry vectors. The full list of primers used is available as (Additional file 1: Table S1) and through DNASU (https://dnasu.org/DNASU) . The first release of the h3′UTRome V1 targeted a panel of 1,815 3′UTRs (Fig. 1) enriched for transcription factors, kinases, RNA binding protein and other regulatory genes. The length of the 3′UTRs cloned in this release ranges between 200 and 2,500 nt in length, which is larger than the average size of human 3′UTRs (Fig. 2).
We used the NA12878 DNA sample obtained from the NIGMS Human Genetic Cell Repository at the Coriell Institute for Medical Research (Camden, New Jersey). This genomic DNA was extracted from the GM12878, a B-lymphocyte cell line of a human female subject. Once received, the genomic DNA was diluted to a concentration of 200 ng/μl, aliquoted in 96-well PCR reactions, and stored at -80 °C until use.
Genomic touchdown PCR
The reactions were conducted using Platinum Taq polymerase (Invitrogen) in 96-well plates using 200 ng of genomic DNA per reaction. The reaction conditions were maintained as per the manufacturers protocol with changes to the annealing temperature of the reaction. The PCR conditions included 16 cycles of touchdown PCR, where the temperature of the annealing phase decreased by 1 °C per cycle, ending at a temperature of 50 °C. The reaction proceeded for 15 more cycles at an annealing temperature of 55 °C. The resulting PCR products were visualized on by electrophoresis on 96-well agarose gels and screened by size to determine successful amplicons.
Gateway® BP recombination reaction and transformation
Site-specific DNA recombination was used to clone the human 3′UTR PCR amplicons into the Gateway® Entry vector pDONR P2r-P3 (Invitrogen), using BP Clonase II Enzyme Mix kit (Invitrogen, Carlsbad, CA) following the manufacturer’s specifications. DH5α E. coli cells were transformed with 1 μl of the resultant reaction mixture, and screened the following day for successful recombinants using Lysogeny Broth (LB) agar plates with Kanamycin (Kan) antibiotic.
3′UTR isolation and size screening
In order to isolate single clonal populations, unique bacterial colonies for each 3′UTR clone were picked from the LB plates, and grown overnight in 96 deep-well plates containing LB (500 mL) with Kanamycin resistance (50 μg/mL) (total colonies picked = 1,824). The resultant bacterial growths were used as a template to perform colony PCR reactions using M13 DNA primer pairs. The amplicons were then analyzed in 96-well agarose gels and positive clones were initially screened based on their expected size (Fig. 1d). Up to three more colonies for genes that did not satisfy our quality control inspection were picked (total colonies picked = 753), and rescreened by repeating the bacterial colony PCR step. Bacterial colonies that passed the initial screen were re-arrayed and stored in glycerol stocks, while the primer pairs of the remaining genes were used in a second pass, starting at genomic PCR to capture any 3′UTR missed (Fig. 1c).
PCR analysis with M13 DNA primer pairs was performed for each positive 3′UTR clone, using overnight bacterial growths as a template and Phusion® Taq polymerase (New England Biolabs), as per manufacturers protocol. These PCR amplicons were then sent for sequencing at the DNA Lab, School of Life Sciences, Arizona State University, using the sequencing primer 1FP2rP3 seq (5′-GCATATGTTGTGTTTTACAGTATTATGTAG-3′) which binds ~100 nt upstream of the recombination element in the P2rP3 plasmid. 1,461 3′UTR clones successfully sequence verified and passed this step. The trace files for each 3′UTR clone successfully screened are available through DNAsu website . Using custom a BioPerl script with Blat integration  we mapped our sequencing results to 1,461 unique 3′UTRs in the human genome.
The 3′LIFE assay was performed as previously described . We re-arrayed bacterial colonies from a panel of human 3′UTRs from the h3′UTRome v1, and grew the plate over-night in a 96 deep-well format using 200 mL of LB in presence of kanamycin (50 μg/mL). We used 1 μl of the resultant overnight culture to perform the colony PCR with Phusion Taq polymerase (Invitrogen) as per manufacturers protocol. The amplicons from the PCR reaction were shuttled into the pLIFE-3′UTR vector (DNASU Plasmid ID: EvNO00601503) by LR recombination using LR Clonase enzyme (Invitrogen) as per manufacturers protocol. 1 μl of the resultant LR reaction mixture was transformed in DH5ɑ E.coli cells. The transformed cells were then plated on LB agar plates containing ampicillin (100 μg/ml), and incubated overnight at 37 °C. Single bacterial colonies were isolated and grown overnight in 500 mL of LB containing ampicillin (100 μg/mL). The resultant overnight bacterial growth was screened based on size using agarose gel electrophoresis. Bacterial colonies from wells passing the screen were frozen as glycerol stocks and also grown overnight for 96-well plasmid DNA extraction as previously described [8, 18]. In order to express let-7c miRNA we used the pLIFE-miR let-7c construct [8, 18]. The miRNA miR-221 was extracted from human genomic DNA derived from GM12878 cells using DNA primers containing Gateway® recombination elements (forward primer – 5′-GGGGACAGCTTTCTTGTACAAAGTGGAGTTTCAACATGATGTCATGATTAAATG-3′; reverse primer- 5′-GGGGACAACTTTGTATAATAAAGTTGCACCTTATCTCTGGTTTACTAGGCTG-3′). The amplified PCR amplicon was cloned into pLIFE-miR (DNASU Plasmid ID: EvNO00601504) vector using LR Clonase II enzyme as per manufactures protocol (Invitrogen). We designed the positive and negative controls for miR-221 targeting by introducing 22 nt long complementary sequences for the 3p arm (positive control) and 5p arm (negative control) arms of miR-221 into the SV40 3′UTR by site directed mutagenesis (Quikchange®, Invitrogen), as per manufacturer protocol. We used the let-7c positive control as previously described . Plasmid DNA was extracted as was previously described . The 3′LIFE assay was performed as previously described [8, 18]. In brief, 87 queried human 3′UTRs + 3 controls were transfected into HEK293T cells using the 96-well Shuttle nucleofection system (Lonza). Transfected cells were cultured for 72 h and then lysed, then used to perform the dual luciferase assay. The screen was performed five times (435 reactions), and the resulting data was analyzed as previously described [8, 18]. The top hits for each miRNA were distinguished by requiring a minimum repression index of 0.8 and a p-value < 0.05.
Luminescent Identification of Functional Elements in 3′UTRs
RNA Binding Proteins
Bartel DP. MicroRNAs: target recognition and regulatory functions. Cell. 2009;136(2):215–33. doi:10.1016/j.cell.2009.01.002.
Jin P, Alisch RS, Warren ST. RNA and microRNAs in fragile X mental retardation. Nat Cell Biol. 2004;6(11):1048–53. doi:10.1038/ncb1104-1048.
Poy MN, Eliasson L, Krutzfeldt J, Kuwajima S, Ma X, Macdonald PE, et al. A pancreatic islet-specific microRNA regulates insulin secretion. Nature. 2004;432(7014):226–30. doi:10.1038/nature03076.
Calin GA, Sevignani C, Dumitru CD, Hyslop T, Noch E, Yendamuri S, et al. Human microRNA genes are frequently located at fragile sites and genomic regions involved in cancers. Proc Natl Acad Sci U S A. 2004;101(9):2999–3004. doi:10.1073/pnas.0307323101.
Ventura A, Jacks T. MicroRNAs and cancer: short RNAs go a long way. Cell. 2009;136(4):586–91. doi:10.1016/j.cell.2009.02.005.
Ray D, Kazan H, Cook KB, Weirauch MT, Najafabadi HS, Li X, et al. A compendium of RNA-binding motifs for decoding gene regulation. Nature. 2013;499(7457):172–7. doi:10.1038/nature12311.
Bartel DP. MicroRNAs: genomics, biogenesis, mechanism, and function. Cell. 2004;116(2):281–97.
Wolter JM, Kotagama K, Pierre-Bez AC, Firago M, Mangone M. 3′LIFE: a functional assay to detect miRNA targets in high-throughput. Nucleic Acids Res. 2014;42(17):e132. doi:10.1093/nar/gku626.
Chen K, Rajewsky N. The evolution of gene regulation by transcription factors and microRNAs. Nat Rev Genet. 2007;8(2):93–103. doi:10.1038/nrg1990.
Lewis BP, Burge CB, Bartel DP. Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell. 2005;120(1):15–20. doi:10.1016/j.cell.2004.12.035.
Lall S, Grun D, Krek A, Chen K, Wang YL, Dewey CN, et al. A genome-wide map of conserved microRNA targets in C. elegans. Curr Biol. 2006;16(5):460–71. doi:10.1016/j.cub.2006.01.050.
Paraskevopoulou MD, Georgakilas G, Kostoulas N, Vlachos IS, Vergoulis T, Reczko M, et al. DIANA-microT web server v5.0: service integration into miRNA functional analysis workflows. Nucleic Acids Res. 2013;41(Web Server issue):W169–73. doi:10.1093/nar/gkt393.
Easow G, Teleman AA, Cohen SM. Isolation of microRNA targets by miRNP immunopurification. RNA. 2007;13(8):1198–204. doi:10.1261/rna.563707.
Selbach M, Schwanhausser B, Thierfelder N, Fang Z, Khanin R, Rajewsky N. Widespread changes in protein synthesis induced by microRNAs. Nature. 2008;455(7209):58–63. doi:10.1038/nature07228.
Rual JF, Hirozane-Kishikawa T, Hao T, Bertin N, Li S, Dricot A, et al. Human ORFeome version 1.1: a platform for reverse proteomics. Genome Res. 2004;14(10B):2128–35. doi:10.1101/gr.2973604.
Yang X, Boehm JS, Yang X, Salehi-Ashtiani K, Hao T, Shen Y, et al. A public genome-scale lentiviral expression library of human ORFs. Nat Methods. 2011;8(8):659–61. doi:10.1038/nmeth.1638.
Blazie SM, Babb C, Wilky H, Rawls A, Park JG, Mangone M. Comparative RNA-Seq analysis reveals pervasive tissue-specific alternative polyadenylation in Caenorhabditis elegans intestine and muscles. BMC Biol. 2015;13:4. doi:10.1186/s12915-015-0116-6.
Wolter JM, Kotagama K, Babb CS, Mangone M. Detection of miRNA Targets in High-throughput Using the 3′LIFE Assay. J Vis Exp. 2015(99). doi:10.3791/52647.
Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, et al. The human genome browser at UCSC. Genome Res. 2002;12(6):996–1006. doi:10.1101/gr.229102. Article published online before print in May 2002.
Marsischky G, LaBaer J. Many paths to many clones: a comparative look at high-throughput cloning methods. Genome Res. 2004;14(10B):2020–8. doi:10.1101/gr.2528804.
Kent WJ. BLAT--the BLAST-like alignment tool. Genome Res. 2002;12(4):656–64. doi:10.1101/gr.229202. Article published online before March 2002.
Schaefer U, Schmeier S, Bajic VB. TcoF-DB: dragon database for human transcription co-factors and transcription factor interacting proteins. Nucleic Acids Res. 2011;39(Database issue):D106–10. doi:10.1093/nar/gkq945.
Zhang HM, Chen H, Liu W, Liu H, Gong J, Wang H, et al. AnimalTFDB: a comprehensive animal transcription factor database. Nucleic Acids Res. 2012;40(Database issue):D144–9. doi:10.1093/nar/gkr965.
Manning G, Whyte DB, Martinez R, Hunter T, Sudarsanam S. The protein kinase complement of the human genome. Science. 2002;298(5600):1912–34. doi:10.1126/science.1075762.
Cook KB, Kazan H, Zuberi K, Morris Q, Hughes TR. RBPDB: a database of RNA-binding specificities. Nucleic Acids Res. 2011;39(Database issue):D301–8. doi:10.1093/nar/gkq1069.
Cormier CY, Mohr SE, Zuo D, Hu Y, Rolfs A, Kramer J, et al. Protein Structure Initiative Material Repository: an open shared public resource of structural genomics plasmids for the biological community. Nucleic Acids Res. 2010;38(Database issue):D743–9. doi:10.1093/nar/gkp999.
Johnson SM, Grosshans H, Shingara J, Byrom M, Jarvis R, Cheng A, et al. RAS is regulated by the let-7 microRNA family. Cell. 2005;120(5):635–47. doi:10.1016/j.cell.2005.01.014.
Stinson S, Lackner MR, Adai AT, Yu N, Kim HJ, O’Brien C, et al. TRPS1 targeting by miR-221/222 promotes the epithelial-to-mesenchymal transition in breast cancer. Sci Signal. 2011;4(177):ra41. doi:10.1126/scisignal.2001538.
Fornari F, Gramantieri L, Ferracin M, Veronese A, Sabbioni S, Calin GA, et al. MiR-221 controls CDKN1C/p57 and CDKN1B/p27 expression in human hepatocellular carcinoma. Oncogene. 2008;27(43):5651–61. doi:10.1038/onc.2008.178.
Ciafre SA, Galardi S, Mangiola A, Ferracin M, Liu CG, Sabatino G, et al. Extensive modulation of a set of microRNAs in primary glioblastoma. Biochem Biophys Res Commun. 2005;334(4):1351–8. doi:10.1016/j.bbrc.2005.07.030.
Galardi S, Mercatelli N, Giorda E, Massalini S, Frajese GV, Ciafre SA, et al. miR-221 and miR-222 expression affects the proliferation potential of human prostate carcinoma cell lines by targeting p27Kip1. J Biol Chem. 2007;282(32):23716–24. doi:10.1074/jbc.M701805200.
Garofalo M, Di Leva G, Romano G, Nuovo G, Suh SS, Ngankeu A, et al. miR-221&222 regulate TRAIL resistance and enhance tumorigenicity through PTEN and TIMP3 downregulation. Cancer Cell. 2009;16(6):498–509. doi:10.1016/j.ccr.2009.10.014.
le Sage C, Nagel R, Egan DA, Schrier M, Mesman E, Mangiola A, et al. Regulation of the p27(Kip1) tumor suppressor by miR-221 and miR-222 promotes cancer cell proliferation. EMBO J. 2007;26(15):3699–708. doi:10.1038/sj.emboj.7601790.
Liu X, Cheng Y, Yang J, Xu L, Zhang C. Cell-specific effects of miR-221/222 in vessels: molecular mechanism and therapeutic application. J Mol Cell Cardiol. 2012;52(1):245–55. doi:10.1016/j.yjmcc.2011.11.008.
Chistiakov DA, Sobenin IA, Orekhov AN, Bobryshev YV. Human miR-221/222 in Physiological and Atherosclerotic Vascular Remodeling. Biomed Res Int. 2015;2015:354517. doi:10.1155/2015/354517.
Chi SW, Hannon GJ, Darnell RB. An alternative mode of microRNA target recognition. Nat Struct Mol Biol. 2012;19(3):321–7. doi:10.1038/nsmb.2230.
Baek D, Villen J, Shin C, Camargo FD, Gygi SP, Bartel DP. The impact of microRNAs on protein output. Nature. 2008;455(7209):64–71. doi:10.1038/nature07242.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene ontology: tool for the unification of biology. Gene Ontol Consortium Nat Genet. 2000;25(1):25–9. doi:10.1038/75556.
Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28(1):27–30.
Betel D, Koppal A, Agius P, Sander C, Leslie C. Comprehensive modeling of microRNA targets predicts functional non-conserved and non-canonical sites. Genome Biol. 2010;11(8):R90. doi:10.1186/gb-2010-11-8-r90.
Johnson CD, Esquela-Kerscher A, Stefani G, Byrom M, Kelnar K, Ovcharenko D, et al. The let-7 microRNA represses cell proliferation pathways in human cells. Cancer Res. 2007;67(16):7713–22. doi:10.1158/0008-5472.CAN-07-1083.
Lupini L, Bassi C, Ferracin M, Bartonicek N, D’Abundo L, Zagatti B, et al. miR-221 affects multiple cancer pathways by modulating the level of hundreds messenger RNAs. Front Genet. 2013;4:64. doi:10.3389/fgene.2013.00064.
Wu YH, Hu TF, Chen YC, Tsai YN, Tsai YH, Cheng CC, et al. The manipulation of miRNA-gene regulatory networks by KSHV induces endothelial cell motility. Blood. 2011;118(10):2896–905. doi:10.1182/blood-2011-01-330589.
Zongaro S, Hukema R, D’Antoni S, Davidovic L, Barbry P, Catania MV, et al. The 3′ UTR of FMR1 mRNA is a target of miR-101, miR-129-5p and miR-221: implications for the molecular pathology of FXTAS at the synapse. Hum Mol Genet. 2013;22(10):1971–82. doi:10.1093/hmg/ddt044.
Godshalk SE, Paranjape T, Nallur S, Speed W, Chan E, Molinaro AM, et al. A Variant in a MicroRNA complementary site in the 3′ UTR of the KIT oncogene increases risk of acral melanoma. Oncogene. 2011;30(13):1542–50. doi:10.1038/onc.2010.536.
Togliatto G, Trombetta A, Dentelli P, Rosso A, Brizzi MF. MIR221/MIR222-driven post-transcriptional regulation of P27KIP1 and P57KIP2 is crucial for high-glucose- and AGE-mediated vascular cell damage. Diabetologia. 2011;54(7):1930–40. doi:10.1007/s00125-011-2125-5.
Ichimura A, Ruike Y, Terasawa K, Shimizu K, Tsujimoto G. MicroRNA-34a inhibits cell proliferation by repressing mitogen-activated protein kinase kinase 1 during megakaryocytic differentiation of K562 cells. Mol Pharmacol. 2010;77(6):1016–24. doi:10.1124/mol.109.063321.
Kneitz B, Krebs M, Kalogirou C, Schubert M, Joniau S, van Poppel H, et al. Survival in patients with high-risk prostate cancer is predicted by miR-221, which regulates proliferation, apoptosis, and invasion of prostate cancer cells by inhibiting IRF2 and SOCS3. Cancer Res. 2014;74(9):2591–603. doi:10.1158/0008-5472.CAN-13-1606.
Hu G, Gong AY, Liu J, Zhou R, Deng C, Chen XM. miR-221 suppresses ICAM-1 translation and regulates interferon-gamma-induced ICAM-1 expression in human cholangiocytes. Am J Physiol Gastrointest Liver Physiol. 2010;298(4):G542–50. doi:10.1152/ajpgi.00490.2009.
Zhang X, Mao H, Chen JY, Wen S, Li D, Ye M, et al. Increased expression of microRNA-221 inhibits PAK1 in endothelial progenitor cells and impairs its function via c-Raf/MEK/ERK pathway. Biochem Biophys Res Commun. 2013;431(3):404–8. doi:10.1016/j.bbrc.2012.12.157.
Lal A, Navarro F, Maher CA, Maliszewski LE, Yan N, O’Day E, et al. miR-24 Inhibits cell proliferation by targeting E2F2, MYC, and other cell-cycle genes via binding to “seedless” 3′UTR microRNA recognition elements. Mol Cell. 2009;35(5):610–25. doi:10.1016/j.molcel.2009.08.020.
Cevec M, Thibaudeau C, Plavec J. NMR structure of the let-7 miRNA interacting with the site LCS1 of lin-41 mRNA from Caenorhabditis elegans. Nucleic Acids Res. 2010;38(21):7814–21. doi:10.1093/nar/gkq640.
Seiler CY, Park JG, Sharma A, Hunter P, Surapaneni P, Sedillo C, et al. DNASU plasmid and PSI:Biology-Materials repositories: resources to accelerate biological research. Nucleic Acids Res. 2014;42(Database issue):D1253–60. doi:10.1093/nar/gkt1060.
Mi H, Muruganujan A, Thomas PD. PANTHER in 2013: modeling the evolution of gene function, and other gene attributes, in the context of phylogenetic trees. Nucleic Acids Res. 2013;41(Database issue):D377–86. doi:10.1093/nar/gks1118.
We thank Victoria Godlove and Steve Blazie for discussions and reagents needed for the 3′UTR cloning effort. We thank Amanda Phomsavanah, Dasia Garcia and Christina Nguyen for help with the cloning of miR-221 and a portion of the clone library, and Thyu-Duyen Nguyen for suggestions and advice during the preparation of the manuscript. We thank Andrea Throop, Kevin Peasley and Hanna Johnson for sharing reagents, advice and help with the cloning effort. We thank Dr. Karen Anderson and her lab for help with the 3′LIFE screen. We thank Dr. Mitch Magee, Amit Sharma, Dr. Jin Park and Dr. Josh LaBaer for help in preparing the release of the library through DNASU.
The authors declare that they have no competing interests.
MM, KK and RM designed the experiments. KK and CB executed the experiments. KK and CB cloned most of the 3′UTRs described in the manuscript and assisted with the interpretation of the data. MM performed the bioinformatics analysis needed for the Sanger sequencing effort. KK and JW performed the 3′LIFE screen and analyzed the data. MM and KK led the analysis and interpretation of the data, assembled the figures and wrote the manuscript. All authors read and approved the final manuscript. This work was supported by the NIH grant 1R21CA179144, and the Arizona State University - Dublin City University Catalyst Fund.
List of primers used in the h3′UTRome v1. A unique RefSeq ID refers to each transcript. For each transcript listed the table displays its given alias, the forward and reverse primer sequences used for genomic PCR, the chromosome of origin and the length of the cloned 3′UTR. The Entrez and Ensembl gene IDs are listed where available. (PDF 391 kb)
List of 3′UTRs from the h3′UTRome v1 clone library used in the 3′LIFE assay. Each 3′UTR is referred to by a unique RefSeq ID and its Alias. The 3′UTRs were queried for predicted binding by miR-221 and let-7c using three prediction software; TargetScan, PicTar and DIANA tools [10–12]. Previously validated 3′UTR-miRNA interactions are listed with reference PMIDs. The direct involvement of each gene in the RAS pathway was established by using the GO  and KEGG  databases. The following column indicates if a given 3′UTR was among the top hits in the 3′LIFE screen for each miRNA. (PDF 37 kb)
Gene Ontology analysis of let-7c and miR-221 top hits. The top hits for let-7c and miR-221 were queried for enrichments in biological processes. Only results with p < 0.05 are shown. The resulting biological processes are sorted based on fold enrichment. (PDF 57 kb)
About this article
Cite this article
Kotagama, K., Babb, C.S., Wolter, J.M. et al. A human 3′UTR clone collection to study post-transcriptional gene regulation. BMC Genomics 16, 1036 (2015). https://doi.org/10.1186/s12864-015-2238-1
- 3′ untranslated region