Skip to main content

MirSNP, a database of polymorphisms altering miRNA target sites, identifies miRNA-related SNPs in GWAS SNPs and eQTLs

Abstract

Background

Numerous single nucleotide polymorphisms (SNPs) associated with complex diseases have been identified by genome-wide association studies (GWAS) and expression quantitative trait loci (eQTLs) studies. However, few of these SNPs have explicit biological functions. Recent studies indicated that the SNPs within the 3’UTR regions of susceptibility genes could affect complex traits/diseases by affecting the function of miRNAs. These 3’UTR SNPs are functional candidates and therefore of interest to GWAS and eQTL researchers.

Description

We developed a publicly available online database, MirSNP (http://cmbi.bjmu.edu.cn/mirsnp), which is a collection of human SNPs in predicted miRNA-mRNA binding sites. We identified 414,510 SNPs that might affect miRNA-mRNA binding. Annotations were added to these SNPs to predict whether a SNP within the target site would decrease/break or enhance/create an miRNA-mRNA binding site. By applying MirSNP database to three brain eQTL data sets, we identified four unreported SNPs (rs3087822, rs13042, rs1058381, and rs1058398), which might affect miRNA binding and thus affect the expression of their host genes in the brain. We also applied the MirSNP database to our GWAS for schizophrenia: seven predicted miRNA-related SNPs (p < 0.0001) were found in the schizophrenia GWAS. Our findings identified the possible functions of these SNP loci, and provide the basis for subsequent functional research.

Conclusion

MirSNP could identify the putative miRNA-related SNPs from GWAS and eQTLs researches and provide the direction for subsequent functional researches.

Background

MicroRNAs (miRNAs) are small non-coding RNA molecules of ~22 nucleotides that primarily mediate post-transcriptional gene silencing processes in animals [1, 2]. MiRNAs inactivate specific mRNAs and interfere with the translation of target proteins [3]. In mammals, miRNAs are predicted to control the activities of ~50% of all protein-coding genes [4]. As key post-transcriptional regulators, miRNAs have an important role in a wide range of biological processes, including cell proliferation, differentiation, apoptosis and metabolism [2, 3]. Evidence indicates that miRNAs are also involved in the pathogenesis of complex diseases, such as cancer and mental disorders [4, 5].

Complementarity to bases 2-8 of the miRNA (the seed site) is important in miRNA-mRNA binding [6, 7]. MiRNAs are key regulators of gene expression; therefore, SNPs in the seed sites of miRNA targets may create, as well as destroy, miRNA binding sites, and further affect phenotypes and disease susceptibility [8]. Identifying these seed-site SNPs could help in the further exploration of the molecular mechanism of gene dysregulation. In addition, genetic variants in miRNA genes may also have important roles by affecting miRNA maturation, which may affect disease susceptibility [8]. Certain polymorphisms in miRNA genes have been found to be associated with various complex diseases, including cancers, mental diseases, cardiomyopathy, and asthma (Additional file 1: Table S1).

GWAS and eQTLs are powerful methods for identifying genetic variants contributing to disease risk and gene expression. In a GWAS of schizophrenia (SCZ), Ripke et al. found the most significant SNP (p < 1.6 × 10-11) within an intron of a putative primary transcript for hsa-mir-137 and found four other SNPs achieving genome-wide significance that were located in predicted target sites of hsa-mir-137 [9]. It was estimated that more than 50% of the protein-coding genes are regulated by miRNAs, and each miRNA may regulate hundreds of potential targets [10, 11]. Taking into account large scale of biological significance shown by miRNAs, miRNA-related SNPs could be important variations leading to traits and diseases. Identifying allele-specific miRNA-mRNA interactions may indicate the functional roles of the SNPs from GWAS and eQTLs that presently lack obvious known function.

To help identifying putative miRNA-related SNPs from researchers’ own GWAS and cis-acting eQTLs data set, we have developed a freely available database, named “MirSNP”, which provides SNPs located in predicted miRNA target sites. This database has minor allele frequency (MAF) and linkage disequilibrium (LD) information for the SNPs and has annotations concerning their creative or disruptive effects on putative target sites. The MirSNP database comprises over 414,510 predicted miRNA-related SNPs, enabling users to identify potential miRNA-related SNPs from their own GWAS or eQTLs data. In this work, we applied MirSNP to our schizophrenia GWAS data and several brain eQTLs data as examples.

Construction and content

Data sources

To store the mRNA sequences, miRNA data and SNPs, a local Structured Query Language (SQL) database was built using MySQL software. Human miRNAs were downloaded from miRBase18.0 [12]. For SNP catalogs, four tables from dbSNP135 [13] were used (see URLs). To maximize the consistency between different databases, mRNA sequence files were obtained from NCBI (see URLs) rather than from the UCSC genome browser. In total, 42,733 mRNA sequences and 513,249 SNPs located in the 3’UTRs of mRNA were eligible for subsequent analysis.

Identifying polymorphisms in pre-miRNA genes

We identified SNPs either located in human pre-miRNA genes or in their adjacent upstream/downstream 200 bp regions by comparing the coordinates of both SNPs and related miRNAs (Figure 1). There were 12,846 polymorphisms, including 1,940 SNPs in pre-miRNA sequences. An SQL script was written to calculate the SNP density (Figure 2). The SNP density in pre-miRNA regions declined rapidly to 0.43 SNPs/kb if we considered only SNPs having a MAF above 0.01 in four populations (CEU, HCB, JPT, YRI).

Figure 1
figure 1

Workflow depicting analysis of variations in/near miRNA genes.

Figure 2
figure 2

Density of SNPs in human miRNA genes. (A) SNP density in human pre-miRNAs and flanking regions. Each bar represents the average level of all human miRNAs, error bars represent the standard deviation of the mean value. MiRNA and SNP information came from mirBASE18 and dbSNP135, respectively. Figure A shows the result of all SNPs in dbSNP135. (B) Figure B only shows SNPs with a calculated MAF > 0.01 in at least one population from four (CEU, HCB, JPT, YRI).

Identifying polymorphisms in miRNA target sites

The method of identifying polymorphisms in mRNAs affecting miRNA-mediated processes is shown in Figure 3. The information as to whether or not a SNP is located in the 3’UTR of an mRNA came from dbSNP135. Only SNPs located in mRNA 3’UTR areas were recorded in the local SQL database. Preforming sequence alignment between 20-bp DNA sequences surrounding 3’UTR SNPs and the corresponding mRNA sequences, variants were mapped onto their mRNA sequences. Subsequently, each SNP in our database had two to four mRNA sequence records corresponding to different alleles. Using the mRNA sequences of SNPs and miRNA mature sequences, we obtained the predicted target results using an miRNA target prediction algorithm, miRanda, which has highest sensitivity among eight tested algorithms [14].

Figure 3
figure 3

(A) Workflow depicting the analysis of variations in miRNA-mRNA binding sites. (B) An example of how we determinate a SNP in predicted mRNA target site.

Although there are examples that imperfect 7-nt seed site pairing can be functional, there is overwhelming evidence to support the hypothesis that Watson-Crick pairing to the miRNA seed site is the most important feature for miRNA prediction and function [6, 7]. Therefore, we adopted the 7-nt seed region in the miRNA as the major criterion in the miRanda algorithm [15]. In detail, to predict miRNA binding sites, we applied miRanda v3.3a with the default pairing score cutoff of “≥140” and “–strick” command, which only considered stringent 7-nt seed pairing (requiring an uninterrupted match of at least seven nucleotides from the 5’-end of the miRNA).

Additional information about MirSNP

A requirement for perfect 7-nt seed site pairing improves the reliability of miRNA target prediction, particularly when seed motif is conserved in the UTR regions of whole-genome alignments [16, 17]. We downloaded the conservative information of whole-genome alignments (phastCons 46way vertebrates from UCSC ftp site, see URLs) [18] and then added the average value of conservative scores of 7-nt seed motif to our database. The score of mirSVR methodology, a machine learning method for ranking miRNA target sites [19], were also added to the MirSNP database as annotation. However, the data of MirSNP are not identical to that of the mirSVR. Imperfect overlap may traced back to SNPs (the mirSVR methodology didn’t consider the impact of SNPs on miRNA-mRNA bindings), the use of different UTR database and miRNA information. Therefore, not all results in MirSNP should have the score of mirSVR.

SNPs become more important if they have a high frequency or are undergoing positive selection. Therefore, we have added MAF information from dbSNP135 (see URLs) for the SNPs into MirSNP. Based on this information, the results of MirSNP filtered by MAF could be displayed. Analysis of the MAF data revealed that there were 32,822 SNPs located in miRNA-mRNA binding sites with MAFs greater than 0.01 in the four populations. In addition, 122 SNPs in pre-miRNA genes had MAF data and the remaining 1,818 SNPs lacked MAF data.

LD information was obtained for each SNP from the HapMap project [20]. The Phase2 HapMap fileset (see URLs) was downloaded for the four populations and the linkage of the SNPs was calculated using a threshold of r2 greater than 0.8 using the PLINK software [21]. LD and MAF information of SNPs were stored into separate tables in download page (see URLs).

Database construction

All the useful data were stored in a local MySQL database. We used Django, a web application framework written in Python, to build a user-friendly online website (http://cmbi.bjmu.edu.cn/mirsnp).

Utility

The website

We obtained over one million records for 414,510 miRNA-related SNPs. These SNPs were classified into four groups, labeled as create, enhance, decrease or break. To display the records in the MirSNP database, we designed a user-friendly MirSNP web site (see URLs) to allowing searching for SNPs in putative miRNA target sites. On the frame “single search” (Figure 4A), the system allows users to search records by entering a RefGene name, mRNA id, SNP RSid or mature miRNA name. All identifiable RefGene names and mRNA ids for this search can be found by clicking the hyperlinks “Refgene name” and “mRNA id” on the page. After pressing the “Search” button, the results are presented on a new page (Figure 4B). The record in Figure 4B shows that the A-allele of rs56352346 may promote the binding of CYP4B1 gene and hsa-let-7a-2-3p and when rs56352346 is the C-allele, the mature miRNA and gene cannot combine (Figure 4B). The specific explanation of each column is provided in the help page. Additionally, on the “single search” frame, users can choose whether to display the binding alignment as well as whether to filter the output by MAF.

Figure 4
figure 4

(A) The “Single search” frame of the MirSNP web site. (B) An example of miRNA-related SNP search results.

We would like to recommend the frame named “Query disease & trait associated SNPs” (Figure 5). This frame permits the search of SNPs from GWAS and eQTL studies. The acceptable input is a list of SNP RSid in a text file. Here, users can query not only the submitted SNPs, but also their linked SNPs (r2 > 0.8). For example, searching for associated SNPs in our schizophrenia GWAS data (p < 0.0001) in this frame, four additional SNPs were identified compared to using the “batch search” frame that does not consider linked SNPs.

Figure 5
figure 5

The “Query disease & trait associated SNPs” frame of the MirSNP web site.

The use of MirSNP for brain eQTL data sets

Many studies have implicated the association between SNPs and the expression of their host genes. We speculate that it will be of great significance to combine our miRNA-related SNPs data and eQTL data. We used the MirSNP database for three human brain eQTL data sets, including human cortical samples from 193 individuals [22], cortex samples from 269 individuals [23] and samples obtained from 150 subjects [24]. SNPs located in miRNA-mRNA binding sites could affect the expression of their host genes; therefore, we only considered eQTLs that had a cis-effect on the host genes. Four putative miRNA-related SNPs (rs3087822, rs13042, rs1058381, rs1058398) were selected and they were statistically significant, genome-wide, in the three brain eQTL data (Table 1). Based on our in silico analysis, we hypothesize that these SNPs may affect the miRNA mechanism and thus affect the mRNA expression of their host genes in the brain. Further experiments are necessary to confirm our speculation concerning these SNPs.

Table 1 Brain eQTLs found in predicted miRNA-mRNA binding sites

The use of MirSNP for a schizophrenia GWAS data set

Our previous GWAS data [25], involving 746 SCZ cases and 1,599 healthy controls, identified a set of 7,705 SNPs having a statistical significance of p < 0.01. Here, we conducted a genome-wide analysis for these GWAS SNPs falling within computationally predicted miRNA targets. We combined putative miRNA-related SNPs and GWAS SNPs with SNP id as the key. To increase the range of combination, we used HapMap data and the software Plink to calculate r2 between pairwise SNPs. The GWAS data are from Chinese Han population; therefore, we chose 90 Asian individuals from the HapMap project for LD analyses. A subset of 4,997 SNPs in predicted miRNA target sites were in our GWAS analyses. Hence, we set 1.0 × 10-5 (0.05/4997) as the threshold of statistical significance. Three polymorphisms were identified (Table 2). The SNP that showed the strongest association with schizophrenia was found in the TBC1D15 gene (p = 4.0 × 10-6 in the Chinese Han population). The in silico analysis implied that three SNPs (rs17110432, rs11178988, and rs11178989) in 3’UTR area of TBC1D15 may affect the miRNA-mRNA binding process. However, further experiments are necessary.

Table 2 Putative miRNA-related SNPs associated with schizophrenia

We also overlapped the SNPs around microRNA genes in our GWAS data of schizophrenia. A subset of 108 SNPs around miRNA genes was identified. We found one SNP (rs10173558), which is only 5 bp away from the start site of “hsa-mir-1302-4”, having a statistical significance lower than 0.001. Yuan et al. reported that the highest expression level of miR-1302’ target genes was in the nervous system and the genes were enriched in both synapses and intracellular membrane-bounded organelles [26]. This finding implied a potential relevance of miR-1304 to psychiatric disorders.

Discussion

In recent years, increasing numbers of databases have been published to aid researchers to explore the impact of SNPs on miRNA targets [2733]. Some researchers, such as Richardson [31] and Ziebarth [32], have provided links between SNPs in miRNA target sites, cis-acting eQTLs and the results of GWAS. Previous works summarized the characteristics of miRNA-related SNPs and showed the potential of applying such databases in GWAS and eQTL researches. However, these databases may not be suitable for a single GWAS or eQTL data set. Some databases cannot perform batch search for numbers of SNPs and some cannot provide effective miRNA-related information for strongly- associated loci. MirSNP was developed to identify putative miRNA-related SNPs from single data sets of GWAS or eQTL, especially from newly published data sets. First, our analysis covered 513,249 known 3’UTR SNPs based on dbSNP135 and we used highly consistent data sources to avoid data loss while integrating different data. Furthermore, sequence alignments between surrounding DNA sequences of SNPs and the corresponding mRNA sequences were used to map variants into their mRNA sequences. Finally, our web site was designed to directly use GWAS or eQTL data sets in a batch query, particularly considering the linked SNPs in different populations.

The MirSNP database stores a large number of records of SNPs in predicted miRNA targets sites and we concentrated on providing a convenient search platform so that recent GWAS or eQTL results can be placed on this platform for batch retrieval. In MirSNP, all 3’UTR SNPs stored in dbSNP135 database (513,249) were analyzed. Compared with other databases, MirSNP obtained more results for both common and rare SNPs that might influence miRNA processes. In Table 3 and Figure 6, a comparison between MirSNP and three existing databases (PolymiRTS, Mirsnpscore and Patrocles) are presented. To compare the sensitivity of MirSNP prediction, we identified 13 validated miRNA-related SNPs by literature review: most of the SNPs coming from the table 2 in Sethupathy and Collins [34]. Of the 13 cases (Table 4), nine were identified using MirSNP. The SNPs that not found were either not located in the 3’UTR based on our records (rs34764978, rs13212041) or do not have a perfect 7-nt binding in the seed site (rs2735383, rs9341070). For the other three databases, from thirteen cases, no more than five validated SNPs were identified. Therefore, the MirSNP database could cover majority of validated miRNA-related SNPs.

Table 3 Comparison among four databases
Figure 6
figure 6

Comparison between MirSNP and three similar databases.

Table 4 Experimentally validated miRNA-related SNPs found in MirSNP

MiRNAs can downregulate gene expression by two posttranscriptional mechanisms: mRNA cleavage or translational repression. In animals, miRNA is thought to have a repressive effect that influences protein expression, not the mRNA levels. However, it has been estimated that over 80 percent of miRNAs acted to lower mRNA levels, which shows that mRNA destabilization is the primary action mode of miRNAs on target mRNAs [35]. In addition, studies have shown that SNPs in seed-sites region can significantly change the expression of the target mRNA and protein [3640]. eQTL analysis is an experimental method for exploring the relationship between SNPs and mRNA expression at a high throughput. Genetic variants that create or destroy an miRNA binding site may be the casual cis-acting eQTLs. The combination of our MirSNP database and eQTL data will provide possible explanations for the eQTLs. We have already identified four SNPs in prediected miRNA target sites that were proven to be brain eQTLs in three independent studies. Further experiments are needed to prove that these eQTLs affect gene expression through an miRNA mechanism. On the other hand, using GWAS, enormous numbers of associations between SNPs and diseases have been reported. There are many disease-associated SNPs that function by miRNA regulation. Overlapping miRNA-related SNPs and the existing GWAS data could identify possible biological mechanisms for these disease-associated variations and provide the in silico basis for further studies.

Our aim was to merge the MirSNP database with high throughput SNP experimental data. We identified many SNPs in predicted miRNA targets and indicated their potential functions based on sequence algorithms. Unfortunately, the expression information of miRNAs and mRNAs are not supplied in our database. We considered the possibility of an miRNA and mRNA combining, but under the complex mechanism of spatial and temporal expression, we had no idea if the two molecules would encounter each other in vivo. Combining MirSNP with additional databases containing expression information, such as miRGator [41], would improve the functionality of the database.

Conclusions

MirSNP is a database of SNPs in predicted miRNA target sites, based on information from dbSNP135 and mirBASE18. MirSNP is highly sensitive and covers most experiments confirmed SNPs that affect miRNA function. The results suggest that our prediction and data processing are full-scale. MirSNP may be combined with researchers’ own GWAS or eQTL positive data sets to identify the putative miRNA-related SNPs from traits/diseases associated variants. We aim to update the MirSNP database as new versions of mirBASE and dbSNP database become available.

Availability and requirements

MirSNP is publicly available on the internet at http://cmbi.bjmu.edu.cn/mirsnp.

URLs

MirSNP (http://cmbi.bjmu.edu.cn/mirsnp)

dbSNP135 (ftp://ftp.ncbi.nih.gov/snp/organisms/human_9606/database/organism_data)

mRNA sequence file (ftp://ftp.ncbi.nih.gov//refseq/H_sapiens/mRNA_Prot/human.rna.fna.gz, released November 15, 2011)

HapMap fileset (http://pngu.mgh.harvard.edu/~purcell/plink/res.shtml)

mirBASE (http://www.mirbase.org/ftp.shtml)

UCSC (ftp://hgdownload.cse.ucsc.edu/goldenPath/hg19/phastCons46way/vertebrate/)

Abbreviations

3’UTR:

3’ untranslated region

CEU:

Samples of Utah residents with Northern and Western European ancestry from the CEPH collection

HCB:

Samples of Han Chinese in Beijing

JPT:

Samples of Japanese in Tokyo

YRI:

Samples of Yoruba people in Ibadan

eQTL:

Expression quantitative trait locus

GWAS:

Genome-wide association study

LD:

Linkage disequilibrium

MAF:

Minor allele frequency

miRNA:

microRNA

SCZ:

Schizophrenia

SNP:

Single nucleotide polymorphism

SQL:

Structured Query Language.

References

  1. He L, Hannon GJ: MicroRNAs: small RNAs with a big role in gene regulation. Nat Rev Genet. 2004, 5 (7): 522-531. 10.1038/nrg1379.

    Article  CAS  PubMed  Google Scholar 

  2. Mendell JT: MicroRNAs: critical regulators of development, cellular physiology and malignancy. Cell Cycle. 2005, 4 (9): 1179-1184. 10.4161/cc.4.9.2032.

    Article  CAS  PubMed  Google Scholar 

  3. Bartel DP: MicroRNAs: genomics, biogenesis, mechanism, and function. Cell. 2004, 116 (2): 281-297. 10.1016/S0092-8674(04)00045-5.

    Article  CAS  PubMed  Google Scholar 

  4. Krol J, Loedige I, Filipowicz W: The widespread regulation of microRNA biogenesis, function and decay. Nat Rev Genet. 2010, 11 (9): 597-610.

    CAS  PubMed  Google Scholar 

  5. Gao FB: Posttranscriptional control of neuronal development by microRNA networks. Trends Neurosci. 2008, 31 (1): 20-26. 10.1016/j.tins.2007.10.004.

    Article  PubMed Central  PubMed  Google Scholar 

  6. Brennecke J, Stark A, Russell RB, Cohen SM: Principles of microRNA-target recognition. PLoS Biol. 2005, 3 (3): e85-10.1371/journal.pbio.0030085.

    Article  PubMed Central  PubMed  Google Scholar 

  7. Lai EC: Micro RNAs are complementary to 3’ UTR sequence motifs that mediate negative post-transcriptional regulation. Nat Genet. 2002, 30 (4): 363-364. 10.1038/ng865.

    Article  CAS  PubMed  Google Scholar 

  8. Ryan BM, Robles AI, Harris CC: Genetic variation in microRNA networks: the implications for cancer research. Nat Rev Cancer. 2010, 10 (6): 389-402. 10.1038/nrc2867.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  9. Ripke S, Sanders AR, Kendler KS, Levinson DF, Sklar P, Holmans PA, Lin DY, Duan J, Ophoff RA, Andreassen OA, et al: Genome-wide association study identifies five new schizophrenia loci. Nat Genet. 2011, 43 (10): 969-976. 10.1038/ng.940.

    Article  CAS  Google Scholar 

  10. Bartel DP: MicroRNAs: target recognition and regulatory functions. Cell. 2009, 136 (2): 215-233. 10.1016/j.cell.2009.01.002.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  11. Voinnet O: Origin, biogenesis, and activity of plant microRNAs. Cell. 2009, 136 (4): 669-687. 10.1016/j.cell.2009.01.046.

    Article  CAS  PubMed  Google Scholar 

  12. Kozomara A, Griffiths-Jones S: miRBase: integrating microRNA annotation and deep-sequencing data. Nucleic Acids Res. 2011, 39 (Database issue): D152-D157.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  13. Sherry ST, Ward MH, Kholodov M, Baker J, Phan L, Smigielski EM, Sirotkin K: dbSNP: the NCBI database of genetic variation. Nucleic Acids Res. 2001, 29 (1): 308-311. 10.1093/nar/29.1.308.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  14. Alexiou P, Maragkakis M, Papadopoulos GL, Reczko M, Hatzigeorgiou AG: Lost in translation: an assessment and perspective for computational microRNA target identification. Bioinformatics. 2009, 25 (23): 3049-3055. 10.1093/bioinformatics/btp565.

    Article  CAS  PubMed  Google Scholar 

  15. Betel D, Wilson M, Gabow A, Marks DS, Sander C: The microRNA.org resource: targets and expression. Nucleic Acids Res. 2008, 36 (Database issue): D149-D153.

    PubMed Central  CAS  PubMed  Google Scholar 

  16. Berezikov E: Evolution of microRNA diversity and regulation in animals. Nat Rev Genet. 2011, 12 (12): 846-860. 10.1038/nrg3079.

    Article  CAS  PubMed  Google Scholar 

  17. Lewis BP, Burge CB, Bartel DP: Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell. 2005, 120 (1): 15-20. 10.1016/j.cell.2004.12.035.

    Article  CAS  PubMed  Google Scholar 

  18. Siepel A, Bejerano G, Pedersen JS, Hinrichs AS, Hou M, Rosenbloom K, Clawson H, Spieth J, Hillier LW, Richards S, et al: Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Res. 2005, 15 (8): 1034-1050. 10.1101/gr.3715005.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  19. Betel D, Koppal A, Agius P, Sander C, Leslie C: Comprehensive modeling of microRNA targets predicts functional non-conserved and non-canonical sites. Genome Biol. 2010, 11 (8): R90-10.1186/gb-2010-11-8-r90.

    Article  PubMed Central  PubMed  Google Scholar 

  20. Frazer KA, Ballinger DG, Cox DR, Hinds DA, Stuve LL, Gibbs RA, Belmont JW, Boudreau A, Hardenbol P, Leal SM, et al: A second generation human haplotype map of over 3.1 million SNPs. Nature. 2007, 449 (7164): 851-861. 10.1038/nature06258.

    Article  CAS  PubMed  Google Scholar 

  21. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, et al: PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007, 81 (3): 559-575. 10.1086/519795.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  22. Myers AJ, Gibbs JR, Webster JA, Rohrer K, Zhao A, Marlowe L, Kaleem M, Leung D, Bryden L, Nath P, et al: A survey of genetic human cortical gene expression. Nat Genet. 2007, 39 (12): 1494-1499. 10.1038/ng.2007.16.

    Article  CAS  PubMed  Google Scholar 

  23. Colantuoni C, Lipska BK, Ye T, Hyde TM, Tao R, Leek JT, Colantuoni EA, Elkahloun AG, Herman MM, Weinberger DR, et al: Temporal dynamics and genetic control of transcription in the human prefrontal cortex. Nature. 2011, 478 (7370): 519-523. 10.1038/nature10524.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  24. Gibbs JR, van der Brug MP, Hernandez DG, Traynor BJ, Nalls MA, Lai SL, Arepalli S, Dillman A, Rafferty IP, Troncoso J, et al: Abundant quantitative trait loci exist for DNA methylation and gene expression in human brain. PLoS Genet. 2010, 6 (5): e1000952-10.1371/journal.pgen.1000952.

    Article  PubMed Central  PubMed  Google Scholar 

  25. Yue WH, Wang HF, Sun LD, Tang FL, Liu ZH, Zhang HX, Li WQ, Zhang YL, Zhang Y, Ma CC, et al: Genome-wide association study identifies a susceptibility locus for schizophrenia in Han Chinese at 11p11.2. Nat Genet. 2011, 43 (12): 1228-1231. 10.1038/ng.979.

    Article  CAS  PubMed  Google Scholar 

  26. Yuan Z, Sun X, Jiang D, Ding Y, Lu Z, Gong L, Liu H, Xie J: Origin and evolution of a placental-specific microRNA family in the human genome. BMC Evol Biol. 2010, 10: 346-10.1186/1471-2148-10-346.

    Article  PubMed Central  PubMed  Google Scholar 

  27. Barenboim M, Zoltick BJ, Guo Y, Weinberger DR: MicroSNiPer: a web tool for prediction of SNP effects on putative microRNA targets. Hum Mutat. 2010, 31 (11): 1223-1232. 10.1002/humu.21349.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  28. Bruno AE, Li L, Kalabus JL, Pan Y, Yu A, Hu Z: miRdSNP: a database of disease-associated SNPs and microRNA target sites on 3’UTRs of human genes. BMC Genomics. 2012, 13: 44-10.1186/1471-2164-13-44.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  29. Gong J, Tong Y, Zhang HM, Wang K, Hu T, Shan G, Sun J, Guo AY: Genome-wide identification of SNPs in microRNA genes and the SNP effects on microRNA target binding and biogenesis. Hum Mutat. 2012, 33 (1): 254-263. 10.1002/humu.21641.

    Article  CAS  PubMed  Google Scholar 

  30. Hiard S, Charlier C, Coppieters W, Georges M, Baurain D: Patrocles: a database of polymorphic miRNA-mediated gene regulation in vertebrates. Nucleic Acids Res. 2010, 38 (Database issue): D640-D651.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  31. Richardson K, Lai CQ, Parnell LD, Lee YC, Ordovas JM: A genome-wide survey for SNPs altering microRNA seed sites identifies functional candidates in GWAS. BMC Genomics. 2011, 12: 504-10.1186/1471-2164-12-504.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  32. Ziebarth JD, Bhattacharya A, Chen A, Cui Y: PolymiRTS Database 2.0: linking polymorphisms in microRNA target sites with human diseases and complex traits. Nucleic Acids Res. 2012, 40 (Database issue): D216-D221.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  33. Thomas LF, Saito T, Saetrom P: Inferring causative variants in microRNA target sites. Nucleic Acids Res. 2011, 39 (16): e109-10.1093/nar/gkr414.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  34. Sethupathy P, Collins FS: MicroRNA target site polymorphisms and human disease. Trends Genet. 2008, 24 (10): 489-497. 10.1016/j.tig.2008.07.004.

    Article  CAS  PubMed  Google Scholar 

  35. Guo H, Ingolia NT, Weissman JS, Bartel DP: Mammalian microRNAs predominantly act to decrease target mRNA levels. Nature. 2010, 466 (7308): 835-840. 10.1038/nature09267.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  36. Yang L, Li Y, Cheng M, Huang D, Zheng J, Liu B, Ling X, Li Q, Zhang X, Ji W, et al: A functional polymorphism at microRNA-629-binding site in the 3’-untranslated region of NBS1 gene confers an increased risk of lung cancer in Southern and Eastern Chinese population. Carcinogenesis. 2012, 33 (2): 338-347. 10.1093/carcin/bgr272.

    Article  CAS  PubMed  Google Scholar 

  37. Liu Z, Wei S, Ma H, Zhao M, Myers JN, Weber RS, Sturgis EM, Wei Q: A functional variant at the miR-184 binding site in TNFAIP2 and risk of squamous cell carcinoma of the head and neck. Carcinogenesis. 2011, 32 (11): 1668-1674. 10.1093/carcin/bgr209.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  38. Wynendaele J, Bohnke A, Leucci E, Nielsen SJ, Lambertz I, Hammer S, Sbrzesny N, Kubitza D, Wolf A, Gradhand E, et al: An illegitimate microRNA target site within the 3’ UTR of MDM4 affects ovarian cancer progression and chemosensitivity. Cancer Res. 2010, 70 (23): 9641-9649. 10.1158/0008-5472.CAN-10-0527.

    Article  CAS  PubMed  Google Scholar 

  39. Saetrom P, Biesinger J, Li SM, Smith D, Thomas LF, Majzoub K, Rivas GE, Alluin J, Rossi JJ, Krontiris TG, et al: A risk variant in an miR-125b binding site in BMPR1B is associated with breast cancer pathogenesis. Cancer Res. 2009, 69 (18): 7459-7465. 10.1158/0008-5472.CAN-09-1201.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  40. Sun Q, Gu H, Zeng Y, Xia Y, Wang Y, Jing Y, Yang L, Wang B: Hsa-mir-27a genetic variant contributes to gastric cancer susceptibility through affecting miR-27a and target gene expression. Cancer Sci. 2010, 101 (10): 2241-2247. 10.1111/j.1349-7006.2010.01667.x.

    Article  CAS  PubMed  Google Scholar 

  41. Cho S, Jun Y, Lee S, Choi HS, Jung S, Jang Y, Park C, Kim S, Kim W: miRGator v2.0: an integrated system for functional investigation of microRNAs. Nucleic Acids Res. 2011, 39 (Database issue): D158-D162.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

We extend our gratitude to all the subjects who participated in this study. This work was supported by grants from the National Natural Science Foundation of China (81071087, 81071088), the International Science & Technology Cooperation Program of China (2010DFB30820), the National High Technology Research and Development Program of China (2009AA022702).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Weihua Yue or Dai Zhang.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

LCX designed and implemented the database, and performed all data processing. LM and LCX designed the web interface. YWH, WLF and ZFQ participated in the analysis of SCZ-associated SNPs and eQTLs. ZD and LTT planned and directed the project. LCX, LTT and YWH drafted the manuscript. All authors have read and approved the manuscript.

Electronic supplementary material

12864_2012_4764_MOESM1_ESM.xls

Additional file 1: Table S1: Disease-associated SNPs found within microRNA genes. We queried articles for all SNPs located in human pre-miRNA genes and their adjacent upstream/downstream 200 bp regions. 286 articles were found that were published before November, 2011. We reviewed these papers and selected 22 SNPs that were reported to be associated with diseases. aPubmed ids and SNP information are from dbSNP135. bmicroRNA positions are from mirBASE18. cThe MAF column reports the allele frequency of the minor allele from the four populations where it was the highest. (XLS 26 KB)

Authors’ original submitted files for images

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Liu, C., Zhang, F., Li, T. et al. MirSNP, a database of polymorphisms altering miRNA target sites, identifies miRNA-related SNPs in GWAS SNPs and eQTLs. BMC Genomics 13, 661 (2012). https://doi.org/10.1186/1471-2164-13-661

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/1471-2164-13-661

Keywords