Genome-wide analysis of human hotspot intersected genes highlights the roles of meiotic recombination in evolution and disease
© Zhou et al.; licensee BioMed Central Ltd. 2013
Received: 20 September 2012
Accepted: 29 January 2013
Published: 31 January 2013
Meiotic recombination events are not randomly located, but rather cluster at hotspot regions. Recently, the fine-scale mapping of genome-wide human recombination hotspots was performed. Here, we systematically analyzed the evolutionary and disease-associated features of hotspots that overlapped with protein-coding genes.
In this study, we defined hotspot intersected genes as HI genes. We found that HI genes were prone to be located in the extracellular part and were functionally enriched in cell-to-cell communication. Tissue-specific genes and secreted protein encoding genes were overrepresented in HI genes, while housekeeping genes were underrepresented. Compared to slowly evolving housekeeping genes and random genes with lower recombination rates, HI genes evolved faster. The fact that brain and blood specific genes were overrepresented in HI genes indicates that they may be involved in the evolution of human intelligence and the immune system. We also found that genes related to disease were enriched in HI genes, especially genes with disease-associated chromosomal rearrangements. Hotspot sequence motifs were overrepresented in common sequences of HI genes and genes with disease-associated chromosomal rearrangements. We further listed repeat elements that were enriched both in hotspots and genes with disease-associated chromosomal rearrangements.
HI genes are evolving and may be involved in the generation of key features of human during evolution. Disease-associated genes may be by-products of meiotic recombination. In addition, hotspot sequence motifs and repeat elements showed the connection between meiotic recombination and genes with disease-associated chromosomal rearrangements at the sequence level. Our study will enable us to better understand the evolutionary and biological significance of human meiotic recombination.
KeywordsMeiotic recombination Recombination hotspots Evolution Disease Chromosomal rearrangements Repeat element
Meiotic homologous recombination is necessary for accurate chromosomal disjunction which is important in maintaining genome integrity. It also contributes to genetic diversity by generating new combinations of alleles, thereby providing raw materials for evolution [1, 2]. Errors in meiotic recombination can cause genome instability and diseases. For example, chromosome nondisjunction or reduced recombination may result in constitutive aneuploidy, which can lead to spontaneous abortion or congenital birth defects. In addition, non-allelic homologous recombination (NAHR) can cause chromosomal rearrangements, many of which have been associated with diseases . To ensure accurate homologous recombination during meiosis, the location and rate of recombination should be precisely regulated. Previous studies have found that meiotic recombination events are non-randomly located, but cluster at hotspots that exhibit elevated rates of recombination . Recent studies have constructed high-resolution fine-scale maps of recombination rates and hotspots across the human genome . In addition, hotspot-associated sequence motifs have also been reported in the human genome . The PR domain zinc finger protein 9 (PRDM9) is thought to be a transregulator of meiotic recombination hotspots in humans and mice, and the hotspot sequence motifs may serve as binding sites for this protein . However, the motifs of hotspot may also drive genome instability, as they are found in disease-causing repeat sequences and breakpoint regions . Since hotspots are not shared between humans and chimpanzees, hotspot sequences are thought to have evolved quickly .
In the present study, we focused on the consequences of human meiotic recombination hotspots, in regard to determining their effect on genome evolution, and evaluating their association with genome instability and diseases. In order to answer these questions at the genome and protein-coding gene levels, we first defined the human meiotic recombination hotspot intersected gene as the HI gene. Since HI genes directly overlapped with recombination hotspots, they may reflect the features and consequences of recombination. We first analyzed the functional enrichments of HI genes, and then studied the evolutionary features of HI genes by comparing them to slowly and quickly evolving genes. We also analyzed the association of HI genes with human diseases and sought to explore the sequence basis for the connections. These findings will enable us to better understand the evolutionary and biological significance of human meiotic recombination hotspots.
Results and discussion
General features of HI genes
There were 2,054 transcripts that intersected with 1,775 hotspots. In total, we obtained a list of 1,156 hotspot intersected genes, and named the dataset HI genes. The transcripts of HI genes were merged to 1,136 non-overlapping regions. The minimum, maximum, and median lengths of HI gene regions that were covered by hotspots are 7 bases, 120,000 bases, and 10,000 bases respectively. About 72% of HI gene regions covered at least one entire hotspot region, and about 97% of HI gene regions covered at least 1,000 bases with hotspot regions.
Functional annotation of HI genes
Previous study have shown that hotspot predictions are enriched at genes with particularly important roles in the central nervous system . Our results, based on fine-scale mapping of the recombination rate, also provided evidence that HI genes were preferentially located around ion channels and are involved in neuron development. Cell adhesion and extracellular components are important for the evolution of multicellular organisms . Since HI gene products were enriched in cell adhesion and the extracellular matrix, we hypothesized that HI genes may play roles in human evolution.
HI genes and human evolution
Comparison of evolutionary rates
The results of the evolutionary rates were consistent with previous findings demonstrating that human SNP (single-nucleotide polymorphism) variability and mutation rate are higher in regions of high recombination . The faulty repair of DSBs (double-strand breaks) that initiated recombination could explain the mutagenesis of recombination. However the dN/dS values of HI genes, MI genes, and CI genes were really close numerically (Table S2 in Additional file 1). This could be explained by the viewpoint of historical recombination rates. Humans and common chimpanzees share about 95 ~ 99% of their DNA (deoxyribonucleic acid), but hotspot locations are markedly different, and the sex averaged recombination rates of humans might be elevated relative to other primates . It seems that hotspots themselves were evolving, and some regions of hotspots might shift during evolution. The current hotspots in human were observed, but there may be historical hotspots in other regions. So the already evolved historical HI genes may also displayed relative elevated evolutionary rates.
Tandem duplicated genes are mainly generated by ‘unequal crossing over’, which results from homologous recombination between paralogous sequences . To further analyze the evolutionary consequences of recombination hotspots, we calculated the proportions of DGD (Duplicated genes)  transcripts in HI transcripts, MI transcripts, CI transcripts, HK transcripts, TiGER transcripts, SPD transcripts, and hg18rpa (RefSeq protein-coding genes located in autosomal chromosomes, based on human genome hg18) transcripts (Table S4 in Additional file 1). We found that HI transcripts had a higher percentage of DGD transcripts than HK transcripts (20.1% versus 9.1%, P < 0.0001), indicating that HK genes are very different from HI genes in evolution. However compared to HI transcripts, hg18rpa transcripts had a similar percentage of DGD transcripts (22.2%, P > 0.05), and both TiGER transcripts and SPD transcripts had a significantly higher percentage of DGD transcripts than HK transcripts.
As the whole protein-coding genes had a high percentage of duplicated genes, this hinted at the shift of hotspot regions during evolution. MI transcripts and CI transcripts had a lower percentage of DGD transcripts than HI transcripts; however TiGER transcripts and SPD transcripts had a higher percentage of DGD transcripts. These results were consistent with the evolutionary rates of HI genes, HK genes, TiGER genes, SPD genes, MI genes, and CI genes. Thus we concluded that genes located in recombination regions are currently evolving.
HI genes and human diseases
The OMIM dataset includes genes associated with a mixture of all kinds of human disorders, and MD genes were also extracted from OMIM genes but were filtered to include only those genes with strong evidence showing that at least one mutation in the particular gene is causative of disease. MD genes are reported to have a tendency to be ancient genes that are highly associated with duplication events during evolution . The enrichment of MD genes in HI genes further indicated that heritable diseases may be by-products of the evolutionary process.
The dbCRID dataset contains genes involved in chromosomal rearrangement in human diseases . Most part of dbCRID genes are those genes involved in reciprocal translocation (71.0% in dbCRID genes, 76.7% in HI-dbCRID common genes), and reciprocal translocation associated dbCRID genes were enriched in HI genes (FE = 3.09, P < 0.001). Interestingly, TICdb genes also had a higher value of fold enrichment than CGC genes. CGC genes are consisting of all mutated genes in human cancer . In terms of mutation type, most CGC genes are translocation associated genes (66.5% in CGC genes, 88.9% in HI-CGC common genes); translocation associated CGC genes were also enriched in HI genes (FE = 2.23, P < 0.001). The TICdb genes, which contain reciprocal translocation associated genes in human tumours, were also enriched in HI genes. Together, these results show that chromosomal rearrangements in human diseases are highly associated with recombination hotspots. For example, BCL2 is annotated as an HI gene and is located on 18q21.33. BCL2 was initially reported to be involved in t(14;18) translocation in lymphoma . Reed JC et al. discovered the oncogenic potential of BCL2 by gene transfer . Another example of an HI gene is RAD51, which is located on 14q24.1. RAD51 plays key roles in both mitotic and meiotic homologous recombination and in DNA double-strand break repair. A region of RAD51 is reported to be involved in reciprocal translocations in uterine leiomyomas .
There were only 29 genes found both in HI-MD (20.6%) and HI-InteCR common genes (31.5%), indicating that genes involved in heritable diseases and chromosomal rearrangements associated diseases are to some extent two different classes of genes. MD genes are disease-causing genes, but may not directly associate with recombination events. Thus, some Mendelian heritable diseases may be indirect consequences of recombination events during evolution. However, InteCR genes are involved in disease causing chromosomal rearrangements at the sequence level, and corresponding chromosomal rearrangements may be directly associated with recombination events. Both homologous recombination and non-homologous end joining are the main repair systems for handling DNA double-strand breaks . Actual crossover events that would result in genome rearrangements are found to be suppressed during the recombinational repair of double-strand breaks . Thus meiotic recombination does not aim to cause diseases, but we did observe that many diseases genes were enriched in HI genes. As HI genes were directly overlapped with meiotic recombination hotspot, we explained it by the fact that some diseases may be by-products of faulty recombination events in meiotic recombination hotspot regions.
Meiotic non-allelic homologous recombinations in the germ line can cause genome instability, and thus result in genomic disorders, but the associated protein-coding genes are not well annotated . The most frequent chromosomal rearrangement associated disease is cancer , and the most frequent type in cancer is translocation . About 90.8% CGC genes show somatic mutations (the corresponding percentage for HI-CGC common genes is 97.8%). HI genes were extracted from meiotic recombination hotspots, but chromosomal rearrangement associated genes in somatic disease were enriched in HI genes. Mitotic recombination has been reported to be associated with carcinogenesis . However, hotspots for mitotic recombination in human have not been fully studied. Our results suggest that there may be some connections between meiotic and mitotic recombination hotspots at the sequence level.
To further explore the common molecular features of recombination hotspots and genes involved in chromosomal rearrangements associated diseases, we analyzed the occurrences of hotspot motifs and repeat elements in HI gene regions and InteCR gene regions. There are three sequence motifs including CCTCCCT, CCCCACCCC and CCNCCNTNNCCNC associated with recombination hotspots . All these motifs had elevated proportions in InteCR gene regions, HI gene regions, and HI-InteCR common gene regions compared to hg18rpa gene regions (Table S5 in Additional file 1). The most significantly enriched motif was CCCCACCCC. That indicates the motif CCCCACCCC is mostly responsible for chromosomal rearrangements associated diseases. We found 130 repeat elements which were significantly overrepresented in hotspot regions compared to autosomal genome sequences (Table S6 in Additional file 1). Previously reported enriched repeat elements in hotspot sequences including THE1B, THE1A, CT-rich, and L2 are also in the list . These repeats may drive recombination regions to hotspots. To further search possible repeats that are responsible for chromosomal rearrangement associated diseases, we analyzed the occurrences of repeat elements in HI-InteCR gene regions. There were 51 repeat elements enriched in HI-InteCR gene regions compared to InteCR gene regions, and 37 of them were also enriched repeat elements in hotspot regions (Tables S7 and S8 in Additional file 1). These 37 common repeat elements may play a key role in both driving recombination hotspots and chromosomal rearrangement associated diseases. For example, MIRb is one of the enriched repeat elements. MIRb insertion was reported to be involved in paraganglioma and pheochromocytoma which are associated with germline mutation of the tumour suppressor genes .
Meiotic recombination is important for evolution and is also highly associated with genome instability. Since HI genes are overlapped with recombination hotspots, HI gene regions may also endure recombination at high frequency. We systematically analyzed the evolutionary and biomedical features of meiotic recombination hotspots at the protein-coding gene level. Thus, it enables us to better understand the consequences of recombination during evolution since the protein-coding genes are well annotated functionally. Our results illustrate the double sides of meiotic recombination.
The evolutionary importance of meiotic recombination
Our results demonstrate that HI genes are functionally enriched in cell-to-cell communication which is a key function for multicellular organisms. Compared to slowly evolved housekeeping genes, HI genes evolved more quickly. We also believe that hotspots themselves are changing during evolution. As most HI genes are old genes, it indicates that some old genes are still evolving and that provide materials for human evolution through recombination. HI genes may point out the status and future of human evolution as new genes and new functions are generated. We found that tissue-specific genes such as brain and blood specific genes were overrepresented in HI genes. Thus, human HI genes may also be important for human intelligence and the immune system.
The by-products of meiotic recombination
The purpose of meiotic recombination is not aim to cause diseases, but rather to repair DNA break points. However many disease-associated genes are enriched in HI genes. We first show that Mendelian heritable diseases may be indirect by-products of recombination events during evolution. We also find that chromosomal rearrangement associated diseases are highly associated with hotspot regions. Since many observed chromosomal rearrangement associated diseases are somatic cancers, which are mainly caused by mitotic recombination, it indicates that meiotic recombination may be safe but hotspot sequences may be susceptible to diseases. We also find that hotspot-associated motifs are also enriched in genes involved in chromosomal rearrangement associated diseases. We provide repeat elements that may both promote recombination hotspots and chromosomal rearrangement associated diseases.
Definition of HI gene and dataset collection
Summary of gene datasets
Protein-coding genes on autosomal chromosomes (based on human genome hg18 and RefSeq genes)
Recombination hotspots intersected genes
Recombination middle spots intersected genes
Recombination cold spots intersected genes
Genes encoding secreted proteins
Genes classified by different evolutionary age
Mendelian disorder associated genes in human
Mendelian Disease Genes (at least one mutation in the particular gene is causative of the disease)
Genes with mutations have been causally implicated in cancer
Reciprocal translocation associated genes in human tumours
Chromosomal rearrangement associated genes in human diseases
Integrated chromosomal rearrangement associated genes in human diseases (Combined of dbCRID, TICdb and CGC genes)
Functional annotation of HI genes
DAVID (The Database for Annotation, Visualization and Integrated Discovery, version 6.7) provided a comprehensive set of online functional annotation tools used to describe the biological meaning behind a large list of genes . RefSeq mRNA identifies of HI genes were submitted to DAVID to perform GO (Gene Ontology) enrichment analysis. The hg18rpa genes were set as the background for comparison. The lowest fold enrichment value was set to 1.5 and the false discovery rate (FDR) was set to 0.05. A total of 2,054 mRNA identifies of HI genes were mapped to 1,142 DAVID genes; and 30,320 mRNA identifies of hg18rpa genes were mapped to 17,626 DAVID genes. There are 833 DAVID HI genes observed in GO-CC (cellular component). Corresponding numbers for GO-MF (molecular function) and GO-BP (biological process) are 802 and 839.
For all enrichment analysis in this part, hg18rpa genes were set as background and statistical values were calculated by Fisher’s exact test. For comparison of evolutionary rates, gene datasets were first mapped to ensemble proteins. The dN, dS, and dN/dS values for human-mouse orthologs were directly obtained from Ensemble through BioMart . For the evolutionary comparison of HK genes, TiGER genes, and SPD genes, the common genes with HI genes were excluded in all paired datasets respectively. For example, while comparing the evolutionary rates of HK genes with HI genes, the common genes between HK genes and HI genes were removed to obtain a better representation of the unique features of each dataset. And for comparison of HI genes, MI genes and CI genes, only unique genes in each dataset were retained. The Wilcoxon rank sum non-parametric test was applied for statistical comparison . Duplicated genes were downloaded from the duplicated genes database (DGD) . The DGD genes are co-located to the same chromosome and are highly similar in sequences. Thus, DGD genes are groups of tandem repeated genes. The enrichment comparisons of duplicated genes were also performed by Fisher’s exact test. The non-redundant mRNA transcripts were counted to reflect the feature of gene duplication.
Disease association analysis
For calculation of the enrichments of disease-associated genes in HI genes, hg18rpa genes were set as background and statistical values were calculated by Fisher’s exact test. A total of 114 fragile sites that located in autosomal chromosomes were extracted from a published dataset and mapped to the hg18 genome . The resolutions of fragile site regions were range from 1E + 3 to1E + 4 kb. The InteCR dataset integrated chromosomal rearrangement associated disease genes in dbCRID, TICdb and CGC genes (only included genes involved in amplification, large deletion, gene conversion, and translocation). A total of 614 InteCR genes and 618 gene regions were obtained. The enrichments of hotspot motifs in InteCR gene regions, HI gene regions, and HI-InteCR regions were calculated by Fisher’s exact test (the hg18rpa gene regions were set as the background). Repeatmasker data, downloaded from UCSC, was used for the analysis of repeated sequences . The fisher’s exact test was used to evaluate the enrichments of repeat elements in datasets.
This work was supported by grants from 973 programs (2011CB944304, 2013CB911400), and the Chinese Natural Science Funds (81222006). It was also sponsored by Qing Lan Project.
- Coop G, Przeworski M: An evolutionary view of human recombination. Nat Rev Genet. 2007, 8: 23-34.View ArticlePubMed
- Webster MT, Hurst LD: Direct and indirect consequences of meiotic recombination: implications for genome evolution. Trends Genet. 2012, 28: 101-109. 10.1016/j.tig.2011.11.002.View ArticlePubMed
- Sasaki M, Lange J, Keeney S: Genome destabilization by homologous recombination in the germ line. Nat Rev Mol Cell Biol. 2010, 11: 182-195.PubMed CentralView ArticlePubMed
- Lichten M, Goldman AS: Meiotic recombination hotspots. Annu Rev Genet. 1995, 29: 423-444. 10.1146/annurev.ge.29.120195.002231.View ArticlePubMed
- Kong A, Thorleifsson G, Gudbjartsson DF, Masson G, Sigurdsson A, Jonasdottir A, Walters GB, Gylfason A, Kristinsson KT, Gudjonsson SA: Fine-scale recombination rate differences between sexes, populations and individuals. Nature. 2010, 467: 1099-1103. 10.1038/nature09525.View ArticlePubMed
- Myers S, Bottolo L, Freeman C, McVean G, Donnelly P: A fine-scale map of recombination rates and hotspots across the human genome. Science. 2005, 310: 321-324. 10.1126/science.1117196.View ArticlePubMed
- Berg IL, Neumann R, Lam KW, Sarbajna S, Odenthal-Hesse L, May CA, Jeffreys AJ: PRDM9 variation strongly influences recombination hot-spot activity and meiotic instability in humans. Nat Genet. 2010, 42: 859-863. 10.1038/ng.658.PubMed CentralView ArticlePubMed
- Myers S, Freeman C, Auton A, Donnelly P, McVean G: A common sequence motif associated with recombination hot spots and genome instability in humans. Nat Genet. 2008, 40: 1124-1129. 10.1038/ng.213.View ArticlePubMed
- Paigen K, Petkov P: Mammalian recombination hot spots: properties, control and evolution. Nat Rev Genet. 2010, 11: 221-233.PubMed CentralView ArticlePubMed
- Freudenberg J, Fu YH, Ptacek LJ: Enrichment of HapMap recombination hotspot predictions around human nervous system genes: evidence for positive selection?. Eur J Hum Genet. 2007, 15: 1071-1078. 10.1038/sj.ejhg.5201876.View ArticlePubMed
- Hynes RO, Zhao Q: The evolution of cell adhesion. J Cell Biol. 2000, 150: F89-96. 10.1083/jcb.150.2.F89.View ArticlePubMed
- Zhang L, Li WH: Mammalian housekeeping genes evolve more slowly than tissue-specific genes. Mol Biol Evol. 2004, 21: 236-239.View ArticlePubMed
- Winter EE, Goodstadt L, Ponting CP: Elevated rates of protein secretion, evolution, and disease among tissue-specific genes. Genome Res. 2004, 14: 54-61.PubMed CentralView ArticlePubMed
- Lercher MJ, Hurst LD: Human SNP variability and mutation rate are higher in regions of high recombination. Trends Genet. 2002, 18: 337-340. 10.1016/S0168-9525(02)02669-0.View ArticlePubMed
- Hurles M: Gene duplication: the genomic trade in spare parts. PLoS Biol. 2004, 2: E206-10.1371/journal.pbio.0020206.PubMed CentralView ArticlePubMed
- Ouedraogo M, Bettembourg C, Bretaudeau A, Sallou O, Diot C, Demeure O, Lecerf F: The Duplicated Genes Database: Identification and Functional Annotation of Co-Localised Duplicated Genes across Genomes. PLoS One. 2012, 7: e50653-10.1371/journal.pone.0050653.PubMed CentralView ArticlePubMed
- Zhang YE, Landback P, Vibranovski MD, Long M: Accelerated recruitment of new brain development genes into the human genome. PLoS Biol. 2011, 9: e1001179-10.1371/journal.pbio.1001179.PubMed CentralView ArticlePubMed
- Long M, Betran E, Thornton K, Wang W: The origin of new genes: glimpses from the young and old. Nat Rev Genet. 2003, 4: 865-875.View ArticlePubMed
- Barreiro LB, Marioni JC, Blekhman R, Stephens M, Gilad Y: Functional comparison of innate immune signaling pathways in primates. PLoS Genet. 2010, 6: e1001249-10.1371/journal.pgen.1001249.PubMed CentralView ArticlePubMed
- Gibson KR: Evolution of human intelligence: the roles of brain size and mental construction. Brain Behav Evol. 2002, 59: 10-20. 10.1159/000063730.View ArticlePubMed
- Dickerson JE, Robertson DL: On the origins of Mendelian disease genes in man: the impact of gene duplication. Mol Biol Evol. 2012, 29: 61-69. 10.1093/molbev/msr111.PubMed CentralView ArticlePubMed
- Kong F, Zhu J, Wu J, Peng J, Wang Y, Wang Q, Fu S, Yuan LL, Li T: dbCRID: a database of chromosomal rearrangements in human diseases. Nucleic Acids Res. 2011, 39: D895-900. 10.1093/nar/gkq1038.PubMed CentralView ArticlePubMed
- Futreal PA, Coin L, Marshall M, Down T, Hubbard T, Wooster R, Rahman N, Stratton MR: A census of human cancer genes. Nat Rev Cancer. 2004, 4: 177-183. 10.1038/nrc1299.PubMed CentralView ArticlePubMed
- Pegoraro L, Palumbo A, Erikson J, Falda M, Giovanazzo B, Emanuel BS, Rovera G, Nowell PC, Croce CM: A 14;18 and an 8;14 chromosome translocation in a cell line derived from an acute B-cell leukemia. Proc Natl Acad Sci USA. 1984, 81: 7166-7170. 10.1073/pnas.81.22.7166.PubMed CentralView ArticlePubMed
- Reed JC, Cuddy M, Slabiak T, Croce CM, Nowell PC: Oncogenic potential of bcl-2 demonstrated by gene transfer. Nature. 1988, 336: 259-261. 10.1038/336259a0.View ArticlePubMed
- Date O, Katsura M, Ishida M, Yoshihara T, Kinomura A, Sueda T, Miyagawa K: Haploinsufficiency of RAD51B causes centrosome fragmentation and aneuploidy in human cells. Cancer Res. 2006, 66: 6018-6024. 10.1158/0008-5472.CAN-05-2803.View ArticlePubMed
- Helleday T, Lo J, van Gent DC, Engelward BP: DNA double-strand break repair: from mechanistic understanding to cancer treatment. DNA Repair (Amst). 2007, 6: 923-935. 10.1016/j.dnarep.2007.02.006.View Article
- Richardson C, Moynahan ME, Jasin M: Double-strand break repair by interchromosomal recombination: suppression of chromosomal translocations. Genes Dev. 1998, 12: 3831-3842. 10.1101/gad.12.24.3831.PubMed CentralView ArticlePubMed
- Sengstag C: The role of mitotic recombination in carcinogenesis. Crit Rev Toxicol. 1994, 24: 323-353. 10.3109/10408449409017922.View ArticlePubMed
- LaFave MC, Sekelsky J: Mitotic recombination: why? when? how? where?. PLoS Genet. 2009, 5: e1000411-10.1371/journal.pgen.1000411.PubMed CentralView ArticlePubMed
- Debacker K, Kooy RF: Fragile sites and human disease. Hum Mol Genet. 2007, 16 Spec No. 2: R150-158.View ArticlePubMed
- Bayley JP, Weiss MM, Grimbergen A, van Brussel BT, Hes FJ, Jansen JC, Verhoef S, Devilee P, Corssmit EP, Vriends AH: Molecular characterization of novel germline deletions affecting SDHD and SDHC in pheochromocytoma and paraganglioma patients. Endocr Relat Cancer. 2009, 16: 929-937. 10.1677/ERC-09-0084.View ArticlePubMed
- Goecks J, Nekrutenko A, Taylor J: Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol. 2010, 11: R86-10.1186/gb-2010-11-8-r86.PubMed CentralView ArticlePubMed
- Karolchik D, Hinrichs AS, Furey TS, Roskin KM, Sugnet CW, Haussler D, Kent WJ: The UCSC Table Browser data retrieval tool. Nucleic Acids Res. 2004, 32: D493-496. 10.1093/nar/gkh103.PubMed CentralView ArticlePubMed
- Pruitt KD, Tatusova T, Maglott DR: NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 2005, 33: D501-504. 10.1093/nar/gki476.PubMed CentralView ArticlePubMed
- Chang CW, Cheng WC, Chen CR, Shu WY, Tsai ML, Huang CL, Hsu IC: Identification of human housekeeping genes and tissue-selective genes by microarray meta-analysis. PLoS One. 2011, 6: e22859-10.1371/journal.pone.0022859.PubMed CentralView ArticlePubMed
- Chen Y, Zhang Y, Yin Y, Gao G, Li S, Jiang Y, Gu X, Luo J: SPD–a web-based secreted protein database. Nucleic Acids Res. 2005, 33: D169-173. 10.1093/nar/gni168.PubMed CentralView ArticlePubMed
- Liu X, Yu X, Zack DJ, Zhu H, Qian J: TiGER: a database for tissue-specific gene expression and regulation. BMC Bioinformatics. 2008, 9: 271-10.1186/1471-2105-9-271.PubMed CentralView ArticlePubMed
- Hamosh A, Scott AF, Amberger J, Bocchini C, Valle D, McKusick VA: Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res. 2002, 30: 52-55. 10.1093/nar/30.1.52.PubMed CentralView ArticlePubMed
- Novo FJ, de Mendibil IO, Vizmanos JL: TICdb: a collection of gene-mapped translocation breakpoints in cancer. BMC Genomics. 2007, 8: 33-10.1186/1471-2164-8-33.PubMed CentralView ArticlePubMed
- da Huang W, Sherman BT, Lempicki RA: Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009, 4: 44-57.View ArticlePubMed
- Vilella AJ, Severin J, Ureta-Vidal A, Heng L, Durbin R, Birney E: EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates. Genome Res. 2009, 19: 327-335.PubMed CentralView ArticlePubMed
- Wilcoxon F: Individual comparisons by ranking methods. Biometrics. 1945, 1: 80-83. 10.2307/3001968.View Article
- Jurka J: Repbase update: a database and an electronic journal of repetitive elements. Trends Genet. 2000, 16: 418-420. 10.1016/S0168-9525(00)02093-X.View ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.