- Research article
- Open Access
Network-based SNP meta-analysis identifies joint and disjoint genetic features across common human diseases
- Matthias Arnold†1Email author,
- Mara L Hartsperger†1,
- Hansjörg Baurecht†2, 3,
- Elke Rodríguez2,
- Benedikt Wachinger1,
- Andre Franke4,
- Michael Kabesch5,
- Juliane Winkelmann6, 7, 8,
- Arne Pfeufer6, 8, 9,
- Marcel Romanos10,
- Thomas Illig11,
- Hans-Werner Mewes1, 12,
- Volker Stümpflen1 and
- Stephan Weidinger2Email author
© Arnold et al.; licensee BioMed Central Ltd. 2012
- Received: 29 May 2012
- Accepted: 12 September 2012
- Published: 18 September 2012
Genome-wide association studies (GWAS) have provided a large set of genetic loci influencing the risk for many common diseases. Association studies typically analyze one specific trait in single populations in an isolated fashion without taking into account the potential phenotypic and genetic correlation between traits. However, GWA data can be efficiently used to identify overlapping loci with analogous or contrasting effects on different diseases.
Here, we describe a new approach to systematically prioritize and interpret available GWA data. We focus on the analysis of joint and disjoint genetic determinants across diseases. Using network analysis, we show that variant-based approaches are superior to locus-based analyses. In addition, we provide a prioritization of disease loci based on network properties and discuss the roles of hub loci across several diseases. We demonstrate that, in general, agonistic associations appear to reflect current disease classifications, and present the potential use of effect sizes in refining and revising these agonistic signals. We further identify potential branching points in disease etiologies based on antagonistic variants and describe plausible small-scale models of the underlying molecular switches.
The observation that a surprisingly high fraction (>15%) of the SNPs considered in our study are associated both agonistically and antagonistically with related as well as unrelated disorders indicates that the molecular mechanisms influencing causes and progress of human diseases are in part interrelated. Genetic overlaps between two diseases also suggest the importance of the affected entities in the specific pathogenic pathways and should be investigated further.
- Genome-wide association study
- Genetic overlap
- Shared variant network
- Disease comorbidity
In the past years enormous progress has been made in the identification of complex trait susceptibility loci. The application of genome-wide association studies (GWAS) created a still growing set of genetic markers associated with increased risk for a multitude of different diseases. In contrast to few single loci exerting large effects on some phenotypes – mostly immune-related traits – the majority of traits was only associated with loci displaying small effects of odds ratios ranging from 1.1 to 1.5. Meta-analyses of several GWA studies further extended the set of known disease-related associations with even lower-effect variants. Despite the impressive progress in the field, for most traits only a small proportion of the total heritability is yet explained by known risk variants. A notable exception is type 1 diabetes (T1D) where validated risk loci explain a large proportion of the total heritability. In contrast, for most traits a considerably larger number of variants was reported to be associated, but typically these explain less than 50% of the total heritability.
Intriguingly, although published individual GWAS are usually carried out for one trait at a time, a significant overlap in the associations of several complex diseases becomes apparent. Besides effects on a specific phenotype, loci and single SNPs thus may also exert pleiotropic effects by contributing to a variety of traits. While it is not surprising that susceptibility genes for closely related traits should be shared, multi-functionality of a gene in phenotype presentation, i.e. pleiotropy, sensu stricto refers to seemingly unrelated and distinct traits. Loci or variants affecting several traits might have small effects on each specific trait, but may be of major biological interest while indicating shared or branching etiological mechanisms. In principle, the influence of such loci can be agonistic or antagonistic, i.e. involve concurrent similar or opposite effects of the same variant for different traits. So far, few studies attempted to study such loci in a systemic fashion and rather focused on shared risk variants in closely related traits like autoimmune diseases[8–10], heart diseases or cancer.
In order to identify shared or branching pathways of related as well as diverse (i.e. medically and phenotypically distinct) diseases, we performed a systematic comparative analysis of genetic commonalities and differences across traditionally defined traits using the available repository of GWAS results. In the context of network medicine, we utilized an approach based on the diseasome concept and investigated high-significance associations beyond conventional single-marker analysis in a hypothesis-free and comprehensive way. In former studies we found differing approaches of gene and locus assignment to association markers which partially led to controversial results (e.g.). We therefore developed a more sophisticated locus assignment method and evaluate its reliability by utilizing the information contained directly in the reported markers. For this variant-based approach we manually curated a high-quality data set to construct a network extending the knowledge on genetic overlaps between diseases as provided by GWA studies.
We defined the associated loci over the variant-based linkage disequilibrium (LD) measure r2 and, accordingly, expected the SLN and the SVN (Figure1B and C) to be of similar shape. However, when visually comparing the networks, significant differences in size and structure became apparent. Therefore, we performed further analyses to compare established property measures of the networks in detail to investigate potential reasons for this divergence.
Shared locus vs. shared association analysis
Antagonistically linked traits
Antagonistically Linked Traits (Loci)
AST (17q12); T1D (1p13, 16p11); VIT (1p13); T2D (2p21); RA (1p13); SLE (1p13); PS (19p13); MS (17q21)
CD (17q12); UC (17q12); BLC (17q12); RA (17q12); PS (5q31)
COR (9p21); T2D (9p21); GLA (9p21); IPF (5p15) ; TN (5p15)
AST (17q12); MS (20q13); CD (1p13)
RA (20q13); HCC (1p36); CD (17q21)
AST (17q12); CeD (1p36); SLE (1q23)
PN (5p15); IPF (5p15); TN (5p15)
GLI (5p15); LN (5p15)
CD (1p13); MEL (11q14)
DIABETES MELLITUS, TYPE 1
CD (1p13, 16p11); IBD (1p13)
DIABETES MELLITUS, TYPE 2
GLI (9p21); CD (2p21)
UC (1p36); CRC (3q26)
IDIOPATHIC PULMONARY FIBROSIS
GLI (5p15); LN (5p15)
AST (5q31); CD (19p13)
LUPUS ERYTHEMATOSUS, SYSTEMIC
CD (1p13); UC (1q23)
INFLAMMATORY BOWEL DISEASES
LIVER CIRRHOSIS, BILIARY
HEAD AND NECK NEOPLASMS
Agonistically linked traits
Agonistically Linked Traits (Loci)
UC (1p31, 1q32, 3p21, 5p13, 9p24, 9q32, 9q34, 10q24, 21q21, 21q22); IBD (1p31, 9q32, 10q22, 16q12, 20q13, 22q12); CeD (2q12, 18p11, 22q11); LEP (9q32, 13q14); MG (22q12); T1D (1q32, 18p11); SC (3p21); OVARIAN NEOPLASMS (8q24); AS (1p31); LL (2q37); HTG (2p23); AA (11q13); MS (10p15); AD (11q13)
CeD (1p36, 4q27, 6q23); SLE (2q32, 7q32, 8p23); T1D (2q33, 4p15, 21q22); BLC (7q32); SS (7q32); FL (6p21); UC (6q23); LBL (6p21); SCHIZOPHRENIA (6p21); VIT (21q22)
CD (2q12, 18p11, 22q11); RA (1p36, 4q27, 6q23); UC (2p16, 6q23); T1D (18p11); BLC (7p14); VIT (3q28); HYP (12q24); SLE (6p21); MG (6p21)
CD (1p31, 1q32, 3p21, 5p13, 9p24, 9q32, 9q34, 10q24, 21q21, 21q22); CeD (2p16, 6q23); AS (1p31, 2q11); IBD (1p31, 21q22); SC (3p21); PS (1p31); RA (6q23); T1D (1q32); SLE (7q32)
MCI (1p13, 1p32, 1q41, 2q33, 6p24, 9p21, 10q11, 19p13, 21q22); CAD (1p13, 3q22, 9p21); ICA (10q24); PD (10q24); AAA (9p21); HYP (10q24); HTG (11q23)
LUPUS ERYTHEMATOSUS, SYSTEMIC
RA (2q32, 7q32, 8p23); SS (2q32, 7q32); LN (6p21); BLC (7q32); MG (6p21); UC (7q32); CeD (6p21)
DIABETES MELLITUS, TYPE 1
RA (2q33, 4p15, 21q22); CD (2q33, 4p15, 21q22); CeD (18p11); AA (12q13); VIT (21q22); UC (1q32)
ICA (10q24); PD (10q24); COR (10q24); KIDNEY FAILURE, CHRONIC (16p12); CeD (12q24)
INFLAMMATORY BOWEL DISEASES
CD (1p31, 9q32, 10q22, 16q12, 20q13, 22q12); LEP (9q32); UC (1p31, 21q22); MG (22q12); AS (1p31)
LIVER CIRRHOSIS, BILIARY
RA (7q32); SLE (7q32); SS (7q32); MS (12p13); CeD (7p14)
CORONARY ARTERY DISEASE
COR (1p13, 3q22, 9p21); MCI (1p13, 9p21); AAA (9p21); T2D (2q36)
IBD (22q12); CD (22q12); SLE (6p21); CeD (6p21)
AORTIC ANEURYSM, ABDOMINAL
COR (9p21); CAD (9p21); MCI (9p21)
PD (10q24); HYP (10q24); COR (10q24)
SLE (6p21); COPD (15q25); PVD (15q25)
LBL (11q24); RA (6p21); LL (6p21)
COR (1p13, 1p32, 1q41, 2q33, 6p24, 9p21, 10q11, 19p13, 21q22); CAD (1p13, 9p21); AAA (9p21)
HYP (10q24); COR (10q24); ICA (10q24)
SLE (2q32, 7q32); RA (7q32); BLC (7q32)
UC (1p31, 2q11); IBD (1p31); CD (1p31)
CeD (3q28); RA (21q22); T1D (21q22)
T1D (12q13); CD (11q13)
UC (3q21); CD (3q21)
GLIOMA (20q13); CD (11q13)
DIABETES MELLITUS, TYPE 2
OBESITY (16q12); CAD (2q36)
HEAD AND NECK NEOPLASMS
GASTROINTESTINAL NEOPLASMS (10q23); ALCOHOLISM (12q24)
CD (2p23); COR (11q23)
CD (9q32, 13q14); IBD (9q32)
CD (2q27); FL (11q24)
LYMPHOMA, LARGE B-CELL, DIFFUSE
FL (6q21); RA (6q21)
CD (10p15); BLC (12p13)
PERIPHERAL VASCULAR DISEASES
COPD (15q25); LN (15q25)
COLORECTAL NEOPLASMS (8q24); ENDOMETRIAL NEOPLASMS (17q12)
UC (1p31); ARTHRITIS, PSORIATIC (6q21)
PULMONARY DISEASE, CHRONIC OBSTRUCTIVE
LN (15q25); PVD (15q25)
Topology of the SVN
Its degree distribution attributes the SVN a scale-free network, i.e. it approximates a power-law (P(k) ∼ k−γ; γ = 1.32; R2 = 0.69) (Additional file4: Figure S2). Interestingly, also when considering the two node types separately, disease nodes (γ = 0.97; R2 = 0.71) as well as locus nodes (γ = 2.98; R2 = 0.93) show scale-free degree distributions (Additional file4: Figure S2). The scale-free property classifies the network (and its two sets of node types, respectively) as structured, i.e. non-random. It has to be considered that the limited size of the SVN leads to inaccuracies in distribution fitting and thus reduces the explanatory value of this observation. However, as clinically related diseases (i.e. diseases which present similar symptoms) should present a higher genetic overlap than unrelated disorders, this finding meets expectations.
The variant-based SVN also shows no artificial character with regards to its topology. Both locus and disease node sets comprise hubs, here defined as nodes with a degree >3, which form the central elements in the network. As in each GWAS multiple markers are associated with a single disease, one would expect hubs to be constituted mostly of disease nodes. In line with that, 74% of the hubs in our network are disease nodes. The remaining 26% are loci hubs (seven gene loci and three intergenic loci). Several of these loci have been previously identified as influencing susceptibility to multiple diseases like the HLA region on chromosome 6, a cancer locus at chromosome 8q24, and a coronary artery disease locus at chromosome 9p21. Further hub loci are PTPN22, a known player across several autoimmunity disorders, and IL23R, which has been shown to direct inflammatory processes. In addition, we observed hubs which have not yet been described as predisposing to a whole group of diseases, such as TNPO3 which appears to predispose to various autoimmune diseases like systemic lupus erythematosus, systemic scleroderma, and rheumatoid arthritis[19–21], or TNFSF15, which shows associations with several inflammatory diseases[22–25]. As expected, in the majority of cases the traits linked to one hub can be assigned to the same disease group and, further, diseases which are not obviously related to other disorders linked to the respective hub are mostly associated with antagonistic signals. For instance, in a four-gene locus at chromosome 17q12 (GSDML/IKZF3/ORMDL3/ZPBP2, see Additional file2: Table S1), four autoimmune diseases are associated with the same risk allele that in turn has opposite effects on asthma[20, 25–27]. Thus, our results indicate that loci associated with several diseases have an effect specific to a certain disease group rather than effects on unrelated diseases, and that, if there is an effect on an unrelated disease, it can often be distinguished by the direction of the effect.
Disease clustering mirrors trait relatedness
To identify shared and branching mechanisms we split the SNP association data into agonistic and antagonistic variants. Since in most cases there is no solid and comprehensive basis of experimental data that would allow for a more sensitive classification, we suggest that the best available indication of distinct effects of a variant on two diseases is the signal itself being different. Therefore, we define a SNP to be agonistic if all disorders are associated with the same risk allele of the SNP. Conversely, we consider a SNP antagonistic if the associated risk alleles differ between diseases. Accordingly, in the analysis of genetic overlaps as a measure of trait similarity only agonistic variants were included.
Overall, the outcome of the clustering poses the question of the extent of the influence of phenotype classification and population stratification on GWAS results. Frequent comorbidities (also of seemingly unrelated diseases such as obesity and cancer), diagnostic difficulties in highly related diseases like Crohn’s disease (CD) and ulcerative colitis (UC), and structural (genetic) differences in population subgroups are known to complicate GWAS and impact their outcomes. With growing sample sizes in case–control studies, the potential of false positives produced by such phenomena also increases. As a response, manifold control procedures to handle these and other confounding factors have been developed which are widely used and well appreciated. The heterogeneity of the clusters we retrieved once more highlights the need for the development and application of such methods.
Odds ratio as potential indicator of primary effects
In the context of agonistic association overlap between related diseases, we used the odds ratios (ORs) reported with the SNPs to investigate their impact on the respective traits. In general, the highest ORs are reported for associations of autoimmune diseases to the HLA locus on chromosome 6. Associations with traits where few gene variants with strong effects are reported, e.g. rs6107516 in the prion protein PRNP associated with Creutzfeldt-Jakob disease (OR = 38.5) or rs2071348 in the hemoglobin gene cluster at 11p15.4 associated with beta-thalassemia (OR = 4.33), are exceptions from the majority of associations displaying small ORs[33, 34]. Based on the effect size of variants associated with more than one trait and the same risk allele, we identified three patterns.
First, we identified variants which are likely to present general agonistic risk factors for a group of related diseases or syndromes such as rs13015714 in an interleukine receptor gene cluster at chromosome 2q12.1. This SNP is associated with celiac disease (CeD) and Crohn’s disease (CD) with equal ORs (OR = 1.19)[24, 35]. In cases of frequent comorbidities, though, comparable ORs have limited informative value. The SNP rs9939609 in the FTO gene for instance is associated with T2D and obesity with nearly equal ORs (OR = 1.34 and OR = 1.32) and thus appears to link two coequal traits of the metabolic syndrome (we refer to the definition of the International Diabetes Foundation, 2006)[36, 37]. However, for SNPs in the FTO gene it has been shown that adjustment for body mass index results in the loss of significance (OR∼ 1.0) of the association with T2D.
Second, SNPs appeared in several cases to be primarily associated with one disease, which in turn represents a risk factor for another associated trait. For instance, rs2200733 on chromosome 4q25 is linked to atrial fibrillation with a higher OR (OR = 1.72) than to stroke (OR = 1.26)[39, 40]. Another example is rs964184 which is located proximal to the apolipoprotein gene cluster on chromosome 11q23 which is associated with hypertriglyceridemia with a markedly higher OR (OR = 3.28) than to coronary disease (OR = 1.13)[41, 42]. The lower effects of the markers on the hypothesized “secondary sequels” may be explained by the fact that these are caused by the primary diseases, but with less than 100% penetrance.
Third, we speculate that the OR might allow conclusions with respect to the evaluation of an association in cases where similar traits are linked to the same SNP with diverging effect sizes. For instance, CD and ulcerative colitis (UC) share multiple risk loci. The two diseases are strongly related in their etiology and pathology. Thus, a clinical distinction of both diseases is difficult if based only on few criteria and might lead to inaccurate case ascertainment leading to mixed associations. However, for several SNPs such as rs11209026, which is located in the IL23R gene, we found notably higher effects on CD (OR = 3.84) than on UC (OR = 1.74)[25, 43]. Conversely, rs3024505 which lies proximal to the IL10 gene shows a greater effect on UC (OR = 1.46) than on CD (OR = 1.12)[24, 44]. Interestingly, it has been shown that IL23 is selectively upregulated in CD while levels in UC patients are normal and IL10 expression appears to be higher in UC as compared to CD[24, 45]. Thus, the OR might – similar to the above examples – allow for identifying potentially misleading associations in closely related diseases which may result from diagnostic errors.
Identification of branching etiologies
We searched for evidence that antagonistic signals represent genetic indicators of branching points in the etiologies of two diseases or disease groups. For the assessment of potentially multifunctional variants we therefore focused on markers with inverse effects. We identified 44 such variants, which represent almost 4% of the original association data analyzed and about 25% of the SNPs associated with more than one disease. Of those 44 variants, about one fifth (n = 9) are located in the HLA region. SNP-markers in that region are known to differ in their ability to capture the classical HLA-alleles and therefore were not considered further for the present analysis.
For cases where the function of the harboring genes is known, we were able to identify conclusive models. For instance, rs2736100 in the telomerase reverse transcriptase (TERT) gene was reported to exert antagonistic effects in idiopatic pulmonary fibrosis (IPF) and testicular germ cell tumor (TGCT) and two other cancer traits[47–53]. Whereas telomerase activity is generally upregulated in tumors sustaining proliferation and potentiating mutagenesis and transformation of cancer cells, in IPF limited cell division due to decreased telomerase activity is thought to contribute to the phenomenon of high percentages of apoptotic cells in fibroblasts. Consistent with that observation, disturbed telomerase activity in TGCT is believed to form a distinct mechanism of cancerogenesis in this tumor type. This distinction from other cancer traits is believed to be based on the fact that testicular germ cells are the only adult cell type with high telomerase expression. Another example is the telomerase RNA component TERC, which is essential for TERT functioning. Opposite alleles of SNP rs10936599 are associated with CeD and colorectal cancer (CRC)[35, 57]. Jones et al. showed that rs2293607, a variant tagged by rs10936599, alone is sufficient to modulate TERC expression. While in CRC this leads to TERC overexpression and longer telomeres, the opposite might apply to CeD, which exhibits telomere reduction and genomic instability[58, 59]. The observation that both constituents of the telomerase complex contain independent antagonistic variants is an intriguing finding. It suggests parallel, autonomous evolution of two functionally interacting loci gone to fixation at a trade-off between early cell senescence or increased apoptosis rates (as in IPF and CeD) and oncogenesis.
A further example is rs1393350 in the tyrosinase (TYR) gene where the opposite alleles are linked to vitiligo and melanoma[60, 61], potentially mirroring the inverse correlation observed for the two traits. The phenomenon is based on the presentation of TYR (self-) antigens on the cell surface of melanocytes. It is hypothesized that in vitiligo the immune system is hypersensitive towards TYR antigens, which are overexpressed in melanoma cells. A possible explanation may be that opposite alleles differentially influence the antigenicity of the TYR protein, thereby conferring protection from melanoma but susceptibility to vitiligo through immune surveillance and vice versa.
In cases of functionally less or uncharacterized genes and their involvement in the associated diseases, our approach can still be used to suggest potential trait-specific effects. Antagonistic effects of rs12720356 (localized in the TYK2 gene) in CD and psoriasis, for instance, might point towards different patterns of cytokine signaling in these two diseases[24, 63, 64]. Likewise, rs12727642 and rs35675666, both located in the PARK7 gene and inversely associated with CeD and UC, could indicate differential effects of oxidative stress on each trait[25, 35].
Variant-based analysis of joint and disjoint genetic features
In this study, we identified overlapping genetic associations and their corresponding loci with analogous or contrasting effects on different diseases. We addressed the methodological challenges of the identification of the functional entities affected by GWAS-detected variants.
Associations formally implicate genomic regions which are captured via tagging SNPs representing haplotype blocks. By using the population-specific LD-based haplotype data provided by the HapMap project[65, 66] or, more recently, the 1000 genomes project, SNP arrays are constructed aiming at a high coverage of the total genome variation, but without considering biologically functional aspects. The advantage of GWAS as a method is its unbiased approach to identify genomic regions compromised in a disease; a major drawback is that the association of markers without knowledge of the causal variants and their effects does not allow for a straightforward biological interpretation.
As we show, the reliability of an automated assignment of LD-based loci to the trait-associated variants is strongly context-dependent. Especially in cases of high gene density or, conversely, in intergenic regions/gene deserts, resolving GWAS signals is not possible without further knowledge. Simplifications such as more basic locus assignment approaches which neglect the LD structure of the genome (e.g. classifying a SNP as affecting only the most proximal gene) may seem more intuitive, might facilitate analyses and could be useful to identify causal disease-gene associations. These correct associations of genes which are detected through significant enrichment of a harbored tagging variant in a patient cohort may not be discovered when incorporating LD data in cases where the LD block of the respective variant spans across several genes. However, such approaches disregard a basic principle defining the current GWAS paradigm, namely the use of LD information in the design of genotyping arrays to achieve the genome-wide coverage of common SNPs. Hence, it can be problematic to project the variant-based GWAS data on genes or loci. Accordingly, we decided to use variant-based methods and concentrated on strong gene candidates identified via the gene function of single-gene loci whenever suggesting potential biological effects of the considered variants.
In the analysis of genetic overlaps we followed the hypothesis that the effects of variants shared across several diseases correspond to the reported risk alleles. If the risk allele is the same in all associated diseases, we assume the effect to be the same, i.e. that there is a common underlying etiology. For closely related diseases a positive correlation is not surprising, e.g. a GWAS on psoriatic arthritis (PSA) will also detect agonistic variants such as rs33980500 that are also associated with psoriasis (PS)[68, 69]. Indeed, the vast majority of agonistic variants in our data set links groups of related diseases and thus may mark interesting target regions for therapeutic interventions. However, we also found a few agonistic signals connecting apparently unrelated diseases, e.g. rs6010620 which exerts susceptibility for both glioma and atopic dermatitis (AD)[50, 51, 70, 71]. If our hypothesis is correct, an endophenotype influencing both diseases may be present which has yet to be identified. For antagonistic SNPs, on the other hand, we describe plausible mechanisms that may render variants protective against one trait and predisposing to another, labeling the affected genes/loci as pleiotropic. If pleiotropic effects are as frequent as evolutionary modelers postulate[2, 72] and this effects can be identified by analyses based on GWAS, this might have great implications for the development and use of therapeutics because it would enable avoidance of potential side effects when targeting such loci. Already, there are more than 50 genotype/drug interactions known for which therapeutic dosing recommendations are available.
Our results present new starting points for studying the genetics of complex diseases. The observation that more than 15% of the SNPs considered in our study are associated both agonistically and antagonistically with related as well as unrelated disorders indicates that the molecular mechanisms influencing causes and progress of human diseases are in part interrelated. Genetic overlaps between two diseases also suggest the importance of the affected entities in the specific pathogenic pathways and should be investigated further. These may be secondary, such as genes involved in inflammatory responses related to T2D as well as cancer[30, 38]. The findings presented also demonstrate the need to clarify the relation of any phenotype linked to an associated marker. For directly interrelated diseases such as PS and PSA often PS patients without present arthritis or arthritis in the past are used as additional control group. Associations are then interpreted as PSA-specific if not as strongly associated with PS[74, 75]. Comparable procedures may proof useful in frequently co-occurring diseases genetically linked by agonistic variants. Nevertheless, the complex genetics of multifactorial diseases asks for a better understanding of the functions underlying common disorders. An improved characterization of the endophenotype, such as metabolite or protein concentrations, may enhance our understanding of identical pathomechanisms that link agonistic genetic loci to clinically distinct traits. Pleiotropic effects, on the other hand, that are harbored in the same locus may trigger different mechanisms interfering with the genetic or environmental background. The detailed examination of antagonistically associated loci may thus lead to first insight into the mechanism of the various types of pleiotropy in human diseases.
Association selection and curation
Construction of GWAS networks
For the construction of the locus-based data representation, we defined an associated locus as the whole genomic region captured by SNPs in strong LD, r2 ≥ 0.8, with the marker originally reported in a GWAS contained in our data set. The locus is then characterized as all genes located within this genomic region (referred to as “gene locus”) (Figure2). If the region contains no genes, the locus is assigned to its chromosomal location (referred to as “intergenic locus”). LD data and gene information were obtained with the SNAP tool. After locus assignment, our final data set consisted of 111 different traits linked via 1,120 SNPs to 508 gene loci and 226 intergenic loci.
Based on this list we constructed a bipartite graph consisting of two disjoint sets of nodes (Figure1A) representing the complete association data. The first node set corresponds to the traits, whereas the other set comprises the associated loci. Two nodes are connected by an edge if a variant within the respective locus is associated with the corresponding trait. By removal of isolated traits, i.e. traits which share no associated locus with another trait (n = 27) (Figure1A), and cutting out loci which are associated with only one trait (n = 577), we retrieved the SLN (Figure1B).
To obtain a variant-based representation of the data, we repeated the network generation on marker scale by utilizing the set of variants associated with more than one distinct trait. For this, we used the LD data to mutually assign the associated traits of sentinel SNPs in pairwise LD if not already present. In other words, each variant is, in addition to its own associated traits, assigned the traits associated with all correlated SNPs. This set consists of 175 SNPs located in 94 loci and associated with 55 diseases (Additional file2: Table S1). In the resulting bipartite SVN, a trait and a locus are linked if the locus contains a variant which comprises associations with this and at least one other trait. Here, the allele information was included in the graph visualization by coloring of the edges (Figure1C). Both the SLN and the SVN are provided as machine-readable files, see Additional files5 and6.
where N i is the number of neighbors of i and S(i, j) is the number of shared neighbors of nodes i and j (undefined if i and j do not share a neighbor) plus one if j is a neighbor of i.
for the linear transformation of the power-law functions, i.e. ln y = a + b ln x.
Determination of agonistic and antagonistic effects
For all variants associated with more than one trait, we manually extracted the risk alleles (OR > 1, independently of major or minor allele status) and odds ratios from the reporting studies. The alleles were mapped to the forward DNA strand according to dbSNP 131. The same procedure was applied to markers which were indirectly associated with a trait over LD. If for all traits the same associated risk allele (and corresponding allele, respectively) was reported, the SNP was classified as agonistic. If the risk alleles of a SNP were opposed in the associated diseases, the variant was classified as antagonistic.
We applied complete-linkage hierarchical clustering to identify groups of traits genetically overlapping with respect to agonistic signals. Normalization was performed using the linear PCC defined as where the input are the vectors of the variant-based agonistic overlap of two distinct diseases X and Y to all other diseases. Thus, disorders which are clustered together show a homogeneous association overlap pattern to all other diseases, while diseases which are not clearly assigned to a cluster present a more heterogeneous pattern relatively unique in the SNP data. For cluster definition, we used a Euclidian distance threshold of 1.71. This threshold was determined as the maximal distance at which the six traits not correlating with other diseases (Figure3) remain non-clustered.
Calculation of the CPMA statistic for autoimmune loci
We downloaded the dataset S1 from and extracted the information on autoimmune-linked SNPs contained in the SVN. We used the Z-scores given in the file to compute two-sided P-values for all seven GWAS. Using the CPMA code provided onhttp://www.cotsapaslab.info/index.php/software/cpma/ we calculated the CPMA P-values as described in.
This work was supported by the Helmholtz Alliance on Systems Biology (project CoReNe and the Joint Technology Platforms); the German Federal Ministry of Education and Research as part of the National Genome Research Network [NGFN 01GS 0429, NGFN 01GS 0818]; the German Federal Ministry of Education and Research in its MedSys initiative (project SysMBo) [FKZ: 0315494A]; the TUM Graduate School for Information Science in Health; the Christiane Kühne Center for Allergy Research and Education (http://www.ck-care.ch/); the Schleswig-Holstein Excellence Cluster “Inflammation at Interfaces”; and a Heisenberg professorship of the DFG [WE 2678/4-1 to S.W.]. The funders had no role in study design, data collection, interpretation and analysis, decision to publish, or preparation of the manuscript.
- Hindorff LA, Sethupathy P, Junkins HA, Ramos EM, Mehta JP, Collins FS, Manolio TA: Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci U S A. 2009, 106 (23): 9362-9367.PubMed CentralView ArticlePubMedGoogle Scholar
- Stranger BE, Stahl EA, Raj T: Progress and promise of genome-wide association studies for human complex trait genetics. Genetics. 2011, 187 (2): 367-383.PubMed CentralView ArticlePubMedGoogle Scholar
- Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, Hunter DJ, McCarthy MI, Ramos EM, Cardon LR, Chakravarti A, et al: Finding the missing heritability of complex diseases. Nature. 2009, 461 (7265): 747-753.PubMed CentralView ArticlePubMedGoogle Scholar
- Wei Z, Wang K, Qu HQ, Zhang H, Bradfield J, Kim C, Frackleton E, Hou C, Glessner JT, Chiavacci R, et al: From disease association to risk assessment: an optimistic view from genome-wide association studies on type 1 diabetes. PLoS Genet. 2009, 5 (10): e1000678-PubMed CentralView ArticlePubMedGoogle Scholar
- So HC, Li MX, Sham PC: Uncovering the total heritability explained by All true susceptibility variants in a genome-wide association study. Genet Epidemiol. 2011, 35 (6): 447-456.PubMedGoogle Scholar
- Frazer KA, Murray SS, Schork NJ, Topol EJ: Human genetic variation and its contribution to complex traits. Nat Rev Genet. 2009, 10 (4): 241-251.View ArticlePubMedGoogle Scholar
- Wagner GP, Zhang J: The pleiotropic structure of the genotype-phenotype map: the evolvability of complex organisms. Nat Rev Genet. 2011, 12 (3): 204-213.View ArticlePubMedGoogle Scholar
- Cotsapas C, Voight BF, Rossin E, Lage K, Neale BM, Wallace C, Abecasis GR, Barrett JC, Behrens T, Cho J, et al: Pervasive sharing of genetic effects in autoimmune disease. PLoS Genet. 2011, 7 (8): e1002254-PubMed CentralView ArticlePubMedGoogle Scholar
- Sirota M, Schaub MA, Batzoglou S, Robinson WH, Butte AJ: Autoimmune disease classification by inverse association with SNP alleles. PLoS Genet. 2009, 5 (12): e1000792-PubMed CentralView ArticlePubMedGoogle Scholar
- Zhernakova A, van Diemen CC, Wijmenga C: Detecting shared pathogenesis from the shared genetics of immune-related diseases. Nat Rev Genet. 2009, 10 (1): 43-55.View ArticlePubMedGoogle Scholar
- Harismendy O, Notani D, Song X, Rahim NG, Tanasa B, Heintzman N, Ren B, Fu XD, Topol EJ, Rosenfeld MG, et al: 9p21 DNA variants associated with coronary artery disease impair interferon-gamma signalling response. Nature. 2011, 470 (7333): 264-268.PubMed CentralView ArticlePubMedGoogle Scholar
- Meyer KB, Maia AT, O'Reilly M, Ghoussaini M, Prathalingam R, Porter-Gill P, Ambs S, Prokunina-Olsson L, Carroll J, Ponder BAJ: A functional variant at a prostate cancer predisposition locus at 8q24 is associated with PVT1 expression. PLoS Genet. 2011, 7 (7): e1002165-PubMed CentralView ArticlePubMedGoogle Scholar
- Barabasi AL, Gulbahce N, Loscalzo J: Network medicine: a network-based approach to human disease. Nat Rev Genet. 2011, 12 (1): 56-68.PubMed CentralView ArticlePubMedGoogle Scholar
- Goh KI, Cusick ME, Valle D, Childs B, Vidal M, Barabasi AL: The human disease network. Proc Natl Acad Sci U S A. 2007, 104 (21): 8685-8690.PubMed CentralView ArticlePubMedGoogle Scholar
- Barrenas F, Chavali S, Holme P, Mobini R, Benson M: Network properties of complex human disease genes identified through genome-wide association studies. PLoS One. 2009, 4 (11): e8090-PubMed CentralView ArticlePubMedGoogle Scholar
- Klein J, Sato A: The HLA system. Second of two parts. N Engl J Med. 2000, 343 (11): 782-786.View ArticlePubMedGoogle Scholar
- Gregersen PK: Gaining insight into PTPN22 and autoimmunity. Nat Genet. 2005, 37 (12): 1300-1302.View ArticlePubMedGoogle Scholar
- Di Meglio P, Di Cesare A, Laggner U, Chu CC, Napolitano L, Villanova F, Tosi I, Capon F, Trembath RC, Peris K, et al: The IL23R R381Q gene variant protects against immune-mediated diseases by impairing IL-23-induced Th17 effector response in humans. PLoS One. 2011, 6 (2): e17160-PubMed CentralView ArticlePubMedGoogle Scholar
- Chung SA, Taylor KE, Graham RR, Nititham J, Lee AT, Ortmann WA, Jacob CO, Alarcon-Riquelme ME, Tsao BP, Harley JB, et al: Differential genetic associations for systemic lupus erythematosus based on anti-dsDNA autoantibody production. PLoS Genet. 2011, 7 (3): e1001323-PubMed CentralView ArticlePubMedGoogle Scholar
- Stahl EA, Raychaudhuri S, Remmers EF, Xie G, Eyre S, Thomson BP, Li Y, Kurreeman FA, Zhernakova A, Hinks A, et al: Genome-wide association study meta-analysis identifies seven new rheumatoid arthritis risk loci. Nat Genet. 2010, 42 (6): 508-514.PubMed CentralView ArticlePubMedGoogle Scholar
- Radstake TR, Gorlova O, Rueda B, Martin JE, Alizadeh BZ, Palomino-Morales R, Coenen MJ, Vonk MC, Voskuyl AE, Schuerwegh AJ, et al: Genome-wide association study of systemic sclerosis identifies CD247 as a new susceptibility locus. Nat Genet. 2010, 42 (5): 426-429.PubMed CentralView ArticlePubMedGoogle Scholar
- Kugathasan S, Baldassano RN, Bradfield JP, Sleiman PM, Imielinski M, Guthery SL, Cucchiara S, Kim CE, Frackelton EC, Annaiah K, et al: Loci on 20q13 and 21q22 are associated with pediatric-onset inflammatory bowel disease. Nat Genet. 2008, 40 (10): 1211-1215.PubMed CentralView ArticlePubMedGoogle Scholar
- Barrett JC, Hansoul S, Nicolae DL, Cho JH, Duerr RH, Rioux JD, Brant SR, Silverberg MS, Taylor KD, Barmada MM, et al: Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease. Nat Genet. 2008, 40 (8): 955-962.PubMed CentralView ArticlePubMedGoogle Scholar
- Franke A, McGovern DP, Barrett JC, Wang K, Radford-Smith GL, Ahmad T, Lees CW, Balschun T, Lee J, Roberts R, et al: Genome-wide meta-analysis increases to 71 the number of confirmed Crohn's disease susceptibility loci. Nat Genet. 2010, 42 (12): 1118-1125.PubMed CentralView ArticlePubMedGoogle Scholar
- Anderson CA, Boucher G, Lees CW, Franke A, D'Amato M, Taylor KD, Lee JC, Goyette P, Imielinski M, Latiano A, et al: Meta-analysis identifies 29 additional ulcerative colitis risk loci, increasing the number of confirmed associations to 47. Nat Genet. 2011, 43 (3): 246-252.PubMed CentralView ArticlePubMedGoogle Scholar
- Liu X, Invernizzi P, Lu Y, Kosoy R, Lu Y, Bianchi I, Podda M, Xu C, Xie G, Macciardi F, et al: Genome-wide meta-analyses identify three loci associated with primary biliary cirrhosis. Nat Genet. 2010, 42 (8): 658-660.PubMed CentralView ArticlePubMedGoogle Scholar
- Moffatt MF, Gut IG, Demenais F, Strachan DP, Bouzigon E, Heath S, von Mutius E, Farrall M, Lathrop M, Cookson WO, et al: A large-scale, consortium-based genomewide association study of asthma. N Engl J Med. 2010, 363 (13): 1211-1221.PubMed CentralView ArticlePubMedGoogle Scholar
- Hidalgo CA, Blumm N, Barabasi AL, Christakis NA: A dynamic network approach for the study of human phenotypes. PLoS Comput Biol. 2009, 5 (4): e1000353-PubMed CentralView ArticlePubMedGoogle Scholar
- Park J, Lee DS, Christakis NA, Barabasi AL: The impact of cellular networks on disease comorbidity. Mol Syst Biol. 2009, 5: 262-PubMed CentralView ArticlePubMedGoogle Scholar
- Basen-Engquist K, Chang M: Obesity and cancer risk: recent review and evidence. Curr Oncol Rep. 2011, 13 (1): 71-76.PubMed CentralView ArticlePubMedGoogle Scholar
- Guindi M, Riddell RH: Indeterminate colitis. J Clin Pathol. 2004, 57 (12): 1233-1244.PubMed CentralView ArticlePubMedGoogle Scholar
- Rodriguez-Murillo L, Greenberg DA: Genetic association analysis: a primer on how it works, its strengths and its weaknesses. Int J Androl. 2008, 31 (6): 546-556.View ArticlePubMedGoogle Scholar
- Mead S, Poulter M, Uphill J, Beck J, Whitfield J, Webb TE, Campbell T, Adamson G, Deriziotis P, Tabrizi SJ, et al: Genetic risk factors for variant Creutzfeldt-Jakob disease: a genome-wide association study. Lancet Neurol. 2009, 8 (1): 57-66.PubMed CentralView ArticlePubMedGoogle Scholar
- Nuinoon M, Makarasara W, Mushiroda T, Setianingsih I, Wahidiyat PA, Sripichai O, Kumasaka N, Takahashi A, Svasti S, Munkongdee T, et al: A genome-wide association identified the common genetic variants influence disease severity in beta0-thalassemia/hemoglobin E. Hum Genet. 2010, 127 (3): 303-314.View ArticlePubMedGoogle Scholar
- Dubois PC, Trynka G, Franke L, Hunt KA, Romanos J, Curtotti A, Zhernakova A, Heap GA, Adany R, Aromaa A, et al: Multiple common variants for celiac disease influencing immune gene expression. Nat Genet. 2010, 42 (4): 295-302.PubMed CentralView ArticlePubMedGoogle Scholar
- Burton PR, Clayton DG, Cardon LR, Craddock N, Deloukas P, Duncanson A, Kwiatkowski DP, McCarthy MI, Ouwehand WH, Samani NJ, et al: Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature. 2007, 447 (7145): 661-678.View ArticleGoogle Scholar
- Frayling TM, Timpson NJ, Weedon MN, Zeggini E, Freathy RM, Lindgren CM, Perry JRB, Elliott KS, Lango H, Rayner NW, et al: A common variant in the FTO gene is associated with body mass index and predisposes to childhood and adult obesity. Science. 2007, 316 (5826): 889-894.PubMed CentralView ArticlePubMedGoogle Scholar
- Renstrom F, Payne F, Nordstrom A, Brito EC, Rolandsson O, Hallmans G, Barroso I, Nordstrom P, Franks PW, Consortium G: Replication and extension of genome-wide association study results for obesity in 4923 adults from northern Sweden. Hum Mol Genet. 2009, 18 (8): 1489-1496.PubMed CentralView ArticlePubMedGoogle Scholar
- Gudbjartsson DF, Arnar DO, Helgadottir A, Gretarsdottir S, Holm H, Sigurdsson A, Jonasdottir A, Baker A, Thorleifsson G, Kristjansson K, et al: Variants conferring risk of atrial fibrillation on chromosome 4q25. Nature. 2007, 448 (7151): 353-357.View ArticlePubMedGoogle Scholar
- Gretarsdottir S, Thorleifsson G, Manolescu A, Styrkarsdottir U, Helgadottir A, Gschwendtner A, Kostulas K, Kuhlenbaumer G, Bevan S, Jonsdottir T, et al: Risk variants for atrial fibrillation on chromosome 4q25 associate with ischemic stroke. Ann Neurol. 2008, 64 (4): 402-409.View ArticlePubMedGoogle Scholar
- Johansen CT, Wang J, Lanktree MB, Cao H, McIntyre AD, Ban MR, Martins RA, Kennedy BA, Hassell RG, Visser ME, et al: Excess of rare variants in genes identified by genome-wide association study of hypertriglyceridemia. Nat Genet. 2010, 42 (8): 684-687.PubMed CentralView ArticlePubMedGoogle Scholar
- Schunkert H, Konig IR, Kathiresan S, Reilly MP, Assimes TL, Holm H, Preuss M, Stewart AF, Barbalic M, Gieger C, et al: Large-scale association analysis identifies 13 new susceptibility loci for coronary artery disease. Nat Genet. 2011, 43 (4): 333-338.PubMed CentralView ArticlePubMedGoogle Scholar
- Duerr RH, Taylor KD, Brant SR, Rioux JD, Silverberg MS, Daly MJ, Steinhart AH, Abraham C, Regueiro M, Griffiths A, et al: A genome-wide association study identifies IL23R as an inflammatory bowel disease gene. Science. 2006, 314 (5804): 1461-1463.PubMed CentralView ArticlePubMedGoogle Scholar
- Franke A, Balschun T, Karlsen TH, Sventoraityte J, Nikolaus S, Mayr G, Domingues FS, Albrecht M, Nothnagel M, Ellinghaus D, et al: Sequence variants in IL10, ARPC2 and multiple other loci contribute to ulcerative colitis susceptibility. Nat Genet. 2008, 40 (11): 1319-1323.View ArticlePubMedGoogle Scholar
- Madsen K: Combining T cells and IL-10: a new therapy for Crohn's disease?. Gastroenterology. 2002, 123 (6): 2140-2144.View ArticlePubMedGoogle Scholar
- de Bakker PI, McVean G, Sabeti PC, Miretti MM, Green T, Marchini J, Ke X, Monsuur AJ, Whittaker P, Delgado M, et al: A high-resolution HLA and SNP haplotype map for disease association studies in the extended human MHC. Nat Genet. 2006, 38 (10): 1166-1172.PubMed CentralView ArticlePubMedGoogle Scholar
- Landi MT, Chatterjee N, Yu K, Goldin LR, Goldstein AM, Rotunno M, Mirabello L, Jacobs K, Wheeler W, Yeager M, et al: A genome-wide association study of lung cancer identifies a region of chromosome 5p15 associated with risk for adenocarcinoma. Am J Hum Genet. 2009, 85 (5): 679-691.PubMed CentralView ArticlePubMedGoogle Scholar
- Miki D, Kubo M, Takahashi A, Yoon KA, Kim J, Lee GK, Zo JI, Lee JS, Hosono N, Morizono T, et al: Variation in TP63 is associated with lung adenocarcinoma susceptibility in Japanese and Korean populations. Nat Genet. 2010, 42 (10): 893-View ArticlePubMedGoogle Scholar
- Mushiroda T, Wattanapokayakit S, Takahashi A, Nukiwa T, Kudoh S, Ogura T, Taniguchi H, Kubo M, Kamatani N, Nakamura Y, et al: A genome-wide association study identifies an association of a common variant in TERT with susceptibility to idiopathic pulmonary fibrosis. J Med Genet. 2008, 45 (10): 654-656.View ArticlePubMedGoogle Scholar
- Sanson M, Hosking FJ, Shete S, Zelenika D, Dobbins SE, Ma Y, Enciso-Mora V, Idbaih A, Delattre JY, Hoang-Xuan K, et al: Chromosome 7p11.2 (EGFR) variation influences glioma risk. Hum Mol Genet. 2011, 20 (14): 2897-2904.PubMed CentralView ArticlePubMedGoogle Scholar
- Shete S, Hosking FJ, Robertson LB, Dobbins SE, Sanson M, Malmer B, Simon M, Marie Y, Boisselier B, Delattre JY, et al: Genome-wide association study identifies five susceptibility loci for glioma. Nat Genet. 2009, 41 (8): 899-904.PubMed CentralView ArticlePubMedGoogle Scholar
- Hsiung CA, Lan Q, Hong YC, Chen CJ, Hosgood HD, Chang IS, Chatterjee N, Brennan P, Wu C, Zheng W, et al: The 5p15.33 Locus is associated with risk of lung adenocarcinoma in never-smoking females in Asia. PLoS Genet. 2010, 6 (8): e1001051-PubMed CentralView ArticlePubMedGoogle Scholar
- Turnbull C, Rapley EA, Seal S, Pernet D, Renwick A, Hughes D, Ricketts M, Linger R, Nsengimana J, Deloukas P, et al: Variants near DMRT1, TERT and ATF7IP are associated with testicular germ cell cancer. Nat Genet. 2010, 42 (7): 604-607.PubMed CentralView ArticlePubMedGoogle Scholar
- Xu Y, He K, Goldkorn A: Telomerase targeted therapy in cancer and cancer stem cells. Clin Adv Hematol Oncol. 2011, 9 (6): 442-455.PubMedGoogle Scholar
- Ramos C, Montano M, Garcia-Alvarez J, Ruiz V, Uhal BD, Selman M, Pardo A: Fibroblasts from idiopathic pulmonary fibrosis and normal lungs differ in growth rate, apoptosis, and tissue inhibitor of metalloproteinases expression. Am J Respir Cell Mol Biol. 2001, 24 (5): 591-598.View ArticlePubMedGoogle Scholar
- Schrader M, Burger AM, Muller M, Krause H, Straub B, Smith GL, Newlands ES, Miller K: Quantification of human telomerase reverse transcriptase mRNA in testicular germ cell tumors by quantitative fluorescence real-time RT-PCR. Oncol Rep. 2002, 9 (5): 1097-1105.PubMedGoogle Scholar
- Houlston RS, Cheadle J, Dobbins SE, Tenesa A, Jones AM, Howarth K, Spain SL, Broderick P, Domingo E, Farrington S, et al: Meta-analysis of three genome-wide association studies identifies susceptibility loci for colorectal cancer at 1q41, 3q26.2, 12q13.13 and 20q13.33. Nat Genet. 2010, 42 (11): 973-U989.View ArticlePubMedGoogle Scholar
- Jones AM, Beggs AD, Carvajal-Carmona L, Farrington S, Tenesa A, Walker M, Howarth K, Ballereau S, Hodgson SV, Zauber A, et al: TERC polymorphisms are associated both with susceptibility to colorectal cancer and with longer telomeres. Gut. 2011, 61 (2): 248-254.PubMed CentralView ArticlePubMedGoogle Scholar
- Cottliar A, Palumbo M, La Motta G, de Barrio S, Crivelli A, Viola M, Gomez JC, Slavutsky I: Telomere length study in celiac disease. Am J Gastroenterol. 2003, 98 (12): 2727-2731.View ArticlePubMedGoogle Scholar
- Bishop DT, Demenais F, Iles MM, Harland M, Taylor JC, Corda E, Randerson-Moor J, Aitken JF, Avril MF, Azizi E, et al: Genome-wide association study identifies three loci associated with melanoma risk. Nat Genet. 2009, 41 (8): 920-925.PubMed CentralView ArticlePubMedGoogle Scholar
- Jin Y, Birlea SA, Fain PR, Gowan K, Riccardi SL, Holland PJ, Mailloux CM, Sufit AJ, Hutton SM, Amadi-Myers A, et al: Variant of TYR and autoimmunity susceptibility loci in generalized vitiligo. N Engl J Med. 2010, 362 (18): 1686-1697.PubMed CentralView ArticlePubMedGoogle Scholar
- Spritz RA: The genetics of generalized vitiligo: autoimmune pathways and an inverse relationship with malignant melanoma. Genome Med. 2010, 2 (10): 78-PubMed CentralView ArticlePubMedGoogle Scholar
- Strange A, Capon F, Spencer CCA, Knight J, Weale ME, Allen MH, Barton A, Band G, Bellenguez C, Bergboer JGM, et al: A genome-wide association study identifies new psoriasis susceptibility loci and an interaction between HLA-C and ERAP1. Nat Genet. 2010, 42 (11): U985-U106.View ArticleGoogle Scholar
- Freeman AF, Holland SM: Clinical manifestations of hyper IgE syndromes. Dis Markers. 2010, 29 (3–4): 123-130.PubMed CentralView ArticlePubMedGoogle Scholar
- International HapMap C: A haplotype map of the human genome. Nature. 2005, 437 (7063): 1299-1320.View ArticleGoogle Scholar
- Frazer KA, Ballinger DG, Cox DR, Hinds DA, Stuve LL, Gibbs RA, Belmont JW, Boudreau A, Hardenbol P, International HapMap C, et al: A second generation human haplotype map of over 3.1 million SNPs. Nature. 2007, 449 (7164): 851-861.View ArticlePubMedGoogle Scholar
- Genomes Project C: A map of human genome variation from population-scale sequencing. Nature. 2010, 467 (7319): 1061-1073.View ArticleGoogle Scholar
- Huffmeier U, Uebe S, Ekici AB, Bowes J, Giardina E, Korendowych E, Juneblad K, Apel M, McManus R, Ho P, et al: Common variants at TRAF3IP2 are associated with susceptibility to psoriatic arthritis and psoriasis. Nat Genet. 2010, 42 (11): 996-999.PubMed CentralView ArticlePubMedGoogle Scholar
- Ellinghaus E, Ellinghaus D, Stuart PE, Nair RP, Debrus S, Raelson JV, Belouchi M, Fournier H, Reinhard C, Ding J, et al: Genome-wide association study identifies a psoriasis susceptibility locus at TRAF3IP2. Nat Genet. 2010, 42 (11): 991-995.PubMed CentralView ArticlePubMedGoogle Scholar
- Sun LD, Xiao FL, Li Y, Zhou WM, Tang HY, Tang XF, Zhang H, Schaarschmidt H, Zuo XB, Foelster-Holst R, et al: Genome-wide association study identifies two new susceptibility loci for atopic dermatitis in the Chinese Han population. Nat Genet. 2011, 43 (7): 690-694.View ArticlePubMedGoogle Scholar
- Wrensch M, Jenkins RB, Chang JS, Yeh RF, Xiao Y, Decker PA, Ballman KV, Berger M, Buckner JC, Chang S, et al: Variants in the CDKN2B and RTEL1 regions are associated with high-grade glioma susceptibility. Nat Genet. 2009, 41 (8): 905-908.PubMed CentralView ArticlePubMedGoogle Scholar
- Caspari E: A synopsis of contemporary evolutionary thinking. Evol Int J Org Evol. 1949, 3 (4): 377-View ArticleGoogle Scholar
- Klein TE, Chang JT, Cho MK, Easton KL, Fergerson R, Hewett M, Lin Z, Liu Y, Liu S, Oliver DE, et al: Integrating genotype and phenotype information: an overview of the PharmGKB project. Pharmacogenetics research network and knowledge base. Pharmacogenomics J. 2001, 1 (3): 167-170.View ArticlePubMedGoogle Scholar
- Bowes J, Barton A: The genetics of psoriatic arthritis: lessons from genome-wide association studies. Discov Med. 2010, 10 (52): 177-183.PubMedGoogle Scholar
- Liu Y, Helms C, Liao W, Zaba LC, Duan S, Gardner J, Wise C, Miner A, Malloy MJ, Pullinger CR, et al: A genome-wide association study of psoriasis and psoriatic arthritis identifies new disease loci. PLoS Genet. 2008, 4 (3): e1000041-PubMed CentralView ArticlePubMedGoogle Scholar
- Yu W, Gwinn M, Clyne M, Yesupriya A, Khoury MJ: A navigator for human genome epidemiology. Nat Genet. 2008, 40 (2): 124-125.View ArticlePubMedGoogle Scholar
- Barnickel T, Weston J, Collobert R, Mewes HW, Stumpflen V: Large scale application of neural network based semantic role labeling for automated relation extraction from biomedical texts. PLoS One. 2009, 4 (7): e6393-PubMed CentralView ArticlePubMedGoogle Scholar
- Johnson AD, Handsaker RE, Pulit SL, Nizzari MM, O'Donnell CJ, de Bakker PIW: SNAP: a web-based tool for identification and annotation of proxy SNPs using HapMap. Bioinformatics. 2008, 24 (24): 2938-2939.PubMed CentralView ArticlePubMedGoogle Scholar
- Dong J, Horvath S: Understanding network concepts in modules. BMC Syst Biol. 2007, 1: 24-PubMed CentralView ArticlePubMedGoogle Scholar
- Stelzl U, Worm U, Lalowski M, Haenig C, Brembeck FH, Goehler H, Stroedicke M, Zenkner M, Schoenherr A, Koeppen S: A human protein-protein interaction network: a resource for annotating the proteome. Cell. 2005, 122 (6): 957-968.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.