- Open Access
Obesity-related known and candidate SNP markers can significantly change affinity of TATA-binding protein for human gene promoters
BMC Genomics volume 16, Article number: S5 (2015)
Obesity affects quality of life and life expectancy and is associated with cardiovascular disorders, cancer, diabetes, reproductive disorders in women, prostate diseases in men, and congenital anomalies in children. The use of single nucleotide polymorphism (SNP) markers of diseases and drug responses (i.e., significant differences of personal genomes of patients from the reference human genome) can help physicians to improve treatment. Clinical research can validate SNP markers via genotyping of patients and demonstration that SNP alleles are significantly more frequent in patients than in healthy people. The search for biomedical SNP markers of interest can be accelerated by computer-based analysis of hundreds of millions of SNPs in the 1000 Genomes project because of selection of the most meaningful candidate SNP markers and elimination of neutral SNPs.
We cross-validated the output of two computer-based methods: DNA sequence analysis using Web service SNP_TATA_Comparator and keyword search for articles on comorbidities of obesity. Near the sites binding to TATA-binding protein (TBP) in human gene promoters, we found 22 obesity-related candidate SNP markers, including rs10895068 (male breast cancer in obesity); rs35036378 (reduced risk of obesity after ovariectomy); rs201739205 (reduced risk of obesity-related cancers due to weight loss by diet/exercise in obese postmenopausal women); rs183433761 (obesity resistance during a high-fat diet); rs367732974 and rs549591993 (both: cardiovascular complications in obese patients with type 2 diabetes mellitus); rs200487063 and rs34104384 (both: obesity-caused hypertension); rs35518301, rs72661131, and rs562962093 (all: obesity); and rs397509430, rs33980857, rs34598529, rs33931746, rs33981098, rs34500389, rs63750953, rs281864525, rs35518301, and rs34166473 (all: chronic inflammation in comorbidities of obesity). Using an electrophoretic mobility shift assay under nonequilibrium conditions, we empirically validated the statistical significance (α < 0.00025) of the differences in TBP affinity values between the minor and ancestral alleles of 4 out of the 22 SNPs: rs200487063, rs201381696, rs34104384, and rs183433761. We also measured half-life (t1/2), Gibbs free energy change (ΔG), and the association and dissociation rate constants, ka and kd, of the TBP-DNA complex for these SNPs.
Validation of the 22 candidate SNP markers by proper clinical protocols appears to have a strong rationale and may advance postgenomic predictive preventive personalized medicine.
Metabolic syndrome, one of the main global challenges for modern health care , involves elevated risk of several interrelated disorders: obesity, ischemic heart disease, hypertension, insulin resistance, type II diabetes mellitus, and dyslipidemia. It is commonly believed that obesity is among the key risk factors of metabolic syndrome . Obesity affects quality of life and life expectancy and is associated with cardiovascular disorders  (such as hypertension [6, 7]), cancer [4, 5], diabetes [8, 9], and damage to kidneys and liver . It has also been reported that obesity correlates with reproductive disorders in women [11–13], prostate diseases in men , and elevated risk of congenital anomalies in children .
The discovery  that the leptin gene LEP is "the obesity gene" (OB; i.e., LEP ≡ OB) shed light on the regulation of energy metabolism . Leptin is a peptide hormone with molecular mass ~16 kD . Its secretion by adipocytes is directly proportional to adipose-tissue weight. Recessive mutations in the leptin gene are strongly associated with obesity in mice and humans . Leptin circulates with blood and regulates food consumption and energy demand of the brain. With a decrease in fat weight, the plasma leptin level decreases to increase appetite and suppress energy consumption until the recovery of fat weight. When fat weight increases, the level of circulating leptin increases and suppresses appetite until a fat weight reduction . Initially, it was believed that leptin is produced mainly by adipocytes , but later it was found that leptin is also produced in such organs as the stomach , heart , and placenta . In addition to body weight control, leptin plays other physiological roles. It modulates vascular tone and blood pressure and enhances angiogenesis and calcification of vascular cells [22–24]. Numerous studies showed a direct correlation between blood leptin concentrations and blood pressure . Elevated blood pressure contributes to atherosclerosis . Recent studies on the development of obesity in male Wistar rats uncovered an association of diet-induced fatty liver (steatosis) with insulin/leptin resistance .
Recent years have witnessed an increasing number of women with metabolic imbalance during pregnancy, with a variety of consequences; the mechanism of this metabolic imbalance is still poorly understood . The molecular pathways linking obesity to the above disorders are unclear  mostly because of the large number of genes related to the nervous system, endocrine system, and metabolic system that regulate energy metabolism. The contribution of a single gene is estimated to be 1-6% , which is close to the statistical significance threshold with the current accuracy of these estimates. Postgenomic predictive preventive medicine  offers a way around this obstacle (poor understanding of the pathogenesis of obesity as a whole) by taking into account numerous SNP markers of various partially obesity-related complications in the above-mentioned or other disorders [31, 32].
Analysis of SNP markers, which show differences between an individual human genome and the reference human genome (hg19), as part of postgenomic preventive personalized medicine allows for effective treatment , improvement of treatment , and prevention of complications of treatment . Genome-wide SNP identification is the goal of the project 1000 Genomes . The dbSNP database  documents current results of this project ; thus, the reference human genome, hg19, which is thought to contain the ancestral versions of all SNPs, is constantly refined. It is available in the Ensembl database  via the Web service UCSC Genome Browser . Ensembl contains data on gene knockouts in animals, and this information is helpful, for example, for reconstruction of perturbation networks of various disorders and for development of therapeutic strategies .
Computer-based analysis of hundreds of millions of unannotated SNPs in 1000 Genomes  that are documented in the dbSNP database  may accelerate the search for biomedical SNP markers [41, 42]. For this purpose, all the identified SNPs were mapped onto whole-genome maps of genes [38, 39] and onto protein-binding sites in DNA that were predicted in silico [43, 44] and detected in vivo using chromatin immunoprecipitation (ChIP), interchromosomal contacts, nucleosomes, transcriptomes either in health , during infection , disease , or after treatment . On the basis of these data, many Web services such as TFBS , ACTIVITY , is-rSNP , RegulomeDB , rSNP-MAPPER , RAVEN , SELEX_DB , FunSeq2 , APEG , FeatureScan [58, 59], SNPChIPTools , SNP-MED , SNAP , FunciSNP , SPOT , rSNP_Guide [42, 65], and ChroMoS  facilitate the search for candidate SNP markers in terms of ranking of unannotated SNPs by their similarity to biomedical SNP markers in accordance with projections of these SNPs onto whole-genome maps. According to the Central Limit Theorem, the accuracy of this similarity-based search for candidate SNP markers should increase as the number, diversity, representativeness, and completeness of genome-wide maps increase .
Due to this mainstream approach, the most impressive progress has been achieved with SNPs located in protein-coding gene regions  because of the invariant types of disruption in both structure and function of the altered proteins regardless of the cellular conditions . On the other hand, the effects of SNPs that are located in regulatory regions of genes [38, 70] are still difficult to predict in silico . Most studies in this field deal with regulatory SNPs in binding sites of TATA-binding protein (TBP) in the region [-70; -20] upstream of the transcription start of various mRNAs encoded in the human genome [72, 73] (assembly of the preinitiation complex starts with binding of RNA polymerase to the anchoring TBP/DNA complex [74, 75]). For this reason, model animals with a null-mutation  or a knockdown of TBP  are always inviable.
Previously, we developed a computer-assisted method for estimation of significance (Fisher's Z-score) of the difference between ancestral and minor SNP variants in terms of their effects on gene expression . Later, we confirmed the predictions of this method by our independent experiments in vitro under both equilibrium  and nonequilibrium  conditions of the electrophoretic mobility shift assay (EMSA). Then, we verified these empirical findings using a wide range of real-time assays, such as surface plasmon resonance, stopped-flow (on a ProteOn™ XPR36 biosensor; Bio-Rad Lab., USA) , and fluorescence resonance energy transfer (on an SX20 spectrometer; Applied Photophysics, UK) . In addition, we utilized independent results of over 100 experiments by others [83–90]. On the basis of this comprehensive validation [79–90] of the method , we designed the Web service SNP_TATA_Comparator (http://beehive.bionet.nsc.ru/cgi-bin/mgs/tatascan/start.pl) [91, 92] for the researchers who would like to analyze certain changes in core promoters of human genes.
In the present work, we cross-validated the output of two computer-based methods: DNA sequence analysis using Web service SNP_TATA_Comparator and keyword search for articles on biochemical markers of obesity. Near the TBP-binding sites of human gene promoters, we found 22 obesity-related candidate SNP markers, including rs201381696 (obesity), rs200487063 and rs34104384 (both: obesity-caused hypertention) in the human gene (LEP) of leptin and rs183433761 (obesity resistance during a high-fat diet) in the human gene (GCG) of glucagon. We characterized them empirically and in terms of quantitative estimates of the equilibrium dissociation constant (K0D). We then validated these estimates in terms of the apparent dissociation constant, K*D ≡ kd/ka, the ratio of the association rate constant (ka), and the dissociation rate constant (kd) of the TBP-DNA complex by means of EMSA under nonequilibrium conditions in vitro. We found a significant linear correlation (r = 0.99; α < 0.01) between K0D values predicted in silico and K*D values determined by measurements in vitro for the obesity-related candidate SNP markers rs200487063, rs201381696, rs34104384, and rs183433761, which may be useful for postgenomic predictive preventive personalized medicine.
The results of in silicoanalysis of the known and candidate SNP markers in the TBP-binding sites of the human gene promoters
We applied our Web service SNP_TATA_Comparator [91, 92] to 68 biomedical and candidate SNP markers in the TBP-binding sites of the human gene promoters taken from our recent review . Table S1 (Additional file 1) shows the results.
The human IL1Bgene
(interleukin 1β) promoter contains the biomedical SNP marker rs1143627. This SNP is associated with greater body fat in older men , Graves' disease , gastric cancer , hepatocellular carcinoma , non-small cell lung cancer , gastric ulcer and chronic gastritis , and major recurrent depression (99). It is a widely studied regulatory SNP marker in the TBP-binding sites of human gene promoters. Previously, we also studied this SNP in vitro by EMSA under both equilibrium  and nonequilibrium  conditions as well as in silico .
The human APOA1gene
(apolipoprotein A-I) has the -35A→C substitution relative to the start of transcript number 3 of this gene. This SNP is inside a proven TATA box (the canonical form of TBP-binding sites) . It is associated with obesity, fatty liver, and hematuria . Using this SNP marker of obesity, we validated the suitability of our Web service SNP_TATA_Comparator for analysis of obesity-related polygenic human diseases (see Methods).
The human NOS2gene
(inducible nitric oxide synthase 2) contains an SNP marker of epilepsy  and resistance to malaria . It is a -51T→C substitution relative to the start of transcript number 1  that causes NOS2 overexpression according to clinical research [100–102]. We have previously studied this SNP in depth using EMSA [79, 80] and our computer-based method .
In this work, an additional keyword search (hereinafter, see Methods) pointed to new data suggesting that NOS2 overexpression may be a biochemical marker of obesity . On the basis of this empirical observation , we propose the -51T→C substitution in the NOS2 gene promoter as a candidate SNP marker of obesity (Table S1, Additional file 1).
The human PGRgene
(progesterone receptor) contains the biomedical SNP marker rs10895068 causing de novo appearance of a spurious TBP-biding site along with an additional pathogenic transcription start site (TSS) at position +270 relative to the normal TSS for transcript number 2 of the same gene. This SNP is associated with endometrial cancer in obese women . It was previously analyzed by our computer-based method .
Here, using a keyword search, we found new clinical data showing that PGR overexpression is a biochemical marker of male breast cancer (T4 tumor) in obesity . With this in mind, we propose rs10895068 as a candidate SNP marker of breast cancer in obese men (Table S1, Additional file 1).
The human ESR2gene
(estrogen receptor β) contains the SNP marker (rs35036378) of a primary ESR2-deficient pT1 tumor that can transform into breast cancer without preventive treatment . An additional keyword search pointed to new clinical data showing that an ESR2 deficiency can reduce the risk of obesity after ovariectomy . According to these data, we propose the SNP s10895068 as a candidate marker of the low risk of obesity after ovariectomy (Table S1, Additional file 1).
The human HSD17B1gene
(placental 17-β-hydroxysteroid dehydrogenase) contains an SNP marker (rs201739205) of breast cancer . Nevertheless, using a keyword search, we found a new promising study suggesting that weight loss in HSD17B1-deficient obese postmenopausal women by means of the appropriate diet and/or exercise reduces the risk of obesity-related cancers . Thus, we propose rs201739205 as a possible predictive SNP marker of effectiveness of preventive anticancer treatment among obese postmenopausal women (Table S1, Additional file 1).
The human MBL2gene
(soluble mannose-binding lectin 2, synonym: protein C) contains the SNP marker (rs72661131) of variable immunodeficiency , preeclampsia , and stroke  caused by deficient expression of this gene as we have also predicted in silico  and confirmed under both equilibrium  and nonequilibrium  conditions in vitro. A keyword search for "MBL2," "deficiency," and "obesity" pointed to clinical findings that an MBL2 deficiency can increase the risk of obesity . Therefore, rs72661131 may be a candidate SNP marker of obesity (Table S1, Additional file 1).
Near this biomedical SNP marker, we found one more SNP: rs562962093 (unannotated), and then, predicted that it can impair the expression of MBL2, too. This is why we propose rs562962093 as a candidate SNP marker of obesity  (Table S1, Additional file 1).
The human F7gene
(coagulation factor VII; synonym: proconvertin): its promoter contains a biomedical SNP, namely, the -35A→C substitution relative to the start of transcript number 1 of this gene. This SNP is a marker of moderate bleeding caused by an F7 deficiency .
An additional keyword search pinpointed the clinical finding of a statistically significant correlation between the total F7 level in obese patients with type 2 diabetes mellitus and the risk of cardiovascular complications . Hence, we suggest the -35A→C substitution within the human F7 gene promoter as a candidate SNP marker of the low risk of cardiovascular complications in obese patients with type 2 diabetes mellitus (Table S1, Additional file 1).
Near this biomedical SNP marker, we found two unannotated SNPs, rs367732974 and rs549591993, that can cause overexpression of the F7 gene (Table S1, Additional file 1). On the basis of the correlation , we propose the two SNPs rs367732974 and rs549591993 as candidate markers of the high risk of cardiovascular complications in obese patients with type 2 diabetes mellitus (Table S1, Additional file 1).
The human F3gene
(coagulation factor F3, synonym: tissue factor) contains a known SNP marker (rs563763767) of obesity , myocardial infarction, and thromboembolism caused by F3 overexpression  (Table S1, Additional file 1).
The human HBB and HBDgenes
(β- and δ-chains of hemoglobin, respectively) contain the best-studied TBP-binding sites that are altered by a number of SNP markers of resistance to malaria and thalassemia (Cooley's anemia) , namely: rs34500389, rs33981098, rs33980857, rs34598529, rs33931746, rs397509430, and rs35518301. Previously, we have analyzed most of them in depth using our computer-based method [83, 91] as well as using EMSA under both equilibrium  and nonequilibrium  conditions in vitro.
Using a keyword search, we found new clinical data showing that a hemoglobin deficiency can serve as a biochemical marker of chronic inflammation in comorbidities of obesity. This finding allows us to propose rs34500389, rs33981098, rs33980857, rs34598529, rs33931746, rs397509430, and rs35518301 as candidate SNP markers of inflammatory complications in obesity (Table S1, Additional file 1).
Near these biomedical SNP markers, we found three unannotated SNPs: rs63750953, rs281864525, and rs34166473. Our analysis predicted that they can cause a hemoglobin deficiency. Consequently, we propose them as candidate SNP markers of inflammatory complications of obesity (Table S1, Additional file 1).
After that, we analyzed all unannotated SNPs in the [-70; -20] region (where all proven TBP-binding sites are located) in the only known promoter of the human LEP gene, which we selected due to Friedman's discovery that the LEP gene (encodes hormone leptin) is the "obesity gene": OB ≡ LEP  (Fig. S1, see Additional file 2). Table S1 (Additional file 1) shows three unannotated SNPs--rs201381696, rs200487063, and rs34104384--which can alter TBP's affinity for this promoter according to our prediction (see Additional file 2). The first one in the list can reduce the TBP-promoter affinity, whereas the two others can increase this affinity. Using a keyword search, we found literature data showing that the leptin deficiency is a biochemical marker of obesity , whereas leptin overexpression can serve as a marker of obesity-caused hypertension [6, 7]. This is why we propose three more candidate SNP markers (rs201381696 as well as rs200487063 and rs34104384) of obesity-caused hypertension (Table S1, Additional file 1).
Finally, we selected one more unannotated SNP (rs183433761) within the TBP-binding site of the promoter of the human GCG gene of glucagon because this SNP's association with obesity has not yet been examined. Similarly, we predicted a GCG deficiency and found (using a keyword search) that it is a biochemical marker of the resistance to obesity during a high-fat diet . Accordingly, we propose rs183433761 as a candidate SNP marker of obesity resistance during a high-fat diet (Table S1, Additional file 1).
We reviewed all of the above findings and selected four of our obesity-related candidate SNP markers--rs201381696, rs200487063, rs34104384, and rs183433761--for empirical biochemical verification using EMSA under nonequilibrium conditions in vitro because these SNPs have not yet been verified experimentally on their possible association with obesity.
The results of in vitroanalysis of the four selected candidate SNP markers
The primary experimental data from the in vitro analysis of the four selected candidate SNP markers are depicted in Figure 1. Table 1 shows these results for rs200487063, rs201381696, rs34104384, and rs183433761 in terms of the primary experimental ka and kd values that characterize the kinetics of TBP's binding to the ancestral and minor variants of the TBP-binding site in the only promoter of the LEP gene that is documented in the human reference genome hg19.
First, the above-mentioned results show that the variant -38a in the LEP gene promoter increases the rate of formation of the TBP-DNA complexes (ka) by 50%, the variant -30t fivefold, and variant -35g by 24-fold. The rate of decay (kd) for -38a increased by 20%, and for -35g by 14-fold, whereas for -30t, it decreased 4.4-fold. Affinity (K*D) of TBP for ODN containing -38a increased by 50%, and for the -30t variant it increased by 12%, whereas for the -35g variant, the affinity diminished 2.8-fold. Meanwhile, the minimal half-life of the complex (t1/2) of 9 min was observed for the -35g variant; this situation was caused by the increase in the rate of day (kd) 14-fold relative to the norm and by very low affinity. It should be noted that the variants -38a and -30t are located in the flanks of the TATAbox (the canonical form of a TBP-binding site), and only variant -35g affects the sequence of the TATAbox itself, thereby causing a massive decrease in affinity to 230 nM.
As shown in Table 1 the greatest increase in the rate of formation and decay of the TBP-TATA complex was observed for variant -35g (24-fold and 14-fold, respectively), which affects the sequence of the TATA-box and replaces its most conserved base "A" with "G" . This change reduces affinity (K*D) to a virtually nonspecific level (230 nM) and reduces half-life of the complexes more than sevenfold. This alteration is associated with obesity .
In contrast, variant -41g of the promoter of GCG lowers the rate of formation (ka) of the complex TBP-ODN 2.3-fold, whereas the rate of dissociation (kd) of the complex and its half-life decrease by 20%. The calculated KD value, characterizing the change in TBP-ODN affinity, decreased 2.25-fold.
At the same time, their ratio, kd/ka, which determines the apparent dissociation constant K*D and affinity TBP-ODN, decreased more than threefold in the case of -35g, which disrupts the TATA-like subsequence (by converting it to g-38cTg-35TAA) but increased 1.6-fold for -38a flanking the normal a-38cTA-35TAA pentanucleotide. We estimated the half-life of the TBP-ODN complexes (Table 1). Its values, 9-77 min, fit the time interval, 5-180 min (Figure 1), of nonequilibrium in vitro conditions for our measurement of the kinetics of TBP's binding to each ODN corresponding to our three possible obesity-related SNP markers. The minimal half-life, 9 min, was observed for the allele -35G which corresponds to minimal binding affinity and to a maximal dissociation rate of the complex TBP-TATA.
Thus, judging by the effects of rs200487063, rs201381696, rs34104384, and rs183433761, the constants ka and kd describe qualitatively different and independent characteristics. This observation is in agreement with the commonly accepted notion that K*D, ka, and kd independently characterize different features of the kinetics of intermolecular binding.
Figure 2 shows significant correlations between nonequilibrium in vitro measurements of K*D (Table 1) and equilibrium in silico estimates of K0D (Table S1, see Additional file 1). Assessment by three independent statistical tests is provided: Pearson's simple linear correlation (r = 0.99; α < 0.00025), Kendall's rank correlation (τ = 1; α < 0.005), and Goodman-Kruskal's generalized correlation (γ = 1; α < 0.005). These data are indicative of robustness of the correlation, and this robustness is consistent with our analyses of various SNP markers of monogenic diseases under equilibrium , nonequilibrium , and real-time  experimental in vitro conditions and with independent studies by other authors [74, 123, 124].
The different ranges of K0D and K*D reflect the differences in uncontrollable factors under equilibrium and nonequilibrium in vitro conditions, which do not influence the relative values of TBP affinity for different variants of the TBP-binding site (Figure 2). Because of this property of TBP-binding sites (Figure 2), which was proven empirically by many research groups [74, 123, 124], leptin production in a patient with the minor variant -35g of rs201381696 may be proportionately lower (in each tissue expressing the LEP gene, regardless of its tissue specificity) in comparison with a person carrying the variant WT. Consequently, the total plasma leptin level in this patient will be lowered: this is a clinically established risk factor of obesity . By the same token, the minor variants -38g of rs200487063 and -30t of rs34104384 may cause a proportional increase in leptin production (by any tissues) and a proportional increase in the overall plasma leptin level, which is a known risk factor of hypertension as a complication of obesity [6, 7]. This combined empirical and computer-based identification of rs200487063, rs201381696, rs34104384, and rs183433761 (never considered in association with obesity) as obesity-related candidate SNP markers is the main result of this work.
Here, we focused only on the TBP-binding site because it is the best-studied site in human gene promoters ; we detected known SNP manifestations: "susceptibility/resistance" to diseases. In comparison with our previous applications of the Web service SNP_TATA_Comparator [91, 92] to the research on monogenic diseases [78–92], here we found a number of different associations, such as complications of obesity (e.g., rs10895068, male breast cancer [T4 tumor] in obesity), a phenotypic trait (e.g., rs1143627, greater body fat in older men), consequences of surgery (e.g., rs35036378, reduced risk of obesity after ovariectomy), eating behavior (e.g., rs183433761, obesity resistance during a high-fat diet), and the preventive effect of a lifestyle (e.g., rs201739205, low risk of obesity-related cancers due to weight loss by diet/exercise in obese postmenopausal women). These findings can extend the practical utility of our Web service because of the cross-validation of its output with a keyword search for available literature on an appropriate biochemical marker of the diseases under study.
In addition to the commonly used textual analysis of DNA sequences, we take into account quantitative values of the minor-groove width of B-helical DNA  because TBP binds to DNA via incorporation of side chains of two amino acid residues into DNA's minor groove . This additional criterion increases the accuracy of our prediction (Figure 2), in agreement with independent findings of some other researchers . This result means that the computer-based SNP analysis may be improved if various quantitative parameters of conformational and/or physicochemical features of B-helical DNA [128, 129] are taken into account in addition to the textual data.
Furthermore, population frequencies of the minor alleles of unannotated SNP can serve as another information source that may improve the predictions of candidate SNP markers of a disease. Accordingly, Table S1 (Additional file 1) characterizes all the SNPs under study in terms of the population frequencies taken from the "1000 Genome Browser" . As one can see, these parameters vary from sample to sample in a wide range (e.g., the population frequency rates of rs1143627 vary from 27% to 65%, see Table S1, Additional file 1). In addition, many biomedical SNP markers still had no values of population frequencies within the framework of the "1000 Genome Browser"  (e.g., rs397509430, rs33980857, rs34598529, rs33931746, rs33981098, rs34500389, rs63750953, rs281864525, rs35518301, and rs34166473; see Table S1, Additional file 1). Moreover, many biomedical SNP markers were not still documented by the database dbSNP  (e.g., the substitutions -35A→C (APOA1), -51T→C (NOS2), -35A→C (F7); see Table S1, Additional file 1). At present, the above-mentioned population frequency values correspond to ethnic groups in regional subpopulations (for studies on human migration flows) rather than cohorts of patients with certain risk factors of diseases--overweight, smoking, and alcohol intake--to prioritize the candidate SNP markers according to the biomedical standards . This is why the type of biomedical standardization of whole-genome data available today may advance postgenomic predictive preventive personalized medicine .
Our in vitro assays (Table 1) and in silico predictions (Table S1, Additional file 1) all indicate that the greatest changes in the leptin level may be expected for the minor allele -35g, which disrupts the TATA-like TBP-binding site and decreases the TBP-DNA affinity threefold. In contrast, the minor alleles -30t and -38a flanking this site seem to enhance this affinity by 25% and 50%, respectively. We have already observed this pattern empirically in a previous study, namely, three SNP markers of β-thalassemia (rs34598529, rs33931746, and rs3393174) within the well-known TATA box decrease TBP's affinity for the human HBB gene promoter 10-fold, whereas another biomedical SNP marker, -27A→t (flanking this TATA box), yields only a 20% decrease . As for the smallest change in the binding affinity of TBP for the promoter variant -30t (rs34104384, LEP), one can see in Table S1 (Additional file 1) that the same or even smaller effects are observed for some biomedical SNP markers, such as rs35036378 (25%, ESR2), the -35A→c substitution (25%, APOA1), rs34500389 (17%, HBB), and the -35A→c substitution (15%, F7). In addition, one can see in Table 1 that the quantitative values of the Gibbs free energy change (ΔG) are an unreliable criterion for distinguishing between variants of ODNs binding to TBP. In contrast, the kinetic constant ka of the association rate of the TBP-ODN complex does it best (see Results). This finding means that the binding of TBP to a promoter is under kinetic control rather than under thermostatic one. Therefore, additional empirical measurements can enrich the preliminary bioinformatics predictions on the candidate SNP markers of diseases.
Finally, our combined in silico and in vitro data may serve as a good rationale for clinical researchers who wish to validate promising candidates for SNP markers. The definitive proof of an SNP as a clinical marker is demonstration of a significant difference in its frequency between patients and healthy people, with adjustments for various confounding factors such as the ethnic and gender composition of the regional subpopulation in question, lifestyle, living conditions, climate, environment, and expressivity and penetrance of the disorder under study .
Changes in the affinity of a transcription factor for a regulatory DNA sequence are commonly an appreciable but not a crucial factor of a disease; for this reason, diverse clinical manifestations of the interactions between genetic and environmental factors are typical of polygenic diseases . In the case of metabolic syndrome , obesity is a strong contributing factor  in addition to genetic predisposition. Prevalence of obesity among overweight Europeans exceeds 50% and results in elevated risk of respiratory failure, atherosclerosis, and heart failure . In this work, we identified 22 obesity-related candidate SNP markers. Their validation in accordance with proper biomedical standards may help to solve the global problem of treatment of metabolic syndrome  by means of the postgenomic predictive preventive personalized medicine .
Recombinant full-length human TBP (native amino acid sequence) was expressed in Escherichia coli BL21 (DE3) cells transformed with the pAR3038-TBP plasmid (the generous gift of Prof. B. Pugh, Pennsylvania State University) by a previously described method  with two modifications: the IPTG concentration was 1.0 instead of 0.1 mM, and the induction time was 3 instead of 1.5 h. For a more detailed description of our protocol for production and purification of human TBP, see .
Oligodeoxyribonucleotides (ODNs) 26 bp long were synthesized by the Biosynthesis Enterprise (Novosibirsk, Russia) and were purified by PAGE. The ODN sequences correspond to either the ancestral or minor allele of SNPs of the TBP-binding site in the promoters of human genes LEP and GCG that are analyzed here in vitro (Table 1). Labeled double-stranded ODNs were prepared by 32P labeling of both strands by means of T4 polynucleotide kinase (SibEnzyme, Novosibirsk) with subsequent annealing by heating to 95°C (at equimolar concentrations) and slow cooling (no less than 3 h) to room temperature. The duplexes were analyzed in 15% nondenaturing polyacrylamide gel (1 × Tris-borate-EDTA buffer) and isolated by electroelution. For a detailed description of our protocol for labeling of ODNs with 32P, see .
The association rate constant (ka) and dissociation rate constant (kd) were determined for the complexes of TBP with each 26-bp ODN corresponding to the 26-bp sequence of either the ancestral or minor variant of the human LEP gene promoter. Association kinetics experiments were performed at four ODN concentrations: 10, 20, 40, and 60 nM as shown in Figure 1(B). The experiments with TBP/ODN binding were performed at 25°C in binding buffer (20 mM 4-[2-hydroxyethyl]-1-piperazineethanesulfonic acid [HEPES]-KOH pH 7.6, 5 mM MgCl2, 70 mM KCl, 1 mM dithiothreitol [DTT], 100 μg/mL BSA, 0.01% NP-40, and 5% glycerol) at a fixed concentration (0.3 nM) of active TBP. The gels were dried and Imaging Screen-K (Kodak, Rochester, NY, USA) was exposed to these gels for analysis on a Molecular Imager PharosFX Plus phosphorimager (Bio-Rad, Herts, UK). The resulting autoradiographs were quantitated in the Quantity One 4.5.0 software (Bio-Rad) as shown in Figure 1(A). Using these data, we calculated the association rate constant (ka) and dissociation rate constant (kd) in the Graph-Pad Prism 5 software (http://graphpad-prism.software.informer.com/5.01) by global fitting of the data onto the association kinetics model. For a detailed description of our protocol for in vitro measurement of association and dissociation rate constants for TBP/ODN complexes, see .
We analyzed DNA sequences of the human gene promoters taken from the database Ensembl  using their locations within the human reference genome (hg19) taken from the database GENCODE , as shown in Fig. S2 (Additional file 3). For each DNA sequence, we assessed the maximal "-ln(K0D) ± δ" affinity of TBP for the [-70; -20] promoter region (where all the known sites are located) using our Web service [91, 92] as described in Additional file 4. For each case of predicted significant overexpression or downregulation of the human genes (as clinically relevant biochemical markers), we manually performed a keyword search in the NCBI databases  as described in detail elsewhere  and schematically shown in Fig. S3 (Additional file 5). Our heuristic interpretation of these predicted cases of significant overexpression and underexpression of the human genes is shown in the second rightmost column of Table S1 (Additional file 1): these are clinical data identified by our manual keyword search, with citations in the rightmost column.
Finally, using the software package STATISTICA (Statsoft™, Tulsa, USA), we estimated statistical significance of three correlations: Pearson's simple linear correlation (r), Kendall's rank correlation (τ), and Goodman-Kruskal's generalized (γ) correlation between the K0D values predicted in silico and the K*D values obtained by measurement in vitro as shown in Figure 2.
electrophoretic mobility shift assay
hemoglobin subunit β
hemoglobin subunit δ
association rate constant
dissociation rate constant
apparent dissociation constant
single nucleotide polymorphism
transcription start site
wild type (norm)
Gibbs free energy change.
Sandhofer A, Kaser S, Ritsch A, Laimer M, Engl J, Paulweber B, et al: Cholesteryl ester transfer protein in metabolic syndrome. Obesity (Silver Spring). 2006, 14: 812-8.
Hulthe J, Bokemark L, Wikstrand J, Fagerberg B: The metabolic syndrome, LDL particle size, and atherosclerosis: the atherosclerosis and insulin resistance (AIR) study. Arterioscler Thromb Vasc Biol. 2000, 20: 2140-7.
Lavie CJ, Milani RV, Ventura HO: Obesity and cardiovascular disease: risk factor, paradox, and impact of weight loss. J Am Coll Cardiol. 2009, 53: 1925-32.
Reeves GK, Pirie K, Beral V, Green J, Spencer E, Bull D, et al: Cancer incidence and mortality in relation to body mass index in the Million Women Study: cohort study. BMJ. 2007, 335: 1134-
Rodriguez C, Calle EE, Fakhrabadi-Shokoohi D, Jacobs EJ, Thun MJ: Body mass index, height, and the risk of ovarian cancer mortality in a prospective cohort of postmenopausal women. Cancer Epidemiol Biomarkers Prev. 2002, 11: 822-8.
Tumer N, Erdos B, Matheny M, Cudykier I, Scarpace PJ: Leptin antagonist reverses hypertension caused by leptin overexpression, but fails to normalize obesity-related hypertension. J Hypertens. 2007, 25: 2471-8.
Beltowski J: Central vs. peripheral leptin excess in the pathogenesis of obesity-associated hypertension. J Hypertens. 2008, 26: 827-828.
Chan JM, Rimm EB, Colditz GA, Stampfer MJ, Willett WC: Obesity, fat distribution, and weight gain as risk factors for clinical diabetes in men. Diabetes Care. 1994, 17: 961-9.
Akamine EH, Marcal AC, Camporez JP, Hoshida MS, Caperuto LC, Bevilacqua E, et al: Obesity induced by high-fat diet promotes insulin resistance in the ovary. J Endocrinol. 2010, 206: 65-74.
Matsunaga A, Sasaki J, Han H, Huang W, Kugi M, Koga T, et al: Compound heterozygosity for an apolipoprotein A1 gene promoter mutation and a structural nonsense mutation with apolipoprotein A1 deficiency. Arterioscler Thromb Vasc Biol. 1999, 19: 348-55.
Arroyo A, Laughlin GA, Morales AJ, Yen SS: Inappropriate gonadotropin secretion in polycystic ovary syndrome: influence of adiposity. J Clin Endocrinol Metab. 1997, 82: 3728-33.
Dravecka I, Lazurova I, Kraus V: Obesity is the major factor determining an insulin sensitivity and androgen production in women with anovulary cycles. Bratisl Lek Listy. 2003, 104: 393-9.
Barber TM, McCarthy MI, Wass JA, Franks S: Obesity and polycystic ovary syndrome. Clin Endocrinol (Oxf). 2006, 65: 137-45.
Frederickson R, Hagglund M, Olszewski PK, Stephansson O, Jacobsson JA, Olszewska AM, et al: The obesity gene, FTO, is of ancient origin, up-regulated during food deprivation and expressed in neurons of feeding-related nuclei of the brain. Endocrinology. 2008, 149: 2062-71.
Stothard KJ, Tennant PW, Bell R, Rankin J: Maternal overweight and obesity and the risk of congenital anomalies: a systematic review and meta-analysis. JAMA. 2009, 301: 636-50.
Zhang Y, Proenca R, Maffei M, Barone M, Leopold L, Friedman JM: Positional cloning of the mouse obese gene and its human homologue. Nature. 1994, 372: 425-32.
Farooqi S, O'Rahilly S: 20 years of leptin: human disorders of leptin action. J Endocrinol. 2014, 223: T63-70.
Zhang F, Basinski MB, Beals JM, Briggs SL, Churgay LM, Clawson DK, et al: Crystal structure of the obese protein leptin-E100. Nature. 1997, 387: 206-209.
Friedman JM: Leptin and the regulation of body weigh. Keio J Med. 2011, 60: 1-9.
Zhang Y, Proenca R, Maffei M, Barone M, Leopold L, Friedman JM: Positional cloning of the mouse obese gene and its human homologue. Nature. 1994, 372: 425-32.
Sobhani I, Bado A, Vissuzaine C, Buyse M, Kermorgant S, Laigneau JP, et al: Leptin secretion and leptin receptor in the human stomach. Gut. 2000, 47: 178-83.
Purdham DM, Zou MX, Rajapurohitam V, Karmazyn M: Rat heart is a site of leptin production and action. Am J Physiol Heart Circ Physiol. 2004, 287: H2877-84.
Masuzaki H, Ogawa Y, Sagawa N, Hosoda K, Matsumoto T, Mise H, et al: Nonadipose tissue production of leptin: leptin as a novel placenta-derived hormone in humans. Nat Med. 1997, 3: 1029-33.
Zeidan A, Broman J, Hellstrand P, Sward K: Cholesterol dependence of vascular ERK1/2 activation and growth in response to stretch: role of endothelin-1. Arterioscler Thromb Vasc Biol. 2003, 23: 1528-34.
Hall JE, da Silva AA, do Carmo JM, Dubinion J, Hamza S, Munusamy S, et al: Obesity-induced hypertension: role of sympathetic nervous system, leptin and melanocortins. J Biol Chem. 2010, 285: 17271-6.
Beltowski J: Leptin and atherosclerosis. Atherosclerosis. 2006, 189: 47-60.
Zhang L, Song H, Ge Y, Ji G, Yao Z: Temporal relationship between diet-induced steatosis and onset of insulin/leptin resistance in male Wistar rats. PLoS One. 2015, 10: e0117008-
Zampieri TT, Ramos-Lobo AM, Furigo IC, Pedroso AB, Buonfiglio DC, Donato J: SOCS3 deficiency in leptin receptor-expressing cells mitigates the development of pregnancy-induced metabolic changes. Mol Metab. 2015, 4: 237-45.
Nteeba J, Ganesan S, Keating AF: Impact of obesity on ovotoxicity induced by 7,12-dimethylbenz[a]anthracene in mice. Biol Reprod. 2014, 90: 68-
Trovato GM: Sustainable medical research by effective and comprehensive medical skills: overcoming the frontiers by predictive, preventive and personalized medicine. EPMA J. 2014, 5: 14-
Welter D, MacArthur J, Morales J, Burdett T, Hall P, Junkins H, et al: The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Res. 2014, 42: D1001-6.
Landrum MJ, Lee JM, Riley GR, Jang W, Rubinstein WS, Church DM, et al: ClinVar: public archive of relationships among sequence variation and human phenotype. Nucleic Acids Res. 2014, 42: D980-5.
Al-Shakfa F, Dulucq S, Brukner I, Milacic I, Ansari M, Beaulieu P, et al: DNA variants in region for noncoding interfering transcript of dihydrofolate reductase gene and outcome in childhood acute lymphoblastic leukemia. Clin Cancer Res. 2009, 15: 6931-8.
Aprile G, Bonotto M, Ongaro E, Pozzo C, Giuliani F: Critical appraisal of ramucirumab (IMC-1121B) for cancer treatment: from benchside to clinical use. Drugs. 2013, 73: 2003-15.
Mallal S, Nolan D, Witt C, Masel G, Martin AM, Moore C, et al: Association between presence of HLA-B 5701, HLA-DR7, and HLA-DQ3 and hypersensitivity to HIV-1 reverse-transcriptase inhibitor abacavir. Lancet. 2002, 359: 727-32.
Colonna V, Ayub Q, Chen Y, Pagani L, Luisi P, Pybus M, et al: Human genomic regions with exceptionally high levels of population differentiation identified from 911 whole-genome sequences. Genome Biol. 2014, 15: R88-
Sherry ST, Ward MH, Kholodov M, Baker J, Phan L, Smigielski EM, et al: dbSNP: the NCBI database of genetic variation. Nucleic Acids Res. 2001, 29: 308-11.
Zerbino DR, Wilder SP, Johnson N, Juettemann T, Flicek PR: The Ensembl regulatory build. Genome Biol. 2015, 16: 56-
Haeussler M, Raney BJ, Hinrichs AS, Clawson H, Zweig AS, Karolchik D, et al: Navigating protected genomics data with UCSC Genome Browser in a Box. Bioinformatics. 2015, 31: 764-6.
Kemmeren P, Sameith K, van de Pasch LA, Benschop JJ, Lenstra TL, Margaritis T, et al: Large-scale genetic perturbations reveal regulatory networks and an abundance of gene-specific repressors. Cell. 2014, 157: 740-52.
Vasiliev GV, Merkulov VM, Kobzev VF, Merkulova TI, Ponomarenko MP, Kolchanov NA: Point mutations within 663-666 bp of intron 6 of the human TDO2 gene, associated with a number of psychiatric disorders, damage the YY-1 transcription factor binding site. FEBS Lett. 1999, 462: 85-8.
Ponomarenko JV, Orlova GV, Merkulova TI, Fokin ON, Vasiliev GV, et al: rSNP_Guide: an integrated database-tools system for studying SNPs and site-directed mutations in transcription factor binding sites. Hum Mutat. 2002, 20: 239-48.
Deyneko IV, Weiss S, Leschner S: An integrative computational approach to effectively guide experimental identification of regulatory elements in promoters. BMC Bioinformatics. 2012, 13: 202-
Deyneko IV, Kel AE, Kel-Margoulis OV, Deineko EV, Wingender E, Weiss S: MatrixCatch--a novel tool for the recognition of composite regulatory elements in promoters. BMC Bioinformatics. 2013, 14: 241-
Ni Y, Hall AW, Battenhouse A, Iyer VR: Simultaneous SNP identification and assessment of allele-specific bias from ChIP-seq data. BMC Genet. 2012, 13: 46-
Leschner S, Deyneko IV, Lienenklaus S, Wolf K, Bloecker H, Bumann D, et al: Identification of tumor-specific Salmonella Typhimurium promoters and their regulatory logic. Nucleic Acids Res. 2012, 40: 2984-94.
Hu J, Locasale JW, Bielas JH, O'Sullivan J, Sheahan K, Cantley LC, et al: Heterogeneity of tumor-induced gene expression changes in the human metabolic network. Nat Biotechnol. 2013, 31: 522-9.
Hein M, Graver S: Tumor cell response to bevacizumab single agent therapy in vitro. Cancer Cell Int. 2013, 13: 94-
Chen CY, Chang IS, Hsiung CA, Wasserman WW: On the identification of potential regulatory variants within genome wide association candidate SNP sets. BMC Med Genomics. 2014, 7: 34-
Ponomarenko JV, Furman DP, Frolov AS, Podkolodny NL, Orlova GV, Ponomarenko MP, et al: ACTIVITY: a database on DNA/RNA sites activity adapted to apply sequence-activity relationships from one system to another. Nucleic Acids Res. 2001, 29: 284-7.
Macintyre G, Bailey J, Haviv I, Kowalczyk A: is-rSNP: a novel technique for in silico regulatory SNP detection. Bioinformatics. 2010, 26: i524-30.
Boyle AP, Hong EL, Hariharan M, Cheng Y, Schaub MA, Kasowski M, et al: Annotation of functional variation in personal genomes using RegulomeDB. Genome Res. 2012, 22: 1790-7.
Riva A: Large-scale computational identification of regulatory SNPs with rSNP-MAPPER. BMC Genomics. 2012, 13: S7-
Andersen MC, Engstrom PG, Lithwick S, Arenillas D, Eriksson P, Lenhard B, et al: In silico detection of sequence variations modifying transcriptional regulation. PLoS Comput Biol. 2008, 4: e5-
Ponomarenko JV, Orlova GV, Frolov AS, Gelfand MS, Ponomarenko MP: SELEX_DB: a database on in vitro selected oligomers adapted for recognizing natural sites and for analyzing both SNPs and site-directed mutagenesis data. Nucleic Acids Res. 2002, 30: 195-9.
Fu Y, Liu Z, Lou S, Bedford J, Mu XJ, Yip KY, et al: FunSeq2: a framework for prioritizing noncoding regulatory variants in cancer. Genome Biol. 2014, 15: 480-
Chen CC, Xiao S, Xie D, Cao X, Song CX, Wang T, et al: Understanding variation in transcription factor binding by modeling transcription factor genome-epigenome interactions. PLoS Comput Biol. 2013, 9: e1003367-
Deyneko IV, Bredohl B, Wesely D, Kalybaeva YM, Kel AE, Blocker H, et al: FeatureScan: revealing property-dependent similarity of nucleotide sequences. Nucleic Acids Res. 2006, 34: W591-5.
Deyneko IV, Kalybaeva YM, Kel AE, Blocker H: Human-chimpanzee promoter comparisons: property-conserved evolution?. Genomics. 2010, 96: 129-33.
Bryzgalov LO, Antontseva EV, Matveeva MY, Shilov AG, Kashina EV, Mordvinov VA, et al: Detection of regulatory SNPs in human genome using ChIP-seq ENCODE data. PLoS One. 2013, 8: e78833-
Podkolodnyy NL, Afonnikov DA, Vaskin YuYu, Bryzgalov LO, Ivanisenko VA, Demenkov PS, et al: Program complex SNP-MED for analysis of single-nucleotide polymorphism (SNP) effects on the function of genes associated with socially significant diseases. Russ J Genet App Res. 2014, 4: 159-67.
Johnson AD, Handsaker RE, Pulit SL, Nizzari MM, O'Donnell CJ, de Bakker PI: SNAP: a web-based tool for identification and annotation of proxy SNPs using HapMap. Bioinformatics. 2008, 24: 2938-9.
Coetzee SG, Rhie SK, Berman BP, Coetzee GA, Noushmehr H: FunciSNP: an R/bioconductor tool integrating functional non-coding data sets with genetic association studies to identify candidate regulatory SNPs. Nucleic Acids Res. 2012, 40: e139-
Saccone SF, Bolze R, Thomas P, Quan J, Mehta G, Deelman E, et al: SPOT: a web-based tool for using biological databases to prioritize SNPs after a genome-wide association study. Nucleic Acids Res. 2010, 38: W201-9.
Rasskazov DA, Antontseva EV, Bryzgalov LO, Matveeva MYu, Kashina EV, Ponomarenko PM, et al: rSNP_Guide-based evaluation of SNPs in promoters of the human APC and MLH1 genes associated with colon cancer. Russ J Genet App Res. 2014, 4: 245-53.
Barenboim M, Manke T: ChroMoS: an integrated web tool for SNP classification, prioritization and functional interpretation. Bioinformatics. 2013, 29: 2197-8.
Ponomarenko MP, Ponomarenko JV, Frolov AS, Podkolodnaya OA, Vorobyev DG, Kolchanov NA, et al: Oligonucleotide frequency matrices addressed to recognizing functional DNA sites. Bioinformatics. 1999, 15: 631-43.
Amberger J, Bocchini CA, Scott AF, Hamosh A: McKusick's Online Mendelian Inheritance in Man (OMIM). Nucleic Acids Res. 2009, 37: D793-6.
Mitsuyasu H, Izuhara K, Mao XQ, Gao PS, Arinobu Y, Enomoto T, et al: Ile50Val variant of IL4R-alpha upregulates IgE synthesis and associates with atopic asthma. Nat Genet. 1998, 19: 119-20.
Guo L, Du Y, Chang S, Zhang K, Wang J: rSNPBase: a database for curated regulatory SNPs. Nucleic Acids Res. 2014, 42: D1033-9.
Yoo SS, Jin C, Jung DK, Choi YY, Choi JE, Lee WK, et al: Putative functional variants of XRCC1 identified by RegulomeDB were not associated with lung cancer risk in a Korean population. Cancer Genet. 2015, 208: 19-24.
Savinkova LK, Ponomarenko MP, Ponomarenko PM, Drachkova IA, Lysova MV, Arshinova TV, et al: TATA box polymorphisms in human gene promoters and associated hereditary pathologies. Biochemistry (Mosc). 2009, 74: 117-29.
Ponomarenko M, Mironova V, Gunbin K, Savinkova L: Hogness Box. Brenner's Encyclopedia of Genetics. Edited by: Maloy S, Hughes K. 2013, San Diego: Academic Press, Elsevier Inc, 3: 491-4. 2nd
Pugh BF: Control of gene expression through regulation of the TATA-binding protein. Gene. 2000, 255: 1-14.
Ponomarenko M, Savinkova L, Kolchanov N: Initiation Factors. Brenner's Encyclopedia of Genetics. Edited by: Maloy S, Hughes K. 2013, San Diego: Academic Press, Elsevier Inc, 4: 83-5. 2nd
Martianov I, Viville S, Davidson I: RNA polymerase II transcription in murine cells lacking the TATA binding protein. Science. 2002, 298: 1036-9.
Muller F, Lakatos L, Dantonel J, Strahle U, Tora L: TBP is not universally required for zygotic RNA polymerase II transcription in zebrafish. Curr Biol. 2001, 11: 282-7.
Ponomarenko PM, Savinkova LK, Drachkova IA, Lysova MV, Arshinova TV, Ponomarenko MP, et al: A step-by-step model of TBP/TATA box binding allows predicting human hereditary diseases by single nucleotide polymorphism. Dokl Biochem Biophys. 2008, 419: 88-92.
Savinkova L, Drachkova I, Arshinova T, Ponomarenko P, Ponomarenko M, Kolchanov N: An experimental verification of the predicted effects of promoter TATA-box polymorphisms associated with human diseases on interactions between the TATA boxes and TATA-binding protein. PLoS One. 2013, 8: e54626-
Drachkova I, Savinkova L, Arshinova T, Ponomarenko M, Peltek S, Kolchanov N: The mechanism by which TATA-box polymorphisms associated with human hereditary diseases influence interactions with the TATA-binding protein. Hum Mutat. 2014, 35: 601-8.
Drachkova IA, Shekhovtsov SV, Peltek SE, Ponomarenko PM, Arshinova TV, Ponomarenko MP, et al: Surface plasmon resonance study of the interaction between the human TATA-box binding protein and the TATA element of the NOS2A gene promoter. Vavilov J Genet Breed. 2012, 16: 391-6.
Arkova OV, Kuznetsov NA, Fedorova OS, Kolchanov NA, Savinkova LK: Real-time interaction between TBP and the TATA box of the human triosephosphate isomerase gene promoter in the norm and pathology. Acta Naturae. 2014, 6: 36-40.
Ponomarenko PM, Ponomarenko MP, Drachkova IA, Lysova MV, Arshinova TV, Savinkova LK, et al: Prediction of the affinity of the TATA-binding protein to TATA boxes with single nucleotide polymorphisms. Mol Biol (Mosk). 2009, 43: 327-36.
Suslov VV, Ponomarenko PM, Efimov VM, Savinkova LK, Ponomarenko MP, Kolchanov NA: SNPs in the HIV-1 TATA box and the AIDS pandemic. J Bioinform Comput Biol. 2010, 8: 607-25.
Ponomarenko PM, Suslov VV, Savinkova LK, Ponomarenko MP, Kolchanov NA: A precise equation of equilibrium of four steps of TBP binding with the TATA box for prognosis of phenotypic manifestation of mutations. Biofizika. 2010, 55: 400-14.
Suslov VV, Ponomarenko PM, Ponomarenko MP, Drachkova IA, Arshinova TV, Savinkova LK, et al: TATA box polymorphisms in genes of commercial and laboratory animals and plants associated with selectively valuable traits. Russ J Genet. 2010, 46: 394-403.
Mironova VV, Ponomarenko PM, Ponomarenko MP, Kolchanov NA: Specific/nonspecific binding of TBP to promoter DNA of the auxin response factor genes in plants correlated with ARFs function on gene transcription (activator/repressor). Dokl Biochem Biophys. 2010, 433: 191-6.
Ponomarenko MP, Suslov VV, Gunbin KV, Ponomarenko PM, Vishnevsky OV, Kolchanov NA: Identification of the relationship between the variability of the expression of signaling pathway genes in the human brain and the affinity of TATA-binding protein to their promoters. Russ J Genet App Res. 2015.
Drachkova IA, Ponomarenko PM, Arshinova TV, Ponomarenko MP, Suslov VV, Savinkova LK, et al: In vitro examining the existing prognoses how TBP binds to TATA with SNP associated with human diseases. Health. 2011, 3: 577-83.
Ponomarenko PM, Ponomarenko MP: Sequence-based prediction of transcription up-regulation by auxin in plants. J Bioinform Comput Biol. 2015, 13: 1540009-
Rasskazov DA, Gunbin KV, Ponomarenko PM, Vishnevsky OV, Ponomarenko MP, Afonnikov DA: SNP_TATA_comparator: Web-service for comparison of SNPs within gene promoters associated with human diseases using the equilibrium equation of the TBP/TATA complex. Vavilov J Genet Breed. 2013, 17: 599-606.
Ponomarenko M, Rasskazov D, Arkova O, Ponomarenko P, Suslov V, Savinkova L, et al: How to use SNP_TATA_Comparator to find a significant change in gene expression caused by the regulatory SNP of this gene's promoter via a change in affinity of the TATA-binding protein for this promoter. Biomed Res Int. 2015, 2015: 359835-
Strandberg L, Mellstrom D, Ljunggren O, Grundberg E, Karlsson MK, Holmberg AH, et al: IL6 and IL1B polymorphisms are associated with fat mass in older men: the MrOS Study Sweden. Obesity (Silver Spring). 2008, 16: 710-3.
Hayashi F, Watanabe M, Nanba T, Inoue N, Akamizu T, Iwatani Y: Association of the -31C/T functional polymorphism in the interleukin-1beta gene with the intractability of Graves' disease and the proportion of T helper type 17 cells. Clin Exp Immunol. 2009, 158: 281-6.
El-Omar EM, Carrington M, Chow WH, McColl KE, Bream JH, Young HA, et al: Interleukin-1 polymorphisms associated with increased risk of gastric cancer. Nature. 2000, 404: 398-402.
Wang Y, Kato N, Hoshida Y, Yoshida H, Taniguchi H, Goto T, et al: Interleukin-1beta gene polymorphisms associated with hepatocellular carcinoma in hepatitis C virus infection. Hepatology. 2003, 37: 65-71.
Wu KS, Zhou X, Zheng F, Xu XQ, Lin YH, Yang J: Influence of interleukin-1 beta genetic polymorphism, smoking and alcohol drinking on the risk of non-small cell lung cancer. Clin Chim Acta. 2010, 411: 1441-6.
Martinez-Carrillo DN, Garza-Gonzalez E, Betancourt-Linares R, Monico-Manzano T, Antunez-Rivera C, Roman-Roman A, et al: Association of IL1B -511C/-31T haplotype and Helicobacter pylori vacA genotypes with gastric ulcer and chronic gastritis. BMC Gastroenterol. 2010, 10: 126-
Borkowska P, Kucia K, Rzezniczek S, Paul-Samojedny M, Kowalczyk M, Owczarek A, et al: Interleukin-1beta promoter (-31T/C and -511C/T) polymorphisms in major recurrent depression. J Mol Neurosci. 2011, 44: 12-6.
Gonzalez-Martinez JA, Moddel G, Ying Z, Prayson RA, Bingaman WE, Najm IM: Neuronal nitric oxide synthase expression in resected epileptic dysplastic neocortex. J Neurosurg. 2009, 110: 343-9.
Clark IA, Rockett KA, Burgner D: Genes, nitric oxide and malaria in African children. Trends Parasitol. 2003, 19: 335-7.
Burgner D, Rockett K, Ackerman H, Hull J, Usen S, Pinder M, Kwiatkowski DP: Haplotypic relationship between SNP and microsatellite markers at the NOS2A locus in two populations. Genes Immun. 2003, 4: 506-14.
Jeon MJ, Leem J, Ko MS, Jang JE, Park HS, Kim HS, et al: Mitochondrial dysfunction and activation of iNOS are responsible for the palmitate-induced decrease in adiponectin synthesis in 3T3L1 adipocytes. Exp Mol Med. 2012, 44: 562-70.
De Vivo I, Huggins GS, Hankinson SE, Lescault PJ, Boezen M, Colditz GA, et al: A functional polymorphism in the promoter of the progesterone receptor gene associated with endometrial cancer risk. Proc Natl Acad Sci USA. 2002, 99: 12263-8.
Liukkonen S, Saarto T, Maenpaa H, Sjostrom-Mattson J: Male breast cancer: a survey at the Helsinki University Central Hospital during 1981-2006. Acta Oncol. 2010, 49: 322-7.
Philips S, Richter A, Oesterreich S, Rae JM, Flockhart DA, Perumal NB, et al: Functional characterization of a genetic polymorphism in the promoter of the ESR2 gene. Horm Cancer. 2012, 3: 37-43.
Seidlova-Wuttke D, Nguyen BT, Wuttke W: Long-term effects of ovariectomy on osteoporosis and obesity in estrogen-receptor-β-deleted mice. Comp Med. 2012, 62: 8-13.
Peltoketo H, Piao Y, Mannermaa A, Ponder BA, Isomaa V, Poutanen M, et al: A point mutation in the putative TATA box, detected in nondiseased individuals and patients with hereditary breast cancer, decreases promoter activity of the 17 beta-hydroxysteroid dehydrogenase type 1 gene 2 (EDH17B2) in vitro. Genomics. 1994, 23: 250-2.
Campbell KL, Foster-Schubert KE, Makar KW, Kratz M, Hagman D, Schur EA, et al: Gene expression changes in adipose tissue with diet- and/or exercise-induced weight loss. Cancer Prev Res (Phila). 2013, 6: 217-31.
Boldt AB, Culpi L, Tsuneto LT, de Souza IR, Kun JF, Petzl-Erler ML: Diversity of the MBL2 gene in various Brazilian populations and the case of selection at the mannose-binding lectin locus. Hum Immunol. 2006, 67: 722-34.
Sziller I, Babula O, Hupuczi P, Nagy B, Rigo B, Szabo G, Papp Z, et al: Mannose-binding lectin (MBL) codon 54 gene polymorphism protects against development of pre-eclampsia, HELLP syndrome and pre-eclampsia-associated intrauterine growth restriction. Mol Hum Reprod. 2007, 13: 281-5.
Cervera A, Planas AM, Justicia C, Urra X, Jensenius JC, Torres F, et al: Genetically-defined deficiency of mannose-binding lectin is associated with protection after experimental stroke in mice and outcome in human stroke. PLoS One. 2010, 5: e8433-
Fernandez-Real JM, Straczkowski M, Vendrell J, Soriguer F, Perez Del Pulgar S, Gallart L, et al: Protection from inflammatory disease in insulin resistance: the role of mannan-binding lectin. Diabetologia. 2006, 49: 2402-11.
Kavlie A, Hiltunen L, Rasi V, Prydz H: Two novel mutations in the human coagulation factor VII promoter. Thromb Haemost. 2003, 90: 194-205.
El-Hagracy RS, Kamal GM, Sabry IM, Saad AA, Abou El Ezz NF, Nasr HA: Tissue factor, tissue factor pathway inhibitor and factor VII activity in cardiovascular complicated type 2 diabetes mellitus. Oman Med J. 2010, 25: 173-8.
Chu AJ: Tissue factor, blood coagulation, and beyond: an overview. Int J Inflam. 2011, 2011: 367284-
Arnaud E, Barbalat V, Nicaud V, Cambien F, Evans A, Morrison C, et al: Polymorphisms in the 5' regulatory region of the tissue factor gene and the risk of myocardial infarction and venous thromboembolism: the ECTIM and PATHROS studies. Etude Cas-Temoins de l'Infarctus du Myocarde. Paris Thrombosis case-control Study. Arterioscler Thromb Vasc Biol. 2000, 20: 892-8.
Martiney JA, Cerami A, Slater AF: Inhibition of hemozoin formation in Plasmodium falciparum trophozoite extracts by heme analogs: possible implication in the resistance to malaria conferred by the beta-thalassemia trait. Mol Med. 1996, 2: 236-46.
Altunoglu E, Muderrisoglu C, Erdenen F, Ulgen E, Ar MC: The impact of obesity and insulin resistance on iron and red blood cell parameters: a single center, cross-sectional study. Turk J Haematol. 2014, 31: 61-7.
Lee EB, Mattson MP: The neuropathology of obesity: insights from human disease. Acta Neuropathol. 2014, 127: 3-28.
Takagi Y, Kinoshita K, Ozaki N, Seino Y, Murata Y, Oshida Y, et al: Mice deficient in proglucagon-derived peptides exhibit glucose intolerance on a high-fat diet but are resistant to obesity. PLoS One. 2015, 10: e0138322-
Bucher P: Weight matrix descriptions of four eukaryotic RNA polymerase II promoter elements derived from 502 unrelated promoter sequences. J Mol Biol. 1990, 212: 563-78.
Stewart JJ, Stargell LA: The stability of the TFIIA-TBP-DNA complex dependent on the sequence of the TATAAA element. J Biol Chem. 2001, 276: 30078-84.
Mogno I, Vallania F, Mitra RD, Cohen BA: TATA is a modular component of synthetic promoters. Genome Res. 2010, 20: 1391-7.
Karas H, Knuppel R, Schulz W, Sklenar H, Wingender E: Combining structural analysis of DNA with search routines for the detection of transcription regulatory elements. Comput Appl Biosci. 1996, 12: 441-6.
Kim JL, Nikolov DB, Burley SK: Co-crystal structure of TBP recognizing the minor groove of a TATA element. Nature. 1993, 365: 520-7.
Deyneko IV, Kel AE, Bloecker H, Kauer G: Signal-theoretical DNA similarity measure revealing unexpected similarities of E. coli promoters. In Silico Biol. 2005, 5: 547-55.
Ponomarenko JV, Ponomarenko MP, Frolov AS, Vorobyev DG, Overton GC, Kolchanov NA: Conformational and physicochemical DNA features specific for transcription factor binding sites. Bioinformatics. 1999, 15: 654-68.
Ponomarenko MP, Ponomarenko JV, Frolov AS, Podkolodny NL, Savinkova LK, Kolchanov NA, et al: Identification of sequence-dependent DNA features correlating to activity of DNA sites interacting with proteins. Bioinformatics. 1999, 15: 687-703.
Sokolenko AA, Sandomirskii II, Savinkova LK: Interaction of yeast TATA-binding protein with short promotor segments. Mol Biol (Mosk). 1996, 30: 279-85.
Pybus M, Dall'Olio GM, Luisi P, Uzkudun M, Carreno-Torres A, Pavlidis P, et al: 1000 Genomes Selection Browser 1.0: a genome browser dedicated to signatures of natural selection in modern humans. Nucleic Acids Res. 2014, 42: D903-9.
Rhee HS, Pugh BF: Comprehensive genome-wide protein-DNA interactions detected at single nucleotide resolution. Cell. 2011, 147: 1408-19.
Holvoet P: Relations between metabolic syndrome, oxidative stress and inflammation and cardiovascular disease. Verh K Acad Geneeskd Belg. 2008, 70: 193-219.
Pugh BF: Purification of the human TATA-binding protein, TBP. Methods Mol Biol. 1995, 37: 359-67.
Harrow J, Frankish A, Gonzalez JM, Tapanari E, Diekhans M, Kokocinski F, et al: GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res. 2012, 22: 1760-74.
NCBI Resource Coordinators: Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2015, 43: D6-17.
Missala I, Kassner U, Steinhagen-Thiessen E: A systematic literature review of the association of lipoprotein(a) and autoimmune diseases and atherosclerosis. Int J Rheumatol. 2012, 2012: 480784-
Waardenberg AJ, Basset SD, Bouveret R, Harvey RP: CompGO: an R package for comparing and visualizing Gene Ontology enrichment differences between DNA binding experiments. BMC Bioinformatics. 2015, 16: 275-
The authors are grateful to Victor V. Gulevich for English translation and to Nikolai A. Shevchuk for editing. This work was supported by grant 14-04-00485 from the Russian Foundation for Basic Research to LKS, OVA, TVA, and MPP. DAR and NAK acknowledge the support of the software development by the Russian Scientific Foundation, project 14-24-00123. Data compilation and analysis were supported by Projects VI.58.1.2 from the Russian State Budget for IAD.
Publication of this article has been funded by the Russian Science Foundation.
This article has been published as part of BMC Genomics Volume 16 Supplement 13, 2015: Selected articles from the 7th International Young Scientists School "Systems Biology and Bioinformatics" (SBB'2015): Genomics. The full contents of the supplement are available online at http://www.biomedcentral.com/bmcgenomics/supplements/16/S13.
The authors declare that they have no competing interests.
MPP performed the sequence analysis and helped to draft the manuscript. OVA, IAD, and TVA carried out the in vitro experiments. DAR designed, developed, maintained, adapted, and tuned the software for both sequence and data analysis. PMP designed and performed the statistical analysis. LKS drafted the manuscript and designed and coordinated the in vitro experiments. NAK conceived of and supervised the study. All coauthors read and approved the final version of the manuscript.
Electronic supplementary material
Additional file 1: Table S1. Obesity-related known and candidate SNP markers altering affinity of the TATA-binding protein (TBP) for human gene promoters. (PDF 219 KB)
Additional file 2: . The obesity-related candidate SNP markers (in the human LEP gene promoter) predicted using SNP_TATA_Comparator . (A) The only promoter of the human LEP gene and the six unannotated SNPs (analyzed in this study) in the region [-70; -20] (double-headed arrow, ↔) where all proven TBP-binding sites are located. Single-headed arrow (→): transcription start site (TSS), box: TATA-like subsequence TATAA, ellipse: three possible obesity-related SNP markers predicted in this work. (B-D) The results produced by our Web service SNP_TATA_Comparator  for the three possible obesity-related SNP markers (rs200487063, rs201381696, and rs34104384) located in the human LEP gene promoter. The symbols are explained in the legend of Fig. S2 (see Methods; Additional file 3). (PNG 534 KB)
Additional file 3: . The result produced by SNP_TATA_Comparator  for a known SNP marker of obesity . Legend: Solid, dotted, and dashed arrows indicate queries for the gene list, list of transcripts of a certain gene, and DNA sequence of the promoter corresponding to the specified transcript of the gene in Ensembl  and GENCODE  editions of the reference human genome hg19, respectively. Dash-and-dot arrows: estimates of significance of the aberration of gene product abundance in patients with the minor allele (relative to the ancestral allele: reference human genome hg19) expressed as Fisher's Z-score. Two circles indicate the ancestral allele (-35A) and minor allele (-35c) of this SNP marker of obesity (-35A→c); this SNP causes underexpression the human APOA1 gene . (PNG 961 KB)
Additional file 4: Supplementary method. A quantitative estimate of binding affinity of TATA-binding protein (TBP) for a promoter of the human gene as a function of DNA sequence of this promoter. (PDF 162 KB)
Additional file 5: Figure S3. A flow chart of the keyword search for comorbidities of obesity where biochemical markers correspond to a change in expression of a given gene containing the SNP marker of interest. (PNG 333 KB)
About this article
Cite this article
Arkova, O.V., Ponomarenko, M.P., Rasskazov, D.A. et al. Obesity-related known and candidate SNP markers can significantly change affinity of TATA-binding protein for human gene promoters. BMC Genomics 16, S5 (2015). https://doi.org/10.1186/1471-2164-16-S13-S5
- TATA-binding protein
- single nucleotide polymorphism
- TBP/promoter affinity change
- statistical significance
- candidate SNP marker
- prediction in silico
- verification in vitro