Many obesity-associated SNPs strongly associate with DNA methylation changes at proximal promoters and enhancers

Voisin, Sarah; Almén, Markus Sällman; Zheleznyakova, Galina Y.; Lundberg, Lina; Zarei, Sanaz; Castillo, Sandra; Eriksson, Fia Ence; Nilsson, Emil K.; Blüher, Matthias; Böttcher, Yvonne; Kovacs, Peter; Klovins, Janis; Rask-Andersen, Mathias; Schiöth, Helgi B.

doi:10.1186/s13073-015-0225-4

Research
Open access
Published: 08 October 2015

Many obesity-associated SNPs strongly associate with DNA methylation changes at proximal promoters and enhancers

Sarah Voisin¹,
Markus Sällman Almén^1,2,
Galina Y. Zheleznyakova¹,
Lina Lundberg¹,
Sanaz Zarei¹,
Sandra Castillo¹,
Fia Ence Eriksson¹,
Emil K. Nilsson¹,
Matthias Blüher³,
Yvonne Böttcher³,
Peter Kovacs³,
Janis Klovins⁴,
Mathias Rask-Andersen¹ &
…
Helgi B. Schiöth¹

Genome Medicine volume 7, Article number: 103 (2015) Cite this article

11k Accesses
97 Citations
8 Altmetric
Metrics details

Abstract

Background

The mechanisms by which genetic variants, such as single nucleotide polymorphisms (SNPs), identified in genome-wide association studies act to influence body mass remain unknown for most of these SNPs, which continue to puzzle the scientific community. Recent evidence points to the epigenetic and chromatin states of the genome as having important roles.

Methods

We genotyped 355 healthy young individuals for 52 known obesity-associated SNPs and obtained DNA methylation levels in their blood using the Illumina 450 K BeadChip. Associations between alleles and methylation at proximal cytosine residues were tested using a linear model adjusted for age, sex, weight category, and a proxy for blood cell type counts. For replication in other tissues, we used two open-access datasets (skin fibroblasts, n = 62; four brain regions, n = 121–133) and an additional dataset in subcutaneous and visceral fat (n = 149).

Results

We found that alleles at 28 of these obesity-associated SNPs associate with methylation levels at 107 proximal CpG sites. Out of 107 CpG sites, 38 are located in gene promoters, including genes strongly implicated in obesity (MIR148A, BDNF, PTPMT1, NR1H3, MGAT1, SCGB3A1, HOXC12, PMAIP1, PSIP1, RPS10-NUDT3, RPS10, SKOR1, MAP2K5, SIX5, AGRN, IMMP1L, ELP4, ITIH4, SEMA3G, POMC, ADCY3, SSPN, LGR4, TUFM, MIR4721, SULT1A1, SULT1A2, APOBR, CLN3, SPNS1, SH2B1, ATXN2L, and IL27). Interestingly, the associated SNPs are in known eQTLs for some of these genes. We also found that the 107 CpGs are enriched in enhancers in peripheral blood mononuclear cells. Finally, our results indicate that some of these associations are not blood-specific as we successfully replicated four associations in skin fibroblasts.

Conclusions

Our results strongly suggest that many obesity-associated SNPs are associated with proximal gene regulation, which was reflected by association of obesity risk allele genotypes with differential DNA methylation. This study highlights the importance of DNA methylation and other chromatin marks as a way to understand the molecular basis of genetic variants associated with human diseases and traits.

Background

Genome-wide association studies (GWASs) have identified a plethora of common genetic variants that are associated with obesity-associated traits (e.g., body mass index (BMI) [1–11], fat mass [12, 13], low lean body mass [14], blood lipid levels [15], waist circumference [13, 16], BMI-adjusted waist-to-hip ratio [17, 18]). Some of these single nucleotide polymorphisms (SNPs) are located near genes whose role in obesity is well established, such as MC4R [19]. However, most of these SNPs are located near genes whose role in obesity is still unclear, and the mechanisms through which they act remain unknown. Part of this lack of understanding may be due to a focus on the genes in closest proximity to these SNPs. Actually, these SNPs may regulate genes that are located quite far away, as recently demonstrated for genetic variants within FTO. In human brains, obesity-associated SNPs in FTO were found to be associated with expression of IRX3, a gene located more than half a million base pairs downstream of the body mass-associated genetic locus [20]. Another instance is the rs4537545 SNP previously associated with coronary heart disease [21] and located within IL6R: this SNP was recently found to be associated with blood mRNA levels of ATP8B2, a gene located 115 kb away [22]. Thus, obesity-associated SNPs might act through long-range interactions (for example, by disrupting enhancers) and potentially through epigenetic mechanisms.

The epigenome represents the pattern of chemical and structural modifications to DNA that are heritable through mitosis and/or meiosis, but that do not entail changes in DNA sequence. Epigenetic mechanisms encompass DNA methylation, histone modifications, and non-coding RNAs, and have the potential to modify gene expression. Recent attention has been drawn to the possible role of epigenetics in the pathogenesis of obesity [23, 24]. Moreover, while the epigenome is known to be modulated by the environment, this modulation can also be affected by genetic variants. Studies in brain [25–28], adipose tissue [29, 30], blood [26, 31, 32], lung [33], fibroblasts [34, 35], T cells [35], leukocytes [36], and lymphoblastoid cells [35, 37] have shown that the genome contains quantitative trait loci (QTLs) for DNA methylation, also called methylation QTLs (meQTLs). DNA methylation levels correlate with the presence of specific alleles at nearby SNPs, and meQTLs tend to locate outside of promoters, especially in intergenic regions. In a study conducted in adipose tissue [29], meQTLs overlapping metabolic disease loci were enriched in histone marks predictive of genetic enhancers. Interestingly, top associations from a GWAS of bipolar disorder were enriched in meQTLs [38], suggesting that this could be a powerful approach to better understand the molecular basis of candidate SNPs from GWASs.

In the present study, we tested associations between 52 SNPs that were previously identified in GWASs or meta-analyses to be associated with obesity traits, and proximal DNA methylation in whole blood of 355 healthy young individuals. We then tested the tissue specificity of the majority of these associations in four brain regions (n = 121–133), visceral adipose tissue (VAT; n = 149), subcutaneous adipose tissue (SAT; n = 149) and fibroblasts (n = 62). Finally, the genomic context of associated CpG sites was explored, using chromatin segmentation on publicly available histone marks from 11 tissues and long-range interactions from five cell lines.

Methods

Discovery study group

Ethics, consent and permissions

The discovery study group comprised two sub-groups of healthy young Caucasians from two different age ranges (Table 1). All participants and their guardians gave informed written consent and the study was approved by the local ethics committee in Uppsala, EPN, diary number 2011/446; this study was conducted in accordance with the principles of the Declaration of Helsinki. The first sub-group comprised 130 individuals aged 14–16 years who were recruited by visiting schools in Uppsala county and by post. Two 6-ml blood samples were drawn for genotyping and DNA methylation measurement, at any time during the day. The other sub-group comprised 225 individuals of white European descent aged 18–34 years, also recruited in Uppsala. Subjects were fasting (at least 10 h) when blood samples were taken for genotyping and DNA methylation measurement. For individuals aged under 18 years, we used Cole et al.’s definition to determine weight category [39]. For individuals aged 18 years and older, the following cutoffs were used: lean, BMI < 25; overweight, 25 ≤ BMI < 30; obese, BMI ≥ 30. We chose to use weight category instead of BMI since our cohort includes individuals aged under 18 years whose BMI scales differ from the BMI scales of individuals aged over 18 years.

Table 1 Description of the discovery samples

Full size table

Genotyping

We selected 52 SNPs that have been associated by GWASs or meta-analyses of GWASs with obesity-associated traits (BMI [1–10], BMI-adjusted waist-to-hip ratio [18], fat mass [12], low lean body mass [14], blood lipid levels [15] and waist circumference [16]) and the discovery study group was genotyped for these SNPs (Additional file 1). Genotyping of the 52 SNPs was carried out at the SNP technology platform at Uppsala University [40] using an Illumina Golden Gate Assay (Illumina Inc., San Diego, CA, USA). There were missing genotypes for 8 of the 52 tested SNPs, ranging from one individual to 52 individuals with missing genotypes (Additional file 1). Individuals with missing genotypes were removed from the analysis.

DNA methylation profiling

The genome-wide Illumina Infinium HumanMethylation450 BeadChip (Illumina), which allows interrogation of 485,512 CpG dinucleotides covering 25,953 genes, was applied to determine the methylation profile of genomic DNA isolated and purified from the peripheral whole blood. This chip has been shown to give a reliable and reproducible estimation of the methylation profile on a genomic scale [15]. First, bisulfite conversion of genomic DNA was performed using the EZ DNA Methylation-Gold™ Kit (Zymo Research) according to the manufacturer’s protocol. Briefly, 500 ng of DNA was sodium bisulfite-treated, denatured at 98 °C for 10 min, and bisulfite converted at 64 °C for 2.5 h. After conversion, samples were desulfonated and purified using column preparation. Approximately 200 ng of bisulfate-converted DNA was processed according to the Illumina Infinium Methylation Assay protocol. This assay is based on the conversion of unmethylated cytosine (C) nucleotides into uracil/thymine (T) nucleotides by the bisulfite treatment. The DNA was whole-genome amplified, enzymatically fragmented, precipitated, resuspended, and hybridized overnight at 48 °C to locus-specific oligonucleotide primers on the BeadChip. After hybridization, the C or T nucleotides were detected by single-base primer extension. The fluorescence signals corresponding to the C or T nucleotides were measured from the BeadChips using the Illumina iScan scanner. Phenotypes, genotypes, raw data, and processed DNA methylation data are available through the Gene Expression Omnibus (GEO) database [41] with accession number [GEO:GSE73103].

DNA methylation processing

All downstream data processing and statistical analyses were performed with the statistical software R [42] together with the minfi [43], ChAMP [44], sva [45], and MethylAid [46] packages of the Bioconductor project.

Background correction and adjustment of type I and type II probes

Fluorescence data were preprocessed using the GenomeStudio 2009.2 (Illumina) software. First, we background corrected the data using NOOB [47]. In the Illumina Infinium HumanMethylation450 BeadChip array, the probes come in two different designs, characterized by widely different DNA methylation distributions and dynamic range, which may bias downstream analyses. Therefore, we applied the BMIQ algorithm to adjust for the two different probe designs [48].

Removal of batch effects

The plates on which samples are run introduce a known batch effect that is important to correct for. We used the ComBat function to adjust directly for this batch effect [45].

Principal component analysis

We performed a principal component analysis (PCA) using the PCA function of the FactoMineR package [49], first calculating the covariance matrix between all samples using only the most variable autosomal CpG sites, measured in terms of their 95 % reference range: the range of methylation values observed in the central 95 % of the samples or, more precisely, the difference between the 97.5 and 2.5 % percentiles. Using a 95 % reference range of at least 0.20, 103,408 CpG sites were used in the covariance matrix calculation. Together, the two first principal components explain over 39 % of the total variance. Each subsequent vector does not add substantially to the variance explained: 285 vectors would be necessary to explain 95 % of the total variance.

Sample exclusion

We excluded from association analyses: (1) samples that were outliers in any one of the quality control plots generated by MethylAid [46] (rotated M versus U plot, overall sample-dependent control plot, bisulfite conversion control plot, overall sample-independent control plot and detection p value plot) using the default thresholds (0 samples); (2) samples that were outliers with respect to any one of the first eight principal components (corresponding to the approximate location of the elbow of the eigenvalue scree plot; six samples). After exclusion of samples, we were left with 349 samples: 128 from the first sub-group (29 % males; mean age ± standard deviation 15.3 ± 0.64 years) and 221 from the second sub-group (78 % males; mean age ± standard deviation 23.6 ± 3.3 years).

Probe exclusion

We removed probes with missing β values, probes having less than 75 % of samples with detection p value < 0.01, and probes located on the sex chromosomes. Using the annotation generated by Chen et al. [50], we also removed cross-reactive probes and probes containing SNPs with minor allele frequency > 1 % in European populations. In total, 397,615 probes were included in the analysis.

Choice of investigated CpGs

We selected the probes within 500 kb of each SNP. A total of 8485 probes were analyzed, with an average of 163 CpGs per SNP (Additional file 1).

Cell-type proportions

Because differences in cell-type proportions between DNA samples can confound association results [51], we adjusted our analyses using a surrogate for cell-type proportions derived from 43 differentially methylated CpG sites present on the HumanMethylation450 array that have the ability to discriminate between blood cell types [52]. As a surrogate for cell-type proportions, and to reduce the number of variables, we used the first two principal components associated with these 43 sites that together explain over 70 % of the total variance in methylation at these 43 CpG sites.

To verify that the first two principal components that we derived from the list of 43 differentially methylated CpG sites [52] can indeed serve as a surrogate for blood cell proportions, we tested for associations between the principal components and the methylation levels at all of our sites, adjusting our analyses for sex, age, weight category, and batch. We selected the top 10 % of the sites that showed the strongest associations (49,035 sites, all associated at levels p < 10⁻⁸) and extracted these sites in data sets of purified human leukocyte subtypes [53] [GEO:GSE39981]; 2564 sites were overlapping. A dendrogram representation of our top sites in this data set [53] reveals clear clustering of samples according to cell type, indicating a good ability for principal components to discriminate between samples with different cell compositions (Additional file 2).

Validation of methylation with bisulfite sequencing

The methylation levels of two of the associated CpG sites (cg15576492 and cg2204028, at position chr1:1015257–1015540) were validated using bisulfite sequencing. The sequences including target CpG sites were obtained from the University of California, Santa Cruz (UCSC) Genome Browser database. The sequences (bisulfite-converted DNA template) for the primers were forward (biotin labeled)-5′-ATGGATGTTGGTGTGAGTATT-3′ and reverse 5′-CCCTCTACACATCTAAACCCT-3′. Bisulfite sequencing primers were designed with Methyl Primer Express® v.1.0 (Applied Biosystems) so that the amplicons covered target CpG sites. These regions were PCR amplified in duplicate from bisulfite-treated DNA. Similar efficiency in PCR amplification for unmethylated and methylated fragments was controlled for using Human Methylated & Non-methylated DNA Set (Zymo Research). PCR reactions were performed in a final volume of 25 μl and contained 2.5 μl of bisulfite-treated DNA (10–15 ng/μl), 0.05 μl of each primer (100 pmol/μl), 1 μl DMSO, 0.5 μl of SYBR Green I (1:50,000; Invitrogen, Sweden) in TE buffer (pH 7.8), 0.25 μl of 25 mM dNTP mix (Fermentas), 2.5 μl 10× buffer, 4 μl of 25 mM MgCl₂, 1 U of Hot Start Taq DNA polymerase (Thermo Scientific). Cycling conditions were as follows: 10-min initial denaturation step at 95 °C, followed by 45 cycles of 95 °C for 20 s, 30 s at optimal annealing temperature of primers, 20–45 s at 72 °C, 5 min of final elongation at 72 °C. Fluorescence was measured after the elongation phase. Melting curve analysis consisted of 81 cycles of 10 s at 55 °C with increasing increments of 0.5 °C per cycle. Bio-Rad iQ5 version 2.0 software (Bio-Rad Laboratories) was used to process real-time PCR data.

Amplicons were purified using GeneJET PCR Purification Kit (Thermo Scientific).

DNA sequencing was performed using BigDye® Terminator v.3.1 Cycle Sequencing Kit (Applied Biosystems) on an ABI3730XL DNA Analyzer (Applied Biosystems) at Uppsala Genome Center. Cycle sequencing was as follows: 30 s initial denaturation step at 94 °C, followed by 35 cycles of 94 °C for 25 s, 50 °C for 15 s, 60 °C for 120 s. Each sample was sequenced twice and the two methylation levels were averaged. Amplification primers were used for sequencing. All samples were analyzed in duplicates on different plates and the mean methylation levels in percentage per sample were used for further analyses. Methylation levels of CpG sites for all amplicons were quantified using Epigenetic Sequencing Methylation analysis software [54]. The software was repeatedly used to determine the methylation profile of several genes [55, 56]. The software algorithm analyzes the methylation percentage of each CpG site in an amplicon without cloning stage.

Replication study groups

VAT and SAT

VAT and SAT samples were used to test specifically the association between alleles at rs1011731 and methylation at cg13446689. Paired samples of VAT and SAT from 149 Caucasian subjects (35 % male) who underwent open abdominal surgery were included in the study. This subset is part of a study group that had already been genotyped for rs1011731, described in detail elsewhere [57]. Thirty-two individuals were lean (aged 63 ± 11 years, BMI 22.1 ± 2.5 kg/m²), 22 were overweight (67 ± 12 years, BMI 27.1 ± 1.4 kg/m²) and 94 were obese (age 47 ± 13 years, BMI 48.1 ± 9.7 kg/m²); BMI was missing for one individual and 46 subjects had diabetes type 2. Patients with severe conditions, including generalized inflammation or end-stage malignant diseases, were excluded from the study. Samples of VAT and SAT were immediately frozen in liquid nitrogen after explantation. The study was approved by the ethics committee of the University of Leipzig and all subjects gave written informed consent.

Genomic DNA was extracted from frozen adipose tissue samples using GenElute™ Mammalian Genomic DNA Miniprep Kit (SIGMA-ALDRICH, USA). All samples were bisulfite converted using Qiagen Epitect Bisulfite Kit (Qiagen, Hilden, Germany) according to the manufacturer’s protocol and applied to whole bisulfitome amplification (EpiTect Whole Bisulfitome Kit, Qiagen, Hilden, Germany). Finally, all samples were purified using GenElute PCR Clean-up Kit (Sigma-Aldrich, USA). Methylation levels of cg13446689 were determined using a custom designed PyroMark CpG assay (Qiagen, Hilden, Germany). The sequences (bisulfite-converted DNA template) for the primers were forward (biotin labeled)-5′-AAGTGATGGGAGTTGTTGG-3′ and reverse 5′- ACCCCAAAACAATTCAAACAAACCATA-′3. Using the sequencing primer 5′-ACAATTCAAACAAACCATACTTA-3′ the following sequence was analyzed (5′- CACAAC[R]ACTAACTAATCTATAC[R]ACCTCAAACCAAAAACAACAACCAACAACTCC-3′). The pyrosequencing was run on a PyroMark Q24 (Qiagen, Hilden, Germany). All samples were analyzed in duplicates on different plates and the mean methylation levels in percentage per sample were used for further analyses. Water was used as a non-template control using the same PCR conditions.

Fibroblasts

Methylation, SNP genotyping, and gene expression data from primary skin fibroblasts from Caucasian individuals (n = 62) [34] were obtained from GEO (accession number [GEO:GSE53261]).

Brain regions (cerebellum, frontal cortex, caudal pons, and temporal cortex)

SNP genotyping data from four different brain regions (n = 121–133) [58] were obtained from dbGAP (accession number phs000249.v1.p1). All individuals were of Caucasian descent, but two individuals from the cerebellum study samples were of African and Asian descent, respectively. We removed these two individuals from our analysis. Methylation data were obtained from GEO (accession number [GEO:GSE15745]).

Annotation

Genes

The genomic positions of RefSeq genes were downloaded from the UCSC genome browser, and the location of each CpG site was determined as promoter (within 1500 bp of the transcription start site (TSS)), gene body, intergenic, or ambiguous (overlapping a promoter and a gene body).

Linkage disequilibrium

Linkage disequilibrium (LD) data were obtained from SNAP Proxy, using CEU as the “population panel” and the 1000 Genomes Pilot 1 as “SNP dataset” [59].

Chromatin states

ChromHMM [60] was applied for seven publicly available histone modifications (H3K4me1, H3K4me3, H3K9ac, H3K9me3, H3K27ac, H3K27me3, and H3K36me3) from 11 tissues: adipose nuclei (AN), pancreatic islets (PI), peripheral blood mononuclear primary cells (PBMC), skeletal muscle (SM), liver, brain angular gyrus (BrainAG), brain anterior caudate (BrainAC), brain cingulate gyrus (BrainCG), brain hippocampus (BrainHIPPO), brain inferior temporal lobe (BrainITL), and brain substantia nigra (BrainSN). Data were downloaded from NIH Roadmap Epigenomics Project Data Listings. An 18-state model was learned from all binarized data and was used to produce segmentations based on the most likely state assignment of the model. Then, each state was assigned to one of the following seven categories: enhancer, active TSS/poised TSS/flanking TSS, active transcription, quiescent, heterochromatin, Polycomb-repressed, ZNF genes/repeats.

Ubiquitous, tissue-specific, and cell-specific in vivo transcribed enhancers

Ubiquitous, tissue-specific (adipose tissue, blood, brain, liver, pancreas, and skeletal muscle) and cell type-specific (preadipocytes, fat cells, hepatocytes, and skeletal muscle cells) enhancers, as well as TSS–enhancer associations, as defined by CAGE tags in the FANTOM5 project, were downloaded from the Transcribed Enhancer Atlas website [61, 62].

Long-range interactions

We used publicly available chromatin interaction analysis by paired-end tag sequencing (ChIA-PET) libraries to map long-range interactions in five different cell lines, with three different transcription factors [63] (Additional file 3). Data were downloaded from the WashU Epigenome Browser.

Expression QTLs

We used the following publicly available expression QTL (eQTL) browsers to see whether any of the associated SNPs or SNPs in strong linkage with them (r² > 0.8) were eQTLs for our genes of interest: the eQTL browser of the Genotype-Tissue Expression (GTEx) project [64], the eQTL Browser of the National Center for Biotechnology Information, the eQTL resources from the Gilad/Pritchard group [65], and the blood eQTL browser developed by Westra et al. [66].

Statistics

For statistical analysis, we used the log₂ ratio of the intensities of methylated probe and unmethylated probe, also called M value, which is more statistically valid for the differential analysis of methylation levels [67].

Linear model

We developed the following linear model for each CpG site k:

$$ {M}_k={a}_k+{b}_{kS}S+{b}_{kA}A+{b}_{kW}W+{b}_{kG}G+{b}_{kPC1}PC1+{b}_{kPC2}PC2+{\varepsilon}_k $$

where M_k is the M value of CpG site k, S is the dichotomized sex (female = 1 and male = 0), A is the age, W is the weight category (normal weight = 0, overweight = 1, obese = 2), G is the genotype at the investigated SNP (homozygotes for non-risk allele = 0, heterozygotes = 1, homozygotes for risk allele = 2), PC1 and PC2 are the first two principal components derived from the list of 43 differentially methylated CpG sites in blood cell types, and ε_k is the unexplained variability. We chose to use weight category instead of BMI since our study samples include individuals aged under 18 years whose BMI scales differ from the BMI scales of individuals aged over 18 years. Rare homozygous genotypes (count of less than 10) were combined with heterozygotes.

The coefficients b _kx summarize the association between methylation levels and the variables of interest. The p value for the SNP was determined using a likelihood ratio test, using the lrtest function of the lmtest package [68], and we report the effect size as the proportion R² of the CpG methylation variance that is explained by the SNP, among the variance not already explained by the covariates. To control the proportion of false positives, q values were calculated using the qvalue function of the qvalue package [69]. A SNP was considered significant if its q value was < 0.05.

Enrichment of associated CpGs in genomic regions, in vivo transcribed enhancers, and chromatin states

To test whether associated CpGs were enriched or underrepresented in different genomic regions (promoter, gene body, etc.), chromatin states (enhancer, TSS, heterochromatin, etc.) and in vivo transcribed enhancers, we used Fisher’s exact test. To control the proportion of false positives, q values were calculated using the qvalue function of the qvalue package [69]. Significance was considered at a q value < 0.05.

Number of long-range interactions

The distributions of the numbers of long-range interactions per CpG were skewed. Thus, to see whether associated CpGs had a higher or lower number of long-range interactions, we used Mann–Whitney U-test.

Power calculations

We used the pwr.f2.test function of the pwr package in R to determine the statistical power in the replication datasets (fibroblasts, brain and SAT/VAT).

Results

Obesity-associated SNPs associate with methylation at proximal CpGs in whole blood samples from healthy individuals

We tested associations between 52 obesity-associated SNPs and M values of all CpG sites 500 kb upstream and 500 kb downstream of each SNP in the blood of 355 individuals (Table 1), using a linear regression model adjusted for age, sex, blood cell type surrogate, and weight category (i.e., lean, overweight, or obese) instead of BMI since our study samples include individuals aged under 18 years whose BMI scales differ from the BMI scales of individuals aged over 18 years. In total, 8485 probes were tested, with an average of 163 probes per SNP (Additional file 1). Methylation levels at 107 CpGs associated with genotypes at 28 SNPs (likelihood ratio test, q value < 0.05; Additional file 4) and most of the associations were between SNPs and CpGs that are close to each other (50 % of the associations are between SNPs and CpGs that are within 40 kb of each other; Fig. 1). Also, the closer the SNP and CpG are, the stronger the statistical significance is (Fig. 1). One example of these SNP–CpG associations is depicted in Fig. 2. The rs713586 SNP explains 53.8 % of the total variance in methylation at cg01884057, with carriers of the risk allele (C) at rs713586 having higher methylation.

The two sub-groups that were pooled for the discovery analysis were of two different age ranges (see “Methods”), but they did not significantly differ in terms of global DNA methylation patterns, as shown by PCA (Additional file 5). To make sure that the two sub-groups were comparable and could effectively be combined for the discovery analysis, we tested the significance of the 107 CpGs separately in each. SNP effects were in the same directions for all 107 CpGs in the two separate sub-groups; 105 of the 107 CpGs were significant (raw p value < 0.05) in the first, while 86 of the 107 CpGs were significant (raw p value < 0.05) in the second. This suggests that our results are not driven by a specificity of one of the two sub-groups and that it was reasonable to pool them for the discovery analysis.

Genomic context of CpGs associated with obesity-associated SNPs

To understand the functional significance of the CpGs associated with alleles at obesity-associated SNPs, we analyzed their genomic location in relation to genes, chromatin states in 11 tissues, ubiquitous, tissue-specific, or cell-specific in vivo transcribed enhancers, and long-range interactions in five cell lines.

CpGs associated with obesity-associated SNPs are depleted in promoters and enriched in intergenic regions

Thirty-eight of the associated CpGs were located in gene promoters (MIR148A, BDNF, PTPMT1, NR1H3, MGAT1, SCGB3A1, HOXC12, PMAIP1, PSIP1, RPS10-NUDT3, RPS10, SKOR1, MAP2K5, SIX5, AGRN, IMMP1L, ELP4, ITIH4, SEMA3G, POMC, ADCY3, SSPN, LGR4, TUFM, MIR4721, SULT1A1, SULT1A2, APOBR, CLN3, SPNS1, SH2B1, ATXN2L, and IL27), including eight also located in a gene body (Additional file 4). Thus, associated CpGs were underrepresented in promoters (28 % of CpGs, Fisher’s exact test p value = 0.0097). In contrast, 31 associated CpGs were located in intergenic regions, which is more than expected by chance (30 % of CpGs, Fisher’s exact test p value = 0.0087; Fig. 3). This is consistent with previous studies on meQTLs [27, 30].

CpGs associated with obesity-associated SNPs are enriched in enhancers in PBMCs

The activity of functional genomic elements is associated with the state of the chromatin at these sites, such as histone modification patterns and access of transcription factors to DNA. The recently developed chromHMM tool allows interpreting chromatin states in a particular tissue or cell type by integrating histone marks and transcription factor binding data [60]. Using seven publicly available histone marks in 11 tissues relevant in the pathogenesis of obesity (AN, six brain regions, liver, PBMCs, PIs, and SM), we interpreted the chromatin states of all regions containing the tested CpGs (Additional file 6). Consistent with the enrichment of associated CpGs in intergenic regions (Fig. 3), associated CpGs were enriched in enhancers in PBMCs (Fisher’s exact test, q value = 0.0019) (Fig. 4).

Only one CpG associated with obesity-associated SNPs is located in in vivo transcribed enhancers

The enrichment of associated CpGs in enhancers is a prediction by chromHMM that relies on histone marks, but we wanted to test whether associated CpGs were also found in active enhancers, as defined by cap-analysis of gene expression (CAGE) in the FANTOM5 project [61]. cg04588972, whose methylation was lower in carriers of the risk allele at rs1878047, was in a ubiquitous enhancer showing long-range interactions with the TSS of KLK14, IGLON5, LRRC4B, and SYT3. However, associated CpGs were not underrepresented in ubiquitous, tissue-specific, or cell-specific enhancers (Fisher’s exact tests, all p values > 0.05). FANTOM5 uses very stringent criteria to detect active enhancers using whole transcriptome sequencing [61], thus possibly explaining why none of the associated CpGs were in active enhancers as defined by CAGE in the FANTOM5 project. Indeed, chromHMM predicted 33–266 times more active enhancers from the FANTOM5 project depending on the tissue, and there was little overlap between the two.

CpGs associated with obesity-associated SNPs show long-range interactions with promoters and other genomic regions

Following the enrichment of associated CpGs at enhancers, we mapped all tested CpGs to long-range interactions as defined by ChIA-PET libraries from five cell lines and three transcription factors (Additional file 3) [63]. Of the 107 associated CpGs, 103 (96 %) were located in regions with at least one long-range interaction with another genomic region, and 73 of the 107 associated CpGs (68 %) were located in regions with at least one long-range interaction with a gene promoter (Additional file 4). For instance, five CpGs negatively associated with rs2444217 and located in enhancers in brain, PBMCs, liver, PIs, and SM showed long-range interactions with the same five gene promoters (Additional file 4; Fig. 5). Also, 6 of the 15 CpGs associated with alleles at rs3934834 were found to interact with no less than 11 promoters, and were in enhancers in PIs (Additional files 4 and 6; Fig. 6). The graphic results for all significant SNPs can be found in Additional file 7. Associated CpGs were not enriched in long-range interactions across all five cell lines and three target transcription factors (Mann–Whitney U-test p value > 0.05; Fig. 7).

Associated CpGs are located in or show long-range interactions with the promoters of genes for which the corresponding SNPs are known eQTLs

We showed that some of the associated CpGs are located in gene promoters, and some are in regions showing putative long-range interactions with gene promoters. In order to make the link between SNP, methylation, and mRNA expression, we searched four eQTL databases (see “Methods”); we browsed all associated SNPs and SNPs in strong LD with them (with r² > 0.8), and retrieved the genes for which they were eQTLs (Additional file 8). We found that associated CpGs are located in or show long-range interactions with the promoters of genes for which the corresponding SNPs are known eQTLs. For instance, rs10838738 is a known eQTL for several genes in blood, including C1QTNF4, CELF1, and NUP160. Interestingly, rs10838738 associated with three CpGs showing long-range interactions with C1QTNF4 (Additional file 4), four CpGs showing long-range interactions with CELF1 (Additional file 4), including three in enhancers in PBMCs (Additional file 6), and one CpG showing long-range interactions with NUP160 (Additional file 4) that was in an enhancer in PBMCs (Additional file 4). Another example is rs713586, a known eQTL for ADCY3 in blood and monocytes. rs713586 associated with a CpG located in the promoter of ADCY3 (Additional file 4) that was also promoter-associated in PBMCs (Additional file 6).

Genome-scale measurements are validated by bisulfite sequencing

We validated one of the tested CpGs (cg15576492) by bisulfite sequencing, using DNA from 17 individuals from the discovery study group who were homozygous for rs3934834 (six A/A and 11 G/G). Our criteria for choosing this site were the following: 1) significant association with risk alleles; 2) strongest association with risk alleles; 3) located in a gene promoter and/or having long-range interactions with a gene promoter. The correlation between methylation assessed by Illumina and bisulfite sequencing was good (Pearson’s correlation coefficient r = 0.68, p value = 0.0025; Additional file 9). Methylation in A/A was higher than methylation in G/G, but the methylation difference did not reach statistical significance (p value > 0.05), which could be explained by reduced statistical power (17 individuals).

SNP–CpG associations might not be blood-specific

Four of the initial SNP–CpG associations in blood are replicated in skin fibroblasts

The open-access dataset of skin fibroblasts consists of DNA methylation data assessed with the Illumina HumanMethylation450 BeadChip and genotype data assessed with the Illumina Human1M-Duov3 DNA Analysis BeadChip. Thus, we had data to test 65 of the 107 significant SNP–CpG associations in skin fibroblasts (n = 62). Fourteen SNP–CpG associations had a raw p value < 0.05, and seven had a q value < 0.05, including four having a concordant effect sign with results obtained in blood (Additional file 10). Notably, genotypes at rs1011731 associated with methylation at cg13446689 (regression coefficient = 0.254, q value = 0.012).

The single SNP–methylation association tested in SAT and VAT was not significant

The SAT and VAT study group of 149 individuals (mostly overweight/obese) was used to test specifically the association between genotypes at rs1011731 and methylation at cg13446689, which was assessed by bisulfite sequencing. We chose to test this SNP–CpG pair because there was an association between cg13446689 and rs1011731 in both blood and fibroblasts, and because this study group had already been genotyped for rs1011731. There was no association between methylation at cg13446689 in VAT or SAT and genotypes at rs1011731 (p value > 0.05; Additional file 10).

The two SNP–methylation associations tested in cerebellum, frontal cortex, caudal pons, and temporal cortex were not significant

The open-access dataset of four brain regions consists of DNA methylation assayed using the Illumina HumanMethylation27 BeadChip, and genotype data assessed with the Illumina Human1M-Duov3 DNA Analysis BeadChip. Thus, we had data to test two of the 107 associated CpGs (cg05585544 and cg11385473). There was no association between genotypes at rs10838738 and methylation at cg05585544 in any of the four brain regions; there was no association between genotypes at rs652722 and methylation at cg11385473 in any of the four brain regions (p values > 0.05; Additional file 10).

Discussion

Our findings suggest that carriers of obesity-associated risk alleles display complex alterations of the gene regulatory landscape. We find that obesity-associated SNPs can be linked to DNA methylation levels in several proximal locations, which implies that they may affect multiple genes. These SNPs associated with proximal DNA methylation levels in whole blood of healthy individuals, but these associations might not be blood-specific. Interestingly, several obesity-associated SNPs associated with CpGs that were in the promoters of genes known to participate in the pathogenesis of obesity, or were located in regions that interact with such genes. In addition, associated CpGs were enriched in enhancers in blood, which highlights their potential in gene regulation.

It is well established that DNA methylation levels correlate with the presence of specific alleles at nearby SNPs [25–37], such as Grundberg et al. [29], who found that 28 % of CpGs were associated with SNPs within 100 kb in adipose tissue. If we restrict our analysis to 100 kb, we find 103 SNP–CpG associations at a q value < 0.05, corresponding to 27 unique SNPs (52 % of tested SNPs). It would be difficult to assess whether this percentage is particularly high, but the present study shows that obesity-associated SNPs discovered in GWASs may mediate their effect through alterations of the regulation of transcription. Indeed, the global results for each significant SNP display fascinating patterns. Several obesity-associated SNPs may affect “transcription factories”, clusters of gene promoters and their enhancers that interact in three-dimensional space and are brought together by DNA-binding proteins such as CTCF [70]. The most striking example is rs7498665 since the 12 CpGs associated with this SNP are located in ten distinct gene promoters. rs3888190, one of the top loci of the most recent BMI GWAS [11], is in perfect LD with rs7498665 (r² = 1) and is known to be an eQTL for five of these ten gene promoters (APOBR [71], SH2B1 [71], SULT1A2 [72], ATXN2L [11], and TUFM [11]). Another interesting example is rs10838738, which associated with three CpGs showing long-range interactions with C1QTNF4, four CpGs showing long-range interactions with CELF1, and one CpG showing long-range interactions with NUP160. rs10838738 is a known eQTL for these three genes in blood [64, 66]. Thus, our results suggest that the effect of obesity-associated SNPs may be mediated by multiple and quite distant genes, as illustrated by three of our investigated SNPs (rs3934834, rs2287019, and rs7498665) that associated with CpGs interacting with no less than 15 promoters (Additional files 4 and 7). This underlines the importance for a rational and inclusive selection process for candidate genes for GWAS hits rather than the common practice of only focusing on the closest gene.

At a more detailed level, patterns of DNA methylation at specific CpGs between carriers and non-carriers of risk alleles were consistent with previous studies. Alleles at rs713586 explained 54 % of the variance in methylation at cg01884057, with an increase of almost 10 % methylation for each risk allele. The very same pattern was also found in adipose tissue in another study [29]. More interestingly, some patterns of DNA methylation between carriers and non-carriers of risk alleles was consistent with what is known about these genes and obesity. For instance, MIR148A is upregulated during normal adipogenesis but downregulated in obese adipocytes [73], and its expression is regulated by DNA methylation at its CpG island [74]. Consistently, carriers of the risk allele at rs1055144 had higher methylation levels in the promoter of MIR148A. Also, carriers of the risk allele at rs10838738 had lower methylation in the promoter of PTPMT1, a gene that codes for a mitochondrial phosphatase whose inhibition lowers glucose concentration [75] and a suggested drug target for treatment of type II diabetes [76]. Last but not least, three of the associated CpGs were located within two of the numerous promoters of BDNF, which encodes a neurotrophin that plays several roles in regulating energy homeostasis [77]. It is suggested that BDNF is finely regulated by DNA methylation and histone modifications [78, 79], and differential BDNF transcripts are expressed at different time points and in different cellular compartments [79]. Carriers of the risk allele at rs10767664 had higher methylation in the pII promoter of BDNF, and lower methylation in the pVI promoter of BDNF. However, the roles of specific BDNF promoters in obesity remain unexplored. Also, the SNPs may affect other genes linked to obesity: NR1H3, a member of the liver X receptors that regulate cholesterol catabolism [80] and expressed during adipose tissue remodeling [81]; PACSIN3, a kinase that induces glucose uptake by adipocytes [82]; LGR4, a G protein-coupled receptor whose ablation potentiates the white-to-brown fat transition [83]; POMC, a peptide that decreases food intake and increases energy expenditure [84]; CLN3 and ITH4, two proteins positively associated with obesity [85, 86]; and the developmental genes HOTAIR and HOXC11, responsible for differential fat accumulation between upper and lower body, and under epigenetic control [85, 87].

Our study aimed at unraveling the molecular effects of body mass-associated genetic variants on chromatin structure, with a special focus on DNA methylation. We benefited from a large sample size for the discovery analysis (n = 355) and from the use of a large battery of open access datasets to map associated CpGs to meaningful genomic annotations, such as promoters, predicted and in vivo transcribed enhancers, and long-range interactions. In addition, we tested the tissue-specificity of 65 of the initial associations in skin fibroblasts, two of the initial associations in four brain regions, and one in SAT and VAT. It is possible that some of the associations discovered in blood are limited to this tissue, or that unmeasured environmental factors such as smoking, diet, physical activity, and tissue-specific molecular factors impacted DNA methylation at the measured CpG sites and confounded our results. It should be noted, however, that we could not test all of the 107 initial associations in the replication samples, and the sample sizes of the replication samples were smaller than the discovery study group. Analysis of statistical power (probability of detecting a “true” effect when it exists) suggests that we have a high probability of replicating our results in the VAT and SAT replication samples, where power was 95 %. In contrast, power was only 23–25 % for cg11385473 and 42–47 % for cg05585544 for the brain replication samples and 39 % on average for the skin fibroblast replication samples, which implies that we are likely unable to replicate our results due to too small sample groups for these conditions. Besides, pan-tissue SNP–CpG associations are consistent with a genome-wide study where genotype-dependent methylation differences between blood and brain were associated, making genetic influence on DNA methylation in blood relevant for other tissues [26]. Finally, it should be kept in mind that the probes of the methylation array used in this study (Illumina 450 k) are enriched in CpG islands, gene promoters, and gene regions; it is thus possible that we missed important CpGs linked to obesity-associated SNPs.

In the paradigm of genetics–epigenetics–environment relationships, it is still unknown whether obesity-associated SNPs directly cause differential DNA methylation at genes and enhancers that contribute to the pathogenesis of obesity, or if the observed differential methylation levels are merely a consequence of a modified gene regulation caused by the presence of risk alleles at obesity-associated SNPs. In a recent review on the function and information content of DNA methylation, DNA methylation is thought to have both an active and passive role in gene regulation, and it seems to be highly contextual [88]. In particular, it has been proposed that mutations within regulatory regions affect binding of transcription factors, which in turn influence DNA methylation [88]. If DNA methylation does not necessarily actively impact on gene regulation, it is at least an informative marker of the underlying regulatory activity. Therefore, the differential methylation observed in carriers of risk alleles at obesity-associated SNPs in our study likely reflects allele-specific effects on gene regulatory mechanisms.

Conclusions

In this study we report strong associations between obesity-associated SNPs discovered in GWASs and methylation levels at proximal CpG sites. The methylation sites associated with alleles at obesity-associated SNPs were enriched in enhancers in PBMCs, and some of these sites were located in the promoters of genes, or were located in regions showing long-range interactions with established roles in appetite regulation as well as regulation of body mass. We also found indications that some of these genotype–methylation associations exist in different tissues. This study has implications for understanding how obesity-associated variants mediate their effects. Further studies are needed to unravel the mechanisms that govern the interplay between genetic variants and the activity of functional DNA elements.

Abbreviations

AN:: adipose nuclei
BMI:: body mass index
BrainAC:: brain anterior caudate
BrainAG:: brain angular gyrus
BrainCG:: brain cingulate gyrus
BrainHIPPO:: brain hippocampus
BrainITL:: brain inferior temporal lobe
BrainSN:: brain substantia nigra
CAGE:: cap-analysis of gene expression
ChIA-PET:: chromatin interaction analysis by paired-end tag sequencing
eQTL:: expression quantitative trait locus
GEO:: Gene Expression Omnibus
GWAS:: genome-wide association study
LD:: linkage disequilibrium
meQTL:: methylation quantitative trait locus
PBMC:: peripheral blood mononuclear primary cell
PCA:: principal component analysis
PCR:: polymerase chain reaction
PI:: pancreatic islet
QTL:: quantitative trait locus
SAT:: subcutaneous adipose tissue
SM:: skeletal muscle
SNP:: single nucleotide polymorphism
TSS:: transcription start site
UCSC:: University of California, Santa Cruz
VAT:: visceral adipose tissue

References

Willer CJ, Speliotes EK, Loos RJF, Li S, Lindgren CM, Heid IM, et al. Six new loci associated with body mass index highlight a neuronal influence on body weight regulation. Nat Genet. 2009;41:25–34.
Article PubMed Central CAS PubMed Google Scholar
Speliotes EK, Willer CJ, Berndt SI, Monda KL, Thorleifsson G, Jackson AU, et al. Association analyses of 249,796 individuals reveal eighteen new loci associated with body mass index. Nat Genet. 2011;42:937–48.
Article Google Scholar
Okada Y, Kubo M, Ohmiya H, Takahashi A, Kumasaka N, Hosono N, et al. Common variants at CDKAL1 and KLF9 are associated with body mass index in east Asian populations. Nat Genet. 2012;44:302–6.
Article CAS PubMed Google Scholar
Thorleifsson G, Walters GB, Gudbjartsson DF, Steinthorsdottir V, Sulem P, Helgadottir A, et al. Genome-wide association yields new sequence variants at seven loci that associate with measures of obesity. Nat Genet. 2009;41:18–24.
Article CAS PubMed Google Scholar
Liu JZ, Medland SE, Wright MJ, Henders AK, Heath AC, Madden PAF, et al. Genome-wide association study of height and body mass index in Australian twin families. Twin Res Hum Genet. 2010;13:179–93.
Article PubMed Central PubMed Google Scholar
Johansson A, Marroni F, Hayward C, Franklin CS, Kirichenko AV, Jonasson I, et al. Linkage and genome-wide association analysis of obesity-related phenotypes: association of weight with the MGAT1 gene. Obesity (Silver Spring). 2010;18:803–8.
Article CAS PubMed Google Scholar
Cotsapas C, Speliotes EK, Hatoum IJ, Greenawalt DM, Dobrin R, Lum PY, et al. Common body mass index-associated variants confer risk of extreme obesity. Hum Mol Genet. 2009;18:3502–7.
Article PubMed Central CAS PubMed Google Scholar
Wen W, Cho Y-S, Zheng W, Dorajoo R, Kato N, Qi L, et al. Meta-analysis identifies common variants associated with body mass index in east Asians. Nat Genet. 2012;44:307–11.
Article PubMed Central CAS PubMed Google Scholar
Ng MCY, Hester JM, Wing MR, Li J, Xu J, Hicks PJ, et al. Genome-wide association of BMI in African Americans. Obesity. 2012;20:622–7.
Article PubMed Central CAS PubMed Google Scholar
Field SF, Howson JMM, Walker NM, Dunger DB, Todd JA. Analysis of the obesity gene FTO in 14,803 type 1 diabetes cases and controls. Diabetologia. 2007;50:2218–20.
Article PubMed Central CAS PubMed Google Scholar
Locke AE, Kahali B, Berndt SI, Justice AE, Pers TH, Day FR, et al. Genetic studies of body mass index yield new insights for obesity biology. Nature. 2015;518:197–206.
Article PubMed Central CAS PubMed Google Scholar
Loos RJF, Lindgren CM, Li S, Wheeler E, Zhao JH, Prokopenko I, et al. Common variants near MC4R are associated with fat mass, weight and risk of obesity. Nat Genet. 2008;40:768–75.
Article PubMed Central CAS PubMed Google Scholar
Jamshidi Y, Snieder H, Ge D, Spector TD, O’Dell SD. The SH2B gene is associated with serum leptin and body fat in normal female twins. Obesity (Silver Spring). 2007;15:5–9.
Article PubMed Central CAS PubMed Google Scholar
Liu XG, Tan LJ, Lei SF, Liu YJ, Shen H, Wang L, et al. Genome-wide association and replication studies identified TRHR as an important gene for lean body mass. Am J Hum Genet. 2009;84:418–23.
Article PubMed Central CAS PubMed Google Scholar
Teslovich TM, Musunuru K, Smith AV, Edmondson AC, Stylianou IM, Koseki M, et al. Biological, clinical and population relevance of 95 loci for blood lipids. Nature. 2010;466:707–13.
Article PubMed Central CAS PubMed Google Scholar
Heard-Costa NL, Zillikens MC, Monda KL, Johansson A, Harris TB, Fu M, et al. NRXN3 is a novel locus for waist circumference: a genome-wide association study from the CHARGE Consortium. PLoS Genet. 2009;5, e1000539.
Article PubMed Central PubMed Google Scholar
Shungin D, Winkler TW, Croteau-Chonka DC, Ferreira T, Locke AE, Magi R, et al. New genetic loci link adipose and insulin biology to body fat distribution. Nature. 2015;518:187–96.
Article PubMed Central CAS PubMed Google Scholar
Heid IIM, Jackson AUA, Randall J, Winkler T. Meta-analysis identifies 13 new loci associated with waist-hip ratio and reveals sexual dimorphism in the genetic basis of fat distribution. Nat Genet. 2010;42:950.
Article Google Scholar
Hinney A, Vogel CIG, Hebebrand J. From monogenic to polygenic obesity: Recent advances. Eur Child Adolesc Psychiatry. 2010;19:297–310.
Article PubMed Central PubMed Google Scholar
Smemo S, Tena JJ, Kim KH, Gamazon ER, Sakabe NJ, Gomez-Marin C, et al. Obesity-associated variants within FTO form long-range functional connections with IRX3. Nature. 2014;507:371–5.
Article PubMed Central CAS PubMed Google Scholar
Elliott P, Chambers JC, Zhang W, Clarke R, Hopewell JC, Peden JF, et al. Genetic Loci associated with C-reactive protein levels and risk of coronary heart disease. JAMA. 2009;302:37–48.
Article PubMed Central CAS PubMed Google Scholar
Mansego ML, Milagro FI, Zulet MA, Martinez JA. SH2B1 CpG-SNP is associated with body weight reduction in obese subjects following a dietary restriction program. Ann Nutr Metab. 2014;66:1–9.
Article PubMed Google Scholar
Dick KJ, Nelson CP, Tsaprouni L, Sandling JK, Aïssi D, Wahl S, et al. DNA methylation and body-mass index: a genome-wide analysis. Lancet. 2014;6736:1–9.
Google Scholar
Almén MS, Nilsson EK, Jacobsson JA, Kalnina I, Klovins J, Fredriksson R, et al. Genome-wide analysis reveals DNA methylation markers that vary with both age and obesity. Gene. 2014;548:61–7.
Article PubMed Google Scholar
Quon G, Lippert C, Heckerman D, Listgarten J. Patterns of methylation heritability in a genome-wide analysis of four brain regions. Nucleic Acids Res. 2013;41:2095–104.
Article PubMed Central CAS PubMed Google Scholar
Davies MN, Volta M, Pidsley R, Lunnon K, Dixit A, Lovestone S, et al. Functional annotation of the human brain methylome identifies tissue-specific epigenetic variation across brain and blood. Genome Biol. 2012;13:R43.
Article PubMed Central CAS PubMed Google Scholar
Gibbs JR, van der Brug MP, Hernandez DG, Traynor BJ, Nalls MA, Lai SL, et al. Abundant quantitative trait loci exist for DNA methylation and gene expression in Human Brain. PLoS Genet. 2010;6:29.
Article Google Scholar
Zhang D, Cheng L, Badner JA, Chen C, Chen Q, Luo W, et al. Genetic control of individual differences in gene-specific methylation in human brain. Am J Hum Genet. 2010;86:411–9.
Article PubMed Central CAS PubMed Google Scholar
Grundberg E, Meduri E, Sandling JK, Hedman AK, Keildson S, Buil A, et al. Global analysis of dna methylation variation in adipose tissue from twins reveals links to disease-associated variants in distal regulatory elements. Am J Hum Genet. 2013;93:876–90.
Article PubMed Central CAS PubMed Google Scholar
Drong AW, Nicholson G, Hedman AK, Meduri E, Grundberg E, Small KS, et al. The presence of methylation quantitative trait loci indicates a direct genetic influence on the level of DNA methylation in adipose tissue. PLoS One. 2013;8, e55923.
Article PubMed Central CAS PubMed Google Scholar
Smith AK, Kilaru V, Kocak M, Almli LM, Mercer KB, Ressler KJ, et al. Methylation quantitative trait loci (meQTLs) are consistently detected across ancestry, developmental stage, and tissue type. BMC Genomics. 2014;15:145.
Article PubMed Central PubMed Google Scholar
Van Eijk K, de Jong S, Boks M, Langeveld T, Colas F, Veldink J, et al. Genetic analysis of DNA methylation and gene expression levels in whole blood of healthy human subjects. BMC Genomics. 2012;13:636.
Article PubMed Central PubMed Google Scholar
Shi J, Marconett CN, Duan J, Hyland PL, Li P, Wang Z, et al. Characterizing the genetic basis of methylome diversity in histologically normal human lung tissue. Nat Commun. 2014;5:3365.
PubMed Central PubMed Google Scholar
Wagner JR, Busche S, Ge B, Kwan T, Pastinen T, Blanchette M. The relationship between DNA methylation, genetic and expression inter-individual variation in untransformed human fibroblasts. Genome Biol. 2014;15:R37.
Article PubMed Central PubMed Google Scholar
Gutierrez-Arcelus M, Lappalainen T, Montgomery SB, Buil A, Ongen H, Yurovsky A, et al. Passive and active DNA methylation and the interplay with genetic variation in gene regulation. Elife. 2013;2, e00523.
PubMed Central PubMed Google Scholar
Gertz J, Varley KE, Reddy TE, Bowling KM, Pauli F, Parker SL, et al. Analysis of DNA methylation in a three-generation family reveals widespread genetic influence on epigenetic regulation. PLoS Genet. 2011;7:e1002228.
Article PubMed Central CAS PubMed Google Scholar
Bell J, Pai A, Pickrell J, Gaffney D, Pique-Regi R, Degner J, et al. DNA methylation patterns associate with genetic and gene expression variation in HapMap cell lines. Genome Biol. 2011;12:R10.
Article PubMed Central CAS PubMed Google Scholar
Gamazon ER, Badner JA, Cheng L, Zhang C, Zhang D, Cox NJ, et al. Enrichment of cis-regulatory gene expression SNPs and methylation quantitative trait loci among bipolar disorder susceptibility variants. Mol Psychiatry. 2012;18:340–6.
Article PubMed Central PubMed Google Scholar
Cole TJ, Bellizzi MC, Flegal KM, Dietz WH. Establishing a standard definition for child overweight and obesity worldwide: international survey. BMJ. 2000;320:1240–3.
Article PubMed Central CAS PubMed Google Scholar
SNP&SEQ Technology Platform. http://molmed.medsci.uu.se/SNP+SEQ+Technology+Platform/.
Gene Expression Omnibus. http://www.ncbi.nlm.nih.gov/geo/.
The R Project for Statistical Computing. https://www.r-project.org/.
Aryee MJ, Jaffe AE, Corrada-Bravo H, Ladd-Acosta C, Feinberg AP, Hansen KD, et al. Minfi: A flexible and comprehensive Bioconductor package for the analysis of Infinium DNA Methylation microarrays. Bioinformatics. 2014;10:1363–9.
Article Google Scholar
Morris TJ, Butcher LM, Feber A, Teschendorff AE, Chakravarthy AR, Wojdacz TK, et al. ChAMP: 450 k chip analysis methylation pipeline. Bioinformatics. 2014;30:428–30.
Article PubMed Central CAS PubMed Google Scholar
Leek JT, Johnson WE, Parker HS, Jaffe AE, Storey JD. The SVA package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. 2012;28:882–3.
Article PubMed Central CAS PubMed Google Scholar
Van Iterson M, Tobi EW, Slieker RC, den Hollander W, Luijk R, Slagboom PE, et al. MethylAid: Visual and interactive quality control of large Illumina 450 k data sets. Bioinformatics. 2014;30:3435–7.
Article PubMed Google Scholar
Triche TJ, Weisenberger DJ, Van Den Berg D, Laird PW, Siegmund KD. Low-level processing of Illumina Infinium DNA Methylation BeadArrays. Nucleic Acids Res. 2013;41:e90.
Article PubMed Central CAS PubMed Google Scholar
Teschendorff AE, Marabita F, Lechner M, Bartlett T, Tegner J, Gomez-Cabrero D, et al. A beta-mixture quantile normalization method for correcting probe design bias in Illumina Infinium 450 k DNA methylation data. Bioinformatics. 2013;29:189–96.
Article PubMed Central CAS PubMed Google Scholar
Lê S, Josse J, Husson F. FactoMineR: An R package for multivariate analysis. J Stat Softw. 2008;25:1–18.
Article Google Scholar
Chen YA, Lemire M, Choufani S, Butcher DT, Grafodatskaya D, Zanke BW, et al. Discovery of cross-reactive probes and polymorphic CpGs in the Illumina Infinium HumanMethylation450 microarray. Epigenetics. 2013;8:203–9.
Article PubMed Central CAS PubMed Google Scholar
Liu Y, Aryee MJ, Padyukov L, Fallin MD, Hesselberg E, Runarsson A, et al. Epigenome-wide association data implicate DNA methylation as an intermediary of genetic risk in rheumatoid arthritis. Nat Biotechnol. 2013;31:142–7.
Article PubMed Central CAS PubMed Google Scholar
Koestler DC, Marsit CJ, Christensen BC, Accomando W, Langevin SM, Houseman EA, et al. Peripheral blood immune cell methylation profiles are associated with nonhematopoietic cancers. Cancer Epidemiol Biomarkers Prev. 2012;21:1293–302.
Article PubMed Central CAS PubMed Google Scholar
Accomando WP, Wiencke JK, Houseman EA, Butler RA, Zheng S, Nelson HH, et al. Decreased NK cells in patients with head and neck cancer determined in archival DNA. Clin Cancer Res. 2012;18:6147–54.
Article PubMed Central CAS PubMed Google Scholar
Lewin J, Schmitt AO, Adorján P, Hildmann T, Piepenbrock C. Quantitative DNA methylation analysis based on four-dye trace data from direct sequencing of PCR amplificates. Bioinformatics. 2004;20:3005–12.
Article CAS PubMed Google Scholar
Heberlein A, Muschler M, Frieling H, Behr M, Eberlein C, Wilhelm J, et al. Epigenetic down regulation of nerve growth factor during alcohol withdrawal. Addict Biol. 2013;18:508–10.
Article CAS PubMed Google Scholar
Domschke K, Tidow N, Schrempf M, Schwarte K, Klauke B, Reif A, et al. Epigenetic signature of panic disorder: A role of glutamate decarboxylase 1 (GAD1) DNA hypomethylation? Prog Neuropsychopharmacol Biol Psychiatry. 2013;46:189–96.
Article CAS PubMed Google Scholar
Schleinitz D, Klöting N, Lindgren CM, Breitfeld J, Dietrich A, Schön MR, et al. Fat depot-specific mRNA expression of novel loci associated with waist-hip ratio. Int J Obes (Lond). 2014;1:120–5.
Article Google Scholar
Gibbs JR, van der Brug MP, Hernandez DG, Traynor BJ, Nalls MA, Lai SL, et al. Abundant quantitative trait Loci exist for DNA methylation and gene expression in human brain. PLoS Genet. 2010;6:e1000952.
Article PubMed Central PubMed Google Scholar
Johnson AD, Handsaker RE, Pulit SL, Nizzari MM, O’Donnell CJ, De Bakker PIW. SNAP: a web-based tool for identification and annotation of proxy SNPs using HapMap. Bioinformatics. 2008;24:2938–9.
Article PubMed Central CAS PubMed Google Scholar
Ernst J, Kellis M. ChromHMM: automating chromatin-state discovery and characterization. Nat Methods. 2012;9:215–6.
Article PubMed Central CAS PubMed Google Scholar
Andersson R, Gebhard C, Miguel-Escalada I, Hoof I, Bornholdt J, Boyd M, et al. An atlas of active enhancers across human cell types and tissues. Nature. 2014;507:455–61.
Article CAS PubMed Google Scholar
Forrest ARR, Kawaji H, Rehli M, Baillie JK, de Hoon MJL, Lassmann T, et al. A promoter-level mammalian expression atlas. Nature. 2014;507:462–70.
Article CAS PubMed Google Scholar
Hillmer AM, Yao F, Inaki K, Lee WH, Ariyaratne PN, Teo ASM, et al. Comprehensive long-span paired-end-tag mapping reveals characteristic patterns of structural variations in epithelial cancer genomes. Genome Res. 2011;21:665–75.
Article PubMed Central CAS PubMed Google Scholar
Consortium TG. The Genotype-Tissue Expression (GTEx) project. Nat Genet. 2013;45:580–5.
Article Google Scholar
Veyrieras JB, Kudaravalli S, Kim SY, Dermitzakis ET, Gilad Y, Stephens M, et al. High-resolution mapping of expression-QTLs yields insight into human gene regulation. PLoS Genet. 2008;4:e1000214.
Article PubMed Central PubMed Google Scholar
Westra HJ, Peters MJ, Esko T, Yaghootkar H, Schurmann C, Kettunen J, et al. Systematic identification of trans eQTLs as putative drivers of known disease associations. Nat Genet. 2013;45:1238–43.
Article PubMed Central CAS PubMed Google Scholar
Du P, Zhang X, Huang C-C, Jafari N, Kibbe WA, Hou L, et al. Comparison of Beta-value and M-value methods for quantifying methylation levels by microarray analysis. BMC Bioinformatics. 2010;11:587.
Article PubMed Central CAS PubMed Google Scholar
Zeileis A, Hothorn T. Diagnostic checking in regression relationships. R news. 2002;2:7–10.
Google Scholar
storey j: qvalue: q-value estimation for false discovery rate control. 2015:r package version 2.0.0.https://www.bioconductor.org/packages/release/bioc/html/qvalue.html.
Ong C-T, Corces VG. CTCF: an architectural protein bridging genome topology and function. Nat Rev Genet. 2014;15:234–46.
Article CAS PubMed Google Scholar
Emilsson V, Thorleifsson G, Zhang B, Leonardson AS, Zink F, Zhu J, et al. Genetics of gene expression and its effect on disease. Nature. 2008;452:423–8.
Article CAS PubMed Google Scholar
Zhong H, Yang X, Kaplan LM, Molony C, Schadt EE. Integrating pathway analysis and genetics of gene expression for genome-wide association studies. Am J Hum Genet. 2010;86:581–91.
Article PubMed Central CAS PubMed Google Scholar
Xie H, Lim B, Lodish HF. MicroRNAs induced during adipogenesis that accelerate fat cell development are downregulated in obesity. Diabetes. 2009;58:1050–7.
Article PubMed Central CAS PubMed Google Scholar
Long XR, He Y, Huang C, Li J. MicroRNA-148a is silenced by hypermethylation and interacts with DNA methyltransferase 1 in hepatocellular carcinogenesis. Int J Oncol. 2014;45:1915–22.
Google Scholar
Nath AK, Ryu JH, Jin YN, Roberts LD, Dejam A, Gerszten RE, et al. PTPMT1 inhibition lowers glucose through succinate dehydrogenase phosphorylation. Cell Rep. 2015;10:694–701.
Article CAS Google Scholar
Boisclair Y, Tremblay ML. Firing up mitochondrial activities with PTPMT1. Mol Cell. 2005;19:291–2.
Article CAS PubMed Google Scholar
Marosi K, Mattson MP. BDNF mediates adaptive brain and body responses to energetic challenges. Trends Endocrinol Metab. 2014;25:89–98.
Article PubMed Central CAS PubMed Google Scholar
Martínez-Levy G, Cruz-Fuentes CS. Genetic and epigenetic regulation of the brain-derived neurotrophic factor in the central nervous system. Yale J Biol Med. 2014;87:173–86.
PubMed Central PubMed Google Scholar
Karpova NN. Role of BDNF epigenetics in activity-dependent neuronal plasticity. Neuropharmacology. 2014;76:709–18.
Article CAS PubMed Google Scholar
Cruz-Garcia L, Schlegel A. Lxr-driven enterocyte lipid droplet formation delays transport of ingested lipids. J Lipid Res. 2014;55:1944–58.
Article CAS PubMed Google Scholar
Laurencikiene J, Rydén M. Liver X receptors and fat cell metabolism. Int J Obes (Lond). 2012;36:1494–502.
Article CAS Google Scholar
Roach W, Plomann M. PACSIN3 overexpression increases adipocyte glucose transport through GLUT1. Biochem Biophys Res Commun. 2007;355:745–50.
Article PubMed Central CAS PubMed Google Scholar
Wang J, Liu R, Wang F, Hong J, Li X, Chen M, et al. Ablation of LGR4 promotes energy expenditure by driving white-to-brown fat switch. Nat Cell Biol. 2013;15:1455–63.
Article CAS PubMed Google Scholar
Kim JD, Leyva S, Diano S. Hormonal regulation of the hypothalamic melanocortin system. Front Physiol. 2014;5:480.
PubMed Central PubMed Google Scholar
Pinnick KE, Nicholson G, Manolopoulos KN, McQuaid SE, Valet P, Frayn KN, et al. Distinct developmental profile of lower-body adipose tissue defines resistance against obesity-associated metabolic complications. Diabetes. 2014;11:3785–97.
Article Google Scholar
Choi J-W, Liu H, Choi DK, Oh TS, Mukherjee R, Yun JW. Profiling of gender-specific rat plasma proteins associated with susceptibility or resistance to diet-induced obesity. J Proteomics. 2012;4:1386–400.
Article Google Scholar
Karpe F, Pinnick KE. Biology of upper-body and lower-body adipose tissue[mdash]link to whole-body phenotypes. Nat Rev Endocrinol. 2015;11:90–100.
Article CAS PubMed Google Scholar
Schubeler D. Function and information content of DNA methylation. Nature. 2015;517:321–6.
Article CAS PubMed Google Scholar
ENCODE target transcription factors and their corresponding antibodies.http://genome.ucsc.edu/ENCODE/antibodies.html.
http://genome.ucsc.edu/ENCODE/antibodies.html.

Download references

Acknowledgements

The methylation array was performed at the Genotyping Technology Platform, (http://www.genotyping.se), the sequencing was performed at the Uppsala Genome Center (http://www.igp.uu.se/facilities/genome_center), Uppsala, Sweden with support from Uppsala University and the Knut and Alice Wallenberg foundation and at the Uppsala Genome Centre. The studies were supported by the Swedish Research Council (VR, medicine) and the Novo Nordisk Foundation. M.R.A. is supported by the Swedish Brain Research foundation, the Fredrik O. Ingrid Thuring foundation and the Lars Hierta Memorial Foundation. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. This project was supported by grants from the Boehringer Ingelheim Foundation to P.K., from the IFB AdiposityDiseases (ADI-K50D; ADI-K7-39 and ADI-K7-45 to Y.B. and ADI-K60E to P.K.). IFB AdiposityDiseases is supported by the Federal Ministry of Education and Research (BMBF), Germany, FKZ: 01EO1001. This work was further supported by grants from the DDG and DDS to Y.B. and from the Collaborative Research Center funded by the German Research Foundation (CRC 1052; B01 and B03 to M.B. and P.K., respectively) and by the Kompetenznetz Adipositas (Competence network for Obesity) funded by the German Federal Ministry of Education and Research (German Obesity Biomaterial Bank; FKZ 01GI1128).

Author information

Authors and Affiliations

Department of Neuroscience, Functional Pharmacology, Uppsala University, Uppsala, Sweden
Sarah Voisin, Markus Sällman Almén, Galina Y. Zheleznyakova, Lina Lundberg, Sanaz Zarei, Sandra Castillo, Fia Ence Eriksson, Emil K. Nilsson, Mathias Rask-Andersen & Helgi B. Schiöth
Department of Medical Biochemistry and Microbiology, Uppsala University, SE-751 23, Uppsala, Sweden
Markus Sällman Almén
Medical Faculty, IFB Adiposity Diseases, University of Leipzig, Liebigstrasse 21, 04103, Leipzig, Germany
Matthias Blüher, Yvonne Böttcher & Peter Kovacs
Latvian Biomedical Research and Study Center, Ratsupites 1, Riga, LV-1067, Latvia
Janis Klovins

Authors

Sarah Voisin
View author publications
You can also search for this author in PubMed Google Scholar
Markus Sällman Almén
View author publications
You can also search for this author in PubMed Google Scholar
Galina Y. Zheleznyakova
View author publications
You can also search for this author in PubMed Google Scholar
Lina Lundberg
View author publications
You can also search for this author in PubMed Google Scholar
Sanaz Zarei
View author publications
You can also search for this author in PubMed Google Scholar
Sandra Castillo
View author publications
You can also search for this author in PubMed Google Scholar
Fia Ence Eriksson
View author publications
You can also search for this author in PubMed Google Scholar
Emil K. Nilsson
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Blüher
View author publications
You can also search for this author in PubMed Google Scholar
Yvonne Böttcher
View author publications
You can also search for this author in PubMed Google Scholar
Peter Kovacs
View author publications
You can also search for this author in PubMed Google Scholar
Janis Klovins
View author publications
You can also search for this author in PubMed Google Scholar
Mathias Rask-Andersen
View author publications
You can also search for this author in PubMed Google Scholar
Helgi B. Schiöth
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sarah Voisin.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

SV participated in the design of the study, performed the statistical analysis and drafted the manuscript. MSA participated in the design of the study and helped to draft the manuscript. GYZ carried out the confirmative bisulfite sequencing analysis in blood. LL, SZ, SC, and FE recruited participants and collected blood samples. EN and JK carried out DNA preparation and contributed to genotyping. MRA participated in the design of the study. PK, MB, and YB carried out the genotyping and pyrosequencing in adipose tissue. HBS conceived the study, participated in its design and coordination, and helped to draft the manuscript. All authors read and approved the final manuscript.

Additional files

The following additional data are available with the online version of this paper. Additional file 1 is a is a table describing the 52 investigated SNPs. Additional file 2 is a heatmap showing the top CpG sites associated with blood cell type surrogates (principal components), evaluated in purified human leukocyte subtype methylation data sets. Additional file 3 is a table listing the cell lines and target transcription factors of the ChIA-PET libraries. Additional file 4 is a table describing the 107 associated CpGs found in blood and their annotation. Additional file 5 is a PCA showing the two discovery study samples on the first three principal components, using only the most variable autosomal CpG sites. Additional file 6 is a table showing the chromatin state at the genomic position of the 107 associated CpG, in 11 tissues. Additional file 7 is a figure showing the genomic context of the CpGs associated with the 28 significant SNPs. Additional file 8 is a table showing the eQTLs found in four eQTL databases for each of the 28 significant SNPs. Additional file 9 is a figure showing the correlations between Illumina 450 K and pyrosequence analysis of cg15576492. Additional file 10 is a table summarizing the replication of the 107 SNP-CpG associations found in blood.

Additional file 1:

Description of the 52 investigated SNPs. SNPs in bold are SNPs for which significant associations with DNA methylation were found. *Number in parenthesis = number of individuals with missing genotypes. (DOCX 108 kb)

Additional file 2:

Top CpG sites associated with blood cell type surrogates (principal components), evaluated in purified human leukocyte subtype methylation data sets. (TIFF 26367 kb)

Additional file 3:

Cell lines and target transcription factors of the ChIA-PET libraries. ^#Cell types are described in ENCODE project [89]. *Target transcription factors and their corresponding antibodies are described in the ENCODE project [90]. (DOCX 11 kb)

Additional file 4:

Description of the 107 associated CpGs found in blood and their annotation. *Coefficient of the linear model associated with the obesity-associated SNP: positive for increased methylation with presence of risk allele; coefficients are calculated using M values. (DOCX 38 kb)

Additional file 5:

Principal component analysis of the two study groups ( n = 355) on the first three principal components, using only the most variable autosomal CpG sites. Red dots are individuals from study sub-group 1 (n = 130), black dots are individuals from study sub-group 2 (n = 225). (TIFF 26367 kb)

Additional file 6:

Chromatin states at the genomic position of the 107 CpGs, in the 11 investigated tissues. AN adipose nuclei, BrainAC brain anterior caudate, BrainAG brain angular gyrus, BrainCG brain cingulate gyrus, BrainHIPPO brain hippocampus, BrainITL brain inferior temporal lobe, BrainSN brain substantia nigra, PBMC peripheral blood mononuclear primary cells, PI pancreatic islets, SM skeletal muscle. (DOCX 36 kb)

Additional file 7:

Genomic context of the CpGs associated with the significant SNPs. Each plot corresponds to a SNP for which associations with DNA methylation were found (28 plots in total). Genomic positions of RefSeq genes and the obesity-related SNP are displayed in the top panel. Within the two vertical red dotted lines, the linkage disequilibrium r² > 0.8. The positions of the tested CpGs are displayed. Long-range interactions as defined by ChIA-PET libraries from five cell lines using chromatin immunoprecipitation with antibodies targeting three transcription factors (Additional file 4) are displayed as arcs. For clarity of visualization, we chose to display only the long-range interactions of genomic regions containing associated CpGs. Two interacting genomic regions are represented by an arc that links them, and the thickness of the arc line is proportional to the strength of this interaction. The color of the arc corresponds to the target transcription factor and the shade of the color corresponds to the cell line: red for RNA polymerase II, blue for ERα, and green for CTCF. In the bottom panel, chromatin states in 11 tissues are displayed. Chromatin states were obtained using chromHMM prediction using data on seven histone marks (see “Methods”). The color of each band corresponds to a particular state. AN adipose nuclei, BrainAC brain anterior caudate, BrainAG brain angular gyrus, BrainCG brain cingulate gyrus, BrainHIPPO brain hippocampus, BrainITL brain inferior temporal lobe, BrainSN brain substantia nigra, PBMC peripheral blood mononuclear primary cells, PI pancreatic islets, SM skeletal muscle. (TIFF 9613 kb)

Additional file 8:

eQTLs found in four eQTL databases for each of the significant 28 SNPs. ¹Investigated tissues: subcutaneous adipose tissue, aorta artery, tibial artery, esophagus mucosa, esophagus muscularis, heart left ventricle, lung, skeletal muscle, tibial nerve, sun-exposed skin, lower leg, stomach, thyroid, whole blood. ²Investigated tissues/cell lines: lymphoblastoid cell line (LCL), liver, monocytes, fibroblasts, T cells, brain cortex. ³Investigated tissues/cell lines: lymphoblastoid cell line (LCL), liver, brain cerebellum, brain frontal cortex, brain temporal cortex, brain pons. (DOCX 25 kb)

Additional file 9:

Correlations between Illumina 450 K and pyrosequence analysis of cg15576492. Methylation at cg15576492, as determined by the Illumina 450 k Chip and expressed as β value, is plotted against methylation at cg15576492, as determined by pyrosequencing and expressed as β value (n = 17). (TIFF 12920 kb)

Additional file 10:

Replication of the 107 SNP-CpG associations found in blood. *Coefficient of the linear model associated with the obesity-associated SNP: positive for increased methylation with presence of risk allele; coefficients are calculated using M values. Coefficients with a hash symbol correspond to associations with raw p value < 0.05 and coefficients in bold correspond to associations with q value < 0.05. (DOCX 31 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Voisin, S., Almén, M.S., Zheleznyakova, G.Y. et al. Many obesity-associated SNPs strongly associate with DNA methylation changes at proximal promoters and enhancers. Genome Med 7, 103 (2015). https://doi.org/10.1186/s13073-015-0225-4

Download citation

Received: 22 July 2015
Accepted: 21 September 2015
Published: 08 October 2015
DOI: https://doi.org/10.1186/s13073-015-0225-4

Many obesity-associated SNPs strongly associate with DNA methylation changes at proximal promoters and enhancers

Abstract

Background

Methods

Results

Conclusions

Background

Methods

Discovery study group

Ethics, consent and permissions

Genotyping

DNA methylation profiling

DNA methylation processing

Background correction and adjustment of type I and type II probes

Removal of batch effects

Principal component analysis

Sample exclusion

Probe exclusion

Choice of investigated CpGs

Cell-type proportions

Validation of methylation with bisulfite sequencing

Replication study groups

VAT and SAT

Fibroblasts

Brain regions (cerebellum, frontal cortex, caudal pons, and temporal cortex)

Annotation

Genes

Linkage disequilibrium

Chromatin states

Ubiquitous, tissue-specific, and cell-specific in vivo transcribed enhancers

Long-range interactions

Expression QTLs

Statistics

Linear model

Enrichment of associated CpGs in genomic regions, in vivo transcribed enhancers, and chromatin states

Number of long-range interactions

Power calculations

Results

Obesity-associated SNPs associate with methylation at proximal CpGs in whole blood samples from healthy individuals

Genomic context of CpGs associated with obesity-associated SNPs

CpGs associated with obesity-associated SNPs are depleted in promoters and enriched in intergenic regions

CpGs associated with obesity-associated SNPs are enriched in enhancers in PBMCs

Only one CpG associated with obesity-associated SNPs is located in in vivo transcribed enhancers

CpGs associated with obesity-associated SNPs show long-range interactions with promoters and other genomic regions

Associated CpGs are located in or show long-range interactions with the promoters of genes for which the corresponding SNPs are known eQTLs

Genome-scale measurements are validated by bisulfite sequencing

SNP–CpG associations might not be blood-specific

Four of the initial SNP–CpG associations in blood are replicated in skin fibroblasts

The single SNP–methylation association tested in SAT and VAT was not significant

The two SNP–methylation associations tested in cerebellum, frontal cortex, caudal pons, and temporal cortex were not significant

Discussion

Conclusions

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Additional files

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Genome Medicine

Contact us