An integrative systems genetics approach reveals potential causal genes and pathways related to obesity

Kogelman, Lisette J. A.; Zhernakova, Daria V.; Westra, Harm-Jan; Cirera, Susanna; Fredholm, Merete; Franke, Lude; Kadarmideen, Haja N.

doi:10.1186/s13073-015-0229-0

Research
Open access
Published: 20 October 2015

An integrative systems genetics approach reveals potential causal genes and pathways related to obesity

Lisette J. A. Kogelman¹,
Daria V. Zhernakova²,
Harm-Jan Westra^3,4,5,
Susanna Cirera¹,
Merete Fredholm¹,
Lude Franke² &
…
Haja N. Kadarmideen¹

Genome Medicine volume 7, Article number: 105 (2015) Cite this article

7889 Accesses
31 Citations
3 Altmetric
Metrics details

Abstract

Background

Obesity is a multi-factorial health problem in which genetic factors play an important role. Limited results have been obtained in single-gene studies using either genomic or transcriptomic data. RNA sequencing technology has shown its potential in gaining accurate knowledge about the transcriptome, and may reveal novel genes affecting complex diseases. Integration of genomic and transcriptomic variation (expression quantitative trait loci [eQTL] mapping) has identified causal variants that affect complex diseases. We integrated transcriptomic data from adipose tissue and genomic data from a porcine model to investigate the mechanisms involved in obesity using a systems genetics approach.

Methods

Using a selective gene expression profiling approach, we selected 36 animals based on a previously created genomic Obesity Index for RNA sequencing of subcutaneous adipose tissue. Differential expression analysis was performed using the Obesity Index as a continuous variable in a linear model. eQTL mapping was then performed to integrate 60 K porcine SNP chip data with the RNA sequencing data. Results were restricted based on genome-wide significant single nucleotide polymorphisms, detected differentially expressed genes, and previously detected co-expressed gene modules. Further data integration was performed by detecting co-expression patterns among eQTLs and integration with protein data.

Results

Differential expression analysis of RNA sequencing data revealed 458 differentially expressed genes. The eQTL mapping resulted in 987 cis-eQTLs and 73 trans-eQTLs (false discovery rate < 0.05), of which the cis-eQTLs were associated with metabolic pathways. We reduced the eQTL search space by focusing on differentially expressed and co-expressed genes and disease-associated single nucleotide polymorphisms to detect obesity-related genes and pathways. Building a co-expression network using eQTLs resulted in the detection of a module strongly associated with lipid pathways. Furthermore, we detected several obesity candidate genes, for example, ENPP1, CTSL, and ABHD12B.

Conclusions

To our knowledge, this is the first study to perform an integrated genomics and transcriptomics (eQTL) study using, and modeling, genomic and subcutaneous adipose tissue RNA sequencing data on obesity in a porcine model. We detected several pathways and potential causal genes for obesity. Further validation and investigation may reveal their exact function and association with obesity.

Background

Obesity is characterized by an excessive amount of body adipose tissue. Because adipose tissue has many endocrine functions, obesity is a very complex condition and is associated with several severe diseases, such as type 2 diabetes, metabolic syndrome, and several types of cancer. The prevalence of obesity is exponentially rising world-wide [1] and its enormous consequences for the quality of life and life expectancy have led to the need for a better understanding of the molecular pathology involved. To date, genome-wide association studies (GWAS) have identified many different loci associated with obesity and obesity-related phenotypes [2, 3]. However, they explain a limited amount of the 40–70 % predicted genetic variation [2] and provide limited insight into the biological pathways and molecular mechanisms involved.

Transcriptomic analysis can further elucidate the molecular mechanisms, because gene expression provides a link between genetic variations and their corresponding phenotypic alterations [4]. Commonly, transcriptomic data are used to gain biological insight by detecting differences in expression levels [5] between two conditions (i.e., healthy and diseased). Many studies have applied differential expression (DE) analysis to obesity using different tissues, and have detected many biologically relevant genes [6–10]. However, all these studies have used a microarray or quantitative polymerase chain reaction (qPCR) platform. High-throughput RNA sequencing (RNA-Seq) technology has demonstrated advantages beyond microarray technology [11]: the transcriptome is measured more fully and RNA-Seq facilitates the discovery of novel genes. In DE analysis, RNA-Seq data outperform microarray data in terms of the accuracy of measuring gene expression levels and, therefore, RNA-Seq can potentially detect more differentially expressed (DE) genes [12].

Systems genetics approaches via the integration, joint modeling, and analyses of various high-throughput -omics data that represent different levels of biological organization are becoming more popular in genetic studies [13–15]. The integration of genomic and transcriptomic data can be achieved using expression quantitative trait loci (eQTL) studies, whereby genetic variants that underpin differences in expression levels are mapped [14, 16]. It has been shown that eQTLs are highly heritable [17, 18] and have the potential to provide more biological insight into GWAS findings [16]. Several studies have investigated obesity and obesity-related diseases using an eQTL approach, but mainly using microarray expression data [19–21]. Moreover, integration of these data with those from other publically available databases, such as those containing protein–protein interactions (PPI), might also provide further insight into the biological mechanisms behind complex diseases [22, 23].

Here, we present the analysis of RNA-Seq data from subcutaneous adipose tissue from a porcine model specifically created to study obesity. The pig has similar metabolic, physiological, and genetic features to humans, with greater similarity than rodents have to humans, and has shown great potential as a medical model [24, 25]. Extensive discussions on why pigs are a better model are given in our paper describing the pig resource population [26]. We previously showed that the F2 population created for obesity studies demonstrates a high heritability for obesity-related phenotypes [26] and we have detected several obesity-related genes and pathways using network approaches on the genotype [27] and RNA-Seq data [28]. Furthermore, we previously created the Obesity Index (OI), an aggregate additive genetic value for obesity, by combining nine different obesity-related phenotypes [27]. Based on a selective expression profiling design, we selected 36 animals in three groups (lean, intermediate, and obese) for RNA-Seq, and showed that those animals have different metabolic features [28]. In this study, we performed an integrative systems genetics approach to identify causal genes and associated pathways for obesity. This was achieved by integrating information on genetic variants from GWAS (based on high-throughput genotype data) with information from co-expression networks (based on RNA-Seq data) and differential gene expression (based on RNA-Seq data), using an eQTL mapping approach [29, 30]. We also integrated biologically interesting co-expression modules with the known PPI networks to identify key transcriptional and other protein-coding factors underlying the causation of obesity. Thus, the integration of multi-omic biological datasets led to the detection of several obesity-related genes and molecular pathways.

Methods

The pig population

The animal resource used in this study was established using breeds that diverged with respect to obesity traits in the parental generation, resulting in a F2 population that was highly divergent with respect to obesity and obesity-related traits. All F2 animals were fed and housed equally. Animal care and maintenance have been conducted according to the Danish “Animal Maintenance Act” (Act 432 dated 09/06/2004) and biological samples were collected according to the Danish “Veterinary Procedures Act” (Act 433 dated 09/06/2004). For a detailed description of the pedigree and the phenotypes, see [26]. In this study, we only used the animals from the Duroc × Göttingen minipig intercross, which has been previously described [28]. In short, 279 F2 animals resulting from an intercross between the Gӧttingen minipig (predisposed for obesity) and Duroc production pig (selected for lean meat content) were intensively phenotyped (e.g., weight, conformation traits, dual-energy X-ray absorptiometry (DXA) scanning, and slaughter characteristics) and genotyped using the Illumina 60 K porcine SNP chip. We previously created and described the OI [27]—a response variable that is an aggregate genotypic value representing the degree of obesity based on the selection index theory [31]. The OI follows a normal distribution; therefore, animals for RNA-Seq of the subcutaneous adipose tissue were selected using a selective gene expression profiling approach [32, 33]. This approach was carefully chosen to ensure sufficient power to detect eQTLs with a relatively small number of animals. In total, we selected 36 animals across three groups: 12 lean, 12 intermediate, and 12 obese animals (Table 1). The family structure (maximizing the number of full siblings to three) and sex distribution was taken into account in the selection of the animals.

Table 1 Phenotypic characteristics of selected animals

Full size table

RNA sequencing

RNA-Seq was performed as previously described [28]. In short, total RNA was isolated from porcine subcutaneous adipose tissue using the RNeasy Lipid Tissue Mini kit (Qiagen, Hilden, Germany) following the manufacturer’s recommendations. The RNA quantity and quality were assessed by a Nanodrop ND-1000 spectrophotometer and the integrity of the isolated RNA was visually inspected by gel electrophoresis and by measuring the RNA quality indicator value on an Experion system (BioRad, Hemel Heampstead, UK) using the Eukaryote Total RNA StdSens kit (BioRad). Libraries were subsequently constructed using 400 ng total RNA and a TruSeq RNA Sample Prep (Illumina, San Diego, CA, USA) with Poly-A pull down rRNA depletion, following the manufacturer’s recommendations. Samples were sequenced on the HiSeq2500 platform, by dividing the 36 samples over four lanes and using 100 bp paired-end reads. Before alignment, the quality of the reads was checked and the adapters were detected using FastQC. The reads were mapped to the genome assembly SScrofa10.2.72 in STAR aligner using default parameters [34], whereby detected adapters were removed. On average, 20,390 protein-coding genes were detected among the mapped reads. Read counts were estimated using HTSeq [35]. Because transcripts with extremely low expression levels are less reliable [36], transcripts with expression levels equal to or fewer than five counts were removed from the dataset, resulting in 12,253 transcripts per sample. The between-sample bias was removed by estimating the library size factor using the estimateSizeFactor() function in DESeq [37]. Normalization was then performed using the voom() variance-stabilization function in the R-package Limma [38], and samples were corrected for sex and transformed to log₂-counts per million to approach normality.

Association of gene expression with degree of obesity

DE genes were detected using a linear model in the R-package Limma [38], taking into account OI as a continuous variable and sex as a factor. First, a linear model was fitted for each gene using the function lmFit() in Limma:

$$ {y}_{ij}={\beta}_{j,OI}O{I}_i+{\beta}_{j, sex}se{x}_i+{\varepsilon}_{ij} $$

where y _ij is the measured expression level of gene j for individual i, β _j,OI is the estimated regression coefficient from regressing the gene expression value of the i ^th individual in its Obesity Index (OI _i) for the j ^th gene, β _j,sex is the estimated regression coefficient of sex, and ε _ij is the error component. Genes were called as DE when the β _j,OIwas significantly different from 0. Moderated t-statistics (ratio of the log2-fold change to its standard error) and log-odds of differential expression by empirical Bayes moderation of the standard errors towards a common value were then calculated using the function eBayes() in Limma. While moderated t-test statistics account for differences in the variance of gene expression of a gene across replicates, the log-odds represents the odds of a gene being DE with an OI different from 0. The resulting P-values obtained from the moderated t-statistic were corrected for multiple testing using the Benjamini-Hochberg procedure [39]. Genes were identified as DE when the adjusted P-value false discovery rate [FDR]) was below 0.05. The corresponding gene symbols were reported using BioMart [40].

eQTL mapping

The integration of the SNP genotype and RNA-Seq data was obtained using an eQTL study approach [13, 29], following the method as described in Westra et al. [30]. The raw expression data obtained by RNA-Seq were quantile-normalized, log₂-normalized, and centered around zero on a gene level. Finally, a z-transformation was performed on the sample level. Genes were excluded when their expression was below five read counts for any of the samples, because lowly expressed genes potentially introduce a bias. Furthermore, previous studies have shown that the first expression principal components capture experimental variation such as technical and batch effects [41]. Therefore, several eQTL studies were performed on the complete dataset with different principal components removed. In total, four principal components were removed, because this resulted in the highest number of detected eQTLs. The SNPs were included in the genotype data when they had a call rate above 0.95, a minor allele frequency (MAF) above 0.05, and were in Hardy–Weinberg equilibrium (P > 1E⁻⁴). The SNPs were mapped onto the genome using genome assembly SScrofa10.2.74.

Both cis- and trans-eQTL mapping were performed: eQTLs were considered to be cis-acting when the distance between the gene and SNP was less than 1 Mb, and trans-acting when the distance was greater than 1 Mb or when the eQTL was located on another chromosome. The relatively large flanking distance of 1 Mb was chosen because haplotype blocks in pigs are larger than in humans and are even larger in F2 populations. Other eQTL pig studies have adapted an even larger flanking distance (e.g., 10 Mb) for assigning cis-eQTLs [19, 42]. To correct for multiple testing, we created a null distribution of P-values by permuting expression phenotypes relative to genotypes ten times, and then compared the real eQTL P-value distribution to the null distribution. eQTLs were only considered to be significant if the FDR was below 0.05. If several eQTLs were detected per gene, the strongest effect for each gene was presented.

The eQTL mapping was first performed on all SNPs and genes that exceeded the quality control thresholds (52,004 SNPs and 12,253 genes). Furthermore, the analyses were restricted to genes that were DE in this study (458 genes) and to SNPs that were significantly associated with the OI, according to our previously published analysis (366 SNPs) [27].

The detected eQTLs were further investigated according to their physical location in the genome and their effects. The distance from the expression SNP (eSNP) to the affected gene was calculated as the distance from the location of the SNP to the location of the transcription start site of the affected gene. The location and the effect of the eSNPs were assigned using the Variant Effect Predictor [43] with the SScrofa10.2.74 assembly, where the results were restricted to the most severe annotation of the SNP variant.

Integration of eQTLs with co-expression network analysis

We previously conducted a weighted gene co-expression network analysis (WGCNA) on the same RNA-Seq data, and recently published the method and results [28]. To integrate the results from the previous study with those of this study, we evaluated how many eQTLs in the WGCNA modules (clusters of highly interconnected genes) could be detected. We focused solely on the modules that were previously detected as potential biologically interesting: the Blue Module, the Brown Module, the Light-yellow Module, the Black Module, and the Green-yellow Module [28]. Because we are interested in potential causal genes within these modules, we extracted only those genes from the modules that were identified as cis-eQTLs.

Supervised gene co-expression network analysis and the integration of protein–protein interactions

All detected genes in the eQTL mapping and DE analysis were used for supervised WGCNA (sWGCNA). The network was constructed using the framework of Langfelder and Horvath [44], which is also described in Kogelman et al. [27]. Briefly, the adjacency matrix was created by calculating the Pearson’s correlations between the selected genes, and was raised by a power β to reach a scale-free topology index (R ²) of at least 0.90. The topological overlap measure (TOM), which assess the degree of shared neighbors between pairs of genes, was calculated based on the adjacency matrix and was used as input for the gene dendrogram (1-TOM). Following this step, modules were detected and assigned to a color as branches of the gene dendrogram using the DynamicTreeCut algorithm [45], using a minimum module size of 25 genes per module. The module eigengene, the first principal component of each module, represents the module’s expression and was used to detect biologically relevant modules. The Module–Trait Relationship (MTR) was calculated as the correlation between the module eigengene and traits of interest, and modules with a significant correlation >0.5 were selected for functional annotation. Genes in the module were retained when their intra-modular connectivity was >0.6, and when their intra-modular connectivity with other modules was >0.6. The module hub gene was detected as the gene in the module with the highest connectivity, or based on a high intra-modular connectivity (>0.8) and high correlation between the individual gene and the trait of interest (>0.6).

Potentially relevant biological modules were visualized in Cytoscape [46] and were integrated with the known PPI from IntAct [47], where corresponding Uniprot accession IDs were extracted using BioMart [40]. The networks of the selected module and PPI were merged and network analysis within Cytoscape was performed to gain insight into the network topology. Subsequently, the community clustering algorithm GLay [48] (in the ClusterMaker app) was used to detect clusters in the merged co-expression and PPI network.

Functional annotation

Over-represented Gene Ontology (GO) terms and KEGG pathways among the DE genes were detected using the software GoSeq [49], because GoSeq corrects for length bias in RNA-Seq data. First, a probability weighting function for all genes was calculated, based on a given set of biased data for gene length with the function nullp() in GoSeq. Second, a selection-unbiased testing was performed for GO or KEGG enrichment amongst the DE genes. The P-values were corrected for multiple testing using the Benjamini-Hochberg method and GO terms and KEGG pathways were detected as being significantly over-represented using a P-value of 0.05. The over-represented GO terms and KEGG pathways were separately detected for all DE genes, including the upregulated and downregulated DE genes.

Functional annotation analysis of the detected eQTLs was performed using GeneNetwork (http://www.genenetwork.nl), which detects over-represented GO terms, KEGG pathways, and associated phenotypes and tissues [50]. GeneNetwork is based on expression datasets from humans, mice, and rats, and predicts functions of genes against known pathways in various biological databases. The over-representation of GO terms and pathways was tested using the Mann–Whitney U test, and P-values were corrected for multiple testing using the Bonferroni correction.

The complete workflow used in this study is presented in Fig. 1.

Availability of supporting data

The RNA-Seq expression data in this publication have been deposited in NCBI’s Gene Expression Omnibus (GEO) and are available through accession number [GEO:GSE61271].

Results and discussion

Association of gene expression with degree of obesity

The detection of DE genes can lead to a better understanding of genetic and biological differences between two different conditions and to the detection of predictive biomarkers [51, 52]. In this study, we have used the level of obesity as a continuous variable, whereby we correct for the effect of sex. In total, we found 458 DE genes (FDR < 0.05), with a β _j,OI ranging from −0.42 to 0.48. All DE results are presented in Additional file 1.

The heatmap of the top 100 DE genes (Fig. 2) shows that mostly those from the lean and obese animals cluster together within each group. However, those from intermediate animals cluster with both the obese and lean groups. This is possibly due to the OI values of the intermediate group being borderline between the two groups. The genes partly clustered based on the direction of regulation (up or down). The top 10 DE genes include TAS1R3, CSGALNACT1, MAML3, ROM1, LRRC16B, EML5, RPS12 (all downregulated in obese animals), and ADAM9 (upregulated in obese animals). We discuss here only selected genes from this group. The TAS1R3 gene encodes a taste receptor, which has been associated with sweetness responsiveness in mice [53]. Taste receptors influence the perception of food [54], and therefore affect eating behavior [55], which might be associated with obesity [56, 57]. The CSGALNACT1 (chondroitin sulfate N-acetylgalactosaminyltransferase 1) gene plays a role in the process of glucuronidation: the addition of glucuronic acid to a substrate. This gene has been associated with Bell’s palsy (dysfunctioning of the facial nerve, resulting in facial paralysis) and Morquio Syndrome b (a metabolic disease, characterized by the inability to breakdown glycosaminoglycans). However, this gene also has an important function in the development of osteoarthritis [58], a bone remodeling disease that has been associated with obesity. The 40S ribosomal protein S12 is encoded by RPS12, which is associated with diabetic nephropathy in African Americans [59]. The final gene, ADAM9, encodes a transmembrane glycoprotein that functions in integrin binding and SH3 domain binding. The ADAM9 gene is involved in the formation of multinuclear cells and, consequently, with osteoclast fusion [60]; its downregulation might result in reduced bone mass. Bone mineral density is closely related to obesity: it has been suggested that the increased body load on joints resulting from obesity might stimulate the formation of bone mass, even though data from studies remain controversial [61]. However, body adipose tissue is also closely associated with the immune system and bone remodeling (decreased bone mineral density), by the secretion of adipocytes [62]. In a previous study, we identified several genes that might play a role in these associations [28]. The downregulation of these genes in obese animals might correspond to the decreased bone mass via the secretion of adipocytes, but does not correspond to an increased obesity-induced bone mass.

Out of the 458 DE genes, 249 were downregulated and 209 were upregulated in obese animals. Functional annotation analysis was performed for the downregulated and upregulated genes separately. Surprisingly, almost all over-represented pathways were the result of the presence of upregulated genes. The over-represented pathways were mostly associated with the immune system (e.g., Ribosome, P = 4.54E⁻¹² and leukocyte trans-endothelial migration, P = 4.32E⁻⁴), except for starch and sucrose metabolism (P = 6.67E⁻⁴) and osteoclast differentiation (P = 6.67E⁻⁴). However, osteoclasts are derived from macrophages and therefore are also related to the immune system [63], which was shown previously [28]. Within the GO terms, DE genes showed a striking relationship, mainly among the upregulated genes, with the immune system, (e.g., immune system process, P = 2.79E⁻²⁰ and immune response, P = 7.92E⁻¹⁸). The downregulated genes were mainly involved with functions related to ribosomes and the translational process (e.g., translational termination, P = 6.56E⁻¹⁷ and cytosolic ribosome, P = 1.33E⁻¹⁵), which cannot be linked directly to obesity itself.

The location and annotation of eQTLs

The eQTL mapping led to the detection of 1,070 eQTLs: 987 cis-eQTLs and 73 trans-eQTLs (FDR < 0.05). All detected eQTLs are presented in Additional file 2. SNPs used for eQTL mapping were filtered based on a high MAF (<0.05). To ensure that this did not affect results owing to the low number of animals, we investigated the MAF of the eSNPs. From those results (not presented), it was evident that most SNPs that were detected as eQTLs had a high MAF. More specifically, only 14 SNPs of the detected eQTLs had a MAF between 0.05 and 0.10. Based on those results and the statistical models used to estimate the eQTL effects, followed by appropriate significance testing, we believe that the results are reliable.

The detected eQTLs were annotated as cis when the distance between the eSNP and gene was within 1 Mb. This was confirmed by the results, which show that most of the cis-eQTLs were located close to the transcription start site, as expected, because this is the region where most transcriptional regulation occurs (Fig. 3a). Further investigation of the effect of the eQTLs (Fig. 3b) showed that most were located in intergenic (46 %) and intronic regions (37 %). In comparison with the complete set of SNPs that passed quality control, we found that the eQTLs were located more often in intronic regions (>8 %) and less often in intergenic regions (<10 %); the frequencies in all other regions were not altered. These percentages agree with the results of previous studies, which showed that disease-associated SNPs are mainly located within non-coding regions (~90 %) [64–66]. We found comparable frequencies of intergenic and intronic SNPs between cis-eQTLs and trans-eQTLs, where SNPs from cis-eQTLs were also located in exonic regions (3′ untranslated region [UTR], 5′UTR, mis-sense, and splice regions). These intergenic and intronic SNPs might be in linkage disequilibrium with a causative SNP, but might also have a regulatory function on the gene expression of the eQTL. Of the detected trans-eQTLs, 10 SNPs were located on a different chromosome than the target gene. We further investigated the distance between the SNP and target gene of the trans-eQTLs that were located on a similar chromosome, and found that a large part of the trans-eQTL would be detected as cis by increasing the window size by several megabases (Fig. 4a). Furthermore, we investigated the number of target genes per detected trans-eQTL and found that most SNPs were targeting up to five genes (Fig. 4b).

Functional annotation of eQTLs

Functional annotation analysis using all detected cis-eQTLs (987) revealed many metabolism-related GO terms and pathways (Table 2). We also found that several adipose tissues (e.g., adipocytes and subcutaneous adipose tissue) were highly significantly associated with the cis-eQTLs.

Table 2 GeneNetwork results over all cis-eQTLs

Full size table

In order to identify potential causal genes, we limited the number of eQTL association tests by confining the eQTL mapping to DE genes (458 genes) and, secondly, to obesity-associated SNPs resulting from the previously conducted GWAS on the OI (366 SNPs) [27]. Using the restriction of DE genes, we found a total of 36 eQTLs, among which GO terms and pathways related to cholesterol transport and other lipid process were represented, for example, protein–lipid complex (P = 2E⁻⁴), and lipid digestion, mobilization, and transport (P = 2E⁻⁵). Restriction to GWAS SNPs resulted in the detection of 24 eQTLs. Functional annotation showed that these genes were mainly expressed in adipose tissue (subcutaneous fat, abdominal; P = 4E⁻³) and associated phenotypes also showed a link with obesity-related characteristics (e.g., abnormal triglyceride level, P = 1E⁻³). Other GO terms and pathways were related to transcription (e.g., viral transcription, P = 3E⁻⁴), metabolism (tyrosine metabolism, P = 1E⁻³), or immunity (e.g., influenza infection, P = 2E⁻⁴). Of those 24 eQTLs, the expression of one target gene was significantly associated (P < 0.05) with the OI: C15orf26. However, this gene does not seem to have any previously discovered association with obesity or obesity-related diseases. Two other genes tended toward significance: RAB11A and USP36 (P < 0.1) RAB11A has been shown to be an element in the GLUT4 trafficking machinery [67] and has been associated with glucose metabolism [68]. To our knowledge, USP36 has not been previously associated with obesity or obesity-related diseases. Unfortunately, we did not identify any overlap of eQTLs between the restricted analyses of the DE subset and GWAS subset.

From the 36 eQTLs detected among the subset of DE genes, 35 were also present in the eQTLs using no restrictions. One of these eQTLs involves the ENPP1 (ectonucleotide pyrophosphatase/phosphodiesterase 1) gene (Fig. 5a) that encodes a type II transmembrane glycoprotein (P = 1.43E⁻⁴). This glycoprotein plays a role in the regulation of pyrophosphate levels, and several other functions are known: for example, bone mineralization and the regulation of purinergic signaling. More importantly, this gene is highly expressed in adipose tissue and has been associated with obesity, type-2 diabetes, and insulin resistance [69, 70] and also with eating behavior in a pig population [57]. Moreover, expression of this gene inhibits insulin signaling via reduced insulin-receptor tyrosine kinase activity [71]. The data here showed a large difference between the AA and GG phenotypes, whereby the AA animals were lean and GG animals (with an increased expression of ENPP1) were obese, which agrees with data in the literature. For example, the AA animals (n = 8) weighed 0.695 kg (standard deviation [SD] = 0.13 kg) at birth and showed a mean gain of 0.37 kg/day (SD = 0.10 kg/day), compared with GG animals (n = 10), which weighed 0.923 kg (SD = 0.12 kg) at birth and gained 0.535 kg/day (SD = 0.12 kg/day). At slaughter, the AA animals contained 24.88 mm of backfat (SD = 7.59 mm) whereas the GG animals had 39.25 mm of backfat (SD = 7.19 mm). However, we found no difference in the fasting glucose levels between the genotypes.

Another eQTL that has been associated with obesity and insulin resistance is CTSL (cathepsin L), a lysosomal cysteine proteinase (Fig. 5b, P = 5.55E⁻⁶). Several studies have investigated the role of CTSL and have shown, for example, that inhibition of CTSL results in limited adipogenesis or lipid accumulation [72] by reducing the levels of pivotal transcriptional mediators of adipogenesis. Moreover, the pharmacological inhibition of CTSL resulted in reduced body weight gain, and levels of CTSL were elevated in patients who were obese and diabetic [72]. Our results confirm these findings; for example, the AA animals (n = 11) weighed 0.93 kg (SD = 0.16 kg) at birth and showed a mean weight gain of 0.44 kg/day (SD = 0.10 kg/day), whereas the GG animals (n = 2) weighed 0.64 kg (SD = 0.15 kg) at birth and gained 0.32 kg/day (SD = 0.04 kg/day). Furthermore, the AA animals contained 2.73 kg (SD = 0.898 kg) of fat at dual-energy X-ray absorptiometry (DXA) scanning and 2.48 kg (SD = 1.30 kg) of leaf fat at slaughter, whereas the GG animals had 1.60 kg (SD = 0.694 kg) of fat at DXA scanning and 2.20 kg (SD = 1.63) of leaf fat at slaughter. The gene function prediction of GeneNetwork also showed a clear role of CTSL in the regulation of cholesterol and lipids, for example, regulation of plasma lipoprotein particle levels (P = 2.98E⁻¹⁶) and lipid storage (P = 3.80E⁻¹⁴).

Another detected eQTL was CIDE-C (cell death-inducing DFFA-like effector c), also called fat specific protein 27 (FSP27), which was more highly expressed in the TT genotype than the CC genotype (Fig. 5c, P = 2.23E⁻⁴). This gene promotes triglyceride (lipid droplet) formation and has a negative regulatory effect on adipocyte apoptosis [73, 74]. A CIDE-C knockout model in mice resulted in smaller lipid droplets [75]. Furthermore, CIDE-C is regulated by insulin via the Akt1/2-dependent and JNK2-dependent pathways in adipocytes [76]. Animals with the TT genotype (n = 8) were heavier (8.29 versus 17.91 kg at DXA scanning) and showed a higher mean daily gain (0.31 versus 0.51 kg/day) than CC animals (n = 8) and had a considerably higher amount of fat than in CC animals: 1.56 versus 3.20 kg estimated by DXA scanning and 2.01 versus 3.01 kg at slaughter (weight of leaf fat). These results differ from findings in other studies, where CIDE-C mRNA levels were lower in obese subjects and were negatively correlated with body mass index and percentage fat mass, but increased in obese patients after weight loss [77]. However, the GEO database also contains studies that show a lower CIDE-C expression for high weight gainers versus low weight gainers [GEO: GDS2319] and a higher expression in the adipose stem cells of morbidly obese individuals versus non-obese individuals [GEO: GDS5056]. GeneNetwork identified many adipose-related GO terms and pathways for CIDE-C using the predicted function, for example, the GO Biological Process triglyceride metabolic process (P = 1.29E⁻⁷⁶) and GO Cellular Component lipid particle (P = 1.27E⁻⁸⁸).

In addition to the cis-eQTLs, we detected 73 trans-eQTLs and the functional annotation of this group of genes using GeneNetwork resulted in the detection of (subcutaneous) adipose tissue as associated over-represented tissue (P = 6E⁻⁴). Furthermore, a wide variety of significant GO terms and pathways were detected, which were not all directly linked to obesity, for example, excitatory synapse (P = 10E⁻⁵). In general, trans-eQTLs provide a fundamental understanding of potential gene-to-gene regulatory architecture of complex traits and diseases and can also be used to predict transcription factor binding sites [78]. For the trans-eQTLs, genes and SNPs were restricted to DE genes and GWAS SNPs. Among the trans-eQTLs, only two genes overlapped between all trans-eQTLs and those among the DE genes (GFRα3 and MYH3), and only one gene overlapped between all trans-eQTLs and the trans-eQTLs among the GWAS-significant SNPs (ABHD12B). The GFRα3 gene encodes the artemin receptor, which is a neurotrophin with various functions, such as nerve regeneration and tumor-cell migration [79], although no direct link has previously been found between GFRα3 and obesity. Similarly, no direct link has been found between MYH3 (encoding myosin, heavy chain 3, skeletal muscle, embryonic) and obesity. Myosin converts chemical energy into mechanical energy via ATP hydrolysis. Growth characteristics in cattle and pigs have been associated with MYH3 expression, in addition to a difference in muscle growth between and lean and obese pigs, which suggests an association between MYH3 and adiposity. However, we observed no difference in obesity-related phenotypes according to MYH3 expression. The third gene, ABHD12B (α/β-hydrolase domain containing 12B), has also not been previously associated with obesity (Fig. 5d, P = 2.36E⁻⁹). However, it plays a role in lysophosphatidylserine (LPS) metabolism, and ABHD12B knockout mice have a deregulated accumulation of proinflammatory lipids [80]. Furthermore, it has been shown that LPS stimulates glucose transport in adipocytes [81]. In this study, animals with the AA genotype on ALGA0006476 (n = 12) were more obese than animals with the CC genotype (n = 8) and showed a mean daily gain of 0.47 kg/day (SD = 0.11 kg/day) compared to 0.38 kg/day (SD = 0.12 kg/day), and a weight of leaf fat at slaughter of 3.06 kg (SD = 1.36 kg) compared to 1.81 kg (SD = 1.23 kg). The expression of ABHD12B was higher in CC, suggesting that upregulation or activation of this gene results in leaner animals. To our knowledge, no other studies have shown significant effects for the expression of ABHD12B.

Integration of eQTL results with gene co-expression network analysis

Previously, we investigated the RNA-Seq data from this study using a gene co-expression network approach (WGCNA) [28] and we detected five modules that were potentially biologically associated with obesity-related characteristics. We hypothesized that modules containing more eQTL genes than expected by chance would pinpoint modules that are more likely to be causal for the trait under study. Therefore, we investigated how many eQTLs (out of the 987 detected cis-eQTLs) were present in each of the five modules detected using WGCNA. We found five eQTLs in the Green-yellow Module (47 genes), five eQTLs in the Brown Module (86 genes), two eQTLs in the Blue Module (69 genes), and one eQTL in the Black Module (36 genes), with no eQTLs in the Light-yellow Module. This represents 10.64, 5.81, 2.90, 2.78, and 0 % of the number of genes in that particular module, respectively, which is unfortunately not higher than expected by chance (hypergeometric test). The eQTL in the Black Module is a novel gene (uncharacterized protein), with no known orthologs.

The Green-yellow Module was strongly associated with obesity-related phenotypes, but in our previous study, the functional annotation did not identify a relationship with obesity. We have now found five eQTLs in this module: ALDH1L2, GGTA1, KRR1, ME3, and OPTN. The OPTN gene encodes optineurin, a protein that has been investigated intensively in the neuroscience field, and is associated with primary open-angle glaucoma and amyotrophic lateral sclerosis [82]. Notably, it also plays a role in adipogenesis, and modulates the developmental switch into brown preadipocytes [83]. GeneNetwork predicts the adipocytokine signaling pathway (P = 1.5E⁻⁷) as the most likely associated KEGG pathway. For the other genes, no obvious association with obesity or obesity-related phenotypes was found.

In the Brown Module, we found five eQTLs, representing four unique genes: ARF6, PMVK, MSRB2, and two eQTLs in RIN2. The ARF6 gene encodes a small GTP-binding protein that regulates vesicular trafficking actin cytoskeletal dynamics [84]. The PMVK gene encodes an enzyme that functions in the cholesterol biosynthesis pathway, which converts mevalonic acid-5P to mevalonic acid 5-pyrophosphate. Furthermore, it has been shown to be critical in the regulation of the secretion of insulin in pancreatic β cells [85]. The other genes have not been related to obesity or obesity-related phenotypes.

In our previous study, the Blue Module revealed a potential genetic association between obesity, the immune system, and bone remodeling (osteoporosis). Therefore, we would expect that the eQTLs in this module (LAT2 and IGSF6) have a more causal role in this genetic association. Both genes play a role in the immune system, but have not been previously shown to be directly related to obesity.

Supervised gene co-expression network and integration with protein–protein interactions

Both DE genes and cis-eQTL genes were used as input for sWGCNA, with the focus on potential causal genes for obesity. The Pearson’s correlations among 1,408 unique genes (987 cis-eQTLs and 458 DE genes) were calculated and raised to a power β, of three, reaching a scale-free topology index (R ²) of 0.93. Using the TOM, we detected eight modules of at least 25 genes per module, three of which showed strong significant correlations with the OI (MTR_OI) and other obesity-related traits: the Yellow _sWGCNA Module (MTR_OI = −0.74), Blue _sWGCNA Module (MTR_OI = −0.71), and Turquoise _sWGCNA Module (MTR_OI = 0.69). The functional annotation of those modules showed strongly significant GO terms and pathways for only the Turquoise _sWGCNA Module, which were related to obesity and the biological processes lipid localization (P _adj = 5.35E⁻¹²) and lipid transport (P _adj = 2.39E⁻¹⁰) were most significantly over-represented. Based solely on the connectivity of the genes in this module, ITGB2 (β2-Integrin) is the hub gene (highest connectivity) of this module; however, it is not included in the module after gene selection based on the gene-trait correlation (correlation of 0.59 with the OI). According to the gene network prediction of GeneNetwork, ITGB2 is co-expressed with many other genes, which all possess functions within the immune system (e.g., activation of immune response). This gene has also previously been associated with obesity: mutations in ITGB2 have been associated with obesity in mice [86], and fat oxidation and insulin metabolism were impaired in knockout mice [87]. The ITGB2 gene has also been associated with obesity in humans: a polymorphism in this gene is associated with obesity among Japanese individuals living in the US (westernized diet) [88]. Based on measures other than connectivity, such as intra-modular connectivity and gene-trait correlation, other important genes were detected within this module: NCKAP1L, S100A10, VSIG4, and CD68, which have all been associated with immune-related processes. The NCKAP1L gene is also strongly co-expressed with many other genes but is not expressed in adipose tissue, and functions in immune-related processes. S100A10 encodes the protein p11, which functions in cellular processes such as exocytosis and endocytosis, and has been associated with serotonergic signaling and, consequently, with depression [89]. VSIG4 encodes a B7 family-related protein and negatively regulates T cell activation and IL-2 production [90]. CD68 encodes a glycoprotein that is mainly expressed by monocytes and tissue macrophages [91] and binds to oxidized low-density lipoproteins on the cell surface, and might consequently play a role in atherosclerotic lesions [92]. We further analyzed and visualized this module in Cytoscape and merged the turquoise network with the known PPIs from IntAct. The resulting network consisted of 419 nodes and 3,015 edges (Fig. 6a). We calculated the network statistics and clustered the genes and proteins within Cytoscape (Fig. 6b). The largest cluster contained 137 proteins that interacted with a single gene: CALCOCO2 (calcium binding and coiled–coil domain 2). This gene is significantly upregulated in adipose tissue (P = 0.0003) according to HumanMine (http://humanmine.org) and has been shown to be negatively correlated with the level of triglycerides in muscle tissue in a mouse model for human obesity [93]. Two out of the four genes that were detected as potentially important or hub genes (VSIG4 and CD68) were located within second largest module. The third gene (NCKAP1L) formed a cluster together with five other genes. The fourth gene (S100A10) formed a cluster by itself, with ten other proteins.

Conclusion

In this study, we examined the transcriptome and genome of 36 lean, intermediate, and obese pigs using a variety of multi-omic systems genetics approaches, with the aim of detecting potential causal genes and regulatory networks for human obesity, using the pig as a model. We performed DE analysis, weighted gene co-expression network analyses, and integrative systems genetics analyses of obesity by integrating and jointly analyzing the genome and transcriptome using an eQTL approach. We also generated networks using the identified eQTLs to provide causal networks and to identify more biologically relevant causal genes.

We successfully identified many DE genes and the associated pathways showed several immune-related pathways and GO terms, mainly among the upregulated genes. Furthermore, we conducted eQTL mapping, a systems genetics approach, to detect which genetic variants affect the expression levels of obesity-related genes. We detected many cis-acting and trans-acting eQTLs, mostly located in intronic and intragenic regions, which were further analyzed by pathway analyses and we detected many different metabolic pathways using GeneNetwork. To limit our eQTL search to the most promising and potential causal genes, we focused on DE genes and SNPs that exceeded the genome-wide significance threshold in a GWAS with the OI scores. We evaluated how many of the detected eQTLs were present in clusters of highly interconnected genes detected in our previous study, because those clusters containing many eQTLs might represent a causal function of the module leading to obesity. The restriction of the data in these ways led to a subset of eQTLs that were further analyzed using GO and pathway analysis, which resulted in several adiposity-related terms and pathways. The detected trans-eQTLs could not be directly linked to obesity, but provided insights into complex trans-regulatory mechanisms. Finally, we performed a sWGCNA on all detected cis-eQTLs and identified several modules that highly correlated with obesity phenotypes. One of these modules showed a strong association with lipid pathways. However, integration with known PPIs from a publically available database did not provide further insight into important underlying mechanisms.

For years, DNA markers have been studied in association with complex traits, for example, obesity, which has led to the detection of several associated genes. Similarly, gene expression has been studied in detail to detect associated genes. However, the combination of DNA markers and gene expression data leads to a better understanding of the mechanisms behind the translation from DNA marker via transcription toward complex disease, and therefore targets a greater number of potentially causal genes. In this study, we detected several eQTLs that revealed genes that may cause obesity, due to the combined association of DNA marker and transcription with obesity. These genes have been previously associated with obesity-related traits, but have not all been associated with obesity. In this study, we identified, for example, the genes ENPP1, CTSL, CIDE-C, and ABHD12B as potential causal genes for obesity, and further validation (e.g., by qPCR in a large human population) and investigation of these genes might lead to biomarkers for obesity. However, in our study we selected the strongest eQTL effect per DNA marker/gene target, but owing to linkage disequilibrium the identified gene might not always be the true causal gene. Therefore, other strategies are needed to prove causality of our detected potentially causal genes, for which several integrative approaches are available, for example, as proposed by Schadt et al. [94].

In conclusion, this systems genetics study (integrating RNA-Sequencing transcriptomic and genomic information) revealed potential causal genes, and provided insight in the genetic and regulatory architecture of obesity pathways. Furthermore, several relevant GO terms and molecular pathways related to obesity, are presented here. To the best of our knowledge, this is the first study to report integrated transcriptomic and genomics data in a porcine model for obesity.

References

WHO. World Health Organization: Obesity and overweight, Fact sheet No. 311. 2012. Updated March 2013. Available at http://www.who.int/mediacentre/factsheets/fs311/en/.
Speliotes EK, Willer CJ, Berndt SI, Monda KL, Thorleifsson G, Jackson AU, et al. Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index. Nat Genet. 2010;42(11):937–48. doi:10.1038/ng.686.
Article PubMed Central CAS PubMed Google Scholar
Locke AE, Kahali B, Berndt SI, Justice AE, Pers TH, Day FR, et al. Genetic studies of body mass index yield new insights for obesity biology. Nature. 2015;518(7538):197–206. doi:10.1038/nature14177.
Article PubMed Central CAS PubMed Google Scholar
Adams JU. Transcriptome: connecting the genome to gene function. Nat Educ. 2008;1(1):195.
Google Scholar
Oshlack A, Robinson M, Young M. From RNA-seq reads to differential expression results. Genome Biol. 2010;11:220. doi:10.1186/gb-2010-11-12-220.
Article PubMed Central CAS PubMed Google Scholar
Cirera S, Jensen MS, Elbrønd VS, Moesgaard SG, Christoffersen BØ, Kadarmideen HN, et al. Expression studies of six human obesity-related genes in seven tissues from divergent pig breeds. Anim Genet. 2014;45(1):59–66. doi:10.1111/age.12082.
Article CAS PubMed Google Scholar
Rodríguez-Acebes S, Palacios N, Botella-Carretero J, Olea N, Crespo L, Peromingo R, et al. Gene expression profiling of subcutaneous adipose tissue in morbid obesity using a focused microarray: distinct expression of cell-cycle- and differentiation-related genes. BMC Med Genomics. 2010;3(1):1–15. doi:10.1186/1755-8794-3-61.
Article Google Scholar
Lee YH, Nair S, Rousseau E, Allison DB, Page GP, Tataranni PA, et al. Microarray profiling of isolated abdominal subcutaneous adipocytes from obese vs non-obese Pima Indians: increased expression of inflammation-related genes. Diabetologia. 2005;48(9):1776–83. doi:10.1007/s00125-005-1867-3.
Article PubMed Central CAS PubMed Google Scholar
Marrades MP, Milagro FI, Martínez JA, Moreno-Aliaga MJ. Differential expression of aquaporin 7 in adipose tissue of lean and obese high fat consumers. Biochem Biophys Res Commun. 2006;339(3):785–9. doi:10.1016/j.bbrc.2005.11.080.
Article CAS PubMed Google Scholar
Baranova A, Collantes R, Gowder S, Elariny H, Schlauch K, Younoszai A, et al. Obesity-related differential gene expression in the visceral adipose tissue. Obes Surg. 2005;15(6):758–65. doi:10.1381/0960892054222876.
Article PubMed Google Scholar
Wang, Z, Gerstein M, and Snyder M, RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet. 2009;10(1):57–63. doi:10.1038/nrg2484.
Article PubMed Central CAS PubMed Google Scholar
Marioni J, Mason C, Mane S, Stephens M, Gilad Y. RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res. 2008;18(9):1509–17. doi:10.1101/gr.079558.108.
Article PubMed Central CAS PubMed Google Scholar
Kadarmideen H, von Rohr P, Janss L. From genetical genomics to systems genetics: potential applications in quantitative genomics and animal breeding. Mamm Genome. 2006;17(6):548–64. doi:10.1007/s00335-005-0169-x.
Article PubMed Central CAS PubMed Google Scholar
Civelek M, Lusis AJ. Systems genetics approaches to understand complex traits. Nat Rev Genet. 2014;15(1):34–48. doi:10.1038/nrg3575.
Article PubMed Central CAS PubMed Google Scholar
Kadarmideen HN, Reverter A. Combined genetic, genomic and transcriptomic methods in the analysis of animal traits. CAB Reviews. 2007;2(042):16pp.
Article Google Scholar
Cookson W, Liang L, Abecasis G, Moffatt M, Lathrop M. Mapping complex disease traits with global gene expression. Nat Rev Genet. 2009;10(3):184–94. doi:10.1038/nrg2537.
Article PubMed Central CAS PubMed Google Scholar
Dixon AL, Liang L, Moffatt MF, Chen W, Heath S, Wong KCC, et al. A genome-wide association study of global gene expression. Nat Genet. 2007;39(10):1202–7. doi:10.1038/ng2109.
Article CAS PubMed Google Scholar
Visscher PM, Hill WG, Wray NR. Heritability in the genomics era--concepts and misconceptions. Nat Rev Genet. 2008;9(4):255–66. doi:10.1038/nrg2322.
Article CAS PubMed Google Scholar
Ponsuksili S, Murani E, Brand B, Schwerin M, Wimmers K. Integrating expression profiling and whole-genome association for dissection of fat traits in a porcine model. J Lipid Res. 2011;52(4):668–78. doi:10.1194/jlr.M013342.
Article PubMed Central CAS PubMed Google Scholar
Ghazalpour A, Doss S, Sheth S, Ingram-Drake L, Schadt E, Lusis A, et al. Genomic analysis of metabolic pathway gene expression in mice. Genome Biol. 2005;6(7):R59.
Article PubMed Central PubMed Google Scholar
Davis RC, van Nas A, Castellani LW, Zhao Y, Zhou Z, Wen P, et al. Systems genetics of susceptibility to obesity-induced diabetes in mice. Physiol Genomics. 2012;44(1):1–13. doi:10.1152/physiolgenomics.00003.2011.
Article PubMed Central CAS PubMed Google Scholar
Safari-Alighiarloo N, Taghizadeh M, Rezaei-Tavirani M, Goliaei B, Peyvandi AA. Protein-protein interaction networks (PPI) and complex diseases. Gastroenterol Hepatol Bed Bench. 2014;7(1):17–31.
PubMed Central PubMed Google Scholar
Sevimoglu T, Arga KY. The role of protein interaction networks in systems biomedicine. Comput Struct Biotechnol J. 2014;11(18):22–7. doi:10.1016/j.csbj.2014.08.008.
Article PubMed Central PubMed Google Scholar
Spurlock ME, Gabler NK. The development of porcine models of obesity and the metabolic syndrome. J Nutr. 2008;138(2):397–402.
CAS PubMed Google Scholar
Rocha D, Plastow G. Using commercial pigs in the search for genes behind human obesity. Nat Rev Genet. 2005;6(3). doi:10.1038/nrg1556-c1.
Kogelman LJA, Kadarmideen HN, Mark T, Karlskov-Mortensen P, Bruun CS, Cirera S, et al. An F2 pig resource population as a model for genetic studies of obesity and obesity-related diseases in humans: design and genetic parameters. Front Genet. 2013;4:29. doi:10.3389/fgene.2013.00029.
Article PubMed Central PubMed Google Scholar
Kogelman LJA, Pant SD, Fredholm M, Kadarmideen HN. Systems genetics of obesity in an F2 pig model by genome-wide association, genetic network and pathway analyses. Front Genet. 2014;5:214. doi:10.3389/fgene.2014.00214.
Article PubMed Central PubMed Google Scholar
Kogelman LJA, Cirera S, Zhernakova D, Fredholm M, Franke L, Kadarmideen H. Identification of co-expression gene networks, regulatory genes and pathways for obesity based on adipose tissue RNA Sequencing in a porcine model. BMC Med Genomics. 2014;7(1):57. doi:10.1186/1755-8794-7-57.
Article PubMed Central PubMed Google Scholar
Westra H-J, Franke L. From genome to function by studying eQTLs. Biochim Biophys Acta. 2014;1842(10):1896–902. doi:10.1038/ncb1623.
Article CAS PubMed Google Scholar
Westra H-J, Peters MJ, Esko T, Yaghootkar H, Schurmann C, Kettunen J, et al. Systematic identification of trans eQTLs as putative drivers of known disease associations. Nat Genet. 2013;45(10):1238–43. doi:10.1038/ng.2756.
Article PubMed Central CAS PubMed Google Scholar
Cameron ND. Selection indices and prediction of genetic merit in animal breeding. New York: CAB International; 1997.
Google Scholar
Nettleton D, Wang D. Selective transcriptional profiling for trait-based eQTL mapping. Anim Genet. 2006;37 Suppl 1:13–7. doi:10.1111/j.1365-2052.2006.01478.x.
Article CAS PubMed Google Scholar
Cardoso FF, Rosa GJ, Steibel JP, Ernst CW, Bates RO, Tempelman RJ. Selective transcriptional profiling and data analysis strategies for expression quantitative trait loci mapping in outbred F2 populations. Genetics. 2008;180:1679–90. doi:10.1534/genetics.108.090969.
Article PubMed Central PubMed Google Scholar
Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29(1):15–21. doi:10.1093/bioinformatics/bts635.
Article PubMed Central CAS PubMed Google Scholar
Anders S, Pyl PT, Huber W. HTSeq - A Python framework to work with high-throughput sequencing data. Bioinformatics. 2015;31(2):166–9. doi: 10.1093/bioinformatics/btu638.
Article PubMed Central PubMed Google Scholar
Łabaj PP, Leparc GG, Linggi BE, Markillie LM, Wiley HS, Kreil DP. Characterization and improvement of RNA-Seq precision in quantitative transcript expression profiling. Bioinformatics. 2011;27(13):i383–i91. doi:10.1093/bioinformatics/btr247.
Article PubMed Central PubMed Google Scholar
Anders S. Analysing RNA-Seq data with the DESeq Package. 2010. Available at http://www.bioconductor.org/help/course-materials/2011/BioC2011/LabStuff/DESeq.pdf. . Accessed 15 Oct 2015.
Smyth GK. Limma: Linear models for microarray data. Bioinformatics and computational biology solutions using R and Bioconductor. New York: Springer; 2005. p. 397–420.
Book Google Scholar
Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B Methodol. 1995;57:289–300.
Google Scholar
Haider S, Ballester B, Smedley D, Zhang J, Rice P, Kasprzyk A. BioMart Central Portal—unified access to biological data. Nucleic Acids Res. 2009;37 suppl 2:W23–W7. doi:10.1093/nar/gkp265.
Article PubMed Central CAS PubMed Google Scholar
Fehrmann RS, Jansen RC, Veldink JH, Westra HJ, Arends D, Bonder MJ, et al. Trans-eQTLs reveal that independent genetic variants associated with a complex phenotype converge on intermediate genes, with a major role for the HLA. PLoS Genet. 2011;7(8):e1002197. doi:10.1371/journal.pgen.1002197.
Article PubMed Central CAS PubMed Google Scholar
Ponsuksili S, Du Y, Murani E, Schwerin M, Wimmers K. Elucidating molecular networks that either affect or respond to plasma cortisol concentration in target tissues of liver and muscle. Genetics. 2012;192(3):1109–22. doi:10.1534/genetics.112.143081.
Article PubMed Central CAS PubMed Google Scholar
McLaren W, Pritchard B, Rios D, Chen Y, Flicek P, Cunningham F. Deriving the consequences of genomic variants with the Ensembl API and SNP Effect Predictor. Bioinformatics. 2010;26(16):2069–70. doi:10.1093/bioinformatics/btq330.
Article PubMed Central CAS PubMed Google Scholar
Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9:559. doi:10.1186/1471-2105-9-559.
Article PubMed Central PubMed Google Scholar
Langfelder P, Zhang B, Horvath S. Defining clusters from a hierarchical cluster tree: the Dynamic Tree Cut package for R. Bioinformatics. 2008;24(5):719–20. doi:10.1093/bioinformatics/btm563.
Article CAS PubMed Google Scholar
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–504. doi:10.1101/gr.1239303.
Article PubMed Central CAS PubMed Google Scholar
Orchard S, Ammari M, Aranda B, Breuza L, Briganti L, Broackes-Carter F et al. The MIntAct project—IntAct as a common curation platform for 11 molecular interaction databases. Nucleic Acids Res. 2013. doi:10.1093/nar/gkt1115.
Su G, Kuchinsky A, Morris JH, States DJ, Meng F. GLay: community structure analysis of biological networks. Bioinformatics. 2010;26(24):3135–7. doi:10.1093/bioinformatics/btq596.
Article PubMed Central CAS PubMed Google Scholar
Young M, Wakefield M, Smyth G, Oshlack A. Gene ontology analysis for RNA-seq: accounting for selection bias. Genome Biol. 2010;11(2):R14. doi:10.1186/gb-2010-11-2-r14.
Article PubMed Central PubMed Google Scholar
Fehrmann RSN, Karjalainen JM, Krajewska M, Westra H-J, Maloney D, Simeonov A, et al. Gene expression analysis identifies global gene dosage sensitivity in cancer. Nat Genet. 2015;47(2):115–25. doi:10.1038/ng.3173.
Article CAS PubMed Google Scholar
Kadarmideen HN. Genomics to systems biology in animal and veterinary sciences: progress, lessons and opportunities. Livest Sci. 2014;166:232–48. doi:10.1016/j.livsci.2014.04.028.
Article Google Scholar
Kadarmideen HN, Watson-Haigh NS, Andronicos NM. Systems biology of ovine intestinal parasite resistance: disease gene modules and biomarkers. Mol BioSyst. 2011;7(1):235–46. doi:10.1039/c0mb00190b.
Article CAS PubMed Google Scholar
Max M, Shanker YG, Huang L, Rong M, Liu Z, Campagne F, et al. Tas1r3, encoding a new candidate taste receptor, is allelic to the sweet responsiveness locus Sac. Nat Genet. 2001;28(1):58–63. doi:10.1038/ng0501-58.
CAS PubMed Google Scholar
Bachmanov AA, Beauchamp GK. Taste receptor genes. Annu Rev Nutr. 2007;27:389–414. doi:10.1146/annurev.nutr.26.061505.111329.
Article PubMed Central CAS PubMed Google Scholar
Grimm ER, Steinle NI. Genetics of eating behavior: established and emerging concepts. Nutr Rev. 2011;69(1):52–60. doi:10.1111/j.1753-4887.2010.00361.x.
Article PubMed Central PubMed Google Scholar
Salbe AD, DelParigi A, Pratley RE, Drewnowski A, Tataranni PA. Taste preferences and body weight changes in an obesity-prone population. Am J Clin Nutr. 2004;79(3):372–8.
CAS PubMed Google Scholar
Do DN, Strathe AB, Ostersen T, Jensen J, Mark T, Kadarmideen HN. Genome-wide association study reveals genetic architecture of eating behavior in pigs and its implications for humans obesity by comparative mapping. PLoS One. 2013;8(8):e71509. doi:10.1371/journal.pone.0071509.
Article PubMed Central CAS PubMed Google Scholar
Zheng J, Wu C, Ma W, Zhang Y, Hou T, Xu H, et al. Abnormal expression of chondroitin sulphate N-acetylgalactosaminyltransferase 1 and Hapln-1 in cartilage with Kashin-Beck disease and primary osteoarthritis. Int Orthop. 2013;37(10):2051–9. doi:10.1007/s00264-013-1937-y.
Article PubMed Central PubMed Google Scholar
McDonough CW, Palmer ND, Hicks PJ, Roh BH, An SS, Cooke JN, et al. A genome-wide association study for diabetic nephropathy genes in African Americans. Kidney Int. 2011;79(5):563–72. doi:10.1038/ki.2010.467.
Article PubMed Central PubMed Google Scholar
Namba K, Nishio M, Mori K, Miyamoto N, Tsurudome M, Ito M, et al. Involvement of ADAM9 in multinucleated giant cell formation of blood monocytes. Cell Immunol. 2001;213(2):104–13. doi:10.1006/cimm.2001.1873.
Article CAS PubMed Google Scholar
Cao JJ. Effects of obesity on bone metabolism. J Orthop Surg Res. 2011;6:30. doi:10.1186/1749-799x-6-30.
Article PubMed Central PubMed Google Scholar
Gimble JM, Robinson CE, Wu X, Kelly KA. The function of adipocytes in the bone marrow stroma: an update. Bone. 1996;19(5):421–8. doi:10.1016/s8756-3282(96)00258-x.
Article CAS PubMed Google Scholar
Wu Y, Humphrey MB, Nakamura MC. Osteoclasts - the innate immune cells of the bone. Autoimmunity. 2008;41(3):183–94. doi:10.1080/08916930701693180.
Article PubMed Google Scholar
Pennisi E. Disease risk links to gene regulation. Science. 2011;332(6033):1031. doi:10.1126/science.332.6033.1031.
Article PubMed Google Scholar
Kumar V, Wijmenga C, Withoff S. From genome-wide association studies to disease mechanisms: celiac disease as a model for autoimmune diseases. Semin Immunopathol. 2012;34(4):567–80. doi:10.1007/s00281-012-0312-1.
Article PubMed Central CAS PubMed Google Scholar
Hindorff LA, Sethupathy P, Junkins HA, Ramos EM, Mehta JP, Collins FS, et al. Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci. 2009;106(23):9362–7. doi:10.1073/pnas.0903103106.
Article PubMed Central CAS PubMed Google Scholar
Schwenk RW, Eckel J. A novel method to monitor insulin-stimulated GTP-loading of Rab11a in cardiomyocytes. Cell Signal. 2007;19(4):825–30. doi:10.1016/j.cellsig.2006.10.008.
Article CAS PubMed Google Scholar
Capobianco V, Nardelli C, Ferrigno M, Iaffaldano L, Pilone V, Forestieri P, et al. miRNA and protein expression profiles of visceral adipose tissue reveal miR-141/YWHAG and miR-520e/RAB11A as two potential miRNA/protein target pairs associated with severe obesity. J Proteome Res. 2012;11(6):3358–69. doi:10.1021/pr300152z.
Article CAS PubMed Google Scholar
Pan W, Chandalia M, Abate N. New insights into the role of ENPP1 in insulin resistance. J Metabonomics Metab. 2012;1(1). doi:10.4172/jmbm.1000e103.
Meyre D, Bouatia-Naji N, Tounian A, Samson C, Lecoeur C, Vatin V, et al. Variants of ENPP1 are associated with childhood and adult obesity and increase the risk of glucose intolerance and type 2 diabetes. Nat Genet. 2005;37(8):863–7. doi:10.1038/ng1604.
Article PubMed Central CAS PubMed Google Scholar
Maddux BA, Goldfine ID. Membrane glycoprotein PC-1 inhibition of insulin receptor function occurs via direct interaction with the receptor alpha-subunit. Diabetes. 2000;49(1):13–9. doi:10.2337/diabetes.49.1.13.
Article CAS PubMed Google Scholar
Yang M, Zhang Y, Pan J, Sun J, Liu J, Libby P, et al. Cathepsin L activity controls adipogenesis and glucose tolerance. Nat Cell Biol. 2007;9(8):970–7. doi:10.1038/ncb1623.
Article CAS PubMed Google Scholar
Puri V, Konda S, Ranjit S, Aouadi M, Chawla A, Chouinard M, et al. Fat-specific protein 27, a novel lipid droplet protein that enhances triglyceride storage. J Biol Chem. 2007;282(47):34213–8. doi:10.1074/jbc.M707404200.
Article CAS PubMed Google Scholar
Puri V, Virbasius JV, Guilherme A, Czech MP. RNAi screens reveal novel metabolic regulators: RIP140, MAP4k4 and the lipid droplet associated fat specific protein (FSP) 27. Acta Physiol. 2008;192(1):103–15. doi:10.1111/j.1748-1716.2007.01786.x.
Article CAS Google Scholar
Toh SY, Gong J, Du G, Li JZ, Yang S, Ye J, et al. Up-regulation of mitochondrial activity and acquirement of brown adipose tissue-like property in the white adipose tissue of fsp27 deficient mice. PLoS One. 2008;3(8):e2890. doi:10.1371/journal.pone.0002890.
Article PubMed Central PubMed Google Scholar
Ito M, Nagasawa M, Omae N, Ide T, Akasaka Y, Murakami K. Differential regulation of CIDEA and CIDEC expression by insulin via Akt1/2- and JNK2-dependent pathways in human adipocytes. J Lipid Res. 2011;52(8):1450–60. doi:10.1194/jlr.M012427.
Article PubMed Central CAS PubMed Google Scholar
Moreno-Navarrete JM, Ortega F, Serrano M, Rodriguez-Hermosa JI, Ricart W, Mingrone G, et al. CIDEC/FSP27 and PLIN1 gene expression run in parallel to mitochondrial genes in human adipose tissue, both increasing after weight loss. Int J Obes (Lond). 2014;38(6):865–72. doi:10.1038/ijo.2013.171.
Article CAS Google Scholar
Rohr PV, Friberg M, Kadarmideen HN. Prediction of transcription factor binding sites using results from genetical genomics investigations. J Bioinforma Comput Biol. 2007;5:773–93.
Article Google Scholar
Thornton P, Hatcher JP, Robinson I, Sargent B, Franzén B, Martino G, et al. Artemin–GFRα3 interactions partially contribute to acute inflammatory hypersensitivity. Neurosci Lett. 2013;545:23–8. doi:10.1016/j.neulet.2013.04.007.
Article CAS PubMed Google Scholar
Blankman JL, Long JZ, Trauger SA, Siuzdak G, Cravatt BF. ABHD12 controls brain lysophosphatidylserine pathways that are deregulated in a murine model of the neurodegenerative disease PHARC. Proc Natl Acad Sci U S A. 2013;110(4):1500–5. doi:10.1073/pnas.1217121110.
Article PubMed Central CAS PubMed Google Scholar
Yea K, Kim J, Lim S, Kwon T, Park HS, Park KS, et al. Lysophosphatidylserine regulates blood glucose by enhancing glucose transport in myotubes and adipocytes. Biochem Biophys Res Commun. 2009;378(4):783–8. doi:10.1038/ncb1623.
Article CAS PubMed Google Scholar
Turturro S, Shen X, Shyam R, Yue B, Ying H. Effects of mutations and deletions in the human optineurin gene. Springer Plus. 2014;3(1):99. doi:10.1186/2193-1801-3-99.
Article PubMed Central PubMed Google Scholar
Sharma A, Huard C, Vernochet C, Ziemek D, Knowlton KM, Tyminski E, et al. Brown fat determination and development from muscle precursor cells by novel action of bone morphogenetic protein 6. PLoS One. 2014;9(3):e92608. doi:10.1371/journal.pone.0092608.
Article PubMed Central PubMed Google Scholar
Donaldson JG. Multiple roles for Arf6: sorting, structuring, and signaling at the plasma membrane. J Biol Chem. 2003;278(43):41573–6. doi:10.1074/jbc.R300026200.
Article CAS PubMed Google Scholar
Lawrence JTR, Birnbaum MJ. ADP-ribosylation factor 6 regulates insulin secretion through plasma membrane phosphatidylinositol 4,5-bisphosphate. Proc Natl Acad Sci. 2003;100(23):13320–5. doi:10.1073/pnas.2232129100.
Article PubMed Central CAS PubMed Google Scholar
Dong ZM, Gutierrez-Ramos J-C, Coxon A, Mayadas TN, Wagner DD. A new class of obesity genes encodes leukocyte adhesion receptors. Proc Natl Acad Sci. 1997;94(14):7526–30.
Article PubMed Central CAS PubMed Google Scholar
Babic AM, Wang HW, Lai MJ, Daniels TG, Felbinger TW, Burger PC, et al. ICAM-1 and beta 2 integrin deficiency impairs fat oxidation and insulin metabolism during fasting. Mol Med. 2004;10(7–12):72–9. doi:10.2119/2004-00038.Wagner.
PubMed Central CAS PubMed Google Scholar
Awaya T, Yokosaki Y, Yamane K, Usui H, Kohno N, Eboshida A. Gene-environment association of an ITGB2 sequence variant with obesity in ethnic Japanese. Obesity. 2008;16(6):1463–6. doi:10.1038/oby.2008.68.
Article CAS PubMed Google Scholar
Svenningsson P, Kim Y, Warner-Schmidt J, Oh Y-S, Greengard P. p11 and its role in depression and therapeutic responses to antidepressants. Nat Rev Neurosci. 2013;14(10):673–80. doi:10.1038/nrn3564.
Article PubMed Central CAS PubMed Google Scholar
Vogt L, Schmitz N, Kurrer MO, Bauer M, Hinton HI, Behnke S, et al. VSIG4, a B7 family–related protein, is a negative regulator of T cell activation. J Clin Invest. 2006;116(10):2817–26. doi:10.1172/JCI25673.
Article PubMed Central CAS PubMed Google Scholar
Holness C, Simmons D. Molecular cloning of CD68, a human macrophage marker related to lysosomal glycoproteins. Blood. 1993;81(6):1607–13.
CAS PubMed Google Scholar
Ramprasad MP, Terpstra V, Kondratenko N, Quehenberger O, Steinberg D. Cell surface expression of mouse macrosialin and human CD68 and their role as macrophage receptors for oxidized low density lipoprotein. Proc Natl Acad Sci U S A. 1996;93(25):14833–8.
Article PubMed Central CAS PubMed Google Scholar
Stewart TP, Kim HY, Saxton AM, Kim JH. Genetic and genomic analysis of hyperlipidemia, obesity and diabetes using (C57BL/6 J x TALLYHO/JngJ) F2 mice. BMC Genomics. 2010;11:713. doi:10.1186/1471-2164-11-713.
Article PubMed Central CAS PubMed Google Scholar
Schadt EE, Lamb J, Yang X, Zhu J, Edwards S, GuhaThakurta D, et al. An integrative genomics approach to infer causal associations between gene expression and disease. Nat Genet. 2005;37(7):710–7. doi:10.1038/ng1589.
Article PubMed Central CAS PubMed Google Scholar

Download references

Acknowledgements

The project is supported by a grant (Nr. 0603-00457B) from the Danish Council for Strategic Research (BioChild Project: www.biochild.ku.dk) and from the EU-FP7 Marie Curie Actions – Career Integration Grant (CIG-293511), both granted to HNK, from a PhD stipend awarded to LJAK from the University of Copenhagen, and from the Danish Ministry of Science and Technology (the “UNIK Project for Food Fitness and Pharma for Health”) to MF.

Author information

Authors and Affiliations

Department of Veterinary Clinical and Animal Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, Grønnegårdsvej 7, 1870, Frederiksberg C, Denmark
Lisette J. A. Kogelman, Susanna Cirera, Merete Fredholm & Haja N. Kadarmideen
Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
Daria V. Zhernakova & Lude Franke
Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
Harm-Jan Westra
Partners Center for Personalized Genetic Medicine, Boston, MA, USA
Harm-Jan Westra
Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Harm-Jan Westra

Authors

Lisette J. A. Kogelman
View author publications
You can also search for this author in PubMed Google Scholar
Daria V. Zhernakova
View author publications
You can also search for this author in PubMed Google Scholar
Harm-Jan Westra
View author publications
You can also search for this author in PubMed Google Scholar
Susanna Cirera
View author publications
You can also search for this author in PubMed Google Scholar
Merete Fredholm
View author publications
You can also search for this author in PubMed Google Scholar
Lude Franke
View author publications
You can also search for this author in PubMed Google Scholar
Haja N. Kadarmideen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Haja N. Kadarmideen.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

HNK was the overall project leader who conceived this systems genetics study and supervised LJAK in the genetics, bioinformatics, and systems biology analyses. LJAK analyzed all the data and wrote the first draft of the manuscript. MF established the pig population used in the study, and collected and provided the phenotypic measurements and adipose tissue for RNA sequencing. SC performed the RNA isolation and quality control of RNA samples before RNA sequencing. The initial eQTL mapping was carried out at the University Medical Centre Groningen (The Netherlands) where DVZ and HJW helped LJAK with the analysis, under the supervision of LF. Later eQTL analyses were extended to include gene networks at the University of Copenhagen by LJAK under the supervision of HNK. All authors wrote, read, and approved the final version of the manuscript.

Additional files

Additional file 1:

Differentially expressed genes between the different obesity levels. (XLSX 59 kb)

Additional file 2:

Detected cis -eQTLs and trans -eQTLs in the complete dataset. (XLSX 90 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Kogelman, L.J., Zhernakova, D.V., Westra, HJ. et al. An integrative systems genetics approach reveals potential causal genes and pathways related to obesity. Genome Med 7, 105 (2015). https://doi.org/10.1186/s13073-015-0229-0

Download citation

Received: 27 May 2015
Accepted: 05 October 2015
Published: 20 October 2015
DOI: https://doi.org/10.1186/s13073-015-0229-0

An integrative systems genetics approach reveals potential causal genes and pathways related to obesity