Identification and MS-assisted interpretation of genetically influenced NMR signals in human plasma
© Raffler et al.; licensee BioMed Central Ltd. 2013
Received: 30 November 2012
Accepted: 15 February 2013
Published: 15 February 2013
Nuclear magnetic resonance spectroscopy (NMR) provides robust readouts of many metabolic parameters in one experiment. However, identification of clinically relevant markers in 1H NMR spectra is a major challenge. Association of NMR-derived quantities with genetic variants can uncover biologically relevant metabolic traits. Using NMR data of plasma samples from 1,757 individuals from the KORA study together with 655,658 genetic variants, we show that ratios between NMR intensities at two chemical shift positions can provide informative and robust biomarkers. We report seven loci of genetic association with NMR-derived traits (APOA1, CETP, CPS1, GCKR, FADS1, LIPC, PYROXD2) and characterize these traits biochemically using mass spectrometry. These ratios may now be used in clinical studies.
Identification of the genetic and environmental determinants of human metabolism is key to the understanding of complex disorders. Metabolic profiling of blood samples from cohort studies is a major tool in the discovery of new disease relevant biomarkers and the metabolic individuality in the general population. Two major techniques are generally used: mass spectrometry (MS) and nuclear magnetic resonance spectroscopy (NMR). At the basis, both platforms are complementary in many regards. For instance, MS is more sensitive and allows obtaining more detailed information while using small sample volume, but is also quite sensitive to changing experimental conditions, and sample preparation is more demanding. NMR-based measurements, on the other hand, require higher sample volumes and the interpretation of the raw spectra is not always straightforward. Still, the strongest advantages of NMR are simplicity of sample preparation and excellent reproducibility of quantitative metabolic readouts. It has previously been shown that the stability of NMR measurements is to some extend independent of the NMR platforms and laboratory practices [1, 2]. Therefore, NMR spectrometry is predestined for routine clinical applications as it generates datasets that are comparable between clinics and over time.
Previously, we and others have shown the potential of MS-based and NMR-based metabolomics in discovery studies such as the KORA study [3–7]. Using a genome-wide association approach with metabolite concentrations, so-called 'genetically influenced metabotypes' (GIM) have been discovered. These GIMs, which define the genetic basis of human metabolic individuality, have, in many cases, been linked to clinically relevant endpoints. For instance, polymorphisms in the fatty acid desaturase 1 (FADS1) locus are associated with Crohn's disease  and are furthermore suspected to play a role in cardiovascular disorders through association with cholesterol and triglycerides . As further examples, the glucokinase regulator (GCKR) gene is a risk locus for several diabetes-relevant traits , and genetic variants at the carbamoyl-phosphatase 1 (CPS1) locus are associated with risk factors for chronic kidney disease .
Up to now, GWAS using metabolic traits mainly focused on metabolites that were identified by the annotation of MS and NMR spectra using reference spectra in existing databases.
Here, we address the question of whether known clinically relevant GIMs can be derived from raw NMR spectra in blood plasma without prior annotation. We take a hypothesis-free approach to identify genetically influenced NMR features and investigate whether specific ratios between NMR intensities at two chemical shifts can provide informative and robust biomarkers. Finally, we interpret our findings using correlations to MS determined metabolite concentrations.
With the KORA F4 study cohort, we have unique access to a large number of samples of the general population, which are metabolically deeply characterized both using MS and NMR methods. In previous studies based on these data, we showed that using ratios between metabolite concentrations can strengthen genotype-metabotype associations. In such cases, the ratios often reflect educt-product pairs of enzymatic reactions . Based on this idea, we here conduct a genome-wide association study (GWAS) on the binned NMR spectra, in combination with testing ratios between pairs of intensity values at different chemical shift positions. We then confront our results with findings from previous GWAS with MS based metabolomics [4, 5] and NMR derived lipid subclasses . Finally, we interpret the results in the light of the existing MS determined metabolite concentrations and data originating from clinical biochemistry measurements.
The KORA study is an independent population-based survey from the general population living in the region of Augsburg, Germany. The KORA S4 study was conducted in 1999-2001 and comprises a total of 4,261 participants . Between 2006 and 2008, a total of 3,080 subjects participated in a follow-up examination, KORA F4, which is the basis for the results presented here. All participants gave their signed informed consent and the local ethics committee approved the studies.
Blood samples for metabolic analysis and DNA extraction were collected as part of the KORA F4 follow-up study. To avoid variation due to circadian rhythm, blood was drawn in the morning between 08:00 and 10:30 after overnight fasting. A part of the blood was drawn into serum gel tubes, gently inverted twice, and then allowed to rest for 30 min at room temperature (18-25°C) to obtain complete coagulation. The material was then centrifuged for 10 min (2,750 g at 15°C). Serum was divided into aliquots and kept for a maximum of 6 h at 4°C, after which it was frozen at -80°C until final analysis. Another part of the blood was drawn into ethylene diaminetetraacetic acid (EDTA) tubes, gently inverted twice, and left on the Sarstedt Universal mixer <5 min to avoid mechanical hemolysis, followed by centrifugation at 15°C for 10 min at 2,750 g. Thereafter, plasma was separated, divided into 200 mL aliquots and kept at 4°C, after which it was deep-frozen to -80°C. Within 2 weeks, plasma was stored in the gaseous phase of liquid nitrogen at -196°C.
For all individuals profiled from the KORA study, genome-wide single nucleotide polymorphism (SNP) data were already available. These data have been used and described extensively in the past in the context of several GWAS (for example, [4, 5]). Therefore, we summarize only the essential details here. For genotyping, 1,814 randomly selected participants of KORA F4 were included. These samples were genotyped using the Affymetrix Human SNP Array 6.0 (sample call rate >93%). Genotypes were determined using the Birdseed2 clustering algorithm. For quality assurance, the criteria of SNP call rate >95%, minor allele frequency >1%, and P(Hardy-Weinberg) >10-6 were applied as filters. In total, 655,658 autosomal SNPs satisfied these criteria.
Metabolic analyses were conducted using clinical biochemistry methods, two distinct MS-based platforms (targeted and non-targeted), and NMR spectrometry (lipid classes and binned spectral data). For the joint analysis, metabolomics and genotype data were available for a total of 1,757 individuals. All metabolite measurements have been reported before. We therefore only summarize here the points that are essential for the present study.
The following serum lipids were measured on fresh samples using the Dimension RxL (Dade Behring). TC was determined by cholesterol esterase method (CHOL Flex, Dade-Behring, CHOD-PAP method), HDL-C using the AHDL Flex (Dade-Behring, CHOD-PAP method after selective release of HDL-C), LDL-C using the ALDL Flex (Dade Behring, CHOD-PAP method after colorless usage of all non-LDL-C), and TG was measured using a TGL Flex (Dade Behring, enzymatic colorimetric test, GPO-PAP method).
The Biocrates AbsoluteIDQ p150 kit was used for absolute quantification of a defined set of serum metabolites. Sample analyses were done on an API 4000 Q TRAP LC/MS/MS system (Applied Biosystems) equipped with a Schimadzu Prominence LC20AD pump and a SIL-20AC autosampler. The complete analytical process was performed using the MetIQ software package, which is an integral part of the AbsoluteIDQ kit. In total, we detected 163 different metabolites. The metabolomics dataset contains 14 amino acids, hexose (H1), free carnitine (C0), 40 acylcarnitines (Cx:y), hydroxylacylcarnitines (C(OH)x:y), and dicarboxylacylcarnitines (Cx:y-DC), 15 sphingomyelins (SM Cx:y) and N-hydroxylacyloylsphingosylphosphocholine (SM(OH) Cx:y), 77 phosphatidylcholines (PC, aa = diacyl, ae = acyl-alkyl) and 15 lyso-phosphatidylcholines. Lipid side chain composition is abbreviated as Cx:y, where x denotes the number of carbons in the side chain and y the number of double bonds. For further details on this dataset and the coefficients of variation (CVs) of metabolite quantification see .
Metabolon, a commercial supplier of metabolic analyses, developed a platform that integrates the chemical analysis, including identification and relative quantification, data-reduction, and quality-assurance components of the process. The analytical platform incorporates two separate ultrahigh-performance liquid chromatography/tandem mass spectrometry (UHPLC/MS/MS2) injections and one gas chromatography/mass spectrometry (GC/MS) injection per sample. The UHPLC injections were optimized for basic and acidic species. The LC/MS portion of the platform was based on a Waters ACQUITY UPLC and a Thermo-Finnigan LTQ mass spectrometer, which consisted of an electrospray ionization (ESI) source and a linear ion-trap (LIT) mass analyzer. For GC/MS, the samples were analyzed on a Thermo-Finnigan Trace DSQ fast-scanning single-quadrupole mass spectrometer using electron impact ionization. A total of 295 serum metabolites were measured, spanning several relevant classes (amino acids, acylcarnitines, sphingomyelins, glycerophospholipids, carbohydrates, vitamins, lipids, nucleotides, peptides, xenobiotics, and steroids). The detection of the entire panel was carried out with 24 min of instrument analysis time (two injections at 12-min each), while maintaining low median process variability (<12% across all compounds). The resulting MS/MS2 data were searched against a standard library generated by Metabolon that included retention time, molecular mass to charge ratio (m/z), preferred adducts and in-source fragments as well as their associated MS/MS spectra for all molecules in the library. The library allowed for the identification of the experimentally detected molecules on the basis of a multiparameter match without the need for additional analyses. For further details on this dataset and the coefficients of variation (CVs) of the metabolite measurements see .
NMR-derived lipid classes
NMR spectra measurements were carried out at numares (formerly LipoFIT), Regensburg. The blood plasma spectra were recorded on a Bruker 600 MHz Avance IIplus spectrometer. The spectra were phased and baseline corrected. Since no reference compound was added, all spectra were horizontally aligned to the prominent lactate signals at 1.36 and 1.37 ppm. Based on a proprietary approach that uses complex deconvolution algorithms on the spectral regions between 0.6 and 1.5 ppm, a set of 15 lipoprotein subfractions was derived from the spectra. These fractions correspond to HDL (small = L1, medium = L2, large = L3, very large = L4), LDL (very small = L5, small = L6, medium = L7, large = L8, very large = L9), IDL (L10), VLDL (small = L11, large = L12), remnants (L13), and chylomicrons (small = L14, large = L15). For further details on this dataset see .
NMR binned data
For the purpose of this study, the spectra were limited to a range from 0 to 9 ppm and divided into 10,000 bins of equal width (0.001 ppm). Spectral binning is a standard procedure in NMR-based metabolomics studies that reduces the data complexity and compensates for slight jitter of the signals' chemical shifts . The intensity values were log10-transformed prior to analysis. To diminish the effect of outliers, for each bin individual intensity values more than three times the standard deviation away from the mean were excluded. Also, the spectral region affected by the water peak (δ = 4.6-5 ppm) was excluded from further analysis.
To test for associations between genetic polymorphisms and individual NMR signal bins, we created age- and gender-adjusted linear additive models using PLINK (Version 1.07) . Testing all possible ratios between NMR signals all over the spectrum would not be feasible due to the huge amount of computational time that would be needed. Therefore, we performed a simple feature selection. In most cases SNPs do not associate with only one NMR signal bin but with a number of adjacent bins. Thus, we performed a search for local minima on the pseudo-spectra (chemical shift vs. strength of association) resulting from the first GWAS run to pick the positions on the spectra which show the strongest associations. We then used the 500 best-scoring NMR signal positions to compute pair wise ratios (124,750 in total). As before, we used age- and gender-adjusted linear additive models in PLINK to test for associations.
We applied a conservative Bonferroni correction to control for false-positive error rates resulting from multiple testing. We corrected for tests on 655,658 SNPs and 133,350 NMR features at a nominal significance level of 5%, thus obtaining an adjusted P value of 0.05/655,658/133,350 = 5.72 × 10-13.
Spearman correlations between biochemically and MS determined metabolite concentrations and NMR signal intensities were calculated using the statistical analysis system R (Version 2.15.1). To avoid false-positive associations due to small sample sizes, only metabolic traits with at least 300 non-missing values were included. Furthermore, metabolite concentrations that were more than three times the standard deviation away from the mean were excluded.
Seven genetic associations identified
We adopted a two-stage approach to detect associations between genetic variants and NMR signals. In the first step, we performed a GWAS using the NMR intensity readouts of the 1H-NMR spectra from 1,757 plasma samples. To this end, we binned the spectral region ranging from 0 to 9 ppm at a high resolution of 0.001 ppm. The region surrounding the water peak (δ = 4.6-5 ppm) was discarded. NMR intensities were log10-scaled and extreme outliers were removed (see Methods). Usually, NMR-based GWAS either use a selection of chemical shifts or perform spectral annotation to reduce the NMR data to the underlying metabolite concentrations (for example, [6, 7, 17]). In contrast, our approach uses the signal intensities at almost all chemical shift positions of the NMR spectra.
In the second step, we selected the 500 bins in the NMR spectrum that exhibited the strongest signal of association. In cases where neighboring bins all displayed associations of comparable strength to the same single nucleotide polymorphism (SNP), the bin with the lowest P value was selected. In a second GWAS, we then tested all possible ratios between the intensities at these 500 chemical shift positions for associations with all genetic variants. To our knowledge, this is the first GWAS that uses a ratio-based, hypothesis-free approach with raw NMR spectral data as a phenotype. For both GWAS, age and gender were used as covariates. In the subsequent analysis, we considered only robust associations of frequent SNPs with minor allele frequencies (MAF) >10%.
Genetic associations with NMR intensities and ratios between NMR intensities.
Chemical shifts for ratios (ppm)
P value ratios
Overlap with NMR-derived lipid subclasses
Comparison with association data from previous studies on the same specimen.
Mass spectrometry Non-targeted, semi-quantitative
NMR Lipid subclasses
NMR Ratios between chemical shifts(this study)
Pc ae C34:2/Pc aa C32:2
3.286 ppm/1.370 ppm
rs1260326, LD r2 = 0.93
rs1260326, LD r2 = 0.93
P = 3.8x10-8
P = 4.9x10-32
P = 3.7x10-6
P = 2.8x10-15
Glycine/PC aa C38:2
3.599 ppm/2.475 ppm
(locus not reported)
P = 1.9x10-30
P = 9.8x10-21
P = 1.8x10-19
2.757 ppm/2.755 ppm
(locus not reported)
(locus not reported)
P = 0.00021
P = 2.9x10-94
PC aa C36:3/PC aa C36:4
2.801 ppm/2.017 ppm
rs174546, LD r2 = 1.0
P = 6.5x10-179
P = 1.2x10-80
P = 1.4x10-5
P = 1.1x10-94
PC aa C36:2/PC aa C38:1
4.162 ppm/4.082 ppm
rs964184, LD r2 = 0.61
rs964184, LD r2 = 0.61
P = 1.8x10-10
P = 4.3x10-7
P = 4.8x10-12
P = 1.8x10-14
1.068 ppm/1.029 ppm
rs1532085, LD r2 = 0.55
P = 9.7x10-8
P = 8.7x10-7
P = 5.3x10-11
P = 3.6x10-21
2.211 ppm/2.011 ppm
(locus not reported)
rs3764261, LD r2 = 1.0
P = 0.00039
P = 3.6x10-7
P = 1.1x10-18
Strong correlations of NMR signals to MS-based and further metabolite concentrations
In the case of correlation with lipid-related parameters, the correlation spectra generally show wider areas of the NMR spectrum corresponding to these metabolites (for example, PCs in Figure 2, triglyceride levels in Figure 3). In contrast, correlations with non-lipid metabolites are visible as individual, sharp peaks (for example, glycine in Figure 1 and glucose in Additional file 2). These peaks correlate, among others, with the concentrations of glucose, lactate, proline, and glycine with squared Spearman correlation coefficients (rs2) up to 0.61. In most cases, we can verify the validity of the correlations using experimental spectra of pure compounds from HMDB . For glycine, we conducted an additional spiking experiment to confirm that the NMR signal at δ = 3.599 ppm indeed is driven by glycine. However, since many metabolites are by nature interlinked in metabolic networks, some signal correlations do not reflect just a single compound but a mixture of co-regulated (and therefore auto-correlated) metabolites. Also, due to overlapping signals in NMR spectra, there may be multiple, non-related metabolites showing correlations at the same chemical shift position.
Additional file 3, Table S1 lists all metabolites that correlate with NMR signals at rs2 = 0.20 (or above) and the chemical shift where the correlation coefficient is highest. The corresponding correlation spectra are provided in Additional file 2.
Spearmancorrelations between NMR intensities and metabolic traits determined by MS and clinical biochemistry.
Chemical shift (ppm)
Chemical shifts for ratios (ppm)
Metabolite or Metabolite ratios
PC aa C38:5
PC aa C38:5/
PC aa C36:3
PC aa C36:1
In this study, we perform a GWAS with binned NMR spectra and ratios between pairs of chemical shifts in a hypothesis-free approach without prior annotation of NMR features. In total, we identify seven genetic loci. Four of these associations (LIPC, PYROXD2, GCKR, APOA1) only reach a genome-wide significance level when ratios between the signal intensities at two different chemical shift positions are tested. Five of the seven loci (LIPC, CETP, FADS1, GCKR, APOA1) are also known lipid risk loci, and all have previously been identified in GWAS with metabolic traits. There is thus no doubt about the functional role of these genetic loci in inducing a metabolic phenotype. Moreover, correlating NMR features to known metabolic phenotypes determined on other platforms can help in the interpretation of the observed genetic associations for various loci and the characterization of the associated NMR trait, as shown in this study for the CPS1, FADS1, and GCKR loci.
Given the fact that these loci have already been largely studied for their interest from a biomedical point of view, we focus here on aspects that highlight NMR-specific features of the associations. The features presented in this work may now be used in future studies with clinical disease endpoints or may potentially be established as biomarkers. In the following, we focus on four of the loci, each of them highlighting a different situation.
Singular NMR intensities (peaks) correlate with MS-based glycine measurements and provide comparable genetic associations (CPS1 locus)
The carbamoyl-phosphate synthase 1 (CPS1) controls the first step in the urea cycle. Klaus et al. report two mutations of the CPS1 gene that contribute to the onset of CPS1 deficiency, an inborn error of metabolism that causes hyperammonemia . In 2010, Illig et al. reported an association of SNP rs2216405 with glycine concentrations at a nominal P value of 2.59×10-26 . Glycine is metabolically related to carbamoyl phosphate, which in turn is the product of CPS1. Thus, Illig et al. presented a genetically influenced metabotype that might represent a mild form of CPS1 deficiency.
Here, we find an association of the same SNP with the NMR signals at δ = 3.599 ppm (P = 4.46×10-14). The correlation between MS determined glycine and the NMR signals at that chemical shift is rs2 = 0.22. Although this correlation is modest, the correlation spectrum for glycine shows a distinct peak at δ = 3.599 ppm (Figure 1). A reference spectrum for glycine taken from HMDB  shows a single peak at δ = 3.54 ppm. While the spectra used in this study were referenced to lactate and the sample pH was 7.4, the HMDB spectrum was referenced to 4,4-dimethyl-4-silapentane-1-sulfonic acid (DSS) and the sample pH was 7.0. Thus, the 0.06 ppm distance between the putative glycine peak in this study and the HMDB glycine peak can be explained by different experimental conditions. In a separate spiking experiment comparing two spectra with and without addition of glycine, we confirmed that the signal at δ = 3.599 ppm indeed corresponds with glycine.
Thus, in this case we find that information obtained from a GWAS with NMR data provides comparable results to what was obtained with MS, albeit the strength of the association observed with NMR data is weaker in this case. The NMR chemical shift δ = 3.599 ppm could potentially be used as a marker for mild forms of perturbations in the ammonia metabolism caused by genetic variations in the CPS1 locus.
Associations with ratios between NMR intensities indicate spectral regions which are representative of triple and quadruple fatty acid desaturation (FADS1 locus)
The fatty-acid desaturase 1 (FADS1) gene product catalyzes the desaturation reaction of 8,11,14-eicosatrienoyl-CoA to arachidonoyl-CoA (C20:3 → C20:4). Genetic variants in this locus have been linked to Crohn's disease  and risk factors for cardiovascular disorders, namely cholesterol and triglyceride levels . Recently, two independent studies showed that dietary intake of long-chain polyunsaturated fatty acids (for example, C20:3 and C20:4) modulates the association between genetic variation in FADS1 and serum lipid levels, and thereby potentially also modifies the risk of cardiovascular disease [20, 21]. In their previous GWAS, Gieger et al.  and Illig et al.  identified strong associations of SNP rs174547 (intronic region of FADS1) with a number of glycerophospholipids, many containing lipid side chains with C20:3 and C20:4 polyunsaturated omega-3 and omega-6 fatty acids (PUFAs). The authors observed an exceptionally large increase in the strength of the association when ratios between phospholipids containing PUFAs with <4 double bonds and ≥4 double bonds were testedin pairs . This observation can be explained by the fact that ratios between substrate: product pairs approximate the underlying enzymatic reaction rate and also because ratios between related metabolites reduce the overall variance that is observed between individuals with different levels of overall blood PUFA concentrations .
In our data, the correlation spectra for glycerophospholipid levels show distinct differences depending on whether the degree of saturation of the side chains is below four double bonds (precursor of substrates of FADS1 enzymatic reaction) or equal or above four double bonds (products downstream of FADS1) (Figure 2). When testing binned NMR intensities for association, SNP rs174547 displays associations at a wider range of chemical shifts, with the strongest signal at δ = 2.801 ppm (P = 3.96×10-35). Interestingly, when using ratios between the NMR intensities at δ = 2.801 ppm and 2.017 ppm, the strength of association increases by nearly 60 orders of magnitude (P = 1.1×10-94), similar to the MS-based case. The NMR ratio δ = 2.801 ppm/δ = 2.017 ppm is therefore a likely proxy for the ratio between three- and four-fold desaturated long chain fatty acids. As a metabolic marker, this ratio might be a readout for the efficacy of the FADS1 enzymatic reaction in the context of dietary intake of PUFAs.
A ratio between NMR intensities in the triglyceride and in the glucose range of the spectrum constitutes an integrated pleiotropic diabetes risk marker (GCKR locus)
In our study, SNP rs780094 (intronic region of GCKR) associates with NMR intensities at δ = 1.370 ppm. At this spectral position, we observe a high correlation between NMR signals and TG levels (rs2 = 0.61) (Figure 3). However, this association is not of genome-wide significance (P = 1.2×10-10). When testing ratios for association, the strength of association increases by five orders of magnitude, with a P value of 2.8×10-15 for the association of SNP rs780094 to the NMR ratio δ = 1.370 ppm/δ = 3.286 ppm. The correlation spectrum with glucose shows a distinct signal at δ = 3.286 ppm (rs2 = 0.28). This chemical shift is indeed located within a known glucose-related spectral region. Furthermore, this NMR ratio correlates with the triglyceride/glucose ratio (rs2 = 0.42). Interestingly, GCKR is a diabetes risk locus that inversely modulates triglyceride and fasting glucose levels [4, 23–26].
Thus, the combination of these two chemical shifts likely provides a combined measure of two main diabetes-risk readouts that have been independently associated with the GCKR locus before. We suggest that this ratio therefore may constitute an integrated biomarker for the pleiotropic biological processes related to perturbations in the GCKR pathway.
Ratios between neighboring NMR intensities act as a local baseline correction and strengthen the genetic association (PYROXD2 locus)
PYROXD2 is a probable pyridine nucleotide-disulphide oxidoreductase gene. Nicholson et al. recently found this locus to be associated with dimethylamine concentrations in plasma, also using NMR spectrometry . At the time when our study was conducted, metabolomics information was only accessible using NMR methods. Meanwhile, an association with an MS determined, biochemically non-identified metabolite has been found, thus replicating this GIM on a different metabolomics platform . However, a link of this metabotype to a phenotype of clinical relevance has not been reported so far.
Since PYROXD2 is a relatively uncharacterized genetic locus, combining the NMR ratio presented here with findings from other studies and platforms could be used to further investigate its biological function.
In this study, we identify seven ratios of NMR signal intensities at different chemical shift positions that associate with genetic loci at a genome-wide significance level. Compared to the genetic associations to signal intensities at individual chemical shift positions, the strength of association increased in the case of CEPT, GCKR, CPS1, and APOA1 by at least three orders of magnitude. In the case of FADS1 and PYROXD2 it even increased by 60 and 80 orders of magnitude (Table 1). There are different reasons for this increase in the strength of association: In the case of the FADS1 and GCKR loci, the signal ratios most likely represent the ratio between two distinct metabolites or metabolite classes, while in the case of PYROXD2, ratios may compensate for overlapping NMR signals originating from other metabolites, thereby acting as a local baseline correction.
More generally, we have shown that the use of signal ratios is a simple method to derive genetically validated bi-variate measures or biomarkers from NMR spectra. This approach could potentially be generalized to the multi-variate case, again using genetic association as a criterion for feature selection.
y: Hydroxylacylcarnitine (x = number of carbons in fatty acid side chain, y = number of double bonds in fatty acid side chain)
Cholesterylester transfer protein
Utah Residents with Northern and Western European Ancestry
Carbamoyl-phosphate synthetase 1
Coefficient of variation
y: Acylcarnitine (x = number of carbons in fatty acid side chain, y = number of double bonds in fatty acid side chain)
y-DC: Dicarboxylacylcarnitine (x = number of carbons in fatty acid side chain, y = number of double bonds in fatty acid side chain)
Ethylene diaminetetraacetic acid
Electro spray ionization
Delta-5 fatty acid desaturase 1
Free induction decay
Genetically influenced metabotype
Genome-wide association study
High-density lipoprotein cholesterol
Human Metabolome Database
Kooperative Gesundheitsforschung in der Region Augsburg (Cooperative Health Research in the Region of Augsburg)
Low-density lipoprotein cholesterol
Linear ion trap
Minor allele frequency
National Center for Biotechnology Information
Nuclar magnetic resonance spectroscopy
- PC aa Cx:
y: Diacyl phosphatidylcholine (x = number of carbons in fatty acid side chain, y = number of double bonds in fatty acid side chain)
- PC ae Cx:
y: Acyl-alkyl phosphatidylcholine(x = number of carbons in fatty acid side chain, y = number of double bonds in fatty acid side chain)
Parts per million
Polyunsaturated fatty acid
Pyridine nucleotide-disulphide oxidoreductase domain 2
- SM(OH) Cx:
y: N-hydroxylacyloylsphingosylphosphocholine (x = number of carbons in fatty acid side chain, y = number of double bonds in fatty acid side chain)
- SM Cx:
y: Sphingomyelin (x = number of carbons in fatty acid side chain, y = number of double bonds in fatty acid side chain)
Single nucleotide polymorphism
Ultrahigh performance liquid chromatography
Very low-density lipoprotein.
The KORA research platform was initiated and financed by the Helmholtz Center Munich, German Research Center for Environmental Health, which is funded by the German Federal Ministry of Education and Research (BMBF) and by the State of Bavaria.
Computing resources were made available by the Leibnitz Supercomputing Centre of the Bavarian Academy of Science and Humanities and the DEISA Extreme Computing Initiative (Project MeMGenA).
AP is supported by the ENGAGE Exchange and Mobility Program (HEALTH-F4-2007-201413). JR is supported by DFG international research training group 'GRK 1563, Regulation and Evolution of Cellular Systems (RECESS)'. WR is supported by BMBF grant 03IS2061B (project Gani_Med). KS is supported by 'Biomedical Research Program' funds at Weill Cornell Medical College in Qatar, a program funded by the Qatar Foundation.
- Ward JL, Baker JM, Miller SJ, Deborde C, Maucourt M, Biais B, Rolin D, Moing A, Moco S, Vervoort J, Lommen A, Schafer H, Humpfer E, Beale MH: An inter-laboratory comparison demonstrates that [H]-NMR metabolite fingerprinting is a robust technique for collaborative plant metabolomic data collection. Metabolomics. 2010, 6: 263-273.PubMed CentralView ArticlePubMedGoogle Scholar
- Viant MR, Bearden DW, Bundy JG, Burton IW, Collette TW, Ekman DR, Ezernieks V, Karakach TK, Lin CY, Rochfort S, de Ropp JS, Teng Q, Tjeerdema RS, Walter JA, Wu H: International NMR-based environmental metabolomics intercomparison exercise. Environ Sci Technol. 2009, 43: 219-225.View ArticlePubMedGoogle Scholar
- Gieger C, Geistlinger L, Altmaier E, Hrabe de Angelis M, Kronenberg F, Meitinger T, Mewes HW, Wichmann HE, Weinberger KM, Adamski J, Illig T, Suhre K: Genetics meets metabolomics: a genome-wide association study of metabolite profiles in human serum. PLoS Genet. 2008, 4: e1000282-PubMed CentralView ArticlePubMedGoogle Scholar
- Illig T, Gieger C, Zhai G, Romisch-Margl W, Wang-Sattler R, Prehn C, Altmaier E, Kastenmuller G, Kato BS, Mewes HW, Meitinger T, de Angelis MH, Kronenberg F, Soranzo N, Wichmann HE, Spector TD, Adamski J, Suhre K: A genome-wide perspective of genetic variation in human metabolism. Nat Genet. 2010, 42: 137-141.PubMed CentralView ArticlePubMedGoogle Scholar
- Suhre K, Shin SY, Petersen AK, Mohney RP, Meredith D, Wagele B, Altmaier E, Deloukas P, Erdmann J, Grundberg E, Hammond CJ, de Angelis MH, Kastenmuller G, Kottgen A, Kronenberg F, Mangino M, Meisinger C, Meitinger T, Mewes HW, Milburn MV, Prehn C, Raffler J, Ried JS, Romisch-Margl W, Samani NJ, Small KS, Wichmann HE, Zhai G, Illig T, Spector TD: Human metabolic individuality in biomedical and pharmaceutical research. Nature. 2011, 477: 54-60.View ArticlePubMedGoogle Scholar
- Nicholson G, Rantalainen M, Li JV, Maher AD, Malmodin D, Ahmadi KR, Faber JH, Barrett A, Min JL, Rayner NW, Toft H, Krestyaninova M, Viksna J, Neogi SG, Dumas ME, Sarkans U, Donnelly P, Illig T, Adamski J, Suhre K, Allen M, Zondervan KT, Spector TD, Nicholson JK, Lindon JC, Baunsgaard D, Holmes E, McCarthy MI, Holmes CC: A genome-wide metabolic QTL analysis in Europeans implicates two loci shaped by recent positive selection. PLoS Genet. 2011, 7: e1002270-PubMed CentralView ArticlePubMedGoogle Scholar
- Kettunen J, Tukiainen T, Sarin AP, Ortega-Alonso A, Tikkanen E, Lyytikainen LP, Kangas AJ, Soininen P, Wurtz P, Silander K, Dick DM, Rose RJ, Savolainen MJ, Viikari J, Kahonen M, Lehtimaki T, Pietilainen KH, Inouye M, McCarthy MI, Jula A, Eriksson J, Raitakari OT, Salomaa V, Kaprio J, Jarvelin MR, Peltonen L, Perola M, Freimer NB, Ala-Korpela M, Palotie A, Ripatti S: Genome-wide association study identifies multiple loci influencing human serum metabolite levels. Nat Genet. 2012, 44: 269-276.PubMed CentralView ArticlePubMedGoogle Scholar
- Franke A, McGovern DP, Barrett JC, Wang K, Radford-Smith GL, Ahmad T, Lees CW, Balschun T, Lee J, Roberts R, Anderson CA, Bis JC, Bumpstead S, Ellinghaus D, Festen EM, Georges M, Green T, Haritunians T, Jostins L, Latiano A, Mathew CG, Montgomery GW, Prescott NJ, Raychaudhuri S, Rotter JI, Schumm P, Sharma Y, Simms LA, Taylor KD, Whiteman D: Genome-wide meta-analysis increases to 71 the number of confirmed Crohn's disease susceptibility loci. Nat Genet. 2010, 42: 1118-1125.PubMed CentralView ArticlePubMedGoogle Scholar
- Kathiresan S, Willer CJ, Peloso GM, Demissie S, Musunuru K, Schadt EE, Kaplan L, Bennett D, Li Y, Tanaka T, Voight BF, Bonnycastle LL, Jackson AU, Crawford G, Surti A, Guiducci C, Burtt NP, Parish S, Clarke R, Zelenika D, Kubalanza KA, Morken MA, Scott LJ, Stringham HM, Galan P, Swift AJ, Kuusisto J, Bergman RN, Sundvall J, Laakso M: Common variants at 30 loci contribute to polygenic dyslipidemia. Nat Genet. 2009, 41: 56-65.PubMed CentralView ArticlePubMedGoogle Scholar
- Dupuis J, Langenberg C, Prokopenko I, Saxena R, Soranzo N, Jackson AU, Wheeler E, Glazer NL, Bouatia-Naji N, Gloyn AL, Lindgren CM, Magi R, Morris AP, Randall J, Johnson T, Elliott P, Rybin D, Thorleifsson G, Steinthorsdottir V, Henneman P, Grallert H, Dehghan A, Hottenga JJ, Franklin CS, Navarro P, Song K, Goel A, Perry JR, Egan JM, Lajunen T: New genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes risk. Nat Genet. 2010, 42: 105-116.PubMed CentralView ArticlePubMedGoogle Scholar
- Kottgen A, Pattaro C, Boger CA, Fuchsberger C, Olden M, Glazer NL, Parsa A, Gao X, Yang Q, Smith AV, O'Connell JR, Li M, Schmidt H, Tanaka T, Isaacs A, Ketkar S, Hwang SJ, Johnson AD, Dehghan A, Teumer A, Pare G, Atkinson EJ, Zeller T, Lohman K, Cornelis MC, Probst-Hensch NM, Kronenberg F, Tonjes A, Hayward C, Aspelund T: New loci associated with kidney function and chronic kidney disease. Nat Genet. 2010, 42: 376-384.PubMed CentralView ArticlePubMedGoogle Scholar
- Petersen AK, Krumsiek J, Wagele B, Theis FJ, Wichmann HE, Gieger C, Suhre K: On the hypothesis-free testing of metabolite ratios in genome-wide and metabolome-wide association studies. BMC Bioinformatics. 2012, 13: 120-PubMed CentralView ArticlePubMedGoogle Scholar
- Petersen AK, Stark K, Musameh MD, Nelson CP, Romisch-Margl W, Kremer W, Raffler J, Krug S, Skurk T, Rist MJ, Daniel H, Hauner H, Adamski J, Tomaszewski M, Doring A, Peters A, Wichmann HE, Kaess BM, Kalbitzer HR, Huber F, Pfahlert V, Samani NJ, Kronenberg F, Dieplinger H, Illig T, Hengstenberg C, Suhre K, Gieger C, Kastenmuller G: Genetic associations with lipoprotein subfractions provide information on their biological nature. Hum Mol Genet. 2012, 21: 1433-1443.View ArticlePubMedGoogle Scholar
- Wichmann HE, Gieger C, Illig T: KORA-gen--resource for population genetics, controls and a broad spectrum of disease phenotypes. Gesundheitswesen. 2005, S26-30. Suppl 1
- Smolinska A, Blanchet L, Buydens LM, Wijmenga SS: NMR and pattern recognition methods in metabolomics: From data acquisition to biomarker discovery: A review. Anal Chim Acta. 2012, 750: 82-97.View ArticlePubMedGoogle Scholar
- Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, Sham PC: PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007, 81: 559-575.PubMed CentralView ArticlePubMedGoogle Scholar
- Suhre K, Wallaschofski H, Raffler J, Friedrich N, Haring R, Michael K, Wasner C, Krebs A, Kronenberg F, Chang D, Meisinger C, Wichmann HE, Hoffmann W, Volzke H, Volker U, Teumer A, Biffar R, Kocher T, Felix SB, Illig T, Kroemer HK, Gieger C, Romisch-Margl W, Nauck M: A genome-wide association study of metabolic traits in human urine. Nat Genet. 2011, 43: 565-569.View ArticlePubMedGoogle Scholar
- Wishart DS, Knox C, Guo AC, Eisner R, Young N, Gautam B, Hau DD, Psychogios N, Dong E, Bouatra S, Mandal R, Sinelnikov I, Xia J, Jia L, Cruz JA, Lim E, Sobsey CA, Shrivastava S, Huang P, Liu P, Fang L, Peng J, Fradette R, Cheng D, Tzur D, Clements M, Lewis A, De Souza A, Zuniga A, Dawe M: HMDB: a knowledgebase for the human metabolome. Nucleic Acids Res. 2009, 37: D603-610.PubMed CentralView ArticlePubMedGoogle Scholar
- Klaus V, Vermeulen T, Minassian B, Israelian N, Engel K, Lund AM, Roebrock K, Christensen E, Haberle J: Highly variable clinical phenotype of carbamylphosphate synthetase 1 deficiency in one family: an effect of allelic variation in gene expression?. Clin Genet. 2009, 76: 263-269.View ArticlePubMedGoogle Scholar
- Dumont J, Huybrechts I, Spinneker A, Gottrand F, Grammatikaki E, Bevilacqua N, Vyncke K, Widhalm K, Kafatos A, Molnar D, Labayen I, Gonzalez-Gross M, Amouyel P, Moreno LA, Meirhaeghe A, Dallongeville J: FADS1 genetic variability interacts with dietary alpha-linolenic acid intake to affect serum non-HDL-cholesterol concentrations in European adolescents. J Nutr. 2011, 141: 1247-1253.View ArticlePubMedGoogle Scholar
- Lu Y, Feskens EJ, Dolle ME, Imholz S, Verschuren WM, Muller M, Boer JM: Dietary n-3 and n-6 polyunsaturated fatty acid intake interacts with FADS1 genetic variation to affect total and HDL-cholesterol concentrations in the Doetinchem Cohort Study. Am J Clin Nutr. 2010, 92: 258-265.View ArticlePubMedGoogle Scholar
- van Schaftingen E, Veiga-da-Cunha M, Niculescu L: The regulatory protein of glucokinase. Biochem Soc Trans. 1997, 25: 136-140.View ArticlePubMedGoogle Scholar
- Vaxillaire M, Cavalcanti-Proenca C, Dechaume A, Tichet J, Marre M, Balkau B, Froguel P: The common P446L polymorphism in GCKR inversely modulates fasting glucose and triglyceride levels and reduces type 2 diabetes risk in the DESIR prospective general French population. Diabetes. 2008, 57: 2253-2257.PubMed CentralView ArticlePubMedGoogle Scholar
- Saxena R, Voight BF, Lyssenko V, Burtt NP, de Bakker PI, Chen H, Roix JJ, Kathiresan S, Hirschhorn JN, Daly MJ, Hughes TE, Groop L, Altshuler D, Almgren P, Florez JC, Meyer J, Ardlie K, Bengtsson Bostrom K, Isomaa B, Lettre G, Lindblad U, Lyon HN, Melander O, Newton-Cheh C, Nilsson P, Orho-Melander M, Rastam L, Speliotes EK, Taskinen MR, Tuomi T: Genome-wide association analysis identifies loci for type 2 diabetes and triglyceride levels. Science. 2007, 316: 1331-1336.View ArticlePubMedGoogle Scholar
- Horvatovich K, Bokor S, Polgar N, Kisfali P, Hadarits F, Jaromi L, Csongei V, Repasy J, Molnar D, Melegh B: Functional glucokinase regulator gene variants have inverse effects on triglyceride and glucose levels, and decrease the risk of obesity in children. Diabetes Metab. 2011, 37: 432-439.View ArticlePubMedGoogle Scholar
- Kristiansson K, Perola M, Tikkanen E, Kettunen J, Surakka I, Havulinna AS, Stancakova A, Barnes C, Widen E, Kajantie E, Eriksson JG, Viikari J, Kahonen M, Lehtimaki T, Raitakari OT, Hartikainen AL, Ruokonen A, Pouta A, Jula A, Kangas AJ, Soininen P, Ala-Korpela M, Mannisto S, Jousilahti P, Bonnycastle LL, Jarvelin MR, Kuusisto J, Collins FS, Laakso M, Hurles ME: Genome-wide screen for metabolic syndrome susceptibility Loci reveals strong lipid gene contribution but no evidence for common genetic basis for clustering of metabolic syndrome traits. Circ Cardiovasc Genet. 2012, 5: 242-249.PubMed CentralView ArticlePubMedGoogle Scholar
- Krumsiek J, Suhre K, Evans AM, Mitchell MW, Mohney RP, Milburn MV, Wagele B, Romisch-Margl W, Illig T, Adamski J, Gieger C, Theis FJ, Kastenmuller G: Mining the unknown: a systems approach to metabolite identification combining genetic and metabolic information. PLoS Genet. 2012, 8: e1003005-PubMed CentralView ArticlePubMedGoogle Scholar
- Wei SW, Zhang J, Liu LY, Ye T, Gowda GAN, Tayyari F, Raftery D: Ratio analysis nuclear magnetic resonance spectroscopy for selective metabolite identification in complex samples. Analytical Chemistry. 2011, 83: 7616-7623.PubMed CentralView ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.