Quantitative high-throughput metabolomics: a new era in epidemiology and genetics
© BioMed Central Ltd 2012
Published: 30 April 2012
Skip to main content
© BioMed Central Ltd 2012
Published: 30 April 2012
Metabolites in body fluids reflect multiple biochemical processes and pathways relevant to health and disease. Comprehensive approaches to gain insights into metabolic variation and diseases, such as metabolic phenotyping, have become increasingly popular over recent years [1–3]. These developments have been driven by mass spectrometry (MS) and proton nuclear magnetic resonance (NMR) spectroscopy as the two key experimental technologies. On the basis of findings from multiple disciplines, it has been envisaged that metabolic phenotyping will eventually lead to holistic risk assessment for various diseases .
Techniques that can quantify large numbers of metabolites, representing multiple metabolic pathways in systemic metabolism, are particularly relevant for the risk assessment of metabolic disorders, such as diabetes and vascular diseases. Currently, NMR-based applications are the only approaches that can offer fully automated and highly reproducible high-throughput experimentation in a very cost-effective manner [6, 7]. Although the variety of molecules measurable by MS cannot be matched by NMR, the per-sample costs using MS still tend to be high, and the quantitative throughput is limited. Recent MS applications, however, show appealing progress in both throughput and molecular identifications , suggesting that, in the near future, MS and NMR will most likely be used as complementary technologies in large-scale epidemiology. Combined with genome-wide and gene expression data at the population level, comprehensive metabolic information has started to trigger detailed systems-level findings [8, 9]. This novel line of multiomics research is anticipated to grow rapidly and to allow a more thorough molecular understanding of biochemical pathways and disease pathologies. Yet metabolomics will be truly useful in epidemiology or in genetic studies only if quantitative data on specific, identified metabolites are available.
Broadly speaking, epidemiological research and genome-wide association studies (GWASs) aim to discover associations in order to generate biological hypotheses. Conventional thinking holds that the number of people is the primary adjustable variable for increasing the statistical power to detect variants of a given effect in a GWAS. However, effect sizes depend on phenotypic definitions and will get stronger as one moves closer to the biochemical source. This implies that 'missing heritability' is not simply a reflection of what cannot be found by a common genetic variant association; it relates fundamentally to the biological and molecular rationale of the trait. We have recently demonstrated this in a GWAS on (only) 8,330 Finnish individuals for a range of 216 serum metabolic measures quantified by our NMR metabolomics platform . We identified 31 loci associated with one or more metabolic measures at a genome-wide significance level, including seven newly identified loci for low-molecular-weight metabolites and four for serum lipoprotein and lipid measures. Using the same Finnish individuals, we also performed an association analysis of the 95 genetic loci known to affect serum cholesterol and triglyceride levels . Our analysis included comprehensive data on lipids and lipoprotein subclasses, obtained via the NMR metabolomics platform, and four enzymatic lipid traits. For 30 of the 95 loci, we identified new metabolic or genetic associations. In the majority of the loci, the strongest association was to a more specific metabolite measure than the total lipids measured enzymatically. Interestingly, in four loci, the smallest high-density lipoprotein (HDL) measures showed effects opposite to the larger ones, a finding that indicates distinct metabolic characteristics for small and large HDL particles as also previously indicated by the gene co-expression findings in circulating leukocytes . Thus, the findings feature considerable diversity in association patterns for the loci originally identified through associations with enzymatic total lipid measures, and they reveal association profiles of far greater resolution than those from routine clinical lipid measures . Therefore, not unexpectedly, metabolic measures of pathway specificity (such as HDL subclasses) can provide far better insights into biological processes than common clinical measures representing merely a sum of multiple biochemical components (such as HDL cholesterol).
Increased etiological understanding and good biomarkers for disease prediction and prevention could facilitate clinical progress and translational medicine. However, in typical epidemiological studies, the findings, although statistically significant at a population level, reflect only weak relationships between metabolism and demographic or clinical measures, and therefore do not provide a sound basis for individual prediction models. Metabolic measures close to the underlying molecular pathways are needed to increase the accuracy of such modeling. Metabolomics approaches can intrinsically provide holistic molecular perspectives and thereby lead to better representations of disease progression, especially if we stop using univariate cut-offs for diagnostics and start handling disease risk as multivariate continuous dimensions. Combined with new bioinformatics schemes that include biological justification, comprehensive metabolic phenotyping has the potential to provide globally useful solicitous societal solutions with better individual well-being together with more efficiently spent health budgets.
It is clearly understood that typical clinical outcomes of metabolic diseases, like infarctions in coronary heart disease, occur as a result of life-long effects of multiple molecular pathways. If characteristic combinations of these pathways exist, and if they are distinct in relation to metabolic disorders, a pathway-specific identification and risk assessment of individuals might be possible at the population level (Figure 1). The potential translation from the current 'one model for all situations' to a new clinical practice incorporating pathways and metabolic phenotypes requires better understanding of the etiologies of metabolic diseases and their interplay. This invokes the need for quantitative metabolomics for the masses.
We acknowledge financial support from the Academy of Finland, the Responding to Public Health Challenges Research Program of the Academy of Finland, the Finnish Foundation for Cardiovascular Research, the Jenny and Antti Wihuri Foundation, the Finnish Diabetes Research Foundation, and the Strategic Research Funding from the University of Oulu.