- Open Access
Strain-level dissection of the contribution of the gut microbiome to human metabolic disease
Genome Medicine volume 8, Article number: 41 (2016)
The gut microbiota has been linked with metabolic diseases in humans, but demonstration of causality remains a challenge. The gut microbiota, as a complex microbial ecosystem, consists of hundreds of individual bacterial species, each of which contains many strains with high genetic diversity. Recent advances in genomic and metabolomic technologies are facilitating strain-level dissection of the contribution of the gut microbiome to metabolic diseases. Interventional studies and correlation analysis between variations in the microbiome and metabolome, captured by longitudinal sampling, can lead to the identification of specific bacterial strains that may contribute to human metabolic diseases via the production of bioactive metabolites. For example, high-quality draft genomes of prevalent gut bacterial strains can be assembled directly from metagenomic datasets using a canopy-based algorithm. Specific metabolites associated with a disease phenotype can be identified by nuclear magnetic resonance-based metabolomics of urine and other samples. Such multi-omics approaches can be employed to identify specific gut bacterial genomes that are not only correlated with detected metabolites but also encode the genes required for producing the precursors of those metabolites in the gut. Here, we argue that if a causative role can be demonstrated in follow-up mechanistic studies—for example, using gnotobiotic models—such functional strains have the potential to become biomarkers for diagnostics and targets for therapeutics.
Gut microbiome—a new paradigm for understanding metabolic diseases
Obesity and related metabolic diseases such as diabetes and cardiovascular disease represent a major public health threat to both developed countries, such as the United States, and rapidly developing countries, such as China and India [1–3]. China, for example, has more than one hundred million diabetic patients and nearly five hundred million people with pre-diabetes . Metabolic diseases alone could overwhelm the public health and medical systems in these countries unless something substantial happens in the prevention and treatment of these diseases in the next decade.
Human beings are superorganisms consisting of not only our own cells but also up to ten times more microbial cells, most of which are bacteria residing in the gut. The gut microbiota consists of hundreds of individual bacterial species, each of which contains many functionally different strains with significant genetic diversity. Studies of the contribution of the gut microbiome to the onset and progression of metabolic diseases, particularly adiposity and insulin resistance, the two hallmark characteristics of various metabolic diseases in their early stages, have resulted in a paradigm shift in understanding the root cause of human metabolic diseases in the last decade or so, and may bring new hope to countries devastated by such diseases . However, most of the evidence so far is associative in nature. Mechanistic studies, which are needed for demonstration of causality, are mostly attempted at a community level or taxon level higher than species, such as genus, family or even phylum . Bacterial species or other higher taxa are arbitrarily defined taxonomic units for clustering and categorizing strains, each of which consists of genetically identical cell populations. Since bacterial strains, equivalent to individual plants and animals, are the genetically defined, basic functional units of the gut ecosystem, dissecting the contribution of the gut microbiome to human metabolic diseases must be carried out at the strain level. Identifying and understanding all relevant strains in the gut microbiota that may have mechanistically contributed positively (detrimentally) or negatively (beneficially) to the onset and progression of metabolic diseases can lead to the discovery of new biomarkers of predictive and diagnostic value, as well as new targets for effective interventions in humans.
We argue that, unless we can identify specific functional strains of the gut microbiome and understand mechanistically how each individually or in combination contributes to the onset and progression of metabolic diseases, the translation of new microbiome findings to clinical practice for diagnosis and therapeutics will be rather limited. We discuss how high-quality draft genomes can be assembled directly from metagenomic datasets to provide strain-level genetic data that can be correlated with disease-relevant variations of metabolites in samples such as urine, as an example of systems-level discovery approaches for identifying specific functional bacterial strains that may play a causative role in human metabolic diseases. These strains can then be isolated into pure culture and confirmed mechanistically as having a causative role in metabolic diseases using gnotobiotic animal models. This approach may help to move the microbiome field from association at the community or high-taxon level towards causality at the strain level. Such genomic- and molecular-level studies can eventually lead to the discovery of biomarkers and drug targets in the gut microbiome for clinical applications.
Role of the gut microbiota in metabolic diseases
Excessive visceral fat deposition is a primary pathological condition underlying many forms of metabolic diseases. A seminal paper in 2004 reported that the gut microbiota might act as an environmental factor for regulating fat storage in the host . Subsequently, the results of several studies pointed to the involvement of the gut microbiota in fat accumulation . Germ-free mice are resistant to high-fat-diet-induced obesity . Lean germ-free mice accumulated 60 % more fat after being colonized with a normal gut microbiota despite a reduction in their food intake after the conventionalization. Transplantation of gut microbiota from obese mice or humans induced significantly higher fat accumulation in recipient mice than transplantation of gut microbiota from lean donors [8, 9]. Removal of gut microbiota by using cocktails of broad-spectrum antibiotics prevented fat accumulation even in genetically obese mice, such as ob/ob mice or Toll-like receptor 5 knockout mice [10, 11]. It was found that gut microbiota may promote fat accumulation by reducing the expression level of genes required for fatty acid oxidation, such as Fiaf (encoding fasting-induced adipose factor) in the gut, and by increasing the activity of genes needed for synthesizing new fat, such as Acc1 (encoding acetyl-CoA carboxylase 1) and Fas (encoding fatty acid synthase) in the liver . In 2015, a study showed that depletion of the gut microbiota by antibiotics or in germ-free mice increased browning of white adipose tissue and reduced obesity in the mice, possibly via eosinophil infiltration, enhanced type 2 cytokine signaling and M2 macrophage polarization . Thus, dysregulation of genes involved in host lipid metabolism may be an important mechanism by which the gut microbiome promotes excessive fat accumulation in obesity.
Insulin resistance, the other hallmark feature of metabolic diseases [13, 14], has been mechanistically linked to a low-grade, systemic, chronic inflammatory condition in mice and humans . The gut microbiota has also been associated with insulin resistance in mice and humans. Germ-free mice are insulin sensitive but can become insulin resistant after being conventionalized with gut microbiota, particularly from obese mice . In obese human volunteers, systemic insulin sensitivity was improved within 6 weeks after receiving gut microbiota transplantation from healthy donors . Thus, an obesity-associated gut microbiota may work as a virulence factor in driving insulin resistance.
Endotoxin, a proinflammatory form of lipopolysaccharide (LPS), was shown to be able to induce inflammation followed by both adiposity and insulin resistance when subcutaneously injected into mice fed on a low-calorie diet for several weeks . This was the first evidence that LPS, a microbial product from the gut microbiota, may be driving inflammation and contributing to fat accumulation and insulin resistance. These results indicated that some endotoxin producers in the gut microbiota may contribute to the proinflammatory condition and progression of insulin resistance in the host. Recent studies suggest a possible role for LPS in fatty liver disease  and obstructive sleep apnea —an indication that inflammation sustained by microbial products such as LPS may drive more forms of metabolic disorders. Thus, compelling evidence from mouse and human studies supports a pivotal role of the gut microbiota in the onset and progression of metabolic diseases. However, it has been a great challenge for the field to identify all relevant members of the gut microbiota that are associated with the development of metabolic diseases, and to demonstrate their causative contribution to pathophysiological changes critical for disease initiation and progression.
When dissecting and demonstrating the causative contribution of relevant members of the gut microbiome to human metabolic diseases, we should follow the logic of Koch’s postulates, which were established for identifying a causative agent in an infectious disease, but adapt them to the polymicrobial nature of the role of the gut microbiome in human chronic diseases. Firstly, we should do microbiome-wide association studies, in which all members of the gut microbiome that are positively or negatively correlated with disease phenotype(s) need to be identified. Secondly, the associated members should be isolated into individual pure cultures or strains. Individual strains or their combinations should be inoculated into germ-free animals to reproduce at least part of the disease phenotype(s). Thirdly, the molecular mechanisms underlying causation should be established, from colonization of the gut to development of the disease endpoints. After fulfilling these rigorous protocols, these strains would be accepted as causatively contributing to human metabolic diseases. They then have the potential to be new biomarkers and drug targets for clinical applications .
High-quality association studies are critical for the successful identification of potential key players of the gut microbiome in metabolic diseases, which can then be followed by rigorous molecular-level mechanistic studies as the ultimate evidence for causality. We argue that association studies at the strain level are pivotal for reducing spurious correlations and identifying “real targets” for mechanistic studies.
Bacterial species and strains in metabolic disease
Bacterial functions are strain-specific
The gut microbial ecosystem consists of bacterial populations as individual members, each of which has genetically identical cells derived from the same parent cell . Any two populations can be distinguished by at least one single nucleotide polymorphism, and they may have different adaptive functions in the ecosystem—for example, a point mutation in a drug resistance gene can make a mutant population survive a new round of antibiotic medication, while the wild-type may have been wiped out . Bacterial populations, which have been isolated in pure culture or detected by partial or complete sequencing of their genomes, are defined as strains . One strain is thus (at least partially) a known population in the gut ecosystem. In bacterial taxonomy, a “species” would contain individual strains, with up to 30 % difference in their genomic homology; that is, two strains in the same named bacterial species can be genetically more different than humans and mice, which have only about 10 % genomic difference . Genomic sequencing of many strains in the same named bacterial species has already revealed this huge genetic microdiversity. In all 17 sequenced strains of Escherichia coli, 2200 genes were conserved. However, pan-genome prediction indicates that E. coli species may contain a reservoir of more than 13,000 genes . Complete sequencing of 34 strains of Lactobacillus paracasei identified about 1800 orthologous genes (OGs) in its core genome, but 4300–4500 OGs in its pan-genome . Ecological functions in the gut microbiome would thus be population-dependent. Any attempts to dissect the contribution of the gut microbiome to human metabolic diseases starting with microbiome-wide association studies must recognize that the disease-relevant functions of the gut microbiota may well be strain-specific.
Potential bias in taxon-based analysis
Different structural patterns of the gut microbiota have been associated with metabolic diseases, such as the ratio between Firmicutes/Bacteroidetes, high gene count versus low gene count, or profiles of specific operational taxonomic units (OTUs) that are associated with progression of a particular disease phenotype [26–32]. Patterns of the gut microbiota associated with obesity and metabolic disorders have been sought at the individual OTU level (roughly at species level) up to phylum level in16S rRNA gene-sequencing-based analysis. However, species in the same taxon from genus up to phylum can show widely diverse relationships with a particular disease phenotype—some may be positively associated, some negatively, and others may not be associated at all [33, 34]. If a function is encoded in the “core genome” of a taxon, all members of that taxon should have that function. If the function is encoded in the pan-genome only, one or a limited number of members would have that function [35, 36]. It is thus a serious concern if we consider all species (OTUs) in a taxon as one group and seek associations at each taxonomic level, before we can be sure that all OTUs in the same taxon encode the same functions. However, we know that even within the same species, there is often high micro-diversity.
Recent developments in metagenomics have started to provide researchers with tools that can dissect the gut microbiome at the strain level [37–40]. For example, a recently developed canopy-based algorithm can be used to assemble high-quality draft genomes of predominant gut bacteria, based on the principle that if two genes are encoded in the same DNA molecule, their abundances across all the samples in which they can both be detected would be highly correlated to each other . Individual non-redundant genes obtained from metagenomic datasets of many fecal samples can be binned into co-abundance gene groups (CAGs) if their abundances are highly correlated with each other. Genes in each CAG are potentially originally encoded by the same DNA molecule. Assembly of high-quality reads mapped to all the genes in the same CAG can generate high-quality draft genomes. This algorithm allowed researchers to get direct access to the genome variations of predominant bacteria in the gut microbiome. Because each genome represents one single population, this means that strain-level, genome-centric analysis is possible with metagenomic datasets. However, as mentioned earlier, any such genome/strain-level studies need to be confirmed by downstream mechanistic studies, ideally with the strain containing the genome in pure culture, to establish a gnotobiotic model of metabolic disease.
Functional species and strains of the gut microbiota in metabolic diseases
In recent years, a number of functional species and strains have been identified in human metabolic diseases. Some of these may induce or aggravate the disease, while others may be protective.
We found one example of an obesity-inducing strain in a human gut opportunistic species, Enterobacter cloacae, which is known to cause bacteremia when translocated into the bloodstream of immune-compromised individuals . In a volunteer with 174.9 kg initial bodyweight, this species was found to comprise nearly 30 % of the total gut bacterial populations. After taking a dietary intervention aimed at modulating the gut microbiota, this species was almost non-detectable in the gut and the volunteer lost more than 50 kg of baseline bodyweight over 23 weeks, along with recovery of all parameters of metabolic syndrome. A strain named B29 was isolated from the volunteer’s baseline fecal sample and was confirmed to be a member of the overgrowing species of E. cloacae. When inoculated into the gut of germ-free C57/B6 mice fed on a high-fat diet, B29 induced fully developed obesity phenotypes, including inflammation, adiposity and insulin resistance. B29 colonization was also shown to be able to reduce the expression level of Fiaf in the ileum and promote the expression of Acc1 and Fas in the liver. B29-colonized mice fed on normal chow or germ-free control mice fed on a high-fat diet did not become obese. Only the combination of a high-fat diet and mono-association of B29 led to elevated endotoxin levels in the serum and systemic inflammation, and local inflammation in the liver and fat pads. This is the first reported example in which a single strain can induce fully developed obesity phenotypes in gnotobiotic mice. This strain was thus identified as an obesity-inducing “pathogen” by following the logic of Koch’s postulates.
Although a member of a bacterial species that can cause infectious diseases , E. cloacae B29 did not induce any notable septic symptoms even when directly injected into the bloodstream of specific-pathogen-free mice . Genomic sequencing of B29 did not lead to the discovery of known virulence genes apart from genes involved in the LPS biosynthetic pathway. B29 is thus a non-infectious strain of this pathogenic species. B29 reached a stunningly high population level in the gut of its morbidly obese human host—more than 30 % of the total gut bacterial populations. This indicates that this strain has the genetic capacity to outcompete other members of the gut microbiota and become the predominant population. Reaching such a high population level would differentiate it from other LPS endotoxin producers in the gut in that it could make a substantial contribution to inflammation and obesity phenotypes.
It is still not clear why this population can reach such a high level without evoking an acute host immune system response. The patient was reported to have had a serious infection at 4 months old and had received heavy antibiotic medication, and started to gain weight after that incidence. One possibility might be that this strain had colonized the host’s gut so early in life that the host’s immune system developed tolerance to its colonization in the gut. Thus, at least three genetically encoded functions might be needed for a gut bacterium to be a causal agent in obesity development: (1) a virulence factor that can induce inflammation—in this case, the best candidate is LPS endotoxin; (2) the capacity to grow to a high population level in the complex gut ecosystem; and (3) the capacity to evade host immune surveillance so that a high population level can not only be reached but also be maintained in the gut ecosystem. However, all these need to be mechanistically tested. The gnotobiotic model, in which B29 alone or in combination with other members of the gut microbiota can colonize the intestine, represents an ideal system for future elucidation of the molecular mechanism of causation, from colonization by particular members of the gut microbiome to the development of a non-communicable disease such as obesity.
Hopefully, the identification of B29 as a potential pathogenic strain for obesity-related disease from the E. cloacae species, which usually induces infectious diseases, will serve as a good example to encourage researchers in the microbiome field to focus on strain-level diversity when their primary interest is to understand not only the association but also the causative functions of gut bacteria in human chronic diseases [5, 42].
Potentially beneficial strains in obesity have also been identified, isolated and validated in animal models. A strain of Akkermansia muciniphila has been shown to have a protective effect against obesity in both humans and mice [44, 45]. A. muciniphila was found to be negatively associated with obesity and type 2 diabetes in rodents and humans. Administration of viable cells of the strain A. muciniphila MucT (ATCCBAA-835) protected high-fat-diet-fed mice from developing metabolic syndrome, possibly via increasing intestinal levels of endocannabinoids that control inflammation, gut barrier integrity and secretion of gut peptides, including the antimicrobial peptide RegIIIγ.
In an association study involving 416 twin pairs, the Christensenellaceae family showed increased abundance in individuals with low body mass index (BMI). After being transplanted to germ-free mice, Christensenella minuta (DSM22607), a strain of the only cultured member of the family Christensenellaceae, reduced weight gain and altered the microbiome of recipient mice. The strain has been reported to produce short-chain fatty acids, but it is not clear whether this function contributes to its protective effect . It is also not clear whether all the members of this family would have this protective function. For that, the genes encoding this beneficial function would need to be present in the core genome of all members of this family .
The discovery of E. cloacae B29 as a potential pathogenic strain for human obesity is not accidental. It built on prior evidence accumulated over many years in the field on LPS, inflammation and obesity in both animal studies and human epidemiological studies . However, such a path to discovery is of limited efficiency. The human microbiome field requires many new forms of technologies for the systematic discovery of most, if not all, the potential key players of the microbiome that might contribute to human chronic diseases.
Gut bacteria contribute to human metabolic phenotypes by producing and delivering bioactive metabolites into the host systemic circulation . Metagenomics can identify specific strains or populations that may have the genetic potential to produce such bioactive substances and to be involved in a disease phenotype. Whether a particular strain actually contributes to the disease needs to be confirmed with functional studies; that is, whether the bioactive metabolites were actually produced by these bacteria and transported into their hosts, and whether these metabolites were indeed responsible for the disease phenotype. Thus, one important strategy is to link a strain or genome with a particular metabolite involved in a disease process. An integrated metagenomics–metabolomics approach may well serve such needs for the field.
Approaches for dissecting the functional contribution of the gut microbiome to metabolic disease
Gut bacteria can produce various bioactive metabolites, which can enter the bloodstream of the host via the enterohepatic circulation or via a partially impaired gut barrier [48, 49]. One third of the small molecules in the bloodstream can be of gut bacterial origin . Some of the bioactive metabolites can be detrimental to host health, such as those with cytotoxicity, genotoxicity or immunotoxicity [51–55]. When these toxic metabolites enter the bloodstream, they can contribute to the onset and progression of many forms of chronic diseases such as autism, cancer and diabetes [17, 56–59]. Notably, as a detoxification mechanism, these toxic metabolites can be further transformed by host liver enzymes into water-soluble derivatives that are excreted in the urine [57, 60]. Thus, one important strategy for identifying the species or strains of the gut microbiota that may be involved in the production of specific toxic metabolites could be to correlate species- or strain-level variations of gut bacteria with variations of metabolites in the urine and in other types of samples (Fig. 1).
Integrating metagenomic and metabolomic approaches
In a proof-of-principle study, we collected urine and fecal samples from a four-generation, seven-member Chinese family over monthly intervals . This time-series approach for the collection of both fecal and urine samples can help to capture intra-individual and inter-individual variations in both gut bacterial populations and urine metabolites to allow their correlation, to determine the functions of specific strains of the gut microbiota. Population changes of predominant bacteria were assessed by DNA fingerprinting and sequencing. Urine metabolites were profiled using 1H nuclear magnetic resonance (NMR) spectroscopy-based metabonomics. Although we could only identify a limited number of predominant bacteria with the fingerprinting technology, we achieved sub-species-level resolution of the predominant populations because this approach allowed two DNA fragments with a single nucleotide difference in their sequences to be resolved into two bands. A multivariate statistical method was used to correlate changes in the urine and fecal samples. This analysis led to the identification of ten bacterial populations, each of which showed a correlation with at least one urine metabolite. Two bacterial populations were identified as different strains of the species Faecalibacterium prausnitzii. One strain had associations with two urine metabolites, while the other strain had eight associations with urine metabolites—six positive associations and two negative ones. As a non-targeted discovery approach, this method opened new avenues for determining the functions of individual members of the microbiota .
Since the publication of this integrated metagenomics and metabolomics methodology, next-generation, high-throughput sequencing has revolutionized microbiome research. Metagenomic sequencing of total fecal DNA samples now enables researchers to access genomic information from gut bacteria that would otherwise be inaccessible using traditional culture-based technologies [62, 63]. At first, this genomic information can be used to profile variations at the individual gene level. Many studies have focused on functionally relevant genes that might be associated with host health or disease phenotypes [64–67]. Such a gene-centric approach for metagenomic data-mining has generated many new insights into the role of the gut microbiome in human metabolic diseases; for example, volunteers with a high gene count in their microbiomes seem to be better at responding to the same dietary intervention for controlling obesity than those with a low gene count [28, 68]. However, if millions of genes are identified from a metagenomic dataset, it is not technically feasible to correlate their changes with urine metabolome changes. Eventually, we still need to identify the genomic sequences of the strains in the gut microbiome that correlate with specific metabolites or disease phenotypes in order to understand the ecological interactions among them and between them and their hosts.
With this aim, we conducted a clinical trial of a gut microbiota-targeted dietary intervention during which urine and fecal samples were collected so that an integrated metagenomics–metabolomics strategy could be used to dissect the contribution of the gut microbiome to human metabolic disease . Time-series sample collection in such a study design would increase the statistical power needed to correlate strain-level variations in the gut ecosystem with metabolites produced by gut bacteria and delivered into the host systemic circulation.
In this clinical trial, 17 morbidly obese children with a genetic defect called Prader–Willi syndrome were hospitalized for 3 months, and 21 children with simple obesity were hospitalized for 1 month, and both groups were placed on a diet based on whole grains, traditional Chinese medicinal foods and prebiotics. At baseline and at the end of each month, urine and fecal samples were collected. Both cohorts lost substantial amounts of their initial bodyweight and exhibited significantly improved glucose homeostasis, lipid profiles and liver function. Transplantation of the pre- and post-intervention gut microbiota from the same individual into germ-free mice showed that the pre-intervention microbiota induced inflammation in the gut and liver, and fat accumulation in adipocytes of the germ-free mice, whereas transplantation of the post-intervention microbiota did not induce these effects. 16S rRNA gene sequencing-based analysis also confirmed that the dietary intervention significantly modulated the gut microbiota structure of the volunteers, with concomitant improvement of metabolic phenotypes. To assess the contribution of the gut microbiome to childhood obesity in the two cohorts studied, we then used an integrated metagenomics–metabolomics approach to determine whether strain-level dissection could be achieved.
Metagenomic sequencing of 110 fecal DNA samples at 8 Gb each led to the identification of two million non-redundant genes. Using co-abundance analysis, 376 CAGs were obtained with more than 700 genes, indicating that they were bacterial genomes. Of these, 161 CAGs were selected for further analysis as they were shared by more than 20 % of the samples, and thus represented the predominant bacterial populations in these cohorts. From these 161 CAGs, 118 high-quality draft genomes were assembled, each of which could meet at least five of the six criteria for assessing the quality of Human Microbiome Project reference genomes obtained from sequencing of pure cultures.
After the dietary intervention, NMR-based metabolomic analysis of urine samples showed that the levels of four metabolites were significantly increased and the levels of nine metabolites were decreased. Interestingly, among the nine metabolites with decreased levels was trimethylamine-N-oxide (TMAO), a co-metabolite between host and gut bacteria, which can promote plaque formation and increase the risk for atherosclerosis. TMAO is transformed in the liver from a precursor called trimethylamine (TMA), which in turn is produced by some gut bacteria by fermenting dietary choline from animal fat such as phosphatidylcholine . To determine which gut bacteria can convert choline into TMA, we used Spearman correlation to test the association between the 118 high-quality draft genomes and the urine concentration of TMAO. Among the 31 genomes that were correlated with TMAO concentration in the urine, 13 were found to contain the genes encoding choline TMA-lyase and choline TMA-lyase-activating enzyme, the two genes required to convert choline to TMA. These genomes are members of Ruminococcus spp., Parabacteroides spp. and Bacteroides spp. The next step would be to isolate these bacteria and validate their functions for converting choline to TMA and their association with increased risk of atherosclerosis in gnotobiotic models.
The need for new integrative approaches
Since the publication of proof-of-principle studies to show the feasibility of using integrated metagenomics–metabolomics approaches for “functional metagenomics”, researchers have called for “a marriage between metagenomics and metabolomics”, not only in the human microbiome field but also in almost all other microbiome fields [71–76]. Such approaches are facilitating the identification of bacterial populations that are associated with functional effects in health and disease.
Integrated microbiome and metabolome analysis identified the genera Ruminococcus and Butyricicoccus as being associated with butyrate production, and distinguished elderly subjects in the community from those in long-term residential care . Two-week food exchanges in subjects from two populations, in which African–Americans were fed a high-fiber, low-fat African-style diet and rural Africans were fed a high-fat, low-fiber Western-style diet, resulted in changes at the specific genus level of the microbiota and associated changes in metabolites in urine and fecal matter known to affect cancer risk .
Chromatographic–mass spectrometric methods, such as ultra-performance liquid chromatography–mass spectrometry (UPLC–MS)-, LC–MS-, and gas chromatography–mass spectrometry (GC–MS)-based profiling techniques, have also been widely used to detect metabolites in urine, plasma, or other samples [79, 80].
New approaches for the integration of microbiome and metabolomic profiles are also being developed. For example, Noecker and colleagues introduced a comprehensive analytical framework to systematically link variations in metabolomic data with microbial community composition . Bouslimani and colleagues described the implementation of an approach to study the chemical make-up of the surface of human skin and to correlate this with specific skin microbes, using three-dimensional mapping of MS data and microbial 16S rRNA gene sequences . However, strain-level dissection is still a bottleneck for many association studies based on these various approaches. The integrated metagenomics–metabolomics strategy described earlier can identify high-quality draft genomes, which are not only associated with disease-relevant metabolites, but are also shown to encode the genes required for producing the precursors of those metabolites. These identified genomes represent good candidates for downstream isolation and mechanistic studies in gnotobiotic models. Yet this approach has its limitations. For example, the canopy-based algorithm can only reconstruct high-quality draft genomes of prevalent gut bacteria. Furthermore, the NMR-based metabolomics method is also rather limited in identifying disease-relevant urine metabolites. Therefore, more universally applicable approaches are needed to link specific strains or populations in the microbiome with specific metabolites to facilitate strain-level dissection of the contribution of the gut microbiome to human metabolic diseases.
Conclusions and future directions
Strain-level dissection of metagenomic datasets is crucial for conducting high-quality association studies as the first step for demonstrating a causative role for the gut microbiome in human metabolic diseases. However, many confounding factors may impair the quality of associative findings.
The genetic capacity of a functional microbial gene or pathway to contribute to a disease phenotype in the host does not necessarily lead to a causative interaction in the gut ecosystem. For example, the genomes of many bacterial strains in soil environments encode the pathway for converting choline to TMA . We can envision that colonization of germ-free animals with such strains may lead to the associated disease phenotype, but such results may be spurious because these strains are not normal members of the gut ecosystem. Only TMA-producing strains resident in the human gut may have the potential to contribute to atherosclerosis.
Our Prader–Willi syndrome study  showed that among the 31 bacterial genomes that were positively associated with urine TMAO concentration, only 13 encoded the functional genes required to convert choline to the precursor TMA. This means that more than half of the associations may not be relevant for this function. Isolating the strains corresponding to the 13 genomes, that were not only correlated with urine TMAO concentration but also harbored the functional genes, would be the next logical step to move to mechanistic studies to investigate a causative role for these strains in the development of the disease phenotype.
Thus, direct assembly of high-quality draft genomes from metagenomic datasets, covering samples with sufficient inter-individual and intra-individual variations in bacterial populations, may transform human microbiome studies from mainly cataloging and inventory, to functionally demonstrating causative links between specific species or strains of the gut microbiota and defined pathophysiological processes in the host. Correlating fluctuations of these bacterial genomes in the gut with disease-relevant metabolites in samples such as urine, serum or fecal water can facilitate not only the identification of potentially important bacteria, but also the formulation of hypotheses on how they may impact host metabolism and participate in the pathology of chronic diseases. Findings from such studies have the potential to identify key functional bacterial strains in the gut microbiota as new diagnostic biomarkers and interventional targets for metabolic diseases.
body mass index
co-abundance gene group
gas chromatography–mass spectrometry
liquid chromatography–mass spectrometry
nuclear magnetic resonance
operational taxonomic unit
ultra-performance liquid chromatography–mass spectrometry
Dietz WH. The response of the US Centers for Disease Control and Prevention to the obesity epidemic. Ann Rev Public Health. 2015;36:575–96.
Wang Y, Mi J, Shan XY, Wang QJ, Ge KY. Is China facing an obesity epidemic and the consequences? The trends in obesity and chronic disease in China. Int J Obes. 2007;31:177–88.
Misra A, Khurana L. Obesity and the metabolic syndrome in developing countries. J Clin Endocrinol Metab. 2008;93:S9–30.
Xu Y, Wang L, He J, Bi Y, Li M, Wang T, et al. Prevalence and control of diabetes in Chinese adults. JAMA. 2013;310:948–9.
Zhao L. The gut microbiota and obesity: from correlation to causality. Nat Rev Microbiol. 2013;11:639–47.
Backhed F, Ding H, Wang T, Hooper LV, Koh GY, Nagy A, et al. The gut microbiota as an environmental factor that regulates fat storage. Proc Natl Acad Sci U S A. 2004;101:15718–23.
Backhed F, Manchester JK, Semenkovich CF, Gordon JI. Mechanisms underlying the resistance to diet-induced obesity in germ-free mice. Proc Natl Acad Sci U S A. 2007;104:979–84.
Turnbaugh PJ, Backhed F, Fulton L, Gordon JI. Diet-induced obesity is linked to marked but reversible alterations in the mouse distal gut microbiome. Cell Host Microbe. 2008;3:213–23.
Ridaura VK, Faith JJ, Rey FE, Cheng J, Duncan AE, Kau AL, et al. Gut microbiota from twins discordant for obesity modulate metabolism in mice. Science. 2013;341:1241214.
Cani PD, Bibiloni R, Knauf C, Waget A, Neyrinck AM, Delzenne NM, et al. Changes in gut microbiota control metabolic endotoxemia-induced inflammation in high-fat diet-induced obesity and diabetes in mice. Diabetes. 2008;57:1470–81.
Vijay-Kumar M, Aitken JD, Carvalho FA, Cullender TC, Mwangi S, Srinivasan S, et al. Metabolic syndrome and altered gut microbiota in mice lacking Toll-like receptor 5. Science. 2010;328:228–31.
Suarez-Zamorano N, Fabbiano S, Chevalier C, Stojanović O, Colin DJ, Stevanović A, et al. Microbiota depletion promotes browning of white adipose tissue and reduces obesity. Nat Med. 2015;21:1497–501.
Bonora E, Zavaroni I, Alpi O, Pezzarossa A, Bruschi F, Dall'Aglio E, et al. Relationship between blood pressure and plasma insulin in non-obese and obese non-diabetic subjects. Diabetologia. 1987;30:719–23.
DeFronzo RA, Ferrannini E. Insulin resistance. A multifaceted syndrome responsible for NIDDM, obesity, hypertension, dyslipidemia, and atherosclerotic cardiovascular disease. Diabetes Care. 1991;14:173–94.
Hotamisligil GS. Inflammation and metabolic disorders. Nature. 2006;444:860–7.
Vrieze A, Van Nood E, Holleman F, Salojärvi J, Kootte RS, Bartelsman JF, et al. Transfer of intestinal microbiota from lean donors increases insulin sensitivity in individuals with metabolic syndrome. Gastroenterology. 2012;143:913–6.
Cani PD, Amar J, Iglesias MA, Poggi M, Knauf C, Bastelica D, et al. Metabolic endotoxemia initiates obesity and insulin resistance. Diabetes. 2007;56:1761–72.
Matsushita N, Osaka T, Haruta I, Ueshiba H, Yanagisawa N, Omori-Miyake M, et al. Effect of lipopolysaccharide on the progression of non-alcoholic fatty liver disease in high caloric diet-fed mice. Scand J Immunol. 2016;83:109–18.
Kheirandish-Gozal L, Peris E, Wang Y, Tamae Kakazu M, Khalyfa A, Carreras A, et al. Lipopolysaccharide-binding protein plasma levels in children: effects of obstructive sleep apnea and obesity. J Clin Endocrinol Metab. 2014;99:656–63.
Lea DE, Coulson CA. The distribution of the numbers of mutants in bacterial populations. J Genet. 1949;49:264–85.
Ley RE, Peterson DA, Gordon JI. Ecological and evolutionary forces shaping microbial diversity in the human intestine. Cell. 2006;124:837–48.
Madigan M, Martinko J, Stahl D, Clark D. Brock biology of microorganisms. 13th ed. New Jersey: Pearson Education; 2012.
Wayne LG. International Committee on Systematic Bacteriology: announcement of the report of the ad hoc Committee on Reconciliation of Approaches to Bacterial Systematics. Zentralbl Bakteriol Mikrobiol Hyg A. 1988;268:433–4.
Rasko DA, Rosovitz MJ, Myers GS, Mongodin EF, Fricke WF, Gajer P, et al. The pangenome structure of Escherichia coli: comparative genomic analysis of E. coli commensal and pathogenic isolates. J Bacteriol. 2008;190:6881–93.
Smokvina T, Wels M, Polka J, Chervaux C, Brisse S, Boekhorst J, et al. Lactobacillus paracasei comparative genomics: towards species pan-genome definition and exploitation of diversity. PLoS One. 2013;8, e68731.
Ley RE, Bäckhed F, Turnbaugh P, Lozupone CA, Knight RD, Gordon JI. Obesity alters gut microbial ecology. Proc Natl Acad Sci U S A. 2005;102:11070–5.
Turnbaugh PJ, Ley RE, Mahowald MA, Magrini V, Mardis ER, Gordon JI. An obesity-associated gut microbiome with increased capacity for energy harvest. Nature. 2006;444:1027–31.
Le Chatelier E, Nielsen T, Qin J, Prifti E, Hildebrand F, Falony G, et al. Richness of human gut microbiome correlates with metabolic markers. Nature. 2013;500:541–6.
Wang J, Tang H, Zhang C, Zhao Y, Derrien M, Rocher E, et al. Modulation of gut microbiota during probiotic-mediated attenuation of metabolic syndrome in high fat diet-fed mice. ISME J. 2015;9:1–15.
Ussar S, Griffin NW, Bezy O, Fujisaka S, Vienberg S, Softic S, et al. Interactions between gut microbiota, host genetics and diet modulate the predisposition to obesity and metabolic syndrome. Cell Metab. 2015;22:516–30.
Kostic AD, Gevers D, Siljander H, Vatanen T, Hyötyläinen T, Hämäläinen AM, et al. The dynamics of the human infant gut microbiome in development and in progression toward type 1 diabetes. Cell Host Microbe. 2015;17:260–73.
Faith JJ, Colombel JF, Gordon JI. Identifying strains that contribute to complex diseases through the study of microbial inheritance. Proc Natl Acad Sci U S A. 2015;112:633–40.
Zhang C, Zhang M, Wang S, Han R, Cao Y, Hua W, et al. Interactions between gut microbiota, host genetics and diet relevant to development of metabolic syndromes in mice. ISME J. 2010;4:232–41.
Zhang C, Zhang M, Pang X, Zhao Y, Wang L, Zhao L, et al. Structural resilience of the gut microbiota in adult mice under high-fat dietary perturbations. ISME J. 2012;6:1848–57.
Tettelin H, Riley D, Cattuto C, Medini D. Comparative genomics: the bacterial pan-genome. Curr Opin Microbiol. 2008;11:472–7.
Medini D, Donati C, Tettelin H, Masignani V, Rappuoli R. The microbial pan-genome. Curr Opin Genet Dev. 2005;15:589–94.
Luo C, Knight R, Siljander H, Knip M, Xavier R, Gevers D. ConStrains identifies microbial strains in metagenomic datasets. Nat Biotechnol. 2015;33:1045–52.
Cleary B, Brito IL, Huang K, Gevers D, Shea T, Young S, et al. Detection of low-abundance bacterial strains in metagenomic datasets by eigengenome partitioning. Nat Biotechnol. 2015;33:1053–60.
Sahl JW, Schupp JM, Rasko DA, Colman RE, Foster JT, Keim P. Phylogenetically typing bacterial strains from partial SNP genotypes observed from direct sequencing of clinical specimen metagenomic data. Genome Med. 2015;7:52.
Sangwan N, Xia F, Gilbert JA. Recovering complete and draft population genomes from metagenome datasets. Microbiome. 2016;4:8.
Nielsen HB, Almeida M, Juncker AS, Rasmussen S, Li J, Sunagawa S, et al. Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes. Nat Biotechnol. 2014;32:822–8.
Fei N, Zhao L. An opportunistic pathogen isolated from the gut of an obese human causes obesity in germfree mice. ISME J. 2013;7:880–4.
Wisplinghoff H, Seifert H, Tallent SM, Bischoff T, Wenzel RP, Edmond MB. Nosocomial bloodstream infections in pediatric patients in United States hospitals: epidemiology, clinical features and susceptibilities. Pediatr Infect Dis J. 2003;22:686–91.
Dao MC, Everard A, Aron-Wisnewsky J, Sokolovska N, Prifti E, Verger EO, et al. Akkermansia muciniphila and improved metabolic health during a dietary intervention in obesity: relationship with gut microbiome richness and ecology. Gut. 2016;65:426–36.
Everard A, Belzer C, Geurts L, Ouwerkerk JP, Druart C, Bindels LB, et al. Cross-talk between Akkermansia muciniphila and intestinal epithelium controls diet-induced obesity. Proc Natl Acad Sci U S A. 2013;110:9066–71.
Morotomi M, Nagai F, Watanabe Y. Description of Christensenella minuta gen. nov., sp. nov., isolated from human faeces, which forms a distinct branch in the order Clostridiales, and proposal of Christensenellaceae fam. nov. Int J Syst Evol Microbiol. 2012;62:144–9.
Goodrich JK, Waters JL, Poole AC, Sutter JL, Koren O, Blekhman R, et al. Human genetics shape the gut microbiome. Cell. 2014;159:789–99.
Jia W, Li H, Zhao L, Nicholson JK. Gut microbiota: a potential new territory for drug targeting. Nat Rev Drug Discov. 2008;7:123–9.
Nicholson JK, Holmes E, Kinross J, Burcelin R, Gibson G, Jia W, et al. Host-gut microbiota metabolic interactions. Science. 2012;336:1262–7.
Wikoff WR, Anfora AT, Liu J, Schultz PG, Lesley SA, Peters EC, et al. Metabolomics analysis reveals large effects of gut microflora on mammalian blood metabolites. Proc Natl Acad Sci U S A. 2009;106:3698–703.
Glinghammar B, Venturi M, Rowland IR, Rafter JJ. Shift from a dairy product-rich to a dairy product-free diet: influence on cytotoxicity and genotoxicity of fecal water—potential risk factors for colon cancer. Am J Clin Nutr. 1997;66:1277–82.
Venturi M, Hambly RJ, Glinghammar B, Rafter JJ, Rowland IR. Genotoxic activity in human faecal water and the role of bile acids: a study using the alkaline comet assay. Carcinogenesis. 1997;18:2353–9.
Mirvish SS, Haorah J, Zhou L, Hartman M, Morris CR, Clapper ML. N-nitroso compounds in the gastrointestinal tract of rats and in the feces of mice with induced colitis or fed hot dogs or beef. Carcinogenesis. 2003;24:595–603.
Lambert JD, Hong J, Yang GY, Liao J, Yang CS. Inhibition of carcinogenesis by polyphenols: evidence from laboratory investigations. Am J Clin Nutr. 2005;81:284–91.
Cicerone C, Nenna R, Pontone S. Th17, intestinal microbiota and the abnormal immune response in the pathogenesis of celiac disease. Gastroenterol Hepatol Bed Bench. 2015;8:117–22.
Cryan JF, Dinan TG. Mind-altering microorganisms: the impact of the gut microbiota on brain and behaviour. Nat Rev Neurosci. 2012;13:701–12.
Russell WR, Gratz SW, Duncan SH, Holtrop G, Ince J, Scobbie L, et al. High-protein, reduced-carbohydrate weight-loss diets promote metabolite profiles likely to be detrimental to colonic health. Am J Clin Nutr. 2011;93:1062–72.
Taki K, Tsuruta Y, Niwa T. Indoxyl sulfate and atherosclerotic risk factors in hemodialysis patients. Am J Nephrol. 2007;27:30–5.
Meijers BK, Van Kerckhoven S, Verbeke K, Dehaen W, Vanrenterghem Y, Hoylaerts MF, et al. The uremic retention solute p-cresyl sulfate and markers of endothelial damage. Am J Kidney Dis. 2009;54:891–901.
Bain MA, Fornasini G, Evans AM. Trimethylamine: metabolic, pharmacokinetic and safety aspects. Curr Drug Metab. 2005;6:227–40.
Li M, Wang B, Zhang M, Rantalainen M, Wang S, Zhou H, et al. Symbiotic gut microbes modulate human metabolic phenotypes. Proc Natl Acad Sci U S A. 2008;105:2117–22.
Schloss PD, Handelsman J. Metagenomics for studying unculturable microorganisms: cutting the Gordian knot. Genome Biol. 2005;6:229.
Tringe SG, von Mering C, Kobayashi A, Salamov AA, Chen K, Chang HW, et al. Comparative metagenomics of microbial communities. Science. 2005;308:554–7.
Qin J, Li Y, Cai Z, Li S, Zhu J, Zhang F, et al. A metagenome-wide association study of gut microbiota in type 2 diabetes. Nature. 2012;490:55–60.
Qin N, Yang F, Li A, Prifti E, Chen Y, Shao L, et al. Alterations of the human gut microbiome in liver cirrhosis. Nature. 2014;513:59–64.
Karlsson FH, Tremaroli V, Nookaew I, Bergström G, Behre CJ, Fagerberg B, et al. Gut metagenome in European women with normal, impaired and diabetic glucose control. Nature. 2013;498:99–103.
Forslund K, Hildebrand F, Nielsen T, Falony G, Le Chatelier E, Sunagawa S, et al. Disentangling type 2 diabetes and metformin treatment signatures in the human gut microbiota. Nature. 2015;528:262–6.
Cotillard A, Kennedy SP, Kong LC, Prifti E, Pons N, Le Chatelier E, et al. Dietary intervention impact on gut microbial gene richness. Nature. 2013;500:585–8.
Zhang C, Yin A, Li H, Wang R, Wu G, Shen J, et al. Dietary modulation of gut microbiota contributes to alleviation of both genetic and simple obesity in children. EBioMed. 2015;2:966–82.
Wang Z, Klipfell E, Bennett BJ, Koeth R, Levison BS, Dugar B, et al. Gut flora metabolism of phosphatidylcholine promotes cardiovascular disease. Nature. 2011;472:57–63.
Turnbaugh PJ, Gordon JI. An invitation to the marriage of metagenomics and metabolomics. Cell. 2008;134:708–13.
Gilbert JA, Meyer F, Antonopoulos D, Balaji P, Brown CT, Brown CT, et al. Meeting report: the terabase metagenomics workshop and the vision of an Earth microbiome project. Stand Genomic Sci. 2010;3:243–8.
Lu K, Abo RP, Schlieper KA, Graffam ME, Levine S, Wishnok JS, et al. Arsenic exposure perturbs the gut microbiome and its metabolic profile in mice: an integrated metagenomics and metabolomics analysis. Environ Health Perspect. 2014;122:284–91.
Kimes NE, Callaghan AV, Aktas DF, Smith WL, Sunner J, Golding B, et al. Metagenomic analysis and metabolite profiling of deep-sea sediments from the Gulf of Mexico following the Deepwater Horizon oil spill. Front Microbiol. 2013;4:50.
Karlsson FH, Nookaew I, Petranovic D, Nielsen J. Prospects for systems biology and modeling of the gut microbiome. Trends Biotechnol. 2011;29:251–8.
Jones OAH, Sdepanian S, Lofts S, Svendsen C, Spurgeon DJ, Maguire ML, et al. Metabolomic analysis of soil communities can be used for pollution assessment. Environ Toxicol Chem. 2014;33:61–4.
Claesson MJ, Jeffery IB, Conde S, Power SE, O'Connor EM, Cusack S, et al. Gut microbiota composition correlates with diet and health in the elderly. Nature. 2012;488:178–84.
O'Keefe SJ, Li JV, Lahti L, Ou J, Carbonero F, Mohammed K, et al. Fat, fibre and cancer risk in African Americans and rural Africans. Nat Commun. 2015;6:6342.
Martin FP, Dumas ME, Wang Y, Legido-Quigley C, Yap IK, Tang H, et al. A top-down systems biology view of microbiome-mammalian metabolic interactions in a mouse model. Mol Syst Biol. 2007;3:112.
Raman M, Ahmed I, Gillevet PM, Probert CS, Ratcliffe NM, Smith S, et al. Fecal microbiome and volatile organic compound metabolome in obese humans with nonalcoholic fatty liver disease. Clin Gastroenterol Hepatol. 2013;11:868–75.
Noecker C, Eng A, Srinivasan S, Theriot CM, Young VB, Jansson JK, et al. Metabolic model-based integration of microbiome taxonomic and metabolomic profiles elucidates mechanistic links between ecological and metabolic variation. mSystems. 2016;1, e00013.
Bouslimani A, Porto C, Rath CM, Wang M, Guo Y, Gonzalez A, et al. Molecular cartography of the human skin surface in 3D. Proc Natl Acad Sci U S A. 2015;112, E2120.
Craciun S, Balskus EP. Microbial conversion of choline to trimethylamine requires a glycyl radical enzyme. Proc Natl Acad Sci U S A. 2012;109:21307–12.
The authors are grateful for support from the National Natural Science Foundation of China (31330005 and 81401141).
The authors declare that they have no competing interests.
All authors made substantial contribution to conception and drafting of the manuscript. All authors revised and approved the final manuscript.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
Zhang, C., Zhao, L. Strain-level dissection of the contribution of the gut microbiome to human metabolic disease. Genome Med 8, 41 (2016). https://doi.org/10.1186/s13073-016-0304-1
- Bioactive Metabolite
- Metagenomic Dataset
- High Population Level
- Human Chronic Disease
- Functional Microbial Gene