Gut microbial determinants of clinically important improvement in patients with rheumatoid arthritis
Genome Medicine volume 13, Article number: 149 (2021)
Rapid advances in the past decade have shown that dysbiosis of the gut microbiome is a key hallmark of rheumatoid arthritis (RA). Yet, the relationship between the gut microbiome and clinical improvement in RA disease activity remains unclear. In this study, we explored the gut microbiome of patients with RA to identify features that are associated with, as well as predictive of, minimum clinically important improvement (MCII) in disease activity.
We conducted a retrospective, observational cohort study on patients diagnosed with RA between 1988 and 2014. Whole metagenome shotgun sequencing was performed on 64 stool samples, which were collected from 32 patients with RA at two separate time-points approximately 6–12 months apart. The Clinical Disease Activity Index (CDAI) of each patient was measured at both time-points to assess achievement of MCII; depending on this clinical status, patients were distinguished into two groups: MCII+ (who achieved MCII; n = 12) and MCII− (who did not achieve MCII; n = 20). Multiple linear regression models were used to identify microbial taxa and biochemical pathways associated with MCII while controlling for potentially confounding factors. Lastly, a deep-learning neural network was trained upon gut microbiome, clinical, and demographic data at baseline to classify patients according to MCII status, thereby enabling the prediction of whether a patient will achieve MCII at follow-up.
We found age to be the largest determinant of the overall compositional variance in the gut microbiome (R2 = 7.7%, P = 0.001, PERMANOVA). Interestingly, the next factor identified to explain the most variance in the gut microbiome was MCII status (R2 = 3.8%, P = 0.005). Additionally, by looking at patients’ baseline gut microbiome profiles, we observed significantly different microbiome traits between patients who eventually showed MCII and those who did not. Taxonomic features include alpha- and beta-diversity measures, as well as several microbial taxa, such as Coprococcus, Bilophila sp. 4_1_30, and Eubacterium sp. 3_1_31. Notably, patients who achieved clinical improvement had higher alpha-diversity in their gut microbiomes at both baseline and follow-up visits. Functional profiling identified fifteen biochemical pathways, most of which were involved in the biosynthesis of L-arginine, L-methionine, and tetrahydrofolate, to be differentially abundant between the MCII patient groups. Moreover, MCII+ and MCII− groups showed significantly different fold-changes (from baseline to follow-up) in eight microbial taxa and in seven biochemical pathways. These results could suggest that, depending on the clinical course, gut microbiomes not only start at different ecological states, but also are on separate trajectories. Finally, the neural network proved to be highly effective in predicting which patients will achieve MCII (balanced accuracy = 90.0%, leave-one-out cross-validation), demonstrating potential clinical utility of gut microbiome profiles.
Our findings confirm the presence of taxonomic and functional signatures of the gut microbiome associated with MCII in RA patients. Ultimately, modifying the gut microbiome to enhance clinical outcome may hold promise as a future treatment for RA.
Rheumatoid arthritis (RA) is a chronic autoimmune inflammatory disease characterized by symmetric polyarticular inflammation and destruction primarily of the synovial joints, as well as of other organ systems . The prognosis of RA has improved over recent decades in parallel with advancements in diagnosis and treatment, particularly the widespread use of biologic and targeted synthetic disease-modifying anti-rheumatic drugs (DMARDs) that enable many persons with RA to achieve low disease activity or clinical remission. However, the exact etiology and pathogenesis of RA are not yet fully understood . In this regard, population-based studies have provided promising evidence that genetic factors contribute to RA onset [3,4,5,6,7]; however, the low concordance rate of RA in monozygotic twins largely suggests the role of non-genetic, environmental factors influencing the incidence of RA . These non-genetic factors include smoking history , acute infections , and oral and gut microbiota .
During the past decade, the role of the gut microbiome in RA pathogenesis has been demonstrated by several experimental studies [11,12,13,14,15,16]. For example, Maeda et al. have shown increased sensitivity to arthritis (via auto-reactive T cell activation in the intestine) in germ-free SKG mice following fecal microbiota transplantation from early RA patients . In addition, another study reported that inflammatory arthritis was strongly attenuated in K/BxN mice under germ-free (GF) conditions; however, the introduction of segmented filamentous bacteria restored splenic auto-antibodies, serum auto-antibodies, and T-helper 17 (Th17) cells . Moreover, the role of gut microbiome in RA pathogenesis is further supported by the attenuation of arthritis in Il1rn−/− mice by Tobramycin antibiotic treatment, which led to the decrease in relative abundances of gut commensals, such as Helicobacter, Flexispira, Clostridium, and Dehalobacterium .
Cross-sectional, human gut microbiome studies have elucidated the potential role of gut microbiome “dysbiosis” in RA [13, 14, 17, 18]. A study by Chen et al. found lower gut microbial diversity and species richness among RA patients compared to healthy controls; interestingly, patients using methotrexate (MTX) and hydroxychloroquine (HCQ) were observed to have higher gut microbiome diversity and richness than patients not on these medications, possibly indicating partial restoration of normal gut microbiome features with these treatments . Additionally, patients with RA displayed significant improvement in disease activity after being provided with probiotics containing Bacillus coagulans  or Lactobacillus casei [20, 21], providing promising evidence towards probiotic therapies in RA treatment. Moreover, another study revealed significant associations between the relative abundance of gut microbial taxa (e.g., Euryarchaeota, Gammaproteobacteria, Erysipelotrichi, and Coriobacteriales) and the disease activity score on 28 joints (DAS28) . Lastly, to demonstrate the potential of targeting the gut microbiome to modulate host immune response and to treat arthritis, Marietta et al. have shown that the oral administration of Prevotella histicola, which is a human gut-derived commensal bacterium, in transgenic mice expressing RA-associated DQ8 genes can suppress collagen-induced arthritis via regulation of the mucosal immune system .
Certainly, there has been a vast array of recent animal-model studies, cross-sectional case-control studies, and clinical trials showing that a perturbed gut microbiome is a key hallmark of RA. Yet, despite this wide range of novel findings, the association of the gut microbiome with minimum clinically important improvement (MCII) in disease activity in RA patients has yet to be closely examined. The MCII represents the minimal meaningful change (reduction) in quantitative disease activity, and is relevant to patients in terms of improvement in disease symptoms and associated clinical parameters . Although the primary goal in RA management is to achieve and sustain clinical remission or, at least, low disease activity, the MCII in disease activity is also frequently used in clinical settings to evaluate the initial response to treatments. For this, there exists a variety of measurements to quantify RA disease activity, including the Disease Activity Score on 28-joints (DAS28), the Simplified Disease Activity Index (SDAI), and the Clinical Disease Activity Index (CDAI) [25, 26]. Among these quantitative indices, the CDAI is one of the most straightforward to use, as it is designed as a simple numerical addition of four components (clinician evaluator global assessment, patient global assessment, 28-swollen joint count, and 28-tender joint count), and does not require acute-phase reactant laboratory tests for its calculation .
As medicine evolves towards becoming a big data-centric and bioinformatics-driven discipline [27,28,29], one of the most promising translational opportunities with gut microbiome datasets arises from their predictive capabilities. In particular, through integrating key biological features (e.g., taxa, functions, genes) of the microbiome with cutting-edge, machine-learning approaches, large-scale data from gut microbiomes are positioned to inform various health and wellness applications and to guide or complement clinical practice. To this point, the gut microbiome has been demonstrated in recent years to facilitate detection of disease [30,31,32,33,34]; classification of disease subtypes and progression stages [35,36,37]; prediction of clinical outcomes and treatment efficacy [38,39,40,41,42]; personalized nutrition by prediction of postprandial glycemic response [43,44,45]; and estimation of chronological age . Notably, in a recent study, by applying a random-forest machine-learning model to stool metagenomic data from treatment-naive, new-onset RA patients, Artacho et al. found that the gut microbiome can aid in the prediction of response to oral administration of methotrexate . Taken together, these examples highlight the potential value of translating microbiome data into new prognostic tools for all areas of precision medicine.
In this study, by investigating the association of gut microbiome profiles from RA patients with MCII and with other patient factors, we demonstrate a computational approach for utilizing gut microbiome information to identify which patients are likely to show clinical improvement independent of baseline clinical features. To this end, we collect shotgun stool metagenomes from a pilot cohort of 32 patients with RA at two separate time-points (i.e., baseline and follow-up) approximately 6–12 months apart. First, we examine the association of gut microbiome with MCII in RA disease activity. Our results show that the status of whether clinical improvement is achieved (or not) is a significant factor contributing to the variance in gut microbiome taxonomic composition. Next, for each time-point, we examine microbiome properties (alpha- and beta-diversity, microbial taxa, and biochemical pathways) that differentiate patients who eventually show clinical improvement from those who do not. Afterwards, we identify taxonomic and functional features whose magnitude of and/or direction of change (from baseline to follow-up) varies differently between these two patient groups. Finally, we train a deep-learning neural network model on baseline microbiome, clinical, and demographic data to assess how well we can predict whether MCII in disease activity is attained. Encouragingly, we find that the neural network achieves a 90.0% balanced accuracy in leave-one-out cross-validation, with a compelling accuracy in those who showed clinical improvement (12 correctly predicted among 12 total). Overall, our study offers novel insights into how gut microbial signatures are connected to the trajectory of disease activity in RA, and provides proof-of-concept evidence that accurately forecasting MCII from a stool sample may be possible.
Patient enrollment, eligibility criteria, and sample collection
The study population consisted of consecutive patients with RA attending the outpatient practice of the Division of Rheumatology at Mayo Clinic in Rochester, Minnesota. Eligibility required patients to be adults 18 years of age or older with a clinical diagnosis of RA by a rheumatologist on the basis of the American College of Rheumatology/European League Against Rheumatism 2010 revised classification criteria for RA . Patients were excluded if they did not comprehend English; were unable to provide written informed consent; or were members of a vulnerable population (e.g., incarcerated subjects). On the other hand, patients were eligible irrespective of use of any particular medication.
From 86 patients fulfilling the eligibility criteria, stool samples were collected from patients who had two outpatient visits approximately 6–12 months apart; whose clinical data (to assess CDAI and MCII) and demographic information were fully available at both clinical visits; who were not in clinical remission at both visits. In all, this study includes 32 participants, of whom 65.6% (21 of 32) were female.
For whole metagenome shotgun sequencing, stool samples were stored in our ongoing Mayo Clinic Rheumatology Biobank. This biorepository was created for long-term storage of diverse biological samples (e.g., serum, plasma, stool, white blood cells) from de-identified RA patients for use in research. Clinical and demographic data, including the numbers of tender and swollen joints, patient and evaluator global assessments, C-reactive protein (CRP, mg/L), smoking status, and titers for rheumatoid factor (RF, IU/mL) and anti-cyclic citrullinated peptide antibodies (ACPA), were collected from the electronic medical records. All patients provided written informed consent. The study was approved by the Mayo Clinic Institutional Review Board (no. 14-000616).
Determination of minimum clinically important improvement (MCII) in RA disease activity
The CDAI of each patient was measured at two time-points. By taking into account the swollen joint count (of 28 joints), tender joint count (of 28 joints), and the global assessments of disease activity (scored 0–10 on a visual analog scale) by both patient and clinician, the CDAI is scored on a scale ranging from 0 to 76 points . The level of disease activity can be interpreted as low (2.9 ≤ CDAI ≤ 10), moderate (10 < CDAI ≤ 22), or high (22 < CDAI), while CDAI ≤ 2.8 indicates the state of remission . A decrease in CDAI of at least 1 for patients with low disease activity; of at least 6 for patients with moderate disease activity; and of at least 12 for patients with high disease activity between two consecutive visits is considered as MCII in RA disease activity . Based upon these criteria, the study participants can be partitioned into two groups: (i) patients who showed clinical improvement (MCII+) and (ii) patients who did not show clinical improvement (MCII−) at follow-up visit.
Stool sample collection, DNA extraction, and shotgun metagenome sequencing
Stool samples from patients with rheumatoid arthritis were stored in their house-hold freezers (−20 °C) prior to shipment on dry ice to the Medical Genome Facility Research Core at Mayo Clinic (Rochester, MN). Once received, the samples were stored at −80 °C until DNA extraction. DNA extraction from stool samples was conducted as follows: Aliquots were created from parent stool samples using a tissue punch, and the resulting child samples were then mixed with reagents from the Qiagen Power Fecal Kit. This included adding 60 uL of reagent C1 and the contents of a power bead tube (garnet beads and power bead solution). These were then vigorously vortexed to bring the sample punch into solution and centrifuged at 18,000× g for 15 min. From there, the samples were added into a mixture of magnetic beads using a JANUS liquid handler. The samples were then run through a Chemagic MSM1 according to the manufacturer’s protocol. After DNA extraction, paired-end libraries were prepared using 500 ng genomic DNA according to the manufacturer’s instructions for the NEBNext Ultra library prep kit (New England BioLabs). The concentration and size distribution of the completed libraries were determined using an Agilent Bioanalyzer DNA 1000 chip (Santa Clara, CA) and Qubit fluorometry (Invitrogen, Carlsbad, CA). Libraries were sequenced at 23–70 million reads per sample following Illumina’s standard protocol using the Illumina cBot and HiSeq 3000/4000 PE Cluster Kit. The flow cells were sequenced as 150 × 2 paired-end reads on an Illumina HiSeq 4000 using the HiSeq 3000/4000 sequencing kit and HiSeq Control Software HD 22.214.171.124. Base-calling was performed using Illumina’s RTA version 2.7.7.
Quality filtration of sequenced reads
Sequenced reads were processed with the KneadData v0.5.1 quality-control pipeline (http://huttenhower.sph.harvard.edu/kneaddata), which uses Trimmomatic v0.36  and Bowtie2 v2.3.2  for removal of low-quality read bases and human reads, respectively. Trimmomatic v0.36 was run with parameters SLIDINGWINDOW:4:30, and Phred quality scores were thresholded at “< 30.” Illumina adapter sequences were removed, and trimmed non-human reads shorter than 60 bp in nucleotide length were discarded. Potential human contamination was filtered by removing reads that aligned to the human genome (reference genome hg19).
Taxonomic and functional profiling of stool metagenomes
Taxonomic profiling was performed using the MetaPhlAn2 v2.7.8  phylogenetic clade identification pipeline with default parameters. Briefly, MetaPhlAn2 classifies metagenomic reads to taxonomies based on a database (mpa_v20_m200) of clade-specific marker genes derived from ~ 17,000 microbial genomes (corresponding to ~ 13,500 bacterial and archaeal, ~ 3500 viral, and ~ 110 eukaryotic species). Microbes of viral origin and those that were labeled as either unclassified or unknown were excluded from further analyses. Afterwards, microbiome profiles were normalized using total sum-scaling (TSS) normalization to get the relative abundances (i.e., proportions) of microbial taxonomic ranks.
Functional profiling of annotated MetaCyc biochemical pathways of stool metagenomes was quantified using the HUMAnN v2.8 pipeline  with default parameters and with the UniRef90 EC-filtered database integrated into the pipeline. Similarly to the case with taxonomic ranks, MetaCyc pathways unmapped or unintegrated onto the UniRef90 EC-filtered database were discarded from further analyses, and relative abundances of the remaining MetaCyc pathways were calculated using TSS normalization.
Permutational multivariate analysis of variance based upon taxonomic composition of microbial communities
Bray-Curtis distance matrices based on arcsine, square-root transformed relative abundances of microbial taxa (phylum to species) in stool metagenomes (collected at both clinical visits) were generated using the R “vegan” package v2.5.6. A permutational multivariate analysis of variance (PERMANOVA)  was performed on the distance matrix using the “adonis” function. P values for the test statistic (pseudo-F) were based on 999 permutations to assess the contribution of clinical and demographic characteristics (age group [age < 64 years; age ≥ 64 years], sex [male; female], smoking status [smoker; non-smoker], use of conventional synthetic disease-modifying anti-rheumatic drugs [csDMARDs], use of biologic disease-modifying anti-rheumatic drugs [bDMARDs], use of prednisone, and MCII patient group [MCII+; MCII−]) to the total variance in gut microbial community composition (of note, categorical age group was used due to the uneven and skewed distribution of continuous age). Intra-subject longitudinal variation was accounted for by constraining permutations to within visits using the “strata” argument. Both marginal (i.e., univariate analysis) and adjusted (i.e., multivariate analysis controlling for multiple covariates simultaneously) models were used to evaluate percent variance and significance of associations between gut microbiome composition and patient factors.
Comparisons of alpha- and beta-diversity between MCII patient groups
Overall ecology of gut microbiomes was evaluated by calculating alpha-diversity (Fisher’s Index and richness) and beta-diversity (Bray-Curtis distance between all sample-pairs) based upon untransformed relative abundances of microbial species in each stool metagenome using the R “vegan” package v2.5.6. Multiple linear regression models (MLRMs) were then constructed using the R “stats” package v3.6.3 to determine the alpha-diversity indices that were significantly different between MCII+ and MCII− groups. MLRMs were adjusted for clinical and demographic characteristics that explained significant proportions of the variance in gut microbial community composition. Mann-Whitney U test was used to evaluate the statistical significance of the difference in beta-diversity between the patient groups.
Identification of differentially abundant microbial taxa and biochemical pathways between MCII patient groups
To identify differentially abundant microbial taxa and biochemical pathways between MCII+ and MCII− groups (at either baseline or follow-up), MLRMs were constructed for arcsine, square-root transformed relative abundance of each taxon and pathway. All MLRMs were designed to model the relationship between a taxon/pathway and MCII patient group, while adjusting for clinical and demographic characteristics found to be significantly associated with gut microbiome compositional variance according to the aforementioned PERMANOVA analysis. Taxa and pathways were considered as differentially abundant between the two MCII patient groups if both of the following conditions were met: (i) the corresponding regression coefficient for the patient group was significant (P < 0.05) and (ii) detected in at least a third of all samples in order to avoid spurious associations based upon rarely seen events.
Quantification of fold-change in gut microbial taxa and biochemical pathways
Microbial taxa and biochemical pathways detected in at least a third of all samples were considered for the calculation of fold-change (log2(FC)) from baseline to follow-up visit. As log2(FC) cannot be calculated if a taxon/pathway is absent (i.e., relative abundance = 0) at either of the visits, a small pseudo-count (1.0 × 10−5) was added to both the numerator and denominator when calculating fold-changes. Then, MLRMs were designed for each taxon and pathway to identify any significant differences (P < 0.05) in log2(FC) of relative abundances between the two MCII patient groups. All MLRMs were adjusted for clinical and demographic characteristics found to be significantly associated with gut microbiome compositional variance according to the aforementioned PERMANOVA analysis.
Construction of neural networks for predicting MCII and CDAI
Two separate multi-layer (deep) feedforward artificial neural networks with stochastic gradient descent using back-propagation, which were provided by the Python version of the “H2O” package v126.96.36.199, were constructed to meet the following two objectives (i.e., output layer): (i) classify a patient as MCII+ or MCII− from all baseline gut microbiome (relative abundances of 176 taxonomic ranks and of 262 MetaCyc pathways), clinical (CDAI, use of medications [bDMARDs, csDMARDs, and prednisone], HAQ, pain, and CRP), and demographic data (age, sex, and smoking status). In other words, predict whether a patient will achieve MCII based upon all identifiable baseline features. This model’s predictive performance was evaluated by leave-one-out cross-validation on all baseline profiles. Furthermore, Gedeon’s technique , which uses a score function based upon hidden neuron activations, is implemented on the training set to calculate variable importance; and (ii) predict CDAI using the aforementioned microbiome, clinical (except for CDAI), and demographic data as input predictor variables for the neural network. Predictive performance of this second model was evaluated by a leave-one-patient-out cross-validation method. More specifically, in each cross-validation loop, both samples from the same patient were allocated as the internal validation set, while all remaining samples were used as the internal training set for constructing the neural network to predict CDAI scores of the allocated two samples. For both objectives, the default input parameters were used for model-training except for the following: Epochs = “10,000” and Random seed = “1234.” See http://docs.h2o.ai/h2o/latest-stable/h2o-docs/data-science/deep-learning.html for all parameters of the neural network and their default values. Data curation and model implementation was performed in Python v3.6.4 on individual cloud instances utilizing Amazon Web Services (AWS).
MCII prediction with other machine-learning classifiers
Three different machine-learning models (logistic regression, random forest, and support-vector machines) from the “scikit-learn” package v0.24.1 were trained with the aforementioned baseline stool metagenome samples to create a classifier for predicting MCII status. The predictive performance of each classifier was evaluated by leave-one-out cross-validation. Of note, default options were used for the model training except for the following: logistic regression classifier, max_iter = “1000”; random forest, random_state = “1.”
From a total of 86 patients with RA whose blood and/or stool samples were stored in our ongoing biobank, we identified 51 patients who had at least two available stool samples collected at least 6 to 12 months apart (102 total samples). From these 51 patients, we found 36 patients (72 samples) who had fully available clinical data and demographic information at both clinical visits, thereby leading to the exclusion of 15 patients (30 samples). We excluded an additional 4 patients (8 samples) from further analysis because they were in clinical remission at both clinical visits. Hence, this retrospective, observational cohort study includes 32 participants (64 samples), of whom 65.6% (21 of 32) were female.
At the time of baseline stool sample collection, the patients had established disease with a mean age of 64.9 years (s.d. = 11.0), and a mean disease duration of 8.2 years (s.d. = 8.2). A summary of the patient enrollment, eligibility criteria, and sample collection protocol is provided in the “Methods” section. At baseline, all patients were on treatment with biologic disease-modifying anti-rheumatic drugs (bDMARDs, 46.9%), conventional synthetic disease-modifying anti-rheumatic drugs (csDMARDs, 87.5%), or prednisone (46.9%). For any medication, no association was found between its use (at either baseline or follow-up visit) and MCII in RA disease activity (Fig. 1), showing the critical need for more effective predictors of clinical improvement in RA. Baseline and follow-up visits were separated by a mean duration of 9.5 months (s.d. = 3.6 months), which was numerically longer for patients who attained MCII than for patients who did not attain MCII though not statistically significant (median 363 vs. 252 days, respectively; P = 0.08, Mann-Whitney U test). At all instances of stool sample collection, disease activity of patients varied from remission to high disease activity, with a mean CDAI of 16.3 (s.d. = 13.7) and 13.6 (s.d. = 11.6) at baseline and follow-up, respectively.
In total, 12 of the 32 (37.5%) total study participants achieved MCII in RA disease activity at their follow-up visit. The average change in CDAI for these 12 patients was –16.7 (s.d. = 12.8) units, which was, as expected, significantly different from the average change in CDAI of 5.7 (s.d. = 8.9) units for the remaining 20 of 32 (62.5%) patients who did not show improvement in RA disease activity (P = 6.9 × 10–6, Mann-Whitney U test). We used Fisher’s exact test to identify significant differences in categorical variables (e.g., age group, sex, smoking status, medication use, presence of rheumatoid factor or anti-cyclic citrullinated peptides antibodies), and Mann-Whitney U test to identify significant differences in continuous clinical measurements (CDAI, health assessment questionnaire [HAQ], swollen joint count [SJC], tender joint count [TJC], C-reactive protein [CRP], patient’s and physician’s health status assessment) between two patient groups: MCII+ (i.e., patients who showed MCII in disease activity based upon the change in CDAI from baseline to follow-up visit) and MCII– (i.e., patients who did not show MCII) (Table 1). At baseline, we found a significant association between MCII patient group (i.e., MCII+ and MCII–) and CDAI (P = 0.03). At follow-up visit, we found the following factors to be significantly associated with MCII patient group: CDAI (P = 1.9 × 10–3), change in CDAI from baseline (P = 6.9 × 10–6), pain (VAS) (P = 2.8 × 10–3), TJC (P = 0.01), patient global evaluation of disease activity (pt_vas) (P = 6.3 × 10–3), and provider global evaluation of disease activity (md_vas) (P = 0.01). No significant difference in age between the two patient groups was observed at either baseline or follow-up (P = 0.09).
MCII patient group explains significant variance in gut microbial community composition
We performed a PERMANOVA analysis to evaluate the patient characteristics that contribute to the variance in gut microbial communities of patients with RA (Methods). Using univariate (marginal) models, as well as multivariate (adjusted) models that jointly take into consideration all measurable factors, we considered MCII patient group, age group, sex, smoking status, baseline CDAI, and medication use (for csDMARDs, bDMARDs, and prednisone). Of note, we assume that the resulting percent variance explained by each variable in the adjusted model is statistically independent of other variables.
We found that MCII patient group explained 3.8% of the total variance in gut microbial communities (P = 0.002, PERMANOVA; Table 2 and Fig. 2a), after controlling for age group, CDAI, sex, smoking status, use of bDMARDs, csDMARDs, and prednisone, and intra-subject longitudinal variation. The adjusted model also showed that age group, use of csDMARDs, sex, and smoking status significantly explained 7.7%, 3.1%, 2.9%, and 2.7% of the total variance, respectively (Table 2 and Fig. 2b–e), indicating partial dependence of gut microbiome profiles on these other factors; however, CDAI (P = 0.056, PERMANOVA; Fig. 2f), treatment with bDMARDs (P = 0.280, PERMANOVA; Fig. 2g) and with prednisone (P = 0.284, PERMANOVA; Fig. 2h) were not found to have any significant association with gut microbial community composition (Table 2). Taking into account these observations, we additionally controlled for age group, use of csDMARDs, sex, and smoking status in subsequent analyses for investigating the differences in gut microbiome profiles between patients of the MCII+ and MCII− groups.
Features of baseline gut microbiomes significantly differ between MCII+ and MCII− patient groups
At baseline, we observed Bacteroidetes and Firmicutes as the most abundant phyla based upon relative abundances (Additional file 1: Figure S1a); Bacteroidales and Clostridiales as the most abundant orders (Additional file 1: Figure S1b); and Bacteroidaceae as the most abundant family (Additional file 1: Figure S1c). We next investigated the baseline gut microbiomes of all 32 patients to identify differences in ecological diversities (e.g., alpha-/beta-diversity) or in individual taxonomic and functional features between the two MCII patient groups. In effect, by knowing—albeit retrospectively—the clinical outcomes in advance, we have asked: on the basis of gut microbiome information, can differences at baseline not only provide hypotheses that connect gut microbiome to clinical improvement, but also reveal biomarkers predictive of the clinical course?
We found higher species-level alpha-diversity, that is, Fisher’s Index (P = 0.004, MLRM; and Fig. 3a) and richness (P = 0.007, MLRM; and Fig. 3b), and higher beta-diversity, that is, Bray-Curtis distances between all pairs of samples (P = 0.002, Mann-Whitney U test; and Fig. 3c) in the MCII+ group compared to the MCII− group. In addition, we sought to identify microbial taxa and microbiome-derived annotated MetaCyc biochemical pathways that were differentially abundant between the two MCII patient groups at baseline. Our analysis uncovered the following six microbial taxa as higher in the MCII+ group: Negativicutes (class), Selenomonadales (order), Prevotellaceae (family), Coprococcus (genus), Bacteroides sp. 3_1_19 (species), and Bilophila sp. 4_1_30 (species), whereas Eubacterium sp. 3_1_31 (species) was found to be higher in the MCII− group (P < 0.05; and Fig. 3d). Moreover, we found fifteen MetaCyc pathways that were differentially abundant between MCII+ and MCII− groups at baseline (P < 0.05, MLRM; and Fig. 3e). Six of these pathways, which include multiple ones for tetrahydrofolate biosynthesis and L-methionine biosynthesis, were significantly more abundant in patients of the MCII+ group than in those of the MCII– group; in contrast, the remaining nine pathways, the majority of which being for L-arginine and L-ornithine biosynthesis, and L-rhamnose degradation, were more abundant in patients of the MCII group. Taken together, our results show that gut microbiomes of the two diverging patient groups start at different ecological states even before reaching their clinical endpoints.
As was in the case at baseline, we observed a significant difference in species-level Fisher’s Index (P = 0.037, MLRM; Additional file 1: Figure S2a) between the two MCII patient groups at follow-up visit. However, richness (P = 0.094, MLRM) and Bray-Curtis distances between all sample-pairs (P = 0.310, Mann-Whitney U test) did not show significant differences (Additional file 1: Figure S2b–c). Thirteen microbial clades, including Negativicutes, Bifidobacteriales, and Selenomonadales (order); Bifidobacteriaceae, Prevotellaceae, and Oscillospiraceae (family); Bifidobacterium and Veillonella (genus); and Clostridium leptum and Roseburia inulinvorans (species), were found to significantly differ between the MCII+ and MCII− groups at follow-up visit (P < 0.05, MLRM; Additional file 1: Figure S2d). Lastly, MetaCyc pathway-level analysis at follow-up visit showed that only “Superpathway of Polyamine Biosynthesis II” was differentially abundant between the two patient groups (P < 0.011, MLRM; Additional file 1: Figure S2e).
Gut microbiome taxa and functions show significant differences in fold-change from baseline to follow-up between MCII patient groups
We examined the longitudinal variation in relative abundances (i.e., fold-change from baseline to follow-up) of microbial taxa and of biochemical pathways. From this, we sought to identify differences in how the gut microbiome changes in association with clinical outcomes (i.e., showing clinical improvement or not). First, we found that patients of the MCII+ and MCII− groups showed significant fold-change differences in the following eight microbial taxa (P < 0.05, MLRM; Fig. 4a, Additional file 1: Figure S3a): (i) Gammaproteobacteria (class), Oscillibacter (genus), Veillonella (genus), and Bacteroides vulgatus (species) were higher in the MCII+ group. This result suggests that these four taxa increased in relative abundance more highly and/or frequently in the MCII+ group compared to the MCII− group; and (ii) Coprococcus (genus), Ruminococcus (genus), Anaerotruncus colihominis (species), and Oscillibacter sp. KLE_1728 (species) were higher in the MCII− group. In other words, these four taxa increased in relative abundance more highly and/or frequently in the MCII− group than in the MCII+ group.
In the MCII+ group, the relative abundances of four taxa (Gammaproteobacteria, Oscillibacter, Veillonella, and Bacteroides) increased from baseline to follow-up (median log2(fold-change) ≥ 0.1), whereas four taxa (Coprococcus, Ruminococcus, Anaerotruncus colihominis, and Oscillibacter sp. KLE_1728) decreased in abundance (median log2(fold-change) ≤ − 0.1) (Fig. 4a, Additional file 1: Figure S3a). In the MCII− group, the relative abundances of three taxa (Coprococcus, Ruminococcus, and Anaerotruncus colihominis) increased from baseline to follow-up (median log2(fold-change) ≥ 0.1), while two taxa (Gammaproteobacteria and Oscillibacter) decreased in abundance (median log2(fold-change) ≤ − 0.1). Strikingly, these observations imply that the changes in relative abundances (from baseline to follow-up) of Gammaproteobacteria, Coprococcus, Oscillibacter, Ruminococcus, and Anaerotruncus colihominis in the MCII+ group and those in the MCII− group generally diverged in opposite directions.
Next, we identified seven biochemical pathways as having significantly different fold-changes between the two MCII patient groups (P < 0.05, MLRM; Fig. 4b, Additional file 1: Figure S3b): (i) four pathways, including those involving sugar metabolism (e.g., rhamnose degradation, a heptose derivative biosynthesis, GDP-mannose biosynthesis), had higher fold-changes in the MCII+ group; and (ii) three pathways (“Superpathway of Aromatic Amino Acid Biosynthesis”, “Chorismate Biosynthesis from 3-dehydroquinate”, and “myo-, chiro- and scyllo-inositol Degradation”) had higher fold-changes in the MCII− group.
As seen for microbial taxa, changes in relative abundance of five of these seven biochemical pathways were in opposite directions in the two patient groups: ADP-L-glycero- and beta-D-manno-heptose Biosynthesis, and Lipid IVA biosynthesis (Fig. 4b, Pathway A and C, respectively) generally increased in the MCII+ group, but decreased in the MCII– group; myo-, chiro- and scyllo-inositol Degradation (Fig. 4b, Pathway E), Chorismate Biosynthesis from 3-dehydroquinate (Fig. 4b, Pathway F), and Superpathway of Aromatic Amino Acid Biosynthesis (Fig. 4b, Pathway G) generally decreased in the MCII+ group, but increased in the MCII− group. Although it is yet uncertain why the relative abundances of these particular microbial taxa and biochemical pathways increase (or decrease) in one patient group but decrease (or increase) in the other, such analyses into the changes of distinct gut microbiome features, and how these changes are relevant to clinical improvement, can shed new light on additional insights not provided by cross-sectional datasets.
Gut microbiome is a predictive marker for clinical improvement and clinical disease activity in patients with RA
Having the capability to reliably predict whether a patient will show clinical improvement—independent of prior treatment and clinical course—would address what has been a steep challenge in the clinical practice of RA. As described above, we identified differences in baseline gut microbiome properties between MCII+ and MCII− patient groups. As an extension of these findings, we next turned to the question of how accurately baseline gut microbiome profiles and clinical and demographic data, combined with a machine-learning approach, can predict MCII class for a particular patient or group of patients; this essentially enables us to forecast whether a patient will have a good prognosis, that is, achieving MCII or not. To this end, we used a neural network classification model that incorporates baseline microbiome, clinical, and demographic data as the input variables to classify patients into one of the two MCII patient groups (Fig. 5a; Methods). The neural network model was able to distinguish the two groups with reasonably high prediction accuracy in leave-one-out cross-validation: a balanced accuracy (i.e., average of the proportions of MCII+ and MCII− samples that were correctly classified) of 90.0%, as the classification accuracy for the MCII+ and MCII− group was 100.0% (12 of 12) and 80.0% (16 of 20), respectively (Fig. 5b). Encouragingly, we were able to correctly predict MCII in all twelve patients who did indeed show clinical improvement. Furthermore, the deep-learning neural network provided the best classification performance when compared to logistic regression, support vector machines, and random forests (Methods; Additional file 1: Figure S4), thereby proving its utility over other machine-learning classifiers.
Next, by finding which input features were the most informative in the classification process (Methods), we rank-ordered all features based upon their scaled importance as determined by the neural network. We found that the top-ranked features were mainly composed of taxonomic and functional components from gut microbiome data (Fig. 5c). Of note, the top five important features were the Sucrose Degradation III pathway, Parabacteroides sp. D25 (species), Roseburia (genus), Fatty Acid and beta-oxidation II pathway, and Biotin Biosynthesis I pathway. Interestingly, data from clinical and demographic characteristics were ranked much lower: the highest ranked non-microbiome feature was related to the use of csDMARDs, which was ranked 78th (out of 448) in regard to feature importance, followed by sex (female), which was ranked 87th. Hence, microbiome features were deemed to be more important to the neural network classifier in predicting the likelihood of clinical improvement.
Surprisingly, the highest ranked (gut microbiome) features of importance were not differentially abundant between the two MCII groups (P > 0.05, MLRM; Additional file 1: Figure S5). This seemingly counterintuitive result implies that the most important features to the neural network are not necessarily required to have significant associations with the target variable, that is, MCII status; rather, a nonlinear combination and complex arrangement of the highly ranked features may assert strong predictive power. Alternatively, weakly associated features in unison were actually regarded as important for the supervised classification task.
Having shown that gut microbiome data can be used to predict whether (or not) a patient will show MCII, we developed another neural network model to evaluate how well the aforementioned predictor variables can predict CDAI (Fig. 5d; Methods). The direct prediction of a clinical disease activity score using the gut microbiome has yet to be performed in any chronic disease, although a previous study by Tedjo et al. used a Random Forests classifier with operational taxonomic units (OTUs) of the gut microbiome in Crohn’s Disease to differentiate between active disease and remission . By using a leave-one-patient-out cross-validation scheme, wherein predictions in each cross-validation loop were made on both samples from a single left-out patient (Methods), we found that our neural network achieved a moderate, yet significant, correlation between observed (actual) and predicted CDAI (Spearman’s ρ = 0.37, P = 0.003; Fig. 5e). Interestingly, the predicted CDAI fits a lower slope compared to the slope of an exact match between observed and predicted values. CDAI beyond ~ 15 were under-predicted, whereas CDAI below ~ 15 were over-predicted; this threshold could possibly indicate a breakpoint at which our model exhibits different relationships between the response and predictor variables. In summary, the gut microbiome shows promise as a non-invasive screening tool for predicting clinical improvement and perhaps also for monitoring RA disease activity.
To the best of our knowledge, this is the first study to date that uses shotgun metagenomic sequencing of stool to investigate the ties between the gut microbiome and MCII in RA disease activity independent of the initial measurement of conditions or prior treatment. This study addresses the following key questions: What are the distinct microbes and functions that define gut ecologies in patients who achieve MCII compared to patients who do not? Are these specific gut microbiome “signatures” predictive of MCII? Or in other words, how well does the gut microbiome forecast the trajectory of RA disease activity irrespective of prior clinical course? To this end, we compared the baseline gut microbiome compositions between RA patients who eventually showed improvement in disease activity and those who did not. First, we found that the status of MCII is significantly associated with the variation in gut microbiome community composition. Next, a more detailed examination of baseline gut microbiomes allowed us to identify higher levels of alpha-diversity (which is often associated with good health) and beta-diversity in the MCII+ group (i.e., patients who achieved clinical improvement) than in the MCII− group (i.e., patients who did not achieve clinical improvement). Additionally, we identified several microbial taxa and microbiome-derived MetaCyc biochemical pathways as differentially abundant between the two MCII patient groups. Furthermore, we observed several taxa and pathways as having significant differences in fold-change (from baseline to follow-up) between the two patient groups. Lastly, we demonstrate that the integration of gut microbiome and machine-learning technology could theoretically be an avenue for the prediction of disease course in RA. More specifically, by incorporating baseline microbiome, clinical, and demographic data into a deep-learning neural network, we were able to effectively classify patients into their MCII+ or MCII group, thereby allowing us to forecast MCII in patients with RA. With further development, such prognostic biomarkers could identify patients who will achieve MCII with a given therapy earlier on, thereby sparing them the expense and risk of other therapies that are less likely to be effective. Conversely, such tools can detect patients whose disease symptoms are less likely to improve, and perhaps allow clinicians to target and monitor them more closely. In all, our proof-of-concept study targets a significant unmet medical need in RA, and demonstrates the utility of the gut microbiome for the precision medicine era.
We identified several microbial taxa at baseline, including Coprococcus, Bilophila sp. 4_1_30, and Prevotellaceae, to have significantly different relative abundances between the MCII+ and MCII patient groups, even after controlling for demographic and clinical confounders. Coprococcus was found to be relatively higher in the MCII+ group compared to the MCII group. Microorganisms of this genus are known to produce butyrate, which is known for its anti-inflammatory effects [57,58,59,60,61,62,63]. For example, a study in mice showed that butyrate can suppress inflammation by inhibiting histone deacetylases (HDACs) in bone marrow cells . Previously, the administration of an HDAC inhibitor in vivo was found to promote the production and suppressive function of Foxp3+ regulatory T (Treg) cells . The anti-inflammatory effect of butyrate was also shown in Staphylococcus aureus cell-stimulated human monocytes, to which adding butyrate led to a reduction and increase of proinflammatory cytokine IL-12 and anti-inflammatory cytokine IL-10, respectively . In addition, Bilophila sp. 4_1_30 was found to be higher in patients of the MCII+ group. The role of Bilophila species in inflammatory or auto-immune diseases is not yet fully understood. A couple of studies have shown the positive association of Bilophila species (in particular B. wadsworthia) with pro-inflammatory immune responses [65, 66], while another study has shown that Bilophila species have negative associations with LPS-induced, TNFɑ-mediated immune responses in whole blood peripheral blood mononuclear cells . Lastly, Prevotellaceae was also found to have greater abundance in the MCII+ group. Some species in this family are known for their pro-inflammatory effects [14, 68]; therefore, this observation possibly suggests that host immune responses to Prevotellaceae are specific to particular species and/or strains .
At baseline, 26 of the total 32 patients were on antifolate drugs (methotrexate and/or sulfasalazine). In particular, methotrexate is a folate pathway antagonist known to competitively inhibit dihydrofolate reductase (DHFR), which participates in tetrahydrofolate (THF) biosynthesis . Interestingly, in our study, microbial biochemical pathways involved in tetrahydrofolate biosynthesis at baseline were found to be more abundant in patients of the MCII+ group (Fig. 3e). Although it is yet unclear as to why THF biosynthesis pathways were more abundant in the gut of RA patients who eventually obtained clinical improvement, the elevated presence of these pathways may be possibly linked to a protective role in patient outcome.
In addition to baseline differences in microbial taxa between the MCII+ and MCII− groups, we observed differences in the abundances of fifteen biochemical pathways at baseline. Ten of these differentially abundant pathways are involved in the biosynthesis of amino acids, such as arginine, methionine, and ornithine. All four pathways involved in methionine biosynthesis were found to be more abundant in the MCII+ group. Interestingly, dietary supplementation with high levels of methionine has been shown to attenuate arthritis severity in arthritic rats, and also to increase levels of serum Insulin-like Growth Factor-1 (IGF-I) , and to this point, IGF-I was previously found to be significantly lower in female patients with RA than in controls . Alternatively, all four arginine biosynthesis pathways were of lower abundance in the MCII+ group. A recently published study has shown that restriction of arginine improves outcome in multiple murine arthritis models by controlling the metabolism and formation of multi-nuclear giant cells .
Patients of the MCII+ and MCII− groups exhibited significantly different fold-changes from baseline to follow-up visit in eight microbial taxa, including Bacteroides vulgatus, Coprococcus, and Ruminococcus, and in seven MetaCyc biochemical pathways, including L-rhamnose degradation I, GDP-mannose Biosynthesis, and Superpathway of Aromatic Amino Acid Biosynthesis (Fig. 4). These differences in fold-changes of microbiome features (taxa/pathways) are likely effects of a complex combination of a number of factors, which could possibly include the use of certain medications. Indeed, several studies have shown that pharmaceutical drugs can be a modulator of gut microbiome composition and metabolic activity [74,75,76]. In this regard, a recently published study demonstrated that treatment with methotrexate (MTX) in RA patients induced compositional changes in members of the gut microbiota, such as Bacteroidetes, Lachnoclostridium, Collinsella aerofaciens, Dielma fastidiosa, and Prevotella copri, alongside the reduction in multiple immune cell types, which include activated T cells, IFN-γ+ T cells, myeloid cells, and B cells . Along these lines, the use of csDMARDs (which includes methotrexate) was found to be significantly associated with gut microbiome composition in our PERMANOVA analysis. Collectively, our results could implicate various aspects of the gut microbiome with improvement in chronic, debilitating symptoms in RA, raising the interesting possibility of intervening on these markers, e.g., introducing specific desirable bacterial strains into the gut or targeting microbial metabolic pathways as a basis for therapeutic intervention.
Several limitations should be acknowledged when interpreting our results. First and foremost, the relatively small sample size used in our study limits the generalization of the findings to a broader range of RA conditions. It was beyond the scope of this retrospective, observational cohort study to restrict the time of follow-up between clinical visits, leading to variability in the duration of follow-up. While this study is the first to associate gut microbiome signatures with MCII in RA, we do note that our results were derived from a pilot cohort of 32 patients; therefore, conducting more analyses and validation on larger cohorts with pre-specified clinical endpoints is the crucial next step to strengthen and confirm our findings. Second, our results could be influenced by confounders inherent to our cohort of patients. We do acknowledge that there may be geographical/cultural biases in our results, since the patients included in this study are mostly from the midwest region of the United States. Our statistical methods to identify associations between the gut microbiome and MCII were controlled for age, sex, smoking status, follow-up duration, and medication use. However, dietary habits were not assessed, which is a variable well known to influence the composition of the gut microbiome [78, 79]. Importantly, we were not able to statistically control for patient BMI, as current height and weight were found to be missing in several patient records. Of note, obesity is not only strongly tied to gut microbiome [80,81,82], but also known as a prognostic factor in RA. More specifically, patients with obesity have been found to be less likely to respond to disease-modifying therapy . How much BMI plays a role in shaping the current results will be addressed in our future studies. Third, we lose most of the significant (P < 0.05) “hits” found using the MLRMs after Benjamini-Hochberg correction, which could be attributed to a number of factors: (i) lack of strong separation (in gut microbiome) between two study groups having the same disease diagnosis; (ii) comparatively small sample sizes; and (iii) controlling for several potentially confounding factors simultaneously. Fourth, as is often the case in retrospective cohort studies, we cannot completely eliminate the possibility of patient selection bias. For example, patients may not elect to return for a follow-up visit depending on a certain disease severity. Additionally, among the patients whose clinical samples were available in our biobank, some clinical/demographic data were incomplete for both time-points. Such reasons result in exclusion of these patients from our study, and therefore may bias the type of patients who were analyzed. Fifth, all descriptions of annotated biochemical pathways of the gut microbiome allude to functional potential, that is, functional possibilities derived from genetic content. We did not employ transcriptomics or proteomics technologies to assess enzyme abundances; metabolomics to detect small-molecules, or cellular assays to determine metabolic flux. However, these are all promising methods that we can later use to obtain much richer insight into how microbial metabolism affects RA disease course. Sixth, clearly our study cannot provide causal mechanisms underlying the associations between the gut microbiome and MCII in RA disease activity. However, a closer investigation on particular microbial taxa or microbiome-derived pathways identified in our study may provide a promising launchpad for future studies delving into specifically how alterations in the gut microbiome influence RA-associated changes in human physiology or in systemic, chronic inflammation. Seventh, all predictions regarding the MCII patient group and CDAI were performed in cross-validation on the original discovery cohort. It remains to be seen how well the robustness of our prediction models will hold up when demonstrated on an independent validation cohort once available. Finally, although we found that the gut microbiome is surprisingly predictive of MCII, our study is limited by the fact that we collected stool samples and assessed patients’ disease activity at only two time-points. It could be possible that associations between gut microbiome and MCII may not persist past the second visit. Surely, future studies extending this current work will need to entail having larger cohorts, patients with new-onset RA, and several longitudinal sample collections, while considering more potentially confounding factors (e.g., geography, race/ethnicity, diet, and lifestyle).
Several aspects of the gut microbiome are associated with future prognosis in RA, providing motivation for further studies on the effect of intestinal microflora and various patient factors on autoimmune response and clinical course. Additionally, shotgun metagenomic sequencing of microbial communities in stool samples can serve as an effective and reliable predictor of whether patients with RA will achieve clinically important improvement in disease activity. Ultimately, we expect our work to be one cornerstone for a suite of new, omics data-based clinical tools to aid in early detection, diagnosis, prognosis, and treatment in RA [84, 85]. Looking ahead, possible solutions to treat chronic auto-immune or inflammatory diseases could well involve modifying the gut microbiome to an ecological state primed to enhance clinical outcome.
Availability of data and materials
Sequencing data for stool metagenomes used in this study have been deposited at NCBI’s Sequence Read Archive (SRA) data repository (BioProject number PRJNA598446  and PRJNA687957 ) and can be downloaded without any restrictions. The deposited sequences include .fastq files for 64 stool metagenomes collected from 32 patients with rheumatoid arthritis. Human reads were identified and removed prior to data upload.
Biologic disease-modifying anti-rheumatic drugs
Clinical disease activity index
Conventional synthetic disease-modifying anti-rheumatic drugs
Disease Activity Score
Health Assessment Questionnaire Disability Index
Minimum clinically important improvement
Permutational multivariate analysis of variance
Simplified disease activity index
Swollen joint count
Tender joint count
Smolen JS, Aletaha D, Barton A, Burmester GR, Emery P, Firestein GS, et al. Rheumatoid arthritis. Nature Reviews Disease Primers. Nat Publishing Group. 2018;4:1–23.
Sparks JA. Rheumatoid Arthritis. Ann Int Med. 2019;170:ITC1.
MacGregor AJ, Snieder H, Rigby AS, Koskenvuo M, Kaprio J, Aho K, et al. Characterizing the quantitative genetic contribution to rheumatoid arthritis using data from twins. Arthritis Rheum. 2000;43:30–7.
Silman AJ, MacGregor AJ, Thomson W, Holligan S, Carthy D, Farhan A, et al. Twin concordance rates for rheumatoid arthritis: results from a nationwide study. Br J Rheumatol. 1993;32:903–7.
Aho K, Koskenvuo M, Tuominen J, Kaprio J. Occurrence of rheumatoid arthritis in a nationwide series of twins. J Rheumatol. 1986;13:899–902.
Zhernakova A, Stahl EA, Trynka G, Raychaudhuri S, Festen EA, Franke L, et al. Meta-analysis of genome-wide association studies in celiac disease and rheumatoid arthritis identifies fourteen non-HLA shared loci. PLoS Genet. 2011;7:e1002004.
Stahl EA, Raychaudhuri S, Remmers EF, Xie G, Eyre S, Thomson BP, et al. Genome-wide association study meta-analysis identifies seven new rheumatoid arthritis risk loci. Nat Genet. 2010;42:508–14.
Stolt P, Bengtsson C, Nordmark B, Lindblad S, Lundberg I, Klareskog L, et al. Quantification of the influence of cigarette smoking on rheumatoid arthritis: results from a population based case-control study, using incident cases. Ann Rheum Dis. 2003;62:835–41.
Edwards CJ, Goswami R, Goswami P, Syddall H, Dennison EM, Arden NK, et al. Growth and infectious exposure during infancy and the risk of rheumatoid factor in adult life. Ann Rheum Dis. 2006;65:401–4.
Zhang X, Zhang D, Jia H, Feng Q, Wang D, Liang D, et al. The oral and gut microbiomes are perturbed in rheumatoid arthritis and partly normalized after treatment. Nat Med. 2015;21:895–905.
Maeda Y, Takeda K. Host–microbiota interactions in rheumatoid arthritis. Exp Mol Med. 2019;51:1–6.
Wu H-J, Ivanov II, Darce J, Hattori K, Shima T, Umesaki Y, et al. Gut-residing segmented filamentous bacteria drive autoimmune arthritis via T helper 17 cells. Immunity. 2010;32:815–27.
Chen J, Wright K, Davis JM, Jeraldo P, Marietta EV, Murray J, et al. An expansion of rare lineage intestinal microbes characterizes rheumatoid arthritis. Genome Med. 2016;8:43.
Scher JU, Sczesnak A, Longman RS, Segata N, Ubeda C, Bielski C, et al. Expansion of intestinal Prevotella copri correlates with enhanced susceptibility to arthritis. Elife. 2013;2:e01202.
Maeda Y, Kurakawa T, Umemoto E, Motooka D, Ito Y, Gotoh K, et al. Dysbiosis Contributes to Arthritis Development via Activation of Autoreactive T Cells in the Intestine. Arthritis Rheumatol. 2016;68:2646–61.
Rogier R, Ederveen THA, Boekhorst J, Wopereis H, Scher JU, Manasson J, et al. Aberrant intestinal microbiota due to IL-1 receptor antagonist deficiency promotes IL-17- and TLR4-dependent arthritis. Microbiome. 2017;5:63.
Vaahtovuo J, Munukka E, Korkeamäki M, Luukkainen R, Toivanen P. Fecal microbiota in early rheumatoid arthritis. J Rheumatol. 2008;35:1500–5.
Liu X, Zou Q, Zeng B, Fang Y, Wei H. Analysis of fecal Lactobacillus community structure in patients with early rheumatoid arthritis. Curr Microbiol. 2013;67:170–6.
Mandel DR, Eichas K, Holmes J. Bacillus coagulans: a viable adjunct therapy for relieving symptoms of rheumatoid arthritis according to a randomized, controlled trial. BMC Complement Altern Med. 2010;10:1.
Vaghef-Mehrabany E, Alipour B, Homayouni-Rad A, Sharif S-K, Asghari-Jafarabadi M, Zavvari S. Probiotic supplementation improves inflammatory status in patients with rheumatoid arthritis. Nutrition. 2014;30:430–5.
So J-S, Kwon H-K, Lee C-G, Yi H-J, Park J-A, Lim S-Y, et al. Lactobacillus casei suppresses experimental arthritis by down-regulating T helper 1 effector functions. Mol Immunol. 2008;45:2690–9.
Picchianti-Diamanti A, Panebianco C, Salemi S, Sorgi ML, Di Rosa R, Tropea A, et al. Analysis of Gut Microbiota in Rheumatoid Arthritis Patients: Disease-Related Dysbiosis and Modifications Induced by Etanercept. Int J Mol Sci. 2018;19:2938.
Marietta EV, Murray JA, Luckey DH, Jeraldo PR, Lamba A, Patel R, et al. Suppression of Inflammatory Arthritis by Human Gut-Derived Prevotella histicola in Humanized Mice. Arthritis Rheumatol. 2016;68:2878–88.
Curtis JR, Yang S, Chen L, Pope JE, Keystone EC, Haraoui B, et al. Determining the Minimally Important Difference in the Clinical Disease Activity Index for Improvement and Worsening in Early Rheumatoid Arthritis Patients. Arthritis Care Res. 2015;67:1345–53.
Curtis JR, Churchill M, Kivitz A, Samad A, Gauer L, Gervitz L, et al. A Randomized Trial Comparing Disease Activity Measures for the Assessment and Prediction of Response in Rheumatoid Arthritis Patients Initiating Certolizumab Pegol. Arthritis Rheumatol. 2015;67:3104–12.
Anderson J, Caplan L, Yazdany J, Robbins ML, Neogi T, Michaud K, et al. Rheumatoid arthritis disease activity measures: American College of Rheumatology recommendations for use in clinical practice. Arthritis Care Res. 2012;64:640–7.
Topol EJ. High-performance medicine: the convergence of human and artificial intelligence. Nat Med. 2019;25:44–56.
Winslow RL, Trayanova N, Geman D, Miller MI. Computational medicine: translating models to clinical care. Sci Transl Med. 2012;4:158rv11.
Shameer K, Badgeley MA, Miotto R, Glicksberg BS, Morgan JW, Dudley JT. Translational bioinformatics in the era of real-time biomedical, health care and wellness data streams. Brief Bioinform. 2017;18:105–24.
Giloteaux L, Goodrich JK, Walters WA, Levine SM, Ley RE, Hanson MR. Reduced diversity and altered composition of the gut microbiome in individuals with myalgic encephalomyelitis/chronic fatigue syndrome. Microbiome. 2016;4:30.
Baxter NT, Ruffin MT 4th, Rogers MAM, Schloss PD. Microbiota-based model improves the sensitivity of fecal immunochemical test for detecting colonic lesions. Genome Med. 2016;8:37.
Saulnier DM, Riehle K, Mistretta T-A, Diaz M-A, Mandal D, Raza S, et al. Gastrointestinal microbiome signatures of pediatric patients with irritable bowel syndrome. Gastroenterology. 2011;141:1782–91.
Olivares M, Walker AW, Capilla A, Benítez-Páez A, Palau F, Parkhill J, et al. Gut microbiota trajectory in early life may predict development of celiac disease. Microbiome. 2018;6:36.
Gupta VK, Kim M, Bakshi U, Cunningham KY, Davis JM 3rd, Lazaridis KN, et al. A predictive index for health status using species-level gut microbiome profiling. Nat Commun. 2020;11:4635.
Schirmer M, Denson L, Vlamakis H, Franzosa EA, Thomas S, Gotman NM, et al. Compositional and Temporal Changes in the Gut Microbiome of Pediatric Ulcerative Colitis Patients Are Linked to Disease Course. Cell Host Microbe. 2018;24:600–10.e4.
Liu H, Chen X, Hu X, Niu H, Tian R, Wang H, et al. Alterations in the gut microbiome and metabolism with coronary artery disease severity. Microbiome. 2019;7:68.
Takewaki D, Suda W, Sato W, Takayasu L, Kumar N, Kimura K, et al. Alterations of the gut ecological and functional microenvironment in different stages of multiple sclerosis. Proc Natl Acad Sci U S A. 2020;117:22402–12.
Ananthakrishnan AN, Luo C, Yajnik V, Khalili H, Garber JJ, Stevens BW, et al. Gut Microbiome Function Predicts Response to Anti-integrin Biologic Therapy in Inflammatory Bowel Diseases. Cell Host Microbe. 2017;21:603–10.e3.
Heshiki Y, Vazquez-Uribe R, Li J, Ni Y, Quainoo S, Imamovic L, et al. Predictable modulation of cancer treatment outcomes by the gut microbiota. Microbiome. 2020;8:28.
Metwaly A, Dunkel A, Waldschmitt N, Raj ACD, Lagkouvardos I, Corraliza AM, et al. Integrated microbiota and metabolite profiles link Crohn’s disease to sulfur metabolism. Nat Commun. 2020;11:4322.
Zhou Y, Xu ZZ, He Y, Yang Y, Liu L, Lin Q, et al. Gut Microbiota Offers Universal Biomarkers across Ethnicity in Inflammatory Bowel Disease Diagnosis and Infliximab Response Prediction. mSystems. 2018;3:e00188–17.
Khanna S, Montassier E, Schmidt B, Patel R, Knights D, Pardi DS, et al. Gut microbiome predictors of treatment response and recurrence in primary Clostridium difficile infection. Aliment Pharmacol Ther. 2016;44:715–27.
Zeevi D, Korem T, Zmora N, Israeli D, Rothschild D, Weinberger A, et al. Personalized Nutrition by Prediction of Glycemic Responses. Cell. 2015;163:1079–94.
Korem T, Zeevi D, Zmora N, Weissbrod O, Bar N, Lotan-Pompan M, et al. Bread Affects Clinical Parameters and Induces Gut Microbiome-Associated Personal Glycemic Responses. Cell Metab. 2017;25:1243–53.
Suez J, Shapiro H, Elinav E. Role of the microbiome in the normal and aberrant glycemic response. Clin Nutr Exp. 2016;6:59–73.
Huang S, Haiminen N, Carrieri A-P, Hu R, Jiang L, Parida L, et al. Human Skin, Oral, and Gut Microbiomes Predict Chronological Age. mSystems. 2020;5:e00630–19.
Artacho A, Isaac S, Nayak R, Flor-Duro A, Alexander M, Koo I, et al. The Pre-treatment Gut Microbiome is Associated with Lack of Response to Methotrexate in New Onset Rheumatoid Arthritis. Arthritis Rheumatol. 2020;10:41622.
Aletaha D, Neogi T, Silman AJ, Funovits J, Felson DT, Bingham CO 3rd, et al. 2010 Rheumatoid arthritis classification criteria: an American College of Rheumatology/European League Against Rheumatism collaborative initiative. Arthritis Rheum. 2010;62:2569–81.
Canhão H, Rodrigues AM, Gregório MJ, Dias SS, Melo Gomes JA, Santos MJ, et al. Common Evaluations of Disease Activity in Rheumatoid Arthritis Reach Discordant Classifications across Different Populations. Front Med. 2018;5:40.
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–20.
Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–9.
Truong DT, Franzosa EA, Tickle TL, Scholz M, Weingart G, Pasolli E, et al. MetaPhlAn2 for enhanced metagenomic taxonomic profiling. Nat Methods. 2015;12:902–3.
Franzosa EA, McIver LJ, Rahnavard G, Thompson LR, Schirmer M, Weingart G, et al. Species-level functional profiling of metagenomes and metatranscriptomes. Nat Methods. 2018;15:962–8.
McArdle BH, Anderson MJ. Fitting multivariate models to community data: a comment on distance-based redundancy analysis. Ecology. 2001;82:290–7.
Gedeon TD. Data mining of inputs: Analysing magnitude and functional measures. Int J Neural Syst. 1997;8:209–18.
Tedjo DI, Smolinska A, Savelkoul PH, Masclee AA, van Schooten FJ, Pierik MJ, et al. The fecal microbiota as a biomarker for disease activity in Crohn’s disease. Sci Rep. 2016;6:35216.
Louis P, Flint HJ. Diversity, metabolism and microbial ecology of butyrate-producing bacteria from the human large intestine. FEMS Microbiol Lett. 2009;294:1–8.
Kim DS, Da Som K, Kwon J-E, Lee SH, Kim EK, Ryu J-G, et al. Attenuation of Rheumatoid Inflammation by Sodium Butyrate Through Reciprocal Targeting of HDAC2 in Osteoclasts and HDAC8 in T Cells. Front Immunol. 2018;9:1525.
Säemann MD, Böhmig GA, Osterreicher CH, Burtscher H, Parolini O, Diakos C, et al. Anti-inflammatory effects of sodium butyrate on human monocytes: potent inhibition of IL-12 and up-regulation of IL-10 production. FASEB J. 2000;14:2380–2.
Cleophas MCP, Ratter JM, Bekkering S, Quintin J, Schraa K, Stroes ES, et al. Effects of oral butyrate supplementation on inflammatory potential of circulating peripheral blood mononuclear cells in healthy and obese males. Sci Rep. 2019;9:775.
Segain JP, de la Blétière DR, Bourreille A, Leray V, Gervois N, Rosales C, et al. Butyrate inhibits inflammatory responses through NFkappaB inhibition: implications for Crohn’s disease. Gut. 2000;47:397–403.
Liu T, Li J, Liu Y, Xiao N, Suo H, Xie K, et al. Short-chain fatty acids suppress lipopolysaccharide-induced production of nitric oxide and proinflammatory cytokines through inhibition of NF-κB pathway in RAW264.7 cells. Inflammation. 2012;35:1676–84.
Park J-S, Lee E-J, Lee J-C, Kim W-K, Kim H-S. Anti-inflammatory effects of short chain fatty acids in IFN-γ-stimulated RAW 264.7 murine macrophage cells: Involvement of NF-κB and ERK signaling pathways. Int Immunopharmacol. 2007;7:70–7.
Tao R, de Zoeten EF, Ozkaynak E, Chen C, Wang L, Porrett PM, et al. Deacetylase inhibition promotes the generation and function of regulatory T cells. Nat Med. 2007;13:1299–307.
Natividad JM, Lamas B, Pham HP, Michel M-L, Rainteau D, Bridonneau C, et al. Bilophila wadsworthia aggravates high fat diet induced metabolic dysfunctions in mice. Nat Commun. 2018;9:2802.
Devkota S, Wang Y, Musch MW, Leone V, Fehlner-Peach H, Nadimpalli A, et al. Dietary-fat-induced taurocholic acid promotes pathobiont expansion and colitis in Il10−/− mice. Nature. 2012;487:104–8.
Schirmer M, Smeekens SP, Vlamakis H, Jaeger M, Oosting M, Franzosa EA, et al. Linking the Human Gut Microbiome to Inflammatory Cytokine Production Capacity. Cell. 2016;167:1897.
Iljazovic A, Roy U, Gálvez EJC, Lesker TR, Zhao B, Gronow A, et al. Perturbation of the gut microbiome by Prevotella spp. enhances host susceptibility to mucosal inflammation. Mucosal Immunol. 2020;13:s41385-020-0296-4.
Bodkhe R, Balakrishnan B, Taneja V. The role of microbiome in rheumatoid arthritis treatment. Ther Adv Musculoskelet Dis. 2019;11:1759720X19844632.
Visentin M, Zhao R, Goldman ID. The antifolates. Hematol Oncol Clin North Am. 2012;26:629–48.
Li M, Zhai L, Wei W. High-Methionine Diet Attenuates Severity of Arthritis and Modulates IGF-I Related Gene Expressions in an Adjuvant Arthritis Rats Model. Mediat Inflammation. 2016;2016:1–6.
Matsumoto T, Tsurumoto T. Inappropriate serum levels of IGF-I and IGFBP-3 in patients with rheumatoid arthritis. Rheumatology. 2002;41:352–3.
Brunner JS, Vulliard L, Hofmann M, Kieler M, Lercher A, Vogel A, et al. Environmental arginine controls multinuclear giant cell metabolism and formation. Nat Commun. 2020;11:431.
Maier L, Pruteanu M, Kuhn M, Zeller G, Telzerow A, Anderson EE, et al. Extensive impact of non-antibiotic drugs on human gut bacteria. Nature. 2018;555:623–8.
Rogers MAM, Aronoff DM. The influence of non-steroidal anti-inflammatory drugs on the gut microbiome. Clin Microbiol Infect. 2016;22:178.e1–9.
Vich Vila A, Collij V, Sanna S, Sinha T, Imhann F, Bourgonje AR, et al. Impact of commonly used drugs on the composition and metabolic function of the gut microbiota. Nat Commun. 2020;11:362.
Nayak RR, Alexander M, Deshpande I, Stapleton-Gray K, Rimal B, Patterson AD, et al. Methotrexate impacts conserved pathways in diverse human gut bacteria leading to decreased host immune activation. Cell Host Microbe. 2021;29:362–77.e11.
Zmora N, Suez J, Elinav E. You are what you eat: diet, health and the gut microbiota. Nat Rev Gastroenterol Hepatol. 2019;16:35–56.
Singh RK, Chang H-W, Yan D, Lee KM, Ucmak D, Wong K, et al. Influence of diet on the gut microbiome and implications for human health. J Transl Med. 2017;15:73.
Zhao L. The gut microbiota and obesity: from correlation to causality. Nat Rev Microbiol. 2013;11:639–47.
Cani PD. Gut microbiota and obesity: lessons from the microbiome. Brief Funct Genomics. 2013;12:381–7.
Haro C, Rangel-Zúñiga OA, Alcalá-Díaz JF, Gómez-Delgado F, Pérez-Martínez P, Delgado-Lista J, et al. Intestinal Microbiota Is Influenced by Gender and Body Mass Index. PLoS One. 2016;11:e0154090.
Liu Y, Hazlewood GS, Kaplan GG, Eksteen B, Barnabe C. Impact of Obesity on Remission and Disease Activity in Rheumatoid Arthritis: A Systematic Review and Meta-Analysis. Arthritis Care Res. 2017;69:157–65.
Hur B, Gupta VK, Huang H, Wright KA, Warrington KJ, Taneja V, et al. Plasma metabolomic profiling in patients with rheumatoid arthritis identifies biochemical features predictive of quantitative disease activity. Arthritis Res Ther. 2021;23:164.
Mucke J, Sewerin P, Schneider M. Rheumatology in 2049: the age of all data. Ann Rheum Dis. 2021;80:825–7.
Gupta VK, Kim M, Bakshi U, Cunningham KY, Davis III JM, Lazaridis KN, et al. BioProject PRJNA598446: A Predictive Index for Health Status Using Species-level Gut Microbiome Profiling. NCBI Sequence Read Archive (SRA). 2020. https://www.ncbi.nlm.nih.gov/bioproject/PRJNA598446.
Gupta VK, Cunningham KY, Bakshi U, Hur B, Huang H, Warrington KJ, et al. BioProject PRJNA687957: Gut Microbial Determinants of Clinically Important Improvement in Patients with Rheumatoid Arthritis. NCBI Sequence Read Archive (SRA). 2020. https://www.ncbi.nlm.nih.gov/bioproject/PRJNA687957.
First and foremost, we thank our dear patients who volunteered for this study. We also thank the Mayo Clinic Division of Rheumatology study coordinators (Jennifer Sletten and Kathleen McCarthy-Fruin) and the Mayo Clinic Medical Genome Facility staff members (Julie Lau, Jeffrey Meyer, and Bruce Eckloff) for making this work possible.
This work was supported in part by the Mayo Clinic Center for Individualized Medicine (to V.K.G., K.Y.C., U.B., B.H., and J.S.), and Mark E. and Mary A. Davis to Mayo Clinic Center for Individualized Medicine (J.M.D., J.S.).
Ethics approval and consent to participate
Biospecimen collection and study design were approved by the Mayo Clinic Institutional Review Board (#14-000616). Written informed consent was obtained from all participants in the study, and the study design complies with the Declaration of Helsinki.
Consent for publication
J.M.D. has a research grant from Pfizer. The remaining authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Stacked bar-plots showing the distribution of relative abundances of taxonomic ranks detected in baseline gut microbiomes. Figure S2. Differences in gut microbiome features between MCII patient groups at follow-up visit. Figure S3. Microbial taxa and biochemical pathways whose change in relative abundance from baseline to follow-up vary differently between MCII patient groups. Figure S4. Performance evaluation of three different classifiers to predict MCII status. Figure S5. Relative abundances of the top 10 highest-ranked features in the deep-learning neural network model.
About this article
Cite this article
Gupta, V.K., Cunningham, K.Y., Hur, B. et al. Gut microbial determinants of clinically important improvement in patients with rheumatoid arthritis. Genome Med 13, 149 (2021). https://doi.org/10.1186/s13073-021-00957-0