Skip to main content


Microbiomic differences in tumor and paired-normal tissue in head and neck squamous cell carcinomas

Article metrics



While the role of the gut microbiome in inflammation and colorectal cancers has received much recent attention, there are few data to support an association between the oral microbiome and head and neck squamous cell carcinomas. Prior investigations have been limited to comparisons of microbiota obtained from surface swabs of the oral cavity. This study aims to identify microbiomic differences in paired tumor and non-tumor tissue samples in a large group of 121 patients with head and neck squamous cell carcinomas and correlate these differences with clinical-pathologic features.


Total DNA was extracted from paired normal and tumor resection specimens from 169 patients; 242 samples from 121 patients were included in the final analysis. Microbiomic content of each sample was determined using 16S rDNA amplicon sequencing. Bioinformatic analysis was performed using QIIME algorithms. F-testing on cluster strength, Wilcoxon signed-rank testing on differential relative abundances of paired tumor-normal samples, and Wilcoxon rank-sum testing on the association of T-stage with relative abundances were conducted in R.


We observed no significant difference in measures of alpha diversity between tumor and normal tissue (Shannon index: p = 0.13, phylogenetic diversity: p = 0.42). Similarly, although we observed statistically significantly differences in both weighted (p = 0.01) and unweighted (p = 0.04) Unifrac distances between tissue types, the tumor/normal grouping explained only a small proportion of the overall variation in the samples (weighted R2 = 0.01, unweighted R2 < 0.01).

Notably, however, when comparing the relative abundances of individual taxa between matched pairs of tumor and normal tissue, we observed that Actinomyces and its parent taxa up to the phylum level were significantly depleted in tumor relative to normal tissue (q < 0.01), while Parvimonas was increased in tumor relative to normal tissue (q = 0.01). These differences were more pronounced among patients with more extensive disease as measured by higher T-stage.


Matched pairs analysis of individual tumor-normal pairs revealed significant differences in relative abundance of specific taxa, namely in the genus Actinomyces. These differences were more pronounced among patients with higher T-stage. Our observations suggest further experiments to interrogate potential novel mechanisms relevant to carcinogenesis associated with alterations of the oral microbiome that may have consequences for the human host.


Interactions between microbes and carcinogenesis within the host have been studied for decades. The best example is in the context of a single microorganism: Helicobacter pylori and its proven link with gastric cancer [1]. At the other end of the spectrum, and more recently, Fusobacterium nucleatum was described in the gut of those with advanced colorectal cancer [2, 3]. Subsequent functional studies demonstrated F. nucleatum to be capable of both upregulating inflammatory and oncogenic pathways in colon cancer cell lines [4] and inducing adenomas in mice [5]. The bacterial microbiome, defined as the total collection of bacteria that inhabit any environmental niche, has been increasingly recognized as an active participant in human body functions and proposed to be an organ in its own right. At a basic level, we have long understood that the microbiome serves to maintain homeostasis. Dysbiosis, or disruption of the normal flora, can result in pathogenic overgrowth of organisms including Clostridium and Candida in the gastrointestinal and vaginal tracts, respectively [68]. Similarly, the oral microbiome has long been studied in the context of dental caries: ingestion of excessive carbohydrates promotes overgrowth of acidogenic and acidophilic microbes, inducing a local drop in pH, demineralization of enamel, and subsequent breakdown of tooth [9, 10].

While investigation of the association between microbial dysbiosis and colorectal cancer is progressing at a rapid pace, the study of the bacterial microbiome in other areas of the gastrointestinal tract has lagged behind. In particular, head and neck squamous cell carcinomas (HNSCC), which account for more than half a million cancer cases annually around the world [11, 12], has received relatively little attention. This may be due to the fact that HNSCC is a heterogeneous disease entity, encompassing a variety of cancers from different disease sites, and develops from the mucosal linings of the upper aerodigestive tract, comprising: (1) the nasal cavity and paranasal sinuses; (2) the nasopharynx; (3) the oral cavity and oropharynx; and (4) the hypopharynx, larynx, and trachea [13]. Additionally, major risk factors such as smoking, alcohol consumption, and human papillomavirus infection have already been elucidated [13, 14].

However, recent studies have identified microbiomic shifts in the oral cavity associated with cigarette smoking, and in the gastrointestinal tract associated with alcohol consumption [15, 16]. Similar to the pathogenesis of dental caries, it is feasible that the microbiome helps transduce an environmental exposure into a carcinogenic effect. As there are few effective systemic therapies in HNSCC, and toxicity of local treatment is often significant due to the vital structures involved, identification of a microbial pathway to disease may offer new insights into targeted therapies and primary prevention.

Prior work investigating the microbiome of head and neck cancer, including the largest cohort that was previously reported by our group, provided descriptive evidence of the “in”vironment of the head and neck at higher-order taxa and suggested that microbial variation correlates with clinical outcomes and gene methylation status [17]. Smaller studies that have used superficial sampling of oral cavity cancers by means of oral swabs observed differences in taxonomic abundance between normal and tumor surfaces primarily at the phylum level [18, 19]. However, bacteria in the head and neck are clearly not limited to the mucosal surface, but in fact populate deep tissue [17, 20, 21]. These findings are supported by our prior pilot study as well as a recent investigation composed of 29 patients with exclusively laryngeal cancer that demonstrated phylum and genus-level changes in tumor relative to normal tissue [22]. However, the significance of the findings from these studies is unclear. Furthermore, the microbiome content of cancerous mucosal tissue compared to adjacent histologically normal tissue has not been examined outside of the setting of laryngeal cancer.

With increasing evidence that a rich community of bacteria exists within head and neck tissues and may contribute to carcinogenesis, we now seek to identify microbiomic differences between tumor and histologically normal tissue in a large cohort of patients with HNSCC of the oral cavity, oropharynx, hypopharynx, and larynx. In this study, we report on the largest human tissue microbiome study in HNSCC patients, with 16S ribosomal DNA (rDNA) amplicon sequencing of paired normal-tumor tissue samples from 121 unrelated participnts. With these data, we correlate whole microbiome communities of head and neck tissue with clinical outcome measures of HNSCC, in order to test the hypothesis that microbiomes either alter or have been altered by both the presence and extent of HNSCC.


Patient cohort and sample collection

From 2003 to 2014, consecutive HNSCC patients were enrolled into a tissue biorepository collection. The tissue banking protocol was designed specifically to maintain sterility for downstream microbiome analysis. All tissues banked were required to be collected from the oral cavity, oropharynx, hypopharynx, or larynx. In this registry, 169 individuals had available paired adjacent normal and tumor tissue. A total of 30–50 mg each of paired tumor and normal tissue, approximately 2 cm from the tumor edge, were sterilely collected in the operating room, classified via pathology review, flash frozen, and stored at –80 °C. Relevant clinicopathologic features were collected prospectively at the time of diagnosis. Tumor node metastases staging was determined for each primary tumor based on American Joint Committee on Cancer guidelines [23]. Missing data were filled in via retrospective chart review; individuals without available data were noted as such in Table 1.

Table 1 Demographics and clinical characteristics of patientsa

DNA extraction

Total DNA was extracted with modifications from a previously described protocol [17]. Bead homogenization of tissues was performed with a TissueLyser II (Qiagen, Venlo, The Netherlands). Also added was a yeast cell wall lysis step using the Masterpure Yeast DNA Purification kit (Epicentre, Madison, WI, USA) [24]. All beads, tubes, and non-enzymatic reagents were treated with ultraviolet light for at least 30 min prior to use [25]. Reagent controls were confirmed by 16S polymerase chain reaction (PCR) to be absent of contaminating bacteria.

16S rRNA gene sequencing

PCR of the V1–V4 hypervariable regions of the 16S rRNA gene was performed with previously published primers [17]. PCR was performed under the following conditions: 95 °C for 5 min, followed by 32 cycles of 95 °C for 1 min, 55 °C for 40 s, 70 °C for 80 s, and an extension of 72 °C for 10 min. PCR products were electrophoresed on a 1% agarose gel, purified using a Zymoclean DNA Gel Recovery kit (Zymo, Orange, CA, USA), and cloned into a StrataClone pSC vector (Agilent, Santa Clara, CA, USA) [17]. From an initial 169 pairs of patient samples, a total of 318 tissue samples from 159 distinct patients had positive 16S rDNA PCR product recovery. Ninety-five colonies were picked per tissue sample. Plasmid inserts were PCR amplified using standard T3/T7 primers, then Sanger sequenced (ABI3730xl, Life Technologies, Carlsbad, CA, USA).

Bioinformatic analysis

Reads were filtered for quality, trimmed, and compiled using a custom python script. Depth of coverage was set at 60 sequences or higher based on leveling off of the Shannon diversity index at 60 reads. Due to this cutoff, a total of 242 tissue samples from 121 distinct patients were included in the final analysis. Subsampled open-reference operational taxonomic unit (OTU) picking [26] against Greengenes (version 13.8) [27, 28] at 97% similarity threshold using UCLUST [29], alignment with PyNAST [30], phylogenetic tree construction using FastTree (version 2.1.3) [31], and subsequent computation of alpha (Shannon diversity index, phylogenetic diversity) [32, 33] and beta diversity measures (weighted and unweighted Unifrac distances) [34, 35] was performed using QIIME (version 1.9.1) [36].


Student’s t-tests and likelihood ratio tests were used to compare continuous and categorical demographics/clinical factors, respectively, between patient samples included in the final analysis and those excluded due to insufficient reads. Student’s t-tests were used to compare Shannon index and phylogenetic diversity between tumor and non-tumor samples at a sequencing depth of 60 with ten iterations per sample. Distance matrices of the tumor and non-tumor samples were compared using the Adonis statistical method [37]. This method is similar to non-parametric analysis of variance (ANOVA) and relies on F-tests based on sequential sums of squares derived from 1000 permutations on the weighted and unweighted UniFrac distance matrices, with the null hypothesis that there is no difference in community structure between groups. To compare relative abundances of taxa between matched tumor-normal pairs, we used the non-parametric two-sided Wilcoxon signed-rank test. To compare relative abundances of taxa between samples of different T-stages, we used the Wilcoxon rank-sum.

All analyses were conducted in JMP Pro 12 (SAS Institute Inc., Cary, NC, USA) or R version 3.2.2. All statistical tests were two-sided, with a p value < 0.05 or false discovery rate (FDR) adjusted q < 0.05 considered statistically significant. All graphs were created using the R package lattice [38]. The cladogram was created using GraPhlAn on Galaxy [39, 40].


HNSCC microbiomes are similar on a phylum-level to those in previous studies of human oral flora

We analyzed sterilely collected, paired fresh-frozen normal-tumor samples from 121 patients with HNSCC. These patients were not significantly different on any demographic or clinical factors when compared to the 38 patients excluded based on low read count (Table 1). The taxonomic composition of our HNSCC samples is similar to that identified in our previous pilot study of HNSCC [17], as well as with data from previously published studies on the human oral microbiome [4143]. Firmicutes is the predominant phylum, followed by Bacteroidetes and Proteobacteria, then by Fusobacteria and Actinobacteria, in both tumor and adjacent normal samples from HNSCC patients as well as in prior studies (Fig. 1). Phyla falling under 0.1% relative abundance in our dataset were not included in this analysis.

Fig. 1

Relative abundances of major phyla in the human oral microbiome. Bar plot of relative abundances of major phyla in the oral microbiome observed in this study and three previously published series. There were similar relative abundances of the most common phyla among tumor (orange) and adjacent normal (blue) tissue from this study. Additionally, these abundances were similar to previously published series describing the oral microbiome

HNSCC tumor and paired-normal tissue are not significantly different on measures of alpha or beta diversity

The average number of reads for the 242 patient samples in the final analysis was 83 ± 11 and did not differ between tumor (84 ± 13) and normal (83 ± 7) samples (p = 0.48). The average read length was 745 ± 117. To determine whether overall mean diversity was different in tumor and adjacent normal tissue of HNSCC patients, we compared two measures of alpha diversity: Shannon index (H) which measures the evenness and richness of a population; and phylogenetic diversity (PD) which takes the phylogenetic relationship between taxa into account. We found no significant difference in measures of alpha diversity between tumor (H = mean 3.72 ± standard error 0.78, PD = 6.42 ± 1.88) and normal (H = 3.87 ± 0.74, PD = 6.62 ± 1.96) tissue (H: p = 0.13, PD: p = 0.42).

To test whether overall bacterial taxa composition was different between tumor and normal tissue, we used principal coordinates analysis (PCoA) on weighted and unweighted Unifrac distances. We found that, although statistically significantly different on both weighted (p = 0.012) and unweighted (p = 0.042) measures, the tumor/normal grouping explained only a small proportion of the overall variation in the samples (Fig. 2a, Additional file 1: Figure S1A, B). This difference was also similarly significant (weighted p = 0.001, unweighted p = 0.001) but non-explanatory when comparing PCoAs of samples by whether they were from the oral cavity/oropharynx or the hypopharynx/larynx (Fig. 2b).

Fig. 2

PCoA plots of weighted and unweighted UniFrac distances of tumor and normal samples. Overall oral microbiomic diversity of patient samples as represented by PCoA of weighted and unweighted UniFrac distances. In panel (a), each point represents a single tumor (orange square) or adjacent normal (blue circle) sample, with plus sign and ellipses (orange solid line = tumor, blue dashed line = normal) representing the fitted mean and 68% confidence interval of each group, respectively. Adonis testing revealed statistically significant clustering based on the tumor/normal grouping (weighted p = 0.012, unweighted p = 0.042), but this clustering only explained a small proportion of the overall variation among samples (weighted R2 = 0.010, unweighted R2 = 0.006). In panel (b), each point represents a single oral cavity/oropharyngeal (magenta) or hypopharyngeal/laryngeal (green) sample, with plus sign and ellipses (magenta solid line = oral cavity/oropharynx, green dashed line = hypopharynx/larynx) representing the fitted mean and 68% confidence interval of each group respectively. The different shapes provided by the legend delineate smaller sub-categories of each location. Adonis testing revealed statistically significant clustering of oral cavity/oropharyngeal samples relative to hypopharyngeal/laryngeal samples (weighted p = 0.001, unweighted p = 0.001), but this clustering only explained a small proportion of the overall variation among samples (weighted R2 = 0.018, unweighted R2 = 0.014)

Relative abundance of specific taxa differs between tumor and paired normal tissue

Next, we compared the relative abundances of 372 individual taxa between matched pairs of tumor and adjacent normal tissue, finding differences in ten genera, 12 families, eight orders, five classes, and three phyla by Wilcoxon signed-rank testing (Additional file 2: Figure S2). Only 2/10 genera were significant after adjusting for FDR: Actinomyces and Parvimonas. The genus Actinomyces, along with its parent family Actinomycetaceae, order Actinomycetales, class Actinobacteria, and phylum Actinobacteria, was depleted in tumor compared to matched normal tissue. In contrast, the genus Parvimonas, along with its parent family Tissierellaceae, was increased in tumor compared to normal tissue (Fig. 3).

Fig. 3

Significant taxa by Wilcoxon signed-rank in paired tumor and normal tissue. Box plots representing relative abundances of taxa observed to be significantly different between tumor (orange) and adjacent normal (blue) samples by paired Wilcoxon signed-rank testing after correction for FDR. Dark vertical lines represent the median, with the box representing the first (Q1) and third (Q3) quartiles, and the outer fences 1.5 × interquartile range. Outliers are not plotted. Values are reported as median (Q1–Q3), with q values representing significance of Wilcoxon signed-rank comparing tumor and normal relative abundances for each taxon after FDR correction. Taxa names are colored based on the group in which they are overrepresented

After identifying taxa that were significantly different between tumor and paired normal tissues, we performed a stratified analysis to investigate the relationship between tumor stage and the relative abundances of these taxa. We observed that samples from low-stage (T0–2) patients had significantly increased relative abundance of the genus Actinomyces compared to samples from high-stage (T3–4) patients (median 3.3% versus 1.2%, p = 0.005). The parent taxa of the genus Actinomyces were also significantly relatively increased in low-stage patients compared to higher stages, up to the phylum level. In contrast, the genus Parvimonas was significantly relatively decreased in samples from low-stage patients compared to high-stage patients (median 0.0% versus 1.1%, p = 0.023). The relationship between these taxa and T-stage remained consistent when stratifying by tumor versus paired-normal tissue (Fig. 4a). This difference was statistically significant in the normal group (phylum Actinobacteria p = 0.002, genus Actinomyces p = 0.023, genus Parvimonas p = 0.033), but only approached significance in the tumor group (phylum Actinobacteria p = 0.067, genus Actinomyces p = 0.052, genus Parvimonas p = 0.247).

Fig. 4

Relative abundances of differentially represented taxa stratified by T-stage. Box plots representing relative abundances of phylum Actinobacteria (top), genus Actinomyces (middle), and genus Parvimonas (bottom) stratified by T-stage. Dark horizontal lines represent the median, with the box representing the first (Q1) and third (Q3) quartiles, and the outer fences 1.5 × interquartile range. Outliers are not plotted. a Samples stratified by type (normal = blue, tumor = orange), with darker colors representing higher T-stage. In both normal and tumor samples, the relative abundances of Actinobacteria and Actinomyces decrease, while Parvimonas increases, with increasing T-stage. b Samples stratified by location (oral cavity/oropharynx = magenta, hypopharynx/larynx = green), with darker colors representing higher T-stage. In the oral cavity/oropharynx, the relative abundances of Actinobacteria and Actinomyces decrease, while Parvimonas increases, with increasing T-stage. In hypopharyngeal/laryngeal samples, only Actinobacteria is decreased with increasing T-stage

As T-stage was significantly associated with tissue location (oral cavity/oropharynx versus hypopharynx/larynx), we proceeded to stratify samples based on tissue location (Fig. 4b). We observed that relative abundances of the phylum Actinobacteria, genus Actinomyces, and genus Parvimonas were consistently lower at hypopharyngeal/laryngeal locations relative to the oral cavity/oropharynx. However, when analyzing oral cavity/oropharynx samples alone, Actinobacteria and Actinomyces approached significance in low-stage patients relative to high-stage patients (p = 0.100, p = 0.192) and Parvimonas remained significantly relatively decreased among low-stage patients compared to high-stage patients (p = 0.006). When analyzing hypopharyngeal/laryngeal samples alone, Actinobacteria remained significantly relatively increased in low-stage patients (p = 0.031), while Actinomyces and Parvimonas were not significantly different between low-stage and high-stage groups (p = 0.645, p = 0.790).


In this study, we sought to describe the oral microbiome of individuals with HNSCC and to compare the local microbiome of their tumors with neighboring normal tissue. We hypothesized that tumor tissue would have a microbiome unique from that of adjacent normal tissue and be more pronounced in higher stage disease. The simple comparison of tumor versus adjacent normal tissue did not reveal major shifts in overall diversity (Shannon index or phylogenetic diversity) or in microbiomic content. However, matched pairs analysis of individual tumor-normal pairs revealed significant differences in relative abundance of specific taxa, namely the genera Actinomyces and Parvimonas. These differences were more pronounced in patients with a higher T-stage.

The phylum-level oral microbiome of individuals in our study was similar to those reported previously. Dewhirst et al. reported on the Human Oral Microbiome Database, which consisted of 633 Sanger-sequenced oral 16 s rRNA gene libraries from various head and neck sites of patients of various states of health and disease [41]. Ahn et al. analyzed oral washes from 20 individuals (ten with malignant or premalignant oral lesions, ten healthy controls) using both 16 s rRNA pyrosequencing and a custom DNA microarray [42]. Segata et al. found in their study of over 200 healthy adults that the adult digestive tract microbiome differed according to location of sampling; group 1 (G1) sites (buccal mucosa, keratinized gingiva, and hard palate) had increased relative abundance of Firmicutes and decreased relative abundance of other phyla as compared to group 2 (G2) sites (saliva, tongue, tonsils, and throat) [43]. The phyla-level composition of our study population was most similar to Segata et al.’s G2 series, despite the fact that they used next-generation sequencing (NGS) instead of Sanger sequencing, used swabs instead of surgically excised tissue, and had healthy controls instead of patients with HNSCC. This was not surprising given that the majority of our patient tissues were from tongue and tonsil locations (Table 1), as in Segata et al.’s G2 series [43].

We did not observe any differences in overall diversity of tumor and adjacent normal tissue samples from HNSCC patients. On analysis of alpha diversity measures, we found that the Shannon diversity index of our samples was similar to previously reported measures [44, 45]. Median phylogenetic diversity of our samples was lower than the median described by Takeshita et al. in their study of over 2000 healthy Japanese individuals [46]. This suggests that our study, which uses Sanger sequencing and thus has fewer reads, may under-predict the true phylogenetic diversity of patient samples. However, this difference may also be due to differences in patient disease status, ethnicity, diet, and/or sample type (saliva versus tissue).

Although our samples did cluster into statistically significant normal and tumor groups based on weighted and unweighted UniFrac distances, this grouping only explained a small proportion of the overall variation seen in our samples (Fig. 2). This is unsurprising considering the relative proximity (2 cm) of these two categories of tissue. In fact, the relative histological similarity of adjacent “normal” tissue to neighboring tumor tissue was first described in oral epithelia, and given the name “field cancerization” [47]. Coined to designate large areas of premalignant tissue with altered histology adjacent to malignant tumor tissue, field cancerization may also apply to the resident microbiome. These data imply that more similarities than differences exist between the overall oral microbiomes of tumor and adjacent normal tissues from the same patient, consistent with what has been described previously in a smaller series [18].

Despite similarities on the community level, we observed differences between matched pairs of tumor and normal samples on the individual taxon level. Relative abundances of the genus Actinomyces, along with its parent taxa up to the phylum level, were significantly decreased in tumor as compared with normal samples (Fig. 4). Schmidt et al. also described a decrease in the relative abundance of 11 OTUs from the phylum Actinobacteria in swabs of tumor sites as compared to contralateral normal mucosa in 13 individuals with HNSCC [18]. Similarly, Gong et al. observed decreased levels of Actinobacteria in 27 patients with laryngeal carcinoma compared to 28 participants with vocal cord polyps [19].

Members of the genus Actinomyces are human commensals in the oropharynx, gastrointestinal, and female genital tracts, but can rarely cause subacute to chronic infections in the setting of mucosal disruption [48]. While neither this investigation nor the abovementioned studies can establish the nature or timeline of the relationship between depletion of Actinomyces and malignancy, it is possible that Actinomyces spp. exert a protective effect through the secretion protease-inhibitors that inhibit tumorigenesis [49]. Alternatively, Actinomyces spp. could be out-competed by faster-growing oral commensals at the relatively acidic, hypoxic, and glucose-starved tumor microenvironment [50]. This hypothesis would be most consistent with our observation that relative abundances of Actinomyces and Actinobacteria were not only decreased in tumor compared to adjacent normal tissue, but more so in higher T-stage samples (Fig. 4, top and middle). The effect of T-stage was more pronounced in adjacent normal tissue than in tumor, suggesting that Actinomyces depletion may precede tumor invasion. Importantly, node positivity was not associated with Actinomyces relative abundance, indicating that it may not have a role in the tumor’s metastatic potential.

Although a statistically significant difference in the relative abundance of genus Parvimonas was observed between tumor and adjacent normal tissue, the absolute difference was small and may not be clinically relevant. In addition, although the increase in genus Fusobacterium and its parent taxa up to the phylum level in tumor samples was not statistically significant after correction for multiple comparisons, this finding is consistent with previous reports [18] and may be important in the context of what is known about Fusobacterium and colorectal cancer [2, 3, 5].

This investigation represents the largest study of the microbiome of patients with HSNCC to date, with 121 matched tumor and adjacent normal samples. Moreover, a non-parametric matched pairs analysis was conducted, in contrast to prior studies, which allowed us to control for demographics, clinical characteristics, lifestyle factors, and inherent inter-individual microbiomic variability when comparing tumor and normal samples. In contrast to prior investigations that used swabs or oral rinses, we used surgically excised, histologically verified, tumor and adjacent normal tissue. This offers the capacity to directly sample the tumor microenvironment and compare it to the microenvironment of adjacent normal tissue.

At the time of protocol initiation, NGS was less widely available relative to Sanger sequencing. While this is a limitation of our investigation due to the relatively low read counts in our study, there is evidence to suggest that useful comparisons can be made at this sequencing depth [51]. Prior studies have demonstrated that low numbers of reads can accurately characterize communities at the phylum level and be used to uncover large-scale differences between communities through analysis of beta-diversity metrics [34, 52, 53]. Other studies have demonstrated that while Sanger sequencing (at a depth of 50 reads per sample) will miss rare species, it can capture most of the microbial diversity and accurately characterize abundances of predominant taxa [54, 55].

We acknowledge that the power to detect statistically significant differences in relative abundances is limited by the low read counts offered by Sanger sequencing. As such, the likelihood of false negatives in this study is quite high. However, despite low read counts, significant differences were observed, some consistent up to the phylum level, reflecting the large effect size of these differences.


We conclude that the microbiomes of HNSCC tumor microenvironments are largely similar in overall diversity and bacterial composition to that of histologically normal adjacent tissue. However, we detected decreases in the genus Actinomyces and its parent taxa up to the phylum level and found that this decrease was more pronounced in higher T-stage samples. Further investigation is needed to validate these findings in a large series using NGS methods and to determine the biological relevance of this observed difference.



Shannon index


Head and neck squamous cell carcinoma


Phylogenetic diversity


  1. 1.

    Parsonnet J, Friedman GD, Vandersteen DP, Chang Y, Vogelman JH, Orentreich N, et al. Helicobacter pylori infection and the risk of gastric carcinoma. N Engl J Med. 1991;325:1127–31.

  2. 2.

    Castellarin M, Warren RL, Freeman JD, Dreolini L, Krzywinski M, Strauss J, et al. Fusobacterium nucleatum infection is prevalent in human colorectal carcinoma. Genome Res. 2012;22:299–306.

  3. 3.

    Kostic AD, Gevers D, Pedamallu CS, Michaud M, Duke F, Earl AM, et al. Genomic analysis identifies association of Fusobacterium with colorectal carcinoma. Genome Res. 2012;22:292–8.

  4. 4.

    Rubinstein MR, Wang X, Liu W, Hao Y, Cai G, Han YW. Fusobacterium nucleatum promotes colorectal carcinogenesis by modulating E-Cadherin/??-Catenin signaling via its FadA adhesin. Cell Host Microbe. 2013;14:195–206.

  5. 5.

    Kostic AD, Chun E, Robertson L, Glickman JN, Gallini CA, Michaud M, et al. Fusobacterium nucleatum potentiates intestinal tumorigenesis and modulates the tumor-immune microenvironment. Cell Host Microbe. 2013;14:207–15.

  6. 6.

    Consortium THMP. Structure, function and diversity of the healthy human microbiome. Nature. 2013;486:207–14.

  7. 7.

    Fujimura KE, Slusher NA, Cabana MD, Lynch SV. Role of the gut microbiota in defining human health. Expert Rev Anti Infect Ther. 2010;8:435–54.

  8. 8.

    Ma B, Forney LJ, Ravel J. Vaginal microbiome: rethinking health and disease. Annu Rev Microbiol. 2012;66:371–89.

  9. 9.

    Peterson SN, Snesrud E, Liu J, Ong AC, Kilian M, Schork NJ, Bretz W. The dental plaque microbiome in health and disease. PLoS One. 2013;8:e58487.

  10. 10.

    Struzycka I. The oral microbiome in dental caries. Pol J Microbiol. 2014;63:127–35.

  11. 11.

    Chaturvedi AK, Anderson WF, Lortet-Tieulent J, Paula Curado M, Ferlay J, Franceschi S, et al. Worldwide trends in incidence rates for oral cavity and oropharyngeal cancers. J Clin Oncol. 2013;31:4550–9.

  12. 12.

    Jemal A, Bray F, Center MM, Ferlay J, Ward E, Forman D. Global cancer statistics. CA Cancer J Clin. 2011;61:69–90.

  13. 13.

    Sturgis EM, Wei Q, Spitz MR. Descriptive epidemiology and risk factors for head and neck cancer. Semin Oncol. 2004;31:726–33.

  14. 14.

    Sturgis EM, Ang KK. The epidemic of HPV-associated oropharyngeal cancer is here: is it time to change our treatment paradigms? J Natl Compr Canc Netw. 2011;9:665–73.

  15. 15.

    Wu J, Peters BA, Dominianni C, Zhang Y, Pei Z, Yang L, et al. Cigarette smoking and the oral microbiome in a large study of American adults. ISME J. 2016;10:2435–46.

  16. 16.

    Leclercq S, Matamoros S, Cani PD, Neyrinck AM, Jamar F, Stärkel P, et al. Intestinal permeability, gut-bacterial dysbiosis, and behavioral markers of alcohol-dependence severity. Proc Natl Acad Sci U S A. 2014;111:E4485–93.

  17. 17.

    Bebek G, Bennett KL, Funchain P, Campbell R, Seth R, Scharpf J, et al. Microbiomic subprofiles and MDR1 promoter methylation in head and neck squamous cell carcinoma. Hum Mol Genet. 2012;21:1557–65.

  18. 18.

    Schmidt BL, Kuczynski J, Bhattacharya A, Huey B, Corby PM, Queiroz ELS, et al. Changes in abundance of oral microbiota associated with oral cancer. PLoS One. 2014;9:e98741.

  19. 19.

    Gong H, Shi Y, Zhou X, Wu C, Cao P, Xu C, et al. Microbiota in the throat and risk factors for laryngeal carcinoma. Appl Environ Microbiol. 2014;80:7356–63.

  20. 20.

    Hooper SJ, Crean SJ, Fardy MJ, Lewis MA, Spratt DA, Wade WG, et al. A molecular analysis of the bacteria present within oral squamous cell carcinoma. J Med Microbiol. 2007;56:1651–9.

  21. 21.

    Hooper SJ, Crean SJ, Lewis MAO, Spratt DA, Wade WG, Wilson MJ. Viable bacteria present within oral squamous cell carcinoma tissue. J Clin Microbiol. 2006;44:1719–25.

  22. 22.

    Gong H-L, Shi Y, Zhou L, Wu C-P, Cao P-Y, Tao L, et al. The composition of microbiome in larynx and the throat biodiversity between laryngeal squamous cell carcinoma patients and control population. PLoS One. 2013;8:e66476.

  23. 23.

    Edge SB, Compton CC. The American Joint Committee on Cancer: the 7th edition of the AJCC cancer staging manual and the future of TNM. Ann Surg Oncol. 2010;17:1471–4.

  24. 24.

    Dauphin LA, Stephens KW, Eufinger SC, Bowen MD. Comparison of five commercial DNA extraction kits for the recovery of Yersinia pestis DNA from bacterial suspensions and spiked environmental samples. J Appl Microbiol. 2010;108:163–72.

  25. 25.

    Tamariz J, Voynarovska K, Prinz M, Caragine T. The application of ultraviolet irradiation to exogenous sources of DNA in plasticware and water for the amplification of low copy number DNA. J Forensic Sci. 2006;51:790–4.

  26. 26.

    Rideout JR, He Y, Navas-Molina JA, Walters WA, Ursell LK, Gibbons SM, et al. Subsampled open-reference clustering creates consistent, comprehensive OTU definitions and scales to billions of sequences. PeerJ. 2014;2:e545.

  27. 27.

    DeSantis TZ, Hugenholtz P, Larsen N, Rojas M, Brodie EL, Keller K, et al. Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl Environ Microbiol. 2006;72:5069–72.

  28. 28.

    McDonald D, Price MN, Goodrich J, Nawrocki EP, DeSantis TZ, Probst A, et al. An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea. ISME J. 2012;6:610–8.

  29. 29.

    Edgar RC. Search and clustering orders of magnitude faster than BLAST. Bioinformatics. 2010;26:2460–1.

  30. 30.

    Caporaso JG, Bittinger K, Bushman FD, DeSantis TZ, Andersen GL, Knight R. PyNAST: a flexible tool for aligning sequences to a template alignment. Bioinformatics. 2010;26:266–7.

  31. 31.

    Price MN, Dehal PS, Arkin AP. FastTree 2--approximately maximum-likelihood trees for large alignments. PLoS One. 2010;5:e9490.

  32. 32.

    Hill TCJ, Walsh KA, Harris JA, Moffett BF. Using ecological diversity measures with bacterial communities. FEMS Microbiol Ecol. 2003;43:1–11.

  33. 33.

    Barker GM. Phylogenetic diversity: a quantitative framework for measurement of priority and achievement in biodiversity conservation. Biol J Linn Soc. 2008;76:165–94.

  34. 34.

    Lozupone C, Knight R. UniFrac: a new phylogenetic method for comparing microbial communities. Appl Environ Microbiol. 2005;71:8228–35.

  35. 35.

    Lozupone CA, Hamady M, Kelley ST, Knight R. Quantitative and qualitative beta diversity measures lead to different insights into factors that structure microbial communities. Appl Environ Microbiol. 2007;73:1576–85.

  36. 36.

    Caporaso JG, Kuczynski J, Stombaugh J, Bittinger K, Bushman FD, Costello EK, et al. QIIME allows analysis of high-throughput community sequencing data. Nat Methods. 2010;7:335–6.

  37. 37.

    Anderson MJ. A new method for non-parametric multivariate analysis of variance. Austral Ecol. 2001;26:32–46.

  38. 38.

    Sarkar D. SpringerLink (Online service). Lattice multivariate data visualization with R. Use R! J Stat Software. 2008;25(ii):1-265.

  39. 39.

    Asnicar F, Weingart G, Tickle TL, Huttenhower C, Segata N. Compact graphical representation of phylogenetic data and metadata with GraPhlAn. PeerJ. 2015;3:e1029.

  40. 40.

    Blankenberg D, Von Kuster G, Coraor N, Ananda G, Lazarus R, Mangan M, et al. Galaxy: a web-based genome analysis tool for experimentalists. Curr Protoc Mol Biol. 2010;Chapter 19:Unit 19.10.1–21

  41. 41.

    Dewhirst FE, Chen T, Izard J, Paster BJ, Tanner ACR, Yu WH, et al. The human oral microbiome. J Bacteriol. 2010;192:5002–17.

  42. 42.

    Ahn J, Yang L, Paster BJ, Ganly I, Morris L, Pei Z, et al. Oral microbiome profiles: 16 s rRNA pyrosequencing and microarray assay comparison. PLoS One. 2011;6:e22788.

  43. 43.

    Segata N, Haake SK, Mannon P, Lemon KP, Waldron L, Gevers D, et al. Composition of the adult digestive tract bacterial microbiome based on seven mouth surfaces, tonsils, throat and stool samples. Genome Biol. 2012;13:R42.

  44. 44.

    Bik EM, Long CD, Armitage GC, Loomer P, Emerson J, Mongodin EF, et al. Bacterial diversity in the oral cavity of 10 healthy individuals. ISME J. 2010;4:962–74.

  45. 45.

    Pushalkar S, Ji X, Li Y, Estilo C, Yegnanarayana R, Singh B, et al. Comparison of oral microbiota in tumor and non-tumor tissues of patients with oral squamous cell carcinoma. BMC Microbiol. 2012;12:144.

  46. 46.

    Takeshita T, Kageyama S, Furuta M, Tsuboi H, Takeuchi K, Shibata Y, et al. Bacterial diversity in saliva and oral health-related conditions: the Hisayama Study. Sci Rep. 2016;6:22164.

  47. 47.

    Slaughter DP, Southwick HW, Smejkal W. Field cancerization in oral stratified squamous epithelium; clinical implications of multicentric origin. Cancer. 1953;6:963–8.

  48. 48.

    Wong VK, Turmezei TD. Actinomycosis. Britiish Med J. 2011;343:1–7.

  49. 49.

    Hozumi M, Ogawa M, Sugimura T, Takeuchi T, Umezawa H. Inhibition of tumorigenesis in mouse skin by leupeptin, a protease inhibitor from actinomycetes. Cancer Res. 1972;32:1725–28.

  50. 50.

    Schulze A, Harris AL. How cancer metabolism is tuned for proliferation and vulnerable to disruption. Nature. 2012;491:364–73.

  51. 51.

    Hamady M, Knight R. Microbial community profiling for human microbiome projects: Tools, techniques, and challenges. Genome Res. 2009;19:1141–52.

  52. 52.

    Ley RE, Lozupone CA, Hamady M, Knight R, Gordon JI. Worlds within worlds: evolution of the vertebrate gut microbiota. Nat Rev Microbiol. 2008;6:776–88.

  53. 53.

    Ley RE, Hamady M, Lozupone C, Turnbaugh PJ, Ramey RR, Bircher JS, et al. Evolution of mammals and their gut microbes. Science. 2008;320:1647–51.

  54. 54.

    McKenna P, Hoffmann C, Minkah N, Aye PP, Lackner A, Liu Z, et al. The macaque gut microbiome in health, lentiviral infection, and chronic enterocolitis. PLoS Pathog. 2008;4:e20.

  55. 55.

    Smith BC, McAndrew T, Chen Z, Harari A, Barris DM, Viswanathan S, et al. The cervical microbiome over 7 years and a comparison of methodologies for its characterization. PLoS One. 2012;7:e40425.

Download references


The authors thank Nicole Fowler and Christine Fleckenstein for their excellent research coordination efforts in maintaining the HNSCC biospecimen bank and Emily Nizialek for her comments during the earlier stages of this work.


This work was supported, in part, by NIH grants (R01DE21544 to CE and NCATS KL2TR000440 to GB).

Availability of data and materials

The datasets generated and/or analyzed during the current study are available in the European Nucleotide Archive (ENA), accession numbers LT674587-LT699699.

Authors’ contributions

BB and WL obtained the surgical samples. PF, JA, CP, and HZ carried out the DNA extractions, molecular genetic studies, and participated in the sequence alignment. HW, PF, FN, and GB participated in the design of the study and performed the statistical analysis. HW and PF drafted the manuscript. HW, PF, and JA drafted figures for the manuscript. CE conceived of the study. CE and BB participated in its design and coordination, helped to draft and critically revised the manuscript. All authors read and approved the final manuscript.

Authors’ information

PF was an Ambrose Monell Cancer Genomic Medicine Fellow. HW and HZ are mentees under the auspices of the Doris Duke Clinical Research Mentorship Award (to CE). CE is the Sondra J. and Stephen R. Hardis Endowed Chair in Cancer Genomic Medicine at the Cleveland Clinic, and an ACS Clinical Research Professor.

Competing interests

The author(s) declare that they have no competing interests.

Consent for publication

Not applicable.

Ethics approval and consent to participate

Human tissue and data for this study were collected under Cleveland Clinic Institutional Review Board-approved tissue biorepository protocol where written informed consent was obtained from adult research participants, in alignment with the Helsinki Declaration.

Author information

Correspondence to Charis Eng.

Additional files

Additional file 1: Figure S1.

Overall oral microbiomic diversity of patient samples as represented by PCoA of (A) weighted and (B) unweighted UniFrac distances. Each point represents a single tumor (orange) or normal (blue) sample, with connecting lines delineating a tumor/normal pair from the same patient. (TIFF 1177 kb)

Additional file 2: Figure S2.

Cladogram depicting phylogenetic relationship of taxa identified as significantly different (p < 0.05) by Wilcoxon signed-rank testing in tumor relative to adjacent histologically normal tissue prior to correction for FDR. Each concentric ring of nodes represents a taxonomic rank, starting with kingdom at the very center. Moving outwards, the rings represent phylum, class, order, family, and genus. Nodes highlighted in orange are increased in tumor relative to normal samples. Nodes highlighted in blue are increased in normal relative to tumor samples. (TIFF 2213 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Wang, H., Funchain, P., Bebek, G. et al. Microbiomic differences in tumor and paired-normal tissue in head and neck squamous cell carcinomas. Genome Med 9, 14 (2017) doi:10.1186/s13073-017-0405-5

Download citation


  • Head and neck squamous cell carcinoma (HNSCC)
  • Bacteria
  • Microbiome