Salivary biomarker development using genomic, proteomic and metabolomic approaches
© BioMed Central Ltd 2012
Published: 30 October 2012
Skip to main content
© BioMed Central Ltd 2012
Published: 30 October 2012
The use of saliva as a diagnostic sample provides a non-invasive, cost-efficient method of sample collection for disease screening without the need for highly trained professionals. Saliva collection is far more practical and safe compared with invasive methods of sample collection, because of the infection risk from contaminated needles during, for example, blood sampling. Furthermore, the use of saliva could increase the availability of accurate diagnostics for remote and impoverished regions. However, the development of salivary diagnostics has required technical innovation to allow stabilization and detection of analytes in the complex molecular mixture that is saliva. The recent development of cost-effective room temperature analyte stabilization methods, nucleic acid pre-amplification techniques and direct saliva transcriptomic analysis have allowed accurate detection and quantification of transcripts found in saliva. Novel protein stabilization methods have also facilitated improved proteomic analyses. Although candidate biomarkers have been discovered using epigenetic, transcriptomic, proteomic and metabolomic approaches, transcriptomic analyses have so far achieved the most progress in terms of sensitivity and specificity, and progress towards clinical implementation. Here, we review recent developments in salivary diagnostics that have been accomplished using genomic, transcriptomic, proteomic and metabolomic approaches.
The discovery that saliva contains molecular profiles that reflect systemic diseases has opened the doors to a new non-invasive diagnostic methodology: 'salivary diagnostics'. Using saliva in diagnosis is not only practical and non-invasive, but at times is also proving to be more accurate than available alternatives. These advantages might facilitate early detection of many diseases and improve clinical management.
Saliva comprises the secretions of the parotid, submandibular and sublingual glands, hundreds of minor salivary glands and gingival crevice fluid. Saliva functions include food digestion, bolus formation, lubrication and taste facilitation and immune-related functions mediated by secreted antimicrobial peptides  and immunoglobulins . However, these components constitute only a small part of saliva. Many molecular species have now been found, but their functions in the oral cavity remain largely unknown.
Cell-free saliva (CFS) has been found to contain over a thousand proteins that are thought to be involved in a wide range of biological functions , as well as mRNA and microRNA (miRNA) transcripts [4–6], and metabolites [7, 8]. Detecting changes in the salivary concentrations of these molecules has allowed the detection of oral and systemic diseases. Recent developments in genomic, proteomic and metabolomic approaches have facilitated sensitive and high-throughput analysis of saliva and are proving increasingly useful for diagnostics. In this article, we review recent advances in the identification of salivary biomarkers using these approaches, and the implications for diagnostic applications.
Early studies of salivary diagnostics for breast cancer assessed the use of proteins, including c-erbB-2 (human epidermal growth factor receptor 2, a receptor tyrosine kinase involved in cell growth), VEGF (vascular endothelial growth factor, a signal protein involved in vasculogenesis and angiogenesis), EGF (epidermal growth factor, which promotes cell growth, division and differentiation), and CEA (carcinoembryonic antigen, a glycoprotein tumor marker involved in cell adhesion) [9–13]. However, these studies investigated the presence of known serum biomarkers in saliva, rather than the potential existence of unknown biomarkers. Subsequently, de novo transcriptomic, proteomic and metabolomic approaches [5–8, 14–17] have allowed the discovery and validation of salivary biomarker profiles for breast, oral, pancreatic and lung cancers, and periodontitis.
Stringent testing of biomarker performance, including verification and validation, is necessary before clinical application. Although there have been dramatic advances in biotechnology and biomarker research in the last decade, there have been relatively few successful clinical trials of biomarkers, compared with therapeutic research. Therapeutic research utilizes randomized placebo-controlled blinded clinical trials with internationally agreed standards in the evaluation of definitive clinical endpoints and outcomes. Analogous standards for biomarker research did not exist until 2008 with the proposal of prospective specimen collection and retrospective blinded evaluation (PRoBE) study design . PRoBE incorporates prospective specimen collection from the target population, collected in a blinded fashion with no knowledge about patient outcomes. After outcome has been determined, patients and control subjects are selected randomly and their specimens tested. We expect that the application of PRoBE will greatly facilitate the transfer of salivary diagnostics to the clinic.
Promising oral, and head and neck cancer salivary biomarkers discovered using epigenomics, transcriptomics, proteomics and metabolomics
Candidate from previous study, Q-MSP analysis (HNSCC) 
DAPK, DCC, MINT-31, TIMP-3, p16, MGMT, CCNA1
Candidate gene selection based on a previous study, Q-MSP discovery and validation (HNSCC) 
MINT31, CCNA1, DCC, DAPK, p16
MINT31, CCNA1, p16
Candidate from previous study, Q-MSP discovery and validation (HNSCC) 
Candidate from previous study, Q-MSP assessment (HNSCC) 
Discovery by methylation array (OSCC) 
GABRB3_E42_F, IL11_P11_R, INSR_P1063_R, NOTCH3_E403_F, NTRK3_E131_F, PXN_P308_F
ADCYAP1_P455_R, CEBPA_P706_F, EPHA5_E158_R, FGF3_E198_R, HLF_E192_F, IL11_P11_R, INSR_P1063_R, NOTCH3_E403_F
ERBB4_P255_F, IL11_P11_R, PTCH2_P37_F, TMEFF1_P234_F, TNFSF10_E53_F, TWIST1_P44_R
Discovery with HumanMethylation27 DNA assay, validation with Q-MSP 
Microarray discovery and qPCR validation (OSCC) 
IL8, IL1B, OAZ1, SAT
qRTPCR and ELISA validation of previously reported candidates 
IL-8, SAT, H3F3A, S100P
IL-8, IL-1B, SAT, OAZ1
IL-8, IL-1B, SAT, DUSP1
IL-8, IL-1B, S100P, OAZ1
IL-8, SAT, OAZ1, S100P
Discovery and validation by RT-preamp-qPCR (OSCC) 
Candidate gene selection based on previous study, qRT-PCR quantification 
Discovery by C4 RP-LC and capillary reversed-phase LC with quadruple time-of-flight MS and validation by ELISA and immunoblotting (OSCC) 
M2BP, profilin, CD59, MRP14, catalase
ELISA assessment and qPCR confirmation 
Combination of proteomic/transcriptomics
Reproducibility study of validated biomarkers using ELISA and qRT-PCR (OSCC) 
Proteins: IL-1B, IL-8 and M2BP
mRNAs: IL-8, IL-1B, SAT1, S100P
0.89/0.78/0.86, 0.67/0.96/0.85, 0.82/0.84/0.88 for OSCC total/T1-T2/T3-T4 respectively
ULC/Q-TOF-MS (OSCC) 
Valine, lactic acid
Discovery by CE-TOF-MS-based metabolomics 
Taurine, piperidine and a peak at 120.0801 m/z
Promising salivary biomarkers for other diseases discovered using transcriptomics, proteomics and metabolomics
Affymetrix array discovery and qRT-PCR validation (pancreatic cancer) 
KRAS, MBD3L2, ACRV1, DPM1
Microarray discovery and qRT-PCR verification and pre-validation (lung cancer) 
CCNI, EGFR, FGF19, FRS2, GREB1
ELISA assessment (periodontal disease) 
Multiplex protein array discovery. Markers for distinguishing high and low responders to treatment for gingivitis (periodontal disease) 
NR/NR/0.81 (HR) and 0.72 (LR)
Two-dimensional gel electrophoresis and LC-MS-MS (lung cancer) 
Calprotectin, AZGP1, haptoglobin hp2
Combination proteomic/transcriptomic approaches
Discovery by 2D-DIGE and RT-PCR/Affymetrix, validation by qRT-PCR (breast cancer) 
mRNAs: CSTA, TPT1, IGF2BP1, GRM1, GRIK1, H6PD, MDM4, S100A8
Discovery by CE-TOF-MS-based metabolomics (pancreatic cancer) 
Leucine with isoleucine, tryptophan, valine, glutamic acid, phenylalanine, glutamine, aspartic acid
Discovery by CE-TOF-MS-based metabolomics (breast cancer) 
Discovery by SERS (lung cancer) 
Unidentified peak wavelengths; 822, 884, 909, 925, 1009, 1,077, 1,369, 1,393, 1,721 cm-1
Salivary genetic and epigenetic analysis provides insight into the presence of invading pathogens, and aberrant gene transcription profiles that directly reflect pathological genetic processes pertaining to, for example, cancers. This section discusses the most recent advances in genetic and epigenetic biomarker research. Table 1 gives details of promising genetic and epigenetic salivary biomarkers reported for oral, and head and neck cancers.
The salivary genome consists of DNAs representing the genome of the individual, oral microbiota and infecting DNA viruses. The quality and yield of DNA that can be obtained from saliva is relatively good compared with blood and urine, can be used for genotyping, amplification or sequencing , and can be stored long-term without significant degradation . As such, salivary DNA is a robust analyte for diagnostics, but is limited to reflecting presence or absence of specific genes, alteration to sequences (mutation) and methylation status, and cannot provide information on upregulation and downregulation of gene expression.
The potential for salivary diagnosis of Epstein-Barr virus (EBV)-associated nasopharyngeal carcinoma (NPC) using quantitative PCR (qPCR) was reported in 2011 . This method achieved a detection rate of 80%, significantly less sensitive than serological methods (>90%). Also results were somewhat inconsistent with previous reports, as viral load was significantly higher in saliva post-therapy compared with pre-therapy. The authors suggested that this might have been due to decreased salivary flow rates caused by radiotherapy. However, higher levels of EBV DNA in T3/T4 stage NPC compared with T1/T2 stage NPC were reported, which is in agreement with other studies .
Aberrant methylation of tumor suppressor genes is common in cancer cells, and in oral squamous cell carcinoma (OSCC) hypermethylation has been linked with several cancer-related alterations of dysplastic oral epithelium . Promoter hypermethylation has been reported in premalignant OSCC lesions and in head and neck squamous cell carcinoma (HNSCC), showing potential as a biomarker for early detection of primary and relapsing OSCC or HNSCC [23–28] (Table 1). Furthermore, methylation array analysis of DNA extracts from preoperative saliva, postoperative saliva, and tissue samples from patients with OSCC and from saliva from normal individuals showed significant differences in methylation patterns between preoperative and postoperative OSCC and between preoperative OSCC and control saliva samples. Gene panels were constructed that had low-to-moderate sensitivity and good-to-excellent specificity for OSCC  (Table 1).
Quantitative methylation specific PCR (Q-MSP) has shown promise for the development of biomarkers for HNSCC. Carvalho et al.  studied tumor-suppressor gene hypermethylation in salivary rinses from pre-treatment HNSCC patients using Q-MSP. More than half of the study population was found to have methylation of at least one of the selected genes and had significantly reduced overall survival (Table 1). Early epigenetic attempts have been limited to approaches utilizing candidate genes or cell-culture-based discovery, with validation using well-characterized pathology specimens from homogeneous cohorts. Guerrero-Preston et al.  sought to overcome these limitations by using a new study design. To identify differentially methylated genes that could distinguish OSCC or HNSCC tumors from normal tissue, they used clinically defined samples from populations with different risk profiles, high-density promoter methylation platforms, publicly available expression arrays, two-stage design ('discovery screen' and 'prevalence screen') and Q-MSP in a phase I biomarker development trial. This approach aims to improve phase I trials, so that only robust biomarkers are assessed in phase II trials. The discovery screen found that promoter methylation in KIF1A, HOXA9, NID2 and EDNRB genes had a moderate-to-substantial correlation with clinical diagnosis. The prevalence screen showed a near perfect correlation between HOXA9 and NID2 promoter hypermethylation and histological diagnosis. The combined analysis of these two genes resulted in 94% sensitivity, 97% specificity and an area under the receiver operating characteristic curve (AUC) of 0.97 in tissue samples. However, the performance was far lower using saliva samples, with very low sensitivity  (Table 1).
The Human Oral Microbe Identification Microarray (HOMIM) has recently been developed comprising an oligonucleotide microarray based on 16S rRNA, and has allowed profiling and monitoring of changes in the oral microbiota . Alterations in the oral bacterial profile have been found to correlate with several diseases, including pancreatic, oral and lung cancer , colonic neoplasia and extracolonic malignancy , cardiovascular and cerebrovascular disease [34–36], and preterm birth , and these findings are being explored as potential salivary diagnostic approaches.
mRNAs and miRNAs are secreted from cells into the extracellular milieu and can be found in biofluids that are distant to the cellular sources [6, 38, 39]. In a diseased state, transcription of specific mRNAs and miRNAs is altered. Although the validity of salivary RNAs for the development of biomarkers with diagnostic potential initially received some criticism , this approach is now widely accepted. However, the precise sources of salivary RNAs and other molecules remain unclear (for further discussion see Box 1).
Standard procedures for salivary mRNA analysis previously required low temperatures and time-consuming and costly mRNA isolation, which impeded clinical application. Now, simple methods of stabilizing mRNA in saliva samples have been developed, allowing room temperature storage without stabilizing agents, and sample analysis without further processing: so-called 'direct-saliva-transcriptomic-analysis' . However, this approach still involves centrifugation. An alternative method has been described , but was based on the use of a costly stabilizing agent. As such, neither method is completely practical for all applications. Salivary and plasma mRNAs can be surprisingly stable. The mechanism that confers such stability is currently unknown; however, AU-rich element (ARE) sequence motifs in the 3' UTR regions of transcripts can recruit ARE-binding proteins, thereby controlling stability .
Microarray technology has allowed high-throughput saliva analysis and is the current gold standard for identifying saliva transcripts. The salivary transcriptome is profiled using microarrays and validated with qPCR. However, due to the low concentration of some biomarkers and occasional small sample volume, further innovations in technology have been required. Hu et al.  has overcome these limitations by two developments: a universal mRNA-amplification method for microarray discovery, and a multiplex pre-amplification method for qPCR validation. Moreover, the multiplex pre-amplification method allows the simultaneous detection of many transcripts, providing cost-effective screening and quantitative measurement using a relatively small amount of pre-amplified product. In the following sections, we discuss advances in transcriptomic analysis of mRNAs and miRNAs using saliva samples.
Characterization of mRNA profiles in body fluids provides insights into gene transcription in normal and disease states. There have been several advances in transcriptomic salivary mRNA biomarker characterization in the last 4 years. As discussed in this section, salivary mRNA biomarkers have been pursued for a number of diseases, including OSCC, primary Sjögren's syndrome, pancreatic cancer and ovarian cancer, and they have proved to be sensitive, specific and accurate.
Transcriptomic biomarkers for primary Sjögren's syndrome have undergone preclinical validation , and mRNA biomarkers for OSCC previously discovered in a US cohort  have been found to detect OSCC in a cohort of different ethnicity . Discovery and validation of transcriptomic salivary biomarkers has been performed for breast cancer using Affymetrix HG-U133-Plus-2.0 array discovery and preclinical validation in an independent cohort using RT-qPCR . Applying PRoBE design , Zhang et al.  identified salivary mRNA biomarkers for pancreatic cancer with excellent sensitivity and specificity. Importantly, these biomarkers could also distinguish pancreatic cancer from chronic pancreatitis with excellent sensitivity and specificity (both 96.7%). Also, the ovarian cancer salivary transcriptome profile was discovered in a clinical case controlled study using Affymetrix HG-U133-Plus-2.0 array and then validated with qPCR by Lee et al.  (Table 2).
Recently, Spielmann et al.  used massively parallel sequencing to determine nucleotide sequences for each salivary RNA. The salivary transcriptome of unstimulated CFS and whole saliva (WS) from healthy human subjects was profiled: 25% of the sequences from CFS corresponded with the human genome and approximately 30% corresponded with the human oral microbiome. In CFS and WS, more than 4,000 genes and non-coding functional sequences of human and microbial origin combined were detected.
miRNAs are short (19 to 25 nucleotides) RNA transcripts associated with post-transcriptional regulation by the RNA-induced silencing complex [49, 50]. These RNAs have been well characterized and found to play roles in cell growth, differentiation, apoptosis, pathogen-host interactions and stress responses and immune function, and are found in saliva [51–53]. Just as with salivary mRNA, salivary miRNA is surprisingly stable, and this was recently reported to be due to it being harbored in exosomes  (also see Box 1).
miRNAs are differentially expressed in several cancer cell types compared with normal cells, with observed differences that range from ten to over a hundredfold. Changes in mRNA levels in cancer cells are comparatively small, and miRNA has been found to be more useful for characterizing solid tumor types than mRNA. miRNA has been shown to accurately distinguish poorly differentiated tumors, whereas mRNA profiling on the same samples produced highly inaccurate results [52, 55, 56]. Therefore, miRNA cancer biomarkers are potentially very powerful, and if miRNA-based approaches also reveal similarly differential profiles using saliva samples they may become very useful in salivary diagnostics.
Park et al.  found significantly reduced levels of miRNAs miR-125a and miR-200a (known tumor suppressors) in the saliva of oral cancer patients compared with controls. Recent work has also shown that salivary miR-31 (implicated in tumorigenesis) was significantly elevated in all stages of oral cancer, and that salivary miR-31 was more abundant than blood miR-31, indicating the oral tumor origin of this biomarker .
The use of surface-enhanced laser desorption/ionization time-of-flight (SELDI-TOF) mass spectrometry (MS), which allows rapid, high-throughput detection of proteins in minute sample volumes without pre-processing, has been reported for several diseases . Recently, analysis of saliva for protein biomarker discovery has mainly been performed using two-dimensional difference gel electrophoresis (2D-DIGE) coupled with MS (which can identify around 300 proteins in a sample ), and liquid chromatograpy-MS (LC-MS) based techniques (which can identify more than 1,050 proteins in a sample; reviewed in ). Thus, liquid chromatographic separation appears to resolve protein species more precisely than gel electrophoresis methods. Recently, a multiplex protein array was also employed, providing high-throughput analysis ; however, this method requires some prior knowledge of likely analytes.
Despite these advances, the discovery and validation of salivary protein biomarkers provide challenges. Proteins generally have short half-lives, although there can be large differences in observed stability. Both the nature of peptides and the milieu of the oral cavity make salivary proteins vulnerable to degradation. Thus, protein-based salivary diagnostics require immediate processing and/or analysis of saliva samples, or the use of freezers and costly protease inhibitors. In a research setting, these requirements can be easily met. However, in clinical settings protein stabilization without freezers and specialized machinery may be required. Our laboratory has developed protein stabilization methods that allow storage of saliva samples at room temperature, without the need for expensive reagents and equipment [62, 63].
Several different classes of salivary protein biomarkers have been reported. Endothelin-1, a vasoconstrictor, was reported as a potential biomarker for OSCC development in patients with oral lichen planus . Interleukins IL-8 and IL-1β, glycoprotein M2BP (thought to be involved in natural killer cell activity enhancement, tumor suppression, interleukin production and cell aggregation), profilin (actin-binding protein, important for temporal and spatial control of actin microfilament growth), CD59 (complement regulatory protein), MRP14 (a protein involved in regulating myeloid cell function) and catalase (catalyses the conversion of hydrogen peroxide to water and oxygen) have been reported as salivary biomarkers for oral cancer [15, 65–69], and immunoglobulins have long been described as salivary biomarkers for HIV infection [70, 71].
The metabolome is the complement of small-molecule metabolites. Similar to the transcriptome and proteome, the metabolome changes continually and any single profile is a snapshot reflecting gene and protein expression. Metabolomic investigations can generate quantitative data for metabolites in order to elucidate metabolic dynamics related to disease state and drug exposure .
Using capillary electrophoresis time-of-flight MS, Sugimoto et al.  identified metabolites for detecting oral, breast and pancreatic cancer, and periodontal disease. Multiple logistic regression models yielded AUCs of 0.865 for oral cancer, using a profile consisting of three oral-cancer-specific metabolites: taurine, piperidine and a peak with a mass-to-charge ratio (m/z) of 120.0801. A breast cancer profile consisting of taurine and lysine achieved an AUC of 0.973. A model consisting of a pancreatic-cancer-specific metabolite signature comprising leucine with isoleucine, tryptophan, valine, glutamic acid, phenylalanine, glutamine and aspartic acid achieved an AUC of 0.993. Periodontal diseases achieved an AUC of 0.969 using a panel of only two markers, but the specific markers were not reported  (Tables 1 and 2). Using ultraperformance LC with quadrupole/time-of-flight MS and multivariate statistical analysis, Wei et al.  demonstrated that a salivary metabolic profile comprising valine, lactic acid and phenylalanine could discriminate OSCC from controls and oral leukoplakia with good-to-excellent sensitivity, specificity and accuracy (Table 1). However, a limitation of metabolomics compared with genomics, transcriptomics and proteomics is the inability at times to specifically identify the differentially expressed metabolites, as exemplified by these findings [7, 8, 17] (Tables 1 and 2).
Early-stage disease detection is imperative for successful therapy for most cancers, and the ability to quickly identify suspicious lesions will also greatly reduce hospital burdens. In the future, salivary diagnostics is expected to facilitate rapid, easily accessible and non-invasive clinical diagnosis, thus allowing more cases of disease to be detected at early stages and decreasing mortality caused by oral and systemic cancers, and infectious diseases.
Since the 1990s salivary diagnostics have been developed for oral diseases, allowing monitoring of periodontal disease and assessment of caries risk [73–75]. In recent years there have been many reports of potential salivary biomarkers for periodontal diseases found by candidate approaches, and conventional proteomics has been found to be highly accurate for detecting periodontitis as well as oral, head and neck cancers.
Several candidate approaches have elucidated biomarkers for periodontitis and responses to therapy. Gursoy et al.  compared the concentration of five specific salivary proteins, and the presence of five oral pathogens in patients with advanced periodontal disease and healthy controls. They found an association of IL-1β and oral pathogens with periodontitis. Using enzyme-linked immunosorbent assay (ELISA), patients with chronic periodontitis were found to exhibit significantly lower IL-17 and significantly higher IL-18 saliva concentrations compared with healthy individuals .
In a response to therapy study, salivary biomarkers for periodontitis were assessed in adults with chronic periodontitis, in which 33 participants received oral hygiene instructions alone and 35 participants had conventional periodontal treatment comprising scaling and root planning combined with oral hygiene instructions. Saliva samples collected at intervals were analyzed for several proteins, and it was found that the levels of IL-1β, macrophage inflammatory protein 1α (MIP-1α, a chemokine produced by macrophages after bacterial endotoxin stimulation, which activates granulocytes), matrix metalloproteinase-8 (MMP-8, a collagen cleaving enzyme found in connective tissue and involved in degradation of extracellular matrix in both normal and disease processes) and osteoprotegerin (OPG, a decoy receptor that reduces the production of osteoclasts) detected in saliva reflected disease severity and response to therapy and may have potential use in monitoring periodontal disease .
Nomura et al.  found that a combination of alanine aminotransferase (a transaminase that catalyses two steps of the alanine cycle) level and Porphyromonas gingivalis ratio could indicate periodontal disease progression with 40% sensitivity and 96% specificity. Lee et al.  analyzed saliva samples using a multiplex protein array for selected biomarkers implicated in host defense, inflammation, tissue destruction and angiogenesis, and measured subgingival bacterial proportions by checkerboard DNA-DNA hybridization. The authors devised a system to discriminate 'high' and 'low' responders to treatment for gingivitis. Salivary IL-6 and IL-8 levels provided the best distinction between high and low responders (Table 2).
Proteomic and microbial studies have been used to successfully identify biomarkers for periodontal disease. Choi et al.  searched for potential protein biomarkers for periodontitis in gingival crevicular fluid using LC-tandem MS (LC-MS-MS). Azurocidin (an antibiotic protein of azurophil granules with chemotactic activity) was identified as upregulated in gingival crevicular fluid, but not in saliva. ELISA was then used to verify upregulation of azurocidin, identifying the latter as a candidate biomarker for inflammatory periodontal disease . Using SELDI-TOF MS, Hart et al.  characterized the oral flora and salivary proteome in caries-free and caries-active children, and found that a combination model of microbial-proteomic data was far more accurate than microbial or proteomic models alone. Their best model had a 6% test error, with >92% sensitivity, and >95% specificity.
Al-Sabbagh et al.  investigated salivary levels of bone-remodeling-related molecules in chronic periodontitis patients using ELISA. MIP-1α levels were found to be 18-fold higher in periodontitis patients than normal and clinical measures correlated significantly with MIP-1α levels, and achieved excellent sensitivity, specificity and accuracy (Table 2). In a study of dysplastic oral leukoplakia in relation to tobacco habits and periodontitis, Sharma et al.  found that increasing IL-6 levels correlated with increasing severity of dysplasia.
Discovery and validation of salivary biomarkers for OSCC followed, and has also been confirmed across ethnic backgrounds . Many salivary biomarkers, identified using candidate as well as 'omic approaches (including metabolomic, transcriptomic (miRNA and mRNA), proteomic, epigenomic and microbial) have been described for oral, and head and neck cancers [4, 6–8, 22, 23, 25–29, 46, 57, 64, 65, 67, 83].
A comparison of salivary biomarkers for OSCCs and HNSCCs can be seen in Table 1. Several single marker candidates for OSCC have been reported, including EDNRB gene (which encodes endothelin receptor type B, a G-protein-coupled receptor that activates a phosphatidylinositol-calcium second messenger system) hypermethylation [25, 26], miRNAs miR-200a, miR-125a and miR-31 [6, 57], and IL-8 protein , but only IL-8 has been shown to be a sensitive, specific and accurate single marker for OSCC and could prove to be a cost-efficient analyte for OSCC detection.
Several biomarker panels have also been investigated for OSCC detection. An mRNA panel has been validated with excellent sensitivity, specificity and accuracy and could be considered 'second place' to IL-8 . Interestingly, proteomic-transcriptomic combination panels have achieved good sensitivity, specificity and accuracy, but are lagging behind the transcriptomic panel in these performance measures . A five-protein panel achieved higher sensitivity, lower specificity and similar accuracy to IL-8 alone . Metabolomic markers for OSCC have also achieved good accuracy, but have not been validated [7, 8]. Many epigenetic salivary biomarker panels have been reported and some have achieved 100% specificity, but accompanied by low sensitivity (62% ; Table 1), and have not been as stringently tested as proteomic and transcriptomic biomarkers.
More recently, advances in biotechnology, genomics and proteomics have extended the range of salivary diagnostics to systemic disease monitoring. Possibly the most attractive attribute of salivary diagnostics is its implication in systemic disease diagnosis, due to the invasive nature and in some cases poor accuracy of current standard diagnostic methods .
Available tests for systemic cancers include invasive and expensive procedures (such as magnetic resonance imaging, biopsy, X-ray, computed tomography scan, exfoliative cytology, positron emission tomography scan and endoscopy). This, combined with significantly worse prognosis associated with late diagnosis, makes the prospect of salivary diagnostics particularly valuable in oncology. Available diagnostic methodologies for other systemic diseases, such as infections, have similar problems. In this section, we will discuss progress in the use of salivary biomarker-based approaches for diagnosis of pancreatic cancer, lung cancer and breast cancer and finally other systemic diseases.
A significant milestone in salivary diagnostics was reached by Zhang et al. , with a prospective sample collection and retrospective double-blinded validation study that showed that a salivary transcriptome profile could detect early stage resectable pancreatic cancer. The use of the salivary transcriptome for detecting pancreatic cancer was also found to outperform currently used blood-based tests in terms of sensitivity and specificity, but did not allow sensitive detection of pancreatitis, unlike some blood-based tests. Farrell et al.  showed that variations in oral microbiota could possibly be used to detect pancreatic cancer and pancreatitis. Also, a metabolomic approach  has identified pancreatic-cancer-specific salivary metabolomic biomarkers that could distinguish pancreatic cancer from oral and breast cancer and normal controls. Although both the transcriptomic and the metabolomic biomarkers achieved excellent accuracy for this disease, only the transcriptomic panel has been validated (Table 2).
Currently, lung cancer may be detected with chest X-ray and computed tomography, and diagnosis is confirmed via biopsy. Thus, lung cancer detection is limited by stage of disease, and more than 75% of cases are diagnosed in the late stages, significantly reducing survival rate. Lung cancer is often asymptomatic or presents with non-specific symptoms, so there is considerable need for non-invasive approaches for earlier detection of disease [59, 84].
Using 2D-DIGE and MS, Xiao et al.  performed proteomic analysis of saliva samples from lung cancer patients. Sixteen candidate biomarkers were discovered and further verified. Three candidate markers (calprotectin, a calcium and zinc binding protein, zinc-α-2-glycoprotein, a lipolysis stimulating protein, and haptoglobin hp2, a protein that binds free plasma hemoglobin) achieved good sensitivity and excellent specificity and accuracy (Table 2).
Zhang et al.  profiled the salivary lung cancer transcriptome and discovered and pre-validated a panel of seven highly discriminatory mRNAs: BRAF (which encodes the protein B-Raf, and is involved in directing cell growth), CCNI (which encodes the protein cyclin-I, and binds activated cyclin-dependant kinase 5), EGFR (epidermal growth factor receptor), FGF19 (fibroblast growth factor 19, involved in many biological processes, including morphogenesis and cell growth, and tumor growth and invasion), FRS2 (fibroblast growth factor receptor substrate 2, an adapter protein that links fibroblast growth factor and nerve growth factor receptors to downstream signaling pathways), GREB1 (growth regulation by estrogen in breast cancer 1, an early response gene in the estrogen receptor pathway), and LZTS1 (leucine zipper putative tumor suppressor 1, a putative tumor suppressor gene). A panel consisting of five of these markers differentiated lung cancer from normal subjects with 93.75% sensitivity and 82.81% specificity (Table 2).
Surface-enhanced Raman spectroscopy (SERS) was recently employed by Li et al.  to identify biomarkers for lung cancer. SERS revealed nine peaks (assigned to amino acids and nucleic acid bases) that could distinguish samples from patients with lung cancer and controls with 86% accuracy, 94% sensitivity and 81% specificity. Thus, for lung cancer, both transcriptomic and proteomic approaches are proving highly useful for biomarker development, and metabolomic approaches are also showing promise (Table 2).
Today, breast cancer detection employs physical examination and imaging techniques. Emerging technologies such as molecular analysis of nipple fluid aspirate and ductal lavage samples  provide improved accuracy and potentially earlier diagnosis, but are invasive and limited to high-risk patients. Several earlier studies indicated the potential for salivary proteomic detection of breast cancer and its potential use in monitoring response to treatment, and the use of single markers, such as salivary protein c-erbB-2, has also been investigated [9–12].
Cao et al.  investigated the salivary proteome of breast cancer patients compared with healthy individuals. Saliva samples were analyzed with isobaric tags for relative and absolute quantitation (iTRAQ) technology coupled with LC-MS-MS. Of 464 identified proteins, 9 were associated with breast cancer and had a 1.5-fold upregulation or downregulation. Recently, preclinical validation of transcriptomic and proteomic salivary biomarkers with diagnostic power for breast cancer or ovarian cancer has also been reported [14, 47]. However, the metabolomic approach to breast cancer detection has shown better results than the combination proteomic-transcriptomic approach in terms of accuracy (Table 2).
Advances in the use of salivary biomarkers for detecting autoimmune diseases, systemic microbial infections and diabetes have been described recently [2, 45, 70, 87]. Menezes et al.  investigated hepatitis C virus (HCV) loads in saliva and serum samples from HCV-infected individuals using qPCR, and found that HCV RNA could be consistently detected in saliva of HCV-infected individuals, although at significantly lower levels than in serum.
Border et al.  performed label-free differential protein expression analysis using multidimensional LC-MS-MS to characterize the proteome of saliva collected from edentulous type 2 diabetic patients compared with non-diabetic controls. Several proteins, PLUNC (palate, lung and nasal epithelium carcinoma-associated protein, a secretory protein expressed in the upper airways and nasopharyngeal regions), GAPDH (glyceraldehyde-3-phosphate dehydrogenase, an enzyme that catalyses the sixth step of glycolysis), SAA (serum amyloid A, an adipolipoprotein associated with high density lipoproteins in plasma), serotransferrin (an iron-binding transport protein), PIP (prolactin-induced protein, an aspartyl protease that inhibits T-lymphocyte programmed cell death), A2MG (α2-macroglobulin, a broad protease inhibitor), and cystatin S and cystatin SN (protein inhibitors of cysteine proteinases), were found to have diagnostic potential for type 2 diabetes but require further testing to determine clinical value.
Transcriptomic and proteomic approaches to biomarker development have so far had the most progress towards clinical testing. Salivary biomarkers for a range of cancers, both local and distant to the oral cavity, have been characterized and validated at the pre-clinical level. Certainly, salivary biomarkers for oral diseases such as OSCC are likely to become clinically implemented before those for systemic cancers, as they have been well described and validated and often originate locally to the oral cavity where the biospecimen is collected, lending legitimacy to such biomarkers.
The transfer of scientific knowledge of salivary biomarkers to clinical applications is a challenging process that has rarely resulted in clinical implementation. Understanding the sources of salivary biomarkers (for example, exosomes; Box 1) will be a significant objective in the development of salivary diagnostics, as this will provide the rationale for the use of salivary biomarkers for systemic diseases. Our group and others continue to investigate the mechanisms by which salivary biomarkers appear in saliva, and to develop technologies for detecting biomarkers with high sensitivity and specificity in pursuit of compliance at the FDA level.
Accumulating evidence suggests that some salivary biomarkers could derive from systemic sources. Gao et al.  showed that altered salivary gland tissue production of transcription factors might account for a large proportion of upregulated and downregulated gene expression in the saliva of tumor-bearing mice, and that tumor-derived ectopic nerve growth factor could induce production of the transcription factor Egr-1 in salivary glands. Therefore, the salivary transcriptomic profile might be composed of transcripts originating in distant diseased tissues and transcripts originating in salivary glands, and altered expression levels of these transcripts might be caused by transcription factors originating in distant tissues.
Exosomes might be involved in transport of systemic biomarkers to saliva or transport of effectors of salivary gland transcriptional machinery from blood to the salivary glands. These membrane bound microvesicles of 30 to 100 nm in diameter and endocytic origin  are secreted by many cell types in diseased and non-diseased states. Exosomes contain protein, RNA and DNA, and have been found in most body fluids, including saliva [92–95]. Using in vitro methods, studies have shown that exosomes secreted from one cell can be taken up by other cells and can influence the receiving cell's transcriptional signature [92, 96, 97]. Intercellular communication via exosomes can also occur between normal cells and cancer cells and between cancer cells and distant sites. This has been suggested as a mechanism by which cancer cells prepare distant sites for receiving metastatic cells [98, 99]. Exosomes protect their contents from enzymatic degradation, suggesting a role in intercellular transport of molecules [6, 97, 100]. Similarities between serum and saliva biomarker profiles and the co-existence of exosomes in saliva and serum  also suggest the involvement of exosomes in transport of salivary biomarkers. Furthermore, a recent in vitro investigation showed that tumor-derived exosomes can interact with salivary gland cells and activate their transcriptional machinery .
area under the receiver operating characteristic curve
enzyme-linked immunosorbent assay
hepatitis C virus
head and neck squamous cell carcinoma
Human Oral Microbe Identification Microarray
liquid chromatography-mass spectrometry
liquid chromatography-tandem mass spectrometry
oral squamous cell carcinoma
polymerase chain reaction
prospective specimen collection and retrospective blinded evaluation
reverse transcription PCR
reverse transcription quantitative PCR
surface-enhanced laser desorption/ionization time-of-flight mass spectrometry
surface-enhanced Raman spectroscopy
two-dimensional difference gel electrophoresis.