- Open Access
The potential of circulating tumor DNA methylation analysis for the early detection and management of ovarian cancer
Genome Medicinevolume 9, Article number: 116 (2017)
Despite a myriad of attempts in the last three decades to diagnose ovarian cancer (OC) earlier, this clinical aim still remains a significant challenge. Aberrant methylation patterns of linked CpGs analyzed in DNA fragments shed by cancers into the bloodstream (i.e. cell-free DNA) can provide highly specific signals indicating cancer presence.
We analyzed 699 cancerous and non-cancerous tissues using a methylation array or reduced representation bisulfite sequencing to discover the most specific OC methylation patterns. A three-DNA-methylation-serum-marker panel was developed using targeted ultra-high coverage bisulfite sequencing in 151 women and validated in 250 women with various conditions, particularly in those associated with high CA125 levels (endometriosis and other benign pelvic masses), serial samples from 25 patients undergoing neoadjuvant chemotherapy, and a nested case control study of 172 UKCTOCS control arm participants which included serum samples up to two years before OC diagnosis.
The cell-free DNA amount and average fragment size in the serum samples was up to ten times higher than average published values (based on samples that were immediately processed) due to leakage of DNA from white blood cells owing to delayed time to serum separation. Despite this, the marker panel discriminated high grade serous OC patients from healthy women or patients with a benign pelvic mass with specificity/sensitivity of 90.7% (95% confidence interval [CI] = 84.3–94.8%) and 41.4% (95% CI = 24.1–60.9%), respectively. Levels of all three markers plummeted after exposure to chemotherapy and correctly identified 78% and 86% responders and non-responders (Fisher’s exact test, p = 0.04), respectively, which was superior to a CA125 cut-off of 35 IU/mL (20% and 75%). 57.9% (95% CI 34.0–78.9%) of women who developed OC within two years of sample collection were identified with a specificity of 88.1% (95% CI = 77.3–94.3%). Sensitivity and specificity improved further when specifically analyzing CA125 negative samples only (63.6% and 87.5%, respectively).
Our data suggest that DNA methylation patterns in cell-free DNA have the potential to detect a proportion of OCs up to two years in advance of diagnosis and may potentially guide personalized treatment. The prospective use of novel collection vials, which stabilize blood cells and reduce background DNA contamination in serum/plasma samples, will facilitate clinical implementation of liquid biopsy analyses.
Three-quarters of ovarian cancers (OC) are diagnosed when the tumor has spread into the abdomen and long-term survival rates of these women are low (10–30%) .
High-grade serous (HGS) OC accounts for 70–80% of OC deaths and the survival figures have not changed significantly over the last few decades . Early diagnosis and personalized treatment still remain the biggest unmet needs in combating this devastating disease .
A number of OC biomarkers have been studied in the past. Among these, CA125, which was discovered more than 30 years ago , is still the “gold standard,” despite a modest positive predictive value when interpreted using a defined cut-off of 35 IU/mL , which has also been used as a reference standard in our work. Recently, the 35 most promising OC biomarkers were evaluated in the Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening Trial. The markers were tested in samples taken up to six months before OC diagnosis from 118 women and 951 age-matched controls and at a fixed specificity of 95%, CA125 sensitivity out performed all 35 markers . However, the performance of CA125 dropped dramatically when samples taken > 6 months before diagnosis were evaluated . Recently, we demonstrated that the performance of the Risk of Ovarian Cancer Algorithm (ROCA), based on CA125 serial profile, demonstrates superior performance characteristics during screening [6, 7]. CA125 kinetics are also increasingly being explored in women undergoing neoadjuvant chemotherapy (NACT) for predicting disease response and outcome [8,9,10,11]. Both require serial blood sampling, which, in the case of differential diagnosis, is never available in patients presenting clinically.
The vast majority of protein-based tumor markers are produced not only by cancerous but also non-neoplastic normal cells; CA125 is produced by mesothelial cells (i.e. peritoneum and pleura) and hence benign or inflammatory processes can result in aberrant elevations of serum CA125.
Recently, DNA-based markers, shed from tumor cells, have shown great promise in monitoring treatment response and predicting prognosis [12,13,14,15,16]. However, efforts to characterize the cancer genome have shown that only a few genes are frequently mutated in most cancers and that the location of the genetic mutation site differs across individuals with specific tumor types. Hence, the detection of somatic mutations is limited to patients that harbor a predefined set of mutations. The necessity of prior knowledge regarding specific genomic composition of an individual’s tumor tissue is one of the limiting factors when using these “liquid biopsy” approaches for early detection or differential diagnosis of a pelvic mass. Current sequencing technology allows for the detection of a mutant allele fraction of 0.1% (which is one mutant molecule in a background of 1000 wild-type molecules) [12, 17].
The development of a cell-free DNA based test for the early detection of cancer poses two major challenges: (1) low abundance of cancer-DNA in the blood; and (2) high levels of “background DNA” (shed from white blood cells [WBC] ) in serum samples that are separated from blood cells after significant time intervals. This is an issue with most currently available population cohort biobanks which could be used for the validation of potential screening markers using samples that have been banked years in advance of diagnosis.
Alteration of DNA methylation (DNAme) is: (1) an early event in cancer development [19,20,21,22]; (2) more frequently observed than somatic mutations; and (3) centered around specific regions, i.e. CpG islands . Together with its chemical and biological stability, the detection of aberrant DNAme patterns in serum or plasma provide a novel strategy for cancer diagnosis as evidenced by several proof of principle studies [24,25,26,27,28,29,30,31,32,33,34]. DNAme analysis allows for the detection of specific patterns (i.e. full methylation of all linked 7–16 CpGs in a region of 120–150 bp) as opposed to single point mutations (e.g. in the TP53 gene) which is key to improving both the performance characteristics of the test and the detection limit of the assay. Plasma SEPT9 methylation analyses—currently the only cell-free DNA assay which is available for cancer screening in the clinical setting—demonstrates a specificity of 79% and a sensitivity of 68% for detection of colon cancers . The clinical potential of serum/plasma-based cell-free DNA analysis is further exemplified by maternal plasma cell-free DNA testing for fetal trisomy which has already been integrated into clinical practice and demonstrates a higher sensitivity and a lower false-positive rate compared to imaging-based techniques .
We have employed two different epigenome-wide approaches to identify the most promising DNAme-based markers that discriminate OC vs benign pelvic conditions, developed serum tests using the discovered markers, and validated their performance in relation to the serum OC marker CA125.
Patients and sample collection
We analyzed tissue samples from a total of 699 volunteers and 648 serum samples from a total of 598 volunteers (the 25 patients who underwent NACT provided three serial samples) in seven independent sets (Fig. 1).
OC samples [36, 37], WBC samples , and Fallopian Tube samples  have previously been described. Ten benign pelvic tumors (two endometriosis-ovarian cysts, one fibroma, two papillary serous cystadenomas, one mucinous cystadenoma, two serous cystadenomas, one mucinous cystadeonoma with Brenner tumor, and one dermoid cyst), 96 endometrial samples  (Haukeland University Hospital, Bergen, 52 patients with primary and metastatic samples equaling 87, eight benign endometrial (all hyperplasia) and one cell line), and 170 samples (38 colon [COAD controls], 50 liver [LIHC controls], 75 lung [LUSC and LUAD controls], seven rectum [READ controls]) from the publicly available The Cancer Genome Atlas (TCGA) repository were analyzed.
Reduced representation bisulfite sequencing (RRBS) set:
Eleven prospectively collected invasive epithelial OC samples (HGS, n = 8; low grade serous, n = 1; endometrioid, n = 1; mucinous, n = 1; mean age = 54.7 years), one benign tumor (papillary serous cystadenoma; age = 86 years), 18 non-neoplastic tissue samples (breast, n = 7 and adnexal, n = 11; mean age = 60.2 years), two non-neoplastic endometrial tissues (mean age = 68 years), and 23 WBC samples (breast cancer patients, n = 10 and OC patients, n = 13 [11 of which match corresponding OC tissue samples, one matches corresponding normal endometrial sample, and one matches normal ovarian sample]; mean age = 57.8 years) were assessed by RRBS. All samples were collected prospectively at the University College London Hospital in London and the Charles University Hospital in Prague.
For serum sets 1–3 and the NACT serum set, women attending the University College London Hospital in London and the Charles University Hospital in Prague were invited, a written consent obtained, and 20–40 mL blood taken (VACUETTE® Z Serum Sep Clot Activator tubes, Cat. 455071, Greiner Bio One International GmbH), centrifuged at 3000 rpm for 10 min with serum stored at – 80 °C.
Serum set 1:
Serum samples from the following volunteers were collected (at the time of diagnosis, before treatment):
healthy volunteers (n = 19, mean age = 41.1 years);
women with benign pelvic masses (n = 22, mean age = 41.3 years) with the following histologies: endometriosis (n = 6), fibroids (n = 5), hydrosalpinx (n = 1), serous cystadenoma (n = 5), and mucinous cystadenoma (n = 5);
patients with OCs (n = 18, mean age = 62.2 years): endometrioid (n = 2) and clear cell (n = 1) and HGS (n = 15) OCs; 10 and 8 women had a stage I/II and stage III/IV ovarian cancer, respectively.
Serum set 2:
Serum samples from the following volunteers were collected (at the time of diagnosis, before treatment):
healthy volunteers (n = 20, mean age = 42.8 years);
women with benign pelvic masses (n = 34, mean age = 40.0 years) with the following histologies: endometriosis (n = 7), fibroids (n = 8), pelvic inflammatory disease or pelvic abscess (n = 9), serous cystadenoma (n = 5), and mucinous cystadenoma (n = 5);
patients with borderline ovarian tumors (n = 11, mean age = 47.3 years): mucinous (n = 6) and serous (n = 5) borderline tumors;
patients with ovarian cancers (n = 27, mean age = 62.9 years): endometrioid (n = 3), clear cell (n = 3), mucinous (n = 2) and HGS (n = 19) OCs; 10 and 17 women had a stage I/II and stage III/IV OC, respectively.
Serum set 3:
Serum samples from the following volunteers were collected (at the time of diagnosis, before treatment):
healthy volunteers (n = 21, mean age = 50.8 years);
women with benign pelvic masses (n = 119, mean age = 41.4 years) with the following histologies: endometriosis (n = 21), fibroids (n = 21), pelvic inflammatory disease or pelvic abscess (n = 7), serous cystadenoma (n = 20), mucinous cystadenoma (n = 20), and dermoid cysts (n = 30);
patients with borderline ovarian tumors (n = 27, mean age = 57.1 years): mucinous (n = 7) and serous (n = 20) borderline tumor;
patients with non-epithelial tumors (n = 5, mean age = 55.8 years): granulosa cell tumors;
patients with non-OCs (n = 37, mean age = 58.3 years): cervical (n = 10), endometrial (n = 20), and colorectal (n = 7) cancers;
Patients with OCs (n = 41, mean age = 59.6 years): endometrioid (n = 3) and clear cell (n = 5), mucinous (n = 4) and HGS (n = 29) OCS; 16 and 25 women had a stage I/II and stage III/IV OC, respectively.
CA125 analysis was performed using the CA125 Cobas immunoassay and platform (Roche Diagnostics, Burgess Hill, UK) by staff who had no access to clinical or DNAme data.
Patients (n = 25) at the Gynaecological Oncology Centre in Prague deemed not to be suitable for upfront surgery were recruited. The average age was 62.8 years. HGS OCs were the most prevalent histology (n = 23) and the remaining two patients had clear cell OCs. Eighteen and seven patients presented with a stage IIIC and IV OC, respectively. Twenty-four patients received Carboplatin-Paclitaxel combination chemotherapy and one patient received Carboplatin only. All but two patients had interval debulking surgery. Among the 23 patients, 14 had no residual disease, five had macroscopic residual disease, and four had microscopic residual disease (i.e. tumor reaches the edge of at least one of the resected specimens, according to TNM classification). Twelve patients were deemed to be platinum-sensitive (no recurrence within six months after successful completion of neoadjuvant and adjuvant chemotherapy and interval debulking surgery) and eight patients were deemed to be platinum-refractory (n = 2, no response to chemotherapy or progression on chemotherapy) or platinum-resistant (n = 6, recurrence within six months after successful completion of neoadjuvant and adjuvant chemotherapy and interval debulking surgery). For five patients, no data were available on platinum-sensitivity.
All serum samples were collected prospectively at the Charles University Hospital in Prague. Each patient provided three samples at the following time-points:
at the time of histological diagnosis, before chemotherapy;
three weeks after the first cycle of chemotherapy (immediately before the second cycle);
three weeks after the second cycle of chemotherapy (immediately before the third cycle).
CA125 analysis was performed using the CA125 Cobas immunoassay and platform (Roche Diagnostics, Burgess Hill, UK).
Among the 202,546 women, 101,359 women were randomized into the control arm of UKCTOCS (ClinicalTrial.gov registration, NCT00058032) between 2001 and 2005 [6, 7, 39]. Forty-three women developed an invasive epithelial OC within 2 years of serum sample donation and had at least 4 mL of non-hemolyzed serum available. Twenty-six, two, two, one, five, and seven women developed a HGS, mucinous, endometrioid, clear cell, carcinosarcoma, and a carcinoma not otherwise specified, respectively. The average age at sample donation was 63.9 years. Among the 43 women, 19 women were diagnosed within one year and 24 women were diagnosed 1–2 years after sample donation. Thirteen and 30 women were diagnosed with a stage I/II and stage III/IV cancer, respectively. For each of the 43 cases, three women who did not develop any cancer within the first five years after recruitment were matched with respect to age at recruitment, center, and month of recruitment (controls, n = 129) (see Additional file 1: Figure S1).
Blood samples from all UKCTOCS volunteers were spun down for serum separation after being couriered at room temperature to the central laboratory and were aliquoted and stored in liquid nitrogen vapor phase until they were thawed for this study. Only 1 mL of serum per UKCTOCS volunteer was available for cell-free DNA analysis. CA125 analysis was performed using the CA125 Cobas immunoassay and platform (Roche Diagnostics, Burgess Hill, UK). The study was approved by the local research ethics committees: UCL/UCLH Biobank for Studying Health & Disease NC09.13). All patients provided written consent for samples to be used in ethically approved secondary studies.
Isolation and bisulfite modification of DNA
DNA was isolated from tissue and serum samples at GATC Biotech (Konstanz, Germany). Tissue DNA was quantified using NanoDrop and Qubit (both Thermo Fisher Scientific, USA); the size was assessed by agarose gel electrophoresis. Serum DNA was quantified using the Fragment Analyzer and the High Sensitivity Large Fragment Analysis Kit (AATI, USA). DNA was bisulfite converted at GATC Biotech.
DNAme analysis in tissue
Genome-wide methylation analysis was performed either by the Illumina Infinium Human Methylation 450 K beadchip array (Illumina Inc., USA, WG-314-1003) as previously described [37, 38] or using RRBS at GATC Biotech. For the 450 K methylation data, we developed a pipeline in order to select the most promising cancer-specific differentially methylated regions (DMRs) that are most likely to fulfil the strict specificity criteria of a serum-based test (Additional file 2).
For RRBS, DNA was digested by the restriction endonuclease MspI that is specific for the CpG-containing motif CCGG; a size selection of the library provides an enhanced coverage for the CpG-rich regions including CpG islands, promoters, and enhancer elements [40, 41]. The digested DNA was adapter ligated, bisulfite-modified, and polymerase chain reaction (PCR)-amplified. The libraries were sequenced on Illumina’s HiSeq 2500 with 50 bp or 100 bp paired-end mode. Using Genedata Expressionist® for Genomic Profiling v9.1, we have established a bioinformatics pipeline for the detection of cancer-specific DMRs. The most promising DMRs have been taken forward for the development and validation of serum-based clinical assays (Additional file 2).
Targeted ultra-high coverage bisulfite sequencing of serum DNA
Targeted bisulfite sequencing libraries were prepared at GATC Biotech. In brief, bisulfite modification was performed with 1 mL serum equivalent. Modified DNA was used to test up to three different markers using a two-step PCR approach. Ultra-high coverage sequencing was performed on Illumina’s MiSeq or HiSeq 2500 with 75 bp or 125 bp paired-end mode (Additional file 2).
For DMR discovery, the data analysis pipelines are described within the respective sections in the Additional file 2. In brief, Genedata Expressionist® for Genomic Profiling was used to map reads to human genome version hg19, identify regions with tumor-specific methylation patterns, quantify the occurrence of those patterns, and calculate relative pattern frequencies per sample. Pattern frequencies were calculated as number of reads containing the pattern divided by total reads covering the pattern region. To find tumor-specific methylation patterns, we first determined the methylation pattern frequencies of all observed patterns in relevant genomic regions in different tissues. The algorithm that we developed scans the whole genome and identifies regions that contain at least ten aligned paired-end reads. These read bundles are split into smaller regions of interest which contain at least four CpGs in a stretch of, at most, 150 bp. For each region and tissue/sample, the absolute frequency (number of supporting reads) for all observed methylation patterns was determined (Fig. 2a). This led to tens of millions of patterns per tissue/sample. The patterns were filtered in a multi-step procedure to identify the methylation patterns which specifically occur in tumor samples. In order to increase sensitivity and specificity of our pattern discovery procedure, we pooled reads from different tumor or WBC samples, respectively, and scored patterns based on over-representation within tumor tissue. The results were summarized in the specificity score Sp, which reflects the cancer specificity of the patterns. After applying a cut-off of Sp ≥ 10, 2.6 million patterns for OC remained and were further filtered according to the various criteria demonstrated in Fig. 2b (and Additional file 2).
For the filtered unique cancer-specific patterns for OC identified in the Array (n = 19) and RRBS (n = 45) approach, respectively, bisulfite sequencing primers have been designed and technically validated, eventually leading to 31 candidate markers (Additional file 3: Table S1). Quantification and analysis of DNAme patterns were performed by staff who, at the time of analysis, had no access to CA125 data or clinical information. Furthermore, staff who performed the CA125 analyses had no access to clinical information at the time of the analyses. Only samples with valid values in the respective DNAme marker (no samples were excluded based on coverage) and CA125 assays (all samples had a valid value) were included in the calculation of the respective sensitivities and specificities. The 95% CI intervals for sensitivity and specificity have been calculated according to the efficient-score method . Differences in pattern frequencies or coverage have been analyzed using the Mann–Whitney U test.
The samples, techniques, and purpose of the three phases—marker discovery, assay development, and test validation—are summarized in Fig. 1.
DNA methylation marker discovery in tissue
We have used two independent epigenome-wide approaches in order to discover DMRs which have the potential to diagnose OC with high sensitivity and specificity. First, Illumina Infinium Human Methylation450 BeadChip Array (450 K) technology was used to interrogate the methylation status of ~ 485,000 genomic sites in 218 OC  and 438 control samples (Fig. 1 and Additional file 2). A set of 19 high scoring and ranking DMRs were selected for targeted-BS-based serum assay development. Additional file 1: Figure S2 shows an example of a selected top DMR (reaction #228). Second, based on RRBS, we developed a method for methylation quantification that takes advantage of the sequencing-specific information (i.e. the individual methylation status of all sequenced CpGs on every single DNA molecule) to overcome the challenges of using serum-based samples (i.e. relatively few tumor derived molecules in a large background of non-tumor DNA). To achieve sufficient specificity in this setting, our algorithm selects markers that are combinations of four or more CpGs on a single molecule, which show tumor specific methylation. While “background methylation” might be observed on each of these CpGs and also in WBC DNA, for example, it is much less likely that such background methylation of all measured CpGs will be observed in a single DNA molecule derived from WBCs. The analysis of single molecules also enabled us to select patterns that were not observed in any of the WBC samples analyzed, i.e. had 100% specificity in our discovery tissue sample set. Further, to achieve sufficient sensitivity in a liquid biopsy test, we restricted our markers to CpG patterns within a 150-bp window, which would allow for good PCR amplification as well as the increased likelihood of obtaining DNA fragments containing all required CpGs in apoptotic or necrotic, respectively, circulating tumor DNA (ctDNA). Finally, we generalized our algorithm so that it would also detect methylation patterns that are hypomethylated in tumors or heterogeneously methylated, respectively. This approach, together with some additional selection criteria described in Fig. 2 and Additional file 2, led to 45 marker candidates that could be utilized for the development of DNAme assays suitable for liquid biopsies testing.
Further analysis of all patterns occurring within the marker regions revealed that, while the selected, fully methylated patterns were generally more specific, truncated versions of these fully methylated patterns within the same regions (i.e. overlapping patterns including other, more, or less CpGs, respectively) showed very similar pattern frequencies in the samples analyzed (Additional file 1: Figure S3). Patterns from these regions containing one or more unmethylated CpGs were generally less specific. Heterogeneously methylated patterns in other regions were also detected (not shown), but have been filtered out in subsequent steps shown in Fig. 2.
Serum DNAme assay establishment
We used ultra-deep BS sequencing (Fig. 2c) to develop serum assays for the candidate regions in 59 serum samples from Set 1 (Fig. 1 and Additional file 1: Figure S4 and Additional file 2). Based on sensitivity and specificity (assessed by area under the receiver operating characteristic curve [AUC]), nine markers have been selected for further validation in Set 2 (n = 92; Additional file 1: Figure S5). In Sets 1 and 2 combined, the specificity and sensitivity of the top four candidate markers referred to regions #141, #144, #204, and #228 (#228 was only analyzed in Set 2) to discriminate HGS OC from healthy women or those with a benign pelvic mass was 95.7%/42.4%, 93.5%/48.5%, 100%/25.0%, and 100%/36.8%, respectively (pattern frequency thresholds were set at 0.0008, 0.0001, 0.0001, and 0.0001, respectively). Interestingly, region #144 has already been defined as a promising cell-free DNA marker for cancer, particularly in OC [43, 44]. For three (i.e. #144, #204, and #228) of these four regions, CpGs were analyzed on the 450 k methylation array; using these data we demonstrated that aberrant methylation can already be detected in early stage cancers (i.e. stage I and II; Additional file 1: Figure S6). Due to limited serum volume in our validation sets, we chose a combination of three markers. The combination of regions #141, #204, and #228 (at least one of these regions with a pattern frequency above the aforementioned threshold) resulted in a 98.1% specificity and a 63.2% sensitivity. These regions are linked to genes COL23A1, C2CD4D, and WNT6, respectively.
Clinical validation of the serum DNAme assay
We validated the combination of the three markers in Set 3 (Fig. 3a–c) alongside the CA125 serum marker (Fig. 3d). The average coverage (i.e. DNA strand reads by the sequencer for each sample and region) is > 500,000 (Additional file 1: Figure S7). Applying the above indicated cut-off thresholds for the three DNAme markers and 35 IU/mL for serum CA125 led to specificities of 90.7% and 87.1% and sensitivities of 41.4% and 82.8%, respectively (Table 1). Due to the fact that reaction #228 was only analyzed in Set 2, we combined Set 2 and Set 3 in order to redefine the thresholds. Whereas for #141 the threshold of 0.0008 remained unchanged, for #204 and #228 we further lowered the pattern frequency threshold to 0.00003 and 0.00001, respectively, leading to specificity and sensitivity of 91.8% and 58.3%, respectively (Table 1). Among these 48 HGS cancers (i.e. the most aggressive cancers), 6/11 (54.5%) stage I/II and 22/37 (59.5%) stage III/IV cancers were serum DNAme-positive. Importantly, there was no overlap between the DNAme-positive and CA125-false positive controls (Table 2).
Serum DNAme to predict response to platinum-based NACT
In order to further assess the cancer specificity and the dynamics of our three candidate markers in individual patients, we recruited 25 OC patients who received carboplatin-based NACT. Compared with the pre-treatment sample, all three DNAme markers decreased substantially and to a larger extent compared to CA125 after one and two cycles (Fig. 4a–d and Additional file 1: Figures S8–S10). Whereas CA125 dynamics were not a strong discriminator between chemotherapy-responders and non-responders (Table 3), serum DNAme dynamics (i.e. serum DNAme as defined in Sets 2 and 3, before chemotherapy compared to after two cycles) correctly identified 78% and 86% of responders and non-responders (Fisher’s exact test, p = 0.04) overall and 78% and 100% of responders and non-responders among those women who were left without residual disease after interval debulking surgery (Fisher’s exact test, p = 0.007) (Table 3).
Serum DNAme for early diagnosis of OC
In order to judge whether our marker panel is, in fact, capable of diagnosing OC early, samples predating OC diagnosis by up to two years (cases) and matched controls were used from the control (no screening) arm of the UKCTOCS cohort. The median time from venepuncture to serum separation was 21.97 h (interquartile range [IQR] = 19.91–24.34 h) for this sample set. As expected, both the amount of DNA/mL serum as well as the average DNA fragment size were substantially higher in UKCTOCS samples compared with the other samples used in this study (Fig. 5a and b). This is potentially due to DNA from WBCs leaking into the serum during the sample transport time, particularly during the warmer months of the year (Additional file 1: Figure S11). Nevertheless, a small proportion (on average, 19.9%) of the DNA consisted of smaller (50–250 bp) fragments representing DNA from apoptotic cells (including DNA from cancer cells) (Additional file 1: Figure S12). The “contaminating” majority of high-quality DNA not only dilutes the cancer signal but also skews the target sequence amplification towards WBC DNA. In order to adjust for these factors, we made an a priori decision to reduce the threshold for the three regions by a factor of 3 and split the analyses in samples above (high) and below (low) the median amount of DNA (Table 4). The three DNAme-marker panel was able to identify cases with a specificity of 88% and a sensitivity of 58%, when specifically assessed in samples with a DNA concentration lower than the median ng/uL value, and importantly which predated cancer diagnosis by up to two years (Table 4B and Additional file 3: Table S2). The sensitivity of the panel improved from 58% to 64% when exclusively assessing CA125-negative (<35 IU/mL) samples. As previously observed in the Set 3 analysis there is no overlap between CA125 and DNAme false-positive controls (Table 5). When directly comparing the performance of CA125 (applying a cut-off of 35 IU/mL) with the DNAme panel specifically in the “low” DNA samples, the DNAme panel had higher sensitivity (57.9% vs 42.1%) but lower specificity (88.1% vs 95.5%) compared to that of CA125 for the early detection of OC (Additional file 3: Table S3).
We identified cancer-specific DNAme patterns in tissue and developed serum assays which we validated in various settings. Our findings suggested that cell-free DNA has the potential to detect ovarian cancer up to two years in advance of clinical diagnosis. Nevertheless, a cell-free DNA based test will have to overcome several hurdles before clinical implementation.
In healthy individuals, cell-free DNA is present at concentrations in the range of 0–100 ng/mL and an average of 30 ng/mL . DNA derived from tumor cells is shorter than that from non-malignant cells in the plasma of cancer patients . Our overarching goal was to develop DNAme-based markers for early OC detection. In order to realize this aim, samples must be derived from large population-based screening studies, such as the UKCTOCS trial, that have samples banked years ahead of diagnosis. Serum samples from ~ 100,000 women need to be collected to accrue sufficient OC numbers (i.e. a range of 40–50). Within the UKCTOCS, which involved collection and banking of serum samples from over 200,000 women, whole blood samples were couriered to the central laboratory within 2–48 h. Prospectively collected blood samples were spun down 2–12 hours after collection to mimic the UKCTOCS setting. The UKCTOCS, and to a lesser extent the other prospectively collected sets, contained higher than average amounts of cell-free DNA and fragments were longer on average. Both factors reflect the leakage of WBC DNA into serum. In order to compensate for this, we aimed for an extremely high coverage but noted that four of the UKCTOCS samples had at least one of the three markers with a coverage 100,000; we had made an upfront decision not to exclude any of the samples based on lower coverage but note that this is one of the limitations of this study. Despite these complicating factors the three-DNAme marker panel outperformed CA125 using a 35 IU/mL cut-off in detecting OC early in the group of women who had a DNA concentration lower than the median ng/uL value.
In order to provide further functional proof that the newly developed serum DNAme marker panel is cancer-specific and able to indicate the presence of active OC, despite competing with high levels of background WBC DNA inherent within the trial samples analyzed, we demonstrated that our serum DNAme-dynamics correctly identified 7/9 and 6/7 Carboplatin responders and non-responders, respectively.
As we did not observe an overlap between false-positive CA125 and false-positive DNAme samples, it is probable that DNAme false positivity is largely triggered by technical artefacts as a result of extremely low thresholds down to a pattern frequency of 0.000003 (i.e. three cancer patterns in the background of 1,000,000 DNA fragments with a non-cancer pattern). Of note is that for the serum sets, which have been prospectively collected within EpiFemCare, there was a substantial age difference between women who presented with benign pelvic masses and women who presented with OC. This age skew was completely intentional as our main purpose was to benchmark DNAme markers against CA125 false-positive controls and to assess whether CA125 false-positive controls are also DNAme-false positive. The main sources of false positivity are endometriosis, pelvic inflammatory disease, and fibroids—all conditions which are substantially more prevalent (or occur exclusively) in premenopausal (i.e. younger women), whereas OC is far more prevalent in older women. False CA125-positivity can usually be explained by a CA125-producing benign condition .
At the UKCTOCS prevalence screen , the ROCA identified elevated/intermediate risk in 0.93% of women, of whom 0.9% (42/4642) were diagnosed after repeat CA125 testing, ultrasound, additional imaging, and clinical assessment. Applying the three-marker DNAme test, with a conservative (i.e. excessive background DNA will not be an issue in prospective samples) estimate of specificity and sensitivity of 90% and 60%, respectively, as a second line test to ROCA-elevated women at risk could substantially decrease the time to diagnosis in at least half the women with OC.
OC is a low prevalence disease (i.e. lifetime risk in the general female population is 1–2 per 100 women ). The consequence of a positive screening test is an operation under general anesthesia (i.e. laparoscopic or open) to remove one or both ovaries/Fallopian Tubes. Hence, a high specificity of the screening test is of the utmost importance because the positive predictive value strongly depends on the prevalence of the disease and the specificity of the test . Using a highly sensitive marker panel, able to detect > 80% of stage I/II OCS (i.e. a combination of CA125 and HE4 ), to pre-screen the entire population in order to narrow down the group of women who have a high likelihood of OC followed by the cell-free DNA test is a highly promising strategy to achieve a stage shift with at least 50% of cancers (instead of 25% in the absence of screening) diagnosed in stage I/II.
In addition to the use of serum and high levels of contaminating normal DNA from blood cells, the current work has some further limitations. First, the number of samples (specifically when considering only the low-DNA samples) in the UKCTOCS cohort was limited. Second, we were unable to assess whether the panel is specific for OC or whether it may additionally detect other cancer types. In serum set 3, we also analyzed serum samples from 37 patients with non-OCs (ten, 20, and seven with cervical, endometrial, and colorectal cancers, respectively). Two of ten (20%), 5/20 (25%), and 1/7 (14%) were deemed positive based on the final three-marker panels. This may indicate that our panel also detects other cancers arising from the Mullerian tract (i.e. cervical and endometrial cancers). In order to further elaborate on the aforementioned, we assessed the TCGA data. Whereas there was no CpG site on the 450 k methylation array for region #141, for regions #204 and #228 there were two (cg15015892 and cg05021743) and one (cg22344703) CpGs, respectively, represented on the Illumina array. Also, other cancers could potentially be identified using these markers (Additional file 1: Figure S13). Third, we did not directly compare the methylation levels in the primary tumor and the matched serum samples. As we have shown (Additional file 1: Figure S6), methylation levels in the primary cancers are relatively homogenous across different stages of OCs. Hence, any differences in methylation levels detected in the serum reflect conditions such as cancer cell turnover, release of cell-free DNA via lymph-vessels into the bloodstream, and half-life in the circulation—all factors which cannot be assessed by directly measuring DNAme in the cancer.
Our method of defining tumor-specific methylation patterns and quantifying the molecules exhibiting such patterns, instead of determining methylation levels, shows promising results regarding its applicability in liquid biopsy testing. While, in this study, the most promising tumor markers were all fully methylated, the method, per se, is not biased towards hyper- or uniformly methylated patterns and, as such, is also applicable to situations where the markers of interest show hypo- or heterogenous methylation, respectively.
Overall, our study provides a proof of principle that serum DNAme markers have the potential to detect OC within two years in advance of diagnosis and may therefore be able to guide personalized OC treatment. The recent advance of purpose-made blood collection tubes that stabilize cell-free DNA and prevent leakage of DNA from blood cells  will facilitate clinical implementation of DNAme pattern detection in cell-free DNA as a clinical tool in cancer medicine. In addition, recent evidence demonstrates that using DNAme patterns will allow for tissue-of-origin mapping in circulating cell-free DNA [51, 52] which supports the view that a DNAme marker panel is likely to cover a number of tumor-entities.
Overall, and for the first time, our study suggests that serum DNAme markers have the potential to diagnose OCs up to two years in advance of current diagnosis and may potentially enable individualized OC treatment. The recent advance of purposed blood tubes will facilitate clinical implementation of DNAme pattern detection of cell-free DNA as a clinical tool in cancer medicine.
Area under the curve
Benign pelvic mass
High grade serous
Receiver operating characteristic
White blood cell
Hennessy BT, Coleman RL, Markman M. Ovarian cancer. Lancet. 2009;374(9698):1371–82.
Bowtell DD, Bohm S, Ahmed AA, Aspuria PJ, Bast Jr RC, Beral V, et al. Rethinking ovarian cancer II: reducing mortality from high-grade serous ovarian cancer. Nat Rev Cancer. 2015;15(11):668–79.
Bast Jr RC, Klug TL, St JE, Jenison E, Niloff JM, Lazarus H, et al. A radioimmunoassay using a monoclonal antibody to monitor the course of epithelial ovarian cancer. N Engl J Med. 1983;309(15):883–7.
Buys SS, Partridge E, Black A, Johnson CC, Lamerato L, Isaacs C, et al. Effect of screening on ovarian cancer mortality: the Prostate, Lung, Colorectal and Ovarian (PLCO) Cancer Screening Randomized Controlled Trial. JAMA. 2011;305(22):2295–303.
Cramer DW, Bast Jr RC, Berg CD, Diamandis EP, Godwin AK, Hartge P, et al. Ovarian cancer biomarker performance in prostate, lung, colorectal, and ovarian cancer screening trial specimens. Cancer Prev Res (Phila). 2011;4(3):365–74.
Jacobs IJ, Menon U, Ryan A, Gentry-Maharaj A, Burnell M, Kalsi JK, et al. Ovarian cancer screening and mortality in the UK Collaborative Trial of Ovarian Cancer Screening (UKCTOCS): a randomised controlled trial. Lancet. 2016;387(10022):945–56.
Menon U, Ryan A, Kalsi J, Gentry-Maharaj A, Dawnay A, Habib M, et al. Risk algorithm using serial biomarker measurements doubles the number of screen-detected cancers compared with a single-threshold rule in the United Kingdom Collaborative Trial of Ovarian Cancer Screening. J Clin Oncol. 2015;33(18):2062–71.
Vallius T, Hynninen J, Auranen A, Carpen O, Matomaki J, Oksa S, et al. Serum HE4 and CA125 as predictors of response and outcome during neoadjuvant chemotherapy of advanced high-grade serous ovarian cancer. Tumour Biol. 2014;35(12):12389–95.
Le T, Hopkins L, Faught W, Fung-Kee-Fung M. The lack of significance of Ca125 response in epithelial ovarian cancer patients treated with neoadjuvant chemotherapy and delayed primary surgical debulking. Gynecol Oncol. 2007;105(3):712–5.
Pelissier A, Bonneau C, Chereau E, DE LA Motte RT, Fourchotte V, Darai E, et al. Dynamic analysis of CA125 decline during neoadjuvant chemotherapy in patients with epithelial ovarian cancer as a predictor for platinum sensitivity. Anticancer Res. 2016;36(4):1865–71.
Pelissier A, Bonneau C, Chereau E, DE LA Motte RT, Fourchotte V, Darai E, et al. CA125 kinetic parameters predict optimal cytoreduction in patients with advanced epithelial ovarian cancer treated with neoadjuvant chemotherapy. Gynecol Oncol. 2014;135(3):542–6.
Dawson SJ, Tsui DW, Murtaza M, Biggs H, Rueda OM, Chin SF, et al. Analysis of circulating tumor DNA to monitor metastatic breast cancer. N Engl J Med. 2013;368(13):1199–209.
Murtaza M, Dawson SJ, Tsui DW, Gale D, Forshew T, Piskorz AM, et al. Non-invasive analysis of acquired resistance to cancer therapy by sequencing of plasma DNA. Nature. 2013;497(7447):108–12.
Wang Y, Springer S, Mulvey CL, Silliman N, Schaefer J, Sausen M, et al. Detection of somatic mutations and HPV in the saliva and plasma of patients with head and neck squamous cell carcinomas. Sci Transl Med. 2015;7(293):293ra104.
Siravegna G, Mussolin B, Buscarino M, Corti G, Cassingena A, Crisafulli G, et al. Clonal evolution and resistance to EGFR blockade in the blood of colorectal cancer patients. Nat Med. 2015;21(7):827.
Bettegowda C, Sausen M, Leary RJ, Kinde I, Wang Y, Agrawal N, et al. Detection of circulating tumor DNA in early- and late-stage human malignancies. Sci Transl Med. 2014;6(224):224ra224.
Lanman RB, Mortimer SA, Zill OA, Sebisanovic D, Lopez R, Blau S, et al. Analytical and clinical validation of a digital sequencing panel for quantitative, highly accurate evaluation of cell-free circulating tumor DNA. PLoS One. 2015;10(10):e0140712.
Sun K, Jiang P, Chan KC, Wong J, Cheng YK, Liang RH, et al. Plasma DNA tissue mapping by genome-wide methylation sequencing for noninvasive prenatal, cancer, and transplantation assessments. Proc Natl Acad Sci U S A. 2015;112(40):E5503–12.
Bartlett TE, Chindera K, McDermott J, Breeze CE, Cooke WR, Jones A, et al. Epigenetic reprogramming of fallopian tube fimbriae in BRCA mutation carriers defines early ovarian cancer evolution. Nat Commun. 2016;7:11620.
Teschendorff AE, Jones A, Fiegl H, Sargent A, Zhuang JJ, Kitchener HC, et al. Epigenetic variability in cells of normal cytology is associated with the risk of future morphological transformation. Genome Med. 2012;4(3):24.
Teschendorff AE, Widschwendter M. Differential variability improves the identification of cancer risk markers in DNA methylation studies profiling precursor cancer lesions. Bioinformatics. 2012;28(11):1487–94.
Teschendorff AE, Gao Y, Jones A, Ruebner M, Beckmann MW, Wachter DL, et al. DNA methylation outliers in normal breast tissue identify field defects that are enriched in cancer. Nat Commun. 2016;7:10478.
Baylin SB, Jones PA. A decade of exploring the cancer epigenome - biological and translational implications. Nat Rev Cancer. 2011;11(10):726–34.
Campan M, Moffitt M, Houshdaran S, Shen H, Widschwendter M, Daxenbichler G, et al. Genome-scale screen for DNA methylation-based detection markers for ovarian cancer. PLoS One. 2011;6(12):e28141.
deVos T, Tetzner R, Model F, Weiss G, Schuster M, Distler J, et al. Circulating methylated SEPT9 DNA in plasma is a biomarker for colorectal cancer. Clin Chem. 2009;55(7):1337–46.
Fackler MJ, Lopez BZ, Umbricht C, Teo WW, Cho S, Zhang Z, et al. Novel methylated biomarkers and a robust assay to detect circulating tumor DNA in metastatic breast cancer. Cancer Res. 2014;74(8):2160–70.
Fiegl H, Millinger S, Mueller-Holzner E, Marth C, Ensinger C, Berger A, et al. Circulating tumor-specific DNA: a marker for monitoring efficacy of adjuvant therapy in cancer patients. Cancer Res. 2005;65(4):1141–5.
Fiegl H, Jones A, Hauser-Kronberger C, Hutarew G, Reitsamer R, Jones RL, et al. Methylated NEUROD1 promoter is a marker for chemosensitivity in breast cancer. Clin Cancer Res. 2008;14(11):3494–502.
Muller HM, Widschwendter A, Fiegl H, Ivarsson L, Goebel G, Perkmann E, et al. DNA methylation in serum of breast cancer patients: an independent prognostic marker. Cancer Res. 2003;63(22):7641–5.
Muller HM, Fiegl H, Widschwendter A, Widschwendter M. Prognostic DNA methylation marker in serum of cancer patients. Ann N Y Acad Sci. 2004;1022:44–9.
Potter NT, Hurban P, White MN, Whitlock KD, Lofton-Day CE, Tetzner R, et al. Validation of a real-time PCR-based qualitative assay for the detection of methylated SEPT9 DNA in human plasma. Clin Chem. 2014;60(9):1183–91.
Shivapurkar N, Gazdar AF. DNA methylation based biomarkers in non-invasive cancer screening. Curr Mol Med. 2010;10(2):123–32.
Warton K, Mahon KL, Samimi G. Methylated circulating tumor DNA in blood: power in cancer prognosis and response. Endocr Relat Cancer. 2016;23(3):R157–71.
Wittenberger T, Sleigh S, Reisel D, Zikan M, Wahl B, Alunni-Fabbroni M, et al. DNA methylation markers for early detection of women's cancer: promise and challenges. Epigenomics. 2014;6(3):311–27.
Norton ME, Jacobsson B, Swamy GK, Laurent LC, Ranzini AC, Brar H, et al. Cell-free DNA analysis for noninvasive examination of trisomy. N Engl J Med. 2015;372(17):1589–97.
Bartlett TE, Jones A, Goode EL, Fridley BL, Cunningham JM, Berns EM, et al. Intra-gene DNA methylation variability is a clinically independent prognostic marker in women’s cancers. PLoS One. 2015;10(12):e0143178.
Teschendorff AE, Lee SH, Jones A, Fiegl H, Kalwa M, Wagner W, et al. HOTAIR and its surrogate DNA methylation signature indicate carboplatin resistance in ovarian cancer. Genome Med. 2015;7(1):108.
Teschendorff AE, Yang Z, Wong A, Pipinikas CP, Jiao Y, Jones A, et al. Correlation of smoking-associated DNA methylation changes in buccal cells with DNA methylation changes in epithelial cancer. JAMA Oncol. 2015;1(4):476–85.
Menon U, Gentry-Maharaj A, Hallett R, Ryan A, Burnell M, Sharma A, et al. Sensitivity and specificity of multimodal and ultrasound screening for ovarian cancer, and stage distribution of detected cancers: results of the prevalence screen of the UK Collaborative Trial of Ovarian Cancer Screening (UKCTOCS). Lancet Oncol. 2009;10(4):327–40.
Gu H, Smith ZD, Bock C, Boyle P, Gnirke A, Meissner A. Preparation of reduced representation bisulfite sequencing libraries for genome-scale DNA methylation profiling. Nat Protoc. 2011;6(4):468–81.
Lee YK, Jin S, Duan S, Lim YC, Ng DP, Lin XM, et al. Improved reduced representation bisulfite sequencing for epigenomic profiling of clinical samples. Biol Proced Online. 2014;16(1):1.
Newcombe RG. Two-sided confidence intervals for the single proportion: comparison of seven methods. Stat Med. 1998;17(8):857–72.
Sanchez-Vega F, Gotea V, Petrykowska HM, Margolin G, Krivak TC, DeLoia JA, et al. Recurrent patterns of DNA methylation in the ZNF154, CASP8, and VHL promoters across a wide spectrum of human solid epithelial tumors and cancer cell lines. Epigenetics. 2013;8(12):1355–72.
Margolin G, Petrykowska HM, Jameel N, Bell DW, Young AC, Elnitski L. Robust detection of DNA hypermethylation of ZNF154 as a pan-cancer locus with in silico modeling for blood-based diagnostic development. J Mol Diagn. 2016;18(2):283–98.
Gormally E, Caboux E, Vineis P, Hainaut P. Circulating free DNA in plasma or serum as biomarker of carcinogenesis: practical aspects and biological significance. Mutat Res. 2007;635(2–3):105–17.
Jiang P, Lo YM. The long and short of circulating cell-free DNA and the ins and outs of molecular diagnostics. Trends Genet. 2016;32(6):360–71.
Hirsch M, Duffy J, Davis CJ, Nieves PM, Khan KS. Diagnostic accuracy of cancer antigen 125 for endometriosis: a systematic review and meta-analysis. BJOG. 2016;123:1761–8.
Pearce CL, Stram DO, Ness RB, Stram DA, Roman LD, Templeman C, et al. Population distribution of lifetime risk of ovarian cancer in the United States. Cancer Epidemiol Biomarkers Prev. 2015;24(4):671–6.
Simmons AR, Clarke CH, Badgwell DB, Lu Z, Sokoll LJ, Lu KH, et al. Validation of a biomarker panel and longitudinal biomarker performance for early detection of ovarian cancer. Int J Gynecol Cancer. 2016;26(6):1070–7.
Kang Q, Henry NL, Paoletti C, Jiang H, Vats P, Chinnaiyan AM, et al. Comparative analysis of circulating tumor DNA stability in KEDTA, Streck, and Cell Save blood collection tubes. Clin Biochem. 2016;49:1354–60.
Guo S, Diep D, Plongthongkum N, Fung HL, Zhang K, Zhang K. Identification of methylation haplotype blocks aids in deconvolution of heterogeneous tissue samples and tumor tissue-of-origin mapping from plasma DNA. Nat Genet. 2017;49(4):635–42.
Lehmann-Werman R, Neiman D, Zemmour H, Moss J, Magenheim J, Vaknin-Dembinsky A, et al. Identification of tissue-specific cell death using methylation patterns of circulating DNA. Proc Natl Acad Sci U S A. 2016;113(13):E1826–1834.
We thank Aleksandra Gentry-Maharaj, Marina Koßmann-Plasa, Verena Schindler, Sascha Vugrinec, Julia Bottlang, Eleni Pachygiannaki, Nadine Mall, Efrain Manilla-Pérez, and Jan Henkel for excellent technical assistance. Further, thanks go to Kapil Jhalani, Yadhu Kumar, and Fabian Ripp for their assistance in data analysis. We are also grateful to Markus Schmitt, Marcus Benz, Kerstin Stangier, and Kerstin Rönsch for their support and fruitful discussions. We thank all the trial participants for their generosity in donating samples for secondary studies and all the staff involved in the UKCTOCS trial.
This work was funded by the European Union’s Seventh Framework Programme (FP7/2007-2013) under grant agreement number 305428 (Project EpiFemCare). MW also received support from The Eve Appeal (https://eveappeal.org.uk/). Part of this research was undertaken at UCLH/UCL, which received a proportion of its funding from the Department of Health NIHR Biomedical Research Centres funding scheme.
UKCTOCS was funded by the Medical Research Council (G9901012 and G0801228), Cancer Research UK (C1479/A2884), and the Department of Health, with additional support from The Eve Appeal.
Availability of data and materials
The RRBS data are made available in the European Genome-Phenome Archive (Accession number: EGAS00001002609) in order to allow for controlled access.
Ethics approval and consent to participate
Samples were collected prospectively at the University College London Hospital in London and at the Charles University Hospital in Prague and the Department of Gynaecology and Obstetrics. The study was approved by the local research ethics committees: UCL/UCLH Biobank for Studying Health & Disease NC09.13) and the ethics committee of the General University Hospital, Prague approval No.: 22/13 GRANT – 7. RP – EPI-FEM-CARE. All patients provided written informed consent. The UKCTOCS study was approved by the local research ethics committees (UCL/UCLH Biobank for Studying Health & Disease NC09.13) and was approved as part of trial approval by the UK North West Multicentre Research Ethics Committees (North West MREC 00/8/34). All patients provided written informed consent and all studies were conducted in accordance with the Declaration of Helsinki.
Consent for publication
HL is an employee and shareholder of Genedata AG. BW is affiliated with a private company. UM has stock ownership in and research funding from Abcodia Pvt Ltd which has an interest in cancer biomarkers. The remaining authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Design of the nested case-control study based on the UKCTOCS Cohort. Figure S2. DMR discovery with Illumina 450 K methylation arrays. Figure S3. Pattern counts for informative regions. Figure S4. Pattern frequencies for the different regions analyzed in serum set 1 samples. Figure S5. Pattern frequencies for the different regions analyzed in serum set 2 samples. Figure S6. DNA methylation for regions #144, #204, and #228 according to OC stages. Figure S7. Coverage (number of reads) for the three different regions analyzed in serum set 3 samples. Figure S8. CA125 levels measured in NACT serum set samples. Figure S9. Pattern frequencies for the top three reactions measured in NACT serum set samples. Figure S10. Coverage (number of reads) for the top three reactions measured in NACT serum set samples. Figure S11. Average DNA amount extracted correlates with average UK temperature. Figure S12 The fraction (%) of small fragment (50–250 bp) DNA in the serum DNA preparation for 171 UKCTOCS samples analyzed in the study. Figure S13. Box plots comparing the average beta values for 450 k array probes within regions #204 and #228 between each normal (N), cancer (C) group, and white blood cell (WBC) data for OC and other 19 TCGA cancer types. (DOCX 3024 kb)
Supplementary Material and Methods: Additional details of samples sets, methods, and analyses. (DOCX 75 kb)
Coordinates and primers to amplify the identified target region. Table S2. Performance of the serum DNA methylation marker panel in a population-based cohort to diagnose OC early. Table S3. Direct comparison of the performance of the CA125 and the three-marker DNAme panel to predict OC development in the UKCTOCS cohort. (DOCX 29 kb)