Skip to main content

A one-year genomic investigation of Escherichia coli epidemiology and nosocomial spread at a large US healthcare network



Extra-intestinal pathogenic Escherichia coli (ExPEC) are a leading cause of bloodstream and urinary tract infections worldwide. Over the last two decades, increased rates of antibiotic resistance in E. coli have been reported, further complicating treatment. Worryingly, specific lineages expressing extended-spectrum β-lactamases (ESBLs) and fluoroquinolone resistance have proliferated and are now considered a serious threat. Obtaining contemporary information on the epidemiology and prevalence of these circulating lineages is critical for containing their spread globally and within the clinic.


Whole-genome sequencing (WGS), phylogenetic analysis, and antibiotic susceptibility testing were performed for a complete set of 2075 E. coli clinical isolates collected from 1776 patients at a large tertiary healthcare network in the USA between October 2019 and September 2020.


The isolates represented two main phylogenetic groups, B2 and D, with six lineages accounting for 53% of strains: ST-69, ST-73, ST-95, ST-131, ST-127, and ST-1193. Twenty-seven percent of the primary isolates were multidrug resistant (MDR) and 5% carried an ESBL gene. Importantly, 74% of the ESBL-E.coli were co-resistant to fluoroquinolones and mostly belonged to pandemic ST-131 and emerging ST-1193. SNP-based detection of possible outbreaks identified 95 potential transmission clusters totaling 258 isolates (12% of the whole population) from ≥ 2 patients. While the proportion of MDR isolates was enriched in the set of putative transmission isolates compared to sporadic infections (35 vs 27%, p = 0.007), a large fraction (61%) of the predicted outbreaks (including the largest cluster grouping isolates from 12 patients) were caused by the transmission of non-MDR clones.


By coupling in-depth genomic characterization with a complete sampling of clinical isolates for a full year, this study provides a rare and contemporary survey on the epidemiology and spread of E. coli in a large US healthcare network. While surveillance and infection control efforts often focus on ESBL and MDR lineages, our findings reveal that non-MDR isolates represent a large burden of infections, including those of predicted nosocomial origins. This increased awareness is key for implementing effective WGS-based surveillance as a routine technology for infection control.


Extra-intestinal pathogenic Escherichia coli (ExPEC) are a leading cause of healthcare-associated urinary tract and bloodstream infections [1, 2]. Diseases caused by multidrug-resistant (MDR) strains are associated with poor patient outcomes, including high morbidity and mortality, and higher healthcare costs [3,4,5]. In recent years, resistance to commonly prescribed antibiotics has increased in E. coli infections in the USA [e.g., 1.2 to 25% prevalence of fluoroquinolone resistance in the past 15 years [6, 7]] and internationally [1, 3, 8, 9]. Importantly, resistance to 3rd- and 4th-generation cephalosporins, due to the acquisition and horizontal spread of extended-spectrum β-lactamase (ESBL) genes, has increased in both healthcare and community settings [10]. This alarming rise prompted the US Centers for Disease Control and Prevention to identify the ESBL-producing E. coli as a serious threat and urging increased surveillance efforts [11].

Previous molecular studies have separated E. coli into phylogenetic groups, including A, B1, B2, C, D, E, and F, with ExPEC (and consequently the specialized uropathogenic [UPEC] pathotype) largely belonging to phylogroups B2 and D [12, 13]. Multilocus sequence typing (MLST) provides further characterization of E. coli lineages and has led to the identification of specific, globally distributed sequence types (STs). For example, the ST-131 ExPEC lineage is widely distributed and associated with the emergence of fluoroquinolone resistance and frequent carriage of plasmid-bound ESBL genes [12, 14,15,16,17]. Besides resistances, recent studies suggest that the acquisition of virulence-associated genes also plays an integral role in the success and global emergence of ST-131 and other ExPEC lineages. These include a plethora of both structural (e.g., fimbriae, pili, curli, flagella) and secreted (e.g., toxins, iron-acquisition systems) virulence factors often enriched in non-MDR, UPEC lineages (e.g., ST-73, 95, and 127) [18,19,20].

The recent positioning of whole-genome sequencing (WGS) as a near-routine technology is creating a revolution in infection control and allows for targeted interventions to reduce the burden of healthcare-associated infections (HAIs). Such effort requires an understanding of the frequency of nosocomial transmission caused not only by MDR epidemic clones, but also by the more ubiquitous non-MDR lineages. While the latter are responsible for most E. coli infections, very few genome-based studies have examined their role in nosocomial transmission. Instead, most investigations have been performed on small cohorts, often limited to ESBL-producing isolates, which likely underrepresents the extent of E. coli nosocomial transmission events [21].

Here, we retrospectively genome-sequenced and analyzed a complete set of 2075 E. coli clinical isolates collected from 1776 patients over a 12-month period from a large military healthcare network in the Northeast United States. Genome-based detection of possible outbreak clusters revealed extensive roles for non-MDR lineages in suspected nosocomial transmissions, while in-depth phylogenetic, genotypic, and phenotypic characterization revealed a detailed picture of the epidemiology, population structure, and prevalence of resistances in E. coli in this region.


Isolation and phenotypic characterization of E. coli collection

A total of 2075 E. coli isolates (including serial isolates from the same patient) cultured from all clinical specimens of 1776 patients receiving care in the National Capitol Medical Region healthcare network between October 2019 and September 2020 were collected. Of note, no stool isolates were collected as these samples are not routinely sent for culture in the microbiology lab of this hospital and are instead analyzed by molecular and/or antigen diagnostic procedures. Antibiotic susceptibility testing (AST) was performed in a College of American Pathologists (CAP)-certified laboratory using the BD Phoenix (panel NMIC/ID304; BD Diagnostics), which encompasses 18 antibiotics from 11 different antibiotic classes. Where necessary, MICs were determined in triplicate using broth microdilution using Clinical and Laboratory Standards Institute (CLSI) guidelines [22]. Breakpoints were interpreted using CLSI guidelines (2018), with cefazolin MICs interpreted using breakpoints for complicated UTI/systemic infection [22]. Isolates with breakpoints interpreted as I or R were designated non-susceptible. To accurately calculate the prevalence of resistances in the population, a subset of 1828 primary isolates (first isolate of each ST per patient) was specifically used (Table 1).

Table 1 Prevalence of MDR and key resistances in E. coli isolates

Whole-genome sequencing

DNA extraction and WGS were performed as previously described [23]. In brief, genomes were generated for all 2075 isolates using an Illumina MiSeq platform with a 2×300 nt paired-end protocol or a NextSeq-500 platform with a 2×150 nt paired-end protocol. Libraries were prepared using the Kapa HyperPlus kit (Roche Diagnostics) and quantified using the Kapa library quantification kit Illumina/Bio-Rad iCycler (Roche Diagnostics) on a CFX96 real-time cycler (Bio-Rad). De novo assemblies were obtained using Newbler v2.7 (Roche Diagnostics). Minimum thresholds for contig size and coverage were set at 200 bp and 49.5+, respectively. Assembled sequences were annotated using Prokka v1.14.6 [24].

Bioinformatic analysis

Species identification was determined using Kraken2 (v2.0.8-β) [25] and E. coli phylogenetic groups were identified using EzClermont v0.6.3 [26, 27]. In silico ST detection was identified for all isolates using the Achtman MLST scheme through software [28]. This tool uses the PubMLST website [29] developed by Keith Jolley and sited at the University of Oxford. Novel ST was assigned using the MLST sequence archive at EnteroBase [30]. Serotyping and fimH typing were performed using the TORMES pipeline v1.3.0 [31] with the SerotypeFinder O-typing database [32] and FimTyper [33], respectively. Antimicrobial resistance genetic determinants were annotated using AMRFinderPlus [34] and ARIBA [35]. Plasmid replicons [36] and virulence-associated genes (based on the E. coli virulence-associated gene databases EcVGDB and VFDB [37, 38]) were identified using ABRicate [39].

Phylogenetic analysis

For the phylogeny of the diverse set of 123 ESBL-E. coli, the annotated [Prokka v1.14.6 [24]] assemblies were used as input for Roary v3.13.0 [40] and a SNP-based alignment of 2698 core genes was generated. For the phylogeny of the clonal set of 275 ST-131 E. coli, SNP calling was performed with Snippy v.4.4.5 [41] using error correction [Pilon v1.23 [42]] and the annotated genome of ST-131 E. coli EC958 (accession no. GCA_000285655.3) as a reference. For both approaches, recombination was filtered from the alignments using Gubbins v2.4.1 [43] and a maximum-likelihood tree was generated with RAxML v8.2.12 [44] using the GTR+G (50 parsimony, 50 random) model and 100 random bootstrap replicates. Trees were imported in iTOL v.5.5 [45] for visualization with metadata.

Finally, for the ST-131 phylogenetic analysis, clade designations (A, B, and C) were generally characterized by the carriage of type 1 fimbriae adhesion fimH alleles (fimH41, fimH22, and fimH30, respectively) and subclades C0, C1, and C2 based on SNP typing of genetic markers gyrA, parC, and ybbW genetic markers, as previously described [16, 46]. The G273A SNP in ybbW (subclade C2-specific allele) was identified using an individual gene alignment produced by Roary v3.13.0 [40].

Nosocomial transmission analysis

Detection of clusters of transmission was performed in two stages. First, cgMLST allele assignment and minimum spanning tree generation were performed with SeqSphere+ [47] using the E. coli cgMLST scheme developed by Zhou et al. [30]. The distance matrix from SeqSphere+ consisted of the pairwise allelic differences between all 2075 E. coli isolates. Using a threshold of ≤10 allelic differences, a level previously identified as indicative of potential E. coli transmission [48], 105 putative clusters of transmission were identified and comprised isolates from 2 distinct patients or more (Additional file 1: Table S1). Clusters of serial isolates from single patients were removed. Second, to further investigate these putative clusters, an internal reference genome (first isolate temporally) was picked and whole-genome SNP analysis was individually performed for the 105 clusters. Using a 17 SNP cutoff, a threshold previously identified between patient pairs sharing strong epidemiological links [9], 95 of the 105 original clusters were confirmed and were further analyzed in this report. To determine the prevalence of MDR isolates in the clusters, primary MDR cluster isolates (n=228) were used (serial isolates from the same patient and same MDR or non-MDR designation were removed) (Table 1).


Isolate collection and population structure

Between October 2019 and September 2020, a set of 2075 E. coli were collected from all 1776 patients who received care within the National Capitol Region healthcare network (located on the East coast of the USA) (Additional file 1: Table S1). While obtained from 21 facilities, the majority (59%) of the isolates originated from a single, large tertiary care hospital that also served as the central microbiology hub for the remaining 20 facilities. This sampling represents >99% of all E. coli cultured from clinical specimens at the central microbiology laboratory during this 1-year period. Isolates were primarily obtained from urine (93%), followed by bloodstream infections (2%), wound infections (2%), and perirectal swabs (1%). A small number of isolates were cultured from fluid (.07%), tissue (.04%), and respiratory (.01%) cultures (Additional file 1: Table S1).

WGS and cgMLST analysis revealed a diverse population that resolved into 5 main E. coli phylogenetic groups (Fig. 1A, Additional file 1: Table S1), with B2 and D the most represented (71% and 13%, respectively). Molecular typing by in silico MLST indicated the population was composed of 247 STs with 53% belonging to 6 known, globally prevalent STs. These include the epidemic lineages ST-131 (n = 275), ST-73 (n = 224), ST-95 (n = 215), ST-127 (n = 138), and the emerging ST-1193 (n = 112) all within phylogroup B2 [49]. Epidemic lineage ST-69 (n = 142) was the sole exception, belonging to phylogroup D. Notably, 133 (53%) STs were each found in isolate(s) from single patients and only 52 of 1776 patients carried strains with multiple STs (Additional file 1: Table S1).

Fig. 1
figure 1

Population structure of a complete collection of E. coli clinical isolates for a 1-year period at a US hospital. A cgMLST-based minimum spanning tree of 2075 E. coli isolates. Isolates belonging to the main phylogenetic groups observed in this study are circled and labeled. The dominant STs are shaded in light gray and the proportion of MDR (red) and non-MDR (gray) isolates within specific STs is indicated by pie charts. B Pie charts indicate the prevalence of MDR (red) primary isolates (27%) was similar to the prevalence of MDR isolates in primary isolates predicted to be part of clusters of transmission (35%) (Table 1)

Diversity of antibiotic susceptibility profiles

Comprehensive AST was performed on all isolates using 18 antibiotics from 11 different classes (Fig. 2, Additional file 2: Table S2). For an accurate determination of the prevalence of resistances in this E. coli population, removal of serial isolates (same ST per patient) resulted in a collection of 1828 primary isolates (Table 1). From these, the highest prevalence of non-susceptibility was to ampicillin (41%), followed by tetracycline (23%), trimethoprim/sulfamethoxazole (20%), and fluoroquinolones (15% to ciprofloxacin). In contrast, all isolates were susceptible to amikacin, 5% of E. coli were non-susceptible to third- and fourth-generation cephalosporins and <1% (n = 14) showed non-susceptibility to a carbapenem. Of the latter, 6 were resistant to imipenem only (MIC = 2), 5 were resistant to ertapenem only (MIC > 0.5 ml/l), 3 were resistant to ertapenem and imipenem or meropenem, and none carried a carbapenemase (Fig. 2, Table 1 and Additional file 2: Table S2).

Fig. 2
figure 2

Comprehensive phenotypic antibiotic susceptibility testing of all E. coli isolates to 18 antibiotics from 11 classes tested in this study. Breakpoints were interpreted using CLSI guidelines and S (susceptible), I (intermediate), and R (resistant) classifications are labeled for each antibiotic/isolate: red, yellow, and gray, respectively. Interpretations are mapped onto the MST from Fig. 1. AMK amikacin, GEN gentamicin, TOB tobramycin, AMP ampicillin, AMC amoxicillin-clavulanic acid, TZP piperacillin-tazobactam, CFZ cefazolin, FEP cefepime, CAZ ceftazidime, CRO ceftriaxone, ETP ertapenem, IPM imipenem, MEM meropenem, ATM aztreonam, CIP ciprofloxacin, LVX levofloxacin, SXT trimethoprim-sulfamethoxazole, TET tetracycline

Distinct lineages of E. coli were enriched for phenotypic resistance to various classes of antibiotics: (i) 50% of ST-12 were non-susceptible to amoxicillin/clavulanate (vs. 13% across all isolates, p < 0.001 by Fisher exact test), (ii) ST-131 accounted for 59% of isolates non-susceptible to gentamicin and tobramycin (vs. 6% for all, p < 0.001), and (iii) ST-131 and ST-1193 alone represented 72% of all isolates with resistance to the fluoroquinolones (Fig. 2, Additional file 2: Table S2).

Overall, 27% of the primary isolates were classified as multidrug resistant (MDR) as defined by Magiorkas et al. (i.e., non-susceptible to at least one agent in ≥3 antibiotic categories) [50] (Fig. 1A and Additional file 2: Table S2), though the prevalence of MDR varied significantly among the distinct, most frequent E. coli lineages. For example, while lineages ST-131, ST-1193, and ST-69 were significantly enriched in MDR isolates (64%, 55%, and 43%, respectively, with p-values < 0.01 by Fisher exact test), lineages ST-73, ST-127, and ST-95 largely comprised non-MDR isolates (21%, 13%, and 7%, respectively, with p-values < 0.03) (Fig. 1A).

Genomic characterization of ESBL-carrying E. coli

During the study period, 123 ESBL-producing E. coli were identified from 90 unique patients and all were classified as MDR (Additional file 3: Table S3). Interestingly, 22% were cultured from non-urinary sites, a significant divergence from the overall population (7%, p<0.05). Phylogenomic analysis of all ESBL-E. coli isolates indicated ESBL producers were diverse and belonged to 26 STs, including prevalent lineages [ST-131 (from 36 patients), ST-1193 (7 patients), ST-69 (5 patients)], less common lineages in our dataset [ST-38 (11 patients), ST-10 (4 patients)], and rarer ESBL-carrying lineages [ST-44 [51], ST-256, and ST-636 [52] each represented by 2 patients each]. As a result, an overrepresentation of ST-131 and ST-1193, which have fluoroquinolone resistance rates of 52% and 100%, respectively, 74% of ESBL-producers were non-susceptible to fluoroquinolones (compared to 17% overall, p < 0.01) (Fig. 3). The most represented ESBL genes were blaCTX-M-15 (59%) and blaCTX-M-27 (22%). Furthermore, blaCTX-M-14 was carried by 14% of the isolates including eight ST-38 isolates from 5 patients without an identified plasmid replicon. While carriage on a plasmid with an unknown replicon cannot be ruled out, chromosomal carriage of blaCTX-M-14 has previously been described for strains of this ST collected from Mongolian birds [53]. blaCTX-M-55 was observed in 3 isolates and blaCTX-M-24 and blaTEM-19 were observed once in distinct lineages (ST-354 and ST-131, respectively) (Fig. 3). Notably, the first description of ST-1193 harboring a blaCTX-M-64 allele was observed in a singular isolate (Fig. 3). Nine plasmid replicon types regularly associated with ESBL carriage [54, 55] were identified with varying prevalence, from ≥10 to 76% (Fig. 3).

Fig. 3
figure 3

Core genome phylogeny of all ESBL-carrying E. coli isolates in our dataset (n=123). Patient numbers are listed to identify serial isolates. Isolation source and phylogroups are color coded, indicated by the corresponding legends. Fluoroquinolone (FLQ) (ciprofloxacin and/or levofloxacin) non-susceptibility is indicated by a closed orange square, the presences of unique ESBL alleles are shown in a closed blue square, and plasmid replicon families identified with prevalence ≥10% are indicated by a gray closed square. Two novel ESBL-producing STs were characterized: ST-12869 and ST-12736

Outbreak detection reveals the role of non-MDR E. coli in nosocomial transmission

Prediction of possible clusters of transmission was performed in two steps: cgMLST followed by SNP analysis (Table 1). This filtering stringently confirmed 95 clusters (from 105 identified by cgMLST) comprising 258 isolates from 227 patients (Table 1). A total of 26 STs were represented and 61% of the clusters (58 out of 95) were caused by a non-MDR clone (Fig. 4A, B and Additional file 1: Table S1). At the isolate level, the proportion of primary MDR (35%) isolates from potential outbreaks clusters was slightly increased compared to primary non-cluster isolates (27%, p = 0.007 by Fisher exact test) while the proportion of ESBL producers remained comparable (6%) (Fig. 1B) (Table 1). At the lineage level, the largest number of outbreak clusters involved ST-131 (with 8/14 clusters caused by a MDR clone) and ST-73 (with 9/15 clusters caused by a non-MDR clone) (Fig. 4B).

Fig. 4
figure 4

Potential clusters of transmission were defined as groups of isolates from ≥2 patients with ≤17 SNP differences. A Stacked histograms showing the number of MDR and non-MDR cluster isolates according to their ST and B the number of distinct outbreak clusters per ST. Clusters grouping either MDR or non-MDR isolates are shown in red and gray, respectively. Hybrid clusters (i.e., grouping both MDR and non-MDR isolates) are shown in yellow. STs associated with a single cluster were grouped into others and represent ST-244, ST-394, ST-62, ST-372, ST-404, ST-421, ST-428, ST-538, ST-607, ST-1431, ST-1597, and ST-7887. C Analysis of outbreak clusters (identified on the y-axis with the corresponding ST) involving ≥3 patients (n = 24) and ordered temporally. The legend describes novel patients (filled circles), serial isolates (open circles), MDR isolates (red-filled circles), non-MDR (black-filled circles), and outbreaks consisting of all ESBL-E. coli isolates (dashed red line)

While the majority of clusters (78 out of 95) were composed of only two patients (an amount of transmission that routine surveillance cannot influence), the remaining outbreaks involved 3 to 12 patients (Fig. 4C). Temporally, these clusters extended up to 11 months, and lineage ST-131 was once again the most represented, with 5 distinct outbreak clones including two (clusters e and m) that were ESBL-producers (Fig. 4C). The largest predicted outbreak involved 12 patients (cluster n) and was caused by a ST-73 clone that was largely non-MDR and cultured primarily from urine (Fig. 4C). The only exception was MDR isolate 836616 which was distinct by only 12 SNPs from non-MDR isolate 822264 from another patient in this cluster and uniquely acquired resistance genes blaTEM-1, sul2, aph(3)-lb, and aph(6)-ld (Additional file 3: Table S3).

Convergence of resistance and virulence determinants in ST-131 E. coli

Considering the role played by ST-131 in both outbreak and sporadic infections, a detailed genetic analysis of the resistance and virulence genes found in these US isolates was performed. A maximum-likelihood core SNP-based phylogeny of all E. coli ST-131 genomes (n=275) in our dataset resulted in the 3 dominant ST-131 clades: clade A (n = 59, 21%), clade B (n = 29, 11%), and clade C (n = 181, 66%) (Fig. 5). Ninety-three percent of clade A isolates carried fimH41, 84% of clade B carried fimH22, all subclade B0 isolates carried fimH27, and 95% of clade C isolates carried the fimH30 variant. Of note, 19 isolates had non-typeable fimH alleles or a divergent allele designation (Additional file 1: Table S1).

Fig. 5
figure 5

Core genome SNP-based phylogeny of all ST-131 E. coli isolates in our dataset (n=275). Labels for clades A, B, B0, C0, C2, and C1 are indicated and are colored purple, green, light green, light blue, blue, and dark blue, respectively. Metadata are represented as rings from inner to outer: variations in the fimH gene, presence of point mutations in gyrA and parC (filled yellow square), fluoroquinolone (ciprofloxacin and/or levofloxacin) non-susceptibility (closed orange square), presence of ESBL gene (closed blue square), and multidrug-resistant isolate (red closed square). Non-typeable fimH alleles due to truncation or missing gene were grouped with other rare variants identified (Additional file 1: Table S1)

Th predominant clade C isolates were further classified into subclades C0 (n = 38), C1 (n = 101), and C2 (n = 42) (Fig. 5). Unlike clade A, B, and C0 isolates, which were largely (94%) fluoroquinolone susceptible, 100% of clades C1 and C2 isolates carried double gyrA and parC mutations associated with high-level resistance (Fig. 5) [56]. Furthermore, clade C2 was enriched for ESBL-producing isolates (69% of C2 isolates were ESBL compared to only 9% in other clades) and all carried blaCTX-M-15. Interestingly, 74% of clade C0 isolates were characterized as MDR (Fig. 5) despite being susceptible to fluoroquinolone and cephalosporin antibiotics. This was largely due to a higher prevalence of resistance to aminoglycosides (68% vs. 30% in all), folate pathway inhibitors (68% vs. 38%), and tetracycline (61% vs. 35%) in comparison to other clades (Additional file 2: Table S2).

In addition to the enrichment of resistance genes, isolates in lineage ST-131 frequently (>80% and p < 0.01) carried virulence-associated genes previously identified and associated with ExPEC E. coli [57, 58], including the aerobactin locus (iucC and iutA otherwise found in ~34% of all E. coli), a secreted autotransporter toxin (sat, 25% in the whole population), and an IrgA-like adhesin (iha, 26% in other E. coli) (Additional file 4: Table S4).

Accumulation of virulence genes in non-MDR ST-73 lineage

Together with ST-131, isolates belonging to ST-73 played a prominent role in both sporadic infections and possible cases of nosocomial transmission. However, unlike ST-131, no enrichment of antimicrobial resistance determinants was observed within this lineage, and ST-73 isolates remained largely susceptible to aminoglycosides, cephalosporins, and fluoroquinolones (Fig. 2, Additional file 2: Table S2). In contrast, ST-73 isolates contained significantly (p < 0.01) more virulence-associated genes (between 239 and 314) than ST-131 E. coli (between 201 and 309) [13] (Additional file 4: Table S4). Specifically, ST-73 isolates were enriched (>70% vs. <25% in the population as a whole) in uropathogenicity-associated virulence factors involved in invasion and colonization (pic, hek), cell lysis (hlyA), and adhesion and penetration (foc/sfa and cnf) [13, 57, 59] (Additional file 4: Table S4). When compared to the ST-131 population, ST-73 isolates were significantly enriched in a distinct set of virulence genes most likely contributing to the epidemiological success of the lineage (Additional file 4: Table S4).


A significant strength of this study lies in the >99% collection of all E. coli isolates from clinical samples over a recent (2019–2020) 12-month period in a network of US military healthcare facilities. Together with comprehensive AST and WGS, this dataset offered a unique opportunity to describe (i) the continued success and emergence of high-risk ExPEC and UPEC lineages, (ii) the regional prevalence of phenotypic resistances (and associated, acquired antibiotic resistance determinants), and (iii) the respective burden of ESBL, MDR, and non-MDR clones in infections of likely nosocomial origin.

Unlike the UK [60, 61], Canada [18], and other regions of the world [62, 63], recent genomic surveillance data on circulating E. coli lineages and resistances in the USA is limited. At a global scale, our analysis of this set of US isolates is consistent with previous epidemiological studies demonstrating the predominance (>50% of cases) of ST-69, 73, 95, 127, and 131 pandemic ExPEC lineages [16, 17]. E. coli is the world’s leading cause of UTIs, and this is reflected in our collection, where 93% of isolates were from urine samples. The distribution of major lineages observed globally and here also mirrors genomic epidemiology studies of community-acquired (CA)-UTIs across Canada (2012–2015) and from UPEC isolates collected at a Northern California university in 1999–2000 and again in 2016–2017 [18, 19]. However, in contrast to these studies, our collection revealed the emergence of ST-1193 fluoroquinolone-resistant E. coli as one of the most prevalent lineages currently circulating in this region of the USA.

Over the past 20 years in the USA, fluoroquinolones have replaced trimethoprim-sulfamethoxazole as the treatment of choice for uncomplicated UTIs [64]. In our collection, fluoroquinolone non-susceptible isolates largely belonged to only two lineages, ST-131 and ST-1193 (72% between both lineages). For ST-131, numerous studies have described the rapid, global emergence and dominance of subclones with acquired fluoroquinolone resistance mutations (subclade C1/H30-R) and a high prevalence of ESBL enzymes (C2/H30-Rx) [16, 17]. In this study, we show that the prevalence of C1 and C2 in the USA (both as an aggregate [52% of ST-131 isolates] and separately with 37% and 15%, respectively) is comparable to estimates from a recent report of a longitudinal collection of E. coli (albeit of bloodstream isolates) from the last two decades in Norway [62]. Interestingly, these are also similar to earlier US estimates [collection of 261 isolates from 2010 to 2012 [16]] suggesting the ST-131 population structure has remained relatively stable over the last decade and the overall prevalence of this lineage appears to have plateaued. In contrast, lineage ST-1193, which is the only other known clone driving the spread of fluoroquinolone-resistant E. coli globally [65,66,67,68], appears to be surging. For example, though the first worldwide cases of ST-1193 only appeared in 2011 [68, 69], a recent US-based multicenter surveillance study of 6349 clinical E. coli showed that the fraction of fluoroquinolone-resistant ST-1193 increased from 18 to 25% between 2016 and 2017 [67]. In our study of isolates from 2019 to 2020, the fraction was 31%, suggesting the rapid rise of ST-1193 is still ongoing. At the molecular level, all ST-1193 in this collection carried three characteristic, non-synonymous mutations resulting in high-level fluoroquinolone resistance; ParC (S80I) and GyrA (D87N and S83L) acquired via homologous recombination from a single transfer event at the origins of that lineage [70]. In addition, a fourth substitution in ParE (L416F) previously described in ST-1193 lineage was found in all isolates [65].

In our collection, 5% of primary isolates were ESBL-producers and, of particular concern for treatment regiments, a subset of 74% were co-resistant to the fluoroquinolones. These rates were comparable to the prevalence of resistances observed in a large (>1.5 million isolates), multicenter study of community-onset UTI in the USA over the last decade (6.4% ESBL-producers and 21% fluoroquinolone non-susceptible) [71]. In contrast, another nationwide US study focused on HAIs during a similar timeframe reported substantially higher rates of resistance to fluoroquinolone (35%) and extended-spectrum cephalosporins (17%) [72]. Globally, the rate of ESBL-E. coli varies considerably from >40% in regions such as South America, Southeast Asia, India, and China to ~5 to 20% in Europe, Australia, Canada, and the USA [73]. Furthermore, prevalent lineages carrying ESBLs also vary globally (i.e., ST-648 and ST-410 are underrepresented in our study yet are the most prevalent lineages circulating in intensive care units in Vietnam [74]). Importantly, ST-1193 was the third most frequent source of ESBL-producers in our collection, with 8% (n = 7) carrying one of the variously represented alleles (blaCTX-M-15, blaCTX-M-27, blaCTX-M-55, and first report of blaCTX-M-64 carriage), suggesting multiple introductions. In contrast, ESBL-producers composed 69% of isolates within subclade C2 of ST-131 lineage and all carried the same blaCTX-M-15, most likely harbored on an IncF-type plasmid as previously characterized [75]. Finally, while other countries including France [76], Japan [77], and Germany [78] have seen an increase in the recently defined subclade C1-M27 ST-131 [77] clinical isolates carrying blaCTX-M-27, we see a low prevalence of subclade C1 blaCTX-M-27 carrying isolates in this study.

While surveillance and infection control efforts are often and understandably (i.e., increased morbidity, mortality, and financial costs) focused on ESBL and MDR E.coli lineages, the global burden of colonization/infection with non-MDR strains (e.g., global lineages ST-73, ST-95, and ST-127) remains invariably higher [5, 62]. In fact, in this cohort of 1776 patients, 3 out of 4 individuals were diagnosed with a non-MDR isolate (representing a diversity of E. coli lineages, most of which have yet to be explored). When focusing on patients where in-depth comparative genomics suggested nosocomial origin was likely (n = 227), a slight increase in the fraction of MDR cases is observed, but the majority (2 out of 3 patients) were still due to a non-MDR clone. In fact, ExPEC pandemic lineage ST-73 was largely comprised of non-MDR isolates and was both one of the most frequent sources of potential clusters of transmission and responsible for the largest predicted outbreak involving 12 patients. While the possibility of transmission happening outside the hospital (e.g., shared long-term facilities or elderly care home) cannot be excluded, these findings highlight the importance of surveilling E. coli isolates with diverse susceptibility profiles as investigations that focus on MDR only are likely to underestimate ongoing outbreaks in the patient population.

To our knowledge, just a single study has performed similar genome-based detection (albeit using a different methodology) of nosocomial transmission on a complete collection of clinical E. coli isolates [9]. That study examined stool samples from 97 inpatients over a 6-month period at a single UK hospital. Similar to our findings, the two largest clusters identified spanned the entirety of the study period and were caused by the nosocomial spread of non-MDR isolates. Interestingly, these clones were identified as ST-7095 (7 patients, 29 isolates) and ST-635 (4 patients, 18 isolates) [9], two lineages comprised within phylogroup A that were not detected in our sampling. Whether the epidemic success of these non-MDR lineages simply stems from their overall abundance or could result from the acquisition of virulence/colonization factors (as observed here for ST-73) remains to be fully characterized.


By capturing all clinical isolates for a full year, this study provides a rare and contemporary survey of the genomic landscape of MDR and non-MDR E. coli lineages in a large healthcare network in the Northeast US. While pandemic ST-131 and expanding ST-1193 lineages (both characterized by high rates of co-resistance to fluoroquinolones and extended-spectrum cephalosporins) warrant particular surveillance, our findings also indicate that non-MDR lineages play a significant role in nosocomial transmission. With WGS developing as a near-routine technology in infection control, such improved understanding of the epidemiology of hospital-acquired pathogens is critical for maximum effectiveness at reducing infections and healthcare-associated costs.

Availability of data and materials

Both genomic assemblies and raw sequencing data of all isolates analyzed in this study are publicly available in the NCBI database under the BioProject number PRJNA809394 [79].



Extended-spectrum β-lactamase


Multidrug resistant


Multilocus sequence typing


Sequence type


Core genome multilocus sequence typing


National Center for Biotechnology Information


  1. Vihta KD, Stoesser N, Llewelyn MJ, Quan TP, Davies T, Fawcett NJ, et al. Trends over time in Escherichia coli bloodstream infections, urinary tract infections, and antibiotic susceptibilities in Oxfordshire, UK, 1998–2016: a study of electronic health records. Lancet Infect Dis. 2018;18(10):1138–49.

    Article  Google Scholar 

  2. Flores-Mireles AL, Walker JN, Caparon M, Hultgren SJ. Urinary tract infections: epidemiology, mechanisms of infection and treatment options. Nat Rev Microbiol. 2015;13(5):269–84.

    Article  CAS  Google Scholar 

  3. Critchley IA, Cotroneo N, Pucci MJ, Mendes R. The burden of antimicrobial resistance among urinary tract isolates of Escherichia coli in the United States in 2017. PLoS One. 2019;14(12):e0220265.

    Article  CAS  Google Scholar 

  4. Zhen X, Lundborg CS, Sun X, Hu X, Dong H. Economic burden of antibiotic resistance in ESKAPE organisms: a systematic review. Antimicrob Resist Infect Control. 2019;8:137.

    Article  Google Scholar 

  5. Naylor NR, Pouwels KB, Hope R, Green N, Henderson KL, Knight GM, et al. The health and cost burden of antibiotic resistant and susceptible Escherichia coli bacteraemia in the English hospital setting: a national retrospective cohort study. PLoS One. 2019;14(9):e0221944.

    Article  CAS  Google Scholar 

  6. Sader HS, Castanheira M, Flamm RK, Jones RN. Antimicrobial activities of ceftazidime-avibactam and comparator agents against Gram-negative organisms isolated from patients with urinary tract infections in U.S. medical centers, 2012 to 2014. Antimicrob Agents Chemother. 2016;60(7):4355–60.

    Article  CAS  Google Scholar 

  7. Karlowsky JA, Kelly LJ, Thornsberry C, Jones ME, Sahm DF. Trends in antimicrobial resistance among urinary tract infection isolates of Escherichia coli from female outpatients in the United States. Antimicrob Agents Chemother. 2002;46(8):2540–5.

    Article  CAS  Google Scholar 

  8. Sawatwong P, Sapchookul P, Whistler T, Gregory CJ, Sangwichian O, Makprasert S, et al. High burden of extended-spectrum β-lactamase–producing Escherichia coli and Klebsiella pneumoniae bacteremia in older adults: a seven-year study in two rural Thai provinces. Am J Trop Med Hyg. 2019;100(4):943–51.

    Article  CAS  Google Scholar 

  9. Ludden C, Coll F, Gouliouris T, Restif O, Blane B, Blackwell GA, et al. Defining nosocomial transmission of Escherichia coli and antimicrobial resistance genes: a genomic surveillance study. Lancet. Microbe. 2021;0(0). 2021;2(9):e472-80.

  10. Raphael E, Glymour MM, Chambers HF. Trends in prevalence of extended-spectrum beta-lactamase-producing Escherichia coli isolated from patients with community- and healthcare-associated bacteriuria: results from 2014 to 2020 in an urban safety-net healthcare system. Antimicrob Resist Infect Control. 2021;10(1):118.

  11. Centers for Disease Control and Prevention (U.S.). Antibiotic resistance threats in the United States, 2019 [Internet]. Centers for Disease Control and Prevention (U.S.); 2019. Available from: [Cited 2021 May 24].

  12. Denamur E, Clermont O, Bonacorsi S, Gordon D. The population genetics of pathogenic Escherichia coli. Nat Rev Microbiol. 2021;19(1):37-45.

  13. Biggel M, Xavier BB, Johnson JR, Nielsen KL, Frimodt-Møller N, Matheeussen V, et al. Horizontally acquired papGII-containing pathogenicity islands underlie the emergence of invasive uropathogenic Escherichia coli lineages. Nat Commun. 2020;11(1):5968.

    Article  CAS  Google Scholar 

  14. Kaper JB, Nataro JP, Mobley HLT. Pathogenic Escherichia coli. Nat Rev Microbiol. 2004;2(2):123–40.

    Article  CAS  Google Scholar 

  15. Pitout JDD, DeVinney R. Escherichia coli ST131: a multidrug-resistant clone primed for global domination. F1000Research. 2017;6:F1000 Faculty Rev-195.

    Article  Google Scholar 

  16. Price LB, Johnson JR, Aziz M, Clabots C, Johnston B, Tchesnokova V, et al. The epidemic of extended-spectrum-β-lactamase-producing Escherichia coli ST131 is driven by a single highly pathogenic subclone, H30-Rx. mBio. 2013;4(6): e00377-13.

  17. Johnson JR, Tchesnokova V, Johnston B, Clabots C, Roberts PL, Billig M, et al. Abrupt emergence of a single dominant multidrug-resistant strain of Escherichia coli. J Infect Dis. 2013;207(6):919–28.

    Article  CAS  Google Scholar 

  18. Fibke CD, Croxen MA, Geum HM, Glass M, Wong E, Avery BP, et al. Genomic epidemiology of major extraintestinal pathogenic Escherichia coli lineages causing urinary tract infections in young women across Canada. Open Forum. Infect Dis Ther. 2019;6(11):ofz431.

    Google Scholar 

  19. Yamaji R, Rubin J, Thys E, Friedman CR, Riley LW. Persistent pandemic lineages of uropathogenic Escherichia coli in a college community from 1999 to 2017. Diekema DJ, editor. J Clin Microbiol. 2018;56(4):e01834–17.

    Article  CAS  Google Scholar 

  20. Kallonen T, Brodrick HJ, Harris SR, Corander J, Brown NM, Martin V, et al. Systematic longitudinal survey of invasive Escherichia coli in England demonstrates a stable population structure only transiently disturbed by the emergence of ST131. Genome Res. 2017;27(8):1437–49.

    Article  CAS  Google Scholar 

  21. Duval A, Obadia T, Boëlle PY, Fleury E, Herrmann JL, Guillemot D, et al. Close proximity interactions support transmission of ESBL-K. pneumoniae but not ESBL-E. coli in healthcare settings. PLoS Comput Biol. 2019;15(5):e1006496.

    Article  CAS  Google Scholar 

  22. CLSI. Methods for Dilution Antimicrobial Susceptibility Tests for Bacteria That Grow Aerobically. 11th ed. CLSI standard M07. Wayne, PA: Clinical and Laboratory Standards Institute; 2018.

  23. Galac MR, Snesrud E, Lebreton F, Stam J, Julius M, Ong AC, et al. A diverse panel of clinical Acinetobacter baumannii for research and development. Antimicrob Agents Chemother. 2020;64(10):e00840–20 /aac/64/10/AAC.00840-20.atom.

    Article  CAS  Google Scholar 

  24. Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30(14):2068–9.

    Article  CAS  Google Scholar 

  25. Wood DE, Lu J, Langmead B. Improved metagenomic analysis with Kraken 2. Genome Biol. 2019;20(1):257.

    Article  CAS  Google Scholar 

  26. Waters NR, Abram F, Brennan F, Holmes A, Pritchard L. Easy phylotyping of Escherichia coli via the EzClermont web app and command-line tool. Access Microbiol. 2020;2(9):acmi000143.

  27. Clermont O, Christenson JK, Denamur E, Gordon DM. The Clermont Escherichia coli phylo-typing method revisited: improvement of specificity and detection of new phylo-groups. Environ Microbiol Rep. 2013;5(1):58–65.

    Article  CAS  Google Scholar 

  28. Seeman T. mlst - scan contig files against traditional PubMLST typing schemes. Available from: [Accessed 2020 Jan 4].

  29. Jolley KA, Bray J, Maiden MC. Open-access bacterial population genomics: BIGSdb software, the website and their applications. Wellcome Open Res. 2018;(3):124 Accessed 4 January 2020.

  30. Zhou Z, Alikhan NF, Mohamed K, Fan Y, the Agama Study Group, Achtman M. The EnteroBase user’s guide, with case studies on Salmonella transmissions, Yersinia pestis phylogeny, and Escherichia core genomic diversity. Genome Res. 2020;30(1):138–52.

    Article  CAS  Google Scholar 

  31. Quijada NM, Rodríguez-Lázaro D, Eiros JM, Hernández M. TORMES: an automated pipeline for whole bacterial genome analysis. Bioinformatics. 2019;35(21):4207–12.

    Article  CAS  Google Scholar 

  32. Joensen KG, Tetzschner AMM, Iguchi A, Aarestrup FM, Scheutz F. Rapid and easy in silico serotyping of Escherichia coli isolates by use of whole-genome sequencing data. J Clin Microbiol. 2015;53(8):2410–26.

    Article  CAS  Google Scholar 

  33. Roer L, Tchesnokova V, Allesøe R, Muradova M, Chattopadhyay S, Ahrenfeldt J, et al. Development of a web tool for Escherichia coli subtyping based on fimH alleles. J Clin Microbiol. 2017;55(8):2538–43.

  34. Feldgarden M, Brover V, Haft DH, Prasad AB, Slotta DJ, Tolstoy I, et al. Validating the AMRFinder tool and resistance gene database by using antimicrobial resistance genotype-phenotype correlations in a collection of isolates. Antimicrob Agents Chemother. 2019;63(11):e00483–19.

    Article  CAS  Google Scholar 

  35. Hunt M, Mather AE, Sánchez-Busó L, Page AJ, Parkhill J, Keane JA, et al. ARIBA: rapid antimicrobial resistance genotyping directly from sequencing reads. Microb. Genomics. 2017;3(10):e000131.

  36. Carattoli A, Zankari E, García-Fernández A, Voldby Larsen M, Lund O, Villa L, et al. In silico detection and typing of plasmids using PlasmidFinder and plasmid multilocus sequence typing. Antimicrob Agents Chemother. 2014;58(7):3895–903.

    Article  Google Scholar 

  37. Chen L, Zheng D, Liu B, Yang J, Jin Q. VFDB 2016: hierarchical and refined dataset for big data analysis—10 years on. Nucleic Acids Res. 2016;44(D1):D694–7.

    Article  CAS  Google Scholar 

  38. Biggel, M. EcVGDB. Zenodo [Internet]. 2020; Available from:

  39. Seemann T. ABRicate - mass screening of contigs for antimicrobial resistance or virulence genes. Available from: [Accessed 2022 Feb 23].

  40. Page AJ, Cummins CA, Hunt M, Wong VK, Reuter S, Holden MTG, et al. Roary: rapid large-scale prokaryote pan genome analysis. Bioinformatics. 2015;31(22):3691–3.

    Article  CAS  Google Scholar 

  41. Seeman T. snippy - rapid haploid variant calling and core genome alignment. Available from: [Accessed 2019 Sep 25].

  42. Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One. 2014;9(11):e112963.

    Article  Google Scholar 

  43. Croucher NJ, Page AJ, Connor TR, Delaney AJ, Keane JA, Bentley SD, et al. Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins. Nucleic Acids Res. 2015;43(3):e15.

    Article  Google Scholar 

  44. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30(9):1312–3.

    Article  CAS  Google Scholar 

  45. Letunic I, Bork P. Interactive Tree Of Life (iTOL) v4: recent updates and new developments. Nucleic Acids Res. 2019;47(W1):W256–9.

    Article  CAS  Google Scholar 

  46. Stoesser N, Sheppard AE, Pankhurst L, De Maio N, Moore CE, Sebra R, et al. Evolutionary history of the global emergence of the Escherichia coli epidemic clone ST131. mBio. 2016;7(2):e02162.

  47. Jünemann S, Sedlazeck FJ, Prior K, Albersmeier A, John U, Kalinowski J, et al. Updating benchtop sequencing performance comparison. Nat Biotechnol. 2013;31(4):294–6.

    Article  Google Scholar 

  48. Muloi DM, Wee BA, McClean DMH, Ward MJ, Pankhurst L, Phan H, et al. Population genomics of Escherichia coli in livestock-keeping households across a rapidly developing urban landscape. Nat Microbiol. 2022;7(4):581–9.

    Article  CAS  Google Scholar 

  49. Johnson JR, Johnston BD, Porter SB, Clabots C, Bender TL, Thuras P, et al. Rapid emergence, subsidence, and molecular detection of Escherichia coli sequence type 1193-fimH64, a new disseminated multidrug-resistant commensal and extraintestinal pathogen. J Clin Microbiol. 2019;57(5):e01664–18.

    Article  CAS  Google Scholar 

  50. Magiorakos AP, Srinivasan A, Carey RB, Carmeli Y, Falagas ME, Giske CG, et al. Multidrug-resistant, extensively drug-resistant and pandrug-resistant bacteria: an international expert proposal for interim standard definitions for acquired resistance. Clin Microbiol Infect. 2012;18(3):268–81.

    Article  CAS  Google Scholar 

  51. Mshana SE, Falgenhauer L, Mirambo MM, Mushi MF, Moremi N, Julius R, et al. Predictors of blaCTX-M-15 in varieties of Escherichia coli genotypes from humans in community settings in Mwanza, Tanzania. BMC Infect Dis. 2016;16(1):187.

    Article  Google Scholar 

  52. Day MJ, Hopkins KL, Wareham DW, Toleman MA, Elviss N, Randall L, et al. Extended-spectrum β-lactamase-producing Escherichia coli in human-derived and foodchain-derived samples from England, Wales, and Scotland: an epidemiological surveillance and typing study. Lancet Infect Dis. 2019;19(12):1325–35.

    Article  CAS  Google Scholar 

  53. Guenther S, Semmler T, Stubbe A, Stubbe M, Wieler LH, Schaufler K. Chromosomally encoded ESBL genes in Escherichia coli of ST38 from Mongolian wild birds. J Antimicrob Chemother. 2017;72(5):1310–3.

    Article  CAS  Google Scholar 

  54. Kondratyeva K, Salmon-Divon M, Navon-Venezia S. Meta-analysis of pandemic Escherichia coli ST131 plasmidome proves restricted plasmid-clade associations. Sci Rep. 2020;10(1):36.

    Article  CAS  Google Scholar 

  55. Xia L, Liu Y, Xia S, Kudinha T, Xiao SN, Zhong NS, et al. Prevalence of ST1193 clone and IncI1/ST16 plasmid in E-coli isolates carrying blaCTX-M-55 gene from urinary tract infections patients in China. Sci Rep. 2017;7(1):44866.

    Article  CAS  Google Scholar 

  56. Ruiz J. Mechanisms of resistance to quinolones: target alterations, decreased accumulation and DNA gyrase protection. J Antimicrob Chemother. 2003;51(5):1109–17.

    Article  CAS  Google Scholar 

  57. Sarowska J, Futoma-Koloch B, Jama-Kmiecik A, Frej-Madrzak M, Ksiazczyk M, Bugla-Ploskonska G, et al. Virulence factors, prevalence and potential transmission of extraintestinal pathogenic Escherichia coli isolated from different sources: recent reports. Gut Pathog. 2019;11(1):10.

    Article  Google Scholar 

  58. Ben Zakour NL, Alsheikh-Hussain AS, Ashcroft MM, Khanh Nhu NT, Roberts LW, Stanton-Cook M, et al. Sequential acquisition of virulence and fluoroquinolone resistance has shaped the evolution of Escherichia coli ST131. mBio. 2016;7(2):e00347–16.

    Article  CAS  Google Scholar 

  59. Jahandeh N, Ranjbar R, Behzadi P, Behzadi E. Uropathogenic Escherichia coli virulence genes: invaluable approaches for designing DNA microarray probes. Cent Eur. J Urol. 2015;68(4):452-458.

  60. Brodrick HJ, Raven KE, Kallonen T, Jamrozy D, Blane B, Brown NM, et al. Longitudinal genomic surveillance of multidrug-resistant Escherichia coli carriage in a long-term care facility in the United Kingdom. Genome Med. 2017;9(1):70.

    Article  Google Scholar 

  61. Lipworth S, Vihta KD, Chau KK, Kavanagh J, Davies T, George S, et al. Ten years of population-level genomic Escherichia coli and Klebsiella pneumoniae serotype surveillance informs vaccine development for invasive infections. Clin Infect Dis. 2021;73(12):2276–82.

    Article  CAS  Google Scholar 

  62. Gladstone RA, McNally A, Pöntinen AK, Tonkin-Hill G, Lees JA, Skytén K, et al. Emergence and dissemination of antimicrobial resistance in Escherichia coli causing bloodstream infections in Norway in 2002–17: a nationwide, longitudinal, microbial population genomic study. Lancet Microbe. 2021;2(7):e331–41.

    Article  CAS  Google Scholar 

  63. Mahazu S, Sato W, Ayibieke A, Prah I, Hayashi T, Suzuki T, et al. Insights and genetic features of extended-spectrum beta-lactamase producing Escherichia coli isolates from two hospitals in Ghana. Sci Rep. 2022;12(1):1843.

    Article  CAS  Google Scholar 

  64. Kallen AJ, Welch HG, Sirovich BE. Current antibiotic therapy for isolated urinary tract infections in women. Arch Intern Med. 2006;166(6):635.

    Article  Google Scholar 

  65. Johnson TJ, Elnekave E, Miller EA, Munoz-Aguayo J, Flores Figueroa C, Johnston B, et al. Phylogenomic analysis of extraintestinal pathogenic Escherichia coli sequence type 1193, an emerging multidrug-resistant clonal group. Antimicrob Agents Chemother. 2019;63(1):e01913–8.

    Article  CAS  Google Scholar 

  66. Valenza G, Werner M, Eisenberger D, Nickel S, Lehner-Reindl V, Höller C, et al. First report of the new emerging global clone ST1193 among clinical isolates of extended-spectrum β-lactamase (ESBL)-producing Escherichia coli from Germany. J Glob Antimicrob Resist. 2019;17:305–8.

    Article  Google Scholar 

  67. Tchesnokova VL, Rechkina E, Larson L, Ferrier K, Weaver JL, Schroeder DW, et al. Rapid and extensive expansion in the United States of a new multidrug-resistant Escherichia coli clonal group, sequence type 1193. Clin Infect Dis. 2019;68(2):334–7.

    Article  CAS  Google Scholar 

  68. Wu J, Lan F, Lu Y, He Q, Li B. Molecular characteristics of ST1193 clone among phylogenetic group B2 non-ST131 fluoroquinolone-resistant Escherichia coli. Front Microbiol. 2017;8:2294.

    Article  Google Scholar 

  69. Nguyen Q, Nguyen TTN, Pham P, Chau V, Nguyen LPH, Nguyen TD, et al. Genomic insights into the circulation of pandemic fluoroquinolone-resistant extra-intestinal pathogenic Escherichia coli ST1193 in Vietnam. Microb. Genomics. 2021;7(12):000733.

  70. Tchesnokova V, Radey M, Chattopadhyay S, Larson L, Weaver JL, Kisiela D, et al. Pandemic fluoroquinolone resistant Escherichia coli clone ST1193 emerged via simultaneous homologous recombinations in 11 gene loci. Proc Natl Acad Sci. 2019;116(29):14740–8.

    Article  CAS  Google Scholar 

  71. Kaye KS, Gupta V, Mulgirigama A, Joshi AV, Scangarella-Oman NE, Yu K, et al. Antimicrobial resistance trends in urine Escherichia coli isolates from adult and adolescent females in the United States from 2011 to 2019: rising ESBL strains and impact on patient management. Clin Infect Dis. 2021;73(11):1992–9.

    Article  CAS  Google Scholar 

  72. Kourtis AP, Sheriff EA, Weiner-Lastinger LM, Elmore K, Preston LE, Dudeck M, et al. Antibiotic multidrug resistance of Escherichia coli causing device- and procedure-related infections in the United States reported to the National Healthcare Safety Network, 2013–2017. Clin Infect Dis. 2021;73(11):e4552-59.

  73. Murray CJ, Ikuta KS, Sharara F, Swetschinski L, Robles Aguilar G, Gray A, et al. Global burden of bacterial antimicrobial resistance in 2019: a systematic analysis. Lancet. 2022;399(10325):629–55.

    Article  CAS  Google Scholar 

  74. Roberts LW, Hoi LT, Khokhar FA, Hoa NT, Giang TV, Bui C, et al. Genomic characterisation of multidrug-resistant Escherichia coli, Klebsiella pneumoniae, and Acinetobacter baumannii in two intensive care units in Hanoi, Viet Nam: a prospective observational cohort study. Lancet Microbe. 2022;3(11):e857-e866.

  75. Bevan ER, Jones AM, Hawkey PM. Global epidemiology of CTX-M β-lactamases: temporal and geographical shifts in genotype. J Antimicrob Chemother. 2017;72(8):2145–55.

    Article  CAS  Google Scholar 

  76. Birgy A, Bidet P, Levy C, Sobral E, Cohen R, Bonacorsi S. CTX-M-27-producing Escherichia coli of sequence type 131 and clade C1-M27, France. Emerg. Infect Dis. 2017; 23(5):885.

  77. Matsumura Y, Pitout JDD, Gomi R, Matsuda T, Noguchi T, Yamamoto M, et al. Global Escherichia coli sequence type 131 clade with blaCTX-M-27 gene. Emerg Infect Dis. 2016;22(11):1900–7.

    Article  CAS  Google Scholar 

  78. Ghosh H, Doijad S, Falgenhauer L, Fritzenwanker M, Imirzalioglu C, Chakraborty T. blaCTX-M-27 –encoding Escherichia coli sequence type 131 lineage C1-M27 clone in clinical isolates. Germany Emerg Infect Dis. 2017;23(10):1754–6.

    Article  CAS  Google Scholar 

  79. Mills EG, Martin MJ, et al. A one-year genomic investigation of Escherichia coli epidemiology and nosocomial spread at a large U.S. healthcare network. Natl Cent Biotechnol Inf. 2022; Available from:

Download references


The authors are thankful to all the staff of the MRSN and the clinical microbiology laboratory of the WRNMMC. The manuscript has been reviewed by the Walter Reed Army Institute of Research and there is no objection to its presentation. The opinions or assertions contained herein are the private views of the authors and are not to be construed as official or reflecting the views of the Department of the Army or the Department of Defense.


This study was funded by the Armed Forces Health Surveillance Division (AFHSD), Global Emerging Infections Surveillance (GEIS) Branch ProMIS ID P001_20_WR (to P. Mc Gann).

Author information

Authors and Affiliations



P.T.M. designed the research; P.T.M. E.G.M., M.J.M., T.L.L., R.M., B.W.C., C.H., A.C.O, L.N.P, J.A.R-M, G.G.B, S.B.P., and Y.I.K performed the research. E.G.M., M.J.M., J.W.B, P.T.M., and F.L. analyzed the research; E.G.M., M.J.M., and F.L. wrote the paper with input from all authors. All authors read and approved the final version of the manuscript.

Corresponding authors

Correspondence to Patrick T. Mc Gann or Francois Lebreton.

Ethics declarations

Ethics approval and consent to participate

The isolates and clinical information were collected as part of the public health surveillance activities of the MRSN, as determined by the WRAIR branch director and Human Subjects Protection Branch (HSPB) which granted ethical approval. Informed patient consent was waived as samples were taken under a hospital surveillance framework for routine sampling. The research conformed to the principles of the Helsinki Declaration.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1.

Basic Isolate Metadata.

Additional file 2: Table S2.

Antibiotic Susceptibility Testing.

Additional file 3: Table S3.

AMR Genetic Characteristics.

Additional file 4: Table S4.

Virulence Associated Genes.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Mills, E.G., Martin, M.J., Luo, T.L. et al. A one-year genomic investigation of Escherichia coli epidemiology and nosocomial spread at a large US healthcare network. Genome Med 14, 147 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Escherichia coli
  • Genomic epidemiology
  • ST-131
  • Antibiotic resistance
  • Nosocomial