Skip to main content

The identification of novel immunogenic antigens as potential Shigella vaccine components



Shigella is a major diarrheal pathogen for which there is presently no vaccine. Whole genome sequencing provides the ability to predict and derive novel antigens for use as vaccines. Here, we aimed to identify novel immunogenic Shigella antigens that could serve as Shigella vaccine candidates, either alone, or when conjugated to Shigella O-antigen.


Using a reverse vaccinology approach, where genomic analysis informed the Shigella immunome via an antigen microarray, we aimed to identify novel immunogenic Shigella antigens. A core genome analysis of Shigella species, pathogenic and non-pathogenic Escherichia coli, led to the selection of 234 predicted immunogenic Shigella antigens. These antigens were expressed and probed with acute and convalescent serum from microbiologically confirmed Shigella infections.


Several Shigella antigens displayed IgG and IgA seroconversion, with no difference in sero-reactivity across by sex or age. IgG sero-reactivity to key Shigella antigens was observed at birth, indicating transplacental antibody transfer. Six antigens (FepA, EmrK, FhuA, MdtA, NlpB, and CjrA) were identified in in vivo testing as capable of producing binding IgG and complement-mediated bactericidal antibody.


These findings provide six novel immunogenic Shigella proteins that could serve as candidate vaccine antigens, species-specific carrier proteins, or targeted adjuvants.


Shigella is the causative agent of shigellosis, a severe acute gastrointestinal infection that frequently presents as bloody diarrhea, fever, and severe abdominal pain [1]. In 2016, Shigella was estimated to cause > 250 million cases and > 200,000 deaths globally [2]. Higher income countries experience Shigella infections among travelers, aging populations, deployed military personnel, and men who have sex with men (MSM) [2, 3]. However, the preponderance of the Shigella disease burden is in children aged under 5 years residing in low-middle income countries (LMICs). Infection in this vulnerable group can also result in significant long-term consequences such as severe stunting and impaired cognitive development [2, 4]. The global epidemiology of Shigella is worsened by the emergence and spread of multi- and extensively drug resistant (MDR and XDR) variants, making infections increasingly difficult to treat [5]. The principal method of Shigella prevention has been improvements in water, sanitation, and hygiene (WASH) [6]. However, due to the low infectious dose, the standard of WASH required to break transmission is difficult to attain in many LMICs [7]. Furthermore, recent application of molecular techniques to identify Shigella infections found a severe underestimation of the global Shigella burden [8, 9], highlighting the need for new low-cost prevention techniques.

There is currently no licensed vaccine against Shigella [10]. However, studies in animals and controlled human infection models (CHIMs) have shown that protection through immunization is feasible [11, 12]. Natural disease epidemiology in humans and non-human primate infection studies show complete protection from re-infection with a homologous Shigella species. Long-term homologous protection has been attributed to serotype-specific systemic (serum IgG) and mucosal (IgA) antibody responses [12, 13]. The most immunodominant target of the Shigella IgG and IgA response is the O-antigen component of lipopolysaccharide (LPS) [14, 15]. LPS/O-antigen-specific antibodies elicit protection through antibody-mediated opsonization, phagocytosis, and intracellular cytotoxicity [13]. However, antibodies against Shigella O-antigen are highly specific for the infecting species only [11, 13], and do not provide protection against heterologous Shigella species. Since the Shigella genus consists of four species and > 50 serotypes, a lack of cross-protection against heterologous species and serotypes poses a major challenge for vaccine development [16]. This challenge may be overcome by a vaccine that elicits either a broadly reactive immune response or numerous species-specific responses against the globally dominant Shigella species (i.e., S. flexneri and S. sonnei) [17].

The primary strategy of developing an efficacious Shigella vaccine has been to elicit antibody responses targeting Shigella O-antigen [10]. Live-attenuated vaccines with genetically attenuated Shigella [18,19,20], or the expression of Shigella O-antigen on live-attenuated vectors [21], have been shown to induce good antibody responses against O-antigen. Multivalent killed-vaccines also induce high titers of serum IgG and mucosal IgA targeting Shigella O-antigens and have shown protection in early clinical development [22, 23]. Recombinant forms of Shigella O-antigen have also been pursued as vaccine candidates [10], with a Shigella O-antigen conjugated to a carrier protein [24,25,26], which engages T cell help and produces a longer lasting antibody response to the polysaccharide antigen [27]. Various immunogenic proteins, such as the toxins from other pathogens, have been used as carrier proteins [28, 29]. However, protein antigens from Shigella have not been evaluated as a carrier protein for Shigella O-antigen.

Whole genome sequencing provided the ability to predict and derive novel antigens for use as vaccines, and this approach ultimately gave rise to the meningococcal B vaccine [30,31,32]. Further technological advances in immunology and protein engineering to study the interaction of pathogens with the immune system can aid in reverse engineering of protective immunogens [33,34,35]. Here, we aimed to identify novel immunogenic Shigella antigens that could serve as Shigella vaccine candidates, either alone, or when conjugated to Shigella O-antigen. Therefore, we conducted immunogen prediction using bioinformatic analysis, then created a protein microarray of predicted immunogenic Shigella antigens. These expressed antigens were screened for immunogenicity using polyclonal antibodies from patients who recovered from confirmed Shigella infections, to identify a novel set of proteins which may facilitate the development of novel Shigella vaccines.



Human serum samples for the purposes of this investigation were collected from an observational study of children with diarrheal disease and a cohort study of healthy infants, both conducted in Ho Chi Minh City (HCMC), Vietnam [36, 37]. Both studies were approved by the institutional review boards of collaborating institutions HCMC and the Oxford Tropical Research Ethics Committee (OxTREC No. 1045-13) in the UK. Written informed consent by a legal guardian was a prerequisite for enrolment into the studies.

Serum samples

Paired acute-convalescent serum samples were collected as a component of a prospective, observational, multicenter, cross-sectional study conducted in HCMC, Vietnam. The clinical and microbiological data from this study has been published previously [37]. For this investigation, we utilized acute serum samples collected from microbiologically confirmed Shigella (n = 33) or Salmonella (n = 24) cases when they first presented at hospital with acute diarrheal disease (patients bled prior to diagnostic testing) (Table 1). Convalescent (follow-up) serum samples collected at a follow-up visit, 4 weeks (± 1 week) after being enrolled in the study. Umbilical cord blood was collected from a large prospective birth cohort study, where healthy pregnant mothers who visited Hung Vuong Obstetrics Hospital in HCMC, Vietnam, were recruited prior to birth and cord blood sampled after delivery [36, 38]. Serum extracted from maternal blood during pregnancy has previously been subjected to ELISA to measure S. sonnei O-antigen IgG [39]; the cord blood samples screened here were from mothers with high (n = 45) and low (n = 40) S. sonnei O-antigen IgG titers (Table 1).

Table 1 Summary of genomes compared, serum samples tested, Shigella antigens studied, and in vivo immunogenicity testing conducted in present “reverse vaccinology 2.0” study

Bioinformatic analysis

The complete chromosomal sequences of 10 Shigella and 47 Escherichia coli (E. coli) were retrieved from GenBank (accessed in July 2014 using an in-house script described in Supplementary file 1). The incorporated Shigella sequences included 2 S. boydii, 2 S. dysenteriae, 4 S. flexneri, and 2 S. sonnei. The collection of E. coli genomes used included both pathogenic (n = 32) and non-pathogenic variants (n = 15). A complete list of E. coli and Shigella genome sequences utilized in the present study is indicated in Additional File 2 Table S1. The CMG-Biotools (Comparative Microbial Genomics) workbench was used to identify the core genome of all sequences [40]. Protein sequences were extracted based on published annotations for all genomes. To identify conserved and unique regions of the genomes, we performed pairwise reciprocal BLASTP on all extracted protein sequences. Sequences were clustered into one orthologous group using the criteria of alignment of at least 50% similarity matches and alignment length of at least 50% of the longest sequence in the comparison.

By comparing the genomes, several protein subsets of interest were considered: (1) sequences present in all S. sonnei and absent from all E. coli, (2) sequences present in all S. flexneri and absent from all E. coli, and (3) sequences present in all S. sonnei and S. flexneri and absent from all E. coli. The output identified from each subset was manually curated by performing a BLASTN search of their sequences against the NCBI database. Since Shigella is phylogenetically nested within the E. coli species, they show a very low level of divergence in chromosomal genetic makeup. Hence, the number of proteins that fulfilled the above criteria (i.e., in groups 1 to 3) was limited (all are shown in Additional file 3 Table S2) and was not sufficient to develop downstream immunogenic assays.

We additionally included a subset of potentially immunogenic proteins present in both S. sonnei and S. flexneri genomes, notwithstanding their presence in the examined E. coli genomes. In brief, the annotation and protein sequences of each orthologous group were retrieved from the input Shigella genomes (using an in-house script described in Supplementary File 2). Sequences associated with mobile genetic elements (IS elements, transposases, and prophages, excluding pathogenicity islands) were manually checked and removed. Other proteins predicted to not be targets of antibody, by annotation of cellular function and location, were further excluded. These include proteins in toxin-antitoxin systems, bacterial conjugation, plasmid inheritance, genome replication, transcription or protein expression, cellular metabolism, and other cytoplasmic proteins of unspecified function. The retaining subset mostly consists of predicted outer membrane, secreted, periplasmic, and cell wall proteins (Additional file 5 Table S3). Unannotated (hypothetical) proteins were subjected to characterization in Pfam [41], the transmembrane domain prediction server TMHMM [42], and the signal peptide prediction server SignalP4.1 [43] (using default Gram-negative prokaryote settings). Proteins which may show potential immunogenicity (i.e., due to their location on the outer membrane, cell wall, possessing transmembrane domain(s), or possessing a signal peptide) were retained. Unannotated proteins identified by Pfam as bacteriophage-related were discarded.

Protein microarray

Proteins selected through the bioinformatics pipeline are shown in Additional file 5 Table S3. These targets were expressed, printed, and probed as described previously for other protein microarray projects [44,45,46]. Briefly, the corresponding coding sequences from selected Shigella proteins were amplified, cloned into a pXT7 vector, and expressed using a high-throughput in vitro transcription/translation (IVTT) E. coli system (BiotechRabbit, GmbH). Controls lacking DNA were included to account for background reactivity with E. coli, where IVTT was conducted without plasmid DNA. Expressed Shigella antigens from IVTT reactions were printed onto nitrocellulose-coated glass GraceBio slides using an Omni Grid 100 microarray printer (Genomic Solutions). LPS from Shigella (Sigma) was also printed on the microarray slides to act as positive control. Slides (with E. coli lysate (McLab) at a final concentration of 1 mg/ml) were probed with human serum (diluted 1:200), followed by biotin-conjugated secondary antibodies specific for human IgM, IgG, and IgA (Jackson ImmunoResearch). Binding antibody was detected using streptavidin-conjugated SureLightH P-3 (Columbia Biosciences), measured using Perkin Elmer ScanArray Express HT microarray scanner. Spot intensities were quantified using the ScanArray software.

Data analysis

Fold-over-control (FOC) normalizations were conducted to reduce assay to assay variation by dividing the mean spot intensities for each antigen by the intensity for the no DNA control IVTT. Positive serum reactivity to an antigen was defined as a FOC > 2 (i.e., > 2-fold increase in the mean intensity over the background control). Log2-transformed FOC values from paired acute and convalescent samples were compared using a Bayes regularized t test adapted from Cyber-T for protein arrays [47,48,49]. p values were subjected to Benjamini and Hochberg (BH) correction to control for false discovery rate [50]. Data were graphed using the R statistical software ( and packages “Superheat,” “ggplot2,” “rgl,” and “fmsb.”

Protein immunization

His-tagged variants of selected proteins (NmpC, FepA, HtrB, EmrK, NlpB, FhuA, CjrA, and MdtA) (Additional file 6 Table S4) were successfully expressed in a BacPowerTM E. coli protein expression system and purified using nickel affinity chromatography (GenScript Limited, Hong Kong). Four months old, male New Zealand rabbits (n = 2 per protein) were immunized with 0.2 mg of the successfully expressed and purified protein, and serum drawn at 1-week post-immunization of the third dose. The immunogenicity of each protein was assayed by testing the pre-immune and post-immune rabbit sera for sero-positivity using indirect enzyme-linked immunosorbent assay (ELISA) and immunoblot. For ELISA, plates were coated with protein at 4 μg/ml, blocked, incubated with sera (at 1 mg/ml IgG concentration), and detected using anti-rabbit IgG Fc-HRP secondary antibody. For western blots, 50 ng/well of purified proteins was run on SDS-PAGE, transferred to nitrocellulose membrane, blocked, probed with pre-immune and post-immune rabbit sera, and detected with goat anti-rabbit IgG-IRDye800cw secondary antibody.

Serum bactericidal assay

Purified serum antibody from immunized rabbits was tested for serum bactericidal activity (SBA) against S. flexneri 2a (strain EG 0478), S. sonnei (strain DE 1404, containing a cat chloramphenicol resistance gene on the virulence plasmid), and S. Typhimurium (strain ATCC 14028) using a previously described SBA protocol [51, 52]. Heat-inactivated sera were serially diluted from 50 to 0.07 μg/ml, then combined with bacteria (250 CFU/well) and 5 μl of baby rabbit complement and incubated at 37 °C for 90 min. Viable bacterial cells were estimated at time 0 min (T0) and at 90 min post-incubation (T90) by plating on nutrient agar plates. Bactericidal activity was calculated as a ratio of CFU at T90 over T0, from which SBA titers were estimated at 50% bactericidal activity. Convalescent immune serum from a confirmed Shigella-infected patient was used as a positive control [38]. All serum samples were tested in triplicate and the SBA titers averaged. The SBA assays with S. sonnei DE 1404 were performed with and without the supplementation of 10 μg/ml chloramphenicol. S. sonnei has the propensity to lose the virulence plasmid and O-antigen culture during culture, and the addition of 10 μg/ml chloramphenicol was to ensure the maintenance of plasmid and O-antigen during the SBA assay via the added cat gene. These data were compared to assess potential killing differences between plasmid+ and plasmid− organisms.


Bioinformatic analysis identifies potential immunogenic Shigella core antigens

Genomic comparison of Shigella and E. coli core genomes was conducted with the aim of selecting both species-specific and species cross-reactive Shigella proteins common to the most globally dominant species, S. flexneri and S. sonnei. Protein sequences were extracted from the annotated chromosomes of various Shigella species (n = 10), pathogenic (n = 32) and non-pathogenic (n = 15) E. coli (Table 1 and Additional File 2 Table S1), and a list of potentially immunogenic antigens was selected using bioinformatic comparison, a list of potentially immunogenic antigens was selected (Table 2 and Additional File 5 Table S3). Our analysis was restricted to chromosomal proteins to identify novel immunogenic targets, as several proteins on Shigella virulence plasmids have already been extensively studied for their immunogenicity [53]. The analysis identified 22 S. flexneri-specific, 8 S. sonnei-specific, and 2 Shigella-specific proteins (IpaH3.1, IpaH4.5). Another 100 potentially immunogenic orthologs, from both S. sonnei and S. flexneri, were further included to expand the downstream immunogenic assays. Shigella LPS (O-antigen) was included as a positive control. This resulted in a total of 235 proteins that were expressed in vitro, and successfully printed on an antigen microarray for downstream analysis (Additional File 5 Table S3).

Table 2 Description of the protein subsets of interests studied during chromosomal genome comparison between S. sonnei, S. flexneri, and pathogenic and non-pathogenic E. coli

Antigen microarray reveals broad seroconversion following Shigella infections

The Shigella antigen microarray allowed us to assess the IgM, IgA, and IgG responses against the selected antigens following symptomatic Shigella infections. The antigen microarray was probed for sero-reactivity with pairs of acute and early convalescent (i.e., 3- to 4-week follow-up) sera from microbiologically confirmed Shigella-infected diarrheal patients (n = 34) (Table 1). The present study used samples from patients infected with the two current globally dominant Shigella species, S. flexneri (n = 2) and S. sonnei (n = 32).

Shigella infection led to widespread seroconversion in all measured antibody isotypes (IgG, IgA, and IgM) among individuals and across multiple tested antigens, as observed by the increase in sero-reactivity (i.e., measured as fold change over control greater than 2, FOC > 2) from acute to early convalescence (Fig. 1). IgG sero-reactivity (i.e., FOC > 2) analysis showed that at least one individual (3% of all patients) during acute and convalescent phase reacted to a maximum of 35 and 166 antigens, respectively (Fig. 1a). Additionally, 50% of the individuals (i.e., 17 patients) in acute and early convalescent phase sero-reacted (IgG) to 2 and 7 antigens, respectively (Fig. 1a).

Fig. 1
figure 1

Sero-reactivity of Shigella antigens following Shigella infections. a IgG, b IgA, and c IgM reactivity was assessed in gastrointestinal disease patients during acute Shigella infection and at follow-up. IgG, IgA, and IgM responses (Log2 FOC) were graphically represented as heat maps, where Shigella antigens were ordered from bottom to top by increasing average responses and Shigella-infected patients were ordered from left to right by increasing average responses

The IgA responses were markedly higher than the IgG responses, with at least one individual generating a detectable IgA response to 209 and 227 antigens during acute disease and convalescence, respectively (Fig. 1b). We also observed that > 50% of cases in acute and convalescence produced a reactive IgA response to 3 and 44 antigens, respectively. IgM against the antigens also markedly increased between acute Shigella infections and convalescence (with 214 to 230 reactive antigens detected in at least 1 patient and 9 to 38 reactive antigens in > 50% of patients) (Fig. 1c).

We additionally observed that antibody responses were on average higher in patients with inflammatory (i.e., bloody diarrhea) as compared to non-inflammatory (i.e., watery diarrhea) disease (as can be seen in the scatter plots comparing mean IgG, IgA, and IgM responses at early convalescence, Additional File 7 Fig. S1). As a control for the assay, the antigen array was probed with paired acute and convalescent serum (n = 24) from diarrheal patients infected with an alternative genus of diarrheal pathogen, Salmonella. Notably, there were no significant increases in IgG, IgA, or IgM responses between the acute and convalescence in Salmonella-infected diarrheal cases, indicating that the antibody reactivity observed with the serum from the Shigella-infected patients was specific to Shigella (Additional File 8 Fig. S2).

Shigella core antigen microarray identifies novel immunogenic antigens

As has been previously observed, the highest antibody responses at early convalescence following Shigella infections were against the Shigella LPS O-antigen. Additionally, we selected a subset of 12 immunogenic protein antigens using data generated by the antigen array microarray results; the criteria for this selection were the smallest p value (Benjamini-Hochberg corrected Cyber-T test) from comparison between acute and convalescent antibody responses, and the highest mean antibody responses (i.e., Log2(FOC) values) at early convalescence, etc. The twelve selected antigens were NmpC (SF_nmpC) and FepA (SF_fepA) from S. flexneri, and HtrB (SSON_htrB), EmrK (SSON_emrK), NlpB (SSON_nlpB), FhuA (SSON_fhuA), CjrA (SSON_cjrA), MdtA (SSON_mdtA), SbmA (SSON_sbmA), MviN (SSON_mviN), PldA (SSON_pldA), and 3803 (SSON_3803) from S. sonnei. Sero-reactivity between acute and early convalescence was compared using the Benjamini-Hochberg corrected Cyber-T test. For all 12 antigens, we observed a statistically significant (p < 0.05) increase in mean sero-reactivity between acute to early convalescence in all three antibody isotypes (Fig. 2). Furthermore, > 50% of the Shigella-infected cases at convalescence (or follow-up) had positive IgG, IgA, and IgM (i.e., FOC > 2) responses to all selected antigens, with the exception of SSON_mdtA and SSON_fhuA (Fig. 3).

Fig. 2
figure 2

Sero-reactivity of twelve highly reactive Shigella antigens. Microarrays with Shigella antigens were probed for IgG, IgA, and IgM responses (Log2-transformed fold-over-control (FOC)) with acute and convalescent (or follow-up) sera from gastrointestinal patients with laboratory-confirmed Shigella infections. Mean IgG (a), IgA (b), and IgM (c) responses were compared between acute and follow-up samples from Shigella-infected patients, with p values (represented in gray drop-down pin heads) calculated using the Benjamini-Hochberg corrected Cyber-T test. Error bars (black) represent 95% confidence interval around the mean. The dashed horizontal line (red) indicates the position where p value = 0.05

Fig. 3
figure 3

Reactivity of cord blood from mothers with high or low IgG titers to Shigella antigens. a, b IgG reactivity of cord blood to all tested Shigella antigens. In the heat maps, Shigella antigens were ordered from bottom to top by increasing average IgG reactivity and cord blood samples were ordered from left to right by increasing average IgG reactivity. c IgG reactivity to the top twelve highly reactive Shigella antigens was compared between cord blood from high and low IgG titer mothers using the Benjamini-Hochberg corrected Cyber-T test (p values represented in gray drop-down pin heads). Error bars (black) represent 95% confidence interval around the mean. The red dashed horizontal line is at p value = 0.05

Shigella poses the greatest health burden in children under the age of 2 years; therefore, an effective vaccine against Shigella needs to induce a protective immune response in young children. Hence, we assessed the antibody responses to the 12 selected antigens in children aged < 2 years (n = 8) in comparison to those aged > 2 years (n = 26). At early convalescence, there were no significant differences in both the mean antibody (IgG and IgA) sero-reactivity (Additional File 9 Fig. S3A and B) or the percentage of cases that showed positive antibody (IgG and IgA) responses (Fig. S3), when compared between age groups. These data demonstrate that younger children generate antibody responses to the 12 selected antigens. Additionally, we also compared sero-reactivity between male and females; all selected antigens elicited similar antibody responses (IgG and IgA) at early convalescence in both female (n = 20) and male (n = 14) patients with diarrhea (Fig. S3C and D).

Transplacentally transferred antibodies

A potential mechanism for protecting young children and neonates is the prenatal vaccination of pregnant mothers [54]. Prenatal vaccines require the mother to mount a protective IgG response that can be efficiently transferred transplacentally to the unborn fetus [54]. Therefore, we measured the transplacental transfer of IgG as an indirect assessment of whether the selected immunogenic antigens could serve as potential prenatal vaccine candidates. We compared the IgG sero-reactivity at birth (cord blood serum) between infants from mothers with high (n = 45) and low (n = 40) antibody titers (to Shigella O-antigen) (Fig. 3). In general, infants from mothers with high antibody titers demonstrated greater IgG sero-reactivity against tested Shigella antigens in comparison to infants born to mothers with low IgG titers (Fig. 3). Specifically, with the exception of antigen SSON_mdtA, IgG sero-reactivity for the remaining 11 selected antigens was significantly greater (p < 0.05) in infants born to mothers with high antibody titers, than those with low antibody titers (Fig. 3c). These data show that 11/12 selected Shigella antigens are capable of generating IgG antibodies that can successfully transfer across the placenta in a concentration-dependent manner and may serve as good candidates for prenatal Shigella vaccines.

Immunogenic testing of Shigella antigens

We then proceeded to conduct preclinical testing of the top sero-reactive antigens. A BacPowerTM E. coli protein expression system was used to express SF_nmpC, SF_fepA, and SSON_htrB, SSON_emrK, SSON_nlpB, SSON_fhuA, SSON_cjrA, SSON_mdtA, SSON_sbmA, SSON_mviN, SSON_pldA, and SSON_3803. We were unsuccessful in expressing SSON_sbmA, SSON_mviN, SSON_3803, and SSON_pldA using this system (Additional File 6 Table S4). Therefore, we proceeded to test the remaining eight antigens (i.e., SF_nmpC, SF_fepA, and SSON_htrB, SSON_emrK, SSON_nlpB, SSON_fhuA, SSON_cjrA, and SSON_mdtA) in rabbits to ensure they could induce an antibody response when immunized as recombinant antigens. Rabbits were immunized separately with the eight antigens, and the serum was screened to measure binding IgG antibodies. All eight of the purified antigens induced robust IgG responses in vivo, which was detectable by immunoblot and ELISA (Fig. 4). We next tested the immunized rabbit sera for the potential to induce bacterial killing in an antibody-dependent complement-mediated serum bactericidal assay. We measured the killing potential of the purified rabbit serum against S. flexneri, S. sonnei, and Salmonella Typhimurium to ensure that the serum bactericidal activity (SBA) induced by the immunized antigens was Shigella specific. The SBA assay was unable to detect bactericidal activity below 50 μg/ml against S. sonnei (Table 1 and Additional File 10 Table S5) whether grown with or without chloramphenicol, which was used to maintain the virulence plasmid and O-antigen cluster in S. Sonnei due to the plasmid containing a cat resistance gene. However, six of the eight Shigella immunogens (i.e., SF_fepA, SSON_cjrA, SSON_emrK, SSON_fhuA, SSON_mdtA, and SSON_nlpB) induced antibody responses with strong serum bactericidal activity against S. flexneri (i.e., 50% SBA titer < 50 μg/ml) (Table 1); the lowest 50% SBA titer was induced by the immunogen SSON_cjrA (Table 3). Our results show that the antigen SSON_cjrA was the most potent inducer of bactericidal antibodies against Shigella and the best antigen to take forward into future testing as a potential vaccine antigen.

Fig. 4
figure 4

Immunogenicity testing of selected Shigella proteins in vivo. Rabbits were immunized with top reactive Shigella proteins, fepA, nmpC, cjrA, emrK, fhuA, htrB, mdtA, and nlpB. Positive antibody responses to respective Shigella antigen or immunogen in the immunized rabbits were tested using a western blot and b ELISA

Table 3 Immunization with highly reactive Shigella proteins elicits protective antibody responses. Rabbits were immunized with 0.2 mg of Shigella proteins, and sera (at 1 week post-immunization) were tested in vitro for serum bactericidal activity (SBA) against Shigella flexneri, Shigella sonnei, and Shigella Typhimurium. SBA titers presented below are estimated at 50% bactericidal activity


In the current study, we exploited reverse vaccinology 2.0 to integrate both comparative genomics and human immuno-proteome analysis to identify novel immunogenic chromosomal Shigella proteins. Genomic comparison and bioinformatic analysis of 57 Shigella and E. coli genomes allowed us to narrow down to 235 predicted immunogenic antigens. The predicted antigens were then expressed and printed onto a microarray, probed with a panel of sera from Shigella-infected individuals, to narrow the selection to 12 highly sero-reactive antigens. We confirmed that antibody responses to these 12 antigens were similar across sex (i.e., males and females) and two age groups (i.e., < 2 years and > 2 years). Using cord blood samples, we additionally observed that IgG responses to 11 of these 12 antigens could be transmitted transplacentally, hence suggesting the possible application of these antigens as prenatal vaccine candidates. Among the 12 antigens, 8 were successfully expressed as recombinant proteins. Six of these antigens were both immunogenic in animal models and generated functionally protective antibody responses against Shigella. These were SF_fepA, SSON_cjrA, SSON_emrK, SSON_fhuA, SSON_mdtA, and SSON_nlpB, with SSON_cjrA being the most immunogenic in terms of eliciting antibody-mediated bactericidal responses. Such bactericidal effects against S. sonnei were not observed in vitro, probably due to the protection against complement-mediated killing afforded by its high molecular weight capsule [55]. However, the immunogenicity of these antigens suggests that during S. sonnei infections, the capsule can be modulated to expose these functional proteins in vivo. This highlights the complex pathogenesis of S. sonnei and the difficulty in developing a suitable vaccine candidate.

The biological functions of these six Shigella immunogenic proteins have been characterized previously. FepA and FhuA serve as outer membrane proteins which bind and transport siderophores (ferric enterobactin and ferrichrome, respectively) [56, 57]. CjrA shares substantial homology to Pseudomonas aeruginosa PhuW and potentially acts to sequester iron from heme, the most common iron source in mammals [58]. EmrK and MdtA are subunits of the multidrug efflux pump EmrKY and MdtABC, respectively [59, 60], which contribute to resistance to bile salt and antimicrobials in E. coli. In addition, EmrKY has been shown to confer Shigella survival in infected macrophages, facilitating its invasive pathogenesis in the human host [59]. NlpB forms part of the outer membrane protein (OMP) assembly complex, which assembles and inserts beta-barrel proteins into the outer membrane [61, 62]. The immunogenicity of these proteins points to their potentially high expression during Shigella infections, concurring with the survival strategies of pathogenic bacteria. Particularly, within-host iron is key to bacterial replication, and the ability to sequester and transport host iron is pivotal to the pathogenesis of Klebsiella pneumoniae [63] and Staphylococcus aureus [64].

None of the Shigella antigens (i.e., FepA, CjrA, EmrK, FhuA, MdtA, and NlpB) identified as immunogenic in the present study has been previously characterized for immunogenicity either in the context of natural Shigella infections or vaccination. Five of these antigens (i.e., FepA, CjrA, EmrK, FhuA, MdtA, and NlpB) are conserved in pathogenic E. coli, but they have not been tested for immunogenicity following pathogenic E. coli infection either. However, the presence of the genetic cluster cjrABC-senB has been previously linked to uropathogenic E. coli [65]. The immunogenicity of these six proteins may be predictable, since they are either surface exposed outer membrane or periplasmic proteins. Outer membrane particles of Shigella (Generalized Modules of Membrane Antigens—GMMA) have historically been thought to be highly virulent and immunomodulatory and are currently being developed as a vaccine immunogen [66]. GMMA-based Shigella vaccine, 1790GAHB, was shown to be immunogenic in human clinical trials [67]. Immunogenicity of GMMA vesicles has been attributed to the presence of LPS. However, proteomic analysis of the GMMA detected FepA and FhuA [68]. Based on our findings, it is plausible that other Shigella antigens, such as those identified in our current study, are may be partly responsible for the immunogenic properties of GMMA.

Development of a vaccine against Shigella faces many challenges, including the ability to protect against multiple Shigella species and to raise sustained mucosal immunity [10]. Fortunately, all the antigens, with the exception of CjrA, are conserved between S. flexneri and S. sonnei. Therefore, these antigens could be used to create a vaccine that protects against the two globally dominant Shigella species, which accounted for almost 90% of all Shigella cases in the Global Enteric Multicenter study (GEMS) [17]. Furthermore, in addition to strong IgG responses, all six antigens raised significant IgA responses, which is the dominant immunoglobulin at the mucosa. IgA-mediated protection has been explained by both preventing Shigella infection of host cells and downregulating inflammation and intestinal tissue pathology at infected sites [69, 70]. Although the current study measured IgA in serum, it has been previously shown that Shigella-specific serum IgA positively correlates with mucosal IgA in the stool [14]. Additionally, we observed that IgG specific to the six immunogenic Shigella antigens were capable of transplacental transfer, indicating that the antigens could additionally serve as prenatal vaccine candidates to protect neonates.


Shigella infections cause over a quarter of a billion gastrointestinal infection cases globally per annum [2]. Despite the high public health burden, there is currently no licensed vaccine available to prevent Shigella diseases. At present, LPS is a key antigen for the development of a vaccine against Shigella [10, 71]. Here, we identified six novel immunogenic Shigella proteins that could serve as additional vaccine candidates or could be conjugated to O-antigens to provide some cross-protection. Future Shigella challenge studies in animal models or human controlled infection models are needed to test the potency of these identified six antigens as vaccine candidates alone or as new generation glycoconjugates.

Availability of data and materials

All data are presented in the manuscript (in the supplementary information), and raw data from gene selection from comparative genomics, signals from antigen array, and serum bactericidal data are freely available at [72].



Men who have sex with men


Low-middle income countries

MDR and XDR:

Multi- and extensively drug resistant


Water, sanitation, and hygiene


Controlled human infection models




Ho Chi Minh City


In vitro transcription/translation




Benjamini and Hochberg correction


Enzyme-linked immunosorbent assay


Serum bactericidal activity


Outer membrane protein


Generalized Modules of Membrane Antigens


Global Enteric Multicenter study


  1. Kotloff KL, Riddle MS, Platts-Mills JA, Pavlinac P, Zaidi AKM. Shigellosis. Lancet. 2018;391(10122):801–12.

    Article  PubMed  Google Scholar 

  2. Khalil IA, Troeger C, Blacker BF, Rao PC, Brown A, Atherly DE, et al. Morbidity and mortality due to shigella and enterotoxigenic Escherichia coli diarrhoea: the Global Burden of Disease Study 1990-2016. Lancet Infect Dis. 2018;18(11):1229–40.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Olson S, Hall A, Riddle MS, Porter CK. Travelers’ diarrhea: update on the incidence, etiology and risk in military and similar populations - 1990-2005 versus 2005-2015, does a decade make a difference? Tropical Diseases Travel Med Vaccines. 2019;5:1.

    Article  Google Scholar 

  4. Anderson JD, Bagamian KH, Muhib F, Amaya MP, Laytner LA, Wierzba T, et al. Burden of enterotoxigenic Escherichia coli and shigella non-fatal diarrhoeal infections in 79 low-income and lower middle-income countries: a modelling analysis. Lancet Glob Health. 2019;7(3):e321–e30.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Baker S, Thomson N, Weill FX, Holt KE. Genomic insights into the emergence and spread of antimicrobial-resistant bacterial pathogens. Science. 2018;360(6390):733–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Mokomane M, Kasvosve I, de Melo E, Pernica JM, Goldfarb DM. The global problem of childhood diarrhoeal diseases: emerging strategies in prevention and management. Ther Adv Infect Dis. 2018;5(1):29–43.

    PubMed  Google Scholar 

  7. KOTHARY MH, BABU US. Infective dose of foodborne pathogens in volunteers: a review. J Food Saf. 2001;21(1):49–68.

    Article  Google Scholar 

  8. Lindsay B, Ochieng JB, Ikumapayi UN, Toure A, Ahmed D, Li S, et al. Quantitative PCR for detection of Shigella improves ascertainment of Shigella burden in children with moderate-to-severe diarrhea in low-income countries. J Clin Microbiol. 2013;51(6):1740–6.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Platts-Mills JA, Liu J, Rogawski ET, Kabir F, Lertsethtakarn P, Siguas M, et al. Use of quantitative molecular diagnostic methods to assess the aetiology, burden, and clinical characteristics of diarrhoea in children in low-resource settings: a reanalysis of the MAL-ED cohort study. Lancet Glob Health. 2018;6(12):e1309–e18.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Mani S, Wierzba T, Walker RI. Status of vaccine research and development for Shigella. Vaccine. 2016;34(26):2887–94.

    Article  CAS  PubMed  Google Scholar 

  11. Cohen D, Green MS, Block C, Rouach T, Ofek I. Serum antibodies to lipopolysaccharide and natural immunity to shigellosis in an Israeli military population. J Infect Dis. 1988;157(5):1068–71.

    Article  CAS  PubMed  Google Scholar 

  12. Cohen D, Green MS, Block C, Slepon R, Ofek I. Prospective study of the association between serum antibodies to lipopolysaccharide O antigen and the attack rate of shigellosis. J Clin Microbiol. 1991;29(2):386–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Cohen D, Meron-Sudai S, Bialik A, Asato V, Goren S, Ariel-Cohen O, et al. Serum IgG antibodies to Shigella lipopolysaccharide antigens - a correlate of protection against shigellosis. Hum Vaccin Immunother. 2019;15(6):1401–8.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Simon JK, Maciel M Jr, Weld ED, Wahid R, Pasetti MF, Picking WL, et al. Antigen-specific IgA B memory cell responses to Shigella antigens elicited in volunteers immunized with live attenuated Shigella flexneri 2a oral vaccine candidates. Clin Immunol. 2011;139(2):185–92.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Ndungo E, Randall A, Hazen TH, Kania DA, Trappl-Kimmons K, Liang X, et al. A novel Shigella proteome microarray discriminates targets of human antibody reactivity following oral vaccination and experimental challenge. mSphere. 2018;3(4):e00260-18.

  16. Levine MM, Kotloff KL, Barry EM, Pasetti MF, Sztein MB. Clinical trials of Shigella vaccines: two steps forward and one step back on a long, hard road. Nat Rev Microbiol. 2007;5(7):540–53.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Livio S, Strockbine NA, Panchalingam S, Tennant SM, Barry EM, Marohn ME, et al. Shigella isolates from the global enteric multicenter study inform vaccine development. Clin Infect Dis. 2014;59(7):933–41.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Venkatesan MM, Ranallo RT. Live-attenuated Shigella vaccines. Expert Rev Vaccines. 2006;5(5):669–86.

    Article  CAS  PubMed  Google Scholar 

  19. Orr N, Katz DE, Atsmon J, Radu P, Yavzori M, Halperin T, et al. Community-based safety, immunogenicity, and transmissibility study of the Shigella sonnei WRSS1 vaccine in Israeli volunteers. Infect Immun. 2005;73(12):8027–32.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Kotloff KL, Simon JK, Pasetti MF, Sztein MB, Wooden SL, Livio S, et al. Safety and immunogenicity of CVD 1208S, a live, oral DeltaguaBA Deltasen Deltaset Shigella flexneri 2a vaccine grown on animal-free media. Hum Vaccin. 2007;3(6):268–75.

    Article  CAS  PubMed  Google Scholar 

  21. Dharmasena MN, Hanisch BW, Wai TT, Kopecko DJ. Stable expression of Shigella sonnei form I O-polysaccharide genes recombineered into the chromosome of live Salmonella oral vaccine vector Ty21a. Int J Med Microbiol. 2013;303(3):105–13.

    Article  CAS  PubMed  Google Scholar 

  22. Kaminski RW, Wu M, Turbyfill KR, Clarkson K, Tai B, Bourgeois AL, et al. Development and preclinical evaluation of a trivalent, formalin-inactivated Shigella whole-cell vaccine. Clin Vaccine Immunol. 2014;21(3):366–82.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. McKenzie R, Walker RI, Nabors GS, Van De Verg LL, Carpenter C, Gomes G, et al. Safety and immunogenicity of an oral, inactivated, whole-cell vaccine for Shigella sonnei: preclinical studies and a phase I trial. Vaccine. 2006;24(18):3735–45.

    Article  CAS  PubMed  Google Scholar 

  24. Pozsgay V, Chu C, Pannell L, Wolfe J, Robbins JB, Schneerson R. Protein conjugates of synthetic saccharides elicit higher levels of serum IgG lipopolysaccharide antibodies in mice than do those of the O-specific polysaccharide from Shigella dysenteriae type 1. Proc Natl Acad Sci U S A. 1999;96(9):5194–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Passwell JH, Ashkenazi S, Banet-Levi Y, Ramon-Saraf R, Farzam N, Lerner-Geva L, et al. Age-related efficacy of Shigella O-specific polysaccharide conjugates in 1-4-year-old Israeli children. Vaccine. 2010;28(10):2231–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Farzam N, Ramon-Saraf R, Banet-Levi Y, Lerner-Geva L, Ashkenazi S, Kubler-Kielb J, et al. Vaccination with Shigella flexneri 2a conjugate induces type 2a and cross-reactive type 6 antibodies in humans but not in mice. Vaccine. 2017;35(37):4990–6.

    Article  CAS  PubMed  Google Scholar 

  27. Rappuoli R. Glycoconjugate vaccines: principles and mechanisms. Sci Transl Med. 2018;10(456):eaat4615.

  28. Passwell JH, Harlev E, Ashkenazi S, Chu C, Miron D, Ramon R, et al. Safety and immunogenicity of improved Shigella O-specific polysaccharide-protein conjugate vaccines in adults in Israel. Infect Immun. 2001;69(3):1351–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Robbins JB, Kubler-Kielb J, Vinogradov E, Mocca C, Pozsgay V, Shiloach J, et al. Synthesis, characterization, and immunogenicity in mice of Shigella sonnei O-specific oligosaccharide-core-protein conjugates. Proc Natl Acad Sci U S A. 2009;106(19):7974–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Serruto D, Bottomley MJ, Ram S, Giuliani MM, Rappuoli R. The new multicomponent vaccine against meningococcal serogroup B, 4CMenB: immunological, functional and structural characterization of the antigens. Vaccine. 2012;30(Suppl 2):B87–97.

    Article  CAS  PubMed  Google Scholar 

  31. O'Ryan M, Stoddard J, Toneatto D, Wassil J, Dull PM. A multi-component meningococcal serogroup B vaccine (4CMenB): the clinical development program. Drugs. 2014;74(1):15–30.

    Article  CAS  PubMed  Google Scholar 

  32. Sette A, Rappuoli R. Reverse vaccinology: developing vaccines in the era of genomics. Immunity. 2010;33(4):530–41.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Haynes BF, Kelsoe G, Harrison SC, Kepler TB. B-cell-lineage immunogen design in vaccine development with HIV-1 as a case study. Nat Biotechnol. 2012;30(5):423–33.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Dormitzer PR, Grandi G, Rappuoli R. Structural vaccinology starts to deliver. Nat Rev Microbiol. 2012;10(12):807–13.

    Article  CAS  PubMed  Google Scholar 

  35. Rappuoli R, Bottomley MJ, D'Oro U, Finco O, De Gregorio E. Reverse vaccinology 2.0: human immunology instructs vaccine antigen design. J Exp Med. 2016;213(4):469–81.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Anders KL, Thompson CN, Thuy NT, Nguyet NM, Tu le TP, Dung TT, et al. The epidemiology and aetiology of diarrhoeal disease in infancy in southern Vietnam: a birth cohort study. Int J Infect Dis 2015;35:3–10.

  37. Duong VT, Tuyen HT, Minh PV, Campbell J, Phuc HL, Nhu TDH, et al. No clinical benefit of empirical antimicrobial therapy for pediatric diarrhea in a high-usage, high-resistance setting. Clin Infect Dis. 2017; cix844.

  38. Thompson CN, Anders KL, Nhi le TQ, Tuyen HT, Van Minh P, Tu le TP, et al. A cohort study to define the age-specific incidence and risk factors of Shigella diarrhoeal infections in Vietnamese children: a study protocol BMC Public Health 2014;14:1289.

  39. Thompson CN, Le TP, Anders KL, Nguyen TH, Lu LV, Nguyen VV, et al. The transfer and decay of maternal antibody against Shigella sonnei in a longitudinal cohort of Vietnamese infants. Vaccine. 2016;34(6):783–90.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Vesth T, Lagesen K, Acar O, Ussery D. CMG-biotools, a free workbench for basic comparative microbial genomics. PLoS One. 2013;8(4):e60120.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Finn RD, Tate J, Mistry J, Coggill PC, Sammut SJ, Hotz HR, et al. The Pfam protein families database. Nucleic Acids Res. 2008;36(Database issue):D281–8.

    CAS  PubMed  Google Scholar 

  42. Krogh A, Larsson B, von Heijne G, Sonnhammer EL. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol. 2001;305(3):567–80.

    Article  CAS  PubMed  Google Scholar 

  43. Nielsen H. Predicting secretory proteins with SignalP. Methods Mol Biol. 2017;1611:59–73.

    Article  CAS  PubMed  Google Scholar 

  44. Davies DH, Liang X, Hernandez JE, Randall A, Hirst S, Mu Y, et al. Profiling the humoral immune response to infection by using proteome microarrays: high-throughput vaccine and diagnostic antigen discovery. Proc Natl Acad Sci U S A. 2005;102(3):547–52.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. Chakraborty S, Randall A, Vickers TJ, Molina D, Harro CD, DeNearing B, et al. Human experimental challenge with enterotoxigenic Escherichia coli elicits immune responses to canonical and novel antigens relevant to vaccine development. J Infect Dis. 2018;218(9):1436–46.

    Article  PubMed  PubMed Central  Google Scholar 

  46. Liang L, Juarez S, Nga TV, Dunstan S, Nakajima-Sasaki R, Davies DH, et al. Immune profiling with a Salmonella Typhi antigen microarray identifies new diagnostic biomarkers of human typhoid. Sci Rep. 2013;3:1043.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  47. Kayala MA, Baldi P. Cyber-T web server: differential analysis of high-throughput data. Nucleic Acids Res. 2012;40(Web Server issue):W553–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  48. Baldi P, Hatfield GW. DNA microarrays and gene expression: from experiments to data analysis and modeling: Cambridge University Press; 2002.

    Book  Google Scholar 

  49. Baldi P, Brunak SR. Bioinformatics: the machine learning approach, 2nd Edition. Bach F, editor: MIT Press; 2001.

  50. Y. B, Y. H. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B Methodol 1995;57(1):289–300.

  51. Rondini S, Micoli F, Lanzilao L, Gavini M, Alfini R, Brandt C, et al. Design of glycoconjugate vaccines against invasive African Salmonella enterica serovar Typhimurium. Infect Immun. 2015;83(3):996–1007.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  52. de Alwis R, Tu LTP, Quynh NLT, Thompson CN, Anders KL, Van Thuy NT, et al. The role of maternally acquired antibody in providing protective immunity against nontyphoidal Salmonella in urban Vietnamese infants: a birth cohort study. J Infect Dis. 2019;219(2):295-304.

  53. Mattock E, Blocker AJ. How do the virulence factors of Shigella work together to cause disease? Front Cell Infect Microbiol. 2017;7:64.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  54. Fouda GG, Martinez DR, Swamy GK, Permar SR. The impact of IgG transplacental transfer on early life immunity. Immunohorizons. 2018;2(1):14–25.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  55. Caboni M, Pedron T, Rossi O, Goulding D, Pickard D, Citiulo F, et al. An O antigen capsule modulates bacterial pathogenesis in Shigella sonnei. PLoS Pathog. 2015;11(3):e1004749.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  56. Wei Y, Murphy ER. Shigella iron acquisition systems and their regulation. Front Cell Infect Microbiol. 2016;6:18.

    Article  PubMed  PubMed Central  Google Scholar 

  57. Braun V, Braun M. Iron transport and signaling in Escherichia coli. FEBS Lett. 2002;529(1):78–85.

    Article  CAS  PubMed  Google Scholar 

  58. Smajs D, Weinstock GM. The iron- and temperature-regulated cjrBC genes of Shigella and enteroinvasive Escherichia coli strains code for colicin Js uptake. J Bacteriol. 2001;183(13):3958–66.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  59. Pasqua M, Grossi M, Scinicariello S, Aussel L, Barras F, Colonna B, et al. The MFS efflux pump EmrKY contributes to the survival of Shigella within macrophages. Sci Rep. 2019;9(1):2906.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  60. Nagakubo S, Nishino K, Hirata T, Yamaguchi A. The putative response regulator BaeR stimulates multidrug resistance of Escherichia coli via a novel multidrug exporter system. MdtABC J Bacteriol. 2002;184(15):4161–7.

    Article  CAS  PubMed  Google Scholar 

  61. Kuntumalla S, Zhang Q, Braisted JC, Fleischmann RD, Peterson SN, Donohue-Rolfe A, et al. In vivo versus in vitro protein abundance analysis of Shigella dysenteriae type 1 reveals changes in the expression of proteins involved in virulence, stress and energy metabolism. BMC Microbiol. 2011;11:147.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  62. Wu T, Malinverni J, Ruiz N, Kim S, Silhavy TJ, Kahne D. Identification of a multicomponent complex required for outer membrane biogenesis in Escherichia coli. Cell. 2005;121(2):235–45.

    Article  CAS  PubMed  Google Scholar 

  63. Holt KE, Wertheim H, Zadoks RN, Baker S, Whitehouse CA, Dance D, et al. Genomic analysis of diversity, population structure, virulence, and antimicrobial resistance in Klebsiella pneumoniae, an urgent threat to public health. Proc Natl Acad Sci U S A. 2015;112(27):E3574–81.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  64. Skaar EP, Humayun M, Bae T, DeBord KL, Schneewind O. Iron-source preference of Staphylococcus aureus infections. Science. 2004;305(5690):1626–8.

    Article  CAS  PubMed  Google Scholar 

  65. Mao BH, Chang YF, Scaria J, Chang CC, Chou LW, Tien N, et al. Identification of Escherichia coli genes associated with urinary tract infections. J Clin Microbiol. 2012;50(2):449–56.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  66. Ellis TN, Kuehn MJ. Virulence and immunomodulatory roles of bacterial outer membrane vesicles. Microbiol Mol Biol Rev. 2010;74(1):81–94.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  67. Launay O, Ndiaye AGW, Conti V, Loulergue P, Scire AS, Landre AM, et al. Booster vaccination with GVGH Shigella sonnei 1790GAHB GMMA vaccine compared to single vaccination in unvaccinated healthy European adults: results from a phase 1 clinical trial. Front Immunol. 2019;10:335.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  68. Maggiore L, Yu L, Omasits U, Rossi O, Dougan G, Thomson NR, et al. Quantitative proteomic analysis of Shigella flexneri and Shigella sonnei Generalized Modules for Membrane Antigens (GMMA) reveals highly pure preparations. Int J Med Microbiol. 2016;306(2):99–108.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  69. Boullier S, Tanguy M, Kadaoui KA, Caubet C, Sansonetti P, Corthesy B, et al. Secretory IgA-mediated neutralization of Shigella flexneri prevents intestinal tissue destruction by down-regulating inflammatory circuits. J Immunol. 2009;183(9):5879–85.

    Article  CAS  PubMed  Google Scholar 

  70. Mathias A, Longet S, Corthesy B. Agglutinating secretory IgA preserves intestinal epithelial cell integrity during apical infection by Shigella flexneri. Infect Immun. 2013;81(8):3027–34.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  71. Barel LA, Mulard LA. Classical and novel strategies to develop a Shigella glycoconjugate vaccine: from concept to efficacy in human. Hum Vaccin Immunother. 2019;15(6):1338–56.

    Article  PubMed  PubMed Central  Google Scholar 

  72. de Alwis R, and Baker S. The identification of novel immunogenic antigens as potential Shigella vaccine components (version 1). Zenodo. (2020).

Download references


The authors would like to acknowledge and are grateful for all the patient participants, infants, and their families that made this study possible. We also acknowledge the study nurses at Hung Vuong Hospital where the participants were recruited. We thank Corinne N. Thompson whose Shigella O-antigen characterization data of maternal sera were utilized in this study.


This work was supported by a Wellcome senior research fellowship to Stephen Baker to (215515/Z/19/Z). DTP is funded as leadership fellow through the Oak Foundation. The funders had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.

Author information

Authors and Affiliations



Conceptualization: SB. Formal analysis: RdA, LL, EW, HCT, and DHD. Provided samples: VTD. Methodology: RdA, LL, OT, EW, HCT, TNHT, VTD, DDH, and PLF. Writing original draft: RdA and SB. Review and editing: RdA, EW, HCT, DHD, and SB. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Stephen Baker.

Ethics declarations

Ethics approval and consent to participate

Human serum samples for the purposes of this investigation were collected from an observational study of children with diarrheal disease and a cohort study of healthy infants, both conducted in Ho Chi Minh City (HCMC), Vietnam. The observational study that collected acute and convalescent blood samples from pediatric diarrheal patients was approved by the institutional review boards (IRB) of three local hospitals in HCMC and the Oxford Tropical Research Ethics Committee (OxTREC No. 1045-13) in the UK. Written informed consent by a legal guardian was obtained prior to enrolment into this observational study. The pediatric cohort study from which the current investigation accessed umbilical cord blood was approved by the local IRB of four local hospitals in HCMC and OxTREC (No. 02-09). Written informed consent was obtained from pregnant mothers prior to enrolment into the cohort. This study conforms to the ethical guidelines of the Helsinki Declaration. All animal work for this study was conducted by GenScript Biotech Corporation under the auditing, monitoring, and approval of GenScript’s Institutional Animal Care and Use Committee (IACUC).

Consent for publication

Not applicable

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Supplementary file 1.

In-house script to retrieve complete Shigella and Escherichia coli chromosomal sequences from GenBank (accessed July 2014).

Additional file 2: Table S1.

List of Shigella and E. coli strains used for the bioinformatics genome comparison.

Additional file 3: Table S2.

Identified protein within subsets of interests during chromosomal genome comparison between S. sonnei, S. flexneri, pathogenic and non-pathogenic E. coli.

Additional file 4: Supplementary file 2.

Script used to retrieve annotation and protein sequences from the input Shigella genomes.

Additional file 5: Table S3.

List of Shigella proteins and antigens that were successfully tested on the Immunoassay protein microarray.

Additional file 6: Table S4.

The recombinant expression of highly reactive Shigella proteins for in vivo immunogen testing.

Additional file 7: Figure S1.

No statistically significant seroconversion to Shigella antigens detected in diarrheal patients during Salmonella infections.

Additional file 8: Figure S2.

Antibody responses to natural Shigella infections are influenced by severity of disease.

Additional file 9: Figure S3.

Convalescent antibody responses to top reactive Shigella antigens by age and sex.

Additional file 10: Table S5.

Comparison of SBA titre from Rabbits immunized with Shigella candidate proteins using Shigella strain DE1404 grown with and without chloramphenicol (50mg/L).

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

de Alwis, R., Liang, L., Taghavian, O. et al. The identification of novel immunogenic antigens as potential Shigella vaccine components. Genome Med 13, 8 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: