Abstract
Bacteriophages play crucial roles in microbial ecosystems and have potential for biotechnological applications. However, our understanding of culturable phages remains limited. In this study, we characterized six novel Escherichia coli phages isolated from pig farm wastewater and urban sewage, using comprehensive genomic, morphological, and host-range analysis. Using multiple comparative approaches, including gene-sharing network analysis, average nucleotide identity (ANI), and nucleotide intergenomic similarity (NIS), we demonstrated that five of these phages form a distinct group within the subfamily Stephanstirmvirinae, potentially representing a novel genus, for which we propose the name "Wecvirus”. We further propose that these phages, each of which exhibits a unique host range pattern, should be classified in two distinct species within the proposed genus. This host specificity is reflected in differences in the amino acid sequences of tail fibers, which are crucial for infection. The remaining phage, which was not classified as a wecvirus, exhibited characteristics that challenged the current classification criteria, highlighting the need for more-flexible taxonomic approaches. These findings expand our understanding of phage diversity within the subfamily Stephanstirmvirinae and contribute to the evolving phage taxonomy framework.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Bacteriophages (phages) are viruses that infect bacteria and are the most abundant biological entities on Earth [1]. They play vital roles in shaping microbial communities, driving bacterial evolution, and maintaining the ecological balance in various environments [2,3,4,5]. The diversity and abundance of phages significantly influence global biogeochemical cycles and hold promise for biotechnological applications, including phage therapy and biocontrol [6,7,8].
Phage classification is challenging because of the rapid evolution and extensive genetic diversity of phages [9]. The International Committee on Taxonomy of Viruses (ICTV) has continually updated its classification system for viruses, including phages, to accommodate new discoveries and genomic data. In 2022, the ICTV substantially revised the classification of members of the former order "Caudovirales", abandoning the morphology-based classification in favor of a system based on genetic relationships [10].
The subfamily Stephanstirmvirinae, a relatively new taxonomic group within the class Caudoviricetes, consists of two genera: Justusliebigvirus and Phapecoctavirus. Phages in this subfamily primarily infect gammaproteobacteria, including Escherichia coli, Klebsiella pneumoniae, and members of the Pseudomonadales [11,12,13,14,15,16,17]. Interestingly, Campylobacter jejuni, an epsilonproteobacterium with a distinct outer membrane structure, has also been reported to be a host for members of the Stephanstirmvirinae, suggesting that phages in this subfamily may possess diverse host recognition mechanisms [18]. These bacterial hosts are medically significant because of their rapid development of antimicrobial resistance, and phages infecting these bacteria are considered promising candidates for combating drug-resistant pathogens [6, 7, 19]. These phages have been isolated from diverse environments, including water samples (sewage and freshwater), feces (avian and human), and compost, suggesting their ability to infect bacterial strains from different sources [11,12,13, 18]. Phages belonging to the genera Justusliebigvirus and Phapecoctavirus have been reported to show lytic activity against drug-resistant bacteria, human pathogens, and plant pathogens, leading to ongoing research into their potential as novel antimicrobial agents [13,14,15, 17]. The diversity of host recognition mechanisms in these phages indicates their potential to adapt to different bacterial strains. This characteristic, combined with their demonstrated activity against various pathogenic bacteria, makes these phages promising candidates for therapeutic applications. Understanding these host-phage interactions is crucial for developing effective phage-based treatments against emerging drug-resistant bacteria.
As of August 29, 2024, eight publications on “Phapecoctavirus” and one on “Justusliebigvirus” have been indexed in PubMed [11,12,13,14,15,16,17,18, 20]. While these studies have established basic characteristics, such as double-stranded DNA genomes (~ 150 kbp) and GC content (~ 39%), many critical aspects remain unexplored. In particular, the mechanisms underlying host specificity, including the role of structural proteins, such as tail fibers, warrant further investigation to understand their evolutionary adaptations and therapeutic potential.
In this study, we present a comprehensive characterization of six phage isolates: four previously isolated from pig farm wastewater (ΦWec179, ΦWec181, ΦWec186, and ΦWec187) and two from urban sewage (ΦWec188 and ΦWec190) [21]. Our analysis included a phylogenetic analysis based on genome sequences, morphological features observed using transmission electron microscopy (TEM), and host range evaluation. This study expands our understanding of phage diversity within the subfamily Stephanstirmvirinae and contributes significantly to the broader field of phage taxonomy and evolution.
Materials and methods
Reagents and bacteria
Luria-Bertani medium was used for bacterial and phage cultures. The concentration of soft agar was 0.5%. The E. coli strain TK001 used in this study was isolated from the feces of a mouse with dextran sodium sulfate (DSS)-induced colitis and was used as the host strain for phage screening [21]. The animal experiments for generating mice with DSS-induced colitis used for the isolation of TK001 were reviewed and approved by the Waseda University Academic Research Ethics Committee (approval number 2020-A009).
Phage preparation
Four E. coli phages (ΦWec179, 181, 186, and 187) were isolated from swine farm effluent, and two phages (ΦWec188 and 190) were isolated from municipal sewage. These six phages were propagated using E. coli TK001 as the host, while phages T4 and T7 were propagated using MG1655 as the host, all using the plate lysate method. Briefly, an overnight bacterial culture (100 µL) and phage suspension (104 PFU/mL, 100 µL) were added to soft agar (0.5%) and overlaid onto agar plates. After overnight incubation (37°C), 5 mL of sodium chloride–magnesium sulfate (SM) buffer was added to the plates, followed by orbital shaking (approximately 250 rpm, 2–3 h). The supernatant was collected in 15-mL tubes and centrifuged at 7,300 g for 15 min at 4 °C in a TOMY MX-305 centrifuge with an AR500-03 rotor to separate the solid and liquid components. The liquid component was transferred to new 15-mL tubes, followed by the addition of 1 mL of chloroform. After vortexing and settling, the preparation was used as the phage stock. Phages T4 and T7 were obtained from the Biological Resource Center of the National Institute of Technology and Evaluation in Kisarazu, Japan (NBRC 20004 and 20005, respectively).
Host range analysis
The ability of the phages to infect various bacterial strains was evaluated as follows: Each phage (approximately 107 PFU/mL) was serially diluted tenfold up to a 107 dilution in SM buffer, 1 µL of which was spotted onto a bacterial lawn. The efficiency of plating (EoP) was calculated by dividing the titer against each bacterial strain by the titer against the reference host (TK001 for ΦWec179, 181, 186, 188, and 190; MG1655 for T4 and T7). The bacterial strains tested included 33 laboratory and clinical isolates of E. coli, four different serotypes of Salmonella enterica (NBRC3163, 3313, 13245, and 15335), and one strain of E. fergusonii (NBRC102419), details of which are provided in Supplementary Table S1. Serotype determination of E. coli strains was performed using SerotypeFinder 2.0 [22].
TEM
Phages (109−1010 PFU/mL) were purified via polyethylene glycol precipitation and loaded onto copper grids using a support film (cat. no. 649; Nissin EM Co. Ltd., Tokyo, Japan). After washing three times with distilled water, the samples were stained with fourfold-diluted EM Stainer (cat. no. 336; Nissin EM Co., Ltd., Tokyo, Japan) for 1 min. The excess liquid was removed with filter paper, and the grids were air-dried before observation using a JEM-1010 transmission electron microscope (JEOL, Tokyo, Japan) at 100 kV.
Taxonomic analysis based on whole genome sequences
Previously published genome sequences (LC739533-LC739537 and LC739539) were used [21]. Gene-sharing network analysis was performed using vConTACT2 with reference genome sequences of 3,503 phages (ProkaryoticViralRefSeq201) and visualized using Cytoscape v3.10.1 [23, 24]. Taxonomic information that was not updated within vConTACT2 was updated manually using a taxonomy browser (https://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi). ANI analysis was performed using pyani with the "anib" parameter [25]. NIS was calculated using VIRIDIC [26]. The phage genomes used for the ANI and NIS comparisons were obtained using the Genbank_get_genomes_by_taxon.py script included in the Pyani package. Comparative genome analysis was performed using Genomematcher in blastp mode [27]. Synteny visualization between representative phages and type strains from related genera was performed using Clinker v0.0.31[28].
Genome annotation
Phage annotations were performed using Pharokka to complement previous annotations performed using RAST and DFAST [29]. A phylogenetic tree was constructed by the maximum-likelihood method with 1,000 bootstrap replicates, using MEGA 11 [30]. Provisional naming conventions were used to facilitate comparison of the tail-fiber-protein-coding genes. Based on the number of amino acid residues encoded, the genes were designated as follows: TfpL (943 or 958 residues), TfpM1 (890 or 891 residues), TfpM2 (681 residues), TfpS1 (353 residues), TfpS2 (235 residues), and TfpS3 (157 residues). These designations are used for discussion purposes in this report and are not proposed as official gene names.
Results
Morphological analysis
The TEM images showed that all of the phages were morphologically identical to myoviruses (Fig. 1), and all of their capsids and tails exhibited similar morphology.
Host range analysis
Host range analysis with multiple Enterobacteriaceae species revealed that these phages showed similar patterns, all infecting E. coli TK001 and the E. coli clinical isolates SUTL1, ESBL946, and ESBL991 (Fig. 2 and Supplementary Fig. S1). The ΦWec188 and 190 phages could also infect ESBL1054 and S. enterica subsp. enterica serovar Pullorum. Furthermore, ΦWec187 could infect all of the strains that were susceptible to the other phages plus ESBL1054 and the E. coli laboratory strains MG1655 and TOP10F', and it showed lytic activity (without plaque formation) against E. fergusonii. The control phages T4 and T7 infected three laboratory strains (MG1655, TOP10F', and BL21) and E. fergusonii, with T7 also infecting the clinical isolate ESBL1054. Serotype determination of E. coli strains was performed, but no clear correlation was observed between phage infectivity patterns and bacterial serotypes, suggesting that host specificity is determined by factors other than serotype-specific determinants (Supplementary Table S1).
Heat map of the host range of the isolated phages. Host range analysis was performed by spot assays with serial dilutions of each phage on various bacterial strains. The EoP (efficiency of plating) was calculated by dividing the titer against each bacterial strain by the titer against the reference host strain (TK001 for ΦWec179, 181, 186, 188, and 190; MG1655 for T4 and T7). White indicates no reduction in infectivity (EoP = 1), and progressively darker cyan shades represent decreasing infectivity levels, with dark cyan signifying complete loss of infectivity (not detected, ND). Purple shades indicate enhanced infectivity (EoP > 1). "pseud" indicates that plaques were not visible but lysis was observed in the bacterial lawn. "ND" indicates that no lysis or plaque formation occurred even at the highest phage concentration tested
Taxonomic analysis based on whole genome sequences
Our isolates possess a typical myovirus genome organization with genes arranged in functional clusters: early genes involved in DNA replication and recombination, followed by structural genes encoding capsid and tail proteins, and late genes for lysis functions. All of the genomes contain more than 10 tRNA genes and terminate with direct terminal repeats, like other Stephanstirmvirinae members.
BLASTn analysis revealed that among the six phages characterized, ΦWec179, 181, and 186 showed the highest BLAST ANI (identity × coverage) to Escherichia phage vB_EcoM_DE16 (accession no. OP595145.1) of the genus Phapecoctavirus within the subfamily Stephanstirmvirinae, with a BLAST ANI of 55.32% (80.18% identity × 69% coverage). ΦWec187, which showed notably lower GC content (37.5%) than the other phage isolates (42.2–42.3%), was most closely related to Escherichia phage AV123 (accession no. OR352953.1) of the genus Justusliebigvirus with a BLAST ANI of 94.17% (98.09% identity × 96% coverage). ΦWec188 and 190 showed the highest BLAST ANI to Escherichia phage Mt1B1_P17 (accession no. NC_052662.1) of the genus Phapecoctavirus, with a BLAST ANI of 54.36% (77.66% identity × 70% coverage) (Supplementary Table S2). The genomic characteristics of the phages are summarized in Supplementary Figure S2.
To investigate the relationships between these phage isolates more thoroughly, we performed a gene-sharing network analysis based on shared protein clusters between viral genomes using vContact2, including all known members of the genera Justusliebigvirus and Phapecoctavirus (Fig. 3A, Supplementary Table S3). ΦWec179, 181, 186, 188, and 190 formed their own unique cluster (VC315_1) with ΦWec179, 181, and 186 clustering closely together, while ΦWec188 and 190 formed a separate group within the same cluster (Fig. 3B, Supplementary Table S3). ΦWec187 belonged to VC315_0, which contained all members of the genus Justusliebigvirus of the subfamily Stephanstirmvirinae (Fig. 3B, Supplementary Table S3). VC315_2 comprised all members of the genus Phapecoctavirus of the subfamily Stephanstirmvirinae. This clustering pattern demonstrates that all known members of each established genus grouped together, while our isolates formed distinct clusters.
Gene-sharing network analysis of phage genomes. (A) Network visualization of gene-sharing relationships among 3,509 phage genomes supplemented with 15 additional Stephanstirmvirinae members (10 Phapecoctavirus and five Justusliebigvirus phages) not included in the original reference dataset to ensure complete representation of these genera. The boxed area indicates the VC315 cluster. (B) Enlarged view of the VC315 cluster showing subclusters: VC315_0 (green node) contains ΦWec187 and reference phages belonging to Justusliebigvirus, VC315_1 (orange nodes) consists of ΦWec179, 181, 186, 188, and 190, VC315_2 (light blue node) contains reference phages belonging to Phapecoctavirus, and gray nodes indicate other reference phages.
To evaluate the similarity between all NCBI-registered Stephanstirmvirinae subfamily phages (as of July 24, 2024) and ΦWec179, 181, 186, 187, 188, and 190, we calculated the nucleotide intergenomic similarity (NIS) using VIRIDIC and average nucleotide identity (ANI) using pyani (Fig. 4, Supplementary Table S4). ΦWec187 showed NIS values below 95.5% when compared to the registered Stephanstirmvirinae phages, whereas its ANI values were between 0.7% and 94%. Notably, Campylobacter phage A145 (MG065645.1), Campylobacter phage A110a (MG065689.1), Campylobacter phage A150 (MG065639.1), Campylobacter phage C3 (MG065656.1), Escherichia phage vB_Eco_PATM (OX090893.1), and Escherichia phage Paula (LR865361.1) showed NIS values between 95% and 95.5% with ΦWec187, and their ANI values were approximately 90%. However, some phages with ANI values above 90% had NIS values below 95%, whereas others with ANI values below 90% (approximately 89%) had NIS values above 95%. ΦWec179, 181, 186, 188, and 190 showed NIS values below 66% and ANI values between 20 and 50% when compared to Stephanstirmvirinae members registered in NCBI. Furthermore, both NIS and ANI values for ΦWec179, 181, 186, 188, and 190 exceeded 70%; they formed two distinct groups – the ΦWec179, 181, and 186 and ΦWec188 and 190 groups – when a 90% threshold was applied.
Similarity analysis of phage isolates of the subfamily Stephanstirmvirinae. NIS, genome length, and aligned genome fractions were determined using VIRIDIC. The phage genome information for comparison was obtained from registered Stephanstirmvirinae members using the genbank_get_genomes_by_taxon.py script included in the pyani package. Phage names are provided alongside accession numbers for clarity. Green shading indicates Justusliebigvirus members, and blue shading indicates Phapecoctavirus members.
Comparative genomic analysis visually confirmed the high similarity suggested by NIS and ANI values within the ΦWec179, 181, and 186 and ΦWec188 and 190 groups (Supplementary Fig. S2A and B). Comparison between ΦWec179 and ΦWec188 revealed some similarity, although it was less pronounced than that within groups (Supplementary Fig. S2C). Comparison between representative strains ΦWec179 (from the ΦWec179, 181, and 186 group), ΦWec188 (from the ΦWec188, 190 group), and ΦWec187 showed negligible similarity.
To further investigate the evolutionary relationships between our isolates and members of the established Stephanstirmvirinae genera, we performed comparative genomic analysis with type strains belonging to the genera Phapecoctavirus (phAPEC8) and Justusliebigvirus (phi92). This analysis revealed limited synteny between our isolates and the reference strains (Fig. 5). The comparative analysis identified conserved genes suitable for phylogenetic reconstruction, including the large terminase subunit, major capsid protein, and tail fiber genes.
Comparative genomic analysis of representative phages with type strains from related genera. Comparative genome visualization of phiWec188, phiWec179, phAPEC8 (Phapecoctavirus type strain), and phi92 (Justusliebigvirus type strain). Arrows represent ORFs, and connecting ribbons indicate homologous regions with shading intensity reflecting sequence similarity. Genome length and GC content are shown on the right.
Phylogenetic analysis of conserved structural and tail fiber proteins
To investigate evolutionary relationships and host-interaction mechanisms, we conducted phylogenetic analysis of both conserved structural genes (major capsid protein and terminase large subunit) and tail fiber genes. Tail fiber genes were selected for analysis because they are crucial determinants of phage-host interactions through bacterial surface receptor recognition [31,32,33]. Six types of genes encoding tail fiber proteins were identified based on similarity to sequences in the VOG database, using the results of Pharokka annotation: TfpL (long tail fiber protein; 943 or 958 amino acids [aa]), TfpM1 (major tail fiber protein 1; 890 or 891 aa), TfpM2 (major tail fiber protein 2; 681 aa), TfpS1 (short tail fiber protein 1; 353 aa), TfpS2 (short tail fiber protein 2; 235 aa), and TfpS3 (short tail fiber protein 3; 157 aa). These genes were present in all phages except for the justusliebigviruses, including ΦWec187, which lacked the gene encoding TfpM1.tusli
Molecular phylogenetic analysis revealed distinct clustering patterns across different protein families. For structural genes (major capsid protein and terminase large subunit) and most tail fiber genes (TfpM2, TfpS2, TfpS3), our isolates (ΦWec179, 181, 186, 188, 190), Phapecoctavirus members, and Justusliebigvirus members formed clearly separated clades (Fig. 6A, B, E, G, H). However, gene-specific variations were observed: TfpS1 showed mixed clustering, with Phapecoctavirus forming two separate groups while our isolates and Justusliebigvirus clustered together (Fig. 6F). Notably, TfpL and TfpM1 revealed internal structure within the ΦWec179, 181, 186, 188, and 190 group, with ΦWec179, 181, and 186 forming one subclade and ΦWec188 and 190 forming another, (Fig. 6C and D). TfpM1 was absent in all Justusliebigvirus members, indicating genus-specific differences in tail fiber composition (Fig. 6E).
Molecular phylogenetic trees based on the amino acid sequences of tail fiber genes. Trees were constructed using the ML method with 1000 bootstrap replicates. Blue indicates Phapecoctavirus members, green indicates Justusliebigvirus members, and yellow indicates our isolates (ΦWec179, 181, 186, 188, 190). (A) Major head (capsid) protein. (B) Terminase large subunit (Escherichia phage alia excluded due to gene spanning contig boundaries). (C) TfpL. (D) TfpM1 (absent in Justusliebigvirus members; Escherichia phage anhysbys excluded due to gene spanning contig boundaries). (E) TfpM2. (F) TfpS1. (G) TfpS2. (H) TfpS3
Furthermore, while each gene encoding a tail fiber protein (TfpL, TfpM1, TfpM2, TfpS1, TfpS2, and TfpS3) was completely conserved within the ΦWec179 group and the ΦWec188 group, slight differences were observed between the groups, with variations concentrated in the C-terminal regions. Notably, although these genes were perfectly conserved within the ΦWec188 group, their order was not conserved. Additionally, the arrangement of conserved tail fiber genes (TfpL, TfpM2, TfpS1, TfpS2, and TfpS3) in ΦWec187 matched that in ΦWec188.
Discussion
In this study, we characterized six novel E. coli phages, using genomic, morphological, and host range analysis. Our findings revealed that five of these phages (ΦWec179, ΦWec181, ΦWec186, ΦWec188, and ΦWec190) represent a new genus within the subfamily Stephanstirmvirinae, for which we propose the name "Wecvirus”. These phages form two distinct species, based on their genomic similarities and host range patterns. The remaining phage, ΦWec187, shows characteristics of a Justusliebigvirus, although it cannot be classified accurately under the current classification criteria.
Morphological analysis showed that all the phage isolates have a myovirus morphotype, consistent with that observed for other members of the subfamily Stephanstirmvirinae [12,13,14, 16]. Host range patterns revealed distinct infection profiles among the isolated phages. From the perspective of host range testing, all of the environmental phages analyzed in this study, including T4 (class Caudoviricetes, family Straboviridae, subfamily Tevenvirinae, genus Tequatrovirus) and T7 (class Caudoviricetes, family Autotranscriptaviridae, subfamily Studiervirinae, genus Teseptimavirus), which are phylogenetically distinct from the wecviruses, infected approximately 13% of the host strains, confirming the narrow host range typical of Enterobacteriaceae-infecting phages [34]. The differential ability to infect laboratory strains versus clinical isolates suggests an adaptation to specific host populations. Notably, the broader host range of ΦWec187 compared to those of other isolates indicates potential differences in host recognition mechanisms. Our findings align with those of previous studies on Stephanstirmvirinae members, where similar variations in host specificity were reported. For instance, Nicolas et al. reported that Phapecoctavirus phages infected approximately 24% of the strains in a panel of 32 E. coli strains, primarily avian pathogenic E. coli (APEC) strains, whereas "Wecvirus" isolates demonstrated a narrower host range against different clinical isolates [12]. Similarly, Markuskovál et al. observed that the Justusliebigvirus phage phi92 had a 23% infection rate against uropathogenic E. coli [17]. These differences in host specificity possibly reflect the diverse origins of the tested bacterial strains rather than inherent differences in host range breadth.
Taxonomic analysis revealed significant evolutionary relationships among members of the subfamily Stephanstirmvirinae. Phages ΦWec179, ΦWec181, ΦWec186, ΦWec188, and ΦWec190 formed a distinct cluster and showed similarity values below the 70% genus-level threshold in both NIS and ANI analysis when compared with all known members. This discovery suggests that the current sampling has not yet captured the full range of phage diversity within this subfamily. The limited synteny observed between "Wecvirus" and established genera (phAPEC8 and phi92) in comparative genomic analysis further supports the proposal of "Wecvirus" as a distinct genus within the subfamily Stephanstirmvirinae.
Among these five phages, NIS and ANI analysis revealed two distinct clusters with high levels of internal similarity (> 95%). Between-group similarity values of 80–88% indicated that these clusters represent separate species, despite their close evolutionary relationship. The consistently higher NIS values (60–65%) compared to the corresponding ANI values (20–50%) between these phages and reference phages reflect fundamental differences in how these metrics are calculated. These patterns were consistent with BLASTn results: ΦWec187 showed high similarity to known Justusliebigvirus members (BLAST ANI: 94.17%), while "Wecvirus" phages (ΦWec179-190) showed low similarity to known Phapecoctavirus members (BLAST ANI < 70%), supporting their classification as a distinct genus. NIS, which performs direct whole-genome comparisons, appears to capture conserved regions and overall genome structure more effectively. Conversely, ANI values, which represent average similarities across fragmented sequences [35], may better evaluate local similarities but do not completely account for phage-specific genomic features, such as extensive rearrangements or large-scale insertions/deletions.
We observed discrepancies between the NIS and ANI values of ΦWec187, making classification of this phage under the current taxonomic criteria challenging. The unique characteristics of this phage highlight the complexity of current phage classification systems and warrant further investigation. We propose that ΦWec187 should be temporarily classified as “unclassified Stephanstirmvirinae” and investigated in the future.
Phylogenetic analysis of structural genes provided additional support for our taxonomic proposals. Both major capsid protein and terminase large subunit phylogenies clearly separated our isolates from the established genera Phapecoctavirus and Justusliebigvirus, demonstrating that the distinctiveness observed in whole-genome analysis extends to conserved structural components. These results confirm that the proposed genus "Wecvirus" represents a genuine evolutionary lineage rather than a product of extensive horizontal gene transfer or recombination events that might affect only variable genomic regions.
Our findings regarding tail fiber diversity in the "Wecvirus" phages provide insights into the relationship between phage structure and host specificity within the subfamily Stephanstirmvirinae. Interestingly, the type strains of the two established genera in this subfamily exhibit notably broad host ranges. Phage phi92 (Justusliebigvirus) demonstrates remarkable versatility, which can be attributed to its four distinct types of tail fibers/tailspikes that enable attachment to various bacterial surface structures, allowing infection of both E. coli and Salmonella strains [36]. Similarly, phAPEC8 (Phapecoctavirus) shows broad infectivity against various APEC strains [37]. In contrast, our "Wecvirus" isolates demonstrated narrower host ranges (approximately 13% infectivity), which is more typical of Enterobacteriaceae-infecting phages.
The species-specific differences in tail fiber composition we observed in the wecviruses likely represent a more specialized evolutionary strategy for host adaptation, perhaps not as extensive as the "multivalent adsorption apparatus" described for phi92 [36]. This structural difference in tail fiber complexity may explain why "Wecvirus" members exhibit more-restricted host ranges than other Stephanstirmvirinae members, suggesting that tail fiber diversity is a key determinant of host range breadth within this subfamily.
Tail fibers are composed of multiple proteins that play crucial roles in host recognition [38, 39]. For example, the proximal and distal tail fibers of phage T4 have distinct protein compositions [40]. The multiple tail fiber genes identified in this study possibly constitute different parts of the tail structure and may enable the recognition of different host molecules [41].
Detailed analysis of tail fiber proteins provides insights into phage-host interactions and adaptation mechanisms. The differences in host ranges correlate with group-specific characteristics of tail fiber composition, particularly in the C-terminal regions, which are crucial for receptor recognition [42,43,44,45]. While whole-genome comparisons showed clear grouping patterns, phylogenetic analysis based on tail fiber gene sequences revealed more-complex relationships, particularly regarding ΦWec187, suggesting mosaic evolution via horizontal gene transfer rather than simple linear evolution [9, 46].
Our previous study demonstrated that all five phages target the R1 core lipopolysaccharide (LPS), with both groups specifically recognizing galactose transferred by WaaT [47]. The C-terminal sequence variations in tail fibers suggest the acquisition of additional host recognition capabilities while maintaining this core LPS-binding function. In particular, the ΦWec179-186 group appears to utilize flagella as a secondary receptor, while the ΦWec188-190 group requires the inner membrane protein, YhaH, indicating distinct evolutionary trajectories in host adaptation.
Subtle differences in host range were observed, even within the groups. For example, ΦWec179, 181, and 186, despite belonging to the same group and showing complete conservation of tail fiber protein sequences, exhibited slight variations in infection efficiency against certain strains. These observations suggest that, while the tail structure largely determines the host range, host utilization efficiency may involve factors beyond phage adsorption, such as replication efficiency within host cells. For instance, the T4 collar and whiskers regulate long tail fiber retraction under unfavorable conditions [48], and bacterial phage defense systems can reduce efficiency of plating (EoP) [49]. Future research should focus on the functional analysis of receptor-binding proteins and investigation of interactions with the host cell surface. Additionally, functional analysis of genes whose functions are currently unknown may contribute to our understanding of host specificity.
Based on the results of this study and per the ICTV naming conventions (Rules 3.21, 3.22) [50], we propose the following taxonomic classifications:
-
1.
Establishment of a new genus, “Wecvirus”, within the subfamily Stephanstirmvirinae
-
2.
Designation of a new species, “Wecvirus wec179”, comprising ΦWec179, ΦWec181, and ΦWec186
-
3.
Designation of a new species, “Wecvirus wec188”, comprising ΦWec188 and ΦWec190
These names follow the ICTV guidelines [50], with the genus name consisting of a single word with the suffix “-virus” and the species names comprising two words: the genus name followed by the isolate number. The new genus name is based on the representative isolate designation (Wec). The members of the proposed genus "Wecvirus" are genetically distinct from known Stephanstirmvirinae subfamily members (ANI and NIS < 70%), and the members of each of its proposed species (“Wecvirus wec179” and “Wecvirus wec188”) display unique host ranges. The establishment of this new genus and its constituent species will enhance our understanding of phage diversity and refine the taxonomic system within the subfamily Stephanstirmvirinae. Future research is expected to identify additional phages belonging to this new genus and elucidate the ecological role of its members through functional analysis of receptor-binding proteins and investigation of host-phage interactions.
Data availability
All data supporting the findings of this study are included within the manuscript and its supplementary information files.
References
Musheigan (2020) Are There 1031 Virus Particles on Earth, or More, or Fewer? J Bacteriol 202:e00052–e00020. https://doi.org/10.1128/jb.00052-20
Scanlan PD, Buckling A (2012) Co-evolution with lytic phage selects for the mucoid phenotype of Pseudomonas fluorescens SBW25. ISME J 6:1148–1158. https://doi.org/10.1038/ismej.2011.174
Sausset R, Petit MA, Paepe M, De (2020) New insights into intestinal phages. Mucosal Immunol 13:205–215. https://doi.org/10.1038/s41385-019-0250-5
Tanji Y, Hattori K, Suzuki K, Miyanaga K (2008) Spontaneous deletion of a 209-kilobase-pair fragment from the Escherichia coli genome occurs with acquisition of resistance to an assortment of infectious phages. Appl Environ Microbiol 74:4256–4263. https://doi.org/10.1128/AEM.00243-08
Synnott AJ, Kuang Y, Kurimoto M et al (2009) Isolation from sewage influent and characterization of novel Staphylococcus aureus bacteriophages with wide host ranges and potent lytic capabilities. Appl Environ Microbiol 75:4483–4490. https://doi.org/10.1128/AEM.02641-08
Schooley RT, Biswas B, Gill JJ et al (2017) Development and use of personalized bacteriophage-based therapeutic cocktails to treat a patient with a disseminated resistant Acinetobacter baumannii infection. Antimicrob Agents Chemother 61:e00954–e00917. https://doi.org/10.1128/AAC.00954-17
Jault P, Leclerc T, Jennes S et al (2019) Efficacy and tolerability of a cocktail of bacteriophages to treat burn wounds infected by Pseudomonas aeruginosa (PhagoBurn): a randomised, controlled, double-blind phase 1/2 trial. Lancet Infect Dis 19:35–45. https://doi.org/10.1016/S1473-3099(18)30482-1
Fujimoto K, Kimura Y, Shimohigoshi M et al (2020) Metagenome Data on Intestinal Phage-Bacteria Associations Aids the Development of Phage Therapy against Pathobionts. Cell Host Microbe 28:380–389e9. https://doi.org/10.1016/j.chom.2020.06.005
Casjens SR, Thuman-Commike PA (2011) Evolution of mosaically related tailed bacteriophage genomes seen through the lens of phage P22 virion assembly. Virology 411:393–415. https://doi.org/10.1016/j.virol.2010.12.046
Turner D, Shkoporov AN, Lood C et al (2023) Abolishment of morphology-based taxa and change to binomial species names: 2022 taxonomy update of the ICTV bacterial viruses subcommittee. Arch Virol 168:1–9. https://doi.org/10.1007/s00705-022-05694-2
Montso PK, Kropinski AM, Mokoena F et al (2023) Comparative genomics and proteomics analysis of phages infecting multi-drug resistant Escherichia coli O177 isolated from cattle faeces. Sci Rep 13:1–13. https://doi.org/10.1038/s41598-023-48788-w
Nicolas M, Trotereau A, Culot A et al (2023) Isolation and Characterization of a Novel Phage Collection against Avian-Pathogenic Escherichia coli. Microbiol Spectr 11. https://doi.org/10.1128/spectrum.04296-22
Wintachai P, Surachat K, Chaimaha G et al (2022) Isolation and Characterization of a Phapecoctavirus Infecting Multidrug-Resistant Acinetobacter baumannii in A549 Alveolar Epithelial Cells. Viruses 14
Silva EC, Quinde CA, Cieza B et al (2024) Molecular Characterization and Genome Mechanical Features of Two Newly Isolated Polyvalent Bacteriophages Infecting Pseudomonas syringae pv. garcae. Genes (Basel) 15. https://doi.org/10.3390/genes15010113
Khan T, Haider A, Rahman S et al (2024) Complete genome sequence of Escherichia phage iGC_PHA_EC001. Microbiol Resour Announc 13:21–24. https://doi.org/10.1128/mra.00842-23
Smith KR, Bumunang EW, Schlechte J et al (2023) The Isolation and Characterization of Bacteriophages Infecting Avian Pathogenic Escherichia coli O1, O2 and O78 Strains. Viruses 15:1–22. https://doi.org/10.3390/v15102095
Markusková B, Elnwrani S, Andrezál M et al (2024) Characterization of bacteriophages infecting multidrug-resistant uropathogenic Escherichia coli strains. Arch Virol 169. https://doi.org/10.1007/s00705-024-06063-x
Khalifeh A, Kraberger S, Dziewulska D et al (2021) Complete Genome Sequence of a Phapecoctavirus Isolated from a Pigeon Cloacal Swab Sample. Microbiol Resour Announc 10:2020–2022. https://doi.org/10.1128/mra.01471-20
WHO publishes list of bacteria for which new antibiotics are urgently needed. In: World Heal. Olivia Lawe Davies, Organ (2017) https://www.who.int/news/item/27-02-2017-who-publishes-list-of-bacteria-for-which-new-antibiotics-are-urgently-needed. Accessed 2 Sep 2022
Korf IHE, Meier-Kolthoff JP, Adriaenssens EM et al (2019) Still something to discover: Novel insights into Escherichia coli phage diversity and taxonomy. Viruses 11:1–29. https://doi.org/10.3390/v11050454
Kaneko T, Osaka T, Tsuneda S (2023) Tailoring Effective Phage Cocktails for Long-Term Lysis of Escherichia coli Based on Physiological Properties of Constituent Phages. PHAGE 4:128–135. https://doi.org/10.1089/phage.2023.0016
Joensen KG, Tetzschner AMM, Iguchi A et al (2015) Rapid and Easy In Silico Serotyping of Escherichia coli Isolates by Use of Whole-Genome Sequencing Data. J Clin Microbiol 53:2410–2426. https://doi.org/10.1128/jcm.00008-15
Bolduc B, Jang H, Bin, Doulcier G et al (2017) vConTACT: an iVirus tool to classify double-stranded DNA viruses that infect Archaea and Bacteria. 1–26. https://doi.org/10.7717/peerj.3243
Paul S, Andrew M, Owen O et al (1971) Cytoscape: A Software Environment for Integrated Models. Genome Res 13:426. https://doi.org/10.1101/gr.1239303.metabolite
Pritchard L, Glover RH, Humphris S et al (2016) Genomics and taxonomy in diagnostics for food security: Soft-rotting enterobacterial plant pathogens. Anal Methods 8:12–24. https://doi.org/10.1039/c5ay02550h
Moraru C, Varsani A, Kropinski AM (2020) VIRIDIC — A Novel Tool to Calculate the Intergenomic Similarities of. Viruses 12:1268
Ohtsubo Y, Ikeda-Ohtsubo W, Nagata Y, Tsuda M (2008) GenomeMatcher: A graphical user interface for DNA sequence comparison. BMC Bioinformatics 9:1–9. https://doi.org/10.1186/1471-2105-9-376
Gilchrist CLM, Chooi Y-H (2021) clinker & clustermap.js: automatic generation of gene cluster comparison figures. Bioinformatics 37:2473–2475. https://doi.org/10.1093/bioinformatics/btab007
Bouras G, Nepal R, Houtak G et al (2023) Pharokka: a fast scalable bacteriophage annotation tool. Bioinformatics 39:1–4. https://doi.org/10.1093/bioinformatics/btac776
Tamura K, Stecher G, Kumar S (2021) MEGA11: Molecular Evolutionary Genetics Analysis Version 11. Mol Biol Evol 38:3022–3027. https://doi.org/10.1093/molbev/msab120
Russell J, Bikard D (2019) Learning from Antibodies: Phage Host-Range Engineering. Cell Host Microbe 26:445–446. https://doi.org/10.1016/j.chom.2019.09.010
Yao G, Le T, Korn AM et al (2023) Phage Milagro: a platform for engineering a broad host range virulent phage for Burkholderia. J Virol 97:e0085023. https://doi.org/10.1128/jvi.00850-23
Le S, He X, Tan Y et al (2013) Mapping the tail fiber as the receptor binding protein responsible for differential host specificity of Pseudomonas aeruginosa bacteriophages PaP1 and JG004. PLoS ONE 8:e68562. https://doi.org/10.1371/journal.pone.0068562
Mattila S, Ruotsalainen P, Jalasvuori M (2015) On-Demand Isolation of Bacteriophages Against Drug-Resistant Bacteria for Personalized Phage Therapy Bacteria Strains and Culturing. Front Microbiol 6:1–7. https://doi.org/10.3389/fmicb.2015.01271
Goris J, Konstantinidis KT, Klappenbach JA et al (2007) DNA-DNA hybridization values and their relationship to whole-genome sequence similarities. Int J Syst Evol Microbiol 57:81–91. https://doi.org/10.1099/ijs.0.64483-0
Schwarzer D, Buettner FFR, Browning C et al (2012) A Multivalent Adsorption Apparatus Explains the Broad Host Range of Phage phi92: a Comprehensive Genomic and Structural Analysis. J Virol 86:10384–10398. https://doi.org/10.1128/jvi.00801-12
Tsonos J, Adriaenssens EM, Klumpp J et al (2012) Complete Genome Sequence of the Novel Escherichia coli Phage phAPEC8. J Virol 86:13117–13118. https://doi.org/10.1128/jvi.02374-12
Nobrega FL, Vlot M, de Jonge PA et al (2018) Targeting mechanisms of tailed bacteriophages. Nat Rev Microbiol 16:760–773. https://doi.org/10.1038/s41579-018-0070-8
Ando H, Lemire S, Diana PP, Lu KT (2016) Engineering Modular Viral Scaffolds for Targeted Bacterial Population Editing. Cell Syst 1:187–196. https://doi.org/10.1016/j.cels.2015.08.013
Islam MZ, Fokine A, Mahalingam M et al (2019) Molecular Anatomy of the Receptor Binding Module of a Bacteriophage Long Tail Fiber. PLoS Pathog 15:1–21. https://doi.org/10.1371/journal.ppat.1008193
Takeuchi I, Osada K, Azam AH et al (2016) The Presence of Two Receptor-Binding Proteins Contributes to the Wide Host Range of Staphylococcal Twort-Like Phages. Appl Environ Microbiol 82:5763–5774. https://doi.org/10.1128/AEM.01385-16.Editor
Washizaki A, Yonesaki T, Otsuka Y (2016) Characterization of the interactions between Escherichia coli receptors, LPS and OmpC, and bacteriophage T4 long tail fibers. Microbiologyopen 5:1003–1015. https://doi.org/10.1002/mbo3.384
Chen P, Sun H, Ren H et al (2020) LamB, OmpC, and the Core Lipopolysaccharide of Escherichia coli K-12 Function as Receptors of Bacteriophage Bp7. J Virol 94:1–14. https://doi.org/10.1128/jvi.00325-20
Steven AC, Trus BL, Maizel JV et al (1988) Molecular substructure of a viral receptor-recognition protein. The gp17 tail-fiber of bacteriophage T7. J Mol Biol 200:351–365. https://doi.org/10.1016/0022-2836(88)90246-X
Degroux S, Effantin G, Linares R et al (2023) Deciphering Bacteriophage T5 Host Recognition Mechanism and Infection Trigger. J Virol 97. https://doi.org/10.1128/jvi.01584-22
Belcaid M, Bergeron A, Poisson G (2010) Mosaic graphs and comparative genomics in phage communities. J Comput Biol 17:1315–1326. https://doi.org/10.1089/cmb.2010.0108
Kaneko T, Osaka T, Inagaki M, Tsuneda S (2024) From phenotype to receptor: validating physiological clustering of Escherichia coli phages through comprehensive receptor analysis. J Virol 99:e01061–25. https://doi.org/10.1101/2024.11.29.626071
Fokine A, Zhang Z, Kanamaru S et al (2014) The Molecular Architecture of the Bacteriophage T4 Neck. J Mol Biol 23:1–7. https://doi.org/10.1016/j.coph.2007.10.002.Taste
Vassallo CN, Doering CR, Littlehale ML et al (2022) A functional selection reveals previously undetected anti-phage defence systems in the E. coli pangenome. Nat Microbiol 7:1568–1579. https://doi.org/10.1038/s41564-022-01219-4
Statutes | ICTV (2024) https://ictv.global/about/statutes. Accessed 30 Aug
Acknowledgements
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors. The initial English translation of this manuscript was facilitated using the DeepL Translator (DeepL GmbH), ChatGPT-3.5 (OpenAI), Claude 3.5 Sonnet (Anthropic), and Gemini 1.5 Flash (Google DeepMind). We thank Editage (www.editage.com) for professional English language editing. We carefully reviewed and verified the accuracy of the content at each stage of translation and editing.
Author information
Authors and Affiliations
Contributions
Tomoyoshi Kaneko: conceptualization, data curation, formal analysis, writing – original draft, investigation. Jumpei Uchiyama: supervision, writing – reviewing and editing. Osaka Toshifumi: supervision, writing – reviewing and editing.
Satoshi Tsuneda: supervision, writing – reviewing and editing
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no conflicts of interest associated with this article.
Additional information
Handling editor Johannes Wittmann
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic Supplementary Material
Below is the link to the electronic supplementary material
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Kaneko, T., Uchiyama, J., Osaka, T. et al. Novel Escherichia coli phages representing a distinct genus within the subfamily Stephanstirmvirinae: genome and host range characteristics. Arch Virol 171, 18 (2026). https://doi.org/10.1007/s00705-025-06469-1
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1007/s00705-025-06469-1








