Introduction

The type IV secretion systems (T4SSs) are a diverse superfamily of translocation systems found in many species of bacteria1,2. Functionally, T4SSs are classified mainly as conjugation machines or effector translocators3. The conjugation systems mediate the transfer of mobile genetic elements (MGEs) and their cargoes of antibiotic resistance genes and virulence determinants among bacteria4. The effector translocators instead deliver effector proteins and other macromolecules to eukaryotic cells to aid in the establishment of bacterial pathogenic or symbiotic interactions5. All T4SSs are assembled from a minimum set of signature subunits, which are homologs or orthologs of the 11 VirB and VirD4 components that build the prototypical T4SS used by Agrobacterium tumefaciens to deliver oncogenic T-DNA to plants6. T4SSs designated as minimized are composed only of VirB/VirD4-like subunits. Other T4SSs classified as expanded have acquired novel domains among the VirB-like subunits or appropriated as many as two dozen system-specific components of distinct ancestries presumably for specialized functions. The best-characterized of the expanded systems are the Dot/Icm effector translocator functioning in Legionella pneumophila, the Cag effector system in Helicobacter pylori, and the Tra conjugation machine encoded by IncF plasmids2.

The T4SSs are organized as cell-envelope-spanning nanomachines with distinct architectural features that include an outer-membrane core complex (OMCC), an inner membrane complex (IMC), a connecting periplasmic stalk or cylinder, and two or three hexameric ATPases positioned at the cytoplasmic face of the IMC1,7. Conjugation machines and some effector translocator systems also elaborate extracellular pili that mediate attachment to abiotic and biotic surfaces and contribute to the formation of robust biofilms8,9,10. Although the IMCs are refractory to purification for structural analyses, OMCCs associated with two minimized (Xanthomonas citri VirB/VirD4, pKM101-encoded Tra) and two expanded (L. pneumophila Dot/Icm, H. pylori Cag) systems have been solved at or near-atomic resolutions11,12,13,14,15,16 OMCCs of minimized systems are composed only of homologs or orthologs of the A. tumefaciens VirB7, VirB9, and VirB10 subunits7,17, whereas those of expanded systems typically have greater subunit complexity. Among the structurally characterized OMCCs, those of minimized systems exhibit uniform 14-fold symmetry, but those of expanded Dot/Icm and Cag systems consist of substructures with different symmetries2,13,14,15,16. The symmetry mismatches in these latter systems appear to be accommodated through different structural motifs, but how machine asymmetries contribute to T4SSF function is not defined.

The T4SSF mediates high-frequency transfer of IncF plasmids and accounts for the prevalence of these multi-drug resistance plasmids in environmental and clinical settings. The T4SSF also is the only T4SS shown to date to produce pili capable of dynamic extension and retraction9,18. T4SSF nanomachines were recently visualized at a resolution of ~2.3 nm in the native context of the bacterial cell envelope by in situ cryoelectron tomography (CryoET)19. These in situ images identified certain structural features of the T4SSF, such as the OMCCF attached to the OM, a central cylinder spanning the periplasm to the inner membrane (IM), and an inner membrane complex (IMC) dominated at its cytoplasmic interface by the hexameric VirB4-like ATPase TraC. The OMCCF, with an overall width of ~250 Å and a 13-fold-symmetrical outer-lobed ring and an inner ring of undefined symmetry, appears to form a pore across the OM for substrate transfer and also serves as a basal platform for the dynamic F pilus. These features strongly indicate that the OMCCF plays a central role in regulating transitions between the distinct functional states of the T4SSF.

In this work, we report the structure of the OMCCF solved at ~3.0 Å resolution by single-particle cryolectron microscopy (CryoEM). The near-atomic models reveal that the OMCCF is composed of only three subunits, VirB7-like TraV, VirB9-like TraK, and VirB10-like TraB, reminiscent of the minimized systems. Remarkably, the OMCCF is configured as two distinct substructures, an outer ring complex (ORC) with 13-fold symmetry and a central cone (CC) with 17-fold symmetry, and thus exhibits the property of symmetry mismatch shared by the expanded systems. VirB7-like TraV and VirB10-like TraB accommodate the symmetry mismatch by binding both substructures, a property designated here as an intraprotein symmetry alteration. We propose a model in which the stable binding of one domain of TraV and TraB to one substructure and dynamic binding of a second domain to subsets of sites on the mismatched substructure impart conformational flexibility to the OMCCF required for activation of the T4SSF for F plasmid transfer or F pilus production.

Results

Overall architecture of the T4SSF OMCC

We determined an asymmetric structure (C1 reconstruction) of the F-encoded OMCC (OMCCF) at 4.22 Å resolution from 70,700 purified particles (Fig. 1a). The particles were purified from E. coli harboring the IncFV plasmid pED208 engineered to carry a Strep-tag sequence at the 3′ end of traB. This strain stably produces TraB-Str and elaborates fully functional T4SS machines, as evidenced by wild-type levels of F plasmid transfer and F pilus production (Supplementary Fig. S1a). We used sequential Strep-tag affinity pull-down and size exclusion chromatography to purify complexes (Supplementary Fig. S1b, c). Three dominant species detected in SDS-polyacrylamide gels were identified as TraV, TraK, and TraB by LC-MS/MS mass spectrometry (Supplementary Fig. S1d). Purified material also contained minor amounts of the VirB/VirD4 homologs and several F-specific Tra proteins, but these were not detected in the high-resolution OMCCF structure. Data were collected and processed as illustrated in Supplementary Fig. S2 flowcharts.

Fig. 1: Cryo-EM structure of the F-encoded OMCC.
figure 1

a Asymmetric reconstruction (C1) of the OMCCF at 4.22 Å (Supplementary Fig. 2). The C1 structure shows two distinct subassemblies, an outer ring complex (ORC) with 13-fold symmetry and a central cone (CC) with 17-fold symmetry. b A high-resolution map synthesized from the symmetrized ORC and CC maps. ORC is resolved to 3.31 Å with 13-fold symmetry imposed and the CC is at 2.95 Å resolution with 17-fold symmetry imposed (Supplementary Fig. 2). c The atomic models built from (b). d A central slice of the atomic model (c) with different sizes labeled. Subunit colors: TraK (Green), TraB (Blue), TraV (Pink). Structures of the C1, C13, and C17 maps have been deposited with the EMD accession numbers EMD-24768, EMD-24769, and EMD-24770 and atomic models of the C13 and C17 structures have been deposited with the PDB accession numbers 7SPB and 7SPC.

The C1-reconstructed complex is composed of two distinct substructures, the outer ring complex (ORC) with 13-fold symmetry and the central cone (CC) with 17-fold symmetry (Fig. 1a). To enhance the resolution, we performed both C13 and C17 symmetry reconstructions. In the 13-fold symmetry-imposed reconstruction (3.31 Å resolution), the ORC is well-resolved (Supplementary Fig. S3a, c). The CC is blurred due to the application of the wrong symmetry, but the cone shape was readily evident by filtering to a 10 Å resolution. By imposing 17-fold symmetry, we resolved the CC at a resolution of 2.95 Å (Supplementary Fig. S3b, c). A blurred density was evident at the top of the CC, which likely consists of outer-membrane (OM) lipids or detergent as this region of the OMCC is known to embed into the OM17.

In the C13/C17 map, TraK and TraV are the major constituents of the ORC and TraB dominates the CC (Fig. 1b). Models of these subunits were built into the map, resulting in structural definition of the entire OMCCF (Fig. 1b, c, Table 1). In side-view, the ORC is divisible into upper and lower tiers with outer diameters of 265 Å and 210 Å, respectively (Fig. 1d). In its overall width and 13-lobed architecture, the upper tier of the ORC matches the in situ OMCCF structure of the T4SSF visualized in the bacterial cell envelope by CryoET19. The CC, which was not well-resolved in the in situ structure, has a chamber of 125 Å at its widest point that tapers to a 30 Å pore at the top and to 112 Å at the bottom.

Table 1 Cryo-EM data collection, refinement and validation statistics.

The entirety of the OMCCF isolated from the pED208-carrying strain (wild-type, WT) is composed of TraV, TraK, and TraB. Importantly, the ORC and CC substructures of all assigned classes of OMCCF particles uniformly displayed 13- and 17-fold symmetries prior to imposing symmetry. We asked whether a strain engineered to produce TraV, TraK, and TraB in the absence of other Tra components also elaborates the OMCCF. Indeed, we were able to isolate OMCCF particles from the TraV/K/B-producing strain, establishing that other F-encoded Tra proteins are not essential for its assembly. However, in contrast to the OMCCF particles obtained from the WT strain, those from the TraV/K/B-producing strain were highly heterogeneous in size and symmetry. ORC substructures were grouped into 4 distinct classes with symmetries ranging from 11- to 14-fold, although the 13-fold symmetry class was the most abundantly populated (Supplementary Fig. S4). Reconstruction of the class with 13-fold symmetry imposed resulted in a structure of the ORC at a higher resolution of 2.97 Å than the 3.31 Å resolution achieved for the ORC from the WT strain (Supplementary Fig. S4). For the 13-fold symmetrical ORC class, we further refined the CC and obtained classes with similar numbers of particles exhibiting symmetries of 16-fold or 17-fold. The C16 and C17 CC structures were solved at resolutions of 3.87 Å and 3.54 Å, respectively (Supplementary Fig. S4). As discussed further below, constituents of the OMCCF’s obtained from the WT and TraV/K/B-producing strains align well. However, the observed heterogeneity in OMCCF particles isolated from the TraV/K/B-producing strain strongly indicates that one or more other pED208-encoded Tra subunits play important roles in stabilizing the C13:C17 symmetry of the OMCCF, even though the stabilizing factor(s) was not detected in the OMCCF map.

Structure of the outer ring complex

The ORC is 90 Å in height and 48 Å in thickness, and presents as a distinct upper-lobed ring and a lower-continuous ring (Fig. 2a). It consists of 26 copies of TraK and 26 copies of the C-terminal (CT) domain (residues V150-N204) of TraV (Figs. 1, 2a). Reminiscent of other VirB9 homologs, TraK has two globular domains separated by a flexible linker (Fig. 2b, c). The N-terminal (NT) domain (residues Q25-E120) adopts a β-sandwich fold made of five-stranded and four-stranded β-sheets, and the C-terminal (CT) domain (residues Y136-N238) is comprised of 4 β-strands and one α-helix (α1) that extends away from the β-sandwich (Fig. 2d–f). In the structural unit of the ORC (Fig. S5a), TraK assembles as a dimer with the two TraKNT domains situated side by side at an angle of 13.85° to each other. The two TraKCT domains are flipped 180° relative to each other, and the CT dimer sits laterally on top of the NT dimer (Supplementary Fig. S5a). The TraKCT domains are separated from each other by TraV (see below), although an electrostatic interaction between R213 of TraK subunit 1 (S1) and E131 of subunit 2 (S2) likely participates in stabilization of the TraKCT dimer (Fig. 2e).

Fig. 2: Structure of the outer ring complex (ORC).
figure 2

a Overview (upper) and a central cut view (bottom) of the 3.31 Å ORC from the pED208-carrying (WT) strain reconstructed with 13-fold symmetry imposed. b, c Outer and inner views of the heterodimer structural unit including two TraK monomers and two C-terminal domains of TraV (TraVCT). Different subunits are denoted S1 and S2. The two N-terminal domains of TraK (TraKNT) are arranged side-by-side, and the two C-terminal domains (TraKCT) are flipped 180° relative to each other. The two TraVCT domains pack antiparallel to each other. A flexible linker connecting the TraKNT-S2 and TraKCT-S2 domains is visible as a blurred density at low (10 Å) resolution. Two TraB linker segments corresponding to residues R176-186 form specific contacts with TraKNT domains. d,e,f Atomic models of the (d) entire structural unit, (e) the upper TraKCT/TraVCT complex showing residues whose interactions are predicted to stabilize the ORC, and (f) the lower TraKNT dimer with associated TraB linker segments. Colors for all panels: TraK-S1 and TraV-S2(dark and light green), TraV-S1 and TraV-S2 (dark and light pink), TraBR176-S186 linkers (blue).

The two copies of TraK are therefore configured quite differently in the structural unit. In TraK-S1, the NT and CT domains are juxtaposed, but in TraK-S2 these domains are separated by a distance of ~43 Å (Fig. 2d). These structural differences are afforded by TraK’s flexible linker (residues E120-Q134) (Supplementary Fig. S5a). In TraK-S1, the linker is folded and readily detectable in the high-resolution map, but in TraK-S2 it is visible as an elongated density only when the map was filtered to 10 Å resolution (Fig. 2b, c). In the assembled ORC, 13 structural units are arranged in a ring, with the TraKNT domains forming a lower-continuous ring with 26-fold symmetry and the TraKCT domains extending vertically to generate the 13-fold symmetrical, upper-lobed ring (Fig. 2a and Supplementary S4). Although the upper-lobed and lower rings of the ORC exhibit a C13:C26 symmetry mismatch, this symmetry mismatch is readily accounted for by the unique architecture of the TraK dimer in the structural unit. Because only 13 structural units build the entire ORC, we will refer to the ORC as having an overall C13 symmetry, except when specifically discussing the 26 copies of the TraKNT domains.

The C-terminal domain of TraV (TraVCT; residues V150-N204) is composed of two antiparallel β-strands connected by a loop (Fig. 2b–e). An interaction between residues W175 and H183 within the intervening loop is expected to stabilize this subdomain. In the structural unit, one TraVCT domain is situated above and antiparallel to a second. The two TraVCT domains are sandwiched between the two TraKCT domains, configured so that the intervening loops of each TraVCT domain bend inward to form a groove. This groove is occupied by an α-helix situated N-proximally to the CT domain of TraK-S2 (Fig. 2e). This TraK–TraV interaction surface, plus the TraKCT-S1 R213–S2 E131 interaction mentioned above, is predicted to stabilize the 3D architecture of the TraK–TraVCT structural unit. In the assembled ORC, the upper and lower TraVCT domains of one structural unit interact with equivalently-positioned TraVCT domains in the flanking structural units, resulting in a two-stranded ‘ORC belt’ that fixes the TraKCT lobes laterally within the global structure (Fig. 2 and Supplementary Fig. S5b).

In the lower ring of the ORC, we identified additional densities associated with the inner surfaces of each of the 26 TraKNT domains (Fig. 2c, f). By modeling, we confirmed that these densities correspond to residues R176-S186 of TraB, which is part of a long linker (residues K194-Q143) projecting from the C-terminal β-barrel domain through the base of the OMCCF to the inner membrane. As discussed further below, this finding is important in view of further evidence that 17 copies of the TraBβ-barrel domain assemble as the CC. This TraB linker therefore bridges the mismatch between the 26-fold symmetrical lower ring of the ORC and 17-fold symmetrical CC.

The features described above for the ORC substructure of the OMCCF obtained from the pED208-carrying strain are also evident for the class of C13-symmetrized ORC substructures from the TraV/K/B-producing strain. Structural units from the two strain sources exhibit the same overall architecture, and the TraK and TraVCT constituents also superimpose very well (Supplementary Fig. S6a). The C13-symmetrized ORCs from the WT and TraV/K/B-producing strains can therefore be considered equivalent, although other classes of ORCs with distinct symmetries also assemble in this strain (compare Supplementary Figs. S3, S4).

We tested the functional importance of intersubunit contacts predicted to stabilize the ORC or accommodate the ORC-CC symmetry mismatch (Supplementary Fig. S7). First, we deleted traV, traK, and traB from pED208 and confirmed with complementation tests that each mutation is nonpolar on downstream gene expression. As expected from previous studies of other T4SSs, the ΔtraV and ΔtraB mutant strains did not produce functional T4SSF’s, as evidenced by a lack of detectable F pilus production or F plasmid transfer. Although the ΔtraK mutant also failed to produce F pili, strikingly we found that this mutant retained donor proficiency at a level (10−6 Tc’/D) that is well above the threshold of detection (<10−8 Tc’s/D) (Supplementary Fig. S7a,b). We generated additional mutations in traK and traV to test the functional importance of predicted interfacial contacts or stabilizing domains, including (i) an Asp substitution for TraK.R213, (ii) Ala substitutions for TraV.W175 and TraV.H183, and (iii) deletion of the entire TraVCT domain (Δ146–204). All of these mutations phenocopied the ΔtraK mutations in suppressing plasmid transfer by >5–6 orders of magnitude (Supplementary Fig. S7b). These mutations also disrupted F pilus production, as evidenced by an absence of detectable TraA pilin in the extracellular fraction, resistance of the host strain to infection by bacteriophage M13, or lack of F-pilus-mediated aggregation (Supplementary Fig. S7a). Together, these findings support a conclusion that the ORC is essential for production of F pili, but functions mainly to enhance stability or activity of the translocation channel. Remarkably, the CC is capable of mediating DNA transfer across the OM in the complete absence of the ORC.

Next, we deleted the TraBR176-S186 linker residues shown to interact with TraKNT domains comprising the lower ring of the ORC. A Δ179–185 mutation reduced DNA transfer by ~5 orders of magnitude, and host cells were phage resistant and nonaggregative, indicative of a disruption in F pilus production or stability (Supplementary Fig. S7c, d). By contrast, deletion of residues 186–193, which connect the TraBβ-barrel and TraBR176-S186 motifs, had no discernible effects on DNA transfer efficiencies or F pilus production. These finding confirm the functional importance of the TraBR176-S186–TraKNT interaction in accommodating the ORC/CC symmetry mismatch. Furthermore, the flexible linker (residues 187–193) that was not detected in the CryoEM map, but connects the two TraB domains involved in mismatch accommodation, does not contribute to OMCCF function.

Structure of the central cone

The CC was resolved at 2.95 Å resolution after imposing a 17-fold symmetry on particles purified from pED208-carrying cells (Supplementary Fig. S3b, c). The CC has an outer diameter of ~168 Å and an inner chamber of ~125 Å at its widest point (Fig. 1d). It is composed primarily of 17 β-barrel domains of TraB, a structural motif that is highly conserved among all VirB10 homologs or orthologs (Fig. 3b). An α-helical subdomain termed the antennae projection (AP), which also is highly conserved, extends from the top of each β-barrel. The 17 APs taper in to form a 30 Å pore that is presumed to span the OM. The terminal flexible loops connecting the AP helices could not be completely modeled. At the bottom of the β-barrel, a C-terminal α-helix (residues L398-Q413) projects inward and forms a slight constriction at the base of the central cone.

Fig. 3: Structure of central cone (CC).
figure 3

a Overview of the 2.95 Å CC from the pED208-carrying (WT) strain reconstructed with 17-fold symmetry imposed. The CC is composed of domains of TraB (blue) and TraV (pink). b The atomic model of the β-barrel and AP domains of TraB (residues K194-Q413) in rainbow color, with the N-terminal domain of TraV (TraVNT, residues C18-K55) (pink). An AP flexible loop (residues T332-I346) cannot be built in the model. c, d Interaction between the TraVNT domain and three adjacent TraBβ-barrel domains visualized in densities and models, respectively. The red box in (c) shows a piece of lipid density connected to TraV C18.

N-terminal domains of TraV (residues C18-A53) also comprise part of the CC by binding laterally along the surfaces of three TraBβ-barrel domains (Fig. 3). In the assembled cone, adjacent TraVNT domains bind overlapping sets of three TraB β-barrels, forming a lattice predicted to stabilize the CC. Residues C18-N42 bind the TraB β-barrels, and adjacent residues Q43-K55 form an α-helix that protrudes away from the TraBβ-barrel surface toward the ORC. The N-terminal Cys18 residue is predicted to be lipid-modified, as shown for other VirB7 homologs20,21. Correspondingly, we detected an extra density associated with Cys18 residues that probably corresponds to lipid (Fig. 3c, d). This lipid modification is optimally positioned to promote insertion and stabilization of the AP across the OM.

For OMCCF particles purified from the TraV/K/B-producing strain, the CC substructures associated with ORC classes displaying 11-, 12-, and 14-fold symmetries could not be resolved. However, CCs associated with the C13-symmetrized particles were well-resolved and exhibited either 16- or 17-fold symmetries (Supplementary Fig. S4b). Interestingly, the tertiary structures of TraVNT and TraBβ-barrel domains comprising the C16- and C17-symmetrical CCs align well with each other, as well as with the equivalent structures comprising the C17-symmetrical CC from the pED208-carrying strain (Supplementary Fig. S6b). These findings reinforce the notion that the CC is conformationally flexible as a result of sparse interfacial contacts connecting the TraBβ-barrel domains, even to the extent that CCs with different numbers of TraBβ-barrel domains can assemble in vivo.

We tested the functional importance of the TraVNT–TraBβ-barrel interaction by generating a series of deletions of the TraVNT domain without disrupting the N-terminal Cys18 residue for OM anchoring (Supplementary Fig. S7e, f). Deletion of the entire TraVNT domain (Δ20–55), as well as smaller N-terminal (Δ20–50, Δ20–45, Δ20–40) or C-terminal (Δ35–55, Δ40–55) deletions strongly impaired T4SSF function. However, removal of the peripherally-docked α-helix (Δ48–55) had no discernible effect. Several contacts distributed along the length of the TraVNT domain exclusive of the α-helix thus appear to be necessary for binding the CC. We had envisioned that the large linker (residues 55–150) connecting the TraVCT and TraVNT domains would impart conformational flexibility between the ORC and CC of functional importance. Remarkably, however, a Δ54–155 mutation had no effect on either DNA transfer or F pilus production (Supplementary Fig. S7e, f).

TraB and TraV accommodate the ORC/CC symmetry mismatch

Reconstruction of the asymmetrical structural unit of the OMCCF supplied further insights into how TraB and TraV accommodate the C13:C17 mismatch between the ORC and CC substructures. In the case of TraB, the β-barrel domain is the major element of the CC and the R176-S186 linker segment binds TraKNT domains constituting the lower ring of the ORC (Fig. 4a, b). We identified several important features of the TraBR176-S186–TraKNT interaction (Fig. 4b and Supplementary S8). By examination of the C13 structure, all sites on the 26 TraKNT domains appear to be occupied by TraBR176-S186. However, the C13 structure represents an averaged structure derived from analyzing thousands of OMCCF particles, suggesting that the 17 TraBR176-S186 segments linked to 17 TraBβ-barrel components of the CC dock randomly or dynamically with the ORC. In fact, on closer examination of the original densities in the C13 map of TraBR176-S186 segments bound to each of the two TraKNT domains in a structural unit, we found that the TraBR176-S186 densities invariably are stronger when associated with TraKNT S2 than with TraKNT S1 (Supplementary Fig. S8a). This suggests that TraBR176-S186 segments bind one of the TraKNT monomers preferentially or with higher affinity over the second in the assembled ORC. Indeed, the periodicity of TraBR176-S186 binding was clearly evident when we superimposed the asymmetrically reconstructed C1 map (Fig. 1a) onto the C13-symmetrized map (Fig. 3a). Only 17 of the 26 sites available on TraKNT domains are occupied by strong TraBR176-S186 densities (Fig. 4b, gray densities, blue linkers). Of the remaining 9 available sites, 7 are unoccupied (right image, magenta linkers) and 2 have very weak densities (magenta linkers denoted with asterisks). This generates an alternating pattern of strong and weaker or nonexistent TraBR176-S186 densities among the 26 TraKNT domains (Fig. 4b right, e.g., sites 1, 3, 5, 7, 9, 11, 13,…). In the assembled OMCCF, therefore, 13 of the 17 TraBR176-S186 segments stably occupy sites on the 13 TraK-S2 subunits, while the remaining 4 copies of the TraB segments bind subsets of the sites available on the 13 TraK-S1 subunits.

Fig. 4: TraV and TraB accommodate the C13:C17 symmetry mismatch.
figure 4

a An asymmetric unit of the OMCCF. Thirteen TraK/TraVCT units comprise the C13 ORC, while the 17 TraBβ-barrel/TraVNT units assemble as the C17 CC. TraV and TraB bridge the ORC and CC substructures through flexible loops (red dash lines). Subunit domains and colors are shown as described in Fig. 2 legend. Residues shown denote boundaries of the different domains. A short segment (residues R90-G101) in the TraV S2 flexible linker forms a traceable contact with the TraKCT S2 domain. Two TraBR176-S186 linker segments are shown to denote detectable contacts with both S1 and S2 domains of the TraKNT dimer. b Left: Segment of TraB involved in the intraprotein symmetry alteration. Residues K194-G203 are part of the TraBβ-barrel domain, 17 copies of which form the CC. Residues S186-K194 bridge the symmetry mismatch, but can be deleted without affecting T4SSF function. Residues R176-S186 bind a subset of the 26 copies of TraKNT domains comprising the ORC. Middle and Right: Schematic views depicting the TraB symmetry alteration from side and top, respectively. Color-coding is as shown at left. Densities (gray) are extracted from the C1 reconstruction, which shows 17 strong densities (blue sites), 7 empty densities (magenta sites), and 2 weak densities (* at sites #12 and 16, magenta). Numbers refer to the 26 TraKNT domains and 17 connections to the TraBβ-barrel domains. c Left: TraV monomer (Rainbow color) displays an intraprotein symmetry alteration. A subset of the 26 TraVNT (residues C18-K55) bind 17 available sites in the CC. The flexible loop (residues K55-V150) bridges the symmetry mismatch, but can be deleted without affecting T4SSF function. Twenty-six copies of the TraVCT domain (residues V150-N204) comprise the ORC belt. Residues R90-G101 in the flexible loop associated with TraV S2 bind the 13 copies of TraKCT-S2 domains. Right: Schematic view depicting the TraV intraprotein symmetry alteration viewed from top. Densities associated with the traceable R90-G101 segment from TraV S2 are shown. The 4 magenta-colored TraVNT domains (randomly selected) have no traceable paths to TraVCT domains. The 13 TraVNT S2 domains (blue) are connected via the flexible loop to the TraVCT-S2 domains (rainbow). TraBβ-barrels comprise the CC (gray, inner ring). TraVNT-S1 domains connected to the TraVCT-S1 domains (gray, outer ring) have not been identified.

In the asymmetrical structural unit, two TraVCT domains embed into the TraKCT dimer, and long flexible linkers (residues K55-V150) connect to TraVNT domains bound along the surfaces of TraBβ-barrel domains in the CC (Fig. 4a). In the assembled OMCCF, 26 copies of TraVCT domains comprise the ORC belt, but at most 17 associated NT domains occupy sites on the TraBβ-barrel domains. We further examined how TraV accommodates the ORC/CC symmetry mismatch by attempting to trace the flexible linkers joining the N- and C-terminal domains. Although the majority of this linker is not traceable, residues R90-G101 were found to associate with the TraKCT-S2 domain (Fig. 4a, c). These densities are oriented so that residue R90 is proximal to residue K55, which comprises the C-terminus of the TraVNT domain associated with the TraBβ-barrel domains. We visualized only 13 TraVR90-G101 densities, and each of these extends from the 13 TraVCT-S2 domains comprising the upper strand of the ORC belt. These findings support a proposal that the 13 TraVNT-S2 domains connected to the traceable R90-G101 densities occupy 13 of the 17 available sites in the CC.

To reconcile this model with the observation that all 17 sites appear to be occupied in the C17-symmetrized map (Fig. 1), we postulated that the 13 TraVNT-S2 domains bind dynamically to 13 of the available 17 sites on the CC. Indeed, when we compared density intensities of the TraVNT domains to those of TraBβ-barrel domains in the C17 map, we found that the TraV densities are weaker than the TraB densities, suggestive of subsaturated occupancy of available sites. There also are no variations in the TraVNT density intensities among the 17 sites on the CC in the C1 reconstruction (Fig. 1a), further indicating that, unlike the TraBR176-S186 segments, the TraVNT-S2 domains bind with comparable affinities to all available sites on TraBβ-barrel domains (Fig. 4c right, blue NT domains). We were unable to detect the 13 TraVNT domains and linkers associated with TraVCT-S1 domains comprising the lower strand of the belt, and therefore it remains possible that 4 TraVNT-S1 domains occupy the remaining sites on the CC. The TraVNT-S1 domains additionally or alternatively might assemble at the periphery of the ORC to anchor this substructure to the OM through N-terminal lipidation or perhaps recruit other Tra proteins of functional importance to the OMCCF. In line with this latter possibility, when the cryoEM structure of the OMCCF was superimposed onto the structure of the T4SSF generated previously by in situ cryoET19, the periphery of the ORC closely associates with the OM, possibly through connections mediated by the acylated N-termini of the TraVNT-S1 domains (Supplementary Fig. S9a). These peripheral connections, and those contributed by the lipid-modified TraVNT-S2 domains associated with the CC, anchor the OMCCF at the periplasmic face of the OM. This optimally positions the AP pore, which is the dominant hydrophobic motif of the OMCCF, for insertion into the OM (Supplementary Fig. S9a).

Discussion

The F plasmid-encoded T4SS is a dynamic bacterial nanomachine that transitions between a quiescent complex and activated states mediating F pilus assembly or plasmid transmission2,19,22. The near-atomic resolution structures presented here advance an understanding of how the OMCCF orchestrates the structural transitions associated with these different functional states. Remarkably, the OMMCF is composed of ORC and CC substructures exhibiting C13 and C17 symmetries, respectively, and TraV and TraB accommodate the symmetry mismatch by binding both elements through a mechanism designated as an intraprotein symmetry alteration. The TraVNT and TraBβ-barrel domains stably comprise major parts of the ORC and CC substructures, but our data suggest that the TraVCT and TraBR176-S186 domains bind variably or dynamically to the mismatched substructures. These results support the following dynamic model for the action of the OMCCF. The T4SSF is initially assembled in a quiescent state, but is activated by unknown signals to elaborate F pili or by signals such as plasmid docking at the channel entrance and target cell binding for substrate transfer23,24. Accompanying machine activation, TraVNT and TraB R176-S186 domains are triggered to dynamically bind subsets of available target sites in the CC and ORC. Such reiterative binding to adjacent sites might promote lateral or ratcheting movement of the CC relative to the ORC to power cycles of TraA pilin insertion and deinsertion during F pilus extension and retraction. Alternatively, dynamic binding might induce vertical insertion and deinsertion of the CC into and from the OM for reiterative rounds of substrate passage. Further analyses of T4SSF’s in different states are clearly needed to test these ideas and define the nature of structural transitions associated with machine activation. By solving the OMCCF structure likely in its quiescent state, and by defining the importance of key structural elements for the myriad of T4SSF functions, our present findings have laid the groundwork for such future investigations.

The OMCCF particles purified from the pED208-carrying strain displayed a uniform C13:C17 architecture and most OMCCF particles recovered from the TraV/K/B-producing strain also displayed this symmetry mismatch. From the TraV/K/B-producing strain, however, OMCCF complexes with different ORC/CC symmetries were also abundantly represented. When produced in the absence of other pED208-encoded factors, TraV, TraK, and TraB can thus pack in variations of the 26:26:17 stoichiometries observed for WT OMCCF complexes, supporting our proposal that other pED208-encoded factors stabilize the C13:C17 architecture of the WT complexes. Other VirB-like subunits, such as those comprising the IMC or central channel, might act as stabilizing elements. It is also noteworthy that F plasmids encode a variable number of T4SSF-specific subunits of which TraF, TraW, TraH, TraU, TrbB, and TrbC are universally conserved. Prior studies have shown that these factors are required for WT function, localize in the periplasm, and form various interaction networks with each other and TraV, TraK or TraB, together supporting proposals that an F-specific complex(es) binds and regulates the activity of the OMCCF22,25,26,27. While we did not detect F-specific subunits in the solved OMCCF structure, a recent in situ cryoET structural analyses of T4SSFs assembled within the bacterial envelope revealed the presence of densities possibly contributed by F-specific factors at the junction of the OMCCF with the OM (see below). Such factors might associate transiently with the OMCCF to activate the machine in response to signal sensing.

Our mutational analyses confirmed that prominent structural features, including the TraV NT and CT domains and the TraB β-barrel and R176-S186 domains, are essential for plasmid transfer and F pilus production. Our further finding that the flexible intervening linkers TraV54–155 and TraB187–193 connecting the ORC to the CC are completely functionally dispensable establishes that the two domains of both proteins whose densities clearly contribute to both of the symmetry mismatched substructures can be directly connected. This argues against the possible involvement of truncated proteins or distinct subunit copy numbers in accommodating the symmetry mismatch. Additionally, the flexible linkers do not contribute to the conformational flexibility in the OMCCF of proposed importance for machine activation. Our dynamic occupancy model postulates instead that the requisite structural transitions are induced by reiterative binding of the TraVNT and TraBR176-S186 domains to sites on the symmetry mismatched substructures. The functional essentiality of the TraVNT and TraBR176-S186 domains and observations that densities of these domains are weaker than those of their TraBβ-barrel and TraKNT binding targets in the symmetrized maps together support this model (Fig. 4 and Supplementary Fig. S8).

The notion that the T4SSF is activated by movement of the CC relative to the ORC needs to be reconciled, however, with the striking discovery that the CC can support a low level of conjugative DNA transfer in mutants lacking TraK or the TraVCT domain and therefore the entire ORC (Supplementary Fig. S7). The CC might retain a residual capacity to undergo the necessary conformational transitions, e.g., insert into and from the OM, even in the absence of the ORC to mediate some intercellular substrate transfer. Interestingly, the ORC-minus mutant machines fail to elaborate F pili, which raises the possibility that the proposed movements of the CC relative to the ORC, driven by dynamic binding of TraV and TraB to mismatched substructures, exclusively drive F pilus biogenesis and not substrate transfer. ΔtraK and other ORC mutations can be grouped among a class of previously-described uncoupling mutations that selectively disrupt pilus production without blocking assembly of functional translocation channels28,29,30. Such Tra+, Pil uncoupling mutations have been isolated in various T4SS subunits, including the TraK-like VirB9 component of the A. tumefaciens VirB/VirD4 T4SS29. These genetic findings establish a precedent for the selective contributions of VirB9 subunits, which invariably are configured as peripheral scaffolds of OMCCs2, for pilus production.

The WT OMCCF structure provides unprecedented detail of a T4SSF subassembly in a closed or quiescent state, as deduced by the absence of an associated F pilus or target cell contact. Deciphering the nature of structural changes accompanying machine activation remains a formidable challenge using single-particle CryoEM approaches. However, the recent visualization of T4SSF nanomachines in their native cellular context by in situ CryoET has enabled comparisons of quiescent and pilus-bound structures19. These studies have generated important insights into the nature of conformational changes, particularly at the OMCCF–OM junction, associated with pilus biogenesis. In the quiescent channel, the OMCCF–OM junction is characterized by invagination of the OM and the absence of discernible OM-spanning densities, consistent with a closed state. By contrast, in the F pilus-bound structure, distinct disc and plug densities are evident at the distal end of the OMCCF and within the OM. Such densities might be contributed the F-specific Tra components shown to be required for machine activation. Additionally, in the quiescent channel, a ~6 nm central cylinder extends from the IM to the base of the OMCCF, where it then widens to ~9 nm. In the pilus-generating machine, this cylinder does not widen at the base of the OMCCF, but instead projects up through the entire OMCCF to the OM, where it constricts to ~4 nm. The cylinder, whose composition is presently unknown19, is therefore a dynamic structure that imparts conformational changes in the OMCCF required for F pilus production. These findings strongly indicate that the OMCCF is subject to active remodeling when the T4SSF initiates pilus production.

At this time, OMCCs from five functionally-distinct T4SSs have been solved at or near atomic resolution, allowing for detailed comparisons (Supplementary Fig. S9b)11,12,13,14,15,16. Although the OMCCF resembles the equivalent complexes from the pKM101 Tra and X. citri VirB/VirD4 minimized systems in its three-subunit composition, its symmetry mismatch and requirements for other F-specific Tra subunits for machine function support classification of the T4SSF as a member of the expanded systems. The OMCCF is also distinct from those of the H. pylori Cag and L. pneumophila Dot/Icm T4SSs, however, in its smaller size, reduced compositional complexity, and types of symmetry mismatches (Supplementary Fig. S9b). With respect to the latter property, we identified two symmetry mismatches in the OMCCF. The first is the C13:C26 mismatch between the upper-lobed and lower ring of the ORC, which is readily accounted for by the unique architecture of the TraK dimer. In the Cag and Dot/Icm systems, C- and N-terminal domains of the VirB9 homologs (CagX and DotH) also comprise parts of upper and lower substructures, respectively termed the outer-membrane cap (OMC) and periplasmic ring (PR). However, the OMC and PR substructures are compositionally more complex than the ORC, and they also display symmetry mismatches that are distinct from the C13:C26 symmetries exhibited by the upper and lower ORC rings (Supplementary Fig. S9b). A recent study also has shown that VirB9-like DotH accommodates the OMC:PR symmetry mismatch in the Dot/Icm system not through unique configurations of DotH dimers or by intraprotein symmetry alterations, but rather through interactions of distinct numbers of DotH subunits with the two mismatched substructures16.

The second symmetry mismatch in the OMCCF is between the C13 ORC and C17 CC. The corresponding region of the Cag system is uniformly 14-fold symmetric and composed of 14 copies each of VirB9-like CagXNT and VirB10-like CagYβ-barrel domains, plus a number of other system-specific subunits (Supplementary Fig. S9b)13,15. In the Dot/Icm system, the outer portion of the OMC and a central region designated as the central dome are composed of VirB9-like DotHNT and VirB10-like DotGβ-barrel domains, respectively, but the OMC/dome exhibits a C13:C16 symmetry mismatch14,16. As shown here for TraV, the DotC lipoprotein also bridges both complexes, suggesting that a TraV-like intraprotein symmetry alteration might accommodate this mismatch. Finally, it is interesting to note that the TraBR176-S186–TraKNT interaction detected in the OMCCF complex is also conserved in the Cag and Dot/Icm systems (Supplementary Fig. S8b). In both systems, N-terminal domains of the VirB9-like subunits comprising the PR substructures assemble as dimers, and linker segments of the VirB10-like subunits extending from the β-barrel domains form specific contacts with these VirB9NT dimers. It is enticing to suggest that the dynamic occupancy model proposed here for the TraBR176-S186 linker segment also serves to accommodate the OMC/dome:PR asymmetries in the Cag and Dot/Icm OMCCs.

In summary, the OMCCF appears to represent a hybrid of the equivalent substructures associated with minimized and expanded T4SSs. It is also structurally distinct from other OM-bound substructures of other bacterial nanomachines dedicated to export of macromolecules. Recent studies have identified symmetry mismatches among a growing list of such nanomachines, including members of the type II (T2SS), type III (T3SS), and type VI (T6SS) secretion superfamily, the flagellar motor, and F1Fo-ATP synthase31,32,33,34,35,36. Although symmetry mismatches are thought to impart conformational flexibility, the mechanisms by which mismatches are accommodated and their biological importance have not been fully defined. The mechanisms by which TraV and TraB bridge the ORC and CC substructures thus potentially informs an understanding of how symmetry mismatches are accommodated in other bacterial nanomachines. The OMCCF is also considerably compositionally and structurally simpler than these other bacterial nanomachines, making it an ideal complex for future investigations exploring the structural bases and biological consequences of intrinsic symmetry mismatch. From a broader medical perspective, structural definition of the OMCCF and of other OMCCs associated with T4SSs paves the way for design of small molecule inhibitors of these nanomachines of potential value for therapeutic intervention of multi-drug resistance proliferation and virulence.

Materials and methods

Bacterial strains and growth conditions

Bacterial strains used in this study are listed in Supplementary Table S1. E. coli strains were grown at 37 °C with shaking in Lysogeny Broth (LB) unless otherwise indicated. Plasmids were maintained by selection with the following antibiotics with concentrations in parentheses: carbenicillin (100 μg ml−1), spectinomycin (100 μg ml−1), chloramphenicol (20 μg ml−1), tetracycline (20 μg ml−1), kanamycin (50 μg ml−1).

Construction of pED208-traB Strep4 and deletions of traV, traK, and traB

The traB gene along with flanking ~600 bp upstream and downstream sequences was cloned in pBAD24. This construct was further used to incorporate a double twin-Strep tag (Strep4) epitope at the 3′ end of traB by inverse PCR using the oligonucleotides listed in Supplementary Table S1. Next, traBStrep4 was amplified and incorporated in pED208ΔtraB::FRT-Kanr-FRT by λ red-gam induced homologous recombination to excise the FRT-Kanr-FRT cassette and yield pED208-traBStrep437,38. Incorporation of the 4xStrep tag at the end of traB on pED208 was confirmed by sequencing. pED208-traBStrep4 produced stable TraBStrep4 and was phenotypically identical to wild-type pED208 with respect to its roles in DNA transfer and F pilus production (Supplementary Fig. S1). E. coli strain HME45(pED208) was used for generating ΔtraB, ΔtraK and ΔtraV mutants37,38,39. A FRT-Kanr-FRT cassette from pKD1340 was amplified with primers listed in Supplementary Table S1 to carry flanking ~50 base-pairs (bps) sequences, homologous to the upstream and downstream regions of the gene targeted for deletion, at the 5′ and 3′ end respectively. The λ red-gam system in HME45 was induced by growth at 42 °C. Next, the FRT-Kanr-FRT amplicon was introduced by electroporation, and transformants were selected by plating on LB agar plates supplemented with kanamycin (50 μg/ml). Complementing plasmids were then introduced into respective HME45 strains carrying pED208ΔtraB (or ΔtraK or ΔtraV)::FRT-Kanr-FRT to transfer the mutant pED208 plasmids into MC4100 cells carrying pCP2040 expressing the Flp recombinase. Transconjugants were streaked on LB agar and grown at 42 °C overnight to induce recombinase expression for excision of the FRT-Kanr-FRT cassette. Growth at 42 °C also cured pCP20. Individual colonies were screened for Chls and Kans to identify strains with pED208ΔtraB, ΔtraK or ΔtraV plasmids and lacking pCP20. The tra mutations were further confirmed by PCR amplification across the deletion junction and sequencing of the PCR fragment.

Other plasmids

Plasmid pKBV expressing traK, traB, and traV was constructed by PCR amplification of the corresponding gene cluster from pED208-traBStrep4 and insertion into pET-15(b) behind the IPTG-inducible T7 promoter. Plasmids expressing traV, traK, and traB were generated by PCR amplification of the respective genes and insertion of the amplified gene into pBAD24 downstream of the arabinose-inducible pBAD promoter. These plasmids were used as templates to construct mutant alleles by inverse PCR using oligonucleotides listed in Supplementary Table S1. Mutant alleles were verified by sequencing across the entire genes.

Purification of the OMCCF

E. coli MC4100(pED208-traBStrep4) was grown overnight at 37 °C in LB broth supplemented with appropriate antibiotics. The culture was diluted 1:200 in fresh LB broth without antibiotics and grown at 37 °C with shaking to an OD600 of 1.0. E. coli BL21(DE3, pKBV) was grown overnight at 37 °C in LB broth supplemented with appropriate antibiotics and then sub-cultured (1:200) in fresh LB broth without antibiotics (1:200) at 37 °C until an OD600 of 0.6 was reached. IPTG at a final concentration of 0.1 mM was added and the cells grown at 37°C with shaking for 3 hr. MC4100(pED208-traBStrep4) and BL21(DE3, pKBV) cells were harvested and resuspended in ice cold buffer A [50 mM Tris-HCl (pH=8), DNase I (1 μg ml−1), lysozyme (200 µg ml−1) and EDTA-free protease inhibitor (Thermo Fisher Scientific)]. Cells were lysed on ice using Emulsiflex, then 1 mM each (final concentrations) of EDTA and DTT were added. The cell lysate was centrifuged at 25,000 × g for 45 min to remove cell debris, and the supernatant was centrifuged at 95,000 × g for 1 h to pellet the membrane fraction. This fraction was solubilized in ice cold buffer B [50 mM Tris-HCl (pH = 8), 200 mM NaCl, 1 mM DTT, 1 mM EDTA, 0.5% DDM, 0.75% DM-NPG, 0.1 % digitonin, EDTA-free protease inhibitor] for 2 h at 4 °C. The suspension was clarified by centrifugation at 25,000 × g for 30 min. The supernatant was loaded onto a 1 ml StrepTrap HP (GE Healthcare) column equilibrated with buffer C [50 mM Tris-HCl (pH = 8), 200 mM NaCl, 1 mM DTT, 1 mM EDTA, 0.06% DM-NPG, 0.1 % digitonin]. Following the wash with equilibration buffer, the purified F-T4SS core complex was eluted in the equivalent wash buffer supplemented with 10 mM D-desthiobiotin and EDTA-free protease inhibitor. The peak fractions were further pooled and loaded onto a Superdex 200 10/300 column (GE Healthcare) equilibrated in elution buffer described above, but without any D-desthiobiotin. Peak fractions containing OMCCF particles were collected and used for further analysis.

LC/MS/MS analysis

Purified complexes were denatured by boiling in Laemmli’s buffer, electrophoresed a distance of 1 cm into an SDS-(12%) polyacrylamide gel, and stained with Coomassie blue. After thorough destaining, the stained area of each lane was excised, dehydrated in acetonitrile, washed extensively and subjected to thiol reduction by Tris (2-carboxyethyl) phosphine hydrochloride (TCEP) and alkylation with iodoacetamide. Samples were digested overnight with trypsin endoproteinase. An aliquot of the tryptic digest (in 2 % acetonitrile/0.1% formic acid in water) was analyzed by LC/MS/MS on an Orbitrap FusionTM TribridTM mass spectrometer (Thermo ScientificTM) interfaced with a Dionex UltiMate 3000 Binary RSLCnano System. The raw data files were processed using Thermo ScientificTM Proteome DiscovererTM software version 1.4, spectra were searched against the NCBI bacterial database using the Mascot search engine v2.3.02(Matrix Science).

Conjugation assay

Conjugation assays were carried out by growth of overnight cultures of donor and recipient strains, dilution (1:50) in fresh antibiotic-free LB broth, and further incubation for 3 h at 37 °C with shaking. Arabinose (0.2% final concentration) was added to induce pBAD-expressed tra alleles as needed, and cells were grown at 37 °C for another 1.5 h. Equivalent numbers of donor and recipient cells were mixed and the mating mix was incubated at 37 °C for 1.5 h without shaking. Mating mixtures were then serially diluted and plated onto LB agar containing antibiotics selective for transconjugants (Tc’s) and donors (D). The frequency of plasmid transfer is reported as Tc’s/D. Donors that failed to transfer the plasmid in broth matings were further subjected to 5 h filter matings. Donor and recipient cultures were mixed in a 1:1 ratio on nitrocellulose filter discs, which were then incubated on an LB agar plate for 5 h at 37 °C. The discs were suspended in LB broth, and the liquid cultures were serially diluted and plated on LB agar containing antibiotics selective for Tc’s and donors. All matings were performed at least three times in triplicate, and results are reported for a single representative experiment with Tc/D frequencies for each replicate represented as single dots, mean frequencies by bar heights, and standard deviations by error bars.

Phage sensitivity assay

Overnight cultures (50 µl) were spread on LB agar plates containing appropriate antibiotics and arabinose (0.2% final concentration) as necessary for induction of pBAD-expressed tra alleles. Cultures were dried, 2 μl of bacteriophage M13 (1011 pfu/ml) was spotted, and plates were incubated overnight at 37 °C. Plates were examined for M13-induced formation of turbid plaques.

Detection of extracellular TraA pilin

F pili released in the extracellular milieu were recovered and detected by immunostaining as follows. Overnight cultures were diluted (1:200) in fresh LB without antibiotics and when cultures reached an OD600 = 0.6, arabinose (0.2%, final concentration) was added as necessary for induction of pBAD-expressed tra alleles. Following growth for another 2 h, cells were vortexed vigorously and centrifuged at 11,200 × g for 3 min at room temperature. The supernatant was centrifuged twice more to eliminate bacterial cells. The supernatant was then mixed 9:1 with 10X PEG buffer (2% PEG8000, 0.55 M NaCl final concentrations), and the mixture was incubated for 30 min at room temperature and then centrifuged at 18,200 × g for 30 min. The supernatant was removed carefully and the pellet was resuspended in PBS. Samples were subjected to SDS-polyacrylamide gel electrophoresis (SDS-PAGE) and immunostaining of proteins transferred to nitrocellulose membranes using primary antibodies against pED208-encoded TraA pilin (kind gift from Laura Frost) or the β subunit of E. coli RNA polymerase (RNA pol) as a control for cell lysis. Anti-TraA antibodies are available upon request; anti-RNA pol antibodies are commercially available from Biolegends.

CryoEM sample preparation and data collection

Aliquots (4.0 μl) of purified protein complexes were deposited on glow-discharged ultrathin carbon film on Lacey 300 mesh Au grids. The deposited samples were allowed to stay on grids for 30 sec in the FEI Vitrobot chamber maintained at 23 °C with 100% humidity. Grids were blotted for 4 sec using a blot force of −11 and rapidly plunged in liquid ethane for vitrification. The data were collected using an FEI Titan Krios transmission electron microscope, operated at 300 kV and equipped with Gatan GIF Quantum energy filter. The micrographs were acquired with EPU software (FEI), recorded on a K2 Summit direct electron detector (Gatan) operated in counting mode with a physical pixel size of 1.07 Å per pixel. The detector was placed at the end of a GIF Quantum energy filter operated in zero-energy-loss mode with a slit width of 20 eV. The total exposure time was 7 sec and intermediate frames were recorded every 0.2 sec giving an accumulated dose of ~40 eÅ2 and a total of 35 frames per image. The defocus range used was from −0.8 μm to −2.8 μm.

CryoEM data processing

Cryosparc v.3.1.0 was used to process the data, including motion correction, CTF correction, particle picking, 2D classification, initial model building, 3D classification, and final refinements41. In general, we first used a small set of micrographs to pick particles and build a reliable low resolution map, which was used to create 2D projection images as templates for particle picking of the whole dataset (Fig. S2, flowchart 1). After having the templates, we followed the procedure illustrated in Fig. S2, flowchart 2 to process the dataset to final structures.

Generate 2D template for particle picking

Initially 7,000 particles were picked from 200 micrographs using cryoSparc’s blob particle picking procedure with setting the blob diameter to 500 Å41. We performed 2D classification for the 7,000 particles, A subset of particles were selected and used as templates to re-pick particles from the 200 micrographs. After another 2D classification, 1,800 particles were selected for ab initio reconstruction. Three ab-initio C1 symmetry models were built, two models show 13-fold symmetry (1,600 particles). With these particles, we built a 13-fold initial model, and then refined it to an 8 Å map. Based on this map, 25 uniformly distributed 2D projection images were created as templates for subsequently particle picking.

CryoEM 3D reconstructions

For the OMCCF from the pED208-carrying strain, a total of 323,000 particles were extracted from 8,500 micrographs through cryoSparc’s template particle picking procedure. We performed 2D classifications and 150,000 particles contributing to the best 2D classes were selected. C1 symmetry ab initio 3D model building was performed to generate 5 classes. After removing classes that showed broken outer rings, 124,000 particles with complete (non-broken) outer ring density were selected. C13 symmetry was then imposed for refinement. Duplicating this better resolved ORC map as 5 initial models, we performed C1 heterogeneous refinement for the 124k data. The best classes of 70,700 particles that exhibited uniform density for both the ORC and CC were selected. With these particles, we further performed C1, C13, C17 refinements. The corresponding maps were resolved at resolutions of 4.28 Å, 3.31 Å, 2.95 Å, respectively. By applying all possible sorting strategies, we determined that the OMCCF purified from the pED208-carrying strain contains only a single symmetry configuration: 13-fold for the ORC, 17-fold for CC.

For the OMCC from TraV/K/B-producing strain, a total of 321,000 particles were extracted from 5,500 micrographs using the 2D template images created from analyses of OMCC particles from the pED208-carrying strain. An initial 2D classification showed 14-fold and 13-fold symmetries in the top views of the 2D classes. 240,000 particles contributing to the best 2D classes were selected and used for ab initio 3D model building with 5 classes. Four types of symmetries were identified, ranging from 14-to 11-fold. Then symmetrized model refinements were performed for each subdataset. Using these better resolved ORC structures with different symmetry numbers of 14, 13, 12, or 11 as initial models, multiple rounds of heterogeneous refinement were performed to further sort out datasets into different symmetries sub-datasets. The sub-datasets with 14-fold, 13-fold, 12-fold, 11-fold symmetries contain 34k, 148k, 38k, 20k particles, respectively. After symmetrized refinements for each symmetry, resolutions of 5.28 Å, 2.97 Å, 4.46 Å, 6.9 Å were achieved for the 14-, 13-, 12-, and 11-fold complexes, respectively. We performed C1 refinement for the C13 ORC particles. To further resolve the CC of the C13 ORC structure, ab initio C1 refinement was performed with 5 classes. One subdataset (65k particles) with high quality ORC and CC structures was subjected to heterogenous refinements with 3 classes using the corresponding ab initio C1 model. The CCs of one class exhibited 17-fold symmetry (22k particles), of a second class exhibited 16-fold symmetry (26k particles), and of a third exhibited an ambiguous symmetry (17k particles). The first two classes were further refined with 17- and 16-fold symmetry imposed and the maps were used as two models for heterogeneous refinement. The final C17 (26k particles) and C16 (22k particles) reconstructions have resolutions of 3.56 Å and 3.9 Å respectively.

Model building and refinement

Maps were initially sharpened using the Autosharpen option in Phenix v.18.242. The sequences of the structural proteins were first submitted to the SWISS-MODEL SERVER43 (https://swissmodel.expasy.org), from which initial models were produced for the purpose of initiating 3D coordinates of the amino acids of each molecule. These models were then fitted into the density maps using “Fit in Map” option in UCSF ChimeraX v.1.244. Manual refinement of the model to the density was then performed using Coot v.0.9.4.145. At this stage, fit to density was prioritized, matching secondary structure and large, bulky side chains in the density, providing an initial coarse grain model for subsequent optimization. After coarsely fitting the models in Coot, real-space refinement was performed with ‘phenix.real_space_refine’ and subsequently adjusted again manually in Coot. This process was repeated in order to maximize fit to density, minimize Ramachandran angle outliers and eliminate steric clashes. Once individual protein subunits were refined, an asymmetric unit was then created and optimized with the same iterative refinement procedure. A full complex with NCS symmetry was then created in Chimera, followed by a final round of Phenix real-space refinement. Final model statistics were calculated in Phenix using the “Comprehensive Validation” tool in Phenix and reported in Table 1. Map-model visualization was performed in Coot and UCSF Chimera.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.