Nucleoid-associated proteins shape the global protein occupancy and transcriptional landscape of a clinical isolate ofVibrio cholerae

preprint OA: closed CC-BY-NC-ND-4.0
📄 Open PDF Full text JSON View at publisher
Full text 115,433 characters · extracted from oa-pdf · 13 sections · click to expand

Abstract

Vibrio cholerae, the causative agent of the diarrheal disease cholera, poses an ongoing health threat due to its wide repertoire of horizontally acquired elements (HAEs) and virulence factors. New clinical isolates of the bacterium with improved fitness abilities, often associated with HAEs, frequently emerge. The appropriate control and expression of such genetic elements is critical for the bacteria to thrive in the different environmental niches it occupies. H-NS, the histone-like nucleoid structuring protein, is the best studied xenogeneic silencer of HAEs in gamma-proteobacteria. Although H-NS and other highly abundant nucleoid-associated proteins (NAPs) have been shown to play important roles in regulating HAEs and virulence in model bacteria, we still lack a comprehensive understanding of how different NAPs modulate transcription in V. cholerae. By obtaining genome-wide measurements of protein occupancy and active transcription in a clinical isolate of V. cholerae, harboring recently discovered HAEs encoding for phage defense systems, we show that a lack of H-NS causes a robust increase in the expression of genes found in many HAEs. We further found that TsrA, a protein with partial homology to H-NS, regulates virulence genes primarily through modulation of H-NS activity. We also identified a few sites that are affected by TsrA independently of H-NS, suggesting TsrA may act with diverse regulatory mechanisms. Our results demonstrate how the combinatorial activity of NAPs is employed by a clinical isolate of an important pathogen to regulate recently discovered HAEs. Importance: New strains of the bacterial pathogen Vibrio cholerae, bearing novel horizontally acquired elements (HAEs), frequently emerge. HAEs provide beneficial traits to the bacterium, such as antibiotic resistance and defense against invading .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint bacteriophages. Xenogeneic silencers are proteins that help bacteria harness new HAEs and silence those HAEs until they are needed. H-NS is the best-studied xenogeneic silencer; it is one of the nucleoid-associated proteins (NAPs) in gamma- proteobacteria and is responsible for the proper regulation of HAEs within the bacterial transcriptional network. We studied the effects of H-NS and other NAPs on the HAEs of a clinical isolate of V. cholerae. Importantly, we found that H-NS partners with a small and poorly characterized protein, TsrA, to help domesticate new HAEs involved in bacterial survival and in causing disease. Proper understanding of the regulatory state in emerging isolates of V. cholerae will provide improved therapies against new isolates of the pathogen.

Introduction

The disease cholera, caused by the bacterium Vibrio cholerae, occurs following the ingestion of contaminated food or water and the subsequent colonization of the small intestine by the bacterium, causing severe diarrhea induced by bacterially- produced toxins [1]. Worldwide, there are between 1.3 and 4 million annual cholera cases and an estimated 21,000 and 143,000 deaths [2,3]. V. cholerae is documented to rely on multiple horizontally acquired elements (HAEs) during its life cycle in both the aquatic reservoir and in human hosts [1]. HAEs provide an opportunity for bacteria to acquire beneficial traits, such as virulence factors, new metabolic properties, and antibiotic resistance via the integration of foreign DNA [4]. Both of the major V. cholerae virulence factors, cholera toxin and the toxin co-regulated pilus (TcpA), are encoded by genes from two distinct HAEs, the filamentous phage CTXΦ [5] and the Vibrio pathogenicity island-1 (VPI-1), respectively [6]. The presence of these two HAEs is a defining feature of all epidemic V. cholerae strains. More recently, it has been discovered that some V. cholerae isolates rely on an additional HAE, the phage- inducible chromosomal island-like element (PLE), to protect against infection by the highly prevalent lytic phage ICP1 [7–11]. PLE is a viral satellite, a mobile DNA element that is activated upon infection by a given phage, specifically by ICP1 for PLE, whereupon a suite of effectors deleterious to the infecting phage and a few to the host bacterium are expressed [12–14]. HAEs like PLE exemplify the trade-offs of acquiring foreign DNA for bacteria. While the contents of the PLEs provide protection against ICP1, they often do so through the expression of gene products that are toxic to the host cell (other phage-defense systems found on the HAEs have been recently .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint investigated in V. cholerae [13,15–18]). Thus, erroneous transcriptional activation of HAEs may be detrimental to the host cell for different reasons, such as a waste of resources, expression of disrupted regulatory networks, or cytotoxicity, and likely would preclude stable maintenance of such HAEs unless they could be silenced under most conditions [19]. Many bacteria encode xenogeneic silencers to repress newly acquired DNA, minimizing potential harm and aiding in the domestication of new genomic elements (xenogeneic silencers and their roles in bacterial evolution are reviewed in ref. [19]). In Gram-negative bacteria, one of the most widespread and best-studied xenogeneic silencers is H-NS [20]. H-NS is a highly abundant protein that preferentially binds and oligomerizes by forming either linear or bridged filaments on AT-rich regions (a common characteristic of HAEs [19–22]) and subsequently silences gene expression by interfering with RNA polymerase binding or promoting Rho-dependent termination via stalling or backtracking of RNA polymerase [21,23,24]. H-NS is an example of a highly abundant DNA binding protein with loose sequence specificity (a class often referred to as nucleoid-associated proteins, or NAPs, reviewed in ref. [21]) that can modulate gene expression. The model gamma-proteobacterium Escherichia coli has been reported to have about a dozen NAPs that change the structure of the chromatin and can act as either transcriptional activators or repressors, thus altering the entire transcriptional network in response to different conditions [21,25]. NAPs may have overlapping functions to differentially regulate diverse targets. For instance, in E. coli, H- NS is known to form homodimers as well as heterodimers with its paralog StpA to mediate full repression at certain loci [26,27]. H-NS can also act antagonistically with other NAPs; for example, in E. coli, several transcriptional units are repressed by H-NS but activated by the NAP integration host factor (IHF), a highly abundant NAP that both dramatically bends DNA and alters chromosomal supercoiling [21]. Counter-regulated H-NS/IHF targets include the adhesion-related operons csgDEFG and fimB (data from [28]). Additionally, Hha, another E. coli NAP, was shown to support H-NS/DNA bridged filaments despite Hha lacking a predicted DNA-binding domain [23]. While the role that NAPs play in the transcriptional regulatory landscape of E. coli is well established (although ongoing discoveries continue to be made on the regulatory mechanisms of NAPs in that system and other enterobacteria [23,24,27,29–32]), less is known about the mechanisms and effects of xenogeneic silencers and other NAPs in other .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint organisms, such as V. cholerae. The transcriptional regulation of many of the V. cholerae HAEs has been investigated, particularly those directly involved in virulence [33–35]. Previous studies of V. cholerae El Tor (responsible for the ongoing 7th pandemic [36,37]) have shown that H-NS affects the expression of about 18% of the genome in a growth-phase-dependent manner [38]. Many H-NS targets are involved in motility, chemotaxis, biofilm development, and virulence, including the cholera toxin-encoding genes and the toxin- coregulated pilus TcpA [35,38,39]. H-NS is not the only nucleoid-associated protein implicated in regulating HAEs in V. cholerae. For example, IHF in V. cholerae has been reported to regulate the expression of virulence genes on HAEs [33] and possibly motility [40] and has been shown to be important for the conjugative transfer [41] of SXT, an integrative and conjugative element present in many V. cholerae strains [42,43]. Recent transcriptomic studies have demonstrated that TsrA, a protein with weak amino acid homology to H-NS, is involved in transcriptional regulation [44,45]. However, based on previous computational modeling, TsrA does not have a predicted DNA binding domain [44]. While previous studies measured the separate effects of hns and tsrA deletions on gene expression [44,45] and identified a strong overlap in the regulons of H-NS and TsrA, the combinatorial effects of H-NS and TsrA on global protein occupancy and gene expression have not yet been investigated. Additionally, the V. cholerae strains used in previous studies of hns and tsrA deletion did not harbor the PLE and SXT elements commonly present in clinical isolates from the current pandemic. The transcriptional program of the phage satellite PLE has been previously interrogated by RNA sequencing during infection by ICP1 phage, revealing that PLE is transcriptionally activated upon ICP1 infection [46]. However, the factors responsible for silencing PLE in the absence of phage infection remain unidentified. The transcription of some V. cholerae SXT elements has been similarly interrogated, primarily through the lens of element transfer induced by antibiotics or phage infection, but not focusing on the regulatory factors responsible for silencing transcription of the SXT element [47,48]. Thus, we still lack a comprehensive understanding of the regulatory programs involving NAPs of the diverse V. cholerae virulence genes and HAEs shaping the bacterium’s behavior and evolution. .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint While the pattern of protein occupancy on the genome (from both transcription factors – TFs – and NAPs) has a profound impact on bacterial gene regulation, the large number of distinct regulatory proteins makes it impractical to separately measure the contributions of all relevant factors across biological conditions using protein-specific

Methods

such as chromatin-immunoprecipitation followed by sequencing (ChIP-seq). The IPOD-HR (in vivo protein occupancy display at high resolution) method provides an alternative approach, allowing protein-agnostic measurement of the genome-wide accessibility of bacterial chromatin (Figure 1A) [27,49,50]. An additional parallel step during IPOD-HR involves ChIP-seq of RNA polymerase-bound DNA regions, which provides information on RNA polymerase (RNAP) occupancy, a proxy for active transcription [50], and allows for the distinction of transcribed vs. silenced genomic contexts. The IPOD-HR workflow has been previously applied to E. coli and Bacillus subtilis to investigate their transcriptional programs and the effects that certain regulatory factors (especially NAPs) have on genome-wide protein occupancy, chromatin accessibility, and transcription [27]. In order to comprehensively determine the roles of key NAPs in regulating the HAEs of V. cholerae, we employed IPOD-HR together with RNA sequencing on the V. cholerae clinical isolate KDS1, as well as Δhns, ΔtsrA, ΔihfA (IhfA subunit of IHF), and ΔtsrAΔhns deletion mutants derived from the same parental strain. Consistent with previous findings, we observed that the absence of H-NS altered protein occupancy and gene expression in many genomic regions important for host colonization and virulence, including multiple HAEs. We also show that the deletion of H-NS leads to the de- repression of some portions of the phage satellite PLE, suggesting that phage infection perturbs H-NS occupancy, thus contributing to the activation of the PLE program. Analysis of the ΔtsrAΔhns double knockout strain revealed that H-NS drives the main repression of targets and that TsrA generally acts in an H-NS-dependent manner; however, we also identified some sites where TsrA exerts H-NS independent regulatory control. Given the importance of the SOS response in inducing mobile genetic elements, such as prophages and antibiotic resistance genes [51–53], we also explored the effect of DNA damage on protein occupancy profiles and gene expression by performing IPOD-HR on V. cholerae treated with the DNA damaging agent mitomycin C (MMC). We observed a bimodal response in HAEs following MMC treatment: MMC induces increased transcription of some HAEs but increased silencing of others, likely .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint due to substantial rearrangements of the chromatin state at those loci. Our findings both reaffirm prior knowledge regarding the importance of NAPs (especially H-NS) for regulating HAEs in a clinical isolate of V. cholerae and reveal the combinatorial regulation between NAPs and other local regulators.

Results

IPOD-HR identifies large repressive protein occupancy regions in addition to local protein binding in Vibrio cholerae In order to identify locations of potentially repressive NAP binding genome-wide in V. cholerae, we applied IPOD-HR (in vivo protein occupancy display at high resolution) to V. cholerae KDS1 cells grown to exponential phase in lysogeny broth (LB). KDS1 is a clinical isolate representative of the V. cholerae O1 serogroup El Tor biotype, isolated in Bangladesh in 2011 [54], and will be referred to simply as V. cholerae hereafter. As detailed in [49,50], IPOD-HR provides a snapshot of total protein occupancy along the genome, along with paired RNA polymerase occupancy data (Figure 1A). Throughout the text below, we refer to the RNAP ChIP-seq subtracted protein signal as IPOD-HR, and the total protein occupancy before RNAP subtraction as IPOD. Application of IPOD-HR to V. cholerae revealed trends that are consistent with those identified in other model organisms [27,49,50]. The V. cholerae genome shows a diverse protein occupancy pattern, including local regions of protein occupancy consistent with transcription factor binding sites and large transcriptionally silent regions consistent with extended regions of NAP occupancy referred to as extended protein occupancy domains (EPODs), which have been observed in other organisms [27,49,50] (Figure 1B). An example of the information provided by IPOD-HR at a single locus can be seen in Figure 1C, where VC0596 (encoding dksA, a transcription factor [55]) appears highly expressed based on high RNA polymerase (RNAP) occupancy throughout the open reading frame, whereas the adjacent gene VC0597 (sfsA, annotated as “sugar fermentation stimulation protein homolog” in Uniprot) lacks RNAP occupancy and appears to be bound by a local regulator that represses VC0597 and/or activates transcription of VC0596. An example of a large EPOD is seen in Figure 1D, where a region of high protein occupancy extends about 8 kilobases over the region of .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint VCA0780-VCA0785; RNA polymerase binding is generally reduced over the same region. Of the genes encompassed by this EPOD, VCA0785 encodes for CdgC, which acts as a c-di-GMP synthase/phosphodiesterase and is known to regulate biofilm formation and motility, as well as virulence factor expression and rugose colony morphology [56,57]. The rest of the ORFs in this EPOD have not been characterized and have only automatically inferred annotations [58,59]. As was shown in the application of IPOD-HR in E. coli [27,49,50], EPODs tend to be transcriptionally silent, as is seen for the VCA0780-VCA0785 EPOD under the growth condition considered here. Previous application of IPOD-HR to B. subtilis revealed that some EPODs, such as those representing occupancy by the major NAP Rok, show negative enrichment scores following IPOD interphase extraction, in contrast to what was observed for the major NAPs in E. coli [27]. The most likely explanation is Rok-DNA complexes partitioning away from the interphase layer due to the properties of the Rok protein, and thus being depleted during the phenol-chloroform extraction rather than enriched, resulting in a negative ‘enrichment’ score. In considering protein occupancy across the V. cholerae chromosome, we observed regions of strong, sustained negative IPOD occupancy (before and after RNA polymerase ChIP-seq subtraction) similar to those observed in Rok-occupied regions in B. subtilis; an example is shown in Figure 1E. Importantly, these large negative occupancy regions appear to be transcriptionally silent, which is consistent with repressive NAP occupancy, and indicates that the observed negative signal is not due to the RNAP ChIP-seq subtraction used in the IPOD-HR method, or due to unbound DNA, which has been accounted for through comparison with the input sample. We refer to regions of depleted IPOD signal and low expression as negative EPODs (nEPODS) (Figure 1B). In the case of the example region shown in Figure 1E, we see a 2 kilobase nEPOD encompassing VC0516 and partially VC0515 on the large chromosome of V. cholerae. The region is a part of the HAE Vibrio seventh pandemic island-II (VSP-II) [60], where the transcriptionally silent VC0516 corresponds to the phage-like integrase of the VSP-II [61]. We hypothesize that, as in the B. subtilis Rok case noted above, the negative occupancy signal represents a protein or a combination of proteins that produce this behavior due to their properties in the IPOD-HR protocol. As detailed in the Supplementary Text, given the high overlap between known H-NS bound regions and nEPODs, we hypothesize that .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint many of the nEPODs in V. cholerae may represent H-NS binding. In E. coli, EPODs have been observed to form mainly on AT-rich DNA, and to regulate many horizontally acquired genes, prophages, and mobile genetic elements [27,49,50], consistent with previously established regulatory roles of xenogeneic silencers [19–22]. To characterize the properties of EPODs and nEPODs in V. cholerae, we examined the AT richness of EPODs, as well as performed gene set enrichment analysis to identify gene ontology (GO) terms occurring frequently in EPODs. In contrast to E. coli [27,49,50], EPODs in V. cholerae are significantly higher in GC content compared with a background corresponding to the rest of the genome, whereas nEPODs are significantly higher in AT content (Figure 1F). These results suggest that EPODs and nEPODs in V. cholerae likely correspond to occupancy from different DNA binding proteins (as previously observed in B. subtilis). Both EPODs and nEPODs, however, are associated with transcriptional silencing represented by a lack of RNAP occupancy in those regions. In order to identify the pathways primarily affected by EPODs in V. cholerae, we utilized the iPAGE method for gene set enrichment analysis [62], which identifies significant correspondences between gene ontology (GO) terms and the EPOD status (via mutual information) (Figure 1G). Primarily metabolism- oriented GO terms, such as “histidine catabolic process to glutamate and formamide” and “tryptophan biosynthetic process”, are over-represented in positive EPODs, and terms such as “release from viral latency” and “pilus” are enriched in negative EPODs (nEPODs), suggestive of HAE-related processes appearing in nEPODs. Collectively, the enriched GO term categories in V. cholerae EPODs and nEPODs match with the categories enriched in EPODs in E. coli, covering a spectrum of specialized metabolic functions and horizontally acquired elements. The distinction in categories between EPODs and nEPODs likely reflects the division of regulatory labor between different NAPs in V. cholerae, as was observed in B. subtilis. .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint Figure 1. Overview of the protein occupancy landscape in wild type Vibrio cholerae. A) Experimental diagram of the IPOD-HR workflow: crosslinked protein-DNA complexes are divided into three samples: “Input” represents the baseline total fragmented DNA, which is then subjected to extraction of the protein-DNA complexes enriched in the interphase layer (IPOD) .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint and, in parallel, to chromatin immunoprecipitation (ChIP) with an anti-RNA polymerase antibody [63]. DNA from the three samples is purified and sequenced. DNA is depicted in light blue, RNA polymerase (RNAP) is depicted in green with two lobes, the rest of the proteins in the samples are shown as x-shaped cartoons. B) Circos [64] plots of the V. cholerae KDS1 genome that include IPOD-HR traces of rz-scores, RNA polymerase ChIP-seq rz-scores, and locations of EPODs and nEPODs (data tracks are rolling medians over 512 bp windows). C) An example of likely local regulator binding in wild type V. cholerae; shown are typical binding patterns of DNA-binding regulatory proteins (through the midst of VC0597) and of highly transcribed regions (VC0596). IPOD rz-scores are based on the log2 ratios of extracted/input DNA. IPOD-HR rz-scores are RNAP ChIP subtracted IPOD rz-scores (see Materials and Methods). RNA polymerase ChIP-seq rz-scores are based on the log2 ratio of extracted RNAP ChIP vs input DNA. The VC numbers for genes from the C6706 strain of V. cholerae have been mapped to the KDS1 strain used in this study. D) Example of a transcriptionally silent extended protein occupancy domain (EPOD) in wild type V.cholerae, showing high protein occupancy and low RNA polymerase ChIP-seq signal across two divergent operons (spanning VCA0780-VCA0785). For this panel and the rest of the manuscript, the plotted rz-scores are median rz-scores over 512 bp windows unless otherwise noted. E) Example of a negative EPOD (nEPOD) in V. cholerae, showing both negative IPOD enrichment scores and depleted RNA polymerase ChIP-seq signal across VC0516 and partially on VC0515. F) Distribution of AT content of EPOD regions in the wild type V. cholerae KDS1, where negative EPODs (nEPODs) are higher in AT content than the background, whereas positive EPODs have higher GC content. “Background” is composed of all genomic regions that do not fall into the other indicated categories. Significance calling was performed with 1,000-sample permutation tests (described in detail in Materials and Methods) of differences in medians of AT content between the background and the EPODs or nEPODs; in both cases, a significant difference was observed (p-value=0.001). G) GO term enrichment analysis of EPODs and nEPODs. In the iPAGE plots in this manuscript, the color of the heatmap indicates the degree of enrichment or depletion of a given GO term (see color scale) among the genes in each GO term (row), in the EPODs or nEPODs under consideration vs. the rest of the genome. Nucleoid-associated protein IHF in Vibrio cholerae is involved in global iron regulation Due to the many transcriptional regulators in a bacterial cell, the IPOD-HR

Method

allows us to identify regulatory and protein occupancy effects along the genome upon the deletion of a NAP from both direct and indirect binding of effects of the NAP. The integration host factor (IHF) is one of the NAPs that acts as a dual regulator, broadly affecting gene expression [21]. Thus, we profiled the transcriptional network of .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint V. cholerae lacking the IhfA subunit, which dimerizes with the IhfB subunit to make functional IHF. With IPOD-HR we identified (Table S1) distinct loci at which protein occupancy is lost in the absence of IHF compared to wild type (Figure 2A). One such example occurs in the region upstream of VC0143, a hypothetical protein predicted to be essential due to the lack of transposon insertions in whole-genome transposon library screens [65]. We observe a decrease in protein occupancy coupled with an increase in RNA polymerase occupancy in the VC0143 promoter region upon ihfA deletion (Figure 2A). The intergenic region between VC0142 and VC0143 has been reported to encode a small RNA [66,67]. It was also shown that this region contains binding sites for the ferric uptake regulator (Fur) based on ChIP-seq data [66] and for the virulence regulator ToxT based on in vitro DNA pull-down of purified ToxT [68]. Thus, previous studies suggest that the expression of the VC0142/VC0143 sRNA may be regulated by multiple factors, including: Fur [66], which is known to respond to intracellular iron levels; ToxT [68], the master virulence regulator; and the nucleoid- associated protein IHF either directly or indirectly (based on our data). In order to trace the path from ihfA deletion to VC0142/VC0143 sRNA expression, we considered how the Fur regulon is, as well as iron homeostasis-related genes, are affected by the absence of IHF. RNA-sequencing (RNA-seq) showed that many genes involved in iron ion homeostasis and/or known to be in the Fur regulon are upregulated in the absence of IHF (Figure 2B). Our RNA-seq data also corroborated the findings from IPOD-HR that in the absence of IHF, the sRNA in the VC0142/VC0143 intergenic region as well as the hypothetical protein VC0143 are both significantly upregulated (log2 Fold Change (FC) and q-value of 3.94/1.01e-09 and 5.22/3.21e-14 for the small RNA and VC0143, respectively). iPAGE GO term enrichment analysis shows that the “iron ion transport” and “siderophore-biosynthesis process” are some of the enriched GO terms for highly expressed genes, consistent with the above findings that many iron ion homeostasis genes are upregulated in the absence of the alpha subunit of IHF (Figure 2C, Supplementary Figure 1A). Given the upregulation of iron homeostasis-associated genes in the absence of IhfA, we assessed Fur (VC2106) expression and observed a log2FC of -1.22 (q- value=0.005) in the ΔihfA strain compared to wild type, suggesting that Fur expression is decreased upon loss of functional IHF, providing a ready mechanism connecting loss of IHF to the widespread changes in the regulation of iron uptake that we observed. .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint Interestingly, Fur is also downregulated in the knockout strains mentioned below (ΔtsrAΔhns (L2FC=-2.71/q-value=1.08e-21), ΔtsrA (L2FC=-1.54/q-value=6.83e-07) and Δhns (L2FC=-1.69/q-value=2.60e-06)), but we did not observe a large global effect on the expression of iron homeostasis genes as observed in the ΔihfA, suggesting that IHF may be playing a distinct role in regulating the genes involved in iron-homeostasis rather than acting only through Fur. .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint Figure 2. Protein occupancy and RNA-sequencing in V. cholerae lacking IHF A) Protein occupancy (total and RNA polymerase) in the promoter region upstream of VC0143 and the VC0142/VC0143 intergenic sRNA for WT and ΔihfA cells during exponential growth in .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint LB media. The sRNA is shown in green and VC0143 is shown in gray because the annotation pipeline utilized only annotated genes coding for more than 90 amino acids (see Materials and Methods), and thus manual annotation was required. B) Volcano plot of differentially expressed genes in the strains lacking IHF (ΔihfA) relative to wild type cells. Dark colored points above the gray points represent genes that passed the significance threshold of q-value less than or equal to 0.1. The fur (VC2106) gene is denoted in blue, Fur regulon genes obtained from the ChIP-seq results found in ref. [66] (with q-value less than or equal to 0.1) are represented in orange, and a combination of GO terms that had iron in their names are represented in purple. Marginal density plots follow the same color coding as the points in the volcano plot. VC numbers (mapped from C6706 strain to KDS1 V. cholerae) or KDS1 numbers (that did not map with VC numbers) are only shown for highly (log2FC greater or equal to 3.5 or -3.5) differentially expressed genes, or the VC0142/VC0143 sRNA and the fur gene, to avoid crowding. VC0143 is manually annotated and shown as a triangle shape. C) GO term enrichment analysis of differentially expressed genes in the ΔihfA strain (relative to WT) from RNA-sequencing via iPAGE. Wald statistics from the ΔihfA RNA-seq are utilized as the input for iPAGE, detailed in Materials and Methods. The colors on the heatmap represent the GO terms that are highly expressed in ΔihfA (right) or repressed (left). The absence of H-NS results in increased RNA polymerase occupancy across horizontally acquired elements in V. cholerae In order to investigate the global regulatory effects of the xenogeneic silencer H- NS, as well as the putative H-NS-associated regulatory factor TsrA [69] (the regulon of which overlaps with that of H-NS [44,45]), we applied a combination of IPOD, RNA polymerase ChIP-seq, and RNA-seq to unravel changes in protein occupancy and gene expression in the absence of H-NS, TsrA or both. We compared the results to the effects of deleting ihfA (described above), and to a deletion of the enterobactin receptor gene vctA [70], which is not expected to have substantial effects under the growth conditions considered here, but serves as a control for any effects of constructing the deletions themselves. To quantify the effects of the NAP deletions indicated here on regions of likely NAP-mediated silencing, we calculated the changes in occupancy (relative to WT) for different deletion strains at the locations of EPODs and nEPODs identified in our wild type strain (Supplementary figure 2). We observe that, while many EPODs are unaffected, a substantial subset show very strong loss of occupancy (based on negative tails in the IPOD-HR violin plots) in the Δhns, ΔtsrAΔhns, ΔtsrA, and Δihf strains. Similarly, while RNAP occupancy is in the average case unchanged, a subset of EPODs and nEPODs show a sharp increase in RNAP occupancy in the Δhns and .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint ΔtsrAΔhns strains, indicating that deletion of hns, but not of the other regulators considered here, is sufficient to trigger expression changes in those regions. This observation indicates that the selected NAPs result in changes in protein occupancy in EPODs/nEPODs, but only the lack of H-NS results in significant de-repression of those regions. Deletion of the vctA control gene shows minimal shifts in the centers of the occupancy distributions relative to WT, but in the opposite direction as any NAP deletion, and without any substantive tail that would indicate strongly affected specific loci. Because some NAPs like H-NS are involved in repression of HAEs, we investigated protein occupancy in some of the known V. cholerae HAEs: ctx and the RS1 phage satellite, the Vibrio pathogenicity islands (VPI-1 and VPI-2), the Vibrio seventh pandemic islands (VSP-I and VSP-II), the integrative and conjugative element SXT-VchInd6, the viral satellite PLE, and the superintegron (Figure 3A). We find that the absence of H-NS (in both the Δhns and ΔtsrAΔhns strains) results in strong de- repression in VPI-1 (Figure 3C), PLE (Figure 3E), and some de-repression in CTX (Figure 3D), VPI-2, VSP-I, VSP-II based on RNA polymerase ChIP-seq (Figure 3B, Supplementary Figure 3,4). Deletion of tsrA only largely affected the transcription in VPI-1 out of the considered HAEs (Supplementary Figure 4C). Consistent with previous H-NS ChIP-seq results [71] (Figure 3C, gray track) showing that H-NS covers almost the entirety of VPI-1, we observe that the absence of H-NS results in de- repression of VPI-1 as evidenced by the appearance of a large stretch of RNA- polymerase occupancy in the Δhns and ΔtsrAΔhns strains (Figure 3C; Supplementary Figure 5A). As with VPI-1, deletion of hns results in de-repression of ctxA and ctxB, but no significant de-repression is observed in the other deletion strains tested here (Figure 3D, Supplementary Figure 4C, 5C). Although H-NS regulation of VPI-1 and CTX have been studied in other V. cholerae strains [35,38], the regulatory factors that modulate some of the recently identified HAEs, like the phage satellite PLE, in different clinical isolates have not been investigated. In the absence of H-NS, the later ORFs in PLE become de-repressed as evidenced by the observed substantial increase in RNA polymerase occupancy coinciding with the SviR promoter (Figure 3E, Supplementary Figure 5D). This is consistent with SviR expression being below the limit of detection in the absence of ICP1 infection [72] and the absence of H-NS allowing for transcription from the SviR .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint promoter and RNA polymerase occupancy extending into downstream ORFs. Additionally, no dramatic changes are observed across PLE in RNA polymerase occupancy in the absence of tsrA, ihfA, or the control vctA, suggesting that H-NS is the primary repressor of PLE gene expression. However, the RNA polymerase occupancy in the absence of H-NS is inconsistent with the highly programmed gene expression profile of PLE responding to ICP1 infection, which is known to activate the PLE transcriptional program [46]. The incomplete activation of PLE gene expression is consistent with previous observations that other regulatory factors (presumably ICP1- derived) are needed to fully activate the PLE. Our IPOD-HR results show that among the tested NAPs, H-NS results in the highest level of repression of the V. cholerae HAEs, and that H-NS repression plays a considerable role in silencing aberrant PLE gene expression in the absence of ICP1 infection. Although there is a high correlation between the RNA polymerase ChIP-seq and RNA-seq results of our V. cholerae strain (Supplementary Figure 8A), we further assessed the effect sizes of fold changes in transcript levels of individual genes in the PLE with RNA-seq. We observed that orf15, coding for a nicking endonuclease, NixI, that cleaves and inhibits the ICP1 phage genome replication [13], is the most highly differentially expressed gene in cells lacking hns (with L2FC= 5.29/q-value=6.52e-19) out of the all annotated genes in the PLE (Table 1). Additionally, orf17/tcaP is also the most significantly differentially (with L2FC=3.11/q-value=5.09e-35) expressed gene that was recently demonstrated to encode a scaffolding protein of the phage ICP1 coat allowing better transmission of the PLE HAE [14]. Consistent with our findings that the loss of hns masks the effect of the loss of tsrA, H-NS is silencing the majority of the PLE (Table 1) and no further effect is seen when both H-NS and TsrA are absent. Although lack of H-NS results in de-repression of the majority of genes in the PLE, we did not observe the excision or replication of the phage satellite (based on the read counts of the PLE in the input samples of strains lacking hns, and a lack of read boundaries at the PLE ends that would indicate excision). This is consistent with ICP1-encoded PexA being necessary for excision of PLE [11]. Taken together, our IPOD-HR and RNA-seq

Results

show that out of the tested NAPs, H-NS repression plays a considerable role in silencing aberrant PLE gene expression in uninfected cells. .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint Figure 3: Effect of NAP deletions on the horizontally acquired elements of V. cholerae. A) Locations of known HAEs in the KDS1 strain of V. cholerae (outer rings). The blue and orange lines in both chromosomes indicate plus and minus strand genes, respectively. The two chromosomes are not depicted to scale. B) Changes (relative to WT) in robust rz-scores of IPOD-HR and RNA polymerase occupancy across the indicated genomic regions. Here and in subsequent panels summarizing IPOD-HR occupancies, a 50 bp rolling median was used as the fundamental unit of data for each genomic location, and the plotted values reflect the pseudomedian of those values across the indicated genomic features; for RNA polymerase ChIP-seq, we followed a similar procedure, except that we used gene-level means of the ChIP occupancies as individual units of data for the pseudomedian calculations. Individual biological replicates are shown in gray points and the larger colored points are the mean of replicates for each genotype. C) Total protein occupancy signals and RNA polymerase occupancy in VPI-1 for all of the genotypes studied. The gray track shows previously obtained V5-HNS ChIP-seq from C6706 [71] remapped and requantified on our reference genome (detailed in Materials and Methods).The tracks of total protein occupancy with RNAP ChIP subtraction (IPOD-HR) are found in Supplementary Figure 5A. D) As in panel C, for the CTX prophage region; the additional red tracks indicate IPOD-HR. Only wild type and Δhns are depicted in Figure 3D, the rest of the genotypes are found in Supplementary Figure 5C. E) Total protein occupancy signals and RNA polymerase occupancy in the PLE satellite in wild type and Δhns V. cholerae. The rest of the genotypes are found in Supplementary Figure 5D. Table 1: RNA-sequencing Log2 (Fold Change (FC)) of deletion strains (relative to wild type) in genes that have been annotated in the PLE of V. cholerae. We consider a q-value of ≤ 0.1 to be significant. The genes have been sorted by increasing q-values in the Δhns vs wild type. *gene was manually annotated. PLE genes Log2FC in Δhns vs wild type q-value in Δhns vs wild type Log2FC in ΔtsrA vs wild type q-value in ΔtsrA vs wild type ORF17 (TcaP) 3.113 5.09E-35 0.686 6.97E-03 ORF15 (NixI) 5.289 6.52E-19 2.287 6.28E-05 ORF13 2.049 1.55E-15 1.796 3.36E-15 ORF2 (CapR) 2.424 4.74E-15 2.320 1.62E-17 ORF21 1.885 2.77E-11 0.366 2.09E-01 ORF16 3.541 3.86E-09 0.293 6.69E-01 ORF15.1 3.806 1.22E-07 -0.149 8.69E-01 ORF14 1.836 5.19E-07 1.849 4.15E-09 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint ORF22 1.598 8.79E-06 -0.152 7.02E-01 ORF3 2.229 2.53E-05 1.360 3.78E-03 ORF23 1.347 5.73E-05 -0.225 5.18E-01 ORF12 1.658 5.13E-04 1.111 7.28E-03 ORF9 1.104 5.56E-04 1.068 7.55E-05 ORF5 2.436 1.36E-03 2.599 4.07E-05 ORF4 1.946 3.86E-03 0.836 1.64E-01 ORF8 0.764 5.35E-03 1.020 3.66E-06 SviR (sRNA) 1.414 2.79E-02 1.465 4.34E-03 ORF18.1 -1.095 3.85E-02 -1.547 1.59E-04 ORF20.1 (LidI)* 1.039 4.45E-02 -0.685 1.09E-01 ORF7 -1.015 5.88E-02 -1.096 8.87E-03 ORF18 -0.767 1.74E-01 -1.162 5.46E-03 ORF12.1 1.321 2.50E-01 0.521 5.98E-01 ORF11 0.615 2.68E-01 -0.176 7.19E-01 ORF19 -0.181 5.93E-01 -0.659 1.85E-03 ORF20 0.306 6.13E-01 -1.234 9.27E-04 ORF1 -0.240 6.34E-01 -0.268 4.53E-01 ORF10 -0.075 9.66E-01 -1.374 9.57E-02 The absence of H-NS masks the effect of absence of TsrA at the majority of loci TsrA (VC0070) has weak amino acid sequence similarity with the N-terminal oligomerization domain of H-NS in V. cholerae and was recently shown to play a role as a transcriptional regulator [44,45,69]. Because the regulon of TsrA, based on transcriptomic studies, overlaps with that of H-NS [44,45] and because H-NS plays an important role as a global regulator, we tested for epistasis between deletions of hns and tsrA in the V. cholerae clinical isolate under study. Using our RNA-seq results, we assessed the correlation between the regulation of genes from individual deletions of .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint hns and tsrA (Figure 4A). The majority of genes show similar changes in transcript abundance upon loss of H-NS or TsrA (and thus, we hypothesize that H-NS and TsrA co-regulate many of these targets), on the basis of the robust linear regression with a slope close to 1; however, there is a cluster of 49 genes that show substantially more increased transcript levels in the absence of H-NS compared to that of TsrA (thresholding at three standard errors above the curve of slope=1 in the top right quadrant). This cluster includes, for example, the toxin coregulated pilus VC0828/tcpA with significant log2FC=6.40/q-value=6.85e-122 upon deletion of hns, compared to the effect of tsrA deletion with a log2FC=1.79/q-value=3.55e-12. We also compared the effects of individual deletions of hns or tsrA with those of the double knockout (Figure 4B,C). The expression changes caused by the double knockout, especially for genes showing strong differential expression, largely track with the changes seen in the hns single knockout (genes along the line of slope=1), again confirming that deletion of hns is the primary cause of expression changes in the double knockout. For the HAEs studied here, this pattern of H-NS regulatory dominance largely holds true (Figure 4E). However, some exceptions exist: 55 genes are upregulated more strongly in the absence of both proteins out of 381 significantly upregulated genes in Δhns, suggesting independent and additive effects of H-NS and TsrA on some loci. For instance, the uncharacterized gene KDS1_02240/s003 [42] (a ParM/StbA family protein based on an NCBI blast search), located in the SXT, shows a large increase in expression in the double knockout (log2FC=5.87/q-value=7.57e-31) compared to the individual deletion strains (log2FC=1.72/q-value=0.0167 in the Δhns and log2FC=0.731/q-value=0.253 in the ΔtsrA). Similarly, the accessory colonization factor VC0844/AcfA, encoded within VPI-1, is higher in expression (log2FC=6.05/q- value=6.3e-24) in the double mutant compared to individual deletions of hns (log2FC=3.69/q-value=1e-6) or tsrA (log2FC=2.57/q-value=1e-4), respectively, indicating roughly additive effects of the deletions. These results suggest that both H- NS and TsrA synergistically regulate a subset of genes, but H-NS is responsible for the majority of expression changes in the double knockout. In contrast, at the majority of H- NS regulated sites, the additional loss of TsrA does not have any further impact on regulation, suggesting that TsrA’s role is to act via modulation of H-NS or another DNA- binding protein. The napFDABC (VCA0676 - VCA0680) locus, encoding for periplasmic nitrate .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint reductase [73], provides an example of some exceptions, based on IPOD-HR experiments, where we observe independent regulation by H-NS and TsrA (Figure 4D). De-repression of this locus is observed in the absence of TsrA, whereas expression drops in an hns knockout; however, the double mutant results in a further increase in RNAP occupancy (and thus presumably transcription) relative to tsrA deletion. These

Results

appear to indicate that TsrA and H-NS act independently of each other at the napFDABC locus and when they are both no longer present, this chromatin region becomes de-repressed on the basis of increased RNA polymerase occupancy (Figure 4D). RNA-sequencing also demonstrates an increase in napA transcript level in the double knockout (log2FC=1.28/q-value=6.05e-12) relative to either the hns (log2FC=0.61/q-value=1.86e-2) or tsrA (log2FC=0.43/q-value=4.76e-2) single mutants. This result suggests that TsrA may act alongside of (and independently of) H-NS at a few loci, similar to the behavior of H-NS paralog StpA in E. coli, where we have seen a synergistic de-repression upon the deletion of both stpA and hns [27]. However, unlike StpA, which binds directly to DNA [26], TsrA has not been predicted to have DNA- binding ability [44,45], suggesting it may function through H-NS and/or another protein. iPAGE analysis for gene set enrichment is consistent with previous findings that TsrA, like H-NS, regulates many virulence and HAE-associated biological pathways (Supplementary Figure 6A). Many metabolism-related GO terms such as “organic acid catabolic process”, “de novo IMP biosynthetic process”, and “peptide-transporting ATPase activity” are highly expressed in the absence of both H-NS and TsrA. Consistent with previous studies [44,45], we observe that the regulons of both H-NS and TsrA are both AT-rich, which is a characteristic of many horizontally acquired elements and targets of H-NS in gamma-proteobacteria (Supplementary Figure 6B). To further assess the interplay of regulation by H-NS and TsrA, we investigated the enrichment of GO term pathways that are uniquely upregulated in the single deletion of hns or tsrA and the double mutant (Supplementary Figure 7). We found that the shared regulons of H-NS and TsrA consist primarily of metabolic pathways, including “organic acid catabolic process”, “organic anion transport”, “generation of precursor metabolites and energy”, “tricarboxylic acid cycle enzyme complex” and others. In contrast, the regulatory targets uniquely affected by hns deletion and tsrA deletion appear to contain disjointed sets of genes involved in host colonization and virulence, including terms such as “host cell plasma membrane” (e.g., VC1451 and VC0822) in .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint hns deletion and “pilus” (e.g., VC2423, VC0857 and VC0409) in tsrA deletion. These findings indicate that TsrA in fact acts independently of H-NS in regulation of a few key targets involved in virulence, suggesting that TsrA may act with another DNA binding protein to affect gene expression at some loci .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint Figure 4 Effect of the absence of both of H-NS and TsrA on transcript levels in V. cholerae. A-C) Pairwise comparison of log2FC (Fold Changes) in transcript abundances (measured via RNA-seq) for the indicated genotypes, relative to wild type. Blue fitted lines represent the robust .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint linear fit of significantly (q-value less than or equal to 0.1) differentially expressed genes in both of the genotypes (green points) compared; slopes for the fitted lines for each comparison are shown in blue. A line with slope of 1 is shown in black. The points represent four categories with indicated colors: significant genes in both genotypes, significant in only one or the other, significant in neither. The genotypes of pairwise comparisons relative to wild type are: A) Δhns vs. ΔtsrA; B) Δhns vs. ΔtsrAΔhns; C) ΔtsrA vs. ΔtsrAΔhns D) Protein occupancy (total and RNA polymerase) in an EPOD spanning the napFDABC locus in cells of the indicated genotypes, during exponential growth in LB media. E) Heatmap representing the transcript abundances (log10[TPMs]) of genes in the strains shown in various HAEs: VSP-I, VSP-II, VPI-1, VPI-2, SXT, RS1, PLE and CTX. The rows represent the genes within the HAEs, the columns represent individual biological replicates of the indicated genotypes. Gray rectangles in heatmaps represent genes with 0 TPMs. Genes in our revised

Reference

genome that mapped to multiple VC numbers were omitted from analysis. Structural Modeling Demonstrates a Potential Mechanism for TsrA to Modulate H-NS Binding As the results above suggest that the effects of TsrA are mediated largely, though not exclusively, through H-NS, we next sought to determine a potential mechanism for this behavior. Consistent with the absence of a recognizable DNA binding domain [74], modeling a TsrA dimer using DMFold [75] yields a high confidence structure with a predicted quality score (QS; see Materials and Methods) of 0.75, but no clear nucleic acid binding surface is present (Figure 5A-B), and a Foldseek [76] search of one of the modeled chains with an E-value cutoff of 1 yields no hits to known DNA-binding proteins. In contrast, modeling of a H-NS/TsrA heterotetramer using DMFold shows a striking potential binding mode in which TsrA inserts itself into an H-NS dimer, losing TsrA-TsrA contacts in favor of a close association with H-NS (Figure 5C), with a lower QS of 0.47. We note that the structure generally has high local confidence except in a potentially flexible loop connecting the oligomerization and DNA binding domains of the H-NS. In the modeled heterotetramer, the N-terminal dimerization domain (Figure 5D) and the C-terminal DNA binding domain (Figure 5E) of H-NS are both individually similar to existing experimental structures of E. coli H-NS, but TsrA presents an additional interface directly adjoining the H-NS DNA binding domain (Figure 5F). The proximity of TsrA in the heterotetramer model to the DNA binding domain of H-NS in fact presents a unified potential DNA binding interface, providing a ready molecular explanation for how TsrA may modulate H-NS - DNA interactions. While the majority of .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint TsrA effects appear to be H-NS mediated, and explicable through the model shown here, the presence of a minority of H-NS independent TsrA targets (as shown in Figure 4) suggests that TsrA may also interact with other regulators besides H-NS, perhaps in some similar mode. Figure 5: Structural modeling of TsrA/H-NS interactions. A) DMFold model of a TsrA dimer, with the two chains shown in blue and cyan. B) Equivalent view to panel A, showing a surface with residue types colored (white: hydrophobic; green: hydrophilic; blue: basic; red: acidic). C) Model of a 2:2 TsrA/H-NS heterotetramer, with the TsrA chains colored as in panel A, and the H-NS chains colored in red and orange; below is shown a surface representation colored by the residue-level predictive confidence (pLDDT scores). D) Superposition of a crystal structure of the oligomerization domain of E. coli H-NS (PDB code 1LR1, shown with chains in green and gold) with the equivalent portion of the heterotetramer model from panel C (transparent gray cartoon). E) Superposition of an NMR structure of the DNA binding domain of E. coli H-NS (PDB code 1HNR, in red) with the equivalent region of H-NS from the heterotetramer model in panel C. F) As in panel E, but including the modeled TsrA chain (shown in blue). .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint IPOD-HR reveals horizontally acquired elements affected and unaffected by the SOS response Environmental stresses, such as DNA damage, have been shown to affect the regulation of prophages and antibiotic resistance genes found on HAEs for mobilization by stimulating the SOS response [51,52,77]. Thus, we profiled the genome-wide effect of a DNA damage causing agent, mitomycin C (MMC), on transcription and protein occupancy of V. cholerae to identify DNA damage responsive regions of the genome. Transcription and protein occupancy of some of the HAEs were affected by MMC treatment, such as SXT, RS1, CTX, and VSP-1, and many EPODs/nEPODs. (Figure 6A, B; Supplementary Figure 2, 3, 4). However, the superintegron and PLE exhibit negative occupancy signal under MMC treatment, which is not associated with changes in RNA polymerase binding, suggesting that H-NS or other factors that are producing negative signal during interphase extraction are perhaps binding these regions to maintain silencing during DNA damage (Figure 6A-C, Supplementary Figure 3, 4), or that additional transcriptional activators would be needed to enhance transcription in those regions. SXT, RS1, and CTX show increased RNA polymerase binding compared to untreated cells (Figure 6D, Supplementary Figure 4,8B). The SXT integrative conjugative element is a large horizontally acquired genetic element that harbors antibiotic resistance genes and phage defense systems [48,52,78] and it has been shown to be mobilized when the lambda phage cI-like repressor in SXT is proteolyzed by RecA, allowing expression of the genes necessary for the HAE’s transfer that are otherwise repressed [51,52,77]. For example, the mobI gene and origin of transfer (oriT) of SXT [79], which is located and conserved in the intergenic region of mobI and s003 in many SXT elements (Figure 6D), showed increased RNA polymerase binding upon MMC treatment, suggesting increased expression to prepare for mobilization of the element. We also observed a localized region of high protein occupancy in the mobI- s003 intergenic region, perhaps corresponding to the recruitment of the relaxosome consisting of oriT binding proteins needed for the conjugative transfer, as is observed in the bacterial DNA conjugation process [80,81]. Other loci in the SXT exhibiting increased RNA polymerase occupancy during MMC treatment are shown in Supplementary Figure 8B, allowing us to assess which regions are activated during SOS response. In contrast to the chromosome 1 HAEs, PLE and the remainder of the superintegron exhibit a strongly negative IPOD signal upon MMC treatment with no .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint concurrent increase in RNA polymerase binding (Figure 6A-C). This observation suggests that MMC causes substantial occupancy of PLE by H-NS or some other atypical factor. This occupancy results in a negative signal in our assay, but clearly indicates the presence of a factor with the potential for maintaining repression of these regions. Figure 6: Effect of mitomycin C on the regulation of horizontally acquired elements. A-B) Changes (relative to WT) in robust rz-scores in IPOD-HR (A) and RNA pol occupancy (B) in untreated and mitomycin C treated (+MMC) V. cholerae cells over the regions shown. Individual replicates are shown in gray points and the larger colored points are the mean of each of the replicates for each condition. See Fig. 3B for details on how the summary statistics shown here were generated. C) Total protein occupancy signals (IPOD), RNA polymerase ChIP subtracted protein occupancy (IPOD-HR) and the RNA polymerase occupancy (RNAP ChIP-seq) in the PLE island in untreated and MMC treated cells. D) Total protein occupancy signals (IPOD), RNA polymerase ChIP subtracted protein occupancy (IPOD-HR) and the RNA polymerase occupancy (RNAP ChIP-seq) in a region of the SXT element in untreated and MMC treated cells. .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint

Discussion

Proper regulation of virulence genes and horizontally acquired elements is critically important for bacterial fitness, as constitutive expression would be extremely metabolically costly, but failure to express those genes when needed would prevent optimal host colonization, nutrient acquisition, and growth. Through the application of whole-genome protein occupancy profiling and RNA polymerase ChIP-seq to a series of strains lacking different nucleoid-associated proteins (NAPs), we identified a division of labor across different NAPs for regulating different genetic elements associated with horizontal gene transfer and bacterial virulence. Consistent with previous data, we observe that the NAP H-NS is the main xenogeneic silencer in V. cholerae [38,39]; deletion of H-NS was sufficient to trigger increased transcription of many genes. Additionally, deletion of the recently described protein TsrA, a protein with a regulon heavily overlapping that of H-NS, appears to primarily act through H-NS and on H-NS dependent targets, as the regulatory effects of a tandem H-NS/TsrA deletion closely resembled those caused by loss of H-NS alone at the majority of H-NS upregulated targets (381 genes). Because TsrA does not have a predicted DNA-binding domain, it may modulate and associate with H-NS only at certain loci to repress the H-NS targets, much like Hha in E. coli [23,82]. However, at some loci (55 in all), deletions of tsrA and hns show additive or even supra-additive effects, perhaps caused by TsrA associating with another DNA binding protein. Similar observations were made with Hha in Salmonella Typhimurium where Hha may have an H-NS-independent effect on gene expression [83]. It is especially notable that some loci regulated by TsrA, even in the absence of H-NS, are involved in host colonization and virulence (e.g., some genes in “pilus” GO term: VC0409, VC0857, VC2423). Future studies directly tracking the binding sites of TsrA and H-NS in each others’ presence and absence, and possibly concerted efforts to identify potential protein-protein interaction partners of TsrA, will be fruitful in more fully resolving the role of TsrA in modulating V. cholerae gene expression. Other abundant NAPs such as integration host factor (IHF) appear to regulate a distinct set of genes from those covered by H-NS. For example, we found that IHF is involved in regulating the overall balance of iron metabolism. Loss of IHF also appears to alter protein occupancy in H-NS regulated regions, perhaps corresponding to .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint competition with H-NS binding (Supplementary Text); functional implications of these changes (e.g., whether the strength of repression by H-NS is increased in the absence of binding from competing factors in HAEs) remains unclear. Because of the ability of HAEs to mobilize and further promote horizontal gene transfer during DNA damage, we investigated the genome-wide effect of SOS response of V. cholerae with IPOD-HR as the induction of SOS response with DNA damaging agent mitomycin C (MMC). We observed that some loci in the HAEs have higher RNA polymerase occupancy in comparison with a no-treatment control. While some regions of HAEs became de-repressed upon the addition of MMC, other HAEs like the phage satellite PLE were not affected in transcription but we observed substantial changes in the protein occupancy profiles in those regions, perhaps suggesting that other factors such as H-NS may be binding and indicating a rearrangement of chromatin structure, but the absence of some essential signal, presumably ICP1-encoded, that would be needed for full activation of the HAE. The H-NS dependent silencing of PLE revealed here appears in stark contrast to the regulatory strategies for HAEs similar to PLEs that rely on helper phages. For example, for the well-studied Staphylococcus aureus pathogenicity islands (SaPIs) – the HAE itself encodes a master repressor (in this case, Stl) that keeps the SaPI in the prophage-like state. Stl-mediated repression is counteracted by Stl complexing with specific phage antirepressors that disrupt the formation of Stl-DNA complex [84,85]. Based on our data, we hypothesize that H-NS is silencing the majority of the PLE and thus not relying on a PLE-encoded master repressor, but still requiring a phage- encoded protein to relieve the repression by H-NS and TsrA (possibly through induction of some transcriptional activator that remains to be identified). Some examples of phage proteins that relieve the repression by an H-NS family protein (MvaT) include the Pseudomonas phage LUZ24 Gp4, which is proposed to inhibit MvaT-DNA bridged complex formation [86]. Thus, similar investigations may lead to the identification of the ICP1-encoded factor(s) needed for full activation of the PLE during ICP1 infection. We expect that future experiments assessing the extent to which DNA damage or NAP deletions license transcription of the PLE and similar regions will be highly informative (e.g., by testing whether the induction kinetics of PLE are altered by the changes in chromatin structure that we observed), as will additional efforts to determine the proximal signal (presumably some element from ICP1) triggering PLE activation .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint during phage infection. Acknowledgments This work was supported by NIH R35 GM128637 (to PLF) from the National Institute of General Medical Sciences and 1R01AI127652 (to KDS) from the National Institute of Allergy and Infectious Diseases; its contents are solely the responsibility of the authors and do not necessarily represent the official views of the National Institute of Allergy and Infectious Diseases, the National Institute of General Medical Sciences, or NIH. KDS holds an Investigators in the Pathogenesis of Infectious Disease Award from the Burroughs Wellcome Fund. DTD was supported by the National Science Foundation Graduate Research Fellowship (2018257700). Data availability IPOD-HR and RNA-sequencing data are available on GEO (accession number: GSE250408) with the URL: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi? acc=GSE250408 . Nanopore sequencing data for KDS1 are available from the Sequencing Read Archive (SRA) with accession PRJNA1056466.

Materials and methods

Strains, culture, and media conditions The genotypes of all V. cholerae strains used in this study are listed in Table S2. V. cholerae KDS1, an El tor clinical isolate that encodes PLE1, was used as the wild type strain for all experiments and genetic manipulation. All standard overnight bacterial cultures were grown from a single colony in 2 mL volume with aeration at 37oC in standard LB Miller liquid growth medium, supplemented with streptomycin (100 µg/mL). For cultures requiring mitomycin C treatment, an overnight culture grown in standard conditions was diluted to an OD600 of 0.003 into 140 mL of pre-warmed LB Miller liquid medium supplemented with mitomycin C to a final concentration of 20 ng/mL. Cultures were grown to a final OD600 of 0.3 as described and processed according to the methods below. Strain engineering All genetically manipulated strains were generated through splicing by overlap .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint extension (SOE) PCR to generate PCR products with FRT-SpecR/KanR-FRT in place of the gene of interest. This SOE PCR product was added to V. cholerae grown overnight on chitin to induce natural competence and transformants were selected with the appropriate antibiotics, as previously described [87]. Transformants were screened by colony PCR and Sanger sequencing to confirm the presence of the deletions of interest. IPOD sample preparation IPOD samples were collected as previously described [50] with minor modifications. Overnight cultures were diluted to a starting OD600 of approximately 0.003 in a final volume of 140 mL of LB medium in a 1 L flask. The diluted cultures were grown at 37oC with shaking at 250 RPM until a final OD600 of 0.3. 28.9 mL of culture was collected at each time point and mixed with 300 μL 1 M sodium phosphate buffer (pH 7.4) and 810 μL formaldehyde (37%, fresh) in a 50 mL conical tube, then allowed to crosslink for 5 minutes shaking at room temperature (notably, unlike in [50], no rifampicin was added prior to crosslinking). After 5 minutes, the crosslinking reactions were quenched with 6 mL of 2M glycine and returned to the room temperature shaker. After 10 minutes of shaking, tubes were placed in ice for 10 minutes and then pelleted at 7,000x g at 4oC for 4 minutes. Pellets were washed twice with 10 mL of ice-cold phosphate-buffered saline and the final cell pellets were snap-frozen in a dry ice-ethanol slurry and stored at - 80oC. Cell lysis, digestion, and lysate clarification The original IPOD-HR protocol is from ref. [50]; below we summarize the procedures applied here. Frozen cell pellets were resuspended in 600 μL of 1X IPOD lysis buffer (10 mM Tris HCl, pH 8.0; 50 mM NaCl) with 1x protease inhibitors tablet (Roche Complete Mini, EDTA free, Roche Diagnostics GmbH, Mannheim, Germany) and 1.5 μL lysozyme (ThermoScientific, REF90082, 50 mg/mL) incubated for 15 minutes at 30oC with gentle shaking and then placed on ice. Resuspended cells were sonicated with a Branson sonicator at 25% power with three bursts of 10s ON and 10s OFF while in an ice water bath. Sonicated cells were then digested with 6μL RNaseA (SIG 10109169001, Sigma-Aldrich/Roche, 10mg/mL), 5.4 μL of 100mM MnCl2, 4.5 μL of .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint 100mM CaCl2, 9 μL of DNase I, RNAse-free (ThermoScientific, #89836, 1U/μL) and incubated on ice for 30 minutes, after which reactions were quenched with 50 μL 500mM EDTA (pH 8.0); we aimed to obtain fragment sizes of about 150 bp. 400 μL of 1x IPOD lysis buffer without protease inhibitors and lysozyme was added to the digested mixture and vortexed. The mixture was clarified by centrifugation for 10 minutes at 15,700 x g at 4oC and transferred to new tubes. The clarified lysate was partitioned into three different tubes: 50 μL for INPUT (for baseline reference), 400 μL for IPOD (for total protein occupancy), and the rest for CHIP (for RNA polymerase occupancy). 450 μL of ChIP elution buffer (50mM Tris (pH 8.0), 10mM EDTA, 1% SDS) was added to the INPUT sample and kept on ice until reverse cross-linking step (mentioned in the next step). Interphase extraction and nucleic acid purification 400 μL of 100 mM Tris Base and 800 μL of 25:24:1 phenol:chloroform:isoamyl alcohol (Sigma Aldrich, #77617) was added to the 400 μL of clarified lysate kept for the IPOD- HR sample, vortexed for 10s, and then incubated for 10 minutes at room temperature. To obtain a separation of the organic and aqueous layers and formation of the white interphase disc, which is enriched in protein-DNA complexes, the mixture was spun for 2 minutes at 21,130 x g at room temperature. The aqueous and organic layers around the interphase were removed while minimizing disturbance of the interphase disc. The extracted disc was washed once in 350 μL TE (10 mM Tris, pH 8.0; 1 mM EDTA), 350 μL 100 mM Tris base, and 700 μL 24:1 chloroform:isoamyl alcohol. The layers were again separated by spinning for 2 minutes at 21,130 x g at room temperature. After removal of liquid around the interphase disc, a final wash of 700 μL TE and 700 μL 24:1 chloroform:isoamyl alcohol was applied. After vortexing and spinning the final wash as before, as much of the liquid around the white disc as possible was removed and discarded. The interphase disc was resuspended in 500 μL of ChIP elution buffer and vortexed. The resuspended interphase of IPOD sample and INPUT sample (kept on ice from above step) were incubated at 65oC overnight to reverse-crosslink (6-16 hours). RNA Polymerase Chromatin Immunoprecipitation The rest of the clarified lysate (about 500 μL) was mixed with 1 volume of 2x immunoprecipitation (IP) buffer (200mM Tris (pH 8.0), 600mM NaCl, 4% Triton X-100) .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint and incubated with 10μg of anti-E. coli RNA polymerase antibody (NeoClone WP023, NeoClone, Madison, WI, Lot: 2019G15-002) overnight with rocking at 4oC. After overnight incubation, 50 μL of protein G beads (New England Biolabs (NEB), S1430S) per sample was equilibrated in 1x IP buffer. 50 μL of equilibrated protein G beads was added to each antibody-lysate mixture and then incubated with rocking 2 hours at 4oC. After incubation, the beads were washed once in 1mL of the following buffers: IP wash buffer A, IP wash buffer B, IP wash buffer C, 1x IP buffer, and 1x TE. ● 1x wash buffer A (100mM Tris, pH 8.0; 250mM LiCl; 2% Triton X-100; 1mM EDTA) ● 1x wash buffer B (10mM Tris, pH 8.0; 500mM NaCl; 1% Triton X-100; 0.1% sodium deoxycholate; 1mM EDTA) ● 1x wash buffer C (10mM Tris, pH 8.0; 500mM NaCl; 1% Triton X-100; 1mM EDTA) After the wash steps, beads with the immunoprecipitated material were resuspended in 500 μL of ChIP elution buffer by rocking for 5 minutes; the samples were then incubated for 30 minutes at 65oC with vortexing every 5 minutes. The beads were then separated and discarded to retain only the eluted immunoprecipitated sample. The samples were then left to reverse-crosslink overnight (6-16 hours) at 65oC. DNA extraction post reverse cross-linking The DNA from INPUT, IPOD, and ChIP samples were isolated and purified following identical steps. After overnight incubation to reverse the formaldehyde crosslinks, each sample was incubated with 10 μL of RNAse A (Roche Diagnostics GmbH, #SIG10109169001, 10mg/mL) for 2 hours at 37 oC. Then we followed by adding 10 μL Proteinase K (ThermoScientific, #EO0491, 20mg/mL) and incubating for 2 hours at 50 oC. The DNA of each sample was isolated by phenol-chloroform extraction using one volume of 25:24:1 phenol:chloroform:isoamyl alcohol, then re-extracted with one volume of 24:1 chloroform:isoamyl alcohol. At the last stage the samples went into DNA LoBind tubes. The isolated DNA was then precipitated by adding 1/25th volume of 5M NaCl as a precipitating salt, 1/300th volume of GlycoBlue (Invitrogen, #AM9515, 15mg/mL) as a co- precipitant, and two volumes of ice cold 1:1 isopropanol:ethanol. The DNA was precipitated at 4oC for one hour and at -20 oC for more than 1 hour or overnight. Chilled samples were then centrifuged for 15 minutes at 16,100 x g at 4 oC to pellet the DNA. All the liquid was then removed without disturbing the DNA pellet. Pellets were then washed in freshly diluted 95% ethanol, vortexed and then centrifuged .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint for 5 minutes at 16,100 x g at 4oC. Finally, the liquid was removed, and the remaining ethanol was left to evaporate for 30 minutes or less. The following volumes of TEe (10 mM Tris, pH 8.0; 0.1 mM EDTA) were added to resuspend the DNA from the three sample types: 200 μL for INPUT pellets, 50 μL for IPOD, and 30 μL for ChIP. All samples were quantified with QuantiFluor dsDNA System (Promega, #2670) using a BioTek Synergy plate reader. Input samples were assessed for fragment sizes on 2% agarose gel electrophoresis. Illumina library preparation for IPOD-HR The purified and quantified DNA from INPUT, IPOD, and RNA Polymerase ChIP samples were then prepared for Illumina sequencing using NEBNext® Ultra™ II DNA Library Prep Kit for Illumina® (#7645S/L) with NEBNext® Multiplex Oligos for Illumina® Unique Dual Index UMI Adaptors DNA Set 1 (#E7395S) or NEBNext® Multiplex Oligos for Illumina® Dual Index Primers Set 1 (#E7600S) and Dual Index Primers Set 2 (#E7780S). Omega Biotek Mag-Bind DNA purification or Axygen purification beads were used for all SPRI bead cleanup steps. The bead purification at the post-adaptor ligation stage was modified as follows: 1.8x volume of DNA purification beads and 0.7x isopropanol were added instead of the normal 0.9x volume of beads. Each sample prepared for sequencing was quantified and assessed for fragment size and for the presence of any adaptor present on 2% agarose gel electrophoresis. The samples were pooled into libraries, which were sequenced on a NextSeq 550 instrument at the Michigan Advanced Genomics Core (GEO accession: GSE250408). rRNA depletion and RNA-sequencing library preparation Purified RNA was subjected to Baseline-ZERO™ DNase digestion in a 100 μL reaction of 20 μL of purified RNA, 5 μL Baseline-ZERO™ DNAse enzyme (LGC Biosearch Technologies, Pt# E0110-D1, 1 U/μL), 10 μL 10X Baseline-ZERO™ DNase Reaction Buffer, 2 μL RNase Inhibitor, Murine (NEB, #M0314S/L) and nuclease-free water at 37oC for 30 minutes. The RNA was purified with Zymo RNA clean and concentrate kit and eluted with nuclease-free water. The quality of purified RNA was assessed on a 2% agarose gel with guanidine thiocyanate and quantified with a NanoDrop spectrophotometer. Purified RNA was depleted of ribosomal RNA (rRNA) using a NEBNext® rRNA Depletion Kit (Bacteria) (NEB, #E7850L/X) and library preparation was done using NEBNext® Ultra™ II Directional RNA Library Prep Kit for .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint Illumina® (NEB, #E7760S/L), according to the manufacturer’s instructions. The libraries were assessed for quality with agarose gel electrophoresis and quantified with a Promega Quantifluor High Sensitivity dsDNA kit. IPOD-HR pipeline Raw read data from the IPOD, ChIP and INPUT samples were demultiplexed using bcl2fastq2 software (Illumina). The data were then processed using version 2.7.2 of the IPOD-HR pipeline via a singularity container (accessible at https://github.com/freddolino-lab/ipod). The peak calling was performed with version 2.8.1. Briefly, sequences are processed to remove the adapters with cutadapt (4.3 with Python 3.8.6) [88] and low-quality reads were trimmed with trimmomatic (version 0.39) [89], then the processed reads were aligned with bowtie2 (version 2.4.4) [90] to the assembled reference genome of KDS1 strain of V. cholerae in this study. The quality of reads and alignment were assessed with chipqc and fastqc. The reads were quantile normalized and log ratios of IPOD vs INPUT and ChIP-seq vs INPUT were calculated. Further, to obtain occupancy scores without RNA polymerase bound regions, the pipeline subtracts RNA polymerase ChIP-seq normalized scores from the IPOD vs INPUT normalized scores. The pipeline generates bedgraph and narrowpeak files with the normalized scores and we use them for the analysis of data with in-house R, shell and python scripts as described throughout the text. Permutation Test for EPODs and nEPODs A permutation test was performed to identify whether the AT percentage in called EPODs or nEPODs was significantly different from the background (i.e., the remainder of the genome excluding EPODs and nEPODs). We applied the same method separately to EPODs and nEPODs. First, we obtained 1000 randomized EPOD (or nEPOD) distributions containing EPODs of the same length and total number as our real EPODs with bedtools (version 2.30.0) [91], making sure that randomized EPODs excluded the nEPODs, to generate the null distribution of EPODs. For each randomized set of EPOD locations, we then calculated the difference in AT content between the shuffled EPOD locations and the corresponding background, thus obtaining a summary statistic sampled from the null distribution. We compared these values to the difference in AT percentage median between the real EPODs and the real background. The p- .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint value was then calculated as follows: p-value = (# permutations greater than real EPODs + 1)/(1000 permutations + 1) Genomic reference sequence and annotations We obtained an initial genome reference sequence for KDS1 by sequencing high molecular weight genomic DNA purified from the strain. Raw reads are available at the SRA in PRJNA1056466. Initial assembly was performed on the Nanopore reads (Nanopore sequencing performed at SeqCenter in Pittsburgh, Pennsylvania) using NECAT (version 0.0.1 20200803) [92], resulting in two contigs which clearly correspond to the two V. cholerae chromosomes. The assembly was then polished using pilon (version 1.24) [93] using default arguments, with all available reads from INPUT samples (aligned using bowtie2 [90]) as the short read inputs. Manual finishing was performed using a set of iterative rounds of breseq (version 0.37.0) [94] runs to identify remaining discrepancies between the Illumina short read data and the in-progress assembly and resolving them by applying differences with gdtools [94]. We then manually assigned the starting position of each chromosome to match those of commonly used El Tor reference genomes. Annotation of the newly obtained assembly was performed using prokka (version 1.14.5) [95] with arguments --genus Vibrio --species cholerae --accver 2, using a draft genome obtained from the ref. [96] as a reference for potential proteins. We then assigned VC numbers to all identifiable genes matching the reference El Tor strain (EMBL reference sequences AE003852.1 and AE003853.1 for chromosomes 1 and 2, respectively). Annotated genes from the El Tor reference were aligned to those of our prokka-annotated KDS1 assembly, using nucmer (v. 3.1 [97]) with default settings. We further required that for each potential match, the starting and ending positions of the

Reference

El Tor version and the KDS1 version in the identified features differed by no more than 20 nucleotides. RNA-sequencing analysis RNA sequencing raw read data samples were demultiplexed analogously to the above IPOD-HR pipeline. The Illumina adapter sequence from the raw reads were cut with cutadapt (version 4.1 with Python 3.8.6) [88]: .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint cutadapt -j 24 --quality-base=33 -a AGATCGGAAGAGCACACGTCTGAACTCCAGTCA -A AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT Low quality reads were trimmed with trimmomatic (version 0.39) [89]: trimmomatic PE -threads 24 -phred33 -validatePairs TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:14 . The preprocessed reads were pseudoaligned to the transcriptome of KDS1 which was obtained based on our genome assembly/annotation (see above) and quantified with Kallisto (version 0.48.0) [98]. Arguments for indexing: -k 21 –make-unique Arguments for quantitation: -t 4 -b 200 –rf-stranded –bias Differential expression calling of pseudoaligned and quantified RNA-seq reads was performed with the Sleuth R package (version 0.30.1) [99] using Sleuth response error measurement (full) model where we fitted our model by condition ~ batch parameter where condition refers to each genotype: wild type, ΔtsrAΔhns, ΔtsrA, Δhns, and ΔihfA. The Wald test was performed to obtain differential expression values between the condition parameters and the wild type. Downstream data analysis was performed with in-house R and python scripts. The Gene Ontology (GO) terms utilized for the purple density plot above the volcano plot of ΔihfA vs wild type (Figure 2B) were: GO:0006826 GO:0006879 GO:0006880 GO:0010039 GO:0010106 GO:0033212 GO:0033214 GO:0034755 GO:0034756 GO:0034757 GO:0055072 GO:0071281 GO:0097577 GO:0098706 GO:0098711 GO:1901678 GO:0005381 GO:0005506 GO:0015093 GO:0015603, excluding the ChIP-seq-identified Fur regulon genes from ref. [66]. The numbers of genes (55 out of 381) identified as strongly affected in the ΔtsrAΔhns were filtered to only include the genes with log2FC above 3 times the mean of standard error in the log2FC of Δhns (0.9138129) after filtering the genes VC0070(tsrA) and VC1130(hns) that were deleted. Mapping of H-NS ChIP-seq from V. cholerae C6706 to KDS1 strain The SRA read files from H-NS-V5 ChIP-seq and input control from the published study [71] were converted to fastq.gz files with the SRA toolkit (https://github.com/ncbi/sra- tools). The fastq files were then preprocessed to clip the adaptors with cutadapt (4.1 with Python 3.8.6) [88]: .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint cutadapt -j 24 --quality-base=33 -a AGATCGGAAGAGCACACGTCTGAACTCCAGTCA -A AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT We then aligned the preprocessed fastq.gz files to the KDS1 reference genome using bowtie2 (version 2.4.4) [90]: bowtie2 -x reference_index -U fwd_reads.fq.gz -S sam_out -q –local –very-sensitive -p 24 –no-unal –phred33 The sam files were converted to sorted bam files with samtools (version 1.9, using htslib 1.9) [100]. The sorted bam files were quantified using bedtools (version 2.30.0) [91] to obtain aligned and quantified alignment bedgraph files. The quantified bedgraph files were normalized by rescaling the occupancy data so that the position-wise trimmed mean of each track was 100 (after excluding the top and bottom 5% of the positions), and then a pseudocount of 1 was added to each position. The extracted and input samples were each averaged across replicates for data from each experiment; final occupancy traces for the analysis displayed were the log ratios of each extracted sample relative to the corresponding input. iPAGE GO term enrichment analysis iPAGE analysis was performed with version 1.2a. We used iPAGE in discrete mode separately for EPODs and nEPODs, where the input file had two columns: the location, and a number either 1 or 0, where EPODs or nEPODs were assigned to bin 1 and the

Background

was assigned into bin 0. The gene annotation sets were derived by merging current annotations from the Uniprot El Tor proteome (taxon ID 243277; assigned to genes in our new genome following the procedure described above) with those resulting from running ATGO [101] on the called ORFs in our newly derived reference genome. The iPAGE non-default command line arguments were: --exptype=discrete For RNA-sequencing, we used the same iPAGE version but in a continuous mode, where we assigned 7 total bins to separate the GO term classification. The input files for each strain comparison had two columns: gene label and the wald statistic which is the b value divided by se_b from Sleuth RNA-seq analysis. Command line arguments were: --exptype=continuous --ebins=7 --max_p=0.05 The iPAGE on Supplementary Figure 7 was run in discrete mode, where the input file had the following two columns: genes and discrete categories 0 (all of the rest of the genes that were not upregulated in none of the three .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint categories above 3 standard errors), 1 (upregulated in ∆hns∆tsrA that exclude the genes upregulated in ∆hns), 2 (uniquely upregulated in ∆hns), and 3 (uniquely upregulated in ∆tsrA). The iPAGE non-default command line arguments were: --exptype=discrete

References

1. Sack DA, Sack RB, Nair GB, Siddique AK. Cholera. Lancet. 2004;363: 223–233. 2. Ali M, Nelson AR, Lopez AL, Sack DA. Updated global burden of cholera in endemic countries. PLoS Negl Trop Dis. 2015;9: e0003832. 3. Cholera. [cited 10 Apr 2023]. Available: https://www.who.int/news-room/fact-sheets/detail/cholera 4. Ochman H, Lawrence JG, Groisman EA. Lateral gene transfer and the nature of bacterial innovation. Nature. 2000;405: 299–304. 5. Waldor MK, Mekalanos JJ. Lysogenic conversion by a filamentous phage encoding cholera toxin. Science. 1996. pp. 1910–1914. 6. Karaolis DK, Johnson JA, Bailey CC, Boedeker EC, Kaper JB, Reeves PR. A Vibrio cholerae pathogenicity island associated with epidemic and pandemic strains. Proc Natl Acad Sci U S A. 1998;95: 3134–3139. 7. Seed KD, Bodi KL, Kropinski AM, Ackermann H-W, Calderwood SB, Qadri F, et al. Evidence of a dominant lineage of Vibrio cholerae-specific lytic bacteriophages shed by cholera patients over a 10-year period in Dhaka, Bangladesh. MBio. 2011;2: e00334–10. 8. Boyd CM, Angermeyer A, Hays SG, Barth ZK, Patel KM, Seed KD. Bacteriophage ICP1: A Persistent Predator of Vibrio cholerae. Annu Rev Virol. 2021;8: 285–304. 9. O’Hara BJ, Barth ZK, McKitterick AC, Seed KD. A highly specific phage defense system is a conserved feature of the Vibrio cholerae mobilome. PLoS Genet. 2017;13: e1006838. 10. Angermeyer A, Hays SG, Nguyen MHT, Johura F-T, Sultana M, Alam M, .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint et al. Evolutionary Sweeps of Subviral Parasites and Their Phage Host Bring Unique Parasite Variants and Disappearance of a Phage CRISPR-Cas System. MBio. 2021;13: e0308821. 11. McKitterick AC, Seed KD. Anti-phage islands force their target phage to directly mediate island excision and spread. doi:10.1101/218164 12. Hays SG, Seed KD. Dominant Vibrio cholerae phage exhibits lysis inhibition sensitive to disruption by a defensive phage satellite. Elife. 2020;9. doi:10.7554/eLife.53200 13. LeGault KN, Barth ZK, DePaola P, Seed KD. A phage parasite deploys a nicking nuclease effector to inhibit viral host replication. Nucleic Acids Research. 2022. pp. 8401–8417. doi:10.1093/nar/gkac002 14. Boyd CM, Subramanian S, Dunham DT, Parent KN, Seed KD. A Vibrio cholerae viral satellite maximizes its spread and inhibits phage by remodeling hijacked phage coat proteins into small capsids. bioRxiv. 2023. doi:10.1101/2023.03.01.530633 15. Hsueh BY, Severin GB, Elg CA, Waldron EJ, Kant A, Wessel AJ, et al. Phage defence by deaminase-mediated depletion of deoxynucleotides in bacteria. Nat Microbiol. 2022;7: 1210–1220. 16. Jaskólska M, Adams DW, Blokesch M. Two defence systems eliminate plasmids from seventh pandemic Vibrio cholerae. Nature. 2022;604: 323–329. 17. Cohen D, Melamed S, Millman A, Shulman G, Oppenheimer-Shaanan Y, Kacen A, et al. Cyclic GMP–AMP signalling protects bacteria against viral infection. Nature. 2019;574: 691–695. 18. O’Hara BJ, Alam M, Ng W-L. The Vibrio cholerae Seventh Pandemic Islands act in tandem to defend against a circulating phage. PLoS Genet. 2022;18: e1010250. 19. Duan B, Ding P, Navarre WW, Liu J, Xia B. Xenogeneic Silencing and Bacterial Genome Evolution: Mechanisms for DNA Recognition Imply Multifaceted Roles of Xenogeneic Silencers. Mol Biol Evol. 2021;38: 4135–4148. 20. Navarre WW, Porwollik S, Wang Y, McClelland M, Rosen H, Libby SJ, et al. Selective silencing of foreign DNA with low GC content by the H-NS protein in .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint Salmonella. Science. 2006;313: 236–238. 21. Amemiya HM, Schroeder J, Freddolino PL. Nucleoid-associated proteins shape chromatin structure and transcriptional regulation across the bacterial kingdom. Transcription. 2021;12: 182–218. 22. Navarre WW, McClelland M, Libby SJ, Fang FC. Silencing of xenogeneic DNA by H-NS—facilitation of lateral gene transfer in bacteria by a defense system that recognizes foreign DNA. Genes Dev. 2007;21: 1456–1471. 23. Boudreau BA, Hron DR, Qin L, van der Valk RA, Kotlajich MV, Dame RT, et al. StpA and Hha stimulate pausing by RNA polymerase by promoting DNA– DNA bridging of H-NS filaments. Nucleic Acids Res. 2018;46: 5525–5546. 24. Shen BA, Hustmyer CM, Roston D, Wolfe MB, Landick R. Bacterial H-NS contacts DNA at the same irregularly spaced sites in both bridged and hemi- sequestered linear filaments. iScience. 2022;25: 104429. 25. Azam TA, Ishihama A. Twelve species of the nucleoid-associated protein from Escherichia coli. Sequence recognition specificity and DNA binding affinity. J Biol Chem. 1999;274: 33105–33113. 26. Sonnenfield JM, Burns CM, Higgins CF, Hinton JCD. The nucleoid- associated protein StpA binds curved DNA, has a greater DNA-binding affinity than H-NS and is present in significant levels in hns mutants. Biochimie. 2001;83: 243– 249. 27. Amemiya HM, Goss TJ, Nye TM, Hurto RL, Simmons LA, Freddolino PL. Distinct heterochromatin-like domains promote transcriptional memory and silence parasitic genetic elements in bacteria. EMBO J. 2022;41: e108708. 28. Gama-Castro S, Salgado H, Santos-Zavaleta A, Ledezma-Tejeida D, Muñiz-Rascado L, García-Sotelo JS, et al. RegulonDB version 9.0: high-level integration of gene regulation, coexpression, motif clustering and beyond. Nucleic Acids Res. 2016;44: D133–43. 29. Beaufay F, Amemiya HM, Guan J, Basalla J, Meinen BA, Chen Z, et al. Polyphosphate drives bacterial heterochromatin formation. Sci Adv. 2021;7: eabk0233. 30. Wang B, Mittermeier M, Artsimovitch I. RfaH May Oppose Silencing by H- .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint NS and YmoA Proteins during Transcription Elongation. J Bacteriol. 2022;204: e0059921. 31. Picker MA, Karney MMA, Gerson TM, Karabachev AD, Duhart JC, McKenna JA, et al. Localized modulation of DNA supercoiling, triggered by the Shigella anti-silencer VirB, is sufficient to relieve H-NS-mediated silencing. Nucleic Acids Res. 2023;51: 3679–3695. 32. Son B, Patterson-West J, Arroyo-Mendoza M, Ramachandran R, Iben JR, Zhu J, et al. A phage-encoded nucleoid associated protein compacts both host and phage DNA and derepresses H-NS silencing. Nucleic Acids Res. 2021;49: 9229– 9245. 33. Stonehouse E, Kovacikova G, Taylor RK, Skorupski K. Integration host factor positively regulates virulence gene expression in Vibrio cholerae. J Bacteriol. 2008;190: 4736–4748. 34. Stonehouse EA, Hulbert RR, Nye MB, Skorupski K, Taylor RK. H-NS binding and repression of the ctx promoter in Vibrio cholerae. J Bacteriol. 2011;193: 979–988. 35. Nye MB, Pfau JD, Skorupski K, Taylor RK. Vibrio cholerae H-NS silences virulence gene expression at multiple steps in the ToxR regulatory cascade. J Bacteriol. 2000;182: 4295–4303. 36. Faruque SM, Albert MJ, Mekalanos JJ. Epidemiology, genetics, and ecology of toxigenic Vibrio cholerae. Microbiol Mol Biol Rev. 1998;62: 1301–1314. 37. Hu D, Liu B, Feng L, Ding P, Guo X, Wang M, et al. Origins of the current seventh cholera pandemic. Proc Natl Acad Sci U S A. 2016;113: E7730–E7739. 38. Wang H, Ayala JC, Benitez JA, Silva AJ. RNA-Seq Analysis Identifies New Genes Regulated by the Histone-Like Nucleoid Structuring Protein (H-NS) Affecting Vibrio cholerae Virulence, Stress Response and Chemotaxis. PLOS ONE. 2015. p. e0118295. doi:10.1371/journal.pone.0118295 39. Ayala JC, Silva AJ, Benitez JA. H-NS: an overarching regulator of the Vibrio cholerae life cycle. Res Microbiol. 2017;168: 16–25. 40. Wang H, Ayala JC, Benitez JA, Silva AJ. Interaction of the histone-like nucleoid structuring protein and the general stress response regulator RpoS at .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint Vibrio cholerae promoters that regulate motility and hemagglutinin/protease expression. J Bacteriol. 2012;194: 1205–1215. 41. McLeod SM, Burrus V, Waldor MK. Requirement for Vibrio cholerae integration host factor in conjugative DNA transfer. J Bacteriol. 2006;188: 5704– 5711. 42. Beaber JW, Hochhut B, Waldor MK. Genomic and functional analyses of SXT, an integrating antibiotic resistance gene transfer element derived from Vibrio cholerae. J Bacteriol. 2002;184: 4259–4269. 43. Wozniak RAF, Fouts DE, Spagnoletti M, Colombo MM, Ceccarelli D, Garriss G, et al. Comparative ICE genomics: insights into the evolution of the SXT/R391 family of ICEs. PLoS Genet. 2009;5: e1000786. 44. Caro F, Caro JA, Place NM, Mekalanos JJ. Transcriptional Silencing by TsrA in the Evolution of Pathogenic Vibrio cholerae Biotypes. mBio. 2020. doi:10.1128/mbio.02901-20 45. DuPai CD, Cunningham AL, Conrado AR, Wilke CO, Davies BW. TsrA Regulates Virulence and Intestinal Colonization in Vibrio cholerae. mSphere. 2020;5. doi:10.1128/mSphere.01014-20 46. Barth ZK, Netter Z, Angermeyer A, Bhardwaj P, Seed KD. A Family of Viral Satellites Manipulates Invading Virus Gene Expression and Can Affect Cholera Toxin Mobilization. mSystems. 2020;5. doi:10.1128/mSystems.00358-20 47. Poulin-Laprade D, Matteau D, Jacques P-É, Rodrigue S, Burrus V. Transfer activation of SXT/R391 integrative and conjugative elements: unraveling the SetCD regulon. Nucleic Acids Res. 2015;43: 2045–2056. 48. LeGault KN, Hays SG, Angermeyer A, McKitterick AC, Johura F-T, Sultana M, et al. Temporal shifts in antibiotic resistance elements govern phage- pathogen conflicts. Science. 2021;373. doi:10.1126/science.abg2166 49. Vora T, Hottes AK, Tavazoie S. Protein occupancy landscape of a bacterial genome. Mol Cell. 2009;35: 247–253. 50. Freddolino PL, Amemiya HM, Goss TJ, Tavazoie S. Correction: Dynamic landscape of protein occupancy across the Escherichia coli chromosome. PLoS Biol. 2022;20: e3001557. .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint 51. Beaber JW, Hochhut B, Waldor MK. SOS response promotes horizontal dissemination of antibiotic resistance genes. Nature. 2004;427: 72–74. 52. Burrus V, Marrero J, Waldor MK. The current ICE age: biology and evolution of SXT-related integrating conjugative elements. Plasmid. 2006;55: 173– 183. 53. McKnight SL. Phage λ: A Genetic Switch. Gene Control and Phage λ. Marc Ptashne. Cell Press and Blackwell Scientific, Palo Alto, CA, 1986. x, 128 pp., illus. Paper, $16.95. Science. 1986;233: 1435–1436. 54. Seed KD, Lazinski DW, Calderwood SB, Camilli A. A bacteriophage encodes its own CRISPR/Cas adaptive response to evade host innate immunity. Nature. 2013;494: 489–491. 55. Pal RR, Bag S, Dasgupta S, Das B, Bhadra RK. Functional characterization of the stringent response regulatory gene dksA of Vibrio cholerae and its role in modulation of virulence phenotypes. J Bacteriol. 2012;194: 5638– 5648. 56. Lim B, Beyhan S, Yildiz FH. Regulation of Vibrio polysaccharide synthesis and virulence factor production by CdgC, a GGDEF-EAL domain protein, in Vibrio cholerae. J Bacteriol. 2007;189: 717–729. 57. Mueller RS, McDougald D, Cusumano D, Sodhi N, Kjelleberg S, Azam F, et al. Vibrio cholerae strains possess multiple strategies for abiotic and biotic surface colonization. J Bacteriol. 2007;189: 5348–5360. 58. Acosta N, Pukatzki S, Raivio TL. The Vibrio cholerae Cpx envelope stress response senses and mediates adaptation to low iron. J Bacteriol. 2015;197: 262– 276. 59. Taylor DL, Bina XR, Slamti L, Waldor MK, Bina JE. Reciprocal regulation of resistance-nodulation-division efflux systems and the Cpx two-component system in Vibrio cholerae. Infect Immun. 2014;82: 2980–2991. 60. Dziejman M, Balon E, Boyd D, Fraser CM, Heidelberg JF, Mekalanos JJ. Comparative genomic analysis of Vibrio cholerae: genes that correlate with cholera endemic and pandemic disease. Proc Natl Acad Sci U S A. 2002;99: 1556–1561. 61. O’Shea YA, Finnan S, Reen FJ, Morrissey JP, O’Gara F, Boyd EF. The .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint Vibrio seventh pandemic island-II is a 26.9 kb genomic island present in Vibrio cholerae El Tor and O139 serogroup isolates that shows homology to a 43.4 kb genomic island in V. vulnificus. Microbiology. 2004;150: 4053–4063. 62. Goodarzi H, Elemento O, Tavazoie S. Revealing global regulatory perturbations across human cancers. Mol Cell. 2009;36: 900–911. 63. Bergendahl V, Thompson NE, Foley KM, Olson BM, Burgess RR. A cross- reactive polyol-responsive monoclonal antibody useful for isolation of core RNA polymerase from many bacterial species. Protein Expr Purif. 2003;31: 155–160. 64. Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, et al. Circos: an information aesthetic for comparative genomics. Genome Res. 2009;19: 1639–1645. 65. Kamp HD, Patimalla-Dipali B, Lazinski DW, Wallace-Gadsden F, Camilli A. Gene fitness landscapes of Vibrio cholerae at important stages of its life cycle. PLoS Pathog. 2013;9: e1003800. 66. Davies BW, Bogard RW, Mekalanos JJ. Mapping the regulon of Vibrio cholerae ferric uptake regulator expands its known network of gene regulation. Proc Natl Acad Sci U S A. 2011;108: 12467–12472. 67. Livny J, Fogel MA, Davis BM, Waldor MK. sRNAPredict: an integrative computational approach to identify sRNAs in bacterial genomes. Nucleic Acids Res. 2005;33: 4096–4105. 68. Bradley ES, Bodi K, Ismail AM, Camilli A. A genome-wide approach to discovery of small RNAs involved in regulation of virulence in Vibrio cholerae. PLoS Pathog. 2011;7: e1002126. 69. Zheng J, Shin OS, Cameron DE, Mekalanos JJ. Quorum sensing and a global regulator TsrA control expression of type VI secretion and virulence in Vibrio cholerae. Proc Natl Acad Sci U S A. 2010;107: 21128–21133. 70. Mey AR, Wyckoff EE, Oglesby AG, Rab E, Taylor RK, Payne SM. Identification of the Vibrio cholerae enterobactin receptors VctA and IrgA: IrgA is not required for virulence. Infect Immun. 2002;70: 3419–3426. 71. Kazi MI, Conrado AR, Mey AR, Payne SM, Davies BW. ToxR Antagonizes H-NS Regulation of Horizontally Acquired Genes to Drive Host Colonization. PLoS .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint Pathog. 2016;12: e1005570. 72. Dunham DT, Angermeyer A, Seed KD. The RNA-RNA interactome between a phage and its satellite virus reveals a small RNA that differentially regulates gene expression across both genomes. Mol Microbiol. 2023;119: 515– 533. 73. Bueno E, Sit B, Waldor MK, Cava F. Genetic Dissection of the Fermentative and Respiratory Contributions Supporting Vibrio cholerae Hypoxic Growth. J Bacteriol. 2020;202. doi:10.1128/JB.00243-20 74. UniProt Consortium. UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Res. 2021;49: D480–D489. 75. Zheng W, Wuyun Q, Freddolino PL, Zhang Y. Integrating deep learning, threading alignments, and a multi-MSA strategy for high-quality protein monomer and complex structure prediction in CASP15. Proteins. 2023;91: 1684–1703. 76. van Kempen M, Kim SS, Tumescheit C, Mirdita M, Lee J, Gilchrist CLM, et al. Fast and accurate protein structure search with Foldseek. Nat Biotechnol. 2023. doi:10.1038/s41587-023-01773-0 77. Cannon RE. A genetic switch: Gene control and phage λ. Edited by Mark Ptashne. Cambridge, England: Cell Press; and Palo Alto, CA: Blackwell Press, 1986, 128 pp. Developmental Genetics. 1988. pp. 69–69. doi:10.1002/dvg.1020090107 78. Waldor MK, Tschäpe H, Mekalanos JJ. A new type of conjugative transposon encodes resistance to sulfamethoxazole, trimethoprim, and streptomycin in Vibrio cholerae O139. J Bacteriol. 1996;178: 4157–4165. 79. Ceccarelli D, Daccord A, René M, Burrus V. Identification of the origin of transfer (oriT) and a new gene required for mobilization of the SXT/R391 family of integrating conjugative elements. J Bacteriol. 2008;190: 5328–5338. 80. Grandoso G, Avila P, Cayón A, Hernando MA, Llosa M, de la Cruz F. Two active-site tyrosyl residues of protein TrwC act sequentially at the origin of transfer during plasmid R388 conjugation. J Mol Biol. 2000;295: 1163–1172. 81. Llosa M, de la Cruz F. Bacterial conjugation: a potential tool for genomic engineering. Res Microbiol. 2005;156: 1–6. .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint 82. Nieto JM, Madrid C, Prenafeta A, Miquelay E, Balsalobre C, Carrascal M, et al. Expression of the hemolysin operon in Escherichia coli is modulated by a nucleoid-protein complex that includes the proteins Hha and H-NS. Mol Gen Genet. 2000;263: 349–358. 83. Solórzano C, Srikumar S, Canals R, Juárez A, Paytubi S, Madrid C. Hha has a defined regulatory role that is not dependent upon H-NS or StpA. Front Microbiol. 2015;6. doi:10.3389/fmicb.2015.00773 84. Miguel-Romero L, Alqasmi M, Bacarizo J, Tan JA, Cogdell RJ, Chen J, et al. Non-canonical Staphylococcus aureus pathogenicity island repression. Nucleic Acids Res. 2022;50: 11109–11127. 85. Tormo-Más MA, Mir I, Shrestha A, Tallent SM, Campoy S, Lasa I, et al. Moonlighting bacteriophage proteins derepress staphylococcal pathogenicity islands. Nature. 2010;465: 779–782. 86. Bdira FB, Erkelens AM, Qin L, Volkov AN, Lippa AM, Bowring N, et al. Novel anti-repression mechanism of H-NS proteins by a phage protein. Nucleic Acids Res. 2021;49: 10770–10784. 87. Dalia AB, McDonough E, Camilli A. Multiplex genome editing by natural transformation. Proc Natl Acad Sci U S A. 2014;111: 8937–8942. 88. Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal. 2011;17: 10–12. 89. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30: 2114–2120. 90. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9: 357–359. 91. Quinlan AR. BEDTools: The Swiss-Army Tool for Genome Feature Analysis. Curr Protoc Bioinformatics. 2014;47: 11.12.1–34. 92. Chen Y, Nie F, Xie S-Q, Zheng Y-F, Dai Q, Bray T, et al. Efficient assembly of nanopore reads via highly accurate and intact error correction. Nat Commun. 2021;12: 60. 93. Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, et al. .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One. 2014;9: e112963. 94. Deatherage DE, Barrick JE. Identification of mutations in laboratory- evolved microbes from next-generation sequencing data using breseq. Methods Mol Biol. 2014;1151: 165–188. 95. Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30: 2068–2069. 96. McKitterick AC, LeGault KN, Angermeyer A, Alam M, Seed KD. Competition between mobile genetic elements drives optimization of a phage- encoded CRISPR-Cas system: insights from a natural arms race. Philos Trans R Soc Lond B Biol Sci. 2019;374: 20180089. 97. Kurtz S, Phillippy A, Delcher AL, Smoot M, Shumway M, Antonescu C, et al. Versatile and open software for comparing large genomes. Genome Biol. 2004;5: R12. 98. Bray NL, Pimentel H, Melsted P, Pachter L. Erratum: Near-optimal probabilistic RNA-seq quantification. Nat Biotechnol. 2016;34: 888. 99. Pimentel H, Bray NL, Puente S, Melsted P, Pachter L. Differential analysis of RNA-seq incorporating quantification uncertainty. Nat Methods. 2017;14: 687– 690. 100. Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, et al. Twelve years of SAMtools and BCFtools. Gigascience. 2021;10. doi:10.1093/gigascience/giab008 101. Zhu Y-H, Zhang C, Yu D-J, Zhang Y. Integrating unsupervised language model with triplet neural networks for protein gene ontology prediction. PLoS Comput Biol. 2022;18: e1010793. .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted January 1, 2024. ; https://doi.org/10.1101/2023.12.30.573743doi: bioRxiv preprint

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-pdf

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2024) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00
unpaywall
last seen: 2026-05-23T02:00:01.238055+00:00
License: CC-BY-NC-ND-4.0