Tuning into xanthan: A conserved yet flexible polysaccharide utilization system in Microbacterium

preprint OA: closed CC-BY-NC-ND-4.0
📄 Open PDF Full text JSON View at publisher
Full text 70,909 characters · extracted from oa-pdf · 10 sections · click to expand

Keywords

xanthan, Microbacterium, polysaccharide utilization, regulation, microbial 19 ecosystems, evolution 20 21 Word counts: 22

Abstract

204 words 23 Text: 4,846 words 24 25 26 *Address correspondence to Marion Eisenhut, [email protected] 27 28 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 2

Abstract

29 Bacteria encounter structurally complex extracellular polysaccharides in natural environments, 30 yet the regulatory and evolutionary basis of their utilization remains poorly understood. Here, 31 we isolated a soil-derived Microbacterium strain, named Microbacterium xanthanicum UB-LE1, 32 that grows on xanthan as the sole carbon source. We dissected the genetic and regulatory 33 architecture underlying this capability. Genome sequencing combined with transcriptomic and 34 proteomic profiling uncovered a discrete, strongly inducible regulon associated with xanthan 35 utilization, encoding 23 proteins with five secreted proteins and three candidate transcriptional 36 regulators. DNA-affinity purification sequencing confirmed two regulators binding to operons 37 within the xanthan utilization locus. Comparative genomics across the Microbacteriaceae 38 revealed conserved and lineage-specific features of this system and supports recent 39 acquisition and modular integration of the locus, with at least two predominant architectural 40 variants possibly shaped by substrate availability and ecological specialization. Coordinated 41 induction at both the transcript and protein levels, together with two experimentally validated 42 regulators, points to tight regulatory control of complex polysaccharide degradation in 43 Microbacterium xanthanicum UB-LE1. 44 Together, these findings provide mechanistic and evolutionary insight into how bacteria 45 adapt to complex extracellular carbohydrates, expand current knowledge of xanthan turnover 46 in microbial ecosystems, and establish a framework for exploring the emergence and 47 diversification of specialized polysaccharide utilization pathways across bacterial taxa. 48 49 IMPORTANCE 50 Microorganisms are central drivers of carbon turnover in soils and other terrestrial ecosystems, 51 determining the availability of nutrients and shaping microbial community structure. A 52 significant portion of soil carbon is contained in extracellular polysaccharides, yet the pathways 53 by which microorganisms degrade these complex polymers remain poorly understood. 54 Xanthan, a structurally complex and widely produced microbial exopolysaccharide, represents 55 a persistent and largely overlooked carbon pool. By dissecting the genetic, regulatory, and 56 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 3 evolutionary basis of xanthan utilization in Microbacterium xanthanicum UB-LE1, this study 57 advances our understanding of how soil bacteria adapt to complex extracellular carbohydrates 58 and how substrate availability shapes the emergence and diversification of specialized 59 metabolic pathways. Importantly, the identification of additional xanthan-active enzymes and 60 regulatory components in M. xanthanicum UB-LE1 opens opportunities for targeted 61 modification of xanthan structure and properties, paving the way for new biotechnological 62 applications in food, materials, and industrial biotechnology, while linking microbial ecology to 63 functional innovation. 64 65

Introduction

66 Microorganisms play a central role in shaping carbon turnover in soils and other terrestrial 67 ecosystems through the secretion and degradation of extracellular polymers. Among these 68 polymers, exopolysaccharides (EPS) are secreted into the environment to form biofilms, which 69 provide structural support for microbial communities and act as carbon sources for specialist 70 degraders. Understanding how microorganisms sense, transport, and metabolize structurally 71 complex EPS is essential to uncovering both ecological interactions and the evolution of 72 specialized metabolic pathways. 73 Xanthan is a high-molecular-weight, anionic heteropolysaccharide produced by 74 members of the Xanthomonas genus. It consists of a β-(1,4)-glucan backbone substituted with 75 trisaccharide side chains containing acetylated or pyruvylated mannose and glucuronic acid. 76 In plant-pathogenic Xanthomonas campestris, xanthan contributes to virulence by suppressing 77 plant immune responses, protecting the bacteria from host-derived and environmental 78 stresses, and facilitating adhesion, aggregation, and colonization of the apoplast (1–3). Loss 79 of xanthan biosynthesis attenuates virulence, highlighting its central role in host–pathogen 80 interactions. Beyond its role in plant pathogenesis, xanthan persists in soil for extended 81 periods, with portions reported to resist degradation for several months (4) due to its high 82 molecular weight and complex chemical structure, providing a stable carbon pool for microbial 83 communities. 84 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 4 The structural complexity of xanthan has also driven its widespread industrial use. Its 85 high viscosity and shear-thinning properties make it a ubiquitous additive in food, cosmetics, 86 personal care, and industrial formulations (5, 6). Emerging applications in biomedical 87 engineering (7) and geoengineering (4) highlight the growing relevance of understanding 88 xanthan metabolism and biodegradation. Because it is biodegradable, identifying microbial 89 enzymes that can modify or degrade xanthan has both ecological and biotechnological 90 significance, offering potential avenues for tailored modification of xanthan properties for novel 91 applications. Despite the industrial and ecological relevance of xanthan, the mechanisms by 92 which fungi and bacteria degrade this polysaccharide (8) are only partially understood. 93 Xanthan degradation typically begins with an extracellular xanthan lyase (PL8) that liberates 94 terminal pyruvylated mannose residues, followed by an endoxanthanase (GH9 or GH5 family) 95 that cleaves the backbone into tetrasaccharides (8). Complete turnover requires intracellular 96 enzymes to remove acetyl groups and cleave the resulting oligosaccharides into 97 monosaccharides (8, 9), as well as transport systems to import these products and regulatory 98 proteins to coordinate expression of the catabolic machinery. A variety of xanthan-degrading 99 bacteria have been isolated from diverse environments including soil (10), sludge (11), and 100 mammalian intestines (8). These include Bacillus GL1 (12) and Microbacterium XT11 (13), 101 both of which possess secreted PL8 xanthan lyases, membrane-bound GH9 enzymes, and 102 intracellular GH38 and GH3 enzymes (10), and also include Paenibacillus nannanis (9, 14, 103 15), Bacteroides and Ruminococcus (8), Cohnella sp. 56 (VKM B-36720) (16), and 104 Paenarthrobacter ilicis (17). While many xanthan-degrading enzymes are known, information 105 on their regulatory control is scarce. In other extracellular polysaccharide systems, catabolic 106 genes are frequently organized into co-regulated clusters. For instance, polysaccharide 107 utilization (pul) loci in human gut Bacteroidetes comprise genes for extracellular enzymes, 108 transporters, intracellular enzymes, and regulators (18). Secretion of large or high-molecular-109 weight proteins is energetically costly, and these systems are typically under tight 110 transcriptional control (18). Similarly, gene clusters for pectin degradation in Erwinia 111 chrysanthemi (19) or rhamnogalacturonan degradation in Bacillus subtilis (20) are regulated 112 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 5 by specific transcription factors or two-component systems. However, no such regulatory 113 frameworks have been comprehensively described for xanthan utilization. 114 To investigate how bacteria adapt to structurally complex extracellular carbohydrates, 115 we focused on a newly isolated, soil-derived Microbacterium strain capable of using xanthan 116 as its sole carbon source. This strain, M. xanthanicum UB-LE1, provides a unique system to 117 connect ecological persistence of xanthan with the microbial strategies required for its 118 degradation. By integrating genomic, transcriptomic, and proteomic data, we map the 119 functional components involved in xanthan utilization, including secreted enzymes, 120 intracellular catabolic proteins, and regulatory elements coordinating their expression. 121 Comparative analyses across related Microbacteria indicate that these pathways combine 122 both conserved mechanisms and lineage-specific adaptations, reflecting evolutionary 123 pressures imposed by substrate availability and ecological specialization. Studying M. 124 xanthanicum UB-LE1 thus illuminates how bacteria detect, import, and metabolize complex 125 polysaccharides in natural environments, while also providing a framework for exploiting these 126 systems in biotechnology. 127 128

Results

129 Isolation of Microbacterium xanthanicum UB-LE1 as xanthan-degrading bacterium 130 To isolate a naturally occurring xanthan-degrading bacterium, a topsoil sample from the 131 eastern Teutoburger Forest in Bielefeld, Germany, at approximately 52,01387° N, 8,48334° E 132 was enriched for capable xanthan degraders. After several rounds of single colony restreaking, 133 a single orange colony was isolated and provisionally named UB-LE1. Nanopore sequenced 134 DNA assembled into a full genome as a single contig with a total size of 3,410,522 bp, a GC 135 content of 70.46%, and BUSCO completeness of 99.2% (details of genome assembly and 136 annotation in Laker et al. (Companion paper submitted to MRA). Since UB-LE1 showed the 137 highest average nucleotide identity (ANI) values of 98.22% with Microbacterium sp. Leaf436 138 (Table S2) and TYGS classified UB-LE1 as closest to Microbacterium enclense NIO-1002 139 (details of phylogenetic analysis in Laker et al. (Companion paper submitted to MRA)), the 140 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 6 strain was designated Microbacterium xanthanicum UB-LE1 (in the following M. xanthanicum 141 UB-LE1). 142 M. xanthanicum UB-LE1 is a Gram-positive, rod shaped (1 µm by 3 to 4 µm) bacterium 143 that lacks a flagellum and is non-motile (Fig. S1A). Its growth optimum was detected at 30 °C, 144 pH 7, without addition of NaCl (Fig. S1B-D). Growth experiments in replicates identified M. 145 xanthanicum UB-LE1 as an efficient user of xanthan as the sole carbon source with growth 146 rates on xanthan of µ = 0.0653 + 0.0185 h -1 in comparison to glucose with µ = 0.102 + 0.004 147 h-1 (Fig. 1). 148 Fig. 1. Growth and sampling of M. xanthanicum UB-LE1 for OMICS studies. Growth of M . 149 xanthanicum UB-LE1 in M9 medium with either glucose or xanthan as C-source was performed on a 150 shaker at 30 °C and 180 rpm. Sampling time points for transcriptomic and proteomic analyses are 151 indicated with arrows (blue: glucose sampling; yellow: xanthan sampling). Three biological replicates 152 were grown under each condition and sampled. 153 154 Response to xanthan on transcript and protein level 155 To gain a comprehensive view on M. xanthanicum´s response to xanthan we performed a 156 comparative OMICS approach. Samples for transcriptomic and proteomic (cellular and 157 extracellular samples) analyses were taken after 34 h (+ glucose) and 48 h (+ xanthan), 158 respectively, in the exponential grow phase and at a similar cell density (Fig. 1). 159 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 7 The transcriptome analysis detected 3,085 transcripts with xanthan as the carbon 160 source and 3,093 transcripts with glucose as the carbon source (Table S1: mothertable). 161 Samples were well separated in a principal component analysis (PCA) with 49% variation 162 represented in the first principal component indicating the major source of variation in the 163 transcriptome experiment is indeed the carbon source (Fig. 2A). The proteome analysis of the 164 cellular protein detected 1,285 proteins with xanthan as the carbon source and 1,274 proteins 165 with glucose as the carbon source (Table S1: mothertable). Samples were well separated in a 166 PCA analysis with 51% of variation represented in the first component (Fig. 2B). The genome 167 contained 202 genes with products that were predicted to be secreted by the secretion protein 168 (SP) or twin-arginine translocation (TAT) pathways (Table S1: mothertable). Seventy-nine of 169 these predicted extracellular proteins were detected in the supernatant with xanthan as the 170 carbon source and 56 proteins were detected with glucose as the carbon source. The 171 extracellular proteome samples carried higher variation as only 38% of variation in the first 172 principal component separated both groups (Fig. 2C). 173 Statistical analysis detected 257 transcripts with significantly ( q < 0.01) higher 174 abundance in xanthan-dependent growth compared to glucose and 186 transcripts with 175 significantly lower abundance in xanthan, that is higher abundance in glucose-dependent 176 growth (Fig. 2D). When the 34 transcripts with at least 32-fold induction in xanthan were 177 examined, they included MMX123_00115, MMX123_00472 and -473, MMX123_01228, 178 MMX123_01822 and -1823, genes in a genome region from gene MMX123_02529 to gene 179 MMX123_02549, genes in a genome region from gene MMX123_02601 to MMX123_02608, 180 and genes in a genome region from MMX123_03133 to MMX123_03136 (Fig. 2D). Among the 181 proteins, statistical analysis detected 149 cellular proteins as higher abundant in xanthan 182 compared to glucose and 144 cellular proteins as higher abundant in glucose (Fig. 2E, Table 183 S1: mothertable). When the proteins with fold-changes greater than 32-fold were examined, 184 they included the protein encoded by MMX123_00473, proteins which are encoded by genes 185 positioned between MMX123_02531 and MMX123_02549, MMX123_02679, 186 MMX123_03136, MMX123_03140, and MMX123_03143. Many of these are identical to the 187 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 8 maximally induced transcripts with additions of MMX123_02679 and MMX123_03143. Among 188 the proteins in the supernatant with secretion signals, 17 were detected as significantly higher 189 190 Fig 2. RNA-seq and proteomics analyses. Profiles of M. xanthanicum UB-LE1 grown under two 191 different carbon source conditions. Principal component analysis of (A) transcriptome, (B) cellular 192 proteome, and (C) extracellular proteome profiles from M. xanthanicum UB-LE1 grown on glucose or 193 xanthan as the sole carbon source. Yellow points represent samples grown on glucose medium and 194 TranscriptomeCellular proteomeExtracellular proteome A B C D E F .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 9 blue points represent samples grown on xanthan medium. Biological replicates are marked by their 195 labels. Volcano plot showing differential gene expression (D), differential cellular proteome (E) 196 accumulation and differential extracellular proteome (F) accumulation between xanthan- and glucose-197 grown cells. Each point represents a gene or protein. Not significantly regulated events are colored in 198 gray, significantly xanthan up-regulated events are colored in yellow, significantly glucose up-regulated 199 events are colored in blue. 200 201 abundant in xanthan and 4 as significantly higher abundant in glucose (Fig. 2F). The proteins 202 with fold-change greater than 32-fold included four of the five genes annotated as having 203 secreted protein products in the region MMX123_02528 to gene MMX123_02550 and five 204 additional proteins MMX123_00619, MMX123_02209, MMX123_02445, MMX123_02738, 205 and MMX123_02792. 206 The gene products of region MMX123_02528 - MMX123_02550 were consistently 207 detected as the most induced gene products in xanthan-dependent growth with similar 208 inductions in transcripts and proteins (Fig. S2). 209 210 Analysis of CAZymes and putative sugar utilization loci in the genome 211 In accordance with the focus on xanthan utilization we inspected the genomic repertoire of M. 212 xanthanicum UB-LE1 for carbohydrate active enzymes (CAZymes) and observed their 213 response towards the different carbon sources. In total, 106 genes are annotated as 214 CAZymes. The majority with 52 are glycoside hydrolases (GHs) followed by 39 215 glycosyltransferases (GTs), 5 carboesterases (CEs), 5 auxilliary enzymes, 3 polysaccharide 216 lyases, and 1 carbohydrate binding module (CBM) protein (Fig. S3). Of these 106 annotated 217 CAZymes, 14 CAZymes also contain a secretion signal. Seven of these belong to the GH43 218 family, with five significantly transcriptionally more abundant in xanthan, one significantly more 219 abundant in supernatant protein in xanthan. One of the CAZymes with secretion signal is 220 annotated as a putative cellulase (GH5 family) and is significantly more abundant in the 221 extracellular proteome in xanthan. Two of the 14 CAZymes with secretion signal are increased 222 significantly at least 32-fold on transcriptional and secreted protein level, namely a GH9 family 223 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 10 protein and a PL8 family protein. When pul-loci are defined as those, which contain a GH in 224 close proximity to a transport system consisting of periplasmic binding proteins, 15 pul-loci 225 (pul-locus I – XV) were identified in the M. xanthanicum UB-LE1 genome (Table S1: pul loci). 226 Of these, five contained at least three transcripts with significantly higher abundance in 227 xanthan medium while not any locus contained at least three transcripts with significantly 228 higher abundance in glucose medium. Induction in xanthan ranged from about 8-fold to up to 229 225-fold for a transcript in the locus with the highest induction, that is the locus 230 MMX123_02528 – MMX123_02550 (Table S1: pul loci). 231 232 Identification of the xanthan utilization (xut) locus 233 Apparently, OMICS analyses identified a genome region between gene MMX123_02528 to 234 gene MMX123_02550 as consistently upregulated during growth with xanthan as the carbon 235 source on both, the transcript and protein level (Fig. 2, Fig. S2). A genome-wide feature 236 analysis of GC content and transcript abundance patterns identified the same region 237 (MMX123_02528 - MMX123_02550) as having a reduced GC content compared to the 238 surrounding regions and as carrying genes with higher transcript abundance (Fig. 3A). 239 The genome analysis identified four additional regions longer than 10 kb of reduced 240 GC content relative to the surrounding genome (Fig. 3A, Table S1: lowGC content regions) 241 but none of these overlaps with regions of higher transcript abundance (Fig. 3A). Based on its 242 characteristic combination of low GC content and high expression levels during growth on 243 xanthan, genomic region 4 was designated the xanthan utilization (xut) locus (Fig. 3A, Table 244 S1: lowGC content regions). The xut-locus comprises 23 genes (Fig. 3B) with functional 245 annotations provided in Tables S1 (Table S1: xut_interproscan) and naming in Table S3. The 246 gene xanthan utilization regulator 1, xutR1 (MMX123_02528), encoded on the reverse strand, 247 contains a DeoR-type helix-turn-helix domain and a periplasmic binding domain, suggesting a 248 sugar-responsive transcription factor (TF). The adjacent gene xutA (MMX123_02529) encodes 249 a secreted GH9 enzyme likely capable of cleaving the xanthan backbone following removal of 250 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 11 terminal pyruvylated mannose residues (9). The next gene, xutB (MMX123_02530), encodes 251 a putative 2-hydroxyacid dehydrogenase. 252 253 Fig. 3: The xanthan utilization ( xut) locus and its organization in the genomic context in M. 254 xanthanicum UB-LE1. A) Circular genome map of the UB-LE1 genome. Track 1 displays forward 255 coding sequences (CDS; dark gray) and track 2 shows reverse CDS (light gray). In both tracks, rRNAs 256 are shown in green and tRNAs in brown. Track 3 represents GC content with positive GC content in 257 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 12 black and negative in blue-gray. Track 4 depicts GC skew with positive GC skew in bright red and 258 negative in dark red. Track 5 displays the log2 fold change (log2FC) with high log2FC in yellow and low 259 log2FC in blue. The position of the xut-locus is highlighted in light teal over all tracks, with the ±10 kbp 260 surrounding region shown in cyan. B) Organization of the xut-locus with its gene annotation. Genes are 261 shown as directional arrows, representing gene orientation and their colors denote functional gene 262 annotation. Transcript abundance for each gene is shown as two-colored rectangles below each gene. 263 The first rectangle displays the transcripts per million (TPMs) under growth on glucose and the second 264 rectangle represents the TPMs under xanthan growth, with increasing intensity from white to red 265 indicating higher expression levels. 266 267 Genes xutC1 to xutI (MMX123_02531–02539) form a candidate operon on the forward 268 strand. The first three genes encode a likely ABC transporter system, including a lipid-269 anchored binding protein (xutC1) and membrane-associated transporter components (xutC2–270 C3). The intracellular enzymes XutD (GH3) and XutE (GH38) are similar to proteins, which 271 hydrolyze glucose- and mannose-containing linkages. The downstream genes xutF–xutH 272 encode proteins with predicted roles in sugar modification, including oxidoreductase and 273 isomerase activities, while xutI encodes a glycerate kinase. 274 A second operon on the reverse strand includes xutJ (MMX123_02540), encoding a 275 secreted PL8 xanthan lyase that likely cleaves terminal pyruvylated mannose residues, 276 preceded by a second ABC transporter system ( xutK1–K3, MMX123_02541-02543) and the 277 putative esterase XutL ( MMX123_02544). The regulator XutR2 ( MMX123_02545), a TetR-278 type TF, may act as a repressor. A third operon comprises TF xutR3 (MMX123_02550) and 279 genes xutM1–M3 (MMX123_02546-02548), encoding a third transporter system, and XutN 280 (MMX123_02549), a predicted FAD-dependent oxidoreductase. The organization of the locus 281 suggests at least four promoters are required for full expression (Fig. 3B). 282 283 284 285 286 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 13 Identification of transcriptional regulators for the xut-locus 287 The position of three TFs within the xut-locus suggests they may be regulating the expression 288 of genes within and hence bind some or all the putative transcriptional starts (Fig. 3B). DNA 289 Affinity Purification sequencing (DAP-seq) was used to determine genome-wide binding-sites 290 for the three TF candidates XutR1, XutR2, and XutR3. The three candidates were expressed 291 in vitro with halo-tags and detected on Western Blots (Fig. S4). XutR2 protein was incubated 292 B A xut1-operon xut2-operon xut3-operon XutR3 XutR2 xut1-operon xut2-operon xut3-operon .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 14 Fig. 4: XutR2 and XutR3 bind genome regions in the xut-locus. DAP-seq binding data are shown 293 for A) XutR2 and B) XutR3 across the xut-locus and one adjacent flanking gene on either side. For each 294 TF, the top track depicts per-base read coverage of DAP-seq reads mapped the genome using Bowtie2. 295 The second track shows the corresponding empty-vector control coverage, scaled to the experimental 296 track to facilitate direct comparison of signal-to-noise ratios. The third track displays binding sites 297 identified by MACS3 and is based on the narrowPeak output. The width of the bars indicates the called 298 peak region, while the color intensity reflects the peak score. The bottom track illustrates schematic of 299 the xut-locus genomic architecture. Arrows indicate coding strand orientation, with gene colors 300 corresponding to the functional categories defined in Fig. 3B. Adjacent flanking genes are displayed in 301 light gray. 302 303 with DNA and sequencing of the bound DNA yielded two peaks with read pileups above 8,000 304 reads with a summit at genome position 2,766,454, which puts it 15 bases upstream of itself 305 and 114 bases upstream of the first gene, xutL, in the pentacistronic transcriptional unit within 306 the xut-locus (Fig. 4A). 307 The second peak is located at 1,117,528 in between MMX123_00995 and 308 MMX123_00996, neither of which is changed in transcript abundance in xanthan- or glucose-309 dependent growth. XutR3 yielded one peak at genomic position 2,772,971, which is 35 bases 310 upstream of itself, the first gene in a second pentacistronic operon (Fig. 4B). Additional peaks 311 were not detected. That is, XutR2 controls transcription of the xut2-operon (xutJ - xutL) and 312 XutR3 regulates expression of the xut3-operon (xutM1 – xutR3). Transcriptional control of the 313 large xut1-operon (possibly xutB, xutC1 - xutI) remains elusive. 314 315 Conservation of xut-loci among Microbacterium sp. 316 To identify additional Microbacterium species carrying a xut-locus, proteins encoded by 317 MMX123_02528–MMX123_02550 were queried against 1,206 available Microbacterium 318 genomes. Twelve genomes contained the core xanthan-degrading enzymes GH9 (XutA), GH3 319 (XutD), GH38 (XutE), and the xanthan lyase PL8 (XutJ) (Fig. 5,(10)). Three of these displayed 320 the type I arrangement, characterized by separation of GH3/GH38 from PL8 and conservation 321 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 15 of intervening genes, as observed in M. xanthanicum UB-LE1, the leaf-derived strains M. sp. 322 Leaf436 and M. testaceum StLB037, and the wastewater-derived M. sp S1037. In contrast, 323 M. testaceum DSM 20166 lacked the locus and was unable to grow on xanthan (Fig. S5). 324 325 Fig. 5: Synteny plot of genes belonging to the xut-locus region in different Microbacterium 326 species. Color coding represents functional annotation according to M. xanthanicum UB-LE1 (see Fig. 327 3B). Chromosome-level genome assemblies are shown in bold. Genomic coordinates are listed in Table 328 S4. Assemblies marked with Type I: Gene organization of the xut-locus in Microbacterium species with 329 the xanthan lyase ( xutJ) is oriented opposite to upstream genes, e.g., GH3 ( xutD), GH38 ( xutE). 330 Assemblies marked with Type II: Gene organization of the xut-locus in Microbacterium species with 331 the xanthan lyase ( xutJ) is co-oriented with upstream genes, e.g., GH3 ( xutD), GH38 ( xutE). The xut-332 locus organization in M. testaceum NS220 could not be classified and is marked with a question mark, 333 as the xanthan lyase (xutJ) is located on a separate contig from the other locus genes. 334 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 16 Eight genomes showed the type II arrangement, in which the genes encoding GH3, 335 GH38, and PL8 are co-located on the same strand. Known xanthan degraders, such as 336 Microbacterium sp. XT11 and fully sequenced strains including M. oleivorans I46 carried this 337 organization, whereas strains lacking the locus, such as M. oleivorans DSM 16091 (Fig. S5), 338 were deficient in xanthan utilization. 339 Across both types, the core gene block from x utR1 to xutE is conserved, whereas 340 downstream genes xutF, xutG, and xutH are variably distributed. The xanthan lyase xutJ is 341 present in all strains but differs in genomic context. The second transport system and its 342 regulator (XutR2) are restricted to type I, while the third transport system and XutR3 are 343 consistently present in type I and variably retained in type II strains. 344 345

Discussion

346 The identification of a tightly regulated xanthan utilization (xut) locus in M. xanthanicum UB-347 LE1 provides insight into how soil bacteria adapt to structurally complex and spatially 348 heterogeneous carbon sources. From an eco-evolutionary perspective, xanthan represents a 349 particularly challenging substrate: it is chemically recalcitrant (REFs from introduction) and 350 produced in highly localized biological contexts, for example during biofilm formation by 351 Xanthomonas species (3). The strong and selective induction of the xut-locus in response to 352 xanthan (Fig. 2, Fig.3), in contrast to other polysaccharide utilization (pul) loci (Table S1: pul 353 loci), indicates that M. xanthanicum UB-LE1 follows a strategy of conditional investment into 354 extracellular degradation. This strategy is consistent with the high energetic costs associated 355 with secreting large CAZymes, such as PL8 and GH9, which release diffusible oligo- and 356 monosaccharides(21) into the environment. 357 The presence of multiple, co-regulated transport systems within the same locus (Fig. 358 3B, Fig. 4) likely enhances substrate capture and limits loss of released sugars to competing 359 microorganisms. Such coupling of extracellular hydrolysis with high-affinity uptake may 360 stabilize the producer phenotype and reduce exploitation by non-degrading “cheaters” (22). 361 The genomic potential of M. xanthanicum UB-LE1 further suggests a role as an early colonizer 362 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 17 or primary degrader, analogous to pioneer degraders described in marine microbial 363 communities that initiate polymer breakdown and structure subsequent cross-feeding 364 interactions (23). The occurrence of xut-loci in leaf-associated Microbacterium strains (Fig. 5) 365 indicates that xanthan degraders may also influence plant–microbe interactions by 366 destabilizing pathogen biofilms or by redirecting carbon flow through interference competition 367 (24). Such activity could indirectly benefit plant hosts under pathogen pressure, while 368 simultaneously providing the degrader access to released sugars. Co-regulation of a 369 phenylacetic acid degradation operon may further contribute to competitive fitness in biofilm-370 associated niches, where detoxification of antimicrobial compounds can provide a selective 371 advantage (25). Notably, the broad repertoire of additional sugar transporters encoded by M. 372 xanthanicum UB-LE1 suggests that xanthan utilization is embedded within a metabolically 373 flexible, generalist lifestyle rather than representing a narrow specialization, consistent with 374 the enrichment of CAZymes in plant-associated bacteria (26–28). 375 The genomic architecture of the type I xut-locus (Fig. 5) supports the view that xanthan 376 utilization is an adaptive trait shaped by horizontal gene transfer, inducibility, and modular 377 evolution. The reduced GC content (Fig. 3A), tight clustering of functionally linked genes (Fig. 378 3B), and the presence of cognate transcriptional regulators within the locus (Fig. 3B, Fig. 4) 379 are all consistent with acquisition as a pre-assembled functional module. The organization of 380 the xut-locus parallels the general logic of pul-loci described in Bacteroidetes (29), combining 381 secreted and non-secreted CAZymes with tripartite transport systems. Unlike canonical 382 Bacteroidetes pul-loci (30), however, the xut-locus includes LacI-type regulators, here 383 designated XutR1 and XutR3 (Fig. 3B) that likely function as both sensors and transcriptional 384 regulators via fused substrate-binding domains (Table S1: interproscan). At least one such 385 regulator is conserved across both type I and type II xut-locus arrangements in Microbacterium 386 (Fig. 5), suggesting functional constraints on regulatory architecture. The apparent mobility of 387 this system is further supported by recent reports of xanthan utilization loci encoded on 388 plasmids (17). 389 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 18 Comparison of type I xut-loci in Microbacterium with type II arrangements, in which 390 xanthan-associated genes are dispersed across the genome, suggests that complete xanthan 391 utilization requires auxiliary functions beyond the core degradative enzymes. These include 392 enzymes, such as a 2-hydroxyacid reductase (XutB), a FAD-dependent oxidoreductase 393 (XutN), and additional sugar-modifying activities (XutF, XutG, XutI; Table S3, Fig. 5, Fig. 6) 394 that likely enable assimilation of unusual xanthan-derived sugars into central metabolism (Fig. 395 6). 396 397 Fig. 6: Hypothetical model of xanthan degradation and utilization (type I) in M. xanthanicum UB-398 BL1. Extracellular xanthan is initially processed by a secreted xanthan lyase (PL8 family, XutJ), which 399 removes terminal pyruvylated mannose residues, followed by an endoxanthanase (GH9 family, XutA) 400 that cleaves the β-(1,4)-glucan backbone into xanthan-derived oligosaccharides. These 401 oligosaccharides as well as pyruvylated mannose are captured by surface-associated substrate-binding 402 proteins and imported via dedicated transport systems (XutC1/2/3, XutK1/2/3, XutM1/2/3) encoded 403 within the xut-locus. In the cytoplasm, an esterase (likely XutL) removes acetyl groups (Ac), and 404 glycosidases (GH3 family, XutD and GH38 family, XutE) further hydrolyze the oligosaccharides into 405 monosaccharides glucose (G), mannose (M), glucuronic acid (GA), which are subsequently assimilated 406 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 19 into central carbon metabolism. The proteins XutB, XutF, XutG, XutI, and XutN are likely involved in this 407 metabolic conversion. XutH is a candidate enzyme to cleave pyruvylated (Pyr) mannose. The question 408 mark indicates hypothetical or unspecified functions. Expression of the xanthan utilization machinery is 409 tightly controlled by the substrate-responsive transcriptional regulators XutR2 and XutR3 encoded within 410 the xut-locus. The differently colored suns indicate transcriptional control by the respective XutR 411 transcription factor. Protein functions are not functionally tested but suggested according to interproscan 412 predictions. 413 414 Together, our findings position xanthan utilization as a model system for understanding 415 how bacteria adapt to structurally complex, spatially heterogeneous extracellular 416 carbohydrates in natural environments. The xut-locus in M. xanthanicum UB-LE1 illustrates 417 how modular gene clusters, tight regulatory control, and metabolic integration enable 418 conditional investment into energetically costly extracellular degradation strategies. This 419 architecture links ecological opportunity to evolutionary innovation, highlighting how substrate 420 availability can drive the emergence and diversification of specialized catabolic pathways. 421 Beyond advancing fundamental insight into microbial polysaccharide degradation, our work 422 establishes a framework for exploring and engineering xanthan-active systems. The regulatory 423 logic and enzymatic repertoire uncovered here provide entry points for the targeted 424 modification of xanthan structure and properties, connecting microbial ecology with future 425 biotechnological and material applications. 426 427

Materials and methods

428 Isolation of M. xanthanicum UB-BL1 and culture conditions 429 Details of sample collection and strain isolation of M. xanthanicum UB-LE1 is described in 430 Laker et al. (Companion paper submitted to MRA). 431 M. xanthanicum UB-LE1 was grown at 30 °C and 180 rpm in defined synthetic M9-432 medium (KH2PO4, 3 g L-1; NaCl, 0.5 g L-1; Na2HPO4, 6.78 g L-1; NH4Cl, 1 g L-1) containing 0.001 433 g L- 1 Biotin and Thiamin, 0.12 g L-1 MgSO4, 0.033 g L-1 CaCl2 and trace elements (EDTA, 0.05 434 g L-1; FeCl3 x 6H2O, 0.0083 g L- 1; ZnCl2, 0.00084 g L-1; CuCl2 x 2H2O, 0.00013 g L-1; CoCl2 x 435 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 20 2H2O, 0.00010 g L-1; H3BO3, 0.00010 g L-1). Experiments were conducted with UB-LE1 cells, 436 starting OD OD600 of 0.1, grown on 5 g L -1 xanthan or 6 g L -1 glucose to assess the influence 437 of the carbon sources. To obtain biological replicates, three or four batches of UB-LE1 cells 438 were grown on each carbon source. 439 440 RNA isolation and RNA-seq analysis 441 Cells for RNA isolation were harvested after 34 h (+ glucose) and 48 h (+ xanthan) by 442 centrifugation at 4 °C, 8,000 x g for 5 min. Pellets were frozen in liquid nitrogen and stored at 443 −80 °C. Total RNA was extracted using the Quick RNA miniprep kit (Zymo Research, USA) 444 following the manufacturer’s instructions. RNA quality and quantity were measured using a 445 Bioanalyzer 2100 (Agilent, USA). rRNA was depleted using RNA Depletion llumina® Ribo-446 Zero Plus (Illumina, USA). Libraries were prepared using the TruSeq RNA Library Prep Kit v2 447 kit (Illumina, USA), pooled, and sequenced 100 bp single-end on an Illumina NextSeq 2000 448 instrument (Illumina, USA). 449 All computational analyses were performed using default parameters unless otherwise 450 specified. Transcripts were quantified with Kallisto v0.44 (31) with parameters ‘--single -l 200 -451 s 20’ and the M. xanthanicum UB-LE1 transcript sequences as index). Downstream analyses 452 were conducted in R v4.5.2 https://www.R-project.org/ (32) and plotted with ggplot2 v3.5.2 453 (33). Genes with zero counts across all samples were excluded prior to analyzes. Principle 454 components were determined with prcomp (scale=T) and differential transcript accumulation 455 was determined with edgeR v4.6.3 (34), classic mode) and p-values were corrected with 456 Benjamini-Hochberg. A circular genome map of UB-LE1 plotted with pyCirclize 457 (github.com/moshi4/pyCirclize) visualized gene content, GC content, and genome-wide log2-458 fold changes 459 460 Protein isolation and proteome analysis 461 Samples were centrifuged at 8,000 x g for 5 min, washed with phosphate-buffered saline (PBS) 462 containing cOmplete Protease Inhibitor (Roche, Germany), and stored at -80 °C until 463 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 21 lyophilized. The supernatant fraction was passed through a sterile 0.2 µm filter (Fisher 464 Scientific) and stored at -80 °C until lyophilized. Six to twelve mg of lyophilized material were 465 resuspended in 1 mL of 100 mM ammonium bicarbonate (AmBic; Sigma-Aldrich), disrupted 466 mechanically using a Precellys 24 homogenizer (Bertin Instruments) with 0.1 mm glass beads 467 (BioSpec Products) at 6,500 rpm for 30 s in three cycles, refrozen in liquid nitrogen, and 468 lyophilized. 469 Proteins were reduced and alkylated by sequential treatment with trifluoroethanol 470 (VWR Chemicals), AmBic, 200 µM dithiothreitol (DTT), and 200 µM iodoacetamide (Sigma-471 Aldrich) for 90 min at RT, and again treated with DTT for 60 min at RT. Tryptic digestion was 472 performed by overnight-incubation at 37 °C with sequencing-grade Trypsin Gold (1 µg µL -1; 473 Promega) in AmBic and H2O (1:1). Peptides were purified with filter-aided sample preparation 474 (Wisniewski et al., 2009; Sep-Pak C18 Vac Cartridges) and reconstituted in 98% LC-MS water, 475 2% acetonitrile, and 0.1% trifluoroacetic acid (VWR Chemicals), diluted to 0.5-1.5 µg µL -1. 476 Peptides were analyzed by LC–ESI–MS/MS on a Q Exactive Plus Orbitrap instrument (Thermo 477 Fisher Scientific), linear gradient from 3.2% to 76% acetonitrile with 0.1% formic acid over 478 80 min, positive ion mode from 5 to 92 min with a resolution of 70,000, AGC target of 3 x 10 6 479 ions, maximum injection time (IT) of 64 ms, and a scan range of 350-2000 m/z. Data-480 dependent acquisition selected the top 10 precursor ions with a maximum IT of 100 ms, AGC 481 target of 2 x 105, isolation window of 1.6 m/z, and MS2 resolution of 17,500. 482 Raw data were processed using MaxQuant version 2.5.2 with default parameters 483 except “Match between runs” enabled. Proteins were considered identified if represented by 484 at least two unique peptides detected in a minimum of two biological replicates. For label-free 485 quantification (LFQ), intensity values were log 2-transformed and statistically significant 486 differential accumulation was determined by Welch’s t-test with missing values omitted 487 followed by Benjamini Hochberg correction (35). Secreted peptides were predicted with 488 DeepLocPro in Gram-positive mode (36), and carbohydrate-active enzymes were annotated 489 with dbCAN3 (37). 490 491 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 22 Synteny analysis 492 The protein sequences of UB-LE1 xut-locus genes were queried against 1,206 downloaded 493 proteomes from NCBI (retrieved 16.10.2025 with taxon filter “Microbacterium”; option 494 “annotated genomes only”) using BLASTP (e-value = 60%). Genes located 10 kbp upstream and downstream of the UB-LE1 xut-496 locus were queried with BLASTP against the proteomes with xut-locus proteins with 497 Microbacterium sp. MM2322 as a negative control (GCF_964186585.1; (38)). The xut-locus 498 regions were plotted with pyGenomeViz (https://github.com/moshi4/pyGenomeViz) with color 499 codes based on BLASTP results. 500 501 DNA-affinity purification sequencing (DAP-seq) and analysis 502 UB-LE1 was cultivated on LB agar medium over night at 30 °C. Genomic DNA was isolated 503 using the Nucleospin Microbial DNA Kit (Macherey-Nagel, Germany, including RNase 504 treatment), assessed on a 0.8 % agarose gel, and quantified using a High Sensitivity (HS) 505 dsDNA assay kit on a Qubit 4 fluorometer (Thermo Fisher Scientific, USA). For DNA library 506 preparation, 5 μg genomic DNA was fragmented, end-repaired, A ‑tailed, adapter ligated and 507 amplified from 15 ng following (39). 508 DAP-seq experiments were conducted as described by Schiller et al. (2025). Briefly, 509 coding sequences of genes MMX123_02528 (xutR1), MMX123_02545 (xutR2), and 510 MMX123_02550 (xutR3) were cloned in frame with an N-terminal Halo-tag encoded in the 511 pFN19A T7 SP6 Flexi vector (Promega, USA; discontinued) using Gibson assembly (40). 512 Primer sequences are listed in Table S5. Halo:XutR1, Halo:XutR2, Halo:XutR3, and a 513 barnase(-) vector control were expressed with the TnT SP6 High-Yield Wheat Germ Protein 514 Expression System (Promega, USA) using 2 µg plasmid DNA per reaction. Proteins were 515 bound to Magnet HaloTag beads (Promega, USA) for 1 h and washed three times with a PBS-516 NP40 solution. Expression and bead-binding of the Halo-Tag fusion proteins were verified by 517 western blot analysis with anti-HaloTag monoclonal antibody (Promega, USA) and Goat Anti-518 Mouse IgG H&L (HRP) (Abcam, UK). For DNA binding, 1 ng amplified DNA library and 1 µg 519 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 23 salmon sperm DNA (Thermo Fisher Scientific, USA) were added to the bead-bound proteins 520 and incubated for 1 h at 19 °C before four washes with PBS-NP40 solution and elution. The 521 optimal number of PCR cycles for final amplification was determined by qPCR in duplicate 522 using 1 µL recovered DNA and Luna Universal qPCR Master Mix (New England Biolabs, USA), 523 Final libraries were amplified from 23 µL of recovered DNA using the Illumina TruSeq Strand 524 A primer (5'-ACACTCTTTCCCTACACGACGCTCTTCCGATCT-3') and the i7 8-baseindex 525 primer (5'-CAAGCAGAAGACGGCATACGAGAT-[i7]-GTGACTGGAGTTCAGACGTGT-3') 526 with 15-19 PCR cycles. For sequencing, 5 µL of amplified DNA for each sample were pooled 527 and purified twice with AMPure XP beads (Beckman Coulter, USA) at a bead-to-DNA volume 528 ratio of 1:1. Library size distribution and DNA concentration were assessed using HS DNA 529 Analysis Kit on a Bioanalyzer (Agilent Technologies, USA). and Qubit dsDNA HS Assay Kit 530 (Thermo Fisher Scientific, USA), respectively before sequencing as single ‑end reads on a 531 NextSeq2000 (Illumina, USA). 532 Sequencing data were processed with the automated DAP-Seq analysis pipeline v1.0.1 533 described in (39) using parameters ‘useDuplicates=True subsample=200000’, with the 534 Brassica napus Darmor-bzh v10 genome ( GCA_905183035.1,(41)) used as the random 535 source file. Briefly, raw reads were trimmed with Trimmomatic v0.39 (42) and aligned to the M. 536 xanthanicum genome (Laker et al., Companion paper submitted to MRA) using Bowtie2 v2.5.4 537 (43) Aligned reads were filtered by mapping quality with samtools v1.20 (44) and sambamba 538 v1.0.1 (45) and then randomly subsampled to 200,000 per sample. Peaks were called with 539 MACS3 v3.0.2 (46), using the corresponding empty-vector control for each TF as input control. 540 Peaks with a fold enrichment < 3 relative to background were discarded. Genes were 541 considered bound if a peak summit overlapped with their promoters, defined as the 150 bp 542 region upstream of the transcription start site. 543 Per-base coverage across the xut-locus was obtained from processed Bowtie2 544 alignments using samtools depth and peak locations and scores were extracted from MACS3 545 narrowPeak output. The xut-locus coordinates and peak widths were displayed as 546 gene/feature models using the R package gggenes https://wilkox.org/gggenes/ (47). 547 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 24 DATA AVAILABILITY 548 The RNA-Seq data of the glucose and xanthan growth conditions are available in 549 GenBank/ENA under BioProject accession number PRJEB110862. The GenBank/ENA 550 accession number for the raw RNA-Seq read data under glucose are ERR16928898, 551 ERR16928899, ERR16928900 and under xanthan are ERR16928901, ERR16928902, 552 ERR16928903. 553 The mass spectrometry proteomics data have been deposited to the 554 ProteomeXchange Consortium via the PRIDE (48) partner repository with the dataset 555 identifier PXD069319. 556 Strain M. xanthanicum UB-LE1 has been deposited in the Deutsche Sammlung von 557 Mikroorganismen und Zellkulturen under the accession number DSM 120304 and in the 558 Laboratorium voor Microbiologie, Universiteit Gent, collection under the accession number 559 LMG 34097. 560 561

Acknowledgements

562 We thank Ulrike Harke and Sanne Wiersma for the handling of M. xanthanicum UB-LE1 and 563 the class (2023) of the master module ‘Methods and examples of functional genome research’ 564 at Bielefeld University for experimental support. The authors would like to thank Jungbunzlauer 565 Holding AG for their support and for providing a scholarship to fund this project, as well as for 566 supplying “cell free” xanthan. Additionally, we would like to thank Dietmar Althaus for his aid 567 in obtaining sampling permission. We thank the NGS team of the Bielefeld University Omics 568 CF NGS Unit (in development) and CeBiTec as well as the technical staff of the CeBiTec 569 Technology Platform Genomics, particularly Eva Schulte-Bernd, Yvonne Kutter, and Katharina 570 Hanuschka for technical assistance. 571 572 FUNDING 573 This work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research 574 Foundation) – SFB1535 - Project ID 458090666. We acknowledge Bielefeld University core 575 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 25 funding and support for the publication costs by the Open Access Publication Fund of Bielefeld 576 University and the DFG. This work was supported by the BMBF-funded de.NBI Cloud within 577 the German Network for Bioinformatics Infrastructure (de.NBI). 578 579 AUTHOR CONTRIBUTIONS 580 Bianca Laker, Computational data analysis, Manuscript writing | Michael Thomas, Strain 581 isolation, Proteomics, Manuscript writing | Wiebke Weber, Experiments, Data analysis | Prisca 582 Viehoever, RNA-seq, DAP-seq | Anja Meierhenrich, DAP-seq | Levin J. Klages, Genome 583 sequencing, Computational data analysis | Tobias Busche, Genome sequencing, 584 Computational data analysis | Karsten Niehaus, Experimental design, Supervision | Andrea 585 Bräutigam, Experimental design, Data analysis, Supervision, Manuscript writing | Marion 586 Eisenhut, Experimental design, Data analysis, Supervision, Manuscript writing 587 588 AUTHOR ORCIDs 589 Bianca Laker ORCID: 0000-0002-5792-0102 590 Michael Thomas ORCID: 0000-0003-4646-5324 591 Wiebke Weber ORCID: 0009-0007-0721-3163 592 Prisca Viehoever ORCID: 0000-0003-3286-4121 593 Anja Meierhenrich ORCID: 0000-0001-6721-4220 594 Levin J. Klages ORCID: 0000-0003-1634-3705 595 Tobias Busche ORCID: 0000-0001-9211-8927 596 Karsten Niehaus ORCID: 0000-0003-4078-9870 597 Andrea Bräutigam ORCID: 0000-0002-5309-0527 598 Marion Eisenhut ORCID: 0000-0002-2743-8630 599 600 601 602 603 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 26 ADDITIONAL FILES 604 The following material is available online. 605 Supplemental material 606 Table S1: Additional data with mothertable, secreted CAZymes, CAZyme analysis, pul loci, 607 low GC content regions 608 Table S2: Average nucleotide identity (ANI) values for Microbacterium xanthanicum UB-LE1 609 with closely related species. 610 Table S3: Genes of the xut-locus in M. xanthanicum UB-BL1. 611 Table S4: Genomic coordinates of sequence regions used for synteny analyses of the xut-612 locus. 613 Table S5: Primers used during initial PCR for TF candidate amplification. 614 Figure S1: Features of M. xanthanicum UB-LE1. 615 Figure S2: Scatterplot comparing fold-changes between transcriptome and cellular proteome. 616 Figure S3: Pie chart representing the number of proteins assigned to CAZyme classes in M. 617 xanthanicum UB-LE1. 618 Figure S4: Western blotting to prove successful in vitro expression of XutR1, XutR2, and 619 XutR3. 620 Figure S5: Ability of Microbacterium species to utilize xanthan as carbon source. 621 622

References

623 1. Dow JM, Daniels MJ. 1994. Pathogenicity determinants and global regulation of 624 pathogenicity of Xanthomonas campestris pv. campestris. Curr Top Microbiol Immunol 625 192:29–41. 626 627 2. Yun MH, Torres PS, El Oirdi M, Rigano LA, Gonzalez-Lamothe R, Marano MR, 628 Castagnaro AP, Dankert MA, Bouarab K, Vojnov AA. 2006. Xanthan induces plant 629 susceptibility by suppressing callose deposition. Plant Physiol 141:178–87. 630 631 3. Torres PS, Malamud F, Rigano LA, Russo DM, Marano MR, Castagnaro AP, 632 Zorreguieta A, Bouarab K, Dow JM, Vojnov AA. 2007. Controlled synthesis of the DSF 633 cell-cell signal is required for biofilm formation and virulence in Xanthomonas 634 campestris. Environ Microbiol 9:2101–9. 635 636 4. Berninger T, Dietz N, González López Ó. 2021. Water ‑soluble polymers in agriculture: 637 xanthan gum as eco‑friendly alternative to synthetics. Microb Biotechnol 14:1881–638 1896. 639 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 27 640 5. Freire CD, Pinto FR, Almeida D, Gil MM. 2024. Linseed and xanthan gum in algae 641 pasta: textural, sensory and antioxidant characteristics. Int J Food Sci Technol 642 59:5477–5489. 643 644 6. Krystyjan M, Dobosz-Kobędza A, Sikora M, Baranowska HM. 2022. Influence of 645 Xanthan Gum Addition on the Short- and Long-Term Retrogradation of Corn Starches 646 of Various Amylose Content. Polymers (Basel) 14:452. 647 648 7. Singhvi G, Hans N, Shiva N, Kumar Dubey S. 2019. Xanthan gum in drug delivery 649 applications, p. 121–144. In Natural Polysaccharides in Drug Delivery and Biomedical 650 Applications. Elsevier. 651 652 8. Berezina O V., Rykov S V., Schwarz WH, Liebl W. 2024. Xanthan: enzymatic 653 degradation and novel perspectives of applications. Appl Microbiol Biotechnol 654 108:227. 655 656 9. Moroz O V., Jensen PF, McDonald SP, McGregor N, Blagova E, Comamala G, 657 Segura DR, Anderson L, Vasu SM, Rao VP, Giger L, Sørensen TH, Monrad RN, 658 Svendsen A, Nielsen JE, Henrissat B, Davies GJ, Brumer H, Rand KD, Wilson KS. 659 2018. Structural Dynamics and Catalytic Properties of a Multimodular Xanthanase. 660 ACS Catal 8:6021–6034. 661 662 10. Sun Z, Liu H, Wang X, Yang F, Li X. 2019. Proteomic Analysis of the Xanthan-663 Degrading Pathway of Microbacterium sp. XT11. ACS Omega 4:19096–19105. 664 665 11. Muchová M, Růžička J, Julinová M, Doležalová M, Houser J, Koutný M, Buňková L. 666 2009. Xanthan and gellan degradation by bacteria of activated sludge. Water Science 667 and Technology 60:965–973. 668 669 12. Nankai H, Hashimoto W, Miki H, Kawai S, Murata K. 1999. Microbial System for 670 Polysaccharide Depolymerization: Enzymatic Route for Xanthan Depolymerization by 671 Bacillus sp. Strain GL1. Appl Environ Microbiol 65:2520–2526. 672 673 13. Qian F, An L, Wang M, Li C, Li X. 2007. Isolation and characterization of a xanthan-674 degrading Microbacterium sp. strain XT11 from garden soil. J Appl Microbiol 675 102:1362–1371. 676 677 14. Khianngam S, Akaracharanya A, Tanasupawat S, Lee KC, Lee J-S. 2009. 678 Paenibacillus thailandensis sp. nov. and Paenibacillus nanensis sp. nov., xylanase-679 producing bacteria isolated from soil. Int J Syst Evol Microbiol 59:564–568. 680 681 15. Jensen PF, Kadziola A, Comamala G, Segura DR, Anderson L, Poulsen J-CN, 682 Rasmussen KK, Agarwal S, Sainathan RK, Monrad RN, Svendsen A, Nielsen JE, Lo 683 Leggio L, Rand KD. 2019. Structure and Dynamics of a Promiscuous Xanthan Lyase 684 from Paenibacillus nanensis and the Design of Variants with Increased Stability and 685 Activity. Cell Chem Biol 26:191-202.e6. 686 687 16. Han R, Baudrexl M, Ludwig C, Berezina O V., Rykov S V., Liebl W. 2024. 688 Identification of a novel xanthan-binding module of a multi-modular Cohnella sp. 689 xanthanase. Front Microbiol 15:1386552. 690 691 17. Thomas M, Schlüter A, Fjodorova J, Rückert C, Busche T, Niehaus K. 2026. Genomic 692 and proteomic characterization of a newly isolated Paenarthrobacter ilicis strain and 693 its plasmid-mediated xanthan degradation. Microbiol Spectr 14:e0007526. 694 695 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 28 18. McKee LS, La Rosa SL, Westereng B, Eijsink VG, Pope PB, Larsbrink J. 2021. 696 Polysaccharide degradation by the Bacteroidetes: mechanisms and nomenclature. 697 Environ Microbiol Rep 13:559–581. 698 699 19. Reverchon S, Nasser W, Robert ‑Baudouy J. 1991. Characterization of kdgR , a gene 700 of Erwinia chrysanthemi that regulates pectin degradation. Mol Microbiol 5:2203–701 2216. 702 703 20. Hirooka K. 2022. RhgKL and CcpA directly regulate the rhiLFGN-rhgR-yesTUV 704 operon involved in the metabolism of rhamnogalacturonan type I in Bacillus subtilis. 705 Biosci Biotechnol Biochem 86:1383–1397. 706 707 21. Cezairliyan B, Ausubel FM. 2017. Investment in secreted enzymes during nutrient-708 limited growth is utility dependent. Proceedings of the National Academy of Sciences 709 114:E7796–E7802. 710 711 22. Allison SD. 2005. Cheaters, diffusion and nutrients constrain decomposition by 712 microbial enzymes in spatially structured environments. Ecol Lett 8:626–635. 713 714 23. Enke TN, Datta MS, Schwartzman J, Cermak N, Schmitz D, Barrere J, Pascual-García 715 A, Cordero OX. 2019. Modular Assembly of Polysaccharide-Degrading Marine 716 Microbial Communities. Current Biology 29:1528-1535.e6. 717 718 24. Rendueles O, Ghigo J-M. 2015. Mechanisms of Competition in Biofilm Communities. 719 Microbiol Spectr 3. 720 721 25. Hooppaw AJ, McGuffey JC, Di Venanzio G, Ortiz-Marquez JC, Weber BS, Lightly TJ, 722 van Opijnen T, Scott NE, Cardona ST, Feldman MF. 2022. The Phenylacetic Acid 723 Catabolic Pathway Regulates Antibiotic and Oxidative Stress Responses in 724 Acinetobacter. mBio 13:e0186321. 725 726 26. Levy A, Salas Gonzalez I, Mittelviefhaus M, Clingenpeel S, Herrera Paredes S, Miao 727 J, Wang K, Devescovi G, Stillman K, Monteiro F, Rangel Alvarez B, Lundberg DS, Lu 728 T-Y, Lebeis S, Jin Z, McDonald M, Klein AP, Feltcher ME, Rio TG, Grant SR, Doty SL, 729 Ley RE, Zhao B, Venturi V, Pelletier DA, Vorholt JA, Tringe SG, Woyke T, Dangl JL. 730 2018. Genomic features of bacterial adaptation to plants. Nat Genet 50:138–150. 731 732 27. Zhalnina K, Louie KB, Hao Z, Mansoori N, da Rocha UN, Shi S, Cho H, Karaoz U, 733 Loqué D, Bowen BP, Firestone MK, Northen TR, Brodie EL. 2018. Dynamic root 734 exudate chemistry and microbial substrate preferences drive patterns in rhizosphere 735 microbial community assembly. Nat Microbiol 3:470–480. 736 737 28. Vorholt JA. 2012. Microbial life in the phyllosphere. Nat Rev Microbiol 10:828–840. 738 739 29. Grondin JM, Tamura K, Déjean G, Abbott DW, Brumer H. 2017. Polysaccharide 740 Utilization Loci: Fueling Microbial Communities. J Bacteriol 199:199(15):e00860–16. 741 742 30. Terrapon N, Lombard V, Gilbert HJ, Henrissat B. 2015. Automatic prediction of 743 polysaccharide utilization loci in Bacteroidetes species. Bioinformatics 31:647–655. 744 745 31. Bray NL, Pimentel H, Melsted P, Pachter L. 2016. Near-optimal probabilistic RNA-seq 746 quantification. Nat Biotechnol 34:525–527. 747 748 32. R Core Team. 2025. R: A Language and Environment for Statistical Computing. 749 Vienna, Austria. 750 751 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 29 33. Wickham H. 2016. ggplot2. Springer International Publishing, Cham. 752 753 34. Chen Y, Chen L, Lun ATL, Baldoni PL, Smyth GK. 2025. edgeR v4: powerful 754 differential analysis of sequencing data with expanded functionality and improved 755 support for small counts and larger datasets. Nucleic Acids Res 53: gkaf018 756 757 35. Diz AP, Carvajal-Rodríguez A, Skibinski DOF. 2011. Multiple Hypothesis Testing in 758 Proteomics: A Strategy for Experimental Work. Molecular & Cellular Proteomics 759 10:M110.004374. 760 761 36. Nielsen H. 2025. Practical Applications of Language Models in Protein Sorting 762 Prediction: SignalP 6.0, DeepLoc 2.1, and DeepLocPro 1.0. Methods Mol Biol 153–763 175. 764 765 37. Zheng J, Ge Q, Yan Y, Zhang X, Huang L, Yin Y. 2023. dbCAN3: automated 766 carbohydrate-active enzyme and substrate annotation. Nucleic Acids Res 51:W115–767 W121. 768 769 38. Eisenhut M, Schulze T, Matten FNJ, Sengewald A, Böhner P, Falk D, Klinge CV, 770 Langschwager R, Lohoff S, Meyer N, Ponomarchuk V, Pünter F, Sherlock J, Pschik D, 771 Klages LJ, Busche T, Wobbe L, Bräutigam A, Laker B. 2024. Complete genome 772 sequences of Chitinophaga sp. strain MM2321 and Microbacterium sp. strain 773 MM2322, isolated from a sand sample in Harsewinkel, Germany. Microbiol Resour 774 Announc 13: e0070024. 775 776 39. Schiller K, Meierhenrich A, Zenker S, Sielmann LM, Laker B, Bräutigam A. 2025. 777 Optimized DNA affinity purification sequencing determines relative binding affinity of 778 transcription factors. Biol Methods Protoc 10:bpaf082. 779 780 40. Gibson DG, Young L, Chuang R-Y, Venter JC, Hutchison CA, Smith HO. 2009. 781 Enzymatic assembly of DNA molecules up to several hundred kilobases. Nat Methods 782 6:343–345. 783 784 41. Rousseau-Gueutin M, Belser C, Da Silva C, Richard G, Istace B, Cruaud C, Falentin 785 C, Boideau F, Boutte J, Delourme R, Deniot G, Engelen S, de Carvalho JF, 786 Lemainque A, Maillet L, Morice J, Wincker P, Denoeud F, Chèvre A-M, Aury J-M. 787 2020. Long-read assembly of the Brassica napus reference genome Darmor-bzh. 788 Gigascience 9:giaa137 789 790 42. Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina 791 sequence data. Bioinformatics 30:2114–2120. 792 793 43. Langmead B, Salzberg SL. 2012. Fast gapped-read alignment with Bowtie 2. Nat 794

Methods

9:357–359. 795 796 44. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, 797 Durbin R. 2009. The Sequence Alignment/Map format and SAMtools. Bioinformatics 798 25:2078–2079. 799 800 45. Tarasov A, Vilella AJ, Cuppen E, Nijman IJ, Prins P. 2015. Sambamba: fast 801 processing of NGS alignment formats. Bioinformatics 31:2032–2034. 802 803 46. Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nusbaum C, 804 Myers RM, Brown M, Li W, Liu XS. 2008. Model-based Analysis of ChIP-Seq (MACS). 805 Genome Biol 9:R137. 806 807 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint 30 47. Wilkins D. 2023. gggenes: Draw Gene Arrow Maps in “ggplot2.” R package version 808 0.5.0. 809 810 48. Perez-Riverol Y, Bandla C, Kundu DJ, Kamatchinathan S, Bai J, Hewapathirana S, 811 John NS, Prakash A, Walzer M, Wang S, Vizcaíno JA. 2025. The PRIDE database at 812 20 years: 2025 update. Nucleic Acids Res 53:D543–D553. 813 814 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted April 22, 2026. ; https://doi.org/10.64898/2026.04.20.719765doi: bioRxiv preprint

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-pdf

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00
unpaywall
last seen: 2026-05-23T02:00:01.238055+00:00
License: CC-BY-NC-ND-4.0