CAZyme domain architectures suggest fine-scale functional differentiation among anaerobic fungi and bacteria during lignocellulose conversion to volatile fatty acids

preprint OA: closed CC-BY-NC-ND-4.0
📄 Open PDF Full text JSON View at publisher
Full text 76,152 characters · extracted from oa-pdf · 7 sections · click to expand

Abstract

1 Anaerobic fermentation with microbial communities (microbiomes) is an emerging platform for 2 conversion of lignocellulosic biomass to biofuels and bioproducts . The process relies on diverse 3 anaerobic microbes that interact to deconstruct and convert lignocellulosic biomass into a range of 4 products, such as volatile fatty acids (VFAs), which can be achieved by arresting methanogenesis 5 during fermentation. However, defining the distinct functional roles played by various fungi and 6 bacteria during anaerobic biodegradation remains poorly understood. Here, we performed parallel 7 enrichment experiments from cow faeces, goat faeces, and anaerobic digester sludge, selecting for 8 fungal or bacterial dominated communities that convert sorghum biomass into VFAs. 9 Subsequently we reconstructed metabolic networks across the se enrichments based on recovered 10 bacterial metagenome-assembled genomes (MAGs) and fungal isolate genomes and profiled their 11 metabolic activity using metatranscriptomics to identify potential functional niches. Our findings 12 implicate diverse bacteria affiliated with the Bacteroidales and Lachnospiraceae in the direct 13 conversion of lignocellulosic biomass to propionate and butyrate, respectivel y, whereas 14 Neocallimastix-dominated fungal enrichment s converted lignocellulose to lactate, acetate and 15 formate. Analysis of carbohydrate -active enzymes (CAZymes) revealed fine-scale differences 16 between microbes that expressed unique multi-functional enzymes linking two or more CAZymes 17 together with distinct carbohydrate binding motifs , implicating lignocellulose structure as a key 18 driver of selection and niche differentiation. Most of these multi -functional enzymes localized 19 complementary degradation functions together , likely conferring synergistic degradation effects 20 within and between microbiome members . We anticipate that these findings will help inform 21 efforts to develop synthetic microbiomes with tailored functionality for low -cost conversion of 22 lignocellulosic biomass to fuels and bio-based chemicals. 23 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 3

Introduction

1 Biomanufacturing of chemicals from renewable lignocellulosic biomass has the potential to offset 2 society's reliance on fossil fuels. One biomanufacturing strategy involves the use of anaerobic 3 microbial communities (microbiomes) to deconstruct and convert lignocellulosic biomass into 4 biogas (methane and carbon dioxide) via anaerobic digestion [1]. While biogas can be upgraded 5 to produce bioenergy (i.e., renewable natural gas or electricity), its market value is low, often 6 rendering anaerobic digestion not economically viable. Therefore, higher value chemicals, such as 7 volatile fatty acids (VFAs ; $1 -5/kg selling price ) represent an emerging target product for 8 anaerobic digestion that can be achieved by arresting methanogenesis during fermentation. These 9 products can be chemically upgraded to liquid biofuels [2] or used directly in consumer products, 10 antimicrobials or industrial materials [3]. 11 12 Several studies have demonstrated VFA production via arrested methanogenesis from different 13 organic feedstocks [4]. Conversion of lignocellulosic biomass has largely been achieved by 14 anaerobic fermentations using rumen contents as inocula, resulting in acetate, propionate, and 15 butyrate (C2-C4 VFAs) as the main fermentation products in the absence of an external electron 16 donor [5–7]. During fermentation, the microbiomes responsible for lignocellulose deconstruction 17 and conversion have been reported as containing diverse ruminal bacteria [6, 7], with no studies 18 reporting the presence of biomass degrading ruminal fungi , indicating that they are outcompeted 19 by bacteria . While the overall metabolic processes mediated by ruminal bacteria during 20 lignocellulose degradation to VFAs are generally understood, the specific metabolic networks and 21 interactions driving in situ microbiome metabolism remain largely unresolved. Moreover, the 22 contribution and specific metabolic activities expressed by ruminal fungi during lignocellulose 23 deconstruction are only starting to emerge [8]. 24 25 Here, we enriched five parallel anaerobic fungal and bacterial consortia derived from the cow and 26 goat rumen and an anaerobic digester on sorghum substrate in chemically defined media selecting 27 for VFA production. We quantified metabolite and gas production across the enrichments to 28 examine functional differences between bacteria l-dominated consortia and fung al-dominated 29 consortia, as well as inoculum source. Subsequently we reconstructed metabolic networks across 30 the enrichments based on recovered bacterial metagenome -assembled genomes (MAGs) and 31 fungal isolate genomes and profiled their gene expression using metatranscriptomics. Our results 32 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 4 identify specific organisms, carbohydrate -active enzymes (CAZymes), and metabolic pathways 1 active in the deconstruction and conversion of sorghum biomass to VFAs. These results were used 2 to infer key metabolic interactions responsible for producing specific VFA end products and 3 activities that compete with VFA production. Together, these findings offer molecular -level 4 insights on anaerobic microbiome function and will inform the rational design of microbiomes for 5 improved lignocellulose conversion to valuable chemicals. 6 7

Methods

8 Enrichment and cultivation of anaerobic bacterial and fungal cultures on sorghum biomass 9 Anaerobic consortia enriched for biomass-degrading bacteria and biomass-degrading fungi (herein 10 termed “enrichments”) were cultivated in anaerobic Hungate tubes on 1% (w/w) sorghum (2 mm 11 mill size; Idaho National Laboratory) as a sole substrate in 10 ml of chemically defined M2 media 12 [9] with a ~85% N2, 12% CO2, and 3% H2 headspace and resazurin (0.1% w/v) added as a visual 13 redox indicator. Cultures were initially inoculated inside an anaerobic chamber (AS-580, Anaerobe 14 Systems) with 1 mL of either cow (5 g wet feces in 25 mL M2 media) or goat (3 pellets ~2.5 g in 15 25 mL M2) fecal solutions, or with 0.5 mL of sludge from an anaerobic digester to a final volume 16 of 10 mL M2 media in the presence of either 5 mM chloramphenicol (CH, selecting for fungi) or 17 5 mM 2-bromoethanesulfonate (BES, selecting for bacteria) and passaged every 3 -4 days based 18 on 10% (v/v) transfers. Enrichment cultures were maintained at a pH of 7 -7.5 and a temperature 19 of 39 °C. This resulted in 3 bacterial enrichments from the cow, goat, and digester inocula 20 (cow+BES, goat+BES, digester+BES), and 2 fungal enrichments from the cow and goat inocula 21 (cow+CH, goat+CH). Cow and digester enrichments were passaged for 10 generations (~40 days), 22 while goat enrichments were passaged for 5 generations (~20 days). Analytical m easurements 23 were made on biological triplicate s from each of the 5 enrichments cultures daily over a 4-day 24 period. DNA/RNA samples were collected on Day 3 for metagenomic and metatranscriptomic 25 analysis (see methods below). 26 27 Measurement of fermentation products and total pressure accumulation 28 Total pressure accumulation in the headspace of Hungate tubes was monitored daily using a 29 pressure transducer method as described previously [10]. Quantification of liquid fermentation 30 products, including formate, acetate, propionate, butyrate, valerate, succinate, and lactate, was 31 performed on a 1260 Infinity high-performance liquid chromatography (HPLC) system equipped 32 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 5 with an Aminex HPX-87H column (part no. 1250140, Bio-Rad) with an inline 0.22 μm filter (Part 1 No. 50671551, Agilent) followed by a Micro -Guard Cation H guard column (Part No. 1250129, 2 Bio-Rad, Hercules, CA, USA) before the analytical column . Samples were prepared by 3 acidification to a concentration of 5 mM H 2SO4, followed by incubation at room temperature for 4 5 min, centrifugation at 21,000 x g for 5 min, and filtration through a 0.22 µm polyethersulfone 5 (PES) membrane filter into HPLC vials with 300 µl polypropylene inserts. Metabolites were 6 separated using a mobile phase of 5 mM sulfuric acid at 50°C with a flow rate of 0.6 ml/min, a run 7 length of 30 min , and an injection volume of 20 μl. Signals were measured using a variable 8 wavelength detector (λ = 210 nm) and s tandard curves for each compound were made at 5 9 concentrations bracketing the range expected in the samples. 10 11 Quantification of hydrogen (H2) and methane (CH4) gas in the headspace of Hungate tubes were 12 analyzed on a Thermo Fisher Scientific TRACE 1300 gas chromatograph (GC) equipped with a 13 TracePLOT™ TG-BOND Msieve 5 A (Part No. 26003 -6100, Thermo Fisher Scientific) and an 14 Instant Connect Pulsed Discharge Detector (PDD) (Part No. 19070014, Thermo Fisher Scientific). 15 100 µL of headspace gas was collected and subsequently purged three times in a 100 µL air -tight 16 syringe and needle. Then, 20 µL of headspace gas was collected and injected directly into the GC. 17 The oven temperature for each run was 30 ºC and the PDD temperature was 150 ºC. High -purity 18 helium (Part No. HE 5.0UH -55, Praxair, Danbury, CT, USA) was further purified with a heated 19 helium purifier (Part No. HP2, VICI) and used as the carrier gas with a flow rate of 0.5 mL/min. 20 The same flushing and analysis procedures were followed for methane and hydrogen standards 21 including 500 ppm H2, 2% H2, 5 % H2, 20% H2, 0.5 % CH4, 1% CH4, 5% CH4, 10% CH4, and 22 20 % CH4 with balance helium (Douglas Fluid & Integration Technology, Prosperity, SC), which 23 were run at each measurement timepoint to account for the PDD baseline that varied slightly each 24 day. 25 26 Extraction and sequencing of DNA and RNA 27 Biomass samples from bacterial enrichment cultures (cow, goat, digester) were collected for DNA 28 and RNA extraction after 72 hours (Day 3) following culture transfer. DNA was collected from 29 bacterial enrichment samples in triplicate, while RNA was collected from bacterial and fungal 30 enrichment samples in duplicate. Hungate tube contents were centrifuged at max speed for 5 min 31 at 4°C and biomass pellets were immediately flash frozen in liquid nitrogen. RNAlater was added 32 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 6 to samples intended for RNA extraction prior to centrifugation. DNA was extracted from two 1 replicates using the QIAGEN PowerSoil® DNA Isolation Kit (Qiagen, Hilden, Germany) and one 2 replicate using a CTAB protocol. This was done to improve differential coverage binning, where 3 only duplicate DNA samples from the PowerSoil® Kit extraction were used for assessing MAG 4 relative abundance. RNA was extracted from duplicate samples using the QIAGEN RNeasy® Mini 5 Kit with on-column DNAase 1 digestion (Qiagen, Hilden, Germany). DNA and RNA sequencing 6 was performed on the Illumina MiSeq System (v2, 600 cycles) using the Nextera XT DNA Library 7 Preparation Kit (Illumina, CA, USA) and the NEBNext® Ultra II Directional RNA Library Prep 8 Kit at the Biological NanoStructures Lab at UC Santa Barbara. For bacterial metatranscriptomes, 9 rRNA was depleted using the NEBNext® rRNA Depletion Kit (New England Biolabs, MA, USA). 10 For fungal metatranscriptomes, enrichment of mRNA and separation from rRNA was achieved 11 using the NEBNext® Poly(A) mRNA Magnetic Isolation Module. Strand-specific cDNA libraries 12 were prepared with Invitrogen SuperScript II Reverse Transcriptase (Thermo Fisher Scientific, 13 MA, USA). Nucleic acid quantity and quality were determined using a Qubit fluorometer (Thermo 14 Fisher Scientific, MA, USA) and an Agilent TapeStation system (Agilent, CA, USA), respectively. 15 16 Metagenomic assembly, binning, metabolic reconstruction 17 Raw paired-end DNA reads were quality-trimmed and filtered, adapter-trimmed, and contaminant-18 filtered using BBDuk v38.75 and read quality was assessed using FASTQC v0.11.8 . Trimmed 19 reads from the same sample (cow, goat, and digester bacterial enrichments) were co-assembled 20 using MetaSPAades v3.15.3 [11] and mapped to assembled contigs using BBMap v38.75 with 21 minid=0.95. Resulting contigs of at least 1000 bp were binned with MaxBin2 v2.2.7 [12], 22 MetaBAT2 v2.1512 [13], and CONCOCT v1.1.0 [14] and dereplicated using DAS-Tool v1.1.213 23 [15]. Metagenome-assembled genomes (MAGs) were taxonomically classified with GTDB -Tk 24 v2.1.1 (release 9514) [16]. Bacterial MAGs were manually assigned a gram stain based on the 25 taxonomic classification and literature references. Subsequently, gram stain specific prediction of 26 protein product localization for each open reading frame was performed using PSORTb v3.0 [17]. 27 Estimates of MAG completeness and contamination was performed using CheckM v1.1.2 [18]. 28 29 Bacterial MAGs were annotated using the NCBI Prokaryotic Genome Annotation Pipeline version 30 6.10 [40] . Metabolic pathways were annotated across MAGs using gapseq v1.4.0 using the 31 parameters “gapseq find”, “all pathways”, and “bacterial mode ” [41]. Fungal reference genomes 32 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 7 for the previously isolated and genomically sequenced Neocallimastix lanati [20], 1 Anaeromyces sp. S4 [21], Orpinomyces sp. strain C1A [22], Piromyces sp. E2 [21], 2 Piromyces sp.finn [21], Neocallimastix californiae G1 [21], and Neocallimastix frontalis var. 3 giraffae were retrieved from the Joint Genome Institute MycoCosm database. CAZymes and 4 CAZyme motifs in fungal genomes and bacterial MAGs we re predicted using dbcanLight v1.0.2 5 [23] using ‘cazyme mode’ and ‘sub mode’ options. If multiple distinct CAZyme motifs or substrate 6 targets were predicted for the same open reading frame predictions were concatenated into a single 7 feature. 8 9 Metatranscriptomic analysis 10 Complementary DNA (cDNA) reads were quality filtered as described above for DNA. 11 SortMeRNA [24] was used to remove rRNA sequences using multiple databases for RNA 12 sequences. The remaining non -rRNA reads were used for mapping. RNA reads from fungal 13 enrichment cultures were aligned to a combined fungal reference genome (concatenation of all 14 retrieved fungal reference genomes listed above ) using the bbwrap.sh command from BBMap 15 v38.86 with the usejni parameter set to true, ten threads and all other parameters set to their default 16 values. Bacterial RNA reads were aligned competitively against the total metagenomic assembly 17 using the parameter minid=0.95 and then analyzed per MAG. Aligned reads were counted using 18 the featureCounts command from Subread v2.0.3 using ten threads and all other parameters set to 19 their default values. 20 21 For fungal genomes and bacterial MAGs, read counts from each sequencing sample were 22 normalized for depth and gene length by calculating Transcripts Per Million (TPM) by Equation 23 1. 24 𝑇𝑃𝑀! = "!/$! ∑ ""/$"" × 10& Equation 1 25 26 where TPMi is the TMP for transcript i, Ni is the number of reads mapped to transcript i and Li is 27 the length of transcript i. 28 29 TPM values assigned to the same CAZyme motif, in the same genome and from the same sample 30 (and for bacterial MAGs, the same localization) were summed. The mean TPM values across 31 replicate samples were calculated and used for downstream analysis. For each source (digester, 32 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 8 cow or goat), bacterial MAGs in the upper quantile of CAZyme expression, and CAZyme features 1 predicted by PSORTb to be extracellular, were retained for differential expression analysis. 2 Similarly, the two fungal genomes with the highest CAZyme expression from either source were 3 retained. Differential expression analysis was performed separately on bacterial MAGs and fungal 4 genomes from the different sources using a Wilcoxon rank sum test (wilcoxoauc function from the 5 presto R-package, v1.0.0). Predicted CAZyme and substrate target features were mapped to the 6 read count data, generating a CAZyme expression matrix. 7 8

Results

9 Anaerobic fungal and bacterial enrichments efficiently degrade lignocellulose to VFAs 10 To establish different fungal and bacterial consortia capable of degrading crude lignocellulose into 11 VFAs, we first cultivated parallel anaerobic enrichment cultures on 1% milled sorghum in 12 chemically defined M2 media (see Methods). Cultures were inoculated anaerobically with either 13 cow or goat fecal pellets, or with sludge from an anaerobic digester in the presence of either 14 chloramphenicol (selecting for fungi) or 2-bromoethanesulfonate (BES, selecting for bacteria) and 15 passaged every 3-4 days over multiple generations (Figure 1 A). In total, inoculation of sorghum 16 media with cow and goat feces produced stable fungal and bacterial enrichments from each source, 17 whereas inoculation with anaerobic digester sludge only produced a bacterial enrichment, 18 suggesting the source digester had low (or no) anaerobic fungal abundance. 19 20 All enrichments were capable of sorghum deconstruction based on large increases in pressure 21 accumulation during 4 -day Hungate tube (i.e., batch) fermentations (Figure 1B). Fermentation 22 products produced by the fungal enrichments were consistent with a bacterial -like mixed acid 23 fermentation pathway previously reported for anaerobic fungi [25]. Both the cow and goat fungal 24 enrichments produced high yields of formate, acetate, lactate, and hydrogen gas (Figure 1C). In 25 comparison, the three bacterial enrichments from the cow, goat, and digester inocula produced 26 acetate, propionate, and butyrate (C2-C4 VFAs) as main fermentation products, as well as a small 27 amount of hydrogen gas (Figure 1C). These results are consistent with previous studies that 28 quantified VFA production during lignocellulose degradation in anaerobic bacterial fermentations 29 [5–7]. 30 31 Genome-centric metatranscriptomics reveals key microbes active across microbiomes 32 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 9 To identify key microbes responsible for lignocellulose degradation and conversion across the 1 fungal and bacterial enrichments we recovered genomes of individual species and profiled their 2 gene expression using metatranscriptomics. 3 4 A total of 92 MAGs were recovered across the cow, goat, and digester bacterial enrichments 5 affiliated with diverse bacteria belonging to the phyla Actinomycetota, Bacillota, Bacteroidota, 6 Desulfobacterota, Fusobacteriota, Pseudomonadota, and Spirochaetota, and Synergistota. A 7 summary of the genome statistics and taxonomy of these MAGs can be found in Supplementary 8 Table 1. Mapping of DNA and RNA reads back to these genomes revealed that MAGs affiliated 9 with the Lachnosporaceae and Bacteroidetes dominated in abundance and activity across all 10 bacterial enrichments (Figure 2A). In particular, several MAGs seemed to be especially important 11 to microbiome function based on their relative DNA and RNA abundance, including 12 Lachnosporaceae-1C (22% DNA; 30% RNA), Prevotella-1C (12%DNA; 8% RNA), and 13 Anaeroplasma-1C (2% DNA; 7% RNA) from the cow enrichment, Ruminococcaceae-1G (3% 14 DNA; 17% RNA), Lachnosporaceae-4G (12% DNA; 16% RNA), Lachnosporaceae-5G (3% 15 DNA; 8% RNA), Lachnosporaceae-7G (9% DNA; 5%RNA), Bacteroidales-1G (3% DNA; 11% 16 RNA), and Desulfovibrio-2G (1% DNA; 5% RNA) from the goat enrichment, and Bacteroides-17 3D (25% DNA; 27% RNA), Bacteroides-1D (25% DNA; 7% RNA), and Clostridium-1D (9% 18 DNA, 33% RNA) from the digester enrichment (Figure 2A). Interestingly, several of the 19 Bacteroides MAGs across the enrichments (e.g., Bacteroides -1C, Bacteroides -3G, and 20 Bacteroides-2G) had very low relative RNA abundance compared to their relative DNA abundance 21 (Figure 2A). This likely reflects temporal differences in activity across the 4-day batch 22 fermentation as RNA was collected on Day 3 , suggesting these species were more active during 23 the early stages of lignocellulose breakdown. 24 25 To identify active species from the fungal enrichments, RNA reads were mapped back to genomes 26 available at the time from seven isolated anaerobic fungi , including Anaeromyces robustus 27 (Anasp1), Neocallimastix californiae G1 (Neosp1), Neocallimastix frontalis var. giraffae 28 (NeoGfMa1), Neocallimastix lanati (Neolan1), Orpinomyces sp. (Orpsp1), Piromyces sp. E2 29 (PirE2), and Piromyces finnis (Pirfi3). A large fraction of RNA reads mapped successfully to these 30 fungal references (>95%), confirming substantial transcriptional activity from anaerobic fungi in 31 both cow and goat enrichments (Figure 2B ; Supplementary Data 1 ). However, due to extensive 32 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 10 sequence conservation and gene family expansion among anaerobic fungal genomes, many reads 1 mapped ambiguously across closely related taxa, limiting confident assignment to specific species 2 or genomes. Consequently, downstream analyses focused on functional profiles, especially CAZy 3 family expression, rather than species-resolved transcript abundances. At a higher taxonomic level, 4 read mapping patterns were broadly consistent with transcriptional activity from Neocallimastix-5 dominated enrichments (Figure 2B) , nevertheless high sequence similarity among genomes 6 precluded robust species-level resolution. 7 8 Carbohydrate active enzymes (CAZyme) expression profiles indicate substrate specialization 9 and niche partitioning during lignocellulose degradation 10 Sorghum biomass is predicted to consist of crystalline and amorphous cellulose that interacts with 11 xylan containing frequent and irregular arabinosyl substitutions. The monosaccharide composition 12 of non -cellulosic sorghum components in mole percent is approximately 63% xylose, 17% 13 glucose, 12% arabinose, 4% galactose, 3% galacturonic acid, and 1% glucuronic acid (Gao et al., 14 2020). This lignocellulosic matrix represents the primary carbon and energy substrates available 15 to the enrichment cultures and was expected to drive microbiome structure and function [26]. To 16 link the observed lignocellulose degradation activities to potential enzyme functions we analyzed 17 the expression of carbohydrate active enzymes (CAZymes) across all bacterial and fungal samples. 18 This revealed a wide range of expressed CAZymes, targeting diverse glycosidic linkages present 19 in cellulose, hemicellulose, starch, and pectin, especially cellulase (GH9), cellulose 1,4 -β-20 cellobiosidase (GH48/GH6), and β-glucosidase (GH3) involved in cellulose degradation and endo-21 1,4-β-xylanase (GH10/GH11), α-N-arabinofuranosidase (GH43), xylan 1,4-β-xylosidase (GH43), 22 and α -L-arabinofuranosidase (GH51) involved in hemicellulose degradation ( Supplementary 23 Table 2) that have previously been implicated in lignocellulose breakdown in the cow rumen [27]. 24 25 Across the rumen (cow and goat) and digester enrichments, bacterial MAGs affiliated with 26 Clostridia (Lachnospiraceae, Ruminococcus) and Bacteroidia (Bacteroides, Macellibacteroides, 27 Prevotella) were the main contributors to CAZyme expression (Figure 3), consistent with their 28 high abundance and activity based on read mapping (Figure 2A). In the digester bacterial 29 enrichment, Clostridium-1D was implicated as a key lignocellulose degrader with versatile 30 hydrolytic activities based on express ion of extracellular CAZymes involved in cellulose 31 hydrolysis (GH9), xylan backbone cleavage (GH 10/GH11), arabinoxylan debranching 32 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 11 (GH43/GH51), and ester bond hydrolysis between ferulic acid and arabinose moieties in 1 arabinoxylan (CE1) (Figure 3, Supplementary Table 3). As all these enzymes were extracellular, 2 byproducts of degradation (xylo-oligosaccharides, sugar monomers, etc.) likely became available 3 to other microbiome members. 4 5 In the goat bacterial enrichment, lignocellulose degradation activities appeared more distributed . 6 Diverse Ruminococcus MAGs (Ruminococcus-1G, Ruminococcus-2G, Ruminococcus-3G) in the 7 goat enrichment dominated CAZyme expression, including a wide array of CAZymes involved in 8 cellulose and arabinoxylan hydrolysis (Figure 3, Supplementary Table 3). The three Ruminococcus 9 MAGs expressed CAZymes with similar glycolsyl hydrolase motifs, such as GH9/GH48 (glucan) 10 and GH10/GH11 (arabinoxylan), but differed in the carbohydrate binding domains. This suggests 11 that in the goat enrichment, Rumminococcus MAGs were broadly competing for arabinoxylan but 12 could coexist through niche specialization which arose from differing modes of access to the 13 lignocellulosic biomass. For example , although all three Ruminococcus MAGs expressed 14 cellulases from the GH9 family, Ruminococcus-1G and Ruminococcus-3G expressed GH9 linked 15 to a CBM4, while Ruminococcus-2G expressed GH9 linked to a CBM37 , indicating that these 16 MAGs bind to differing sites on the glucan fraction of sorghum. 17 18 Lignocellulose degradation activities were expresedsed across an even broader set of taxa in the 19 cow bacterial enrichment, including Lachnospiraceae-1C, Prevotella-1C, and B acteroidales-2C. 20 Lachnospiraceae-1C and Prevotella-1C appeared to be the primary degraders of arabinoxylan, 21 while Cryptobacteroides-2C and Anaeroplasma-1C targeted pectin and starch, respectively. 22 Lachnospiraceae-1C, which had the highest expression of total and extracellular CAZymes , was 23 highly enriched for extracellular GH25 (lysozyme) which targets bacterial cell walls. The 24 simultaneous expression of GH25, and preference for arabinoxylan targets, by Lachnospiracea e-25 1C and Prevotella-1C suggests that the two MAGs were broadly competing for arabinoxylan but 26 may coexist through niche specialization conferred through diverse CBMs and debranching 27 activities. For example, Lachnospiraceae-1C and Prevotella-1C both express CAZymes from the 28 GH10 family, but GH10 CAZymes expressed by Prevotella-1C are further linked to bindin g 29 domains (CBM4, CBM 48) and esterases (CE1), which are not associated with the GH10 30 CAZymes from Lachnospiraceae-1C. 31 32 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 12 Across bacterial enrichments , several MAGs displayed high expression of multifunctional 1 enzymes consisting of several glycosyl hydrolases , glycosyl hydrolases with carbohydrate 2 esterases, or glycosyl hydrolyses with carbohydrate binding modules (Figure 3, Supplementary 3 Table 4). For example, Clostridium-1D had high expression of a CAZyme encoding two endo-β-4 1,4-xylanases (GH10/GH11) with a carbohydrate esterase (CE1) that may improve hydrolysis of 5 feruloylated arabinoxylan or oligosaccharides, whereas Ruminococcus-1G expressed CAZymes 6 encoding Endo-β-1,4-xylanase (GH10) with a α-L-arabinofuranosidase (GH43) as well as a 7 glucuronoarabinoxylan endo-1,4-beta-xylanase (GH30) with a carbohydrate esterase (CE1) that 8 also likely target branched arabinoxylans (xylan with arabinosyl and feruloyl substitutions). These 9 multifunctional enzymes were MAG specific, especially when comparing gene expression across 10 abundant Lachnospiraceae MAGs recovered from the different enrichments that encoded widely 11 varying lignocellulose hydrolysis capabilities, with Clostridium-1D appearing to be the most 12 robust degrader. Take n together, these results highlight different degradation strategies between 13 bacterial enrichments sourced from diverse inocula (digester sludge, cow feces, goat feces) 14 reflective of niche partitioning and substrate specialization. 15 16 Anaerobic fungal enrichments from the cow and goat samples also had high expression of diverse 17 CAZymes. The repertoire of expressed CAZymes in the goat and cow enrichments were highly 18 similar and primarily targeted arabinoxylan and glucan fractions of the lignocellulosic biomass. 19 This suggested that, although different fungi dominated, the cow and goat fungal enrichments 20 performed similar biochemical functions shaped by sorghum as a substrate. Compared to bacterial 21 enrichments from the same source, the fungal enrichments consistently had a lower diversity of 22 CAZymes, especially glycosyl hydrolases (GHs) (Figure 4) . However, when only consider ing 23 bacterial transcripts predicted to be extracellular, the diversity of GHs expressed by bacteria or 24 fungi became more comparable . Comparing fungal CAZymes and the extracellular fraction of 25 bacterial CAZymes a cross cow and goat sources, the fungal enrichments expressed 38 GHs not 26 expressed by in the bacterial enrichments, including the exo-type cellobiohydrolase, GH6 (Figure 27 4; Supplementary Table 4). Conversely, the bacterial enrichments expressed 11 GHs not expressed 28 by the fungal enrichments, such GH51_2, an α -L-arabinofuranosidase involved in hemicellulose 29 degradation (Figure 4; Supplementary Table 4) . This may reflect different degradation strategies 30 between anaerobic fungi and bacteria that could be complementary, as both are well known to co-31 exist in the rumen. 32 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 13 1 Across all bacterial and fungal enrichments, genes encoding canonical lignin -depolymerizing 2 enzymes, including lignin peroxidases, manganese peroxidases, and β-etherases, were not 3 detected. Instead, transcripts encoding carbohydrate esterases (CE1) were detected across several 4 abundant bacterial MAGs and anaerobic fungal genomes (Figure 3, Supplementary Data 2 and 3), 5 consistent with the potential to hydrolyze feruloyl- and p-coumaroyl ester linkages associated with 6 sorghum arabinoxylan. Canonical pathways for the subsequent catabolism of released ferulic 7 and/or p-coumaric acids (e.g., feruloyl-CoA synthetase, vanillate O-demethylase, or benzoyl-CoA 8 degradation pathways) were not detected. However, phenylacetate-CoA ligase genes were across 9 multiple bacterial MAGs, and near-complete phenylacetate catabolic pathways were detected in a 10 small number of low -abundance MAGs (Enterobacteriaceae-1D, S higella-1C; Supplementary 11 Data 4). Together, these results indicate that lignin was not extensively depolymerized under the 12 enrichment conditions, but instead functionally circumvented through uncoupling of lignin -13 carbohydrate complexes. Importantly, this analysis does not preclude lignin modification via non-14 canonical or uncharacterized anaerobic mechanisms, as previously demonstrated by direct 15 structural analyses of lignin during anaerobic fungal growth [42]. 16 17 Metatranscriptomic analysis reveals key fermentation pathways and microbes involved in VFA 18 production from lignocellulosic sugars 19 To examine the metabolic networks of different fungal and bacterial consortia degrading 20 lignocellulose into VFAs we reconstructed anaerobic fermentation and respiration pathways across 21 the recovered genomes and analyzed their gene expression. MAGs were assigned to different 22 functional guilds based on gene expression of key enzymes and metabolic pathway responsible for 23 specific substrate utilization and expected fermentation/respiration products. This analysis 24 revealed five major guilds across the bacterial enrichments , including hydrolytic propionate-25 producing bacteria, hydrolytic butyrate-producing bacteria, hydrolytic alcohol-producing bacteria, 26 succinate/lactate producing bacteria, and sulfate-reducing bacteria. 27 28 The most abundant and active MAGs were hydrolytic bacteria involved in complex polysaccharide 29 fermentation to propionate or butyrate, together with acetate and H 2 formation. Propionate 30 production was dominated by MAGs affiliated with the order Bacteroidales, including Prevotella-31 1C and Prevotella-2C from the cow enrichment, Bacteroides-2G, Bacteroidales-1G, Bacteroides-32 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 14 1G, and Prevotella-1G from the goat enrichment, and Bacteroides-3D, Bacteroides_1D, and 1 Macellibacteroides-1D from the digester enrichment (Figure 5; Supplementary Dataset 4). These 2 MAGs all expressed enzymes of the methylmalonyl-CoA pathway (Supplementary Dataset) , 3 including a sodium ion pumping methylmalonyl-CoA decarboxylase [28], that couples propionate 4 formation with the generation of an ion motive force that can be used for ATP synthesis. These 5 MAGs also had high expression of pathways for sugar utilization (glucose, xylose, and arabinose) 6 (Figure 5; Supplementary Dataset 1) consistent with their high expression of CAZymes involved 7 in liberating these sugars during cellulose and hemicellulose degradation (Figure 3). 8 9 Butyrate production across the enrichments was dominated by MAGs affiliated with the 10 Lachnospiraceae (Figure 5). All Lachnospiraceae MAGs had high gene expression of the reverse 11 beta-oxidation pathway, including an electron bifurcating acyl-CoA dehydrogenase complex [29]. 12 They also expressed the terminal enzymes phosphate butyryltransferase and butyrate kinase that 13 couples butyrate synthesis with substrate level phosphorylation, except for Clostridium-1D, which 14 instead expressed a butyryl-CoA:acetate CoA transferase [30]. While all Lachnospiraceae MAGs 15 expressed diverse CAZymes (Figure 3) and sugar utilization pathways (Figure 5), Clostridium-1D 16 from the digester enrichment especially stood out among the Lachnospiraceae MAGs as a 17 specialized cellulose and hemicellulose degrad er based on its high copy number and gene 18 expression of 1,4-beta-xylanases, α-L-arabinofuranosidases, and cellulases (Figure 3). 19 20 In addition to propionate and butyrate, s everal MAGs affiliated with the Ruminococcus and 21 Sphaerochaetaceae were predicted to be involved in complex polysaccharide fermentation to 22 ethanol, acetate, and H2 formation. Ruminococcus-1G and Ruminococcus-2G had high expression 23 of a bifunctional acetaldehyde -CoA/alcohol dehydrogenase and NAD-dependent electron -24 bifurcating [FeFe] -hydrogenase (HydABC) , together with CAZymes and substrate utilization 25 pathways involved in both cellulose and hemicellulose degradation (Figures 5; Supplementary 26 Dataset 1 ). This metabolism is consistent with previous batch fermentation results from other 27 Ruminococcus species, such as Ruminococcus albus isolated from the cow rumen [31, 32]. Diverse 28 Sphaerochaetaceae MAGs, including Treponema-1C, Sphaerochaetaceae-1C, Sphaerochaeta-1D, 29 and Sphaerochaetaceae-1C were also implicated in ethanol, acetate, formate, and H 2 formation 30 based on the high gene expression of alcohol dehydrogenase s (iron-containing alcohol 31 dehydrogenase and/or bifunctional acetaldehyde-CoA/alcohol dehydrogenase) and hydrogenases 32 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 15 (iron-only hydrogenases and/or ferredoxin hydrogenases) (Figure 5; Supplementary Data set 1). 1 Additionally, Sphaerochaetaceae-1G also had high expression of lactate dehydrogenase and 2 hydroxyacid dehydrogenase suggesting they may also produce lactate as an end-product. 3 4 Aside from fermentation pathways , all enrichments contained bacteria affiliated with 5 Desulfovibrio sp. predicted to be capable of anaerobic sulfate respiration to hydrogen sulfide 6 coupled to ethanol, formate, lactate, and H 2 oxidation. In particular, Desulfovibrio-1C, 7 Desulfovibrio-1G, Desulfovibrio-2G, and Nitratidesulfovibrio-1D had high expression of sulfate 8 adenylyltransferase, adenylylsulfate reductase , and dissimilatory sulfite reductase involved in 9 dissimilatory sulfate respiration together with high expression of hydrogenases, formate 10 dehydrogenase, lactate dehydrogenase, alcohol dehydrogenase and aldehyde dehydrogenase 11 (Figure 5; Supplementary Dataset 1), suggesting co-utilization of ethanol, formate, lactate, and H2 12 for energy generation. This is consistent with the availability of these substrates as fermentation 13 by-products (Figure 1). 14 15

Discussion

16 The use of genome -centric metatranscriptomics allowed us to analyze the metabolism and gene 17 expression of 5 different anaerobic enrichment cultures degrading lignocellulosic biomass to 18 VFAs. Our results reveal that across all bacterial enrichments, MAGs affiliated with the order 19 Bacteroidales (Bacteroides and Prevotella) and family Lachnospiraceae were dominant 20 lignocellulose degraders that ferment ed biomass-derived sugars to propi onate and butyrate, 21 respectively, together with acetate and H 2 formation. Other MAGs affiliated with Ruminococcus 22 from the goat bacterial enrichment, and to a lesser extent the family Sphaerochaetaceae across all 23 bacterial enrichments, were also implicated in biomass breakdown to ethanol, lactate, formate, 24 acetate, and H 2 (Ruminococcus, Sphaerochaetaceae), consistent with the metabolism of closely 25 related representative isolates [33, 34]. The resulting fermentation products, especially ethanol, 26 formate, lactate, and H 2 were predicted to be substrates for Desulfovibrio MAGs that performed 27 dissimilatory sulfate reduction across all bacterial enrichments . In the fungal enrichments, 28 lignocellulose degradation was dominated by Neocallimastix sp. that employed a mixed acid 29 fermentation pathway, producing lactate, acetate, ethanol, H 2, and CO 2 as major end products , 30 consistent with the metabolism of representative isolates [20]. Taken together, these results 31 highlight that substrate (sorghum biomass) and chemical inhibitors (chloramphenicol and BES) 32 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 16 selected for microbiomes with similar overall functional guild structure, despite containing 1 different taxa and starting from disparate inocula. 2 3 Detailed analysis of CAZyme gene expression across the enrichments revealed common 4 biodegradation strategies between diverse bacteria and fungi while also highlighting potential 5 complementarities and fine -scale functional specialization . While all bacterial and fungal 6 enrichments expressed CAZymes necessary for cellulose and hemicellulose degradation, including 7 those involved in cellulose hydrolysis (GH9), arabinoxylan backbone cleavage (GH10/GH11), and 8 arabinoxylan debranching (GH43/GH51/CE1), fine-scale differences between microbes based on 9 multi-functional enzymes linking two or more CAZymes together with diverse CBMs were 10 widespread. Most of these multi-functional enzymes localized complementary degradation 11 functions together, such as xylan backbone cleavage with various debranching activities, likely 12 conferring synergistic degradation effects as shown by purified enzyme studies [35]. The 13 association of these enzymes with diverse CBMs also suggests targeted specialization towards 14 specific lignocellulose structures (e.g., substituted backbones or branched linkages), consistent 15 with recent observations among human gut microbiota [36]. These results implicit fine -scale 16 substrate specialization as a major driver of microbiome assembly, niche partitioning, and species 17 co-existence during anaerobic lignocellulose degradation. 18 19 Hydrolysis and subsequent fermentation of lignocellulosic substrates resulted in a range of 20 fermentation acids shaped by culture conditions. Indeed, it has been shown that the spectrum of 21 fermentation products is highly influenced by both substrate and environmental parameters, such 22 as pH and temperature. In our study , a neutral pH of 7 -7.5 and temperature of 39 °C was 23 maintained, selecting for bacterial enrichments that produced short-chain fatty acids (acetate, 24 propionate, butyrate) and H2 as major fermentation products, and fungal enrichments that produced 25 mixed acid fermentation products (lactate, acetate, formate) and H2. These acids appeared to be 26 directly produced from lignocellulose -derived sugars versus indirectly via lactate as an 27 intermediate, which can result via cross feeding between lactate -producing species (e.g., LAB, 28 Bifidobacteria) and lactate -utilizing species. This may have resulted from having neutral versus 29 mildly acid pH ( ~5-6) conditions, the latter of which has been shown to select for lactate cross-30 feeding. However, general principles on what controls the distribution of fermentation products 31 remains unclear and represents an important knowledge gap to advance anaerobic fermentation as 32 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 17 a platform for biorefining of specific volatile fatty acids, lactate, alcohols, and other bulk chemicals 1 [37]. 2 3 While further studies are still needed to comprehensively map the functional niches driving 4 anaerobic lignocellulose degradation to VFAs, our results provide important insights on fine-scale 5 substrate specialization and metabolic pathways structuring microbiome assembly and function . 6 We anticipate that these findings will help inform efforts to develop synthetic microbiomes with 7 tailored functionality for low-cost conversion of lignocellulosic biomass to fuels/chemicals [38]. 8 This will require isolating and assembling novel anaerobes , such as those identified here, into 9 defined consortia to eliminate competing metabolic pathways , as well as introducing novel 10 functionalities via metabolic engineering [39]. Such consortia would also serve to elucidate basic 11 principles governing microbiome assembly and function, resulting in robust anaerobic 12 fermentation platforms for sustainable biomanufacturing. 13 14 Conflicts of Interest 15 The authors declare no conflicts of interest. 16 17

Acknowledgements

18 This research was sponsored by the U.S. Department of Energy, Office of Science through grant 19 DE-SC0022142. The work conducted at the Joint BioEnergy Institute was supported by the U.S. 20 Department of Energy, Office of Science, Biological and Environmental Research Program, 21 through contract DE-AC02-05CH11231 between Lawrence Berkeley National Laboratory and the 22 U.S. Department of Energy. The United States Government retains and the publisher, by accepting 23 the article for publication, acknowledges that the United States Government retains a 24 nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form 25 of this manuscript, or allow others to do so, for United States Government purposes. Any 26 subjective views or opinions that might be expressed in the paper do not necessarily represent the 27 views of the U.S. Department of Energy or the United States Government. 28 29 Data Availability 30 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 18 Raw DNA and cDNA read data and annotated metagenome assembled genomes (MAGs) can be 1 found on the National Center for Biotechnology Information (NCBI) website under BioProject 2 accession no. PRJNA1379681. 3 4 Figure Legends 5 Figure 1 – Fungal and bacterial enrichment cultures converting lignocellulosic biomass to VFAs. 6 A) Enrichment of anaerobic fungal and bacterial consortia from the cow and goat rumen and an 7 anaerobic digester. B) Accumulated gas production (psig) after 4 days and C) VFA, lactate, and 8 H2 production by anaerobic fungal and bacterial enrichments sourced from the cow and goat rumen 9 and an anaerobic digester . VFA and lactate concentrations reported in mg/L, H 2 concentration 10 reported in nanomolar. H2 concentrations for fungal and bacterial enrichments shown on left and 11 right y-axes, respectively. 12 13 Figure 2 - Abundance and activity of bacterial MAGs and fungal isolate genomes recovered from 14 enrichments. Percentage of raw metagenomic (DNA - red) and metatranscriptomic (RNA - pink) 15 reads mapped to recovered MAGs (A) and existing fungal isolate genomes (B). Fungal isolate 16 genomes labels: Anasp1 - Anaeromyces robustus , Neosp - Neocallimastix californiae G1 , 17 NeoGfMa1 - Neocallimastix frontalis var. giraffae , Neolan1 - Neocallimastix lanati , Orpsp1 - 18 Orpinomyces sp., PirE2 - Piromyces sp. E2, Pirfi3 - Piromyces finnis. 19 20 Figure 3 - CAZyme expression across bacterial MAGs from (A) digester enrichment, (B) cow 21 enrichment, (C) goat enrichment. Left panel – RNA expression (transcript per million, TPM) of 22 annotated CAZymes based on predicted subcellular localization using PSORTb v3.0. Right panel 23 – log2 TPM expression across CAZyme motifs per MAG. Target substrate of CAZyme motif 24 (bottom row) predicted using dbCAN3. 25 26 Figure 4 – CAZyme expression across fungal reference genomes and comparison to bacterial 27 MAGs. (A) RNA expression (log 2 TPM) of CAZyme motifs across abundant fungal isolate 28 genomes. Target substrate of CAZyme motif (bottom row) predicted using dbCAN3. (B) Number 29 of motifs annotated per CAZyme family between bacterial MAGs and fungal isolate genomes. AA 30 - Auxiliary Activities, CBM – Carbohydrate Binding Motifs, CE - Carbohydrate Esterases, GH - 31 Glycoside Hydrolases, GT - GlycosylTransferases, PL - Polysaccharide Lyases. 32 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 19 1 Figure 5 - Expression of key metabolic pathways across MAGs . Rows indicate RNA expression 2 (log2[TPM]) of MAG open reading frames annotated as MetaCyc reactions across key metabolic 3 pathways via gapseq. Columns indicate bacterial MAGs from digester enrichment (left), cow 4 enrichment (middle), and goat enrichment (right). 5 6

References

7 1. Sawatdeenarunat C, Surendra KC, Takara D, Oechsner H, Khanal SK. Anaerobic 8 digestion of lignocellulosic biomass: challenges and opportunities. Bioresour Technol 9 2015; 178: 178–186. 10 2. Huq NA, Hafenstine GR, Huo X, Nguyen H, Tifft SM, Conklin DR, et al. Toward net-11 zero sustainable aviation fuel with wet waste-derived volatile fatty acids. Proceedings of 12 the National Academy of Sciences 2021; 118: e2023008118. 13 3. Agnihotri S, Yin D-M, Mahboubi A, Sapmaz T, Varjani S, Qiao W, et al. A Glimpse of 14 the World of Volatile Fatty Acids Production and Application: A review. Bioengineered 15 2022; 13: 1249–1275. 16 4. Stamatopoulou P, Malkowski J, Conrado L, Brown K, Scarborough M. Fermentation of 17 Organic Residues to Beneficial Chemicals: A Review of Medium-Chain Fatty Acid 18 Production. Processes 2020; 8. 19 5. Weimer PJ, Nerdahl M, Brandl DJ. Production of medium-chain volatile fatty acids by 20 mixed ruminal microorganisms is enhanced by ethanol in co-culture with Clostridium 21 kluyveri. Bioresour Technol 2015; 175: 97–101. 22 6. Agematu H, Takahashi T, Hamano Y. Continuous volatile fatty acid production from 23 lignocellulosic biomass by a novel rumen-mimetic bioprocess. J Biosci Bioeng 2017; 24 124: 528–533. 25 7. Fonoll X, Shrestha S, Khanal SK, Dosta J, Mata-Alvarez J, Raskin L. Understanding the 26 Anaerobic Digestibility of Lignocellulosic Substrates Using Rumen Content as a 27 Cosubstrate and an Inoculum. ACS ES&T Engineering 2021; 1: 424–435. 28 8. Hagen LH, Brooke CG, Shaw CA, Norbeck AD, Piao H, Arntzen MØ, et al. Proteome 29 specialization of anaerobic fungi during ruminal degradation of recalcitrant plant fiber. 30 ISME J 2021; 15: 421–434. 31 9. Teunissen MJ, den Camp HJM, Orpin CG, in ‘t Veld JHJ, Vogels GD. Comparison of 32 growth characteristics of anaerobic fungi isolated from ruminant and non-ruminant 33 herbivores during cultivation in a defined medium. Microbiology (N Y) 1991; 137: 1401–34 1408. 35 10. Haitjema CH, Solomon K V, Henske JK, Theodorou MK, O’Malley MA. Anaerobic gut 36 fungi: Advances in isolation, culture, and cellulolytic enzyme discovery for biofuel 37 production. Biotechnol Bioeng 2014; 111: 1471–1482. 38 11. Nurk S, Meleshko D, Korobeynikov A, Pevzner PA. metaSPAdes : a new versatile 39 metagenomic assembler. Genome Res 2017; 27: 824–834. 40 12. Wu Y-W, Tang Y-H, Tringe SG, Simmons B a, Singer SW. MaxBin: an automated 41 binning method to recover individual genomes from metagenomes using an expectation-42 maximization algorithm. Microbiome 2014; 2: 26. 43 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 20 13. Kang DD, Froula J, Egan R, Wang Z. MetaBAT, an efficient tool for accurately 1 reconstructing single genomes from complex microbial communities. PeerJ 2015; 3: 2 e1165. 3 14. Alneberg J, Bjarnason BS, de Bruijn I, Schirmer M, Quick J, Ijaz UZ, et al. Binning 4 metagenomic contigs by coverage and composition. Nat Methods 2014; 11: 1144–1146. 5 15. Sieber CMK, Probst AJ, Sharrar A, Thomas BC, Hess M, Tringe SG, et al. Recovery of 6 genomes from metagenomes via a dereplication, aggregation and scoring strategy. Nat 7 Microbiol 2018; 3: 836–843. 8 16. Chaumeil P-A, Mussig AJ, Hugenholtz P, Parks DH. GTDB-Tk: a toolkit to classify 9 genomes with the Genome Taxonomy Database. Bioinformatics 2020; 36: 1925–1927. 10 17. Yu NY, Wagner JR, Laird MR, Melli G, Rey S, Lo R, et al. PSORTb 3.0: improved 11 protein subcellular localization prediction with refined localization subcategories and 12 predictive capabilities for all prokaryotes. Bioinformatics 2010; 26: 1608–1615. 13 18. Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the 14 quality of microbial genomes recovered from isolates, single cells, and metagenomes. 15 Genome Res 2015; 25: 1043–55. 16 19. Konwar KM, Hanson NW, Bhatia MP, Kim D, Wu SJ, Hahn AS, et al. MetaPathways 17 v2.5: Quantitative functional, taxonomic and usability improvements. Bioinformatics 18 2015; 31: 3345–3347. 19 20. Elmo W St., M. MJ, A. LP, E. LC, S. LT, Susanna S, et al. Experimentally Validated 20 Reconstruction and Analysis of a Genome-Scale Metabolic Model of an Anaerobic 21 Neocallimastigomycota Fungus. mSystems 2021; 6: e00002-21. 22 21. Haitjema CH, Gilmore SP, Henske JK, Solomon K V., De Groot R, Kuo A, et al. A parts 23 list for fungal cellulosomes revealed by comparative genomics. Nat Microbiol 2017; 2: 1–24 8. 25 22. H. YN, B. CM, G. SC, S. LA, A. PR, Z. NF, et al. The Genome of the Anaerobic Fungus 26 Orpinomyces sp. Strain C1A Reveals the Unique Evolutionary History of a Remarkable 27 Plant Biomass Degrader. Appl Environ Microbiol 2013; 79: 4620–4634. 28 23. Zheng J, Ge Q, Yan Y, Zhang X, Huang L, Yin Y. dbCAN3: automated carbohydrate-29 active enzyme and substrate annotation. Nucleic Acids Res 2023; 51: W115–W121. 30 24. Kopylova E, Noé L, Touzet H. SortMeRNA: Fast and accurate filtering of ribosomal 31 RNAs in metatranscriptomic data. Bioinformatics 2012; 28: 3211–3217. 32 25. Boxma B, Voncken F, Jannink S, Van Alen T, Akhmanova A, Van Weelden SWH, et al. 33 The anaerobic chytridiomycete fungus Piromyces sp. E2 produces ethanol via 34 pyruvate:formate lyase and an alcohol dehydrogenase E. Mol Microbiol 2004; 51: 1389–35 1399. 36 26. Lindemann SR. A piece of the pie: engineering microbiomes by exploiting division of 37 labor in complex polysaccharide consumption. Curr Opin Chem Eng 2020; 30: 96–102. 38 27. Comtet-Marre S, Parisot N, Lepercq P, Chaucheyras-Durand F, Mosoni P, Peyretaillade 39 E, et al. Metatranscriptomics Reveals the Active Bacterial and Eukaryotic Fibrolytic 40 Communities in the Rumen of Dairy Cow Fed a Mixed Diet . Frontiers in Microbiology . 41 2017. , 8 42 28. Hilpert W, Dimroth P. Conversion of the chemical energy of methylmalonyl-CoA 43 decarboxylation into a Na+gradient. Nature 1982; 296: 584–585. 44 29. Fuli L, Julia H, Henning S, Jin Z, Wolfgang B, K. TR. Coupled Ferredoxin and Crotonyl 45 Coenzyme A (CoA) Reduction with NADH Catalyzed by the Butyryl-CoA 46 Dehydrogenase/Etf Complex from Clostridium kluyveri. J Bacteriol 2008; 190: 843–850. 47 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 21 30. Louis P, Young P, Holtrop G, Flint HJ. Diversity of human colonic butyrate-producing 1 bacteria revealed by analysis of the butyryl-CoA:acetate CoA-transferase gene. Environ 2 Microbiol 2010; 12: 304–314. 3 31. Zheng Y, Kahnt J, Kwon IH, Mackie RI, Thauer RK. Hydrogen formation and its 4 regulation in Ruminococcus albus: involvement of an electron-bifurcating [FeFe]-5 hydrogenase, of a non-electron-bifurcating [FeFe]-hydrogenase, and of a putative 6 hydrogen-sensing [FeFe]-hydrogenase. J Bacteriol 2014; 196: 3840–3852. 7 32. Iannotti EL, Kafkewitz D, Wolin MJ, Bryant MP. Glucose fermentation products in 8 Ruminococcus albus grown in continuous culture with Vibrio succinogenes: changes 9 caused by interspecies transfer of H 2 . J Bacteriol 1973; 114: 1231–1240. 10 33. Paster BJ, Canale-Parola E. Physiological diversity of rumen spirochetes. Appl Environ 11 Microbiol 1982; 43: 686–693. 12 34. Paster BJ, Canale-Parola E. Treponema saccharophilum sp. nov., a large pectinolytic 13 spirochete from the bovine rumen. Appl Environ Microbiol 1985; 50: 212–219. 14 35. Sørensen HR, Jørgensen CT, Hansen CH, Jørgensen CI, Pedersen S, Meyer AS. A novel 15 GH43 α-l-arabinofuranosidase from Humicola insolens: mode of action and synergy with 16 GH51 α-l-arabinofuranosidases on wheat arabinoxylan. Appl Microbiol Biotechnol 2006; 17 73: 850–861. 18 36. Yao T, Deemer DG, Chen M-H, Reuhs BL, Hamaker BR, Lindemann SR. Differences in 19 fine arabinoxylan structures govern microbial selection and competition among human gut 20 microbiota. Carbohydr Polym 2023; 316: 121039. 21 37. Regueira A, Rombouts JuliusL, Wahl SA, Mauricio-Iglesias M, Lema JM, Kleerebezem 22 R. Resource allocation explains lactic acid production in mixed-culture anaerobic 23 fermentations. Biotechnol Bioeng 2021; 118: 745–758. 24 38. Lynd LR, Beckham GT, Guss AM, Jayakody LN, Karp EM, Maranas C, et al. Toward 25 low-cost biological and hybrid biological/catalytic conversion of cellulosic biomass to 26 fuels. Energy Environ Sci 2022; 15: 938–990. 27 39. Lawson CE. Retooling Microbiome Engineering for a Sustainable Future. mSystems 2021; 28 6: 10.1128/msystems.00925-21. 29 40. Caspi R, Billington R, Keseler IM, Kothari A, Krummenacker M, Midford PE, Ong WK, 30 Paley S, Subhraveti P, Karp PD. The MetaCyc database of metabolic pathways and 31 enzymes: a 2019 update. Nucleic Acids Res 2020; 48: D445–D453 32 41. Zimmermann J, Kaleta C, Waschina S. gapseq: informed prediction of bacterial metabolic 33 pathways and reconstruction of accurate metabolic models. Genome Biol 2021; 22: 81. 34 42. Lankiewicz, T.S., Choudhary, H., Gao, Y. et al. Lignin deconstruction by anaerobic fungi. 35 Nat Microbiol 2023 8, 596–610. https://doi.org/10.1038/s41564-023-01336-8 36 37 38 39 40 41 42 43 44 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 22 1 Figure 1. Fungal and bacterial enrichment cultures converting lignocellulosic biomass to VFAs. 2 3 4 5 6 7 0 200 400 600 800 0 1 2 3 4 Butyrate 0 4000 8000 12000 0 1 2 3 4 H2 0 500 1000 1500 0 1 2 3 4 Days Lactate 0 300 600 900 0 1 2 3 4 Conc. (mg/L) Formate 300 600 900 1200 0 1 2 3 4 Acetate 0 200 400 600 0 1 2 3 4 Propionate 0 400 800 1200 nM Sample Cow+BES Digester+BES Goat+BES Cow+Ch Goat+Ch control 0 3 6 9 12 Control Cow +BES Goat +BES Digester +BES Cow +Ch Goat +Ch Pressure Accumulation (psig) Fungal Enrichments Bacterial Enrichments Cow Goat No fungi from digester Cow Goat Digester A B C After 4 Days .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 23 1 Figure 2. Abundance and activity of bacterial MAGs and fungal isolate genomes recovered from 2 enrichments 3 Goat Fungi Cow Fungi 0 20 40 60 80 0 20 40 60 Neolan1 NeoGfMa1 Neospfc1 Neosp1 Orpspfg1 Orpsp1 PirE2 Anasp1 Pirfi3 Neolan1 NeoGfMa1 Neospfc1 Neosp1 Orpspfg1 Orpsp1 PirE2 Anasp1 Pirfi3 % Reads Mapped DNA RNA Digester Bacteria Goat Bacteria Cow Bacteria 0 10 20 30 0 5 10 15 20 0 10 20 30 40 Desulfovibrio_1_D Thomasclavelia_1_D Clostridium_2_D Nitratidesulfovibrio_1_D Fusobacterium_1_D Streptococcus_1_D Escherichia_1_D Lacrimispora_1_D Eggerthella_1_D Phascolarctobacterium_1_D Enterococcus_1_D Oscillibacter_1_D Kineothrix_1_D Aminobacterium_1_D Bacteroides_2_D Enterocloster_1_D Dethiosulfovibrio_1_D Sphaerochaeta_1_D Avimicrobium_1_D Scatacola_1_D Macellibacteroides_1_D Bacteroides_1_D Bacteroides_3_D Clostridium_1_D Escherichia_1_G Lachnospiraceae_3_G Streptococcus_1_G Bacilli_1_G Phascolarctobacterium_1_G Fimenecus_1_G Cryptobacteroides_2_G Ventricola_1_G Bacteroides_1_G Bacteroidaceae_1_G Cryptobacteroides_1_G Slackia_1_G Enteromonas_1_G Bacteroides_3_G Bacteroides_2_G Ruminococcaceae_4_G Sphaerochaetaceae_1_G Desulfovibrio_1_G Ruminococcaceae_3_G Anaeroplasma_1_G Pyramidobacter_1_G Lachnospiraceae_1_G Lachnospiraceae_2_G Prevotella_1_G Lachnospiraceae_6_G Ruminococcaceae_2_G Lachnospiraceae_7_G Desulfovibrio_2_G Lachnospiraceae_5_G Bacteroidales_1_G Lachnospiraceae_4_G Ruminococcaceae_1_G Succinivibrio_1_C Enterococcus_1_C Avilachnospira_1_C Escherichia_1_C Phocaeicola_1_C Christensenellales_1_C Eubacterium_1_C Synergistes_1_C Lachnospiraceae_5_C Streptococcus_1_C Cryptobacteroides_1_C Bianqueaceae_1_C Bacteroidaceae_1_C Faecalicoccus_1_C Butyricicoccaceae_1_C Bacteroides_1_C Bacteroidales_1_C Butyrivibrio_1_C Oscillospiraceae_1_C Prevotella_2_C Lachnospiraceae_6_C Enterocloster_1_C Bacilli_1_C Sphaerochaetaceae_1_C Lachnospiraceae_2_C Cryptobacteroides_2_C Desulfovibrio_1_C Lachnospiraceae_4_C Lachnospiraceae_3_C Treponema_1_C Prevotella_1_C Anaeroplasma_1_C Lachnospiraceae_1_C .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 24 1 Figure 3. CAZyme expression across bacterial MAGs from (A) digester enrichment, (B) cow 2 enrichment, (C) goat enrichment. 3 4 5 6 7 8 9 10 11 12 13 Sphaerochaeta_1_D Kineothrix_1_D Avimicrobium_1_D Bacteroides_1_D Bacteroides_3_D Clostridium_1_D 0 10 20 30 TPM x 103 MAG Extracellular Cell wall OM CM Periplasmic Cytoplasmic Unknown 2.5 5.0 7.5 10.0 12.5 log2(TPM + 1) CBM13;CBM2;GH11CBM13;CBM2;GH10 CBM3;GH9 CBM13;CBM2;CBM6;GH43_16 CBM13;CBM2;CE1;GH10 CBM36;GH11CBM3;GH48 CBM6;CE1;GH10;GH11 CBM2;GH5_4 CBM36;CE4;GH11 CBM48;CE1 CE1 GH5_2GH18GT51 CE8 GH16_3;GH43_24 GH28GH16_3 CBM6;GH43_2;GH8 CE12;CE8GH30_3GH25GH51_2 CBM48;CBM50;CBM83;GH13_41 CBM4;GH16_3 GH5_46 CAZyme Motif Target arabinoxylan chitin galactan glucan mucin pectin peptidoglycan starch Lachnospiraceae_4_C Lachnospiraceae_2_C Lachnospiraceae_3_C Treponema_1_C Prevotella_1_C Anaeroplasma_1_C Cryptobacteroides_2_C Lachnospiraceae_1_C 0 2 4 TPM x 103 MAG Extracellular Cell wall OM CM Periplasmic Cytoplasmic Unknown 2 4 6 8 log2(TPM + 1) GH25CE4GH10 CBM34;GH13_39 GH5_37 CBM61;GH53 GH43_26 GH28 CE12;CE8;PL1 CE8;GH95GH16_3 CE8 GH106;GH28 CBM56 CBM26;GH13_32CBM26;GH13_28 GH13_28GH51_2 CBM4;GH10 CBM48;CE1;GH10 GH5_38 PL1 CBM6;GH43_29 GH13_46 CBM50;GH73 GH5_46 CBM4;GH16_3 GH13_4CE12;CE8 PL1_6PL9_1GH5_2GH136 CBM5;GH18 GT51GH9GH73 CAZyme Motif Target arabinoxylan chitin galactan glucan mucin pectin peptidoglycan starch Prevotella_1_G Lachnospiraceae_7_G Ruminococcaceae_3_G Lachnospiraceae_5_G Bacteroidales_1_G Ruminococcaceae_2_G Lachnospiraceae_4_G Ruminococcaceae_1_G 0 5 10 TPM x 103 MAG Extracellular Cell wall OM CM Periplasmic Cytoplasmic Unknown 3 6 9 log2(TPM + 1) CBM4;GH9 GH48 CBM22;GH10;GH11 GH9 CBM3;GH9CBM79;GH9 GH25 CBM22;CBM6;GH10;GH43_16 CBM22;CE1 CBM22;CE1;GH30_8 GH28GH5_39 CE1CE8 CBM37;GH9 CBM22;CBM37;GH11 CBM37 CBM3;CBM37;GH9CBM22;CBM37;GH10 CBM22;GH30_8 CBM37;CBM77;PL1 CBM37;PL11 CBM22;CE4;GH11 CBM13;CBM35;CE12 CBM50GH13_36CE4;GH9 CBM50;GH23 GH73GH5_2 CBM22;GH10 GH11CBM3 GH10;GH11CBM35;GH97CBM22;GH11 GH10 CBM30;GH9 GH146CBM6CBM4PL9PL1_2 CE12;CE8 PL1PL4_4 CAZyme Motif Target arabinoxylan chitin galactan glucan mucin pectin peptidoglycan starch .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 25 1 Figure 4. CAZyme expression across fungal reference genomes and comparison to bacterial 2 MAGs. 3 4 5 6 7 8 9 10 11 12 13 NeoGfMa1 cow Neolan1 goat 0 20 40 60 TPM x 103 Genome 10.5 11.0 11.5 12.0 12.5 13.0 log2(TPM + 1) CBM10;GH6 CE4 GH11 GH43_1 GH48 CE1 GH1 GH10 GH9 GH45 CE1;GH11CE1;CBM10 CAZyme Motif Target arabinoxylan glucan other 0 50 100 150 200 AA CBM CE GH GT PL CAZyme Family Number of Motifs 0 50 100 150 AA CBM CE GH GT PL CAZyme Family Number of Motifs 0 20 40 60 AA CBM CE GH GT PL CAZyme Family Number of Motifs Kingdom Bacteria Fungi 0 20 40 60 AA CBM CE GH GT PL CAZyme Family Number of Motifs Kingdom Bacteria Fungi Cow Cow Goat Goat All ORFs Extracellular ORFs A B .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 26 1 Figure 5. Expression of key metabolic pathways across MAGs . Rows indicate RNA expression 2 (log2[TPM]) of MAG open reading frames annotated as MetaCyc reactions across key metabolic 3 pathways via gapseq. 4 5 6 7 8 9 10 11 12 13 14 15 Digester Cow Goat Xylose_utilization L−Arabinose_utilization Pentose_phosphate_pathway Glycolysis_and_Gluconeogenesis Pyruvate_metabolism_II:_acetyl−CoA, _acetogenesis_from_pyruvate Pyruvate:ferredoxin_oxidoreductase Fermentations:_Mixed_acid Ethanol Butanol pyruvate fermentation to (S)−lactate pyruvate fermentation to (R)−lactate Acetyl−CoA_fermentation_to_Butyrate lactate fermentation to propanoate dissimilatory sulfate reduction IMF Formate hydrogen oxidation/production Clost ridium_1_D Bacteroides_3_DBacteroides_1_DAvimicrobium_1_DKineoth rix_1_D Sphaerochaeta_1_D Nit ratidesul fovib rio_1_D Dethiosul fovib rio_1_D Desul fovib rio_1_D Lachnospi raceae_1_C Cryptobacteroides_2_CAnaeroplasma_1_C Prevotella_1_C Treponema_1_C Lachnospi raceae_3_C Lachnospi raceae_2_C Lachnospi raceae_4_C Desul fovib rio_1_C Ruminococcaceae_1_GLachnospi raceae_4_G Ruminococcaceae_2_G Bacteroidales_1_G Lachnospi raceae_5_G Ruminococcaceae_3_GLachnospi raceae_7_G Prevotella_1_G Desul fovib rio_2_G Desul fovib rio_1_G XYLISOM−RXN XYLULOKIN−RXN RIBULPEPIM−RXN RXN0−5116 ARABISOM−RXN RIBULP3EPIM−RXN RIB5PISOM−RXN TRANSALDOL−RXN 1TRANSKETO−RXN PHOSPHOKETOLASE−RXN GLUCOKIN−RXN PGLUCISOM−RXN 6PFRUCTPHOS−RXN PHOSACETYLTRANS−RXN ACETATEKIN−RXN PYRUFLAVREDUCT−RXN PYRUVFORMLY−RXN ACETALD−DEHYDROG−RXN ALCOHOL−DEHYDROG−RXN RXN−161 BUTANAL−DEHYDROGENASE−RXN L−LACTATE−DEHYDROGENASE−RXN DLACTDEHYDROGNAD−RXN ACETYL−COA−ACETYLTRANSFER−RXN RXN−11662 RXN−11667 RXN−16834 PHOSPHATE−BUTYRYLTRANSFERASE−RXN BUTYRATE−KINASE−RXN R11−RXN RXN0−268 METHYLMALONYL−COA−MUT−RXN METHYLMALONYL−COA−EPIM−RXN RXN−19738 ADENYLYLSULFATE−REDUCTASE−RXN RXN−17803 RXN−17804 SULFATE−ADENYLYLTRANS−RXN NA−TRANSLOCATING−RNF 1.2.1.2−RXN 1.12.1.3−RXN HYDROGEN−DEHYDROGENASE−RXN RXN−12215 FHLMULTI−RXN HYDROG−RXN MAG Reaction 2.5 5.0 7.5 10.0 12.5 log2(TPM + 1) .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 27 SUPPLEMENTARY INFORMATION 1 2 CAZyme architectures suggest fine-scale functional differentiation among anaerobic fungi 3 and bacteria during lignocellulose conversion to volatile fatty acids 4 5 Christopher E. Lawson1,2,3,†, Joel P. Howard3†, Thomas S. Lankiewicz1,4, James B. Brown2, 6 Steven W. Singer1,2, Héctor García Martín1,2,5,6, Michelle A. O’Malley1,4 7 8 1Joint BioEnergy Institute, Emeryville, CA, USA 9 2Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA 10 3Department of Chemical Engineering & Applied Chemistry, University of Toronto, Toronto, ON, Canada 11 4Department of Chemical Engineering, University of California, Santa Barbara, Santa Barbra, CA, USA 12 5DOE Agile BioFoundry, Emeryville, CA, USA 13 6BCAM, Basque Center for Applied Mathematics, Bilbao, Spain 14 †These authors contributed equally to this work 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint 28 Supplementary Figures 1 2 3 Supplementary Figure 1 – Top CAZyme motifs expressed across bacterial and fungal 4 enrichments. CAZyme motifs were predicted using dbcanLight v1.0.2 . BES – 2-5 bromoethansulfonate (bacterial enrichments), Ch-Chloramphenicol (fungal enrichments). 6 7 Supplementary Datasets 8 Supplementary Data 1 – MAG Statistics. MAG statistics, including MAG size, GC content, 9 completeness and redundancy, and DNA /mRNA read mapping, from cow (A), goat (B), and 10 digester (C) bacterial enrichments. 11 12 Supplementary Data 2 – Bacterial MAG CAZyme expression table. Annotation and expression 13 of CAZymes across all bacterial enrichment MAGs. 14 15 Supplementary Data 3 – Fungal CAZyme expression table . Annotation and expression of 16 CAZymes across fungal reference genomes. 17 18 Supplementary Data 4 – MAG pathway summary. Predicted MetaCyc reactions and metabolic 19 pathways across abundant MAGs recovered from cow, goat, and digester enrichments. 20 21 Cow+Ch 0 5 10 15 CBM10 GT8 GT35 CE15;CBM10 PL3_2 CBM6;GH43_16 CE6 GH45_1 CE1;GH11 CBM29 GH45 GH9 GH10 GH1 GH48 GH43_1 CE1 GH11 CE4 CBM10;GH6 log2(tpm + 1) Goat+Ch 0 5 10 15 CBM6;GH43_16 CBM13;CBM6;GH43_16 GH3;GH6 CBM29 GH45 CBM10;GH11 GH45_1 CE6 CE15 GH9 CE1 GH1 GH10 CE1;CBM10 CE1;GH11 CE4 GH11 GH43_1 GH48 CBM10;GH6 log2(tpm + 1) Digester+BES 0 5 10 15 CBM13;CBM2;CE4 GH5_37 CBM3;CBM86;GH10 CBM22;CBM3;GH10 GH73 CBM4;GH9 GH16_21 CBM13;CBM2;CBM6;GH43_29 CBM13;CBM2;PL11 GH9 CBM36;CE4;GH11 CBM2;GH5_4 CBM6;CE1;GH10;GH11 CBM3;GH48 CBM36;GH11 CBM13;CBM2;CE1;GH10 CBM13;CBM2;CBM6;GH43_16 CBM3;GH9 CBM13;CBM2;GH10 CBM13;CBM2;GH11 log2(tpm + 1) CAZyme Motif Cow+BES 0 5 10 15 GH5_38 CBM61;GH53 CE12;CE8;PL1 CE1 GH16_3 GH5_37 GH9 CBM48;CE1;GH10 CBM4;GH10 CBM34;GH13_39 CE8 GH10 GH51_2 GH13_28 CBM4 GH28 CE4 CBM26;GH13_28 CBM26;GH13_32 GH25 log2(tpm + 1) Goat+BES 0 5 10 15 GH5_1 CBM50 GH28 GH16_21 CBM22;CE1;GH30_8 CBM3;CBM37;GH9 CBM37 CBM22;CE1 GH11 CBM22;CBM6;GH10;GH43_16 CBM22;CBM37;GH11 CBM22;GH10 CBM79;GH9 GH25 CBM3;GH9 GH9 CBM37;GH9 CBM22;GH10;GH11 GH48 CBM4;GH9 log2(tpm + 1) .CC-BY-NC-ND 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted February 10, 2026. ; https://doi.org/10.64898/2026.02.09.704939doi: bioRxiv preprint

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-pdf

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00
unpaywall
last seen: 2026-05-24T02:00:01.246996+00:00
License: CC-BY-NC-ND-4.0