Full text
93,118 characters
· extracted from
preprint-html
· click to expand
Modeling plant-microbe interactions with species distribution models | bioRxiv /* */ /* */ <!-- <!-- /*! * yepnope1.5.4 * (c) WTFPL, GPLv2 */ (function(a,b,c){function d(a){return"[object Function]"==o.call(a)}function e(a){return"string"==typeof a}function f(){}function g(a){return!a||"loaded"==a||"complete"==a||"uninitialized"==a}function h(){var a=p.shift();q=1,a?a.t?m(function(){("c"==a.t?B.injectCss:B.injectJs)(a.s,0,a.a,a.x,a.e,1)},0):(a(),h()):q=0}function i(a,c,d,e,f,i,j){function k(b){if(!o&&g(l.readyState)&&(u.r=o=1,!q&&h(),l.onload=l.onreadystatechange=null,b)){"img"!=a&&m(function(){t.removeChild(l)},50);for(var d in y[c])y[c].hasOwnProperty(d)&&y[c][d].onload()}}var j=j||B.errorTimeout,l=b.createElement(a),o=0,r=0,u={t:d,s:c,e:f,a:i,x:j};1===y[c]&&(r=1,y[c]=[]),"object"==a?l.data=c:(l.src=c,l.type=a),l.width=l.height="0",l.onerror=l.onload=l.onreadystatechange=function(){k.call(this,r)},p.splice(e,0,u),"img"!=a&&(r||2===y[c]?(t.insertBefore(l,s?null:n),m(k,j)):y[c].push(l))}function j(a,b,c,d,f){return q=0,b=b||"j",e(a)?i("c"==b?v:u,a,b,this.i++,c,d,f):(p.splice(this.i++,0,a),1==p.length&&h()),this}function k(){var a=B;return a.loader={load:j,i:0},a}var l=b.documentElement,m=a.setTimeout,n=b.getElementsByTagName("script")[0],o={}.toString,p=[],q=0,r="MozAppearance"in l.style,s=r&&!!b.createRange().compareNode,t=s?l:n.parentNode,l=a.opera&&"[object Opera]"==o.call(a.opera),l=!!b.attachEvent&&!l,u=r?"object":l?"script":"img",v=l?"script":u,w=Array.isArray||function(a){return"[object Array]"==o.call(a)},x=[],y={},z={timeout:function(a,b){return b.length&&(a.timeout=b[0]),a}},A,B;B=function(a){function b(a){var a=a.split("!"),b=x.length,c=a.pop(),d=a.length,c={url:c,origUrl:c,prefixes:a},e,f,g;for(f=0;f<d;f++)g=a[f].split("="),(e=z[g.shift()])&&(c=e(c,g));for(f=0;f<b;f++)c=x[f](c);return c}function g(a,e,f,g,h){var i=b(a),j=i.autoCallback;i.url.split(".").pop().split("?").shift(),i.bypass||(e&&(e=d(e)?e:e[a]||e[g]||e[a.split("/").pop().split("?")[0]]),i.instead?i.instead(a,e,f,g,h):(y[i.url]?i.noexec=!0:y[i.url]=1,f.load(i.url,i.forceCSS||!i.forceJS&&"css"==i.url.split(".").pop().split("?").shift()?"c":c,i.noexec,i.attrs,i.timeout),(d(e)||d(j))&&f.load(function(){k(),e&&e(i.origUrl,h,g),j&&j(i.origUrl,h,g),y[i.url]=2})))}function h(a,b){function c(a,c){if(a){if(e(a))c||(j=function(){var a=[].slice.call(arguments);k.apply(this,a),l()}),g(a,j,b,0,h);else if(Object(a)===a)for(n in m=function(){var b=0,c;for(c in a)a.hasOwnProperty(c)&&b++;return b}(),a)a.hasOwnProperty(n)&&(!c&&!--m&&(d(j)?j=function(){var a=[].slice.call(arguments);k.apply(this,a),l()}:j[n]=function(a){return function(){var b=[].slice.call(arguments);a&&a.apply(this,b),l()}}(k[n])),g(a[n],j,b,n,h))}else!c&&l()}var h=!!a.test,i=a.load||a.both,j=a.callback||f,k=j,l=a.complete||f,m,n;c(h?a.yep:a.nope,!!i),i&&c(i)}var i,j,l=this.yepnope.loader;if(e(a))g(a,0,l,0);else if(w(a))for(i=0;i (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0];var j=d.createElement(s);var dl=l!='dataLayer'?'&l='+l:'';j.src='//www.googletagmanager.com/gtm.js?id='+i+dl;j.type='text/javascript';j.async=true;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-M677548'); Skip to main content Home About Submit ALERTS / RSS Search for this keyword Advanced Search New Results Modeling plant-microbe interactions with species distribution models Zihui Wang , Sarah Piché-Choquette , Jocelyn Lauzon , Sarah Ishak , View ORCID Profile Steven W. Kembel doi: https://doi.org/10.1101/2024.07.12.603317 Zihui Wang 1 Département des sciences biologiques, Université du Québec à Montréal , Montréal, QC, H2X1Y4, Canada 2 Department of Botany, University of British Columbia , Vancouver, BC, V6T1Z4, Canada Find this author on Google Scholar Find this author on PubMed Search for this author on this site For correspondence: wang.zihui{at}courrier.uqam.ca kembel.steven_w{at}uqam.ca Sarah Piché-Choquette 1 Département des sciences biologiques, Université du Québec à Montréal , Montréal, QC, H2X1Y4, Canada Find this author on Google Scholar Find this author on PubMed Search for this author on this site Jocelyn Lauzon 1 Département des sciences biologiques, Université du Québec à Montréal , Montréal, QC, H2X1Y4, Canada Find this author on Google Scholar Find this author on PubMed Search for this author on this site Sarah Ishak 1 Département des sciences biologiques, Université du Québec à Montréal , Montréal, QC, H2X1Y4, Canada Find this author on Google Scholar Find this author on PubMed Search for this author on this site Steven W. Kembel 1 Département des sciences biologiques, Université du Québec à Montréal , Montréal, QC, H2X1Y4, Canada Find this author on Google Scholar Find this author on PubMed Search for this author on this site ORCID record for Steven W. Kembel For correspondence: wang.zihui{at}courrier.uqam.ca kembel.steven_w{at}uqam.ca Abstract Full Text Info/History Metrics Preview PDF Abstract Plants interact with diverse microorganisms that play a crucial role in plant growth and development. The diversity and distribution of plant microbiota are altered by anthropogenic environmental change, leading to subsequent impacts on ecosystems. Modeling the distribution of plant-associated microbes is critical for predicting and managing future changes in microbial function, but there remain challenges and open questions when developing these models. We present a conceptual framework for process-oriented predictive modeling of the distribution of plant-associated microbiota. We first describe different approaches to incorporating host plants into modeling microbial distributions, namely by including them as static variables, nesting them within microbial distribution models or incorporating them simultaneously via joint species distribution models. Additionally, we discuss issues associated with collecting and analyzing sequencing-based microbial data, emphasizing the importance of data normalization and careful interpretation of species distribution models. We further discuss how to incorporate evolutionary history into microbial distribution modeling. Finally, we present a case study demonstrating how incorporating host information can improve the prediction of microbial distributions. This study provides insights for predicting future distributions of plant-associated microbes under climate change and plant species redistribution, which can be generalized to other host-associated microbial systems. 1. Introduction Plants interact intimately with diverse microorganisms including bacteria, fungi, archaea, protists, and viruses that inhabit both external and internal plant tissues ( Berendsen et al., 2012 ; Fitzpatrick et al., 2020 ). These plant-microbe interactions comprise a range of symbiotic relationships including parasitism, commensalism, competition and mutualism, playing a pivotal role in shaping plant health and development ( Pattnaik et al., 2021 ; Vandenkoornhuyse et al., 2015 ). While certain microbial groups such as mycorrhizal fungi, phytopathogens and nitrogen-fixing bacteria have been extensively studied over the last century, it was only in recent decades that the advent of high-throughput sequencing techniques revealed the remarkable and complex taxonomic and functional diversity of the plant-associated microbiota. Over the past decades, there has been a growing consensus on the importance of plant-associated microbiota for plant diversity, productivity, biogeography, and indeed nearly every aspect of plant ecology and evolution ( Dastogeer et al., 2020 ; Delavaux et al., 2019 ; Hawkes et al., 2020 ). Moreover, plant-associated microbiota impact socio-economically important ecosystems such as agroecosystems and forests that are directly related to human well-being ( Fisher et al., 2012 ; Thirkell et al., 2017 ). In this context, understanding the drivers of plant-microbe associations and elucidating the factors determining the diversity and distribution of the plant-associated microbiota have been considered key questions in ecology and evolution, as well as in plant sciences, forestry and agronomy. Anthropogenic environmental changes — including climate change, nutrient deposition, land-use change and the introduction of invasive species — influence the diversity and distribution of plant microbiota both directly (i.e. by altering temperature) and indirectly (i.e. through altering plant traits or host species distributions) ( Trivedi et al., 2022 ). These impacts are expected to prompt a rapid shift in the composition and function of plant-associated microbiota, given the shorter generation time and higher sensitivity of microbes to stresses compared to plants and animals ( Cavicchioli et al., 2019 ). The alteration of the plant microbiota induced by global changes, on the one hand, represents a potential catastrophic risk to ecosystem health and human well-being. For example, the increased impact of phytopathogens under climate change has the potential to endanger food supplies for 10-60% of the world’s population and cause billions of dollars of economic damage annually ( Bebber et al., 2014 ; Fisher et al., 2012 ). On the other hand, engineering the plant microbiota represents a potential approach to mitigate the negative effects of climate change on plants, for instance by enhancing plant resistance and acclimatation to abiotic and biotic stresses ( Angulo et al., 2022 ). Despite the critical role of the plant microbiota in mediating the effect of global change on plants and ecosystems — either accelerating or mitigating depending on the context and the microorganisms that are present — they have seldom been the focus of climate change studies and are almost always overlooked in policy development. A major challenge that prevents us from harnessing the full power of plant-associated microbiota is our shallow understanding of the response of microbes to environmental change ( Cavicchioli et al., 2019 ). Therefore, there is an urgent need to develop process-oriented understanding and predictive modeling of plant-associated microbial distribution at large geographic scales. While recent studies have initiated investigations into the global diversity and biogeography of plant microbiota, i.e. the distribution of community diversity and composition ( Li et al., 2023 ; Linde et al., 2018 ; Z. Wang et al., 2023 ), we are lacking a general framework to predict the geographic distribution of individual plant-associated microbial taxa. This article presents a conceptual framework to address the challenges associated with developing process-oriented predictive models for the distribution of plant-associated microbiota. We begin by reviewing the use of species distribution models (SDMs) and their extensions, such as joint SDMs, for free-living microbes and host-associated microbes such as parasites. We argue that although these SDMs have offered insights into modeling the distribution of host-associated microbes, they are not readily applicable to plant microbiota due to 1) distinct constraints imposed on free-living and plant-associated microbes, 2) varying degrees of host specificity between the plant microbiota and parasites, and 3) differences in data types for plants and microbes. We then identify the major limitations of current SDM frameworks in modeling plant-associated microbiota including 1) challenges in integrating host-related biotic factors into microbial SDMs, 2) difficulties in handling microbial sequencing data, and 3) the evolutionary potential of microbes. To overcome these limitations, we propose different strategies for incorporating host information into microbial SDMs. These range from simply incorporating host taxa as static ‘environmental’ variables into microbial SDMs, to increasingly complex models such as nested SDMs and joint SDMs of plant-microbial interactions, which are further demonstrated using a case study on plant leaf-associated bacteria. Additionally, we discuss potential solutions for the compositional and highly dimensional nature of microbial sequencing data, as well as the hierarchical data structure and the challenges in collecting standardized microbial datasets. Lastly, we extend our discussion to the potential influence of microbial niche evolution on SDMs, proposing possible avenues for further exploration. Overall, the aim of this article is to initiate new discussions on SDM-based frameworks for modeling the distribution of plant-associated microbiota. 2. Recent progress and remaining challenges for modeling the distribution of plant-associated microbes 2.1 Species distribution models for microorganisms and parasites While SDMs have been widely used to predict the distribution of plant and animal species across space and time ( Elith & Leathwick, 2009 ), the characterization of microbial SDMs has historically lagged behind due to challenges in collecting microbial occurrence data, identifying microbial taxonomy and gathering environmental data at appropriate spatial and temporal scales ( Hao et al., 2020 ). Consequently, microbial SDM studies have primarily focused on a few disease-causing microorganisms such as the human pathogen Coccidioides immitis and the plant pathogen Pseudomonas syringae ( Gorris et al., 2019 ; Wang et al., 2018 ). Recent advances in environmental DNA approaches have facilitated the collection of microbial biodiversity data, opening new opportunities for modeling the distribution of microorganisms. For example, a recent study adapted SDMs to model the relative abundance of soil bacteria as a function of climate and edaphic variables, revealing that soil pH and climate can explain and predict the abundance distributions of most soil bacterial taxa ( Mod et al., 2021 ). Another study focused on arbuscular mycorrhizal fungi, quantifying their distribution using climate and edaphic variables at a larger spatial scale, which provided a basis for predicting the geographic range shift of arbuscular mycorrhizal fungi under climate change ( Davison et al., 2021 ). However, in the latter case of SDMs for plant-associated mycorrhizal fungi, a major limitation lies in the lack of consideration of host-related factors and the potential confounding effect between hosts and the abiotic environment ( Kivlin et al., 2021 ). The importance of host-related factors in shaping the distribution of host-associated microbes has been particularly emphasized in the context of parasitic interactions ( Cunha et al., 2018 ; Gougherty & Davies, 2022 ; Kuhn et al., 2016 ; Piwowarczyk & Kolanowska, 2023 ). Studies on the SDMs of parasites have demonstrated that incorporating both host distribution and abiotic factors achieved higher accuracy in predicting the distribution of parasites, compared to considering only one of the two elements ( Cunha et al., 2018 ; McDonough & Holloway, 2021 ). Depending on the data availability and the context of host-parasite interactions, host distributions can be incorporated into SDMs in various forms such as the presence-absence or abundance of host taxa, and the taxonomic, phylogenetic and trait attributes of host taxa and surrounding plant communities ( Gougherty & Davies, 2022 ; Wisz et al., 2013 ). However, incorporating host distributions in SDMs also means the need for constructing additional SDMs to forecast future host distributions in order to predict the future distribution of parasites ( Morales-Castilla et al., 2021 ). Although SDMs for parasites have been refined to incorporate host distributions, they are not readily applicable to plant-associated microbiota. Several unique features of plant-associated microbial data pose challenges in modeling their distributions. First, plant-associated microbes differ from parasites in host specificity. Unlike many parasites which infect one or a few host species (though some parasites can have a wide host breadth, see ( Park et al., 2018 )), most plant-associated microbes live across a broad range of host plants and may also be found in non-plant habitats ( Cordovez et al., 2019 ; Lajoie & Kembel, 2021 ). This makes it challenging to include the presence-absence of individual host species in the model. Secondly, microbial data are typically obtained through DNA sequencing, which provides relative abundances of microbial taxa such as amplicon sequence variants (ASVs) or operational taxonomic units (OTUs). This type of data raises additional challenges for microbial SDMs compared to parasites whose occurrence is generally measured through direct observation. For example, the presence-absence of microbes is highly sensitive to sequencing depth ( Gloor et al., 2017 ) and synthesizing data from different studies is challenging since microbial taxa are often defined differently across studies (Brunel, 2020). Lastly, plant-associated microbial data are mostly collected from host individuals, leading to a hierarchical structure in microbial datasets where samples are not independent but clustered within host taxa or other host-related factors. Current parasite SDMs do not accommodate this hierarchical data structure ( Björk et al., 2018 ). 2.2 Joint species distribution models and their application to host-associated microbes Recent developments in joint species distribution models (JSDMs) present an opportunity for incorporating host information into modeling the distribution of plant-associated microbes. JSDMs extend traditional SDMs by simultaneously modeling the distribution of multiple species based on species attributes and environmental variables ( Harris, 2015 ; Warton et al., 2015 ). JSDMs offer several advances over traditional SDMs. Firstly, JSDMs can hierarchically model distributions by incorporating factors that represent inherent relationships among species, such as species traits and phylogenetic relationships ( Jamil et al., 2013 ; Ovaskainen & Soininen, 2011 ; Pollock et al., 2012 ). This allows JSDMs to consider that species with similar traits or phylogeny respond in the same way to the environment, and thereby improve predictions of distributions for rare and under-sampled species by using information from more abundant species that are phylogenetically close or have similar functional traits ( Ovaskainen & Soininen, 2011 ). Secondly, JSDMs can infer associations among species based on model residuals, which reflect species co-occurrence patterns after controlling for the effect of environmental variables ( Pollock et al., 2014 ). While the interpretation of this co-occurrence matrix is challenging because it can result from various processes such as missing predictors, species interactions and model misspecification, it provides information on species associations that can be used to improve predictions ( Ovaskainen et al., 2017 ; Poggiato et al., 2021 ; Warton et al., 2015 ). Lastly, JSDMs can handle complex and hierarchical data structures by incorporating a variety of random factors such as sampling site, time, spatial distances, and other hierarchical layers ( Björk et al., 2018 ; Ovaskainen et al., 2017 ). This improves the estimation of independent effects of factors and can provide information about the relative importance of different factors and processes. Several recent studies have adapted JSDMs to predict the species distribution of host-associated microbes ( Aivelo et al., 2019 ; Björk et al., 2018 ; Minard et al., 2019 ; Odriozola et al., 2021 ; Wang et al., 2023 ). For example, Björk et al. (2018) introduced a novel extension of JSDMs to accommodate the hierarchical structure of host-associated microbial data, which incorporates host-related factors as random terms in the model. This methodology makes it possible to discern the effects of host filtering among other ecological processes, as the authors illustrated with bird and sponge microbiota datasets. In another study, Wang et al. (2023) utilized JSDMs to characterize the geographic distribution of plant phyllosphere bacteria. They predicted the presence and abundance of keystone bacterial taxa as a function of host plant traits and climate. Their models incorporated data on the spatial arrangement and host phylogenetic relationships among samples as random terms to control for the effects of spatially and phylogenetically clustered data structure. This study revealed strong effects of host, climate, and spatial factors in controlling the occurrence and abundance of phyllosphere bacteria, suggesting that host-related factors and spatial proximity should be considered in modeling the distribution of the leaf-associated microbiota. Other studies that applied JSDMs to host-associated microbes reveal the potential of this approach in disentangling the importance of different processes, obtaining species co-occurrence patterns, and testing for niche conservatism along phylogenetic and trait axes ( Aivelo et al., 2019 ; Minard et al., 2019 ; Odriozola et al., 2021 ). For example, Aivelo et al. (2019) used JSDMs to analyze Ixodes ricinus ticks’ bacterial communities along ecological gradients in the Swiss Alps, while Minard et al. (2019) studied the intraspecific variation of the gut microbial communities of Melitaea cinxia caterpillars to identify ecological factors driving their structure. Odriozola et al. (2021) used JSDMs to test whether shared habitat use or fungal-bacterial interactions were responsible for the co-occurrence observed between fungal and bacterial communities from deadwood. 2.3 Challenges of applying SDMs to plant-associated microbiota In reviewing recent studies on SDMs for microbes and host-associated parasites, we have identified three categories of challenges of applying SDMs to model the geographic distributions of plant-associated microbes. The first challenge is related to incorporating host information into microbial SDMs, including what types of host data should be considered, how to deal with varying degrees of host specificity, and whether host distributions should be treated as a constant or dynamic data layer. The second challenge arises from the distinctive format and structure of microbial distribution data compared to plant and animal distribution data for which SDMs were initially developed. Microbial distribution data often come from field surveys of microbial communities based on DNA sequences, which are highly dimensional, compositional and can possess hierarchical structures. The last challenge is associated with the rapid evolution of microbes, leading to the possibility for rapid niche adaptation ( Sousa et al., 2023 ) and host shifts ( Vries et al., 2020 ) that should be considered in species distribution modeling. 3. Developing process-oriented predictive models for the plant-associated microbial distribution 3.1 Incorporating host information in SDMs of plant-microbe associations To address the first challenge, we propose three scenarios for incorporating host plant distributions into the SDMs of plant-associated microbes. This is essential since microbial persistence requires host presence and a failure to incorporate information on host distributions may lead to a misestimation of the geographic range of plant-associated microbes. The first scenario is to incorporate plant distributions as a static layer that constrains microbial distributions. Secondly, the abiotic environment may influence the distribution of host plants, and thereby indirectly influence the distribution of plant-associated microbes. To account for this situation, plant species distributions can be incorporated into microbial SDMs as a dynamic layer, where plant distributions are modeled with a separate SDM and then used to predict microbial SDMs. The third scenario considers the feedback effect between plants and microbes, whereby plant distributions influence microbial distributions and vice versa. As a result, the distribution of microbes and their host plants may respond to environmental change simultaneously, which can be modeled using JSDMs by incorporating the co-occurrence patterns between plant and microbes ( Fig. 1 ). Below we explicitly discuss these three types of SDMs in terms of rationale, data requirements, model structure and computational tools. Download figure Open in new tab Fig. 1 Conceptual frameworks for incorporating plant species distributions into models of plant-associated microbial distribution. Three scenarios describing the influence of plant distributions on microbial distributions are described, including (a,b) plant distributions as a constant constraint on microbial distributions, (c, d) environments affecting both plant and microbial distributions, with plant distributions additionally affecting microbial distributions, and (e, f) plant species and their associated microbes responding to environmental change simultaneously. (a, c, e) graphically summarize the modeling structures and (b, d, f) demonstrate the projection of current and future distributions of plant and microbial species under three different scenarios. Scenario 1 – Incorporate host plants as a static layer Host information can simply be included as a variable in the SDMs of plant-associated microbes. However, the way host information is represented in the model may depend on microbial data sampling strategies. In general, plant-associated microbial samples are collected from host individuals (“host-explicit”). Still, there are cases where microbial communities are sampled from a location with mixed hosts (“host-inexplicit”, e.g. Davison et al., 2021 ; Li et al., 2023 ). Although both types of microbial data can be used to construct microbial SDMs, they address different questions and require different model structures. Microbial SDMs based on host-explicit data predict the occurrence of microbial taxa on a specific host taxon across environmental gradients, which requires a hierarchical structure in the model to account for host effects. For this type of model, microbial occurrence in a location can be inferred by combining the occurrence of the host and the known associations between microbes and the host taxon. In contrast, models based on host-inexplicit data can directly predict the occurrence of microbes in a location as a function of biotic and abiotic environments, which is similar to plant and animal SDMs. Here, we primarily focus on microbial SDMs with host-explicit data although host-inexplicit SDMs are briefly covered. Apart from microbial occurrence, data describing the host species, abiotic and biotic environments of the samples are required for plant-associated microbial SDMs ( Fig. 2 ). Host information includes the species identity, traits, taxonomy, and phylogeny of the host individuals where the microbes are sampled (for host-explicit microbial data only). Biotic variables consist of information on plant communities where the microbes are situated, such as the presence-absence or the abundance of host plant species, as well as the taxonomic diversity, traits and phylogenetic properties of plant communities. The abiotic environment can include climate as well as microhabitat characteristics such as soil properties (in particular for root-associated microbes; Davison et al., 2021 ; Linde et al., 2018 ) and light conditions (in particular for leaf-associated microbes; see Carvalho & Castillo, 2018 ). Additionally, data on microbial attributes including taxonomy, phylogeny and functional traits can be incorporated into the SDMs ( Fig. 2 ). Download figure Open in new tab Fig. 2 Data typically collected for plant-associated microbes. (a) Occurrence data of microbial taxa in each sampling unit. Sampling units are associated with host individuals and locations in host-explicit datasets, and only locations in host-inexplicit dataset; (b) Trait and phylogenetic data of microbial taxa; (c) Plant communities in which samples are collected; (d) Trait and phylogenetic data of plant species; (e) Abiotic environmental variables such as climate and edaphic variables, and (f) Plant-microbe interaction networks. Below we illustrate how these datasets can be used to model the distribution of plant-associated microbes and generate predictions based on host plant distributions and abiotic environments ( Fig. 1a ). We start by modeling the presence or absence (PA) of plant-associated microbial taxon j in sample i . The subscript ( h ) indicates the host plant taxon from which sample i was collected: We use a probit regression to model the probability of presence P ij ( h ) as a function of linear predictors in both fixed and random forms (E2). ∑ k A ik α jk represents the fixed effect of abiotic environments, where A ik denotes the abiotic factor k measured for sample i (e.g. temperature, soil nutrients or light), and the regression parameter a jk represents the response of microbial taxon j to abiotic factor k ( Fig. 3 ). Similarly, ∑ Z B iZ β jZ represents the fixed effect of biotic environments, where B iZ denotes the plant community-related factor z measured for sample i (e.g. the abundance and diversity of host species in the plant community), and β jZ is the response of microbial taxon j to factor z ( Fig. 3 ). The δ i and γ j terms represent sample- and microbial taxon-level random effects, which account for the difference in sampling design and taxon abundance, respectively. Download figure Open in new tab Fig. 3. Graphical summary of the statistical framework integrating plant species distributions into microbial SDMs. Rectangles refer to data (explanatory or response variables), while circles represent parameters to be estimated. Plant distribution is integrated into the model as a static variable (orange color), dynamic variable (green) or as response variables in joint species distribution modeling (purple). Solid lines describe essential statistical inferences, while dotted lines refer to inferences that might be considered depending on the model structure and data availability (e.g. the hierarchical structure is included only when the microbial data is host-explicit). The sample-level random term δ i provides an opportunity to accommodate the hierarchical structure of microbial sampling. Here we consider two types of hierarchy ( Fig. 3 ). The first hierarchy arises from the nested microbial sampling within host plant species, where microbial communities collected from the same plant species are more compositionally similar compared to those collected from different plant species. To account for this effect, we included a random term µ h[i] to indicate the host species-specific effect on sample i (where h [ i ] is the plant species that sample i is collected from, see E3). This item can be further extended to include information on plant traits and phylogeny, which allows us to account for the observed trend that phylogenetically closer plant species tend to harbor a more similar composition of plant-associated microbes (see Björk et al., 2018 for detailed mathematics). The second hierarchy of microbial data is spatial hierarchy, in which microbial samples are nested within study sites, and microbial communities from the same site or nearby sites are more similar than those from spatially distant sites ( Finkel et al., 2012 ). A random term µ s [ i ] was included in E3 to indicate the effect of study site and spatial proximity ( Ovaskainen et al., 2016 ). As explained before, SDMs based on host-explicit microbial data predict the occurrence of microbial taxa on a specific plant species. To determine the microbial occurrence at a location, we need to account for microbial occurrence on all plant species at that location, defined as: where h represents a given plant species at location i , and P ij(h) is the probability of presence of microbial taxon j on plant species h at location i . In contrast, host-inexplicit microbial samples are collected from a location instead of a host individual. As a result, there is no need to consider host-specific random effects µ h[i] in the SDMs with host-inexplicit microbial data. In this case, the occurrence of microbes in a location can be directly predicted as a function of abiotic environments (such as climates) and biotic factors (including occurrence of potential host species or surrounding plant communities). However, such models are very sensitive to under-sampling (i.e. when not all plant species present at a location are sampled), in which case a random effect accounting for differential sampling effort should be considered ( Morales-Castilla et al., 2021 ). A variety of tools can be used for parameter estimation in the above models, including both single-species and multi-species distribution modeling. For example, generalized linear mixed-effect models (GLMMs) are often used to construct SDMs with hierarchical data, where factors such as host species identity and study site can be incorporated as random effects. With some extension, GLMMs can also be used to account for more complex hierarchical structures such as host phylogeny and spatial autocorrelation ( Dormann et al., 2007 ). In addition, many multi-species distribution modeling tools such as JSDMs can model microbial SDMs while accounting for the random effects of host and space, which may take greater computing resource than single-species modeling because of the additional inference of the microbe-microbe co-occurrence matrix ( Ovaskainen et al., 2017 ; Poggiato et al., 2021 ). Scenario 2 – Incorporate host plants as a dynamic layer Plant species distributions shift in response to environmental change, and therefore the future distributions of plant-associated microbes will depend on the redistribution of host species and the reassembly of plant communities. The model described above treats host plant occurrence as a static data layer; therefore, it can predict the geographic distributions of plant-associated microbes based on the current distributions of plant species. However, for predicting future microbial distributions, it is essential to construct separate SDMs to predict future plant distributions. The model structure of plant SDMs has been established previously. Here we use a simplified plant SDM formula to illustrate how future plant distribution can be incorporated to predict microbial distribution. We model the presence-absence of host plant species h at location i ( PA ih ) using a probit regression, where the probability of presence ( P ih ) is modelled as a function of abiotic environments: ∑ k A ik θ hk , where the term A ik denotes the abiotic factor k measured for location i , and the regression parameter θ hk represents the response of plant species h to abiotic factor k ( Fig. 3 ). These abiotic environments are not necessarily the same as those used in microbial SDMs since plants and microbes may respond to different environmental variables. σ i is the residual related to sample i , which can be extended to include other random effects if necessary. The reassembly of plant communities under environmental change can alter the values of plant community-related factors ( B iZ ) in E2, which represents the future properties of plant communities that the microbes may encounter. This value can be predicted in two different ways. One is through building plant SDMs for all possible plant species (E6) and extracting plant community information based on the predicted plant species distributions. The other is by building models to directly predict these factors, i.e. by modeling the abundance or diversity of plant species based on environmental variables. However, predicting plant community-level factors require additional data on plant community composition across environmental gradients ( Fig. 3 ). Based on the predicted occurrence of plant species h at location i ( P ih ) and the predicted occurrence of microbial taxon j on plant species h at location i ( P ij ( h ) ), the occurrence of microbial taxon j at location i can be calculated as E7, where h represents all potential host species of the microbial taxon j . Scenario 3 – Predict plant distribution and microbial distribution simultaneously using JSDMs While the models presented above focus on the influence of plant distributions and community properties on the distribution of plant-associated microbes, microbial distributions can have significant impacts on plant distributions. Many studies have emphasized the importance of plant-associated microbes in shaping plant species biogeographic distributions, for example, through positive effects of beneficial microbes on plant establishment under harsh environmental conditions, or negative effects of phytopathogens on plant fitness in otherwise suitable habitats ( Angulo et al., 2022 ; Delavaux et al., 2019 ). Taking the feedback effects between plant and microbes into account when modeling the geographic distribution of plant-microbial interactions is essential for a realistic prediction of both plant and microbial SDMs. The joint species distribution model (JSDM) approach, which has the feature of inferring and incorporating species associations into SDMs, provides a potential solution for accounting for the feedback effects between plant SDMs and microbial SDMs. To illustrate the use of JSDMs in predicting the distribution of plant-microbial interactions, we assume k microbial taxa and n host plant species collected at the same spatial scale. We predict the presence-absence of plant and microbes simultaneously by: where PA ij is denoted as the presence of species j ( j = 1… k …( k + n ), including k microbial taxa and n plant species) at site i , which is modeled as a function of fixed term ∑ k A ik φ jk and a random term ε ij . The random term was drawn from a normal distribution with a mean of 0 and a variance-covariance matrix W . Here, W is designed to infer residual species-to-species relationships (hereafter residual correlation matrix), indicating the co-occurrence among species after controlling for the influence of fixed environmental terms. By inferring this residual correlation matrix in our model, we can estimate plant-microbe, plant-plant and microbe-microbe co-occurrence patterns. Although the inference of the residual correlation matrix in JSDMs does not necessarily change the estimation of fixed effects of environmental variables compared to single-species SDMs, it can alter the prediction of species distributions by incorporating information on species co-occurrence patterns as a random term ( Poggiato et al., 2021 ). The interpretation of residual correlation is not clear, but it conveys biological information that JSDMs can utilize to better estimate species associations and to provide joint and conditional predictions ( Wilkinson et al., 2021 ). The conditional prediction of JSDMs based on residual correlation provides an opportunity to exploit the causality of the correlations between plant distributions and microbial distributions. For example, assuming that the observed plant-microbial co-occurrence is a result of plant species’ influence on microbial occurrence, one could build a JSDM to determine the probability of microbial occurrence given the occurrence of plant species (microbial occurrence conditional on plant occurrence). Similarly, JSDMs can predict plant occurrence conditional on microbial occurrence to test the influence of microbes on plant distributions. Additionally, plant occurrence and microbial occurrence can be predicted simultaneously as a conditional joint prediction ( Fig. 3 ), assuming bidirectional effects between plants and microbes ( Wilkinson et al., 2021 ). After building these different types of JSDMs, one could evaluate and compare the model performance to reveal insights into the causality of species co-occurrence between plants and their associated microbes ( Wilkinson et al., 2021 ; Zurell et al., 2020 ). 3.2 Addressing challenges associated with microbial sequencing data High dimensionality The most commonly used method in collecting microbial distribution data is metabarcoding, a marker gene-based sequencing approach that amplifies a specific genomic region using short DNA strands as primers to detect sequences that are present and quantify their relative abundance. By comparing those amplified sequences to a database of reference sequences with known taxonomic identity, this method can simultaneously identify many microbial taxa within a sample. Microbial studies based on metabarcoding often cluster amplified sequences into operational taxonomic units (OTUs) based on their sequence similarity, or identify exact amplicon sequence variants (ASVs) from error-corrected sequences( Callahan et al., 2017 ). Both OTUs and ASVs can be used as the fundamental taxonomic unit in SDMs, serving as a pragmatic proxy for ‘species’ in microbial systems where traditional biological classifications are challenging. Metabarcoding of microbial communities often reveals a high microbial diversity, ranging from hundreds to thousands of taxa ( Piché-Choquette et al., 2023 ). These highly dimensional community data pose a challenge given the computational cost of microbial JSDMs. One potential way to reduce the high dimensionality of microbial community data is to focus either on a subset of microbial taxa of high research interest, or on a certain number of most abundant taxa, which discards rare species that are harder to model ( Breiner et al., 2015 ). For example, keystone microbial taxa can exert considerable influence on microbial communities, and changes in the occurrence of these keystone taxa may cause drastic shifts in microbial composition and function( Banerjee et al., 2018 ). In this case, modeling the distribution of keystone taxa could be a priority. In addition, microbial SDMs may focus on microbial taxa with functions of interest such as nitrogen-fixing bacteria or mycorrhizal fungi. Focusing on a subset of microbial taxa can reduce the computational costs associated with JSDMs but requires prior knowledge of the keystone taxa in the study system or of the functional traits of microbial taxa. Another approach to reduce the high dimensionality of microbial community data is to model microbial distributions at a higher taxonomic level, such as the genus or family rank. For example, the bacterial genera Pseudomonas and Methylobacterium can be important for plant health (Mabood et al., 2014), and modeling the distribution of these plant-growth promoting bacterial genera is often of high research interest. Compositional nature of microbial data Metabarcoding-based microbial community data describe the relative abundance of microbial taxa. Most protocols for microbial metabarcoding attempt to normalize sequencing depth per sample to avoid introducing biases when comparing diversity metrics among samples, and as a result, variation in the number of sequences across samples do not reflect the variation in true abundance of microbes across those samples ( Gloor et al., 2017 ). The compositional nature of metabarcoding-based microbial data poses two challenges for microbial SDMs. First, changes in the absolute abundance of a single taxon can alter the relative abundance of all other taxa in a sample, which may generate false correlations in the relative abundances among microbial taxa and between microbial abundances and environmental variables. Moreover, the total amount of sequences in a microbial community is influenced by microbial load (the number of microbial cells or amount of microbial DNA per sample), as well as DNA amplification protocols and sequencing library size (the number of DNA sequences obtained per sample). Microbial loads may vary depending on the unit volumes of microbial samples, and library size can be influenced by the efficiency of DNA extraction, PCR protocols, and other library preparation manipulations ( McLaren et al., 2019 ). As a result, the total abundance of microbes in metabarcoding samples are, in general, biologically meaningless. That is, the total abundance of sequences does not necessarily reflect the true abundance of microbes in the ecosystem and the observed abundance of individual taxa is not comparable among samples. Therefore, it is inappropriate to use raw abundance data from metabarcoding to build microbial SDMs. One solution to this issue is to estimate the absolute abundance of microbial taxa, for example, by combining metabarcoding with techniques that estimate total microbial abundance such as digital PCR and quantitative PCR ( Barlow et al., 2020 ), or by inserting synthetic chimeric DNA spikes into microbial samples as a standard for absolute abundance estimation ( Kim et al., 2021 ; Stämmler et al., 2016 ; Tkacz et al., 2018 ). However, methods for measuring absolute abundance are still in active development and are not as commonly used as relative abundance estimates in microbial studies ( Morton et al., 2019 ). Relative abundance data of microbial communities can be potentially useful for microbial SDMs if sampling effort and library size are well controlled during data collection and analysis. For example, sampling effort can be standardized by collecting the same volume of material for all microbial samples, and library size can be normalized by rarefaction ( Schloss, 2024 ). However, if the microbial loads and library sizes are not standardized during data collection, they must be considered during data analysis, potentially by incorporating sample volume and/or library size as a random term in SDMs ( Odriozola et al., 2021 ). Overall, there is a lack of empirical data on the performance of these different approaches to normalizing abundances in microbial metabarcoding data, and studies are needed to determine the best approach to dealing with the compositional nature of microbial data in SDMs (but see Schloss, 2024 ). Collection of large-scale microbial datasets While the SDMs of plants and animals benefit greatly from the development of global observational datasets such as GBIF and eBird, collection of large-scale datasets for plant-associated microbes remains challenging. Numerous studies over the past decades have aimed to characterize microbial distribution at broad spatial scales, but most of them have focused on free-living microbes in habitats such as soils and oceans (e.g. Earth Microbiome Project; Thompson et al., 2017 ), or for a single host species (Human Microbiome Project Consortium; Methé et al., 2012 ) or specific microbial functional groups (GlobalFungi; Větrovský et al., 2019 , 2020 , and GlobalAMFungi; Větrovský et al., 2023 ). Large-scale studies on plant-associated microbes require not only a broad geographic range of sampling but also a wide host taxonomic coverage. This makes collecting data for host-associated microbes complex and logistically challenging. While large-scale sampling campaigns to quantify plant-associated microbiota remain challenging, an alternative approach is to compile existing datasets to achieve a broad geographical and plant-host taxonomic coverage. An increasing number of datasets on plant-microbial associations has been published over the past decades ( Sengupta et al., 2023 ). These data are accessible through international nucleotide sequence database collaborations such as the European Nucleotide Archive and the U.S. National Center for Biotechnology Information. These databases provide both sequence of the microbes and the metadata of samples such as the host plant, geographical location and the sampling protocol, representing a potential opportunity to investigate the global distribution and host specialization of plant-associated microbes. However, a significant obstacle for merging independent datasets together is the difference in the methods used across studies for sampling, DNA extraction, sequencing and data processing. These differences make it difficult to separate methodological artefacts from biological patterns ( Brunel et al., 2020 ). Therefore, it is essential to control for technical biases and use standardized bioinformatics protocols to synthesize microbial datasets. Examples of this approach include a study that integrated sequences from different studies targeting different 16S rRNA gene regions by mapping them to near full-length 16S sequences based on a reference database ( Lozupone et al., 2013 ). This reference mapping protocol provided a potential way to compare sequences from studies targeting heterogeneous regions of the 16S rRNA gene. Another study merged 30 independent bacterial datasets using a machine-learning approach that accounted for technical bias among studies ( Ramirez et al., 2018 ). The authors showed that disparate amplicon sequences could be integrated to estimate bacterial community structure. However, there remain numerous challenges related to integrating microbial datasets across studies, and the development of standardized sampling, sequencing, and data analysis protocols will aid in the quantification of broad-scale plant-associated microbial distributions. 3.3 Incorporating microbial evolution into species distribution models Microbes have the potential to evolve extremely rapidly given their high abundance, short generation time and the frequency of evolutionary events such as horizontal gene transfer and mutation. Consequently, microbes can quickly adapt to new environments and expand their geographic and host taxonomic range in response to climate change ( Brennan & Logares, 2023 ; Chase et al., 2021 ). Incorporating this evolutionary potential into a SDM represents a key challenge for accurate prediction of microbial distributions. Many studies have demonstrated that the evolutionary history and functional traits of microbes were associated with their geographic and host range. For example, a study on the spatial distribution of soil bacteria showed that genomic traits and phylogenetic relatedness could be used to predict the habitat breadth of soil bacteria: bacteria with larger genomes and more metabolic versatility were more likely to have wider environmental and geographic distributions ( Barberán et al., 2014 ). Another recent study on the host specialization of plant leaf-associated bacteria revealed that microbial genomic traits and phylogeny were associated with the host range of bacteria ( Wang et al., in process ). The importance of microbial phylogeny and traits in predicting their host range has also been broadly documented in the literature on plant-pathogen interactions, in which the potential hosts of poorly studied and newly identified pathogens were estimated based on host ranges of their relatives ( Barwell et al., 2021 ; Gougherty & Davies, 2021 ; Kahn & Almeida, 2022 ; Marcot et al., 2023 ). Overall, these studies suggested that microbial traits and phylogeny provided valuable information of microbial niche evolution that can be used in models of the distribution of plant-associated microbes. A potential way to incorporate microbial traits into SDMs is to assume that the microbial responses to abiotic and biotic factors (i.e. the α jk and β jz in E2) are not independent but correlated with microbial taxon-specific traits ( Abrego et al., 2017 ). This can be achieved by including a nested model of α jk or β jz : where T jl is the value of trait l for microbial taxon j , ω lk and ω lz represent the effect of trait l on the parameter α jk and β jz , respectively, and τ j is the error term. To account for the phylogenetic signal in microbial niches, the τ j should be drawn from a correlation matrix that assigns phylogenetically closer microbial taxa similar values, which can be indicated as where C is a microbial taxon-taxon correlation matrix obtained from microbial phylogenies. However, by including C as the sole variance-covariance matrix, we assume that microbial niche is fully structured by the phylogeny, which is not always the case (Webb & Gaston, 2003). To acknowledge this, a parameter ρ is included to allow for varying strength of phylogenetic conservatism on microbial niche: where I is a microbial taxon-taxon identity matrix and ρ ranges from 0 to 1, with a larger value indicating stronger effect of phylogenetic signal on microbial niche (Ives & Helmus, 2011). 4. Case study We applied SDMs to a dataset of plant leaf-associated bacteria to demonstrate how the future distribution of plant-associated microbes can be predicted under different scenarios of climate change and host species redistribution. The dataset was released from a biogeographic survey of leaf microbes in 10 forest sites along a latitudinal gradient (18.7°N-47.1°N) in China ( Wang et al., 2023 ). A total of 1453 microbial samples from 329 plant species were collected and sequenced using metabarcoding of the bacterial 16S rRNA gene to quantify bacterial community composition, revealing a total of 49,254 bacterial amplicon sequence variants (ASVs). To reduce the dimensionality of this dataset, we selected a small subset of the data by focusing on 12 of the most common plant species and 30 of the most abundant but not ubiquitous bacterial ASVs. The resulting dataset consists of 188 microbial samples nested within 12 plant species and 10 forest sites. The metadata of these samples included, apart from the site and plant species, three site-level climatic variables: annual mean temperature, annual precipitation, and temperature isothermality; and three host plant attributes: specific leaf area, leaf nitrogen concentration, and leaf phosphorus concentration. In addition, the presence-absence of host species in plant communities across study sites was included. Overall, this dataset represents a broad geographic and host species coverage that is essential for a large-scale SDM of host-associated microbes, though its spatial grain is limited by the relatively small number of study sites. We built three models to predict the distribution of bacterial ASVs as a function of three climatic variables and three host attributes while taking the hierarchical structure of the data into account. Bacterial ASV community data (i.e. 188 samples and 30 ASVs) – the response variables – were transformed to presence-absence values before probit regression was applied. In the first model, we considered host plants as a static layer, where the 3 host plant leaf traits and the 3 climatic variables were incorporated as fixed effects (i.e. A 1 -A 3 and B 1 -B 3 in E2). The model included study site and plant species identity as two random terms to account for the variation in bacterial presence-absence within study sites and plant species (i.e. δ i in E2 and E3). In the second model, we considered host plants as a dynamic layer: the distribution of plant species was first modeled as a function of the 3 climatic variables following E6, where the presence-absence of 12 plant species across 10 study sites was used as the response variable in a probit regression. Following E7, we estimated the probability of presence for each bacterial ASV based on the predicted occurrence of host plant species. Finally, in the third model, we predicted plant and bacteria distributions simultaneously with joint species distribution modeling. The presence-absence of 30 bacterial ASVs and 12 plant species were combined as the response variables, 3 climatic variables were used as fixed effects, and study site was treated as a random effect. All these models were estimated using Markov Chain Monte Carlo (MCMC) implemented within the R package ‘Hmsc’ (v.3.0.13) assuming the default prior distributions (Tikhonov et al., 2020). The first model, where host plants were considered a static variable, explained ∼57% of the variation in the presence-absence of leaf bacterial ASVs, most of which was explained by climatic variables (particularly annual mean temperature) ( Fig. 4 ). We found that considering hosts as a dynamic layer expanded the estimated thermal range of leaf bacteria compared to models where host occurrences were treated as static variables ( Fig. 5 ). This indicated that the range shift of host species can influence the prediction of bacterial distributions. However, predicting bacteria and plant distributions simultaneously with joint species distribution modeling only slightly altered the range of predicted bacterial distribution, where the range of some bacterial ASVs distributions expanded while others shrank (Fig. S1). Download figure Open in new tab Fig. 4. Explained variation in the presence-absence of bacterial ASVs in a species distribution model with host attributes considered as static variables. Each column represents a bacterial ASV, ranked in order of total explained variation. Colors represent the variation explained by each variable as noted in the caption box. Abbreviations: Random host and plot, the random effect of host plant species and study plot; SLA, specific leaf area; LeafN and LeafP, leaf nitrogen and phosphorus concentrations; BIO1, BIO12 and BIO3, the annual mean temperature, annual precipitation and temperature isothermality. Download figure Open in new tab Fig. 5. Predictions of bacterial presence across a temperature gradient based on three models, where host plants were considered as a static variable, a dynamic variable and a joint response (model type). The figure demonstrates the prediction for an example bacterial taxon (ASV_16); predictions for the other ASVs can be found in Supplementary Figure 1. The mean and 95% confidence intervals of the predictions are shown. Dashed lines indicate the temperature at which each model predicts a 50% probability of occurrence. Overall, this case study suggests that incorporating the dynamics of host plant distributions is important for predicting the distributions of plant-associated microbes ( Morales-Castilla et al., 2021 ). The persistence of plant-associated microbes relies on the presence of host plants, and therefore ignoring the range expansion of host plant species may underestimate the future distribution of plant-associated microbes, as illustrated in Fig.1 . At the same time, the geographic distribution of plant species can be influenced by their associated microbes. For example, the lack of beneficial microbes in a new habitat may limit the expansion of plant species ( Delavaux et al., 2019 ), while the lack of pathogenic microbes could facilitate plant invasion and expand plant species distribution (Keane & Crawley, 2002). This feedback effect between plant distributions and microbial distributions can be represented by a residual plant-microbe association matrix in the joint species distribution modeling framework, which is useful to predict the distribution of plant and microbes while considering the complex plant-microbial interactions ( Poggiato et al., 2021 ; Wilkinson et al., 2021 ). 5. Future directions Predicting how plant communities and their associated microbes will respond to environmental changes will be crucial in developing ecosystem management policies aiming to prevent or mitigate phytopathogen invasions and preserve ecosystem services provided by microbes to human populations. The conceptual framework presented in this review is an attempt to orient future research in the field of microbial biogeography by addressing key challenges and proposing alternatives to model the future distribution of host-associated microbes. Promising future directions for the study of host-associated microbial distributions include collecting large-scale datasets on microbial distributions, developing novel statistical and computational approaches to model microbial distributions, and assessing the validity of models of host-associated microbial distributions. Need for large-scale data and standardized protocols An important future direction for modeling plant-associated microbial distributions will be the collection of large-scale standardized datasets to allow modeling of distributions at global scales. Given the difficulty of comparing microbial community data collected using different technologies and protocols ( Kool et al., 2023 ), a collective effort should be made to encourage the use of consistent and documented protocols as a roadmap for collecting and analyzing environmental samples yielding plant-associated microbial data. Need for computationally efficient modeling approaches A major bottleneck in the development of host-associated microbial SDMs is that building SDMs from large and complex datasets is computationally intensive. Strategies to reduce model complexity and computing costs when fitting microbial SDMs remain to be evaluated quantitatively. We suggest that a promising research priority will be to compare different approaches such as model selection, ridge regression, dimensionality reduction, or other statistical methods that reduce the complexity of host-associated microbial SDMs. Another way to reduce computational demand while slightly compromising the robustness of the model is to use less greedy species distribution modeling approaches, such as approximated Bayesian methods like Integrated Nested Laplace Approximation and Variational Bayes ( Blei et al., 2017 ; Rue et al., 2009 ). Lastly, advances in computational power such as the use of GPU parallel computing should permit the analysis of increasingly complex models ( Pichler & Hartig, 2021 ). Need to test and validate distribution models Finally, an important future direction will be to assess the validity of predictive models of microbial distributions, by verifying and comparing true microbial occurrences in ecosystems against the predictions obtained with SDMs. For example, in the short term, this could be achieved by modeling the distribution of some chosen microbial taxa with data taken from a restricted spatial range, and collecting samples along a spatial gradient within and beyond their predicted range to survey their presence and test the accuracy of the SDMs. In the medium term, another possibility to test the predictions of SDMs would be to model the distribution of microbes associated with invasive plant species, since these plants tend to rapidly expand their geographic range under climate change ( Dukes et al., 2009 ; Wang et al., 2019 ). By doing so, researchers might be able to compare in just a few years from now the true novel microbial distributions against their predicted distributions computed with presently available data. These kinds of assessments will also be an invaluable opportunity to compare the performance of competing model types, with the aim of gaining a better understanding of which approach is best suited for different study systems and data structures. Ultimately, our ability to better model the distributions of host-associated microbes will make it possible to predict potential responses of plants and their associated microbes to global change. Data accessibility The data that support the findings of the case study are openly available in Sequence Read Archive at https://www.ncbi.nlm.nih.gov/sra/PRJNA1001021 , with accession number: PRJNA1001021 References ↵ Abrego , N. , Norberg , A. , & Ovaskainen , O . ( 2017 ). Measuring and predicting the influence of traits on the assembly processes of wood inhabiting fungi . Journal of Ecology , 105 ( 4 ), 1070 – 1081 . doi: 10.1111/1365-2745.12722 OpenUrl CrossRef ↵ Aivelo , T. , Norberg , A. , & Tschirren , B . ( 2019 ). Bacterial microbiota composition of Ixodes ricinus ticks: the role of environmental variation, tick characteristics and microbial interactions . PeerJ , 7 , e8217 . doi: 10.7717/peerj.8217 OpenUrl CrossRef ↵ Angulo , V. , Beriot , N. , Garcia Hernandez , E. , Li , E. , Masteling , R. , & Lau , J. A . ( 2022 ). Plant– microbe eco evolutionary dynamics in a changing world . New Phytologist , 234 ( 6 ), 1919 – 1928 . doi: 10.1111/nph.18015 OpenUrl CrossRef ↵ Banerjee , S. , Schlaeppi , K. , & Heijden , M. G. A. van der . ( 2018 ). Keystone taxa as drivers of microbiome structure and functioning . Nature Reviews Microbiology , 16 ( 9 ), 567 – 576 . doi: 10.1038/s41579-018-0024-1 OpenUrl CrossRef ↵ Barberán , A. , Ramirez , K. S. , Leff , J. W. , Bradford , M. A. , Wall , D. H. , & Fierer , N . ( 2014 ). Why are some microbes more ubiquitous than others? Predicting the habitat breadth of soil bacteria . Ecology Letters , 17 ( 7 ), 794 – 802 . doi: 10.1111/ele.12282 OpenUrl CrossRef PubMed ↵ Barlow , J. T. , Bogatyrev , S. R. , & Ismagilov , R. F . ( 2020 ). A quantitative sequencing framework for absolute abundance measurements of mucosal and lumenal microbial communities . Nature Communications , 11 ( 1 ), 2590 . doi: 10.1038/s41467-020-16224-6 OpenUrl CrossRef ↵ Barwell , L. J. , Perez Sierra , A. , Henricot , B. , Harris , A. , Burgess , T. I. , Hardy , G. , Scott , P. , Williams , N. , Cooke , D. E. L. , Green , S. , Chapman , D. S. , & Purse , B. V . ( 2021 ). Evolutionary trait based approaches for predicting future global impacts of plant pathogens in the genus Phytophthora . Journal of Applied Ecology , 58 ( 4 ), 718 – 730 . doi: 10.1111/1365-2664.13820 OpenUrl CrossRef ↵ Bebber , D. P. , Holmes , T. , & Gurr , S. J . ( 2014 ). The global spread of crop pests and pathogens . Global Ecology and Biogeography , 23 ( 12 ), 1398 – 1407 . doi: 10.1111/geb.12214 OpenUrl CrossRef ↵ Berendsen , R. L. , Pieterse , C. M. J. , & Bakker , P. A. H. M . ( 2012 ). The rhizosphere microbiome and plant health . Trends in Plant Science , 17 ( 8 ), 478 – 486 . doi: 10.1016/j.tplants.2012.04.001 OpenUrl CrossRef PubMed Web of Science ↵ Björk , J. R. , Hui , F. K. C. , O’Hara , R. B. , & Montoya , J. M . ( 2018 ). Uncovering the drivers of host associated microbiota with joint species distribution modelling . Molecular Ecology , 27 ( 12 ), 2714 – 2724 . doi: 10.1111/mec.14718 OpenUrl CrossRef ↵ Blei , D. M. , Kucukelbir , A. , & McAuliffe , J. D . ( 2017 ). Variational Inference: A Review for Statisticians . Journal of the American Statistical Association , 112 ( 518 ), 859 – 877 . doi: 10.1080/01621459.2017.1285773 OpenUrl CrossRef ↵ Breiner , F. T. , Guisan , A. , Bergamini , A. , & Nobis , M. P . ( 2015 ). Overcoming limitations of modelling rare species by using ensembles of small models . Methods in Ecology and Evolution , 6 ( 10 ), 1210 – 1218 . doi: 10.1111/2041-210x.12403 OpenUrl CrossRef ↵ Brennan , G. L. , & Logares , R . ( 2023 ). Tracking contemporary microbial evolution in a changing ocean . Trends in Microbiology , 31 ( 4 ), 336 – 345 . doi: 10.1016/j.tim.2022.09.001 OpenUrl CrossRef ↵ Brunel , C. , Pouteau , R. , Dawson , W. , Pester , M. , Ramirez , K. S. , & Kleunen , M. van. ( 2020 ). Towards unraveling macroecological patterns in rhizosphere microbiomes . Trends in Plant Science , 25 ( 10 ), 1017 – 1029 . doi: 10.1016/j.tplants.2020.04.015 OpenUrl CrossRef ↵ Callahan , B. J. , McMurdie , P. J. , & Holmes , S. P . ( 2017 ). Exact sequence variants should replace operational taxonomic units in marker-gene data analysis . The ISME Journal , 11 ( 12 ), 2639 – 2643 . doi: 10.1038/ismej.2017.119 OpenUrl CrossRef PubMed ↵ Carvalho , S. D. , & Castillo , J. A . ( 2018 ). Influence of Light on Plant–Phyllosphere Interaction . Frontiers in Plant Science , 9 , 1482 . doi: 10.3389/fpls.2018.01482 OpenUrl CrossRef ↵ Cavicchioli , R. , Ripple , W. J. , Timmis , K. N. , Azam , F. , Bakken , L. R. , Baylis , M. , Behrenfeld , M. J. , Boetius , A. , Boyd , P. W. , Classen , A. T. , Crowther , T. W. , Danovaro , R. , Foreman , C. M. , Huisman , J. , Hutchins , D. A. , Jansson , J. K. , Karl , D. M. , Koskella , B. , Welch , D. B. M. , … Webster , N. S . ( 2019 ). Scientists’ warning to humanity: microorganisms and climate change . Nature Reviews Microbiology , 17 ( 9 ), 569 – 586 . doi: 10.1038/s41579-019-0222-5 OpenUrl CrossRef ↵ Chase , A. B. , Weihe , C. , & Martiny , J. B. H . ( 2021 ). Adaptive differentiation and rapid evolution of a soil bacterium along a climate gradient . Proceedings of the National Academy of Sciences , 118 ( 18 ), e2101254118 . doi: 10.1073/pnas.2101254118 OpenUrl Abstract / FREE Full Text ↵ Cordovez , V. , Dini-Andreote , F. , Carrión , V. J. , & Raaijmakers , J. M . ( 2019 ). Ecology and evolution of plant microbiomes . Annual Review of Microbiology , 73 ( 1 ), 1 – 20 . doi: 10.1146/annurev-micro-090817-062524 OpenUrl CrossRef ↵ Cunha , H. F. , Ferreira , É. D. , Tessarolo , G. , & Nabout , J. C . ( 2018 ). Host plant distributions and climate interact to affect the predicted geographic distribution of a Neotropical termite . Biotropica , 50 ( 4 ), 625 – 632 . doi: 10.1111/btp.12555 OpenUrl CrossRef ↵ Dastogeer , K. M. G. , Tumpa , F. H. , Sultana , A. , Akter , M. A. , & Chakraborty , A . ( 2020 ). Plant microbiome–an account of the factors that shape community composition and diversity . Current Plant Biology , 23 , 100161 . doi: 10.1016/j.cpb.2020.100161 OpenUrl CrossRef ↵ Davison , J. , Moora , M. , Semchenko , M. , Adenan , S. B. , Ahmed , T. , Akhmetzhanova , A. A. , Alatalo , J. M. , Al Quraishy , S. , Andriyanova , E. , Anslan , S. , Bahram , M. , Batbaatar , A. , Brown , C. , Bueno , C. G. , Cahill , J. , Cantero , J. J. , Casper , B. B. , Cherosov , M. , Chideh , S. , … Öpik , M. ( 2021 ). Temperature and pH define the realised niche space of arbuscular mycorrhizal fungi . New Phytologist , 231 ( 2 ), 763 – 776 . doi: 10.1111/nph.17240 OpenUrl CrossRef ↵ Delavaux , C. S. , Weigelt , P. , Dawson , W. , Duchicela , J. , Essl , F. , Kleunen , M. van , König , C. , Pergl , J. , Pyšek , P. , Stein , A. , Winter , M. , Schultz , P. , Kreft , H. , & Bever , J. D. ( 2019 ). Mycorrhizal fungi influence global plant biogeography . Nature Ecology & Evolution , 3 ( 3 ), 424 – 429 . doi: 10.1038/s41559-019-0823-4 OpenUrl CrossRef ↵ Dormann , C. F. , McPherson , J. M. , Araújo , M. B. , Bivand , R. , Bolliger , J. , Carl , G. , Davies , R. G. , Hirzel , A. , Jetz , W. , Kissling , W. D. , Kühn , I. , Ohlemüller , R. , Peres Neto , P. R. , Reineking , B. , Schröder , B. , Schurr , F. M. , & Wilson , R . ( 2007 ). Methods to account for spatial autocorrelation in the analysis of species distributional data: a review . Ecography , 30 ( 5 ), 609 – 628 . doi: 10.1111/j.2007.0906-7590.05171.x OpenUrl CrossRef Web of Science ↵ Dukes , J. S. , Pontius , J. , Orwig , D. , Garnas , J. R. , Rodgers , V. L. , Brazee , N. , Cooke , B. , Theoharides , K. A. , Stange , E. E. , Harrington , R. , Ehrenfeld , J. , Gurevitch , J. , Lerdau , M. , Stinson , K. , Wick , R. , & Ayres , M . ( 2009 ). Responses of insect pests, pathogens, and invasive plant species to climate change in the forests of northeastern North America: What can we predict? This article is one of a selection of papers from NE Forests 2100 : A Synthesis of Climate Change Impacts on Forests of the Northeastern US and Eastern Canada. Canadian Journal of Forest Research, 39(2), 231–248. doi: 10.1139/x08-171 OpenUrl CrossRef ↵ Elith , J. , & Leathwick , J. R . ( 2009 ). Species Distribution Models: Ecological Explanation and Prediction Across Space and Time . Annual Review of Ecology, Evolution, and Systematics , 40 ( 1 ), 677 – 697 . doi: 10.1146/annurev.ecolsys.110308.120159 OpenUrl CrossRef ↵ Finkel , O. M. , Burch , A. Y. , Elad , T. , Huse , S. M. , Lindow , S. E. , Post , A. F. , & Belkin , S . ( 2012 ). Distance-decay relationships partially determine diversity patterns of phyllosphere bacteria on tamrix trees across the sonoran desert . Applied and Environmental Microbiology , 78 ( 17 ), 6187 – 6193 . doi: 10.1128/aem.00888-12 OpenUrl Abstract / FREE Full Text ↵ Fisher , M. C. , Henk Daniel . A. , Briggs , C. J. , Brownstein , J. S. , Madoff , L. C. , McCraw , S. L. , & Gurr , S. J . ( 2012 ). Emerging fungal threats to animal, plant and ecosystem health . Nature , 484 ( 7393 ), 186 – 194 . doi: 10.1038/nature10947 OpenUrl CrossRef PubMed Web of Science ↵ Fitzpatrick , C. R. , Salas-González , I. , Conway , J. M. , Finkel , O. M. , Gilbert , S. , Russ , D. , Teixeira , P. J. P. L. , & Dangl , J. L . ( 2020 ). The plant microbiome: from ecology to reductionism and beyond . Annual Review of Microbiology , 74 ( 1 ), 1 – 20 . doi: 10.1146/annurev-micro-022620-014327 OpenUrl CrossRef ↵ Gloor , G. B. , Macklaim , J. M. , Pawlowsky-Glahn , V. , & Egozcue , J. J . ( 2017 ). Microbiome Datasets Are Compositional: And This Is Not Optional . Frontiers in Microbiology , 8 , 2224 . doi: 10.3389/fmicb.2017.02224 OpenUrl CrossRef PubMed ↵ Gorris , M. E. , Treseder , K. K. , Zender , C. S. , & Randerson , J. T . ( 2019 ). Expansion of Coccidioidomycosis Endemic Regions in the United States in Response to Climate Change . GeoHealth , 3 ( 10 ), 308 – 327 . doi: 10.1029/2019gh000209 OpenUrl CrossRef ↵ Gougherty , A. V. , & Davies , T. J . ( 2021 ). Towards a phylogenetic ecology of plant pests and pathogens . Philosophical Transactions of the Royal Society B , 376 ( 1837 ), 20200359 . doi: 10.1098/rstb.2020.0359 OpenUrl CrossRef ↵ Gougherty , A. V. , & Davies , T. J . ( 2022 ). A global analysis of tree pests and emerging pest threats . Proceedings of the National Academy of Sciences , 119 ( 13 ), e2113298119 . doi: 10.1073/pnas.2113298119 OpenUrl CrossRef ↵ Hao , T. , Guillera-Arroita , G. , May , T. W. , Lahoz-Monfort , J. J. , & Elith , J . ( 2020 ). Using Species Distribution Models For Fungi . Fungal Biology Reviews , 34 ( 2 ), 74 – 88 . doi: 10.1016/j.fbr.2020.01.002 OpenUrl CrossRef ↵ Harris , D. J . ( 2015 ). Generating realistic assemblages with a joint species distribution model . Methods in Ecology and Evolution , 6 ( 4 ), 465 – 473 . doi: 10.1111/2041-210x.12332 OpenUrl CrossRef ↵ Hawkes , C. V. , Bull , J. J. , & Lau , J. A . ( 2020 ). Symbiosis and stress: how plant microbiomes affect host evolution . Philosophical Transactions of the Royal Society B: Biological Sciences , 375 ( 1808 ), 20190590 . doi: 10.1098/rstb.2019.0590 OpenUrl CrossRef ↵ Jamil , T. , Ozinga , W. A. , Kleyer , M. , & Braak , C. J. F. ter . ( 2013 ). Selecting traits that explain species–environment relationships: a generalized linear mixed model approach . Journal of Vegetation Science , 24 ( 6 ), 988 – 1000 . doi: 10.1111/j.1654-1103.2012.12036.x OpenUrl CrossRef ↵ Kahn , A. K. , & Almeida , R. P. P . ( 2022 ). Phylogenetics of Historical Host Switches in a Bacterial Plant Pathogen . Applied and Environmental Microbiology , 88 ( 7 ), e02356 – 21 . doi: 10.1128/aem.02356-21 OpenUrl CrossRef ↵ Kim , J. Y. , Yi , M. , Kim , M. , Lee , S. , Moon , H. S. , Yong , D. , & Yong , T . ( 2021 ). Measuring the absolute abundance of the microbiome by adding yeast containing 16S rRNA gene from a hyperthermophile . MicrobiologyOpen , 10 ( 4 ), e1220 . doi: 10.1002/mbo3.1220 OpenUrl CrossRef ↵ Kivlin , S. N. , Hawkes , C. V. , Papeş , M. , Treseder , K. K. , & Averill , C . ( 2021 ). The future of microbial ecological niche theory and modeling . New Phytologist , 231 ( 2 ), 508 – 511 . doi: 10.1111/nph.17373 OpenUrl CrossRef ↵ Kool , J. , Tymchenko , L. , Shetty , S. A. , & Fuentes , S . ( 2023 ). Reducing bias in microbiome research: Comparing methods from sample collection to sequencing . Frontiers in Microbiology , 14 , 1094800 . doi: 10.3389/fmicb.2023.1094800 OpenUrl CrossRef ↵ Kuhn , T. , Cunze , S. , Kochmann , J. , & Klimpel , S . ( 2016 ). Environmental variables and definitive host distribution: a habitat suitability modelling for endohelminth parasites in the marine realm . Scientific Reports , 6 ( 1 ), 30246 . doi: 10.1038/srep30246 OpenUrl CrossRef ↵ Lajoie , G. , & Kembel , S. W . ( 2021 ). Host neighborhood shapes bacterial community assembly and specialization on tree species across a latitudinal gradient . Ecological Monographs . doi: 10.1002/ecm.1443 OpenUrl CrossRef ↵ Li , P. , Tedersoo , L. , Crowther , T. W. , Wang , B. , Shi , Y. , Kuang , L. , Li , T. , Wu , M. , Liu , M. , Luan , L. , Liu , J. , Li , D. , Li , Y. , Wang , S. , Saleem , M. , Dumbrell , A. J. , Li , Z. , & Jiang , J . ( 2023 ). Global diversity and biogeography of potential phytopathogenic fungi in a changing world . Nature Communications , 14 ( 1 ), 6482 . doi: 10.1038/s41467-023-42142-4 OpenUrl CrossRef ↵ Linde , S. van der , Suz , L. M. , Orme , C. D. L. , Cox , F. , Andreae , H. , Asi , E. , Atkinson , B. , Benham , S. , Carroll , C. , Cools , N. , Vos , B. D. , Dietrich , H.-P. , Eichhorn , J. , Gehrmann , J. , Grebenc , T. , Gweon , H. S. , Hansen , K. , Jacob , F. , Kristöfel , F. , … Bidartondo , M. I. ( 2018 ). Environment and host as large-scale controls of ectomycorrhizal fungi . Nature , 558 ( 7709 ), 243 – 248 . doi: 10.1038/s41586-018-0189-9 OpenUrl CrossRef ↵ Lozupone , C. A. , Stombaugh , J. , Gonzalez , A. , Ackermann , G. , Wendel , D. , Vázquez-Baeza , Y. , Jansson , J. K. , Gordon , J. I. , & Knight , R . ( 2013 ). Meta-analyses of studies of the human microbiota . Genome Research , 23 ( 10 ), 1704 – 1714 . doi: 10.1101/gr.151803.112 OpenUrl Abstract / FREE Full Text ↵ Marcot , B. G. , Scott , P. , & Burgess , T. I . ( 2023 ). Multivariate Bayesian analysis to predict invasiveness of Phytophthora pathogens . Ecosphere , 14 ( 6 ). doi: 10.1002/ecs2.4573 OpenUrl CrossRef ↵ McDonough , S. , & Holloway , P . ( 2021 ). Incorporating host-parasite biotic factors in species distribution models: Modelling the distribution of the castor bean tick, Ixodes ricinus . Irish Geography , 53 ( 2 ). doi: 10.55650/igj.2020.1416 OpenUrl CrossRef ↵ McLaren , M. R. , Willis , A. D. , & Callahan , B. J . ( 2019 ). Consistent and correctable bias in metagenomic sequencing experiments . ELife , 8 , e46923 . doi: 10.7554/elife.46923 OpenUrl CrossRef PubMed ↵ Methé , B. A. , Nelson , K. E. , Pop , M. , Creasy , H. H. , Giglio , M. G. , Huttenhower , C. , Gevers , D. , Petrosino , J. F. , Abubucker , S. , Badger , J. H. , Chinwalla , A. T. , Earl , A. M. , FitzGerald , M. G. , Fulton , R. S. , Hallsworth-Pepin , K. , Lobos , E. A. , Madupu , R. , Magrini , V. , Martin , J. C. , … White , O . ( 2012 ). A framework for human microbiome research . Nature , 486 ( 7402 ), 215 – 221 . doi: 10.1038/nature11209 OpenUrl CrossRef PubMed Web of Science ↵ Minard , G. , Tikhonov , G. , Ovaskainen , O. , & Saastamoinen , M . ( 2019 ). The microbiome of the Melitaea cinxia butterfly shows marked variation but is only little explained by the traits of the butterfly or its host plant . Environmental Microbiology , 21 ( 11 ), 4253 – 4269 . doi: 10.1111/1462-2920.14786 OpenUrl CrossRef ↵ Mod , H. K. , Buri , A. , Yashiro , E. , Guex , N. , Malard , L. , Pinto-Figueroa , E. , Pagni , M. , Niculita-Hirzel , H. , Meer, J. R. van der, & Guisan, A. ( 2021 ). Predicting spatial patterns of soil bacteria under current and future environmental conditions . The ISME Journal , 15 ( 9 ), 2547 – 2560 . doi: 10.1038/s41396-021-00947-5 OpenUrl CrossRef ↵ Morales-Castilla , I. , Pappalardo , P. , Farrell , M. J. , Aguirre , A. A. , Huang , S. , Gehman , A.-L. M. , Dallas , T. , Gravel , D. , & Davies , T. J . ( 2021 ). Forecasting parasite sharing under climate change . Philosophical Transactions of the Royal Society B , 376 ( 1837 ), 20200360 . doi: 10.1098/rstb.2020.0360 OpenUrl CrossRef ↵ Morton , J. T. , Marotz , C. , Washburne , A. , Silverman , J. , Zaramela , L. S. , Edlund , A. , Zengler , K. , & Knight , R . ( 2019 ). Establishing microbial composition measurement standards with reference frames . Nature Communications , 10 ( 1 ), 2719 . doi: 10.1038/s41467-019-10656-5 OpenUrl CrossRef PubMed ↵ Odriozola , I. , Abrego , N. , Tláskal , V. , Zrůstová , P. , Morais , D. , Větrovský , T. , Ovaskainen , O. , & Baldrian , P . ( 2021 ). Fungal Communities Are Important Determinants of Bacterial Community Composition in Deadwood . MSystems , 6 ( 1 ), e01017 – 20 . doi: 10.1128/msystems.01017-20 OpenUrl CrossRef ↵ Ovaskainen , O. , Roy , D. B. , Fox , R. , & Anderson , B. J . ( 2016 ). Uncovering hidden spatial structure in species communities with spatially explicit joint species distribution models . Methods in Ecology and Evolution , 7 ( 4 ), 428 – 436 . doi: 10.1111/2041-210x.12502 OpenUrl CrossRef ↵ Ovaskainen , O. , & Soininen , J . ( 2011 ). Making more out of sparse data: hierarchical modeling of species communities . Ecology , 92 ( 2 ), 289 – 295 . doi: 10.1890/10-1251.1 OpenUrl CrossRef PubMed Web of Science ↵ Ovaskainen , O. , Tikhonov , G. , Norberg , A. , Blanchet , F. G. , Duan , L. , Dunson , D. , Roslin , T. , & Abrego , N . ( 2017 ). How to make more out of community data? A conceptual framework and its implementation as models and software . Ecology Letters , 20 ( 5 ), 561 – 576 . doi: 10.1111/ele.12757 OpenUrl CrossRef ↵ Park , A. W. , Farrell , M. J. , Schmidt , J. P. , Huang , S. , Dallas , T. A. , Pappalardo , P. , Drake , J. M. , Stephens , P. R. , Poulin , R. , Nunn , C. L. , & Davies , T. J . ( 2018 ). Characterizing the phylogenetic specialism–generalism spectrum of mammal parasites . Proceedings of the Royal Society B: Biological Sciences , 285 ( 1874 ), 20172613 . doi: 10.1098/rspb.2017.2613 OpenUrl CrossRef PubMed ↵ Pattnaik , S. S. , Paramanantham , P. , & Busi , S . ( 2021 ). The Plant Microbiome in Sustainable Agriculture . 119 – 139 . doi: 10.1002/9781119505457.ch7 OpenUrl CrossRef ↵ Piché-Choquette , S. , Tláskal , V. , Vrška , T. , Jiráska , L. , Větrovský , T. , & Baldrian , P . ( 2023 ). Continuous microhabitats as crossroads of fungal communities in a primeval temperate forest . Soil Biology and Biochemistry , 187 , 109187 . doi: 10.1016/j.soilbio.2023.109187 OpenUrl CrossRef ↵ Pichler , M. , & Hartig , F . ( 2021 ). A new joint species distribution model for faster and more accurate inference of species associations from big community data . Methods in Ecology and Evolution , 12 ( 11 ), 2159 – 2173 . doi: 10.1111/2041-210x.13687 OpenUrl CrossRef ↵ Piwowarczyk , R. , & Kolanowska , M . ( 2023 ). Effect of global warming on the potential distribution of a holoparasitic plant (Phelypaea tournefortii): both climate and host distribution matter . Scientific Reports , 13 ( 1 ), 10741 . doi: 10.1038/s41598-023-37897-1 OpenUrl CrossRef ↵ Poggiato , G. , Münkemüller , T. , Bystrova , D. , Arbel , J. , Clark , J. S. , & Thuiller , W . ( 2021 ). On the Interpretations of Joint Modeling in Community Ecology . Trends in Ecology & Evolution , 36 ( 5 ), 391 – 401 . doi: 10.1016/j.tree.2021.01.002 OpenUrl CrossRef ↵ Pollock , L. J. , Morris , W. K. , & Vesk , P. A . ( 2012 ). The role of functional traits in species distributions revealed through a hierarchical model . Ecography , 35 ( 8 ), 716 – 725 . doi: 10.1111/j.1600-0587.2011.07085.x OpenUrl CrossRef Web of Science ↵ Pollock , L. J. , Tingley , R. , Morris , W. K. , Golding , N. , O’Hara , R. B. , Parris , K. M. , Vesk , P. A. , & McCarthy , M. A . ( 2014 ). Understanding co occurrence by modelling species simultaneously with a Joint Species Distribution Model (JSDM) . Methods in Ecology and Evolution , 5 ( 5 ), 397 – 406 . doi: 10.1111/2041-210x.12180 OpenUrl CrossRef ↵ Ramirez , K. S. , Knight , C. G. , Hollander , M. de , Brearley , F. Q. , Constantinides , B. , Cotton , A. , Creer , S. , Crowther , T. W. , Davison , J. , Delgado-Baquerizo , M. , Dorrepaal , E. , Elliott , D. R. , Fox , G. , Griffiths , R. I. , Hale , C. , Hartman , K. , Houlden , A. , Jones , D. L. , Krab , E. J. , … Vries , F. T. de . ( 2018 ). Detecting macroecological patterns in bacterial communities across independent studies of global soils . Nature Microbiology , 3 ( 2 ), 189 – 196 . doi: 10.1038/s41564-017-0062-x OpenUrl CrossRef ↵ Rue , H. , Martino , S. , & Chopin , N . ( 2009 ). Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations . Journal of the Royal Statistical Society , 71 ( 2 ), 319 – 392 . doi: 10.1111/j.1467-9868.2008.00700.x OpenUrl CrossRef PubMed Web of Science ↵ Schloss , P. D . ( 2024 ). Rarefaction is currently the best approach to control for uneven sequencing effort in amplicon sequence analyses. MSphere , e 0035423 . doi: 10.1128/msphere.00354-23 OpenUrl CrossRef ↵ Sengupta , P. , Sivabalan , S. K. M. , Mahesh , A. , Palanikumar , I. , Baskaran , D. K. K. , & Raman , K . ( 2023 ). Big Data for a Small World: A Review on Databases and Resources for Studying Microbiomes . Journal of the Indian Institute of Science , 103 ( 3 ), 891 – 907 . doi: 10.1007/s41745-023-00370-z OpenUrl CrossRef ↵ Sousa , J. M. de , Lourenço , M. , & Gordo , I. ( 2023 ). Horizontal gene transfer among host-associated microbes . Cell Host & Microbe , 31 ( 4 ), 513 – 527 . doi: 10.1016/j.chom.2023.03.017 OpenUrl CrossRef ↵ Stämmler , F. , Gläsner , J. , Hiergeist , A. , Holler , E. , Weber , D. , Oefner , P. J. , Gessner , A. , & Spang , R . ( 2016 ). Adjusting microbiome profiles for differences in microbial load by spike-in bacteria . Microbiome , 4 ( 1 ), 28 . doi: 10.1186/s40168-016-0175-0 OpenUrl CrossRef ↵ Thirkell , T. J. , Charters , M. D. , Elliott , A. J. , Sait , S. M. , & Field , K. J . ( 2017 ). Are mycorrhizal fungi our sustainable saviours? Considerations for achieving food security . Journal of Ecology , 105 ( 4 ), 921 – 929 . doi: 10.1111/1365-2745.12788 OpenUrl CrossRef ↵ Thompson , L. R. , Sanders , J. G. , McDonald , D. , Amir , A. , Ladau , J. , Locey , K. J. , Prill , R. J. , Tripathi , A. , Gibbons , S. M. , Ackermann , G. , Navas-Molina , J. A. , Janssen , S. , Kopylova , E. , Vázquez-Baeza , Y. , González , A. , Morton , J. T. , Mirarab , S. , Xu , Z. Z. , Jiang , L. , … Zhao , H . ( 2017 ). A communal catalogue reveals Earth’s multiscale microbial diversity . Nature , 551 ( 7681 ), 457 – 463 . doi: 10.1038/nature24621 OpenUrl CrossRef PubMed ↵ Tkacz , A. , Hortala , M. , & Poole , P. S . ( 2018 ). Absolute quantitation of microbiota abundance in environmental samples . Microbiome , 6 ( 1 ), 110 . doi: 10.1186/s40168-018-0491-7 OpenUrl CrossRef ↵ Trivedi , P. , Batista , B. D. , Bazany , K. E. , & Singh , B. K . ( 2022 ). Plant–microbiome interactions under a changing world: responses, consequences and perspectives . New Phytologist , 234 ( 6 ), 1951 – 1959 . doi: 10.1111/nph.18016 OpenUrl CrossRef ↵ Vandenkoornhuyse , P. , Quaiser , A. , Duhamel , M. , Van , A. L. , & Dufresne , A . ( 2015 ). The importance of the microbiome of the plant holobiont . New Phytologist , 206 ( 4 ), 1196 – 1206 . doi: 10.1111/nph.13312 OpenUrl CrossRef PubMed ↵ Větrovský , T. , Kohout , P. , Kopecký , M. , Machac , A. , Man , M. , Bahnmann , B. D. , Brabcová , V. , Choi , J. , Meszárošová , L. , Human , Z. R. , Lepinay , C. , Lladó , S. , López-Mondéjar , R. , Martinović , T. , Mašínová , T. , Morais , D. , Navrátilová , D. , Odriozola , I. , Štursová , M. , … Baldrian , P . ( 2019 ). A meta-analysis of global fungal distribution reveals climate-driven patterns . Nature Communications , 10 ( 1 ), 5142 . doi: 10.1038/s41467-019-13164-8 OpenUrl CrossRef ↵ Větrovský , T. , Kolaříková , Z. , Lepinay , C. , Hollá , S. A. , Davison , J. , Fleyberková , A. , Gromyko , A. , Jelínková , B. , Kolařík , M. , Krüger , M. , Lejsková , R. , Michalčíková , L. , Michalová , T. , Moora , M. , Moravcová , A. , Moulíková , Š. , Odriozola , I. , Öpik , M. , Pappová , M. , … Kohout , P . ( 2023 ). GlobalAMFungi: a global database of arbuscular mycorrhizal fungal occurrences from high throughput sequencing metabarcoding studies . New Phytologist , 240 ( 5 ), 2151 – 2163 . doi: 10.1111/nph.19283 OpenUrl CrossRef ↵ Větrovský , T. , Morais , D. , Kohout , P. , Lepinay , C. , Algora , C. , Hollá , S. A. , Bahnmann , B. D. , Bílohnědá , K. , Brabcová , V. , D’Alò , F. , Human , Z. R. , Jomura , M. , Kolařík , M. , Kvasničková , J. , Lladó , S. , López-Mondéjar , R. , Martinović , T. , Mašínová , T. , Meszárošová , L. , … Baldrian , P . ( 2020 ). GlobalFungi, a global database of fungal occurrences from high-throughput-sequencing metabarcoding studies . Scientific Data , 7 ( 1 ), 228 . doi: 10.1038/s41597-020-0567-7 OpenUrl CrossRef ↵ Vries , S. de , Stukenbrock , E. H. , & Rose , L. E. ( 2020 ). Rapid evolution in plant–microbe interactions – an evolutionary genomics perspective . New Phytologist , 226 ( 5 ), 1256 – 1262 . doi: 10.1111/nph.16458 OpenUrl CrossRef ↵ Wang , C.-J. , Li , Q.-F. , & Wan , J.-Z . ( 2019 ). Potential invasive plant expansion in global ecoregions under climate change . PeerJ , 7 , e6479 . doi: 10.7717/peerj.6479 OpenUrl CrossRef ↵ Wang , R. , Li , Q. , He , S. , Liu , Y. , Wang , M. , & Jiang , G . ( 2018 ). Modeling and mapping the current and future distribution of Pseudomonas syringae pv. actinidiae under climate change in China . PLoS ONE , 13 ( 2 ), e0192153 . doi: 10.1371/journal.pone.0192153 OpenUrl CrossRef ↵ Wang , Z. , Jiang , Y. , Zhang , M. , Chu , C. , Chen , Y. , Fang , S. , Jin , G. , Jiang , M. , Lian , J. , Li , Y. , Liu , Y. , Ma , K. , Mi , X. , Qiao , X. , Wang , X. , Wang , X. , Xu , H. , Ye , W. , Zhu , L. , … Kembel , S. W . ( 2023 ). Diversity and biogeography of plant phyllosphere bacteria are governed by latitude dependent mechanisms . New Phytologist . doi: 10.1111/nph.19235 OpenUrl CrossRef ↵ Wang , Z. , Lajoie , G. , Jiang , Y. , Zhang , M. , Chu , C. , Chen , Y. , Fang , S. , Jin , G. , Jiang , M. , Lian , J. , Li , Y. , Liu , Y. , Ma , K. , Mi , X. , Qiao , X. , Wang , X. , Wang , X. , Xu , H. , Ye , W. , Zhu , L. , … Kembel , S. W . ( in process ) Host specificity of plant-associated bacteria is negatively associated with genome size and host abundance along a latitudinal gradient . Ecology Letter . ↵ Warton , D. I. , Blanchet , F. G. , O’Hara , R. B. , Ovaskainen , O. , Taskinen , S. , Walker , S. C. , & Hui , F. K. C . ( 2015 ). So Many Variables: Joint Modeling in Community Ecology . Trends in Ecology & Evolution , 30 ( 12 ), 766 – 779 . doi: 10.1016/j.tree.2015.09.007 OpenUrl CrossRef PubMed ↵ Wilkinson , D. P. , Golding , N. , Guillera Arroita , G. , Tingley , R. , & McCarthy , M. A . ( 2021 ). Defining and evaluating predictions of joint species distribution models . Methods in Ecology and Evolution , 12 ( 3 ), 394 – 404 . doi: 10.1111/2041-210x.13518 OpenUrl CrossRef ↵ Wisz , M. S. , Pottier , J. , Kissling , W. D. , Pellissier , L. , Lenoir , J. , Damgaard , C. F. , Dormann , C. F. , Forchhammer , M. C. , Grytnes , J. , Guisan , A. , Heikkinen , R. K. , Høye , T. T. , Kühn , I. , Luoto , M. , Maiorano , L. , Nilsson , M. , Normand , S. , Öckinger , E. , Schmidt , N. M. , … Svenning , J . ( 2013 ). The role of biotic interactions in shaping distributions and realised assemblages of species: implications for species distribution modelling . Biological Reviews , 88 ( 1 ), 15 – 30 . doi: 10.1111/j.1469-185x.2012.00235.x OpenUrl CrossRef PubMed ↵ Zurell , D. , Zimmermann , N. E. , Gross , H. , Baltensweiler , A. , Sattler , T. , & Wüest , R. O . ( 2020 ). Testing species assemblage predictions from stacked and joint species distribution models . Journal of Biogeography , 47 ( 1 ), 101 – 113 . doi: 10.1111/jbi.13608 OpenUrl CrossRef View the discussion thread. Back to top Previous Next Posted July 16, 2024. Download PDF Email Thank you for your interest in spreading the word about bioRxiv. NOTE: Your email address is requested solely to identify you as the sender of this article. Your Email * Your Name * Send To * Enter multiple addresses on separate lines or separate them with commas. You are going to email the following Modeling plant-microbe interactions with species distribution models Message Subject (Your Name) has forwarded a page to you from bioRxiv Message Body (Your Name) thought you would like to see this page from the bioRxiv website. Your Personal Message CAPTCHA This question is for testing whether or not you are a human visitor and to prevent automated spam submissions. Share Modeling plant-microbe interactions with species distribution models Zihui Wang , Sarah Piché-Choquette , Jocelyn Lauzon , Sarah Ishak , Steven W. Kembel bioRxiv 2024.07.12.603317; doi: https://doi.org/10.1101/2024.07.12.603317 Share This Article: Copy Citation Tools Modeling plant-microbe interactions with species distribution models Zihui Wang , Sarah Piché-Choquette , Jocelyn Lauzon , Sarah Ishak , Steven W. Kembel bioRxiv 2024.07.12.603317; doi: https://doi.org/10.1101/2024.07.12.603317 Citation Manager Formats BibTeX Bookends EasyBib EndNote (tagged) EndNote 8 (xml) Medlars Mendeley Papers RefWorks Tagged Ref Manager RIS Zotero Tweet Widget Facebook Like Google Plus One Subject Area Ecology Subject Areas All Articles Animal Behavior and Cognition (7651) Biochemistry (17746) Bioengineering (13928) Bioinformatics (42066) Biophysics (21499) Cancer Biology (18650) Cell Biology (25579) Clinical Trials (138) Developmental Biology (13409) Ecology (19947) Epidemiology (2067) Evolutionary Biology (24374) Genetics (15633) Genomics (22557) Immunology (17775) Microbiology (40505) Molecular Biology (17217) Neuroscience (88796) Paleontology (667) Pathology (2845) Pharmacology and Toxicology (4836) Physiology (7664) Plant Biology (15179) Scientific Communication and Education (2047) Synthetic Biology (4304) Systems Biology (9839) Zoology (2272)
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.