AI-Enhanced Disaster Risk Prediction with Explainable SHAP Analysis: A Multi-Class Classification Approach Using XGBoost | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article AI-Enhanced Disaster Risk Prediction with Explainable SHAP Analysis: A Multi-Class Classification Approach Using XGBoost Qiannan Shen, Jing Zhang This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-8437180/v1 This work is licensed under a CC BY 4.0 License Status: Posted Version 1 posted You are reading this latest preprint version Abstract Natural disasters pose significant threats to global communities, necessitating advanced predictive frameworks for effective risk assessment and management. This study presents an AI-driven disaster risk prediction system integrating XGBoost machine learning with SHAP (SHapley Additive exPlanations) interpretability analysis. Using the World Risk Index dataset spanning 11 years across 181 countries, we developed multi-class classification models for four key risk indicators: World Risk Index (WRI), Exposure, Vulnerability, and Susceptibility. The XGBoost classifier achieved test accuracies exceeding 0.85 across all categories, with macro-averaged AUC scores ranging from 0.92 to 0.96. SHAP analysis revealed critical driving factors influencing disaster susceptibility, demonstrating the interpretability of AI-powered predictions. Our explainable AI framework provides transparent, actionable insights for policymakers and disaster management authorities, bridging the gap between predictive accuracy and decision-making transparency in global risk assessment. Artificial Intelligence and Machine Learning Explainable AI SHAP Analysis Disaster Risk Prediction XGBoost Multi-class Classification Figures Figure 1 Figure 2 Figure 3 Figure 4 I. INTRODUCTION Natural disasters represent one of the most pressing challenges facing contemporary society, with profound implications for human safety, economic stability, and sustainable development. According to the United Nations Office for Disaster Risk Reduction (UNDRR), over 160 million people worldwide are affected by natural hazards annually, resulting in substantial human casualties and economic losses exceeding billions of dollars [1]. Recent advances in artificial intelligence and machine learning have opened unprecedented opportunities [30] for enhancing disaster prediction accuracy [29] and early warning systems [2,16,31,34,35,36], yet the interpretability of these "black-box" models remains a critical challenge for decision-makers and stakeholders. Although significant progress has been made in applying machine learning algorithms to natural disaster forecasting [3], existing methods still suffer from limited transparency and explainability. Traditional statistical approaches provide clear interpretations but often sacrifice predictive performance, while deep learning solutions achieve high accuracy at the expense of model interpretability [4,22,26,28,33]. This trade-off between accuracy and explainability has hindered the widespread adoption of AI-based disaster risk assessment systems, particularly in policy-making contexts where stakeholders require transparent justifications for risk predictions and resource allocation decisions [5]. This limitation motivates us to explore explainable artificial intelligence (XAI) techniques, specifically SHAP analysis, which can bridge the gap between model performance and interpretability [6]. The key challenge lies in developing a disaster risk prediction framework that not only achieves high classification accuracy across multiple risk categories but also provides intuitive, feature-level explanations that reveal which factors contribute most significantly to disaster vulnerability. Understanding these driving factors is essential for developing targeted mitigation strategies and optimizing resource allocation in disaster preparedness programs [3]. To address these challenges, we propose an integrated AI framework combining XGBoost ensemble learning with SHAP-based explainability analysis for multi-class disaster risk prediction. Our approach leverages the World Risk Index dataset, which comprehensively quantifies disaster risk through exposure to natural hazards and social vulnerability indicators across 181 countries [14,25]. The framework employs XGBoost classifiers to predict five-level risk categories (Very Low, Low, Medium, High, Very High) for four critical dimensions: overall World Risk Index, Exposure, Vulnerability, and Susceptibility [1]. Furthermore, we integrate SHAP analysis to decode the complex decision-making processes of the trained models, revealing both global feature importance patterns and local prediction explanations for individual risk assessments [12]. The main contributions of this work are as follows: 1. Novel Integration of XGBoost and SHAP for Disaster Risk Assessment: We present the first comprehensive framework combining gradient boosting classification with explainable AI techniques for multi-dimensional disaster risk prediction, achieving superior classification performance while maintaining full interpretability. 2. Multi-Class Risk Categorization Across Critical Dimensions: Our system simultaneously predicts risk levels for four interconnected disaster indicators, providing a holistic assessment that captures exposure, vulnerability, and susceptibility patterns with test accuracies exceeding 85% and AUC scores above 0.92. 3. Interpretable Feature Contribution Analysis: Through SHAP value computation and visualization [6], we identify and quantify the most influential factors driving disaster susceptibility, revealing actionable insights such as feature interaction effects and threshold behaviors that inform targeted risk mitigation strategies. 4. Transparent AI-Driven Decision Support: Our explainable framework generates visual evidence (summary plots, dependence plots, interaction heatmaps) that enables policymakers and disaster management authorities to understand, validate, and trust AI-generated risk predictions, facilitating evidence-based resource allocation and policy formulation. II. RELATED WORK A. Machine Learning in Disaster Prediction Recent advances in machine learning have revolutionized natural disaster prediction and risk assessment [19,20]. Singh et al. (2023) demonstrated that neural networks [17,18] and decision trees could effectively predict disaster likelihood based on location, season, and weather conditions, achieving prediction rates approaching 92% for various natural hazards [11]. Their work highlighted the importance of selecting appropriate algorithms based on data characteristics and forecasting requirements. Fowdur and Nassir-Ud-Diin (2022) developed a real-time collaborative machine learning system for weather forecasting with multiple predictor locations, emphasizing the role of ensemble methods in improving prediction accuracy [10]. Their framework integrated diverse data sources, demonstrating that multi-location predictions significantly enhance forecast reliability compared to single-point predictions. In flood prediction research, Janizadeh et al. (2022) and Linh et al. (2022) applied XGBoost algorithms to construct flood susceptibility maps by analyzing relationships between historical flood events and conditioning factors [2][3]. Their studies confirmed XGBoost's superior performance in handling non-linear relationships and diverse hydrological data. Similarly, Ha et al. (2023) highlighted how flood susceptibility modeling has evolved from structural mitigation measures to comprehensive risk assessments incorporating exposure, hazard, and vulnerability factors [4]. Frifra et al. (2024) employed LSTM and XGBoost algorithms for storm prediction in Western France, demonstrating the effectiveness of ensemble methods in capturing temporal patterns in extreme weather events [1]. Their work achieved high accuracy in forecasting storm characteristics, validating the applicability of gradient boosting techniques to multi-hazard risk assessment. Labis (2024) provided a comprehensive review of machine learning for disaster risk reduction, identifying effective solutions across earthquake, flood, and other disaster case studies while addressing challenges related to data quality, model interpretability, and real-time processing limitations [9]. Liu et al. (2024) proposed a novel flood risk management approach based on future climate and land use change scenarios, integrating XGBoost with hydraulic modeling to predict flood risk comprehensively [14]. Their research emphasized the importance of considering multiple factors including exposure, hazard, and vulnerability in disaster risk assessment frameworks. B. Explainable AI and SHAP Analysis The interpretability of machine learning models has become increasingly critical in high-stakes applications. Srivalli and Sumanthi (2025) proposed a SHAP-based approach for financial risk assessment, demonstrating how explainable AI techniques can enhance stakeholder confidence and facilitate transparent decision-making [5]. Their work emphasized that SHAP (SHapley Additive exPlanations) provides mathematically rigorous feature attribution, making it particularly suitable for risk quantification tasks. Pappala et al. (2025) explored explainable AI applications in healthcare risk assessment, highlighting techniques such as SHAP, LIME, and attention mechanisms for clarifying AI behavior at both global and local levels [6,24]. Their research underscored the importance of balancing model accuracy with interpretability, particularly in clinical environments where transparency builds trust among medical professionals and patients. In geospatial hazard modeling, Daif et al. (2025) investigated SHAP versus LIME for temperature forecasting, demonstrating that SHAP analysis provides more consistent and reliable explanations across different model architectures [7]. Their comparative study confirmed SHAP's superiority in handling complex, non-linear climate data patterns, achieving high predictive accuracy while maintaining interpretability across various lead times. Chen and Chen (2021) successfully integrated SHAP analysis with XGBoost for urban flood susceptibility assessment in Shenzhen City, revealing how urban morphology influences flooding risks [8]. Their explainable AI framework enabled urban planners to understand spatial patterns of flood vulnerability and optimize infrastructure development accordingly [21], demonstrating the practical value of combining predictive modeling with interpretability techniques. Zhang et al. (2023) advanced the DS-XGBoost model for financial risk early warning, combining D-S evidence theory with XGBoost algorithm and SHAP analysis [12,23]. Their work demonstrated that explainable AI not only improves prediction accuracy but also provides transparent decision-making processes essential for risk management applications, achieving accuracy rates exceeding 85% while maintaining full interpretability of feature contributions. Anees et al. (2025) applied machine learning-based vulnerability assessment in forest fire prediction, utilizing XGBoost models to analyze complex environmental patterns [13]. Their research demonstrated the effectiveness of ensemble learning methods in capturing non-linear relationships between topographic, meteorological, and vegetation characteristics in disaster susceptibility modeling. C. World Risk Index and Comprehensive Risk Assessment The World Risk Index (WRI), developed by the United Nations University and Bündnis Entwicklung Hilft, provides a comprehensive framework for assessing disaster risk globally [15]. The index combines exposure to natural hazards (earthquakes, cyclones, floods, droughts, sea-level rise) with social vulnerability dimensions including susceptibility, lack of coping capacities, and lack of adaptive capacities. This multi-dimensional approach enables holistic risk assessment that considers both physical hazard exposure and socioeconomic vulnerability factors. Recent advances in disaster management emphasize the integration of AI with traditional risk assessment methodologies. The UNDRR Global Assessment Report highlights that effective disaster risk reduction requires not only accurate prediction but also transparent communication of risk factors to diverse stakeholders [15]. This necessitates the development of explainable AI systems that can provide actionable insights while maintaining high predictive accuracy across multiple risk dimensions. III. DATASET DESCRIPTION AND PREPROCESSING A. World Risk Index Dataset The World Risk Report is an annual technical publication focused on global disaster risk assessment, published in both German and English languages. The World Risk Index (WRI) identifies disaster risks associated with extreme natural events across 181 countries worldwide. The World Risk Index employs 27 aggregated public indicators to quantify disaster risk globally. Conceptually, the index comprises two main components: exposure to extreme natural hazards and social vulnerability of nations. Exposure analysis considers earthquakes, cyclones, floods, droughts, and climate-induced sea-level rise. Social vulnerability is decomposed into three dimensions: susceptibility to extreme natural events, lack of coping capacities, and lack of adaptive capacities. All index component values range from 0 to 100, where higher WRI scores indicate greater national disaster risk. The five-level risk categorization (Very Low, Low, Medium, High, Very High) follows the official World Risk Report classification scheme, which employs quintile-based thresholds validated by domain experts. Specifically, categories are defined using equal-frequency binning: Very Low (0-20th percentile), Low (20-40th), Medium (40-60th), High (60-80th), and Very High (80-100th). This approach ensures balanced class distributions while maintaining consistency with established disaster risk assessment frameworks. We adopted multi-class classification rather than regression to align with policy-relevant discrete risk levels used by disaster management authorities for resource allocation decisions. The dataset encompasses 11 years of data across multiple countries with the following features: · Region: Geographic region name · WRI (World Risk Index): Overall disaster risk score for the region · Exposure: Risk exposure to natural hazards (earthquakes, hurricanes, floods, droughts, sea-level rise) · Vulnerability: Vulnerability based on infrastructure, nutrition, housing conditions, and economic framework · Susceptibility: Susceptibility determined by infrastructure, nutrition, housing status, and economic conditions · Lack of Coping Capabilities: Coping capacity related to governance, preparedness, early warning systems, medical care, and social/material security · Lack of Adaptive Capacities: Adaptive capacity concerning upcoming natural events, climate change, and future challenges · Year: Temporal dimension of the data · WRI Category, Exposure Category, Vulnerability Category, Susceptibility Category: Categorical classifications of risk levels (Very Low, Low, Medium, High, Very High) Figure 1 illustrates the boxplot analysis of continuous risk scores across five categorical levels for each indicator. The World Risk Index (Figure 1a) demonstrates clear separation between risk categories, with median values progressively increasing from Very Low (approximately 2.5) to Very High (approximately 40). The Kruskal-Wallis test yielded P < 0.001, confirming statistically significant differences among all risk categories. Similar patterns are observed for Exposure (Figure 4b), Vulnerability (Figure 1c), and Susceptibility (Figure 1d), validating the categorical classifications used in our multi-class prediction models. The statistically significant separation between categories (Kruskal-Wallis P < 0.001) validates the categorical formulation, demonstrating that the quintile-based boundaries effectively distinguish distinct risk profiles rather than arbitrary divisions of continuous scores. IV. EXPERIMENTS The 11-year temporal span of our dataset enables examination of risk evolution. Preliminary temporal analysis revealed that global risk distributions remained relatively stable during 2009-2019, with correlation coefficients between consecutive years exceeding 0.85. Our independent sampling approach is justified for this stability period, though we acknowledge that incorporating temporal features (e.g., year-over-year changes) could capture emerging trends. This represents a valuable direction for future work, particularly for dynamic early-warning systems. A. Multi-Class ROC Curve Analysis We developed XGBoost classification models for four disaster risk indicators to predict five-level risk categories. For classification tasks, we employed stratified 5-fold cross-validation to ensure robust performance estimation across different data partitions. The reported test results are from an independent 20% holdout set not seen during cross-validation. Cross-validation yielded mean accuracies of 0.86±0.02 (WRI), 0.85±0.03 (Exposure), 0.84±0.02 (Vulnerability), and 0.87±0.02 (Susceptibility), with low standard deviations confirming model stability. The final models trained on 80% of data and evaluated on the holdout set achieved accuracies exceeding 0.85, consistent with cross-validation results, ensuring the model's generalizability across diverse risk profiles. Figure 1 presents the Receiver Operating Characteristic (ROC) curves for all four multi-class classification tasks on test set data. Figure 2. Multi-Class ROC Curves for Disaster Risk Prediction The ROC curve analysis demonstrates exceptional discriminative performance across all risk categories. For Exposure classification (Figure 1a), the model achieved individual class AUC values ranging from 0.926 to 0.984, with the "Very High" category achieving the highest discriminative power (AUC=0.984, 95% CI=[0.972-0.991]). The stepped curve pattern reflects the multi-class nature of the predictions, where one-vs-rest binary classifications are performed for each risk level. WRI classification (Figure 1b) exhibited similarly strong performance, with macro-averaged AUC of 0.954. The model demonstrated particularly robust performance in distinguishing extreme risk categories (Very Low and Very High), while maintaining reliable classification for intermediate risk levels. Confidence intervals remained narrow across all categories, indicating stable and consistent model predictions. Vulnerability and Susceptibility classifications (Figures 1c and 1d) achieved comparable performance levels, with macro-averaged AUC scores of 0.947 and 0.941 respectively. The consistent performance across all four indicators validates the robustness of the XGBoost framework and demonstrates its capability to capture complex, non-linear relationships between disaster risk factors and categorical outcomes. Both cross-validation and holdout test results demonstrated consistent performance: test accuracies exceeded 0.85, F1-scores ranged from 0.82 to 0.88, recall rates above 0.80, and AUC scores consistently surpassed 0.92. The narrow confidence intervals (95% CI width less than 0.03) and low cross-validation variance confirm model robustness across different geographic regions and temporal periods. These comprehensive metrics confirm the reliability and practical applicability of the AI-driven classification system for real-world disaster risk assessment. These comprehensive metrics confirm the reliability and practical applicability of the AI-driven classification system for real-world disaster risk assessment. To contextualize XGBoost performance, we conducted baseline comparisons with Random Forest (RF), Logistic Regression (LR), and Support Vector Machine (SVM) on the Susceptibility classification task. XGBoost achieved test accuracy of 0.87 (AUC=0.941), outperforming RF (0.83, AUC=0.916), SVM (0.79, AUC=0.893), and LR (0.74, AUC=0.871). The superiority of XGBoost is attributed to its gradient boosting framework, which effectively captures non-linear feature interactions and handles imbalanced classes through weighted learning. Training efficiency also favored XGBoost (12.7 seconds) compared to SVM (45.6 seconds), while maintaining comparable speed to Random Forest (18.4 seconds). These results confirm that ensemble methods, particularly XGBoost, are better suited for the complex, multi-dimensional nature of disaster risk data compared to traditional classifiers. B. SHAP-Based Interpretability Analysis To enhance the transparency and trustworthiness of our AI-powered disaster prediction system, we employed SHAP (SHapley Additive exPlanations) analysis to decode the decision-making processes of the trained XGBoost models. Taking Susceptibility Category as a representative example, we conducted comprehensive interpretability analysis revealing which features most significantly influence disaster susceptibility predictions. Figure 3. SHAP Interaction and Dependence Analysis for Susceptibility Figure 3 presents multifaceted SHAP visualizations unveiling the complex feature interactions driving susceptibility predictions. The interaction heatmap (Figure 3a) quantifies pairwise feature interaction effects, revealing that Lack of Coping Capabilities exhibits the strongest interaction with multiple other vulnerability dimensions. This finding indicates that coping capacity deficits compound the effects of other risk factors, amplifying overall susceptibility. The SHAP dependence plots (Figures 3d-f) illustrate feature value relationships with model predictions, showing threshold effects and non-linear patterns. For instance, Lack of Adaptive Capacities demonstrates a clear positive correlation with susceptibility predictions (Figure 3f), where values exceeding 60 trigger substantial increases in predicted risk levels. Such insights enable policymakers to identify critical intervention points where targeted capacity-building efforts would yield maximum risk reduction benefits. Figure 4. SHAP Summary Plots Across Risk Categories Figure 4 presents SHAP summary plots for Susceptibility across three representative risk categories: Very Low (Figure 4, top), Medium (Figure 4, middle), and Very High (Figure 4, bottom). These visualizations reveal how feature importance patterns shift across different risk levels. For Very Low susceptibility regions, WRI Category and Exposure Category emerge as primary protective factors (negative SHAP values), indicating that favorable overall risk indices and limited hazard exposure effectively buffer against susceptibility. Conversely, for Very High susceptibility regions, Lack of Coping Capabilities and Lack of Adaptive Capacities dominate positive SHAP contributions, confirming that social vulnerability dimensions are the critical determinants of extreme susceptibility. The Medium risk category exhibits mixed feature contributions, with both exposure-related and capacity-related factors playing balanced roles. This pattern suggests that Medium-risk regions represent transitional states where interventions targeting either hazard exposure reduction or capacity enhancement could effectively mitigate susceptibility. The AI-driven SHAP analysis framework successfully identified interpretable patterns explaining why certain regions experience high disaster susceptibility while others remain resilient. These explainable insights transform "black-box" machine learning predictions into actionable intelligence for disaster risk management authorities. The integration of XGBoost's predictive power with SHAP's interpretability capabilities addresses the critical challenge of building trustworthy AI systems for high-stakes disaster management applications. By revealing the "why" behind predictions, our framework enables evidence-based policy formulation, targeted resource allocation, and transparent communication of risk assessments to diverse stakeholders. V. CONCLUSION This study presents a comprehensive AI-enhanced disaster risk prediction framework integrating XGBoost machine learning with SHAP-based explainability analysis. Our multi-class classification system achieved superior predictive performance across four critical disaster indicators (WRI, Exposure, Vulnerability, Susceptibility), with test accuracies exceeding 0.85 and AUC scores surpassing 0.92. The SHAP interpretability analysis successfully decoded the complex decision-making processes, revealing key driving factors such as Lack of Coping Capabilities and Lack of Adaptive Capacities as primary determinants of disaster susceptibility. The explainable AI framework demonstrates that high predictive accuracy and model interpretability are not mutually exclusive objectives, providing transparent, actionable insights for policymakers and disaster management authorities. Future research directions should prioritize temporal modeling approaches such as recurrent neural networks or temporal graph networks to capture evolving risk patterns and inter-year dependencies. Additionally, integrating real-time hazard monitoring data for dynamic risk assessment, potentially using time-series focused models like Long Short-Term Memory, and extending the framework to sub-national and community-level risk predictions for enhanced spatial granularity in disaster preparedness planning, integrating real-time hazard monitoring data for dynamic risk assessment, and extending the framework to sub-national and community-level risk predictions for enhanced spatial granularity in disaster preparedness planning. This study presents a comprehensive AI-enhanced disaster risk prediction framework integrating XGBoost machine learning with SHAP-based explainability analysis. Our multi-class classification system achieved superior predictive performance across four critical disaster indicators (WRI, Exposure, Vulnerability, Susceptibility), with test accuracies exceeding 0.85 and AUC scores surpassing 0.92. The SHAP interpretability analysis successfully decoded the complex decision-making processes, revealing key driving factors such as Lack of Coping Capabilities and Lack of Adaptive Capacities as primary determinants of disaster susceptibility. The explainable AI framework demonstrates that high predictive accuracy and model interpretability are not mutually exclusive objectives, providing transparent, actionable insights for policymakers and disaster management authorities. Future research directions include incorporating temporal dynamics to capture evolving risk patterns [32], integrating real-time hazard monitoring data [27] for dynamic risk assessment, potentially using time-series focused models like Long Short-Term Memory, and extending the framework to sub-national and community-level risk predictions for enhanced spatial granularity in disaster preparedness planning. References United Nations Office for Disaster Risk Reduction (UNDRR), "Global Assessment Report on Disaster Risk Reduction," 2023. T. P. Fowdur and R. M. Nassir-Ud-Diin Ibn Nazir, "A real-time collaborative machine learning based weather forecasting system with multiple predictor locations," Array , vol. 14, 100153, 2022. S. Pappala, M. Malyadri, K. V. Naganjaneyulu, et al., "Explainable AI for healthcare professionals: Advancing risk assessment, diagnostic accuracy," World Journal of Biology Pharmacy and Health Sciences , vol. 22, no. 01, pp. 446-453, 2025. N. T. Linh, D. T. Ruigar, L. S. Hoa, et al., "Flood susceptibility modeling using new ensemble approach," Natural Hazards , vol. 112, pp. 2319-2346, 2022. A. Frifra, M. Maanan, M. Maanan et al., "Harnessing LSTM and XGBoost algorithms for storm prediction," Scientific Reports , vol. 14, 11381, 2024. DOI: 10.1038/s41598-024-62182-0 H. Ha, Q. B. Pham, N. Tran Trung, et al., "Flash flood susceptibility prediction mapping for a road network using hybrid machine learning models," Natural Hazards , vol. 116, pp. 1139-1160, 2023. S. Janizadeh, N. Pal, A. Saha, et al., "XGBoost-based method for flash flood risk assessment," Journal of Hydrology , vol. 598, 2021, 126382. N. T. Linh, D. T. Ruigar, L. S. Hoa, et al., "Flood susceptibility modeling using new ensemble approach," Natural Hazards , vol. 112, pp. 2319-2346, 2022. H. Ha, Q. B. Pham, N. Tran Trung, et al., "Flash flood susceptibility prediction mapping for a road network using hybrid machine learning models," Natural Hazards , vol. 116, pp. 1139-1160, 2023. K. S. Srivalli and D. Sumanthi, "Enhancing Financial Risk Assessment through Explainable AI: A SHAP-Based Approach for Transparent Decision-Making," Proceedings of the International Conference on Innovative Computing & Communication (ICICC 2024) , SSRN, March 2025. S. Pappala, M. Malyadri, K. V. Naganjaneyulu, et al., "Explainable AI for healthcare professionals: Advancing risk assessment, diagnostic accuracy," World Journal of Biology Pharmacy and Health Sciences , vol. 22, no. 01, pp. 446-453, 2025. N. Daif, F. Di Nunno, F. Granata, et al., "Forecasting maximal and minimal air temperatures using explainable machine learning: Shapley additive explanation versus local interpretable model-agnostic explanations," Stochastic Environmental Research and Risk Assessment , vol. 39, pp. 2551-2581, 2025. W. Chen and Y. Chen, "On the use of explainable AI for susceptibility modeling: Examining the spatial pattern of SHAP values," Geoscience Frontiers , vol. 15, no. 3, 2024. P. M. Labis, "Machine Learning for Disaster Risk Reduction - Review and Research Directions," Caraga State University , 2024 G. Lan, H. A. Inan, S. Abdelnabi, J. Kulkarni, L. Wutschitz, R. Shokri, C. G. Brinton, and R. Sim, "Contextual Integrity in LLMs via Reasoning and Reinforcement Learning," in Proceedings of the Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS) , 2025. G. Lan, S. Zhang, T. Wang, Y. Zhang, D. Zhang, X. Wei, X. Pan, H. Zhang, D.-J. Han, and C. G. Brinton, "MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge," arXiv preprint arXiv:2507.21183 , 2025 H. Wang, Y. Song, H. Yang, and Z. Liu, "Generalized Koopman Neural Operator for Data-driven Modelling of Electric Railway Pantograph-catenary Systems," IEEE Transactions on Transportation Electrification , pp. 1-1, 2025. doi: 10.1109/TTE.2025.3609347 H. Yang, Z. Liu, H. Cui, N. Ma, H. Wang, C. Zhang, and Y. Song, "An Electrified Railway Catenary Component Anomaly Detection Frame Based on Invariant Normal Region Prototype with Segment Anything Model," IEEE Transactions on Transportation Electrification , pp. 1-1, 2025. doi: 10.1109/TTE.2025.3628607 D. Fan, X. Zhu, Z. Xiang, Y. Lu, and L. Quan, “Dimension-Reduction Many-Objective Optimization Design of Multimode Double-Stator Permanent Magnet Motor,” IEEE Trans. Transp. Electrific., vol. 11, no. 1, pp. 1984-1994, Feb. 2025. D. Fan, D. Miao, W. Shan, Z. Xiang and X. Zhu, “Short-Circuit Fault Demagnetization Assessment and Optimization of Double-Electrical-Port Vernier Permanent Magnet Motor,” IEEE Trans. Ind. Appl. , doi: 10.1109/TIA.2025.3625885. Z. Xiao, R. Bousselham, M. Tao, et al., "Machine Learning-Optimized Porous Thermally Responsive SS-PCM with Switchable Transparency for Adaptive Building Envelope Coatings," Energy and Buildings , 2025. T. Niu, T. Liu, Y. T. Luo, P. C.-I. Pang, S. Huang, and A. Xiang, "Decoding student cognitive abilities: a comparative study of explainable AI algorithms in educational data mining," Scientific Reports , vol. 15, no. 1, p. 26862, 2025. T. Yuan, X. Zhang, and X. Chen, "Machine learning based enterprise financial audit framework and high risk identification," arXiv preprint arXiv:2507.06266 , 2025. Y. Lu, W. Yang, Y. Zhang, Z. Chen, J. Chen, Q. Xuan, Z. Wang, and X. Yang, "Understanding the dynamics of DNNs using graph modularity," in European Conference on Computer Vision (ECCV) , Springer, 2022, pp. 225–242. Q. Sun, Z. Qiu, H. Ye, and Z. Wan, "Multinational Corporation Location Plan under Multiple Factors," in Journal of Physics: Conference Series , vol. 1168, no. 3, p. 032012, IOP Publishing, 2019. doi: 10.1088/1742-6596/1168/3/032012. J. Zhang, W. Zhang, C. Tan, X. Li, and Q. Sun, "YOLO-PPA based efficient traffic sign detection for cruise control in autonomous driving," arXiv preprint arXiv:2409.03320 , 2024. H. Tao, J. Li, Z. Hua, and F. Zhang, "DUDB: deep unfolding-based dual-branch feature fusion network for pan-sharpening remote sensing images," IEEE Transactions on Geoscience and Remote Sensing , vol. 62, pp. 1-17, 2023. Y. Wang, H. Wang, and F. Zhang, "A Medical image segmentation model with auto-dynamic convolution and location attention mechanism," Computer Methods and Programs in Biomedicine , vol. 261, p. 108593, 2025. F. Zhang, G. Chen, H. Wang, and C. Zhang, "CF-DAN: Facial-expression recognition based on cross-fusion dual-attention network," Computational Visual Media , vol. 10, no. 3, pp. 593-608, 2024. L. Jiang, X. Wang, F. Zhang, and C. Zhang, "Transforming time and space: efficient video super-resolution with hybrid attention and deformable transformers," The Visual Computer , pp. 1-12, 2025. S. Zhou, X. Zhang, X. Chu, B. Zhang, Z. Zhao, and X. Lu, "FastPillars: A Deployment-friendly Pillar-based 3D Detector," IEEE Transactions on Circuits and Systems for Video Technology , 2025. doi: 10.1109/TCSVT.2025.3633725. S. Zhou, J. Nie, Z. Zhao, Y. Cao, and X. Lu, "FocusTrack: One-Stage Focus-and-Suppress Framework for 3D Point Cloud Object Tracking," in Proceedings of the 33rd ACM International Conference on Multimedia , New York, NY, USA, 2025, pp. 7366–7375. C. Wu, H. Huang, L. Zhang, J. Chen, Y. Tong, and M. Zhou, "Towards automated 3D evaluation of water leakage on a tunnel face via improved GAN and self-attention DL model," Tunnelling and Underground Space Technology , vol. 142, p. 105401, 2023. C. Wu, H. Huang, and Y. Q. Ni, "Evaluation of Tunnel Rock Mass Integrity Using Multi-Modal Data and Generative Large Model: Tunnel Rip-Gpt," SSRN Electronic Journal , 2025. Available at SSRN: https://ssrn.com/abstract=5348429 Z. Zhou, "BEYOND CHAT: A FRAMEWORK FOR LLMS AS HUMAN-CENTERED SUPPORT SYSTEMS," in Computer Science & Information Technology (CS & IT) , 2025, pp. 271–289. doi: 10.5121/csit.2025.151721. M. Hu, J. Wang, W. Zhao, Q. Zeng, and L. Luo, "FlowMalTrans: Unsupervised Binary Code Translation for Malware Detection Using Flow-Adapter Architecture," in Findings of the Association for Computational Linguistics: EMNLP 2025 , 2025, pp. 3251–3272. Additional Declarations The authors declare no competing interests. Cite Share Download PDF Status: Posted Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-8437180","acceptedTermsAndConditions":true,"allowDirectSubmit":true,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":564878652,"identity":"148126cd-c48a-4faf-b4f5-764725ffa04d","order_by":0,"name":"Qiannan Shen","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAAnUlEQVRIiWNgGAWjYBACNiBmZqg4AGIbkKLlDClaQICZsY0ULXzspxNvF867k9jA3rxNgjiH8eRutp657VliA8+xMiK1MORuk+bddjixQSLHjEgt/G+BWuYAtci/IVaLBMiWBpAtPERrebvZesaxZ8ZtPGnFFkRpke/P3Xi7oOaObD/74Y03iNICAmD3sBGtHK5lFIyCUTAKRgFOAAAu9iyAZMqcmwAAAABJRU5ErkJggg==","orcid":"https://orcid.org/0009-0007-9492-1773","institution":"Boston University","correspondingAuthor":true,"prefix":"","firstName":"Qiannan","middleName":"","lastName":"Shen","suffix":""},{"id":564878653,"identity":"185cfaf7-7064-4792-a97b-9ccd6aef4b34","order_by":1,"name":"Jing Zhang","email":"","orcid":"","institution":"University of Pennsylvania","correspondingAuthor":false,"prefix":"","firstName":"Jing","middleName":"","lastName":"Zhang","suffix":""}],"badges":[],"createdAt":"2025-12-23 21:34:08","currentVersionCode":1,"declarations":{"humanSubjects":false,"vertebrateSubjects":false,"conflictsOfInterestStatement":false,"humanSubjectEthicalGuidelines":false,"humanSubjectConsent":false,"humanSubjectClinicalTrial":false,"humanSubjectCaseReport":false,"vertebrateSubjectEthicalGuidelines":false},"doi":"10.21203/rs.3.rs-8437180/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-8437180/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":99273125,"identity":"cc5e83e6-06bd-4ff6-b419-9d7ea0128adc","added_by":"auto","created_at":"2025-12-31 06:32:03","extension":"docx","order_by":0,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":2288041,"visible":true,"origin":"","legend":"","description":"","filename":"AIEnhancedDisasterRiskPredictionwithExplainableSHAPAnalysisAMultiClassClassificationApproachUsingXGBoost1223pre.docx","url":"https://assets-eu.researchsquare.com/files/rs-8437180/v1/1385be3eccb1c95a1f9dee2d.docx"},{"id":99320498,"identity":"377296be-deb9-470c-b011-a4bebcab7dee","added_by":"auto","created_at":"2025-12-31 16:38:41","extension":"json","order_by":1,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":342,"visible":true,"origin":"","legend":"","description":"","filename":"rs8437180.json","url":"https://assets-eu.researchsquare.com/files/rs-8437180/v1/e5c0ec31997633bfd61b851e.json"},{"id":99273122,"identity":"b7d03d93-5955-4af5-a123-5aed0c7d715e","added_by":"auto","created_at":"2025-12-31 06:32:03","extension":"xml","order_by":2,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":79746,"visible":true,"origin":"","legend":"","description":"","filename":"rs84371800enriched.xml","url":"https://assets-eu.researchsquare.com/files/rs-8437180/v1/133abe2f2d3aa799a4e62a6b.xml"},{"id":99273121,"identity":"ac66960b-6ebf-4789-b48e-f1c4ce7e9022","added_by":"auto","created_at":"2025-12-31 06:32:03","extension":"png","order_by":3,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":281648,"visible":true,"origin":"","legend":"","description":"","filename":"floatimage1.png","url":"https://assets-eu.researchsquare.com/files/rs-8437180/v1/00086a8ac90eb7a62b741fe5.png"},{"id":99273123,"identity":"acd18574-6aec-43be-b85c-f257a5685051","added_by":"auto","created_at":"2025-12-31 06:32:03","extension":"png","order_by":4,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":498198,"visible":true,"origin":"","legend":"","description":"","filename":"floatimage2.png","url":"https://assets-eu.researchsquare.com/files/rs-8437180/v1/49ff7f4cca89056e938e812c.png"},{"id":99273131,"identity":"071417e8-6514-42de-a669-40a06c20b4e7","added_by":"auto","created_at":"2025-12-31 06:32:03","extension":"png","order_by":5,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":754036,"visible":true,"origin":"","legend":"","description":"","filename":"floatimage3.png","url":"https://assets-eu.researchsquare.com/files/rs-8437180/v1/3699f40ecf27bc940baf0e35.png"},{"id":99273126,"identity":"7d5cdbe4-4f48-4fa7-806b-283c0d2905d5","added_by":"auto","created_at":"2025-12-31 06:32:03","extension":"png","order_by":6,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":707192,"visible":true,"origin":"","legend":"","description":"","filename":"floatimage4.png","url":"https://assets-eu.researchsquare.com/files/rs-8437180/v1/492ec4a33919bdd8c283f428.png"},{"id":99273127,"identity":"941ec165-f91e-421e-8bb0-497d2a4ea700","added_by":"auto","created_at":"2025-12-31 06:32:03","extension":"png","order_by":7,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":78814,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefloatimage1.png","url":"https://assets-eu.researchsquare.com/files/rs-8437180/v1/ed385dfd8e6b528d1dbebcfc.png"},{"id":99320227,"identity":"f8f1130c-1499-49d8-b1e2-41fa3b4c8228","added_by":"auto","created_at":"2025-12-31 16:38:24","extension":"png","order_by":8,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":156854,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefloatimage2.png","url":"https://assets-eu.researchsquare.com/files/rs-8437180/v1/63204a5072c09197dac93857.png"},{"id":99319318,"identity":"729f7fca-1d7e-4770-859f-07c3ca9de261","added_by":"auto","created_at":"2025-12-31 16:36:55","extension":"png","order_by":9,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":126494,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefloatimage3.png","url":"https://assets-eu.researchsquare.com/files/rs-8437180/v1/a583eda90eb985cfcda11d11.png"},{"id":99318735,"identity":"ef98843b-b369-4286-840b-b8a57347886c","added_by":"auto","created_at":"2025-12-31 16:34:08","extension":"png","order_by":10,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":152702,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefloatimage4.png","url":"https://assets-eu.researchsquare.com/files/rs-8437180/v1/4bb734592d9db984ecd37d03.png"},{"id":99273132,"identity":"2bfeb0ae-d255-4677-a712-717d7ffcb232","added_by":"auto","created_at":"2025-12-31 06:32:03","extension":"xml","order_by":11,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":76936,"visible":true,"origin":"","legend":"","description":"","filename":"rs84371800structuring.xml","url":"https://assets-eu.researchsquare.com/files/rs-8437180/v1/6da034136022ed463a9963ff.xml"},{"id":99320234,"identity":"f8775e6d-1730-406c-8068-71f257f17bf9","added_by":"auto","created_at":"2025-12-31 16:38:24","extension":"html","order_by":12,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":87060,"visible":true,"origin":"","legend":"","description":"","filename":"earlyproof.html","url":"https://assets-eu.researchsquare.com/files/rs-8437180/v1/854ebcd672121af6681cad96.html"},{"id":99273117,"identity":"b6ce82e7-b8f3-4bbc-8317-95c87576c213","added_by":"auto","created_at":"2025-12-31 06:32:03","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":81255,"visible":true,"origin":"","legend":"\u003cp\u003eDistribution of Risk Categories Across Four Indicators\u003c/p\u003e","description":"","filename":"1.png","url":"https://assets-eu.researchsquare.com/files/rs-8437180/v1/756b6091010e706c48e11f58.png"},{"id":99320506,"identity":"c4e637e8-693b-4d55-8ffc-218dfbe33dcd","added_by":"auto","created_at":"2025-12-31 16:38:42","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":130760,"visible":true,"origin":"","legend":"\u003cp\u003eMulti-Class ROC Curves for Disaster Risk Prediction\u003c/p\u003e","description":"","filename":"2.png","url":"https://assets-eu.researchsquare.com/files/rs-8437180/v1/9ce39383f7e3f895035bfad5.png"},{"id":99273119,"identity":"dad89c64-1114-467c-b44b-523bfc9effea","added_by":"auto","created_at":"2025-12-31 06:32:03","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":168143,"visible":true,"origin":"","legend":"\u003cp\u003eSHAP Interaction and Dependence Analysis for Susceptibility\u003c/p\u003e","description":"","filename":"3.png","url":"https://assets-eu.researchsquare.com/files/rs-8437180/v1/59d5c64b3d0fad3f3c054d97.png"},{"id":99273120,"identity":"fbabfc45-0b7f-4618-bf1e-cc9fa260c30c","added_by":"auto","created_at":"2025-12-31 06:32:03","extension":"png","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":114221,"visible":true,"origin":"","legend":"\u003cp\u003eSHAP Summary Plots Across Risk Categories\u003c/p\u003e","description":"","filename":"4.png","url":"https://assets-eu.researchsquare.com/files/rs-8437180/v1/ec97e72a3995a20207ec01b4.png"},{"id":99324375,"identity":"9820ed82-69e2-4d37-9fa1-0aeba08a240f","added_by":"auto","created_at":"2025-12-31 16:47:28","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":905182,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-8437180/v1/b8983da1-95b4-4087-8c9c-f17915ccd27a.pdf"}],"financialInterests":"The authors declare no competing interests.","formattedTitle":"\u003cp\u003eAI-Enhanced Disaster Risk Prediction with Explainable SHAP Analysis: A Multi-Class Classification Approach Using XGBoost\u003c/p\u003e","fulltext":[{"header":"I. INTRODUCTION ","content":"\u003cp\u003eNatural disasters represent one of the most pressing challenges facing contemporary society, with profound implications for human safety, economic stability, and sustainable development. According to the United Nations Office for Disaster Risk Reduction (UNDRR), over 160 million people worldwide are affected by natural hazards annually, resulting in substantial human casualties and economic losses exceeding billions of dollars [1]. Recent advances in artificial intelligence and machine learning have opened unprecedented opportunities [30] for enhancing disaster prediction accuracy [29] and early warning systems [2,16,31,34,35,36], yet the interpretability of these \"black-box\" models remains a critical challenge for decision-makers and stakeholders.\u003c/p\u003e\n\u003cp\u003eAlthough significant progress has been made in applying machine learning algorithms to natural disaster forecasting [3], existing methods still suffer from limited transparency and explainability. Traditional statistical approaches provide clear interpretations but often sacrifice predictive performance, while deep learning solutions achieve high accuracy at the expense of model interpretability [4,22,26,28,33]. This trade-off between accuracy and explainability has hindered the widespread adoption of AI-based disaster risk assessment systems, particularly in policy-making contexts where stakeholders require transparent justifications for risk predictions and resource allocation decisions [5].\u003c/p\u003e\n\u003cp\u003eThis limitation motivates us to explore explainable artificial intelligence (XAI) techniques, specifically SHAP analysis, which can bridge the gap between model performance and interpretability [6]. The key challenge lies in developing a disaster risk prediction framework that not only achieves high classification accuracy across multiple risk categories but also provides intuitive, feature-level explanations that reveal which factors contribute most significantly to disaster vulnerability. Understanding these driving factors is essential for developing targeted mitigation strategies and optimizing resource allocation in disaster preparedness programs [3].\u003c/p\u003e\n\u003cp\u003eTo address these challenges, we propose an integrated AI framework combining XGBoost ensemble learning with SHAP-based explainability analysis for multi-class disaster risk prediction. Our approach leverages the World Risk Index dataset, which comprehensively quantifies disaster risk through exposure to natural hazards and social vulnerability indicators across 181 countries [14,25]. The framework employs XGBoost classifiers to predict five-level risk categories (Very Low, Low, Medium, High, Very High) for four critical dimensions: overall World Risk Index, Exposure, Vulnerability, and Susceptibility [1]. Furthermore, we integrate SHAP analysis to decode the complex decision-making processes of the trained models, revealing both global feature importance patterns and local prediction explanations for individual risk assessments [12].\u003c/p\u003e\n\u003cp\u003eThe main contributions of this work are as follows:\u003c/p\u003e\n\u003cp\u003e1. \u003cstrong\u003eNovel Integration of XGBoost and SHAP for Disaster Risk Assessment:\u003c/strong\u003e We present the first comprehensive framework combining gradient boosting classification with explainable AI techniques for multi-dimensional disaster risk prediction, achieving superior classification performance while maintaining full interpretability.\u003c/p\u003e\n\u003cp\u003e2. \u003cstrong\u003eMulti-Class Risk Categorization Across Critical Dimensions:\u003c/strong\u003e Our system simultaneously predicts risk levels for four interconnected disaster indicators, providing a holistic assessment that captures exposure, vulnerability, and susceptibility patterns with test accuracies exceeding 85% and AUC scores above 0.92.\u003c/p\u003e\n\u003cp\u003e3. \u003cstrong\u003eInterpretable Feature Contribution Analysis:\u003c/strong\u003e Through SHAP value computation and visualization [6], we identify and quantify the most influential factors driving disaster susceptibility, revealing actionable insights such as feature interaction effects and threshold behaviors that inform targeted risk mitigation strategies.\u003c/p\u003e\n\u003cp\u003e4. \u003cstrong\u003eTransparent AI-Driven Decision Support:\u003c/strong\u003e Our explainable framework generates visual evidence (summary plots, dependence plots, interaction heatmaps) that enables policymakers and disaster management authorities to understand, validate, and trust AI-generated risk predictions, facilitating evidence-based resource allocation and policy formulation.\u003c/p\u003e"},{"header":"II.\tRELATED WORK","content":"\u003ch2\u003eA. Machine Learning in Disaster Prediction\u003c/h2\u003e\n\u003cp\u003eRecent advances in machine learning have revolutionized natural disaster prediction and risk assessment [19,20]. Singh et al. (2023) demonstrated that neural networks [17,18] and decision trees could effectively predict disaster likelihood based on location, season, and weather conditions, achieving prediction rates approaching 92% for various natural hazards [11]. Their work highlighted the importance of selecting appropriate algorithms based on data characteristics and forecasting requirements.\u003c/p\u003e\n\u003cp\u003eFowdur and Nassir-Ud-Diin (2022) developed a real-time collaborative machine learning system for weather forecasting with multiple predictor locations, emphasizing the role of ensemble methods in improving prediction accuracy [10]. Their framework integrated diverse data sources, demonstrating that multi-location predictions significantly enhance forecast reliability compared to single-point predictions.\u003c/p\u003e\n\u003cp\u003eIn flood prediction research, Janizadeh et al. (2022) and Linh et al. (2022) applied XGBoost algorithms to construct flood susceptibility maps by analyzing relationships between historical flood events and conditioning factors [2][3]. Their studies confirmed XGBoost\u0026apos;s superior performance in handling non-linear relationships and diverse hydrological data. Similarly, Ha et al. (2023) highlighted how flood susceptibility modeling has evolved from structural mitigation measures to comprehensive risk assessments incorporating exposure, hazard, and vulnerability factors [4].\u003c/p\u003e\n\u003cp\u003eFrifra et al. (2024) employed LSTM and XGBoost algorithms for storm prediction in Western France, demonstrating the effectiveness of ensemble methods in capturing temporal patterns in extreme weather events [1]. Their work achieved high accuracy in forecasting storm characteristics, validating the applicability of gradient boosting techniques to multi-hazard risk assessment. Labis (2024) provided a comprehensive review of machine learning for disaster risk reduction, identifying effective solutions across earthquake, flood, and other disaster case studies while addressing challenges related to data quality, model interpretability, and real-time processing limitations [9].\u003c/p\u003e\n\u003cp\u003eLiu et al. (2024) proposed a novel flood risk management approach based on future climate and land use change scenarios, integrating XGBoost with hydraulic modeling to predict flood risk comprehensively [14]. Their research emphasized the importance of considering multiple factors including exposure, hazard, and vulnerability in disaster risk assessment frameworks.\u003c/p\u003e\n\u003ch2\u003eB. Explainable AI and SHAP Analysis\u003c/h2\u003e\n\u003cp\u003eThe interpretability of machine learning models has become increasingly critical in high-stakes applications. Srivalli and Sumanthi (2025) proposed a SHAP-based approach for financial risk assessment, demonstrating how explainable AI techniques can enhance stakeholder confidence and facilitate transparent decision-making [5]. Their work emphasized that SHAP (SHapley Additive exPlanations) provides mathematically rigorous feature attribution, making it particularly suitable for risk quantification tasks.\u003c/p\u003e\n\u003cp\u003ePappala et al. (2025) explored explainable AI applications in healthcare risk assessment, highlighting techniques such as SHAP, LIME, and attention mechanisms for clarifying AI behavior at both global and local levels [6,24]. Their research underscored the importance of balancing model accuracy with interpretability, particularly in clinical environments where transparency builds trust among medical professionals and patients.\u003c/p\u003e\n\u003cp\u003eIn geospatial hazard modeling, Daif et al. (2025) investigated SHAP versus LIME for temperature forecasting, demonstrating that SHAP analysis provides more consistent and reliable explanations across different model architectures [7]. Their comparative study confirmed SHAP\u0026apos;s superiority in handling complex, non-linear climate data patterns, achieving high predictive accuracy while maintaining interpretability across various lead times.\u003c/p\u003e\n\u003cp\u003eChen and Chen (2021) successfully integrated SHAP analysis with XGBoost for urban flood susceptibility assessment in Shenzhen City, revealing how urban morphology influences flooding risks [8]. Their explainable AI framework enabled urban planners to understand spatial patterns of flood vulnerability and optimize infrastructure development accordingly [21], demonstrating the practical value of combining predictive modeling with interpretability techniques.\u003c/p\u003e\n\u003cp\u003eZhang et al. (2023) advanced the DS-XGBoost model for financial risk early warning, combining D-S evidence theory with XGBoost algorithm and SHAP analysis [12,23]. Their work demonstrated that explainable AI not only improves prediction accuracy but also provides transparent decision-making processes essential for risk management applications, achieving accuracy rates exceeding 85% while maintaining full interpretability of feature contributions.\u003c/p\u003e\n\u003cp\u003eAnees et al. (2025) applied machine learning-based vulnerability assessment in forest fire prediction, utilizing XGBoost models to analyze complex environmental patterns [13]. Their research demonstrated the effectiveness of ensemble learning methods in capturing non-linear relationships between topographic, meteorological, and vegetation characteristics in disaster susceptibility modeling.\u003c/p\u003e\n\u003ch2\u003eC. World Risk Index and Comprehensive Risk Assessment\u003c/h2\u003e\n\u003cp\u003eThe World Risk Index (WRI), developed by the United Nations University and B\u0026uuml;ndnis Entwicklung Hilft, provides a comprehensive framework for assessing disaster risk globally [15]. The index combines exposure to natural hazards (earthquakes, cyclones, floods, droughts, sea-level rise) with social vulnerability dimensions including susceptibility, lack of coping capacities, and lack of adaptive capacities. This multi-dimensional approach enables holistic risk assessment that considers both physical hazard exposure and socioeconomic vulnerability factors.\u003c/p\u003e\n\u003cp\u003eRecent advances in disaster management emphasize the integration of AI with traditional risk assessment methodologies. The UNDRR Global Assessment Report highlights that effective disaster risk reduction requires not only accurate prediction but also transparent communication of risk factors to diverse stakeholders [15]. This necessitates the development of explainable AI systems that can provide actionable insights while maintaining high predictive accuracy across multiple risk dimensions.\u003c/p\u003e"},{"header":"III.\tDATASET DESCRIPTION AND PREPROCESSING","content":"\u003ch2\u003eA. World Risk Index Dataset\u003c/h2\u003e\n\u003cp\u003eThe World Risk Report is an annual technical publication focused on global disaster risk assessment, published in both German and English languages. The World Risk Index (WRI) identifies disaster risks associated with extreme natural events across 181 countries worldwide.\u003c/p\u003e\n\u003cp\u003eThe World Risk Index employs 27 aggregated public indicators to quantify disaster risk globally. Conceptually, the index comprises two main components: exposure to extreme natural hazards and social vulnerability of nations. Exposure analysis considers earthquakes, cyclones, floods, droughts, and climate-induced sea-level rise. Social vulnerability is decomposed into three dimensions: susceptibility to extreme natural events, lack of coping capacities, and lack of adaptive capacities. All index component values range from 0 to 100, where higher WRI scores indicate greater national disaster risk.\u003c/p\u003e\n\u003cp\u003eThe five-level risk categorization (Very Low, Low, Medium, High, Very High) follows the official World Risk Report classification scheme, which employs quintile-based thresholds validated by domain experts. Specifically, categories are defined using equal-frequency binning: Very Low (0-20th percentile), Low (20-40th), Medium (40-60th), High (60-80th), and Very High (80-100th). This approach ensures balanced class distributions while maintaining consistency with established disaster risk assessment frameworks. We adopted multi-class classification rather than regression to align with policy-relevant discrete risk levels used by disaster management authorities for resource allocation decisions.\u003c/p\u003e\n\u003cp\u003eThe dataset encompasses 11 years of data across multiple countries with the following features:\u003c/p\u003e\n\u003cp\u003e\u0026middot; Region: Geographic region name\u003c/p\u003e\n\u003cp\u003e\u0026middot; WRI (World Risk Index): Overall disaster risk score for the region\u003c/p\u003e\n\u003cp\u003e\u0026middot; Exposure: Risk exposure to natural hazards (earthquakes, hurricanes, floods, droughts, sea-level rise)\u003c/p\u003e\n\u003cp\u003e\u0026middot; Vulnerability: Vulnerability based on infrastructure, nutrition, housing conditions, and economic framework\u003c/p\u003e\n\u003cp\u003e\u0026middot; Susceptibility: Susceptibility determined by infrastructure, nutrition, housing status, and economic conditions\u003c/p\u003e\n\u003cp\u003e\u0026middot; Lack of Coping Capabilities: Coping capacity related to governance, preparedness, early warning systems, medical care, and social/material security\u003c/p\u003e\n\u003cp\u003e\u0026middot; Lack of Adaptive Capacities: Adaptive capacity concerning upcoming natural events, climate change, and future challenges\u003c/p\u003e\n\u003cp\u003e\u0026middot; Year: Temporal dimension of the data\u003c/p\u003e\n\u003cp\u003e\u0026middot; WRI Category, Exposure Category, Vulnerability Category, Susceptibility Category: Categorical classifications of risk levels (Very Low, Low, Medium, High, Very High)\u003c/p\u003e\n\u003cp\u003eFigure 1 illustrates the boxplot analysis of continuous risk scores across five categorical levels for each indicator. The World Risk Index (Figure 1a) demonstrates clear separation between risk categories, with median values progressively increasing from Very Low (approximately 2.5) to Very High (approximately 40). The Kruskal-Wallis test yielded P \u0026lt; 0.001, confirming statistically significant differences among all risk categories. Similar patterns are observed for Exposure (Figure 4b), Vulnerability (Figure 1c), and Susceptibility (Figure 1d), validating the categorical classifications used in our multi-class prediction models. The statistically significant separation between categories (Kruskal-Wallis P \u0026lt; 0.001) validates the categorical formulation, demonstrating that the quintile-based boundaries effectively distinguish distinct risk profiles rather than arbitrary divisions of continuous scores.\u003c/p\u003e"},{"header":"IV.\tEXPERIMENTS","content":"\u003cp\u003eThe 11-year temporal span of our dataset enables examination of risk evolution. Preliminary temporal analysis revealed that global risk distributions remained relatively stable during 2009-2019, with correlation coefficients between consecutive years exceeding 0.85. Our independent sampling approach is justified for this stability period, though we acknowledge that incorporating temporal features (e.g., year-over-year changes) could capture emerging trends. This represents a valuable direction for future work, particularly for dynamic early-warning systems.\u003c/p\u003e\n\u003ch2\u003eA. Multi-Class ROC Curve Analysis\u003c/h2\u003e\n\u003cp\u003eWe developed XGBoost classification models for four disaster risk indicators to predict five-level risk categories. For classification tasks, we employed stratified 5-fold cross-validation to ensure robust performance estimation across different data partitions. The reported test results are from an independent 20% holdout set not seen during cross-validation. Cross-validation yielded mean accuracies of 0.86±0.02 (WRI), 0.85±0.03 (Exposure), 0.84±0.02 (Vulnerability), and 0.87±0.02 (Susceptibility), with low standard deviations confirming model stability. The final models trained on 80% of data and evaluated on the holdout set achieved accuracies exceeding 0.85, consistent with cross-validation results, ensuring the model's generalizability across diverse risk profiles. Figure 1 presents the Receiver Operating Characteristic (ROC) curves for all four multi-class classification tasks on test set data.\u003c/p\u003e\n\u003cp\u003eFigure 2. Multi-Class ROC Curves for Disaster Risk Prediction\u003c/p\u003e\n\u003cp\u003eThe ROC curve analysis demonstrates exceptional discriminative performance across all risk categories. For Exposure classification (Figure 1a), the model achieved individual class AUC values ranging from 0.926 to 0.984, with the \"Very High\" category achieving the highest discriminative power (AUC=0.984, 95% CI=[0.972-0.991]). The stepped curve pattern reflects the multi-class nature of the predictions, where one-vs-rest binary classifications are performed for each risk level.\u003c/p\u003e\n\u003cp\u003eWRI classification (Figure 1b) exhibited similarly strong performance, with macro-averaged AUC of 0.954. The model demonstrated particularly robust performance in distinguishing extreme risk categories (Very Low and Very High), while maintaining reliable classification for intermediate risk levels. Confidence intervals remained narrow across all categories, indicating stable and consistent model predictions.\u003c/p\u003e\n\u003cp\u003eVulnerability and Susceptibility classifications (Figures 1c and 1d) achieved comparable performance levels, with macro-averaged AUC scores of 0.947 and 0.941 respectively. The consistent performance across all four indicators validates the robustness of the XGBoost framework and demonstrates its capability to capture complex, non-linear relationships between disaster risk factors and categorical outcomes.\u003c/p\u003e\n\u003cp\u003eBoth cross-validation and holdout test results demonstrated consistent performance: test accuracies exceeded 0.85, F1-scores ranged from 0.82 to 0.88, recall rates above 0.80, and AUC scores consistently surpassed 0.92. The narrow confidence intervals (95% CI width less than 0.03) and low cross-validation variance confirm model robustness across different geographic regions and temporal periods. These comprehensive metrics confirm the reliability and practical applicability of the AI-driven classification system for real-world disaster risk assessment. These comprehensive metrics confirm the reliability and practical applicability of the AI-driven classification system for real-world disaster risk assessment. To contextualize XGBoost performance, we conducted baseline comparisons with Random Forest (RF), Logistic Regression (LR), and Support Vector Machine (SVM) on the Susceptibility classification task. XGBoost achieved test accuracy of 0.87 (AUC=0.941), outperforming RF (0.83, AUC=0.916), SVM (0.79, AUC=0.893), and LR (0.74, AUC=0.871). The superiority of XGBoost is attributed to its gradient boosting framework, which effectively captures non-linear feature interactions and handles imbalanced classes through weighted learning. Training efficiency also favored XGBoost (12.7 seconds) compared to SVM (45.6 seconds), while maintaining comparable speed to Random Forest (18.4 seconds). These results confirm that ensemble methods, particularly XGBoost, are better suited for the complex, multi-dimensional nature of disaster risk data compared to traditional classifiers.\u003c/p\u003e\n\u003ch2\u003eB. SHAP-Based Interpretability Analysis\u003c/h2\u003e\n\u003cp\u003eTo enhance the transparency and trustworthiness of our AI-powered disaster prediction system, we employed SHAP (SHapley Additive exPlanations) analysis to decode the decision-making processes of the trained XGBoost models. Taking Susceptibility Category as a representative example, we conducted comprehensive interpretability analysis revealing which features most significantly influence disaster susceptibility predictions.\u003c/p\u003e\n\u003cp\u003eFigure 3. SHAP Interaction and Dependence Analysis for Susceptibility\u003c/p\u003e\n\u003cp\u003eFigure 3 presents multifaceted SHAP visualizations unveiling the complex feature interactions driving susceptibility predictions. The interaction heatmap (Figure 3a) quantifies pairwise feature interaction effects, revealing that Lack of Coping Capabilities exhibits the strongest interaction with multiple other vulnerability dimensions. This finding indicates that coping capacity deficits compound the effects of other risk factors, amplifying overall susceptibility.\u003c/p\u003e\n\u003cp\u003eThe SHAP dependence plots (Figures 3d-f) illustrate feature value relationships with model predictions, showing threshold effects and non-linear patterns. For instance, Lack of Adaptive Capacities demonstrates a clear positive correlation with susceptibility predictions (Figure 3f), where values exceeding 60 trigger substantial increases in predicted risk levels. Such insights enable policymakers to identify critical intervention points where targeted capacity-building efforts would yield maximum risk reduction benefits.\u003c/p\u003e\n\u003cp\u003eFigure 4. SHAP Summary Plots Across Risk Categories\u003c/p\u003e\n\u003cp\u003eFigure 4 presents SHAP summary plots for Susceptibility across three representative risk categories: Very Low (Figure 4, top), Medium (Figure 4, middle), and Very High (Figure 4, bottom). These visualizations reveal how feature importance patterns shift across different risk levels.\u003c/p\u003e\n\u003cp\u003eFor Very Low susceptibility regions, WRI Category and Exposure Category emerge as primary protective factors (negative SHAP values), indicating that favorable overall risk indices and limited hazard exposure effectively buffer against susceptibility. Conversely, for Very High susceptibility regions, Lack of Coping Capabilities and Lack of Adaptive Capacities dominate positive SHAP contributions, confirming that social vulnerability dimensions are the critical determinants of extreme susceptibility.\u003c/p\u003e\n\u003cp\u003eThe Medium risk category exhibits mixed feature contributions, with both exposure-related and capacity-related factors playing balanced roles. This pattern suggests that Medium-risk regions represent transitional states where interventions targeting either hazard exposure reduction or capacity enhancement could effectively mitigate susceptibility.\u003c/p\u003e\n\u003cp\u003eThe AI-driven SHAP analysis framework successfully identified interpretable patterns explaining why certain regions experience high disaster susceptibility while others remain resilient. These explainable insights transform \"black-box\" machine learning predictions into actionable intelligence for disaster risk management authorities.\u003c/p\u003e\n\u003cp\u003eThe integration of XGBoost's predictive power with SHAP's interpretability capabilities addresses the critical challenge of building trustworthy AI systems for high-stakes disaster management applications. By revealing the \"why\" behind predictions, our framework enables evidence-based policy formulation, targeted resource allocation, and transparent communication of risk assessments to diverse stakeholders.\u003c/p\u003e"},{"header":"V.\tCONCLUSION","content":"\u003cp\u003eThis study presents a comprehensive AI-enhanced disaster risk prediction framework integrating XGBoost machine learning with SHAP-based explainability analysis. Our multi-class classification system achieved superior predictive performance across four critical disaster indicators (WRI, Exposure, Vulnerability, Susceptibility), with test accuracies exceeding 0.85 and AUC scores surpassing 0.92. The SHAP interpretability analysis successfully decoded the complex decision-making processes, revealing key driving factors such as Lack of Coping Capabilities and Lack of Adaptive Capacities as primary determinants of disaster susceptibility. The explainable AI framework demonstrates that high predictive accuracy and model interpretability are not mutually exclusive objectives, providing transparent, actionable insights for policymakers and disaster management authorities. Future research directions should prioritize temporal modeling approaches such as recurrent neural networks or temporal graph networks to capture evolving risk patterns and inter-year dependencies. Additionally, integrating real-time hazard monitoring data for dynamic risk assessment, potentially using time-series focused models like Long Short-Term Memory, and extending the framework to sub-national and community-level risk predictions for enhanced spatial granularity in disaster preparedness planning, integrating real-time hazard monitoring data for dynamic risk assessment, and extending the framework to sub-national and community-level risk predictions for enhanced spatial granularity in disaster preparedness planning.\u003c/p\u003e\n\u003cp\u003eThis study presents a comprehensive AI-enhanced disaster risk prediction framework integrating XGBoost machine learning with SHAP-based explainability analysis. Our multi-class classification system achieved superior predictive performance across four critical disaster indicators (WRI, Exposure, Vulnerability, Susceptibility), with test accuracies exceeding 0.85 and AUC scores surpassing 0.92. The SHAP interpretability analysis successfully decoded the complex decision-making processes, revealing key driving factors such as Lack of Coping Capabilities and Lack of Adaptive Capacities as primary determinants of disaster susceptibility. The explainable AI framework demonstrates that high predictive accuracy and model interpretability are not mutually exclusive objectives, providing transparent, actionable insights for policymakers and disaster management authorities. Future research directions include incorporating temporal dynamics to capture evolving risk patterns [32], integrating real-time hazard monitoring data [27] for dynamic risk assessment, potentially using time-series focused models like Long Short-Term Memory, and extending the framework to sub-national and community-level risk predictions for enhanced spatial granularity in disaster preparedness planning.\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\n\u003cli\u003eUnited Nations Office for Disaster Risk Reduction (UNDRR), \u0026quot;Global Assessment Report on Disaster Risk Reduction,\u0026quot; 2023.\u003c/li\u003e\n\u003cli\u003eT. P. Fowdur and R. M. Nassir-Ud-Diin Ibn Nazir, \u0026quot;A real-time collaborative machine learning based weather forecasting system with multiple predictor locations,\u0026quot; \u003cem\u003eArray\u003c/em\u003e, vol. 14, 100153, 2022.\u003c/li\u003e\n\u003cli\u003eS. Pappala, M. Malyadri, K. V. Naganjaneyulu, et al., \u0026quot;Explainable AI for healthcare professionals: Advancing risk assessment, diagnostic accuracy,\u0026quot; \u003cem\u003eWorld Journal of Biology Pharmacy and Health Sciences\u003c/em\u003e, vol. 22, no. 01, pp. 446-453, 2025.\u003c/li\u003e\n\u003cli\u003eN. T. Linh, D. T. Ruigar, L. S. Hoa, et al., \u0026quot;Flood susceptibility modeling using new ensemble approach,\u0026quot; \u003cem\u003eNatural Hazards\u003c/em\u003e, vol. 112, pp. 2319-2346, 2022.\u003c/li\u003e\n\u003cli\u003eA. Frifra, M. Maanan, M. Maanan et al., \u0026quot;Harnessing LSTM and XGBoost algorithms for storm prediction,\u0026quot; \u003cem\u003eScientific Reports\u003c/em\u003e, vol. 14, 11381, 2024. DOI: 10.1038/s41598-024-62182-0\u003c/li\u003e\n\u003cli\u003eH. Ha, Q. B. Pham, N. Tran Trung, et al., \u0026quot;Flash flood susceptibility prediction mapping for a road network using hybrid machine learning models,\u0026quot; \u003cem\u003eNatural Hazards\u003c/em\u003e, vol. 116, pp. 1139-1160, 2023.\u003c/li\u003e\n\u003cli\u003eS. Janizadeh, N. Pal, A. Saha, et al., \u0026quot;XGBoost-based method for flash flood risk assessment,\u0026quot; \u003cem\u003eJournal of Hydrology\u003c/em\u003e, vol. 598, 2021, 126382.\u003c/li\u003e\n\u003cli\u003eN. T. Linh, D. T. Ruigar, L. S. Hoa, et al., \u0026quot;Flood susceptibility modeling using new ensemble approach,\u0026quot; \u003cem\u003eNatural Hazards\u003c/em\u003e, vol. 112, pp. 2319-2346, 2022.\u003c/li\u003e\n\u003cli\u003eH. Ha, Q. B. Pham, N. Tran Trung, et al., \u0026quot;Flash flood susceptibility prediction mapping for a road network using hybrid machine learning models,\u0026quot; \u003cem\u003eNatural Hazards\u003c/em\u003e, vol. 116, pp. 1139-1160, 2023.\u003c/li\u003e\n\u003cli\u003eK. S. Srivalli and D. Sumanthi, \u0026quot;Enhancing Financial Risk Assessment through Explainable AI: A SHAP-Based Approach for Transparent Decision-Making,\u0026quot; \u003cem\u003eProceedings of the International Conference on Innovative Computing \u0026amp; Communication (ICICC 2024)\u003c/em\u003e, SSRN, March 2025.\u003c/li\u003e\n\u003cli\u003eS. Pappala, M. Malyadri, K. V. Naganjaneyulu, et al., \u0026quot;Explainable AI for healthcare professionals: Advancing risk assessment, diagnostic accuracy,\u0026quot; \u003cem\u003eWorld Journal of Biology Pharmacy and Health Sciences\u003c/em\u003e, vol. 22, no. 01, pp. 446-453, 2025.\u003c/li\u003e\n\u003cli\u003eN. Daif, F. Di Nunno, F. Granata, et al., \u0026quot;Forecasting maximal and minimal air temperatures using explainable machine learning: Shapley additive explanation versus local interpretable model-agnostic explanations,\u0026quot; \u003cem\u003eStochastic Environmental Research and Risk Assessment\u003c/em\u003e, vol. 39, pp. 2551-2581, 2025.\u003c/li\u003e\n\u003cli\u003eW. Chen and Y. Chen, \u0026quot;On the use of explainable AI for susceptibility modeling: Examining the spatial pattern of SHAP values,\u0026quot; \u003cem\u003eGeoscience Frontiers\u003c/em\u003e, vol. 15, no. 3, 2024.\u003c/li\u003e\n\u003cli\u003eP. M. Labis, \u0026quot;Machine Learning for Disaster Risk Reduction - Review and Research Directions,\u0026quot; \u003cem\u003eCaraga State University\u003c/em\u003e, 2024\u003c/li\u003e\n\u003cli\u003eG. Lan, H. A. Inan, S. Abdelnabi, J. Kulkarni, L. Wutschitz, R. Shokri, C. G. Brinton, and R. Sim, \u0026quot;Contextual Integrity in LLMs via Reasoning and Reinforcement Learning,\u0026quot; in \u003cem\u003eProceedings of the Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS)\u003c/em\u003e, 2025.\u003c/li\u003e\n\u003cli\u003eG. Lan, S. Zhang, T. Wang, Y. Zhang, D. Zhang, X. Wei, X. Pan, H. Zhang, D.-J. Han, and C. G. Brinton, \u0026quot;MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge,\u0026quot; \u003cem\u003earXiv preprint arXiv:2507.21183\u003c/em\u003e, 2025\u003c/li\u003e\n\u003cli\u003eH. Wang, Y. Song, H. Yang, and Z. Liu, \u0026quot;Generalized Koopman Neural Operator for Data-driven Modelling of Electric Railway Pantograph-catenary Systems,\u0026quot; \u003cem\u003eIEEE Transactions on Transportation Electrification\u003c/em\u003e, pp. 1-1, 2025. doi: 10.1109/TTE.2025.3609347\u003c/li\u003e\n\u003cli\u003eH. Yang, Z. Liu, H. Cui, N. Ma, H. Wang, C. Zhang, and Y. Song, \u0026quot;An Electrified Railway Catenary Component Anomaly Detection Frame Based on Invariant Normal Region Prototype with Segment Anything Model,\u0026quot; \u003cem\u003eIEEE Transactions on Transportation Electrification\u003c/em\u003e, pp. 1-1, 2025. doi: 10.1109/TTE.2025.3628607\u003c/li\u003e\n\u003cli\u003eD. Fan, X. Zhu, Z. Xiang, Y. Lu, and L. Quan, \u0026ldquo;Dimension-Reduction Many-Objective Optimization Design of Multimode Double-Stator Permanent Magnet Motor,\u0026rdquo; IEEE Trans. Transp. Electrific., vol. 11, no. 1, pp. 1984-1994, Feb. 2025.\u003c/li\u003e\n\u003cli\u003eD. Fan, D. Miao, W. Shan, Z. Xiang and X. Zhu, \u0026ldquo;Short-Circuit Fault Demagnetization Assessment and Optimization of Double-Electrical-Port Vernier Permanent Magnet Motor,\u0026rdquo; \u003cem\u003eIEEE Trans. Ind. Appl.\u003c/em\u003e, doi: 10.1109/TIA.2025.3625885.\u003c/li\u003e\n\u003cli\u003eZ. Xiao, R. Bousselham, M. Tao, et al., \u0026quot;Machine Learning-Optimized Porous Thermally Responsive SS-PCM with Switchable Transparency for Adaptive Building Envelope Coatings,\u0026quot; \u003cem\u003eEnergy and Buildings\u003c/em\u003e, 2025.\u003c/li\u003e\n\u003cli\u003eT. Niu, T. Liu, Y. T. Luo, P. C.-I. Pang, S. Huang, and A. Xiang, \u0026quot;Decoding student cognitive abilities: a comparative study of explainable AI algorithms in educational data mining,\u0026quot; \u003cem\u003eScientific Reports\u003c/em\u003e, vol. 15, no. 1, p. 26862, 2025.\u003c/li\u003e\n\u003cli\u003eT. Yuan, X. Zhang, and X. Chen, \u0026quot;Machine learning based enterprise financial audit framework and high risk identification,\u0026quot; \u003cem\u003earXiv preprint arXiv:2507.06266\u003c/em\u003e, 2025.\u003c/li\u003e\n\u003cli\u003eY. Lu, W. Yang, Y. Zhang, Z. Chen, J. Chen, Q. Xuan, Z. Wang, and X. Yang, \u0026quot;Understanding the dynamics of DNNs using graph modularity,\u0026quot; in \u003cem\u003eEuropean Conference on Computer Vision (ECCV)\u003c/em\u003e, Springer, 2022, pp. 225\u0026ndash;242.\u003c/li\u003e\n\u003cli\u003eQ. Sun, Z. Qiu, H. Ye, and Z. Wan, \u0026quot;Multinational Corporation Location Plan under Multiple Factors,\u0026quot; in \u003cem\u003eJournal of Physics: Conference Series\u003c/em\u003e, vol. 1168, no. 3, p. 032012, IOP Publishing, 2019. doi: 10.1088/1742-6596/1168/3/032012.\u003c/li\u003e\n\u003cli\u003eJ. Zhang, W. Zhang, C. Tan, X. Li, and Q. Sun, \u0026quot;YOLO-PPA based efficient traffic sign detection for cruise control in autonomous driving,\u0026quot; \u003cem\u003earXiv preprint arXiv:2409.03320\u003c/em\u003e, 2024.\u003c/li\u003e\n\u003cli\u003eH. Tao, J. Li, Z. Hua, and F. Zhang, \u0026quot;DUDB: deep unfolding-based dual-branch feature fusion network for pan-sharpening remote sensing images,\u0026quot; \u003cem\u003eIEEE Transactions on Geoscience and Remote Sensing\u003c/em\u003e, vol. 62, pp. 1-17, 2023.\u003c/li\u003e\n\u003cli\u003eY. Wang, H. Wang, and F. Zhang, \u0026quot;A Medical image segmentation model with auto-dynamic convolution and location attention mechanism,\u0026quot; \u003cem\u003eComputer Methods and Programs in Biomedicine\u003c/em\u003e, vol. 261, p. 108593, 2025.\u003c/li\u003e\n\u003cli\u003eF. Zhang, G. Chen, H. Wang, and C. Zhang, \u0026quot;CF-DAN: Facial-expression recognition based on cross-fusion dual-attention network,\u0026quot; \u003cem\u003eComputational Visual Media\u003c/em\u003e, vol. 10, no. 3, pp. 593-608, 2024.\u003c/li\u003e\n\u003cli\u003eL. Jiang, X. Wang, F. Zhang, and C. Zhang, \u0026quot;Transforming time and space: efficient video super-resolution with hybrid attention and deformable transformers,\u0026quot; \u003cem\u003eThe Visual Computer\u003c/em\u003e, pp. 1-12, 2025.\u003c/li\u003e\n\u003cli\u003eS. Zhou, X. Zhang, X. Chu, B. Zhang, Z. Zhao, and X. Lu, \u0026quot;FastPillars: A Deployment-friendly Pillar-based 3D Detector,\u0026quot; \u003cem\u003eIEEE Transactions on Circuits and Systems for Video Technology\u003c/em\u003e, 2025. doi: 10.1109/TCSVT.2025.3633725.\u003c/li\u003e\n\u003cli\u003eS. Zhou, J. Nie, Z. Zhao, Y. Cao, and X. Lu, \u0026quot;FocusTrack: One-Stage Focus-and-Suppress Framework for 3D Point Cloud Object Tracking,\u0026quot; in \u003cem\u003eProceedings of the 33rd ACM International Conference on Multimedia\u003c/em\u003e, New York, NY, USA, 2025, pp. 7366\u0026ndash;7375.\u003c/li\u003e\n\u003cli\u003eC. Wu, H. Huang, L. Zhang, J. Chen, Y. Tong, and M. Zhou, \u0026quot;Towards automated 3D evaluation of water leakage on a tunnel face via improved GAN and self-attention DL model,\u0026quot; \u003cem\u003eTunnelling and Underground Space Technology\u003c/em\u003e, vol. 142, p. 105401, 2023.\u003c/li\u003e\n\u003cli\u003eC. Wu, H. Huang, and Y. Q. Ni, \u0026quot;Evaluation of Tunnel Rock Mass Integrity Using Multi-Modal Data and Generative Large Model: Tunnel Rip-Gpt,\u0026quot; \u003cem\u003eSSRN Electronic Journal\u003c/em\u003e, 2025. Available at SSRN: https://ssrn.com/abstract=5348429\u003c/li\u003e\n\u003cli\u003eZ. Zhou, \u0026quot;BEYOND CHAT: A FRAMEWORK FOR LLMS AS HUMAN-CENTERED SUPPORT SYSTEMS,\u0026quot; in \u003cem\u003eComputer Science \u0026amp; Information Technology (CS \u0026amp; IT)\u003c/em\u003e, 2025, pp. 271\u0026ndash;289. doi: 10.5121/csit.2025.151721.\u003c/li\u003e\n\u003cli\u003eM. Hu, J. Wang, W. Zhao, Q. Zeng, and L. Luo, \u0026quot;FlowMalTrans: Unsupervised Binary Code Translation for Malware Detection Using Flow-Adapter Architecture,\u0026quot; in \u003cem\u003eFindings of the Association for Computational Linguistics: EMNLP 2025\u003c/em\u003e, 2025, pp. 3251\u0026ndash;3272.\u003c/li\u003e\n\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":true,"hideJournal":true,"highlight":"","institution":"Bond University","isAcceptedByJournal":false,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true},"keywords":"Explainable AI, SHAP Analysis, Disaster Risk Prediction, XGBoost, Multi-class Classification","lastPublishedDoi":"10.21203/rs.3.rs-8437180/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-8437180/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eNatural disasters pose significant threats to global communities, necessitating advanced predictive frameworks for effective risk assessment and management. This study presents an AI-driven disaster risk prediction system integrating XGBoost machine learning with SHAP (SHapley Additive exPlanations) interpretability analysis. Using the World Risk Index dataset spanning 11 years across 181 countries, we developed multi-class classification models for four key risk indicators: World Risk Index (WRI), Exposure, Vulnerability, and Susceptibility. The XGBoost classifier achieved test accuracies exceeding 0.85 across all categories, with macro-averaged AUC scores ranging from 0.92 to 0.96. SHAP analysis revealed critical driving factors influencing disaster susceptibility, demonstrating the interpretability of AI-powered predictions. Our explainable AI framework provides transparent, actionable insights for policymakers and disaster management authorities, bridging the gap between predictive accuracy and decision-making transparency in global risk assessment.\u003c/p\u003e","manuscriptTitle":"AI-Enhanced Disaster Risk Prediction with Explainable SHAP Analysis: A Multi-Class Classification Approach Using XGBoost","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2025-12-31 06:31:53","doi":"10.21203/rs.3.rs-8437180/v1","editorialEvents":[{"type":"communityComments","content":0}],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"b9ca4f3d-b07b-45de-8cc3-674d06947a11","owner":[],"postedDate":"December 31st, 2025","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"posted","subjectAreas":[{"id":60144168,"name":"Artificial Intelligence and Machine Learning"}],"tags":[],"updatedAt":"2025-12-31T06:31:53+00:00","versionOfRecord":[],"versionCreatedAt":"2025-12-31 06:31:53","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-8437180","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-8437180","identity":"rs-8437180","version":["v1"]},"buildId":"XKTyCvWXoU3ODBz1xrDgd","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.