LANTERN-01: AI model for Postoperative Complications Prediction after NSCLC Lung Resection: Prospective Multicentric Study and Extern Validation

doi:10.21203/rs.3.rs-8554813/v1

LANTERN-01: AI model for Postoperative Complications Prediction after NSCLC Lung Resection: Prospective Multicentric Study and Extern Validation

2026 · doi:10.21203/rs.3.rs-8554813/v1

preprint OA: closed

Full text JSON View at publisher

Full text 107,156 characters · extracted from preprint-html · click to expand

LANTERN-01: AI model for Postoperative Complications Prediction after NSCLC Lung Resection: Prospective Multicentric Study and Extern Validation | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article LANTERN-01: AI model for Postoperative Complications Prediction after NSCLC Lung Resection: Prospective Multicentric Study and Extern Validation Filippo Lococo, Carolina Sassorossi, Davide Dalfovo, Annalisa Campanella, and 17 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-8554813/v1 This work is licensed under a CC BY 4.0 License Status: Under Review Version 1 posted 11 You are reading this latest preprint version Abstract Introduction : Artificial intelligence(AI) techniques may combine various omics datasets to create more accurate predictive models for lung cancer patients management. The aim of this study from the LANTERN project, is to develop an AI-based predictive model of post-operative complications after lung resection for NSCLC. Methods In the framework of LANTERN Consortium we prospectively collected data from 3/2023 to 12/2024 of patients who underwent curative surgery for Stage I-III NSCLC and were herein analyzed considering 80 preoperative clinical features and 43 spirometric variables to predict the occurrence of post-operative complications. Prediction models were developed on the basis of different feature selection algorithms and machine-learning models. An external dataset composed by a surgical series of 232 patients was used for validation. Results The final analysis was conducted on 231 surgical patients. Post-operative complications were observed in 37 patients (16%). AI-based models showed that pathologic score (AUC = 0.72, 95% CI [0.62–0.81]) and pre-opFEV1_TEOR (AUC = 0.77, CI [0.67–0.89]) was the most relevant variables. Testing the models on the external dataset, while the predictive value of Pathologic score alone was reduced, pre-opFEV1_TEOR and the combination of Pathologic score and pre-opFEV1_TEOR were confirmed to predict post-op outcome. Postoperative risk rises by about 6% per pathologic score level and 11–12% for each 10% decrease in FEV1_TEOR. Conclusions Combining the FEV1_TEOR and the pathologic score permits to predict the complications risk in significant way. This model could be tested in a further prospective cohort of patients to verify its effectiveness in order to improve the perioperative management of lung resection candidates. NSCLC AI post operative complications personalized medicine Figures Figure 1 Figure 2 Figure 3 Figure 4 INTRODUCTION The application of artificial intelligence (AI) and machine learning (ML) into lung cancer treatment has shown potential in improving diagnostic accuracy, treatment personalization, and patient outcomes [ 1 ]. AI applications in thoracic surgery is revealing the potential to improve surgical precision, patient safety, and overall care efficiency [ 2 ]. Despite remarkable advances in lung cancer management, postoperative complications continue to represent a major determinant of outcomes, as they may increase morbidity and mortality. In the era of novel multimodal strategies for resectable NSCLC, which may also affect the complexity of surgical procedures, the identification of patient subgroups at higher risk of postoperative complications is of paramount importance. Such stratification could enable the implementation of personalized perioperative strategies aimed at improving overall outcomes. Under this point of view, Advances in precision medicine offer hope to improve also surgical outcomes [ 3 ]. Previous studies investigated the possibility to individuate high risk patient in different setting (5), considering also the radiological or pre-operative features and using AI based models (5,6) In this frame, the LANTERN (Lung cancer multi-omics digital human avatars for integrating precision medicine) project [ 4 ] arises, with the aims to deliver a novel approach for comprehensive lung cancer decision making solutions, based on predictive digital platforms powered by the integration multi-omics data. The current study, namely LANTERN1, has been led with the aim of creating a machine learning algorithm, able to analyze all the pre operative features of patients candidate to thoracic surgery, to predict the post operative complication risk. In literature, different studies have been published with the aim of predicting post operative complications after non cardiac thoracic surgery. For example, He and coworkers [ 5 ] analysed a ICU (intensive care unit) cohort, to predict in particular respiratory complications and they used univariate and multilogistic regression to perform the analysis. Another interesting experience, from Nonomura and coworkers [ 6 ] tries to predict the risk of pulmonary complications on the base of pre operative radiological features. A very interesting experience is from Salati et al [ 7 ]. They already proposed an AI based model able to define the risk of cardiac and pulmonary complications in the early postoperative period for patients submitted to anatomic lung resection. By the way, as reported by the authors, their experience is a monocentric one and they analyzed the risk on the base of 50 preoperative features. Our study considered more than 100 variables and comprises an external cohort to validate the results. To the best of our knowledge, our analysis would be the first one with these characteristics, with the aim of predicting post operative complications, in patients candidate to surgery for NSCLC, with AI algorithms. This would improve the process of risk definition and therapy planning for patients undergoing lung surgery With this study, we aimed to develop an AI based model for complication risk in a large cohort of patients who underwent curative surgery for NSCLC. METHODS Ethics The protocol of the study has been drawn up in accordance with the Standards of Good Clinical Practice of the European Union and the current review of the Helsinki Declaration and has been approved by the Ethics Committee of this structure. All methods will be carried out in accordance with relevant guidelines and regulations. The informed consent has been obtained from all subjects and/or their legal guardian(s). Approval of Fondazione Policlinico Universitario Agostino Gemelli IRCCS – Università Cattolica del Sacro Cuore Ethics Committee Number: 5420 − 0002485/23; Trial registration - clinicaltrial.gov: NCT05802771. Study Design and Population This study was designed as a prospective, multicenter observational trial followed by external validation, in accordance with the TRIPOD (Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis) guidelines [ 8 ]. Data for 337 patients were collected prospectively between March 2023 and December 2024 within the framework of the LANTERN consortium [ 4 ]. Of these, 231 patients with Stage I-III Non-Small Cell Lung Cancer (NSCLC) underwent curative-intent surgery. Spirometry was performed for 176 of these patients (Fig. 1 ). To assess the generalizability of the final model, we used an independent, retrospective surgical cohort of 232 patients collected from a separate cancer research center.. Data Acquisition and Endpoints We collected and categorized a comprehensive set of preoperative variables. The clinical dataset included 80 variables, covering among other things, patient demographics, clinical characteristics, and a panel of comorbidities (see Supplementary Table 1A). The second part of the dataset consisted of 43 spirometry variables derived from preoperative pulmonary function tests, including but not limited to Forced Expiratory Volume in one second (FEV1), Diffusing Capacity of the Lungs for Carbon Monoxide (DLCO), and other derived flow and volume parameters (see Supplementary Table 1B). The primary endpoint was the occurrence of any postoperative complication, graded according to the Clavien-Dindo classification [ 9 ]. We considered 20 adverse events as defined as Fernandez and coworkers reported [ 10 ] : air leakage, pleural effusion, subcutaneous emphysema, pneumonia, arrhythmia, hemorrhage, sepsis, wound dehiscence, abdominal collection, anemia, acute myocardial ischemia, deep venous thrombosis, fistula, infected wound, occlusion, pulmonary embolism, renal failure, infection of the urinary tract. A further sub-analysis was performed to predict specific complications, grouped as cardiovascular events (defined as acute myocardial ischemia, arrhythmia, deep venous thrombosis pulmonary embolism) and respiratory events (defined as pleural effusion, air leakage, subcutaneous emphysema, pneumonia). The cardiopulmonary as well as respiratory complications were analyzed as a binary outcome (complicated vs not-complicated patient). Only complications occurred within the first 30 days after the operation, or later if experienced within the same postoperative hospital stay were included in the present analysis. Before proceeding with the AI analysis, the database underwent a preprocessing with specific data quality assessment procedures for optimizing data completeness (reported in Supplementary Table 1A and 1B) and consistency. Feature Selection, Model Optimization and Evaluation To identify preoperative predictors, we designed and implemented a machine learning workflow. First, we performed a feature selection protocol embedded within a 20-repetition bootstrapping framework to ensure the stability of feature rankings. Three distinct algorithms were applied: Minimum Redundancy Maximum Relevance (mRMR) [ 11 ], Lasso logistic regression [ 12 ], and Elastic-Net logistic regression [ 13 ]. Feature importance was aggregated across all repetitions to generate a stable ranked list, and the algorithm demonstrating the highest average Area Under the Receiver Operating Characteristic (AUROC) curve was chosen as the optimal method (Supplementary Figs. 1A and 1B). Using the best ranked feature list, we then identified the optimal combination of signature length and machine learning model. We evaluated four supervised learning algorithms: logistic regression (LR), random forest (RF) [ 14 ], a support-vector machine (SVM) with a radial basis function (RBF) kernel [ 15 ], and eXtreme Gradient Boosting (XGBoost) [ 16 ]. For each algorithm, we incrementally constructed feature signatures of increasing length, from 1 (the most important feature) to 10 (the top 10 most important features). The performance of each algorithm and signature length combination was evaluated using a 5-repetition, 3-fold cross-validation scheme on the training data. Model-specific hyperparameters were tuned to maximize performance. The configuration that maximized the mean AUROC across all cross-validation folds was selected as optimal. Finally, predictive models were trained on the entire training dataset using the optimal learner and feature signature length. This process was performed using clinical dataset only, spirometry dataset only, and integrating top features from both datasets. The performance of these final models was then assessed on the independent, external validation cohort by calculating the AUROC with 95% confidence intervals. All data processing, statistical analysis, and machine learning model development were conducted using R (version 4.5.1). The feature ranking, signature determination, and final model development were supported by the “Fully Automated Machine Learning with Interpretable Analysis of Results” (FAMILIAR) framework [ 17 ]. The initial step in our modeling process was to identify the most stable and performant feature selection algorithm. We evaluated three distinct methods within a 20-repetition bootstrapping framework. The Elastic-Net algorithm consistently achieved the highest average Area Under the Receiver Operating Characteristic (AUROC) curve for both the clinical and spirometric datasets (Supplementary Fig. 1A and 1B). RESULTS Patient Characteristics and Outcomes A total of 231 patients who underwent curative-intent surgery for NSCLC were included in the training cohort. The baseline clinical and pathological characteristics of this cohort are summarized in Table 1 . The mean age was 68.6 years (SD ± 9.5), and the majority of patients were former or current smokers (184 patients, 79.7%). Postoperative complications of any grade occurred in 37 patients, corresponding to an overall complication rate of 16.0%. A total of 48 complications occurred in these 37 patients, as nine patients experienced two complications and one patient experienced three. Among the patients with complications, the majority experienced low-grade events (Clavien-Dindo grade I-II: 23 patients, 63.9%), with a mean time from surgery to complication onset of 7 days (SD ± 13). The characteristics of the complications are summarized in Fig. 2 and Table 2 . Model Development and Internal Performance To identify the most stable and performant feature selection algorithm, we compared the performance of three methods. The Elastic-Net algorithm consistently achieved the highest average Area Under the Receiver Operating Characteristic (AUROC) curve for both the clinical and spirometric datasets (Supplementary Fig. 1A and 1B) and was therefore selected to generate the final feature importance rankings.Based on these Elastic-Net rankings (Supplementary Figs. 2 and 3), the machine learning workflow identified the Charlson Comorbidity index (also defines as Pathologic Score in text (see supplementary Table 1)) (Fig. 3 A) and the ratio of preoperative Forced Expiratory Volume in one second to its theoretical value (FEV1_TEOR) (Fig. 3 B) as the most influential variables for predicting postoperative complications. This was determined by evaluating four different machine learning models across various feature set sizes (signature lengths). The tables in Figs. 3 A and 3 B report the peak performance for each model. The selection of these features is the result of a cross-validated hyperparameter search that included optimizing the signature length for four distinct machine learning algorithms. The tables report the optimal cross-validated AUROC for each algorithm. A logistic regression model with a signature length of one demonstrated the highest predictive power for both clinical (AUROC of 0.674) and spirometric (AUROC of 0.738) datasets. When performed on the entire training cohort, a model based on the pathologic score alone achieved an AUROC of 0.72 (95% CI [0.62–0.81]), while a model using only preoperative FEV1_TEOR demonstrated superior predictive power with an AUROC of 0.77 (95% CI [0.67–0.89]). By integrating these two key features, the combined model produced the highest performance on the training data, achieving an AUROC of 0.82 (95% CI [0.72–0.92]), indicating a strong predictive signal within the development dataset (Fig. 3 C). To facilitate clinical application, this final two-feature model was translated into a nomogram, providing a graphical tool to estimate the probability of postoperative complications (Fig. 3 D). When fixing the pathologic score at 5, changes in FEV1_TEOR are associated with significant variations in postoperative risk: a value of 75% corresponds to an estimated risk of 26.4%, 65% to 37.0%, and 55% to 49.1%. Conversely, when fixing FEV1_TEOR at 65%, variations in the pathologic score also show a clear trend: a score of 4 corresponds to a 31.0% risk, 5 to 37.0%, and 6 to 43.3%. These findings suggest an approximate 6% increase in postoperative risk for each incremental step in pathologic score, and an increase of about 11–12% for every 10% decrease in FEV1_TEOR. External Validation of Predictive Models When the trained models were evaluated on the independent external validation cohort, their performance varied. The model based only on the pathologic score failed to generalize, showing no predictive ability beyond random chance (AUROC = 0.50, 95% CI [0.43–0.58]). In contrast, the preoperative FEV1_TEOR was confirmed as a robust and generalizable predictor (Fig. 4 A), with the spirometry-only model achieving an AUROC of 0.72 (95% CI [0.66–0.79]) on the external dataset. The combined model, integrating both pathologic score and FEV1_TEOR, also demonstrated valid predictive performance (Fig. 4 B), with an AUROC of 0.68 (95% CI [0.61–0.75]), although this was slightly lower than the performance of the FEV1_TEOR-only model. Analysis of Specific Complication Types To investigate the prediction of specific adverse events, the modeling pipeline was re-applied to develop distinct models for cardiovascular and respiratory complications. Using the previously identified optimal features, we trained and validated models specifically for these endpoints. In the external validation cohort, similar to the overall complication analysis, the pathologic score alone showed no predictive power for these specific endpoints. Whereas, model using only preoperative FEV1_TEOR (Supplementary Fig. 4A) proved to be effective for predicting both cardiovascular events (AUROC = 0.76, 95% CI [0.64–0.87]) and respiratory events (AUROC = 0.69, 95% CI [0.61–0.77]). Finally, the combined model, incorporating both FEV1_TEOR and pathologic score (Supplementary Fig. 4B), also remained predictive for both sub-endpoints, though its performance was somewhat attenuated, achieving an AUROC of 0.69 (95% CI [0.56–0.81]) for cardiovascular and 0.62 (95% CI [0.53–0.71]) for respiratory complications. To facilitate clinical use, nomograms derived from these combined models are provided for estimating the probability of cardiovascular and respiratory events (Supplementary Fig. 4C). DISCUSSION The present analysis, based on LANTERN data, focused on a still not well investigated topic, the prediction of post operative complications in patients candidate to thoracic surgery for NSCLC with curative intent with ML algorithms. The LANTERN project was ideated [ 4 ] with the purpose to create personalized models of lung cancer treatment through the collection of a great amount of patients data (big data) in digitalized human avatar (DHA). Using an artificial intelligence–based approach, we were able to include a large number of variables (over one hundred), which are typically excluded from analyses relying on traditional statistical methods due to limited data collection or intrinsic methodological constraints. Only few studies already described the AI application for post-operative complication prediction. Matar and coworkers [ 18 ], trained a machine learning model to create a nomogram to predict osteonecrosis after fibula free flap reconstruction in oral cancer patients. In their model, pre-operative radiation therapy, post-operative wound infection, plate exposure, and surgical re-exploration were the most influential features in model predictions. Another interesting experience comes from Wang and colleagues [ 19 ]. They focused on a machine learning-based model to predict of gastroparesis risk following complete mesocolic excision. They found this complication to be associated with advanced age, prolonged surgeries, extensive intraoperative blood loss, surgical techniques, low serum protein levels, anaemia, diabetes, and hypothyroidism and the strength of their results stays in the fact that they validated their results with an internal validation. Also in pediatric surgery ML algorithms have been studied. Liu et al. [ 20 ] presented an integrated machine learning model delayed graft function prediction in pediatric renal transplantation from deceased donors. While the aforementioned studies successfully apply specific ML algorithms, our work highlights the value of a structured machine learning workflow. We systematically tested combinations of feature selection techniques and ML learners to ensure the final model represents the most robust and predictive configuration for identifying patients at risk of postoperative complications. Concerning instead the prediction of complication specifically in thoracic surgery, some studied formulated analysis based on traditional statistics to create nomograms. For example, Wang et al. [ 21 ] created indeed a nomogram, based on four objective and easily assessed factors. It demonstrates excellent predictive performance for pediatric postoperative pulmonary complications after one-lung ventilation, enabling early risk assessment and targeted interventions to improve patient outcomes. Zhou and colleagues [ 22 ] led a prospective bicentric study for the elderly patients submitted to thoracic surgery. They developed and validated of a nomogram for predicting postoperative pulmonary complications in this cohort of patients. This nomogram could identify patients at risk for complication within 30 days. Liu and coworkers developed a very similar experience [ 25 ] creating a nomogram to predict complications in older patients. They identified preoperative presence of chronic obstructive pulmonary disease (COPD), elevated leukocyte count, higher partial pressure of arterial carbon dioxide (PaCO2) level, surgical site, thoracotomy, intraoperative hypotension, blood loss > 100 mL, surgery duration > 180 min, and malignant tumor to be good prognosticator of complications. A very interesting experience comes from Shixin and colleagues [ 24 ], who focused their analysis on post operative complication prediction in patients surgically treated for NSCLC. The population of interest is the same we considered but they developed a nomogram through traditional statistic approach. Taking into account the prediction of complications in thoracic surgery for NSCLC, with the application of AI, to the best of our knowledge, only 2 experiences are reported. A similar analysis to ours, was reported by Salati et al. [ 7 ], but they considered patients submitted to surgery for both malignant and non-malignant conditions and analysed a lower number of variables (n = 50). Furthermore, their one was a retrospective experience, instead ours is a prospective one with the validation on an external cohort. Another one comes from Wand and colleagues [ 25 ], who considered patients operated for NSCLC, but then took into account for the analysis, only pneumonectomies. They applied ML models, as multiple machine-learning models (Logistic regression, eXtreme Gradient Boosting, Random forest, Light Gradient Boosting Machine and Naïve Bayes). Multivariate logistic regression analysis revealed that age, surgery duration, intraoperative intercostal nerve block, postoperative patient-controlled analgesia, bronchial blocker use and sufentanil use were independent predictors of cardiovascular and neurologic complications. Our analysis brings up some novelties if compared to literature. To our knowledge, this is the first study to evaluate a cohort of patients surgically treated for NSCLC with curative intent, encompassing all types of resections performed in this setting—both anatomical and non-anatomical—according to the best available evidence. Furthermore, one interesting aspect, is that more than a hundred variables were considered, of which nearly a half about spirometric parameters. Our results show the weight of the variables in predicting overall complications and the different type of complication with several clinical implications. In particular, pre operative FEV1_TEOR and the combination of Charlson comorbity index and pre operative FEV1_TEOR are the most powerlful factors in predicting post operative complications both in our series and in the external cohort, even if in this cohort with a lower AUROC. Charlson comorbity index alone could predict post operative complication only in our cohort. These results were also confirmed when analysing in particular cardiovascular and respiratory complications. The external cohort used for validation consisted of surgical patients, and therefore the indications for surgery were comparable to those applied at our center. However, the patient populations differed, as did the distribution of lobectomies and segmentectomies and the overall rate of postoperative complications. This represents a strength of our study, since the reproducibility of the results in an independent external cohort supports the robustness of the proposed model and demonstrates that it can be trained on heterogeneous patient populations while maintaining satisfactory performance. The observed differences between the two cohorts may be partly attributable to the ongoing process of data reporting within the dataset, which relies on human input and may introduce variability related to the subjective assessment of complications. Indeed, a well-recognized limitation of AI-based analyses is the challenge of reproducibility, as data codification, interpretation, and reporting are inherently dependent on human oversight and may vary across centers, even when the ontology and definitions are prospectively standardized. In addition, population characteristics may differ between institutions, despite our efforts to select cohorts with comparable features in terms of ethnicity, lifestyle habits, and geographical background. From this perspective, the implementation of cloud-based infrastructures enabling continuous and automated data updating will be of paramount importance, as it would allow models to be continuously retrained, thereby reducing variability, improving reproducibility, and ultimately enhancing their generalizability.In light of the considerations outlined above, the proposed model is not yet ready for routine clinical implementation. In accordance with the objectives of the LANTERN project, prospective validation in our patient cohort will be required once enrollment is completed. The use of a machine learning approach enabled the integration of a broad range of variables that are frequently excluded from conventional statistical analyses because of data quality issues or methodological limitations. This represents a clear advantage over traditional regression models, as it allows the simultaneous evaluation of multiple clinical features that have been insufficiently explored in relation to postoperative complications, largely due to the constraint of limiting the number of covariates in standard multivariate analyses. The ultimate goal is the development of an online application (e.g., for smartphones and tablets, already widely adopted in clinical practice) capable of estimating the overall risk of postoperative complications, as well as the specific risk of cardiovascular and respiratory events, based on patient data, including at least the most clinically relevant variables. In the near future, such a tool could support personalized perioperative management by enabling targeted preventive strategies. For instance, a predicted increased risk of postoperative atrial fibrillation could prompt the initiation of preoperative beta-blocker therapy. More broadly, postoperative intensive care unit (ICU) utilization could be anticipated according to the estimated surgical risk, potentially reducing both healthcare costs and ICU-related complications. Although cardiovascular and respiratory complication categories encompass events requiring different clinical interventions (e.g., antibiotic therapy for pneumonia versus surgical sealants for air leaks), identifying patients at increased risk for a specific complication group may heighten clinical awareness and facilitate the implementation of tailored preventive measures, such as prophylactic sealant use, early antibiotic treatment, or intensified respiratory physiotherapy. Overall, this modeling approach has the potential to enhance clinical practice by informing training strategies, optimizing diagnostic and therapeutic decision-making, supporting interventional planning, and improving patient monitoring and follow-up. CONCLUSIONS In the field of AI application in surgery, the implementation of a model including preoperative data, in particular, pre-operative FEV1_TEOR and the pathologic score permits to predict post-operative complications, selecting ad hoc strategies for patients management. Declarations The study was led in accordance to Helsinki Declaration Ethics approval and consent to participate : The informed consent has been obtained from all subjects and/or their legal guardian(s). Approval of Fondazione Policlinico Universitario Agostino Gemelli IRCCS – Università Cattolica del Sacro Cuore Ethics Committee Number: 5420 − 0002485/23; Trial registration - clinicaltrial.gov: NCT05802771. Availability of data and materials : The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request Consent for publication : not applicable Funding: This project was supported by the Ministry of Health under the frame of ERA PerMed JTC2022 Authors' contributions : FL:Conceptualization, ; visualization: writing-review and editing, surpervision, methodology CS: Conceptualization, writing- original draft preparation, visualization, writing-review and editing , DD and AC: data curation and formal analsysis LB, FG. MB, VP, AB, ET, SL, RA, NF, EO, EM: resources, MC: visualization, Alessandra Cancellieri: resources, investigation RT: resources, EB: resources, validation AG: supersivion, validation, Stefano Margaritora: supervision Acknowledgements : none References Bonci EA, Bandura A, Dooley A, Erjan A, Gebreslase HW, Hategan M, Khanduja D, Lai E, Lescaie A, Nitescu GV, Ramalho S, Thiha A, Bertolaccini L. Artificial intelligence in NSCLC management for revolutionizing diagnosis, prognosis, and treatment optimization: A systematic review. Crit Rev Oncol Hematol. 2025;216:104929. Epub ahead of print. PMID: 40967442. Aleem MU, Khan JA, Younes A, Sabbah BN, Saleh W, Migliore M. Enhancing Thoracic Surgery with AI: A Review of Current Practices and Emerging Trends. Curr Oncol. 2024;31(10):6232–44. 10.3390/curroncol31100464 . PMID: 39451768; PMCID: PMC11506543. Lococo F, Ghaly G, Chiappetta M, Flamini S, Evangelista J, Bria E, Stefani A, Vita E, Martino A, Boldrini L, Sassorossi C, Campanella A, Margaritora S, Mohammed A. Implementation of Artificial Intelligence in Personalized Prognostic Assessment of Lung Cancer: A Narrative Review. Cancers (Basel). 2024;16(10):1832. 10.3390/cancers16101832 . PMID: 38791910; PMCID: PMC11119930. Lococo F, Boldrini L, Diepriye CD, Evangelista J, Nero C, Flamini S, Minucci A, De Paolis E, Vita E, Cesario A, Annunziata S, Calcagni ML, Chiappetta M, Cancellieri A, Larici AR, Cicchetti G, Troost EGC, Ádány R, Farré N, Öztürk E, Van Doorne D, Leoncini F, Urbani A, Trisolini R, Bria E, Giordano A, Rindi G, Sala E, Tortora G, Valentini V, Boccia S, Margaritora S, Scambia G. Lung cancer multi-omics digital human avatars for integrating precision medicine into clinical practice: the LANTERN study. BMC Cancer. 2023;23(1):540. doi: 10.1186/s12885-023-10997-x. Erratum in: BMC Cancer. 2023;23(1):1082. 10.1186/s12885-023-11606-7 . PMID: 37312079; PMCID: PMC10262371. He X, Dong M, Xiong H, Zhu Y, Ping F, Wang B, Kang Y. Prediction models for postoperative pulmonary complications in intensive care unit patients after noncardiac thoracic surgery. BMC Pulm Med. 2024;24(1):420. 10.1186/s12890-024-03153-z . PMID: 39210309; PMCID: PMC11360767. Nonomura R, Tabata T, Sasaki T, Mitomo H, Ishibashi N, Sugawara T, Metoki H. Prediction of Postoperative Respiratory Complications after Lobectomy in Lung Cancer Patients with COPD by Quantitative Image Analysis: A Historical Cohort Study. Ann Thorac Cardiovasc Surg. 2022;28(6):411–9. 10.5761/atcs.oa.22-00133 . Epub 2022 Oct 7. PMID: 36216576; PMCID: PMC9763719. Salati M, Migliorelli L, Moccia S, Andolfi M, Roncon A, Guiducci GM, Xiumè F, Tiberi M, Frontoni E, Refai M. World J Surg. 2021;45(5):1585–94. 10.1007/s00268-020-05948-7 . Epub 2021 Feb 16. PMID: 33594578. A Machine Learning Approach for Postoperative Outcome Prediction: Surgical Data Science Application in a Thoracic Surgery Setting. Collins GS, Reitsma JB, Altman DG, Moons K. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD Statement. BMC Med. 2015;13:1. Dindo D, Demartines N, Clavien P-A. Classification of Surgical Complications: A New Proposal With Evaluation in a Cohort of 6336 Patients and Results of a Survey. Ann Surg. 2004;240:205–13. Fernandez FG, Falcoz PE, Kozower BD, et al. The societyof thoracic surgeons and the european society of thoracic sur-geons general thoracic surgery databases: joint standardization ofvariables definitions and terminology. Ann Thorac Surg. 2015;99:368–76. Peng H, Long F, Ding C. Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans Pattern Anal Mach Intell. 2005;27:1226–38. Tibshirani R. Regression Shrinkage and Selection Via the Lasso. J R Stat Soc Ser B Stat Methodol. 1996;58:267–88. Zou H, Hastie T. Regularization and Variable Selection Via the Elastic Net. J R Stat Soc Ser B Stat Methodol. 2005;67:301–20. Tin Kam Ho. Random decision forests. in Proceedings of 3rd International Conference on Document Analysis and Recognition vol. 1 278–282. (IEEE Comput. Soc. Press, Montreal, Que., Canada, 1995). Cortes C, Vapnik V. Support-vector networks. Mach Learn. 1995;20:273–97. Chen T, Guestrin C, XGBoost:. A Scalable Tree Boosting System. in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 785–794ACM, San Francisco California USA, (2016). 10.1145/2939672.2939785 Zwanenburg A, Löck S. familiar: End-to-End Automated Machine Learning and Model Evaluation. 1.5.0 https://doi.org/10.32614/CRAN.package.familiar (2022). Matar DY, Mackert GA, Matar AY, Chen AC, Panayi AC, Knoedler L, Knoedler S, Yang R, Mady LJ, Kao HK. Artificial Intelligence Outperforms a Nomogram for Osteoradionecrosis Prognostication Following Fibula Free Flap Reconstruction in Oral Cancer Patients. J Stomatol Oral Maxillofac Surg. 2025 Oct 1:102584. doi: 10.1016/j.jormas.2025.102584. Epub ahead of print. PMID: 41043770. Wang W, Yan Z, Zhang Z, Zhang Q, Jia Y. Machine learning-based prediction of gastroparesis risk following complete mesocolic excision. Discov Oncol. 2024;15(1):483. 10.1007/s12672-024-01355-9 . PMID: 39331201; PMCID: PMC11436699. Liu XY, Feng RT, Feng WX, Jiang WW, Chen JA, Zhong GL, Chen CW, Li ZJ, Zeng JD, Liu D, Zhou S, Hu JM, Liao GR, Liao J, Guo ZF, Li YZ, Yang SQ, Li SC, Chen H, Guo Y, Li M, Fan LP, Yan HY, Chen JR, Li LY, Liu YG. An integrated machine learning model enhances delayed graft function prediction in pediatric renal transplantation from deceased donors. BMC Med. 2024;22(1):407. 10.1186/s12916-024-03624-4 . PMID: 39304842; PMCID: PMC11415997. Wang L, Xiao T, Du Z, Chen T, Pei D, Qu S. Development and validation of a nomogram to pediatric postoperative pulmonary complications following thoracic surgery. BMC Anesthesiol. 2025;25(1):255. 10.1186/s12871-025-03122-x . PMID: 40399778; PMCID: PMC12096645. Zhou Y, Wang H, Lu D, Jiang T, Huang Z, Wang F, Yao Y, Gu Y, Wei W. Development and validation of a nomogram for predicting postoperative pulmonary complications in older patients undergoing noncardiac thoracic surgery: a prospective, bicentric cohort study. BMC Geriatr. 2025;25(1):169. 10.1186/s12877-025-05791-2 . PMID: 40082767; PMCID: PMC11905546. Liu J, Xue D, Wang L, Li Y, Liu L, Liao G, Cao J, Liu Y, Lou J, Li H, Yang Y, Mi W, Fu Q. Development and validation of a nomogram for predicting pulmonary complications in elderly patients undergoing thoracic surgery. Aging Clin Exp Res. 2024;36(1):197. 10.1007/s40520-024-02844-1 . PMID: 39368046; PMCID: PMC11455794. Ma S, Li F, Li J, Wang L, Song H. Risk factor analysis and nomogram prediction model construction of postoperative complications of thoracoscopic non-small cell lung cancer. J Thorac Dis. 2024;16(6):3655–67. 10.21037/jtd-24-113 . Epub 2024 Jun 12. PMID: 38983183; PMCID: PMC11228728. Wang Y, Xie S, Liu J, Wang H, Yu J, Li W, Guan A, Xu S, Cui Y, Tan W. Predicting postoperative complications after pneumonectomy using machine learning: a 10-year study. Ann Med. 2025;57(1):2487636. Epub 2025 Apr 7. PMID: 40193241; PMCID: PMC11980193. Tables Tables 1 and 2 are available in the Supplementary Files section. Additional Declarations No competing interests reported. Supplementary Files SupplementaryfiguresBMC.docx Supplementary figures: Supplementary Figure 1. Performance comparison of feature selection algorithms. Violin plots show the distribution of the Area Under the Receiver Operating Characteristic (AUROC) curve for three feature selection methods: Minimum Redundancy Maximum Relevance (mRMR), Lasso logistic regression, and Elastic-Net logistic regression. Performance was evaluated within a 20-repetition bootstrapping framework on the training data. Panel (A) displays the results using the clinical dataset, and panel (B) shows the results using the spirometric dataset. Supplementary Figure 2. Elastic-net feature importance ranking for clinical variables. The higher the number, the greater the relevance of the predictor for the algorithm. Supplementary Figure 3. Elastic-net feature importance ranking for spirometric variables. The higher the number, the greater the relevance of the predictor for the algorithm. Supplementary Figure 4. Model performance in the external validation cohort, evaluated by Area Under the Receiver-Operator Characteristic (AUROC) curves. The panels show the performance of deployment models for two distinct outcomes: (A) cardiovascular complications and (B) respiratory complications. Within each panel, the curve on the left represents a model using FEV1_TEOR alone, while the curve on the right represents the combined model using both the Pathologic Score and FEV1_TEOR. For all curves, the light grey shading represents the bootstrapped 95% confidence interval, and the red line indicates the performance of a random classifier (AUC = 0.50). SupplementarytablesBMC.docx Supplementary tables: Supplementary Table 1A. Description of preoperative clinical variables and their completeness rates. Supplementary Table 1B. Description of preoperative spirometric variables and their completeness rates. Tables.docx Cite Share Download PDF Status: Under Review Version 1 posted Reviews received at journal 13 Mar, 2026 Reviews received at journal 07 Mar, 2026 Reviewers agreed at journal 04 Mar, 2026 Reviews received at journal 02 Mar, 2026 Reviewers agreed at journal 02 Mar, 2026 Reviewers agreed at journal 27 Feb, 2026 Reviewers invited by journal 24 Feb, 2026 Editor invited by journal 05 Feb, 2026 Editor assigned by journal 13 Jan, 2026 Submission checks completed at journal 12 Jan, 2026 First submitted to journal 12 Jan, 2026 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-8554813","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":596156679,"identity":"bf11a2c6-52ca-4d4d-b561-0c78af5ce65e","order_by":0,"name":"Filippo Lococo","email":"","orcid":"","institution":"Catholic University of the Sacred Heart","correspondingAuthor":false,"prefix":"","firstName":"Filippo","middleName":"","lastName":"Lococo","suffix":""},{"id":596156683,"identity":"03833757-7e27-4445-830f-bb35a13f208d","order_by":1,"name":"Carolina Sassorossi","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAABAklEQVRIiWNgGAWjYBAC+QYYixlCyYGIAw/waDE4gKbFGKwlAZ8WdIFEsLV4tbAff/zh4x4be/l2BsYPP/fYpM8PO/wQaIudnG4Ddi3yPTlmkjOepSVuOMzALNnzLC134+00A6CWZGOzA9i1MBzIYWPmOXA4wYCZgY0ByMjdODsBpOVA4jZcWs4/f/wZqNJevpmBjfHPgcPphrPTP+DXciPBQBqohbHhMAPEOnnpHPy2GNx4A/TLAZBfGJulZQ6kGW6Qzik4kGCA2y/y/emPP3w4AAyx/sMHP745YCMvPzt984cPFXZyOL2PAIwNEHvBKjHiCx9AJKFRMApGwSgYBRAAAEgIYv6WKl+1AAAAAElFTkSuQmCC","orcid":"","institution":"Catholic University of the Sacred Heart","correspondingAuthor":true,"prefix":"","firstName":"Carolina","middleName":"","lastName":"Sassorossi","suffix":""},{"id":596156686,"identity":"34826bae-7685-4a41-946a-fd5e96e711e1","order_by":2,"name":"Davide Dalfovo","email":"","orcid":"","institution":"University Hospital Carl Gustav Carus, TUD Dresden University of Technology","correspondingAuthor":false,"prefix":"","firstName":"Davide","middleName":"","lastName":"Dalfovo","suffix":""},{"id":596156688,"identity":"b620139c-0b9d-46b2-9657-3b0f862e5306","order_by":3,"name":"Annalisa Campanella","email":"","orcid":"","institution":"A. Gemelli University Hospital Foundation IRCCS","correspondingAuthor":false,"prefix":"","firstName":"Annalisa","middleName":"","lastName":"Campanella","suffix":""},{"id":596156689,"identity":"0a18d6d3-86c1-401d-831b-c21a14d5960f","order_by":4,"name":"Luca Boldrini","email":"","orcid":"","institution":"Catholic University of the Sacred Heart","correspondingAuthor":false,"prefix":"","firstName":"Luca","middleName":"","lastName":"Boldrini","suffix":""},{"id":596156690,"identity":"c519d3f1-dd04-4c08-955c-a7b4e5ac2a1d","order_by":5,"name":"Filippo Gallina","email":"","orcid":"","institution":"Regina Elena National Cancer Institute","correspondingAuthor":false,"prefix":"","firstName":"Filippo","middleName":"","lastName":"Gallina","suffix":""},{"id":596156691,"identity":"45b5a852-5e0a-4060-86d3-1df747052dd1","order_by":6,"name":"Mattia Bruschi","email":"","orcid":"","institution":"Regina Elena National Cancer Institute","correspondingAuthor":false,"prefix":"","firstName":"Mattia","middleName":"","lastName":"Bruschi","suffix":""},{"id":596156693,"identity":"bc4d04e3-5654-428f-8fa4-c1445c2a78b5","order_by":7,"name":"Virginia Proietti","email":"","orcid":"","institution":"A. Gemelli University Hospital Foundation IRCCS","correspondingAuthor":false,"prefix":"","firstName":"Virginia","middleName":"","lastName":"Proietti","suffix":""},{"id":596156694,"identity":"bae2fa19-9284-4c2e-b1ad-857cf2df949c","order_by":8,"name":"Akshaya Balamurugan","email":"","orcid":"","institution":"A. Gemelli University Hospital Foundation IRCCS","correspondingAuthor":false,"prefix":"","firstName":"Akshaya","middleName":"","lastName":"Balamurugan","suffix":""},{"id":596156695,"identity":"aa7ce759-d8e7-48ff-8172-8d0819b16493","order_by":9,"name":"Esther G C Troost","email":"","orcid":"","institution":"OncoRay – National Center for Radiation Research in Oncology, University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Helmholtz-Zentrum Dresden- Rossendorf","correspondingAuthor":false,"prefix":"","firstName":"Esther","middleName":"G C","lastName":"Troost","suffix":""},{"id":596156696,"identity":"28219491-e098-481c-8338-4a36b8c82f73","order_by":10,"name":"Steffen Löck","email":"","orcid":"","institution":"University Hospital Carl Gustav Carus, TUD Dresden University of Technology","correspondingAuthor":false,"prefix":"","firstName":"Steffen","middleName":"","lastName":"Löck","suffix":""},{"id":596156697,"identity":"184c4769-a9a2-4862-b72d-d9548812a79b","order_by":11,"name":"Róza Ádány","email":"","orcid":"","institution":"ELKH-DE Public Health Research Group, Faculty of Medicine, University of Debrecen","correspondingAuthor":false,"prefix":"","firstName":"Róza","middleName":"","lastName":"Ádány","suffix":""},{"id":596156698,"identity":"284d0143-37e0-4aaa-a8c3-a3d6be0004e4","order_by":12,"name":"Núria Farré","email":"","orcid":"","institution":"Hospital de la Santa Creu i Sant Pau (IR-HSCSP)","correspondingAuthor":false,"prefix":"","firstName":"Núria","middleName":"","lastName":"Farré","suffix":""},{"id":596156699,"identity":"97d25e76-199a-4c13-b149-bbd584ab5884","order_by":13,"name":"Ece Öztürk","email":"","orcid":"","institution":"School of Medicine, Turkey and Koç University Research Center for Translational Medicine (KUTTAM), Sariyer, Koç University, , Turkey.","correspondingAuthor":false,"prefix":"","firstName":"Ece","middleName":"","lastName":"Öztürk","suffix":""},{"id":596156700,"identity":"6f1849f8-1e35-4d3a-a825-b7b26a586831","order_by":14,"name":"Edoardo Mercadante","email":"","orcid":"","institution":"Regina Elena National Cancer Institute","correspondingAuthor":false,"prefix":"","firstName":"Edoardo","middleName":"","lastName":"Mercadante","suffix":""},{"id":596156701,"identity":"87509862-02c9-4cf5-84e3-c48e40b03c2e","order_by":15,"name":"Marco Chiappetta","email":"","orcid":"","institution":"Thoracic Surgery Unit, Magna Graecia University","correspondingAuthor":false,"prefix":"","firstName":"Marco","middleName":"","lastName":"Chiappetta","suffix":""},{"id":596156702,"identity":"ee22abe8-dc79-4fac-8935-f43a19dda561","order_by":16,"name":"Alessandra Cancellieri","email":"","orcid":"","institution":"A. Gemelli University Hospital Foundation IRCCS","correspondingAuthor":false,"prefix":"","firstName":"Alessandra","middleName":"","lastName":"Cancellieri","suffix":""},{"id":596156703,"identity":"8050cb78-7226-4e82-acf9-9bb1df758828","order_by":17,"name":"Rocco Trisolini","email":"","orcid":"","institution":"A. Gemelli University Hospital Foundation IRCCS","correspondingAuthor":false,"prefix":"","firstName":"Rocco","middleName":"","lastName":"Trisolini","suffix":""},{"id":596156704,"identity":"b90be12e-7131-49b6-bddf-5d2b5a9ffc83","order_by":18,"name":"Emilio Bria","email":"","orcid":"","institution":"A. Gemelli University Hospital Foundation IRCCS","correspondingAuthor":false,"prefix":"","firstName":"Emilio","middleName":"","lastName":"Bria","suffix":""},{"id":596156705,"identity":"9ffc934c-59a3-4487-8559-5cd8459c8d2a","order_by":19,"name":"Antonio Gasbarrini","email":"","orcid":"","institution":"Catholic University of the Sacred Heart","correspondingAuthor":false,"prefix":"","firstName":"Antonio","middleName":"","lastName":"Gasbarrini","suffix":""},{"id":596156706,"identity":"a9a7c194-c06d-412b-a446-cc1285d6712f","order_by":20,"name":"Stefano Margaritora","email":"","orcid":"","institution":"Catholic University of the Sacred Heart","correspondingAuthor":false,"prefix":"","firstName":"Stefano","middleName":"","lastName":"Margaritora","suffix":""}],"badges":[],"createdAt":"2026-01-08 20:08:17","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-8554813/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-8554813/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":103585160,"identity":"a22b89e9-e7d5-4245-9c09-c23d9ab99657","added_by":"auto","created_at":"2026-02-27 11:02:22","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":275966,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eWorkflow for Patient Selection and Machine Learning Model Development and Evaluation\u003c/strong\u003e. The process involves data collection, preprocessing, feature selection, training of different machine learning models and evaluation on an external cohort.\u003c/p\u003e","description":"","filename":"floatimage1.png","url":"https://assets-eu.researchsquare.com/files/rs-8554813/v1/30f7581e18c3f492d0e9f61a.png"},{"id":103585166,"identity":"5657545b-a821-4816-b6b8-08ec460d7907","added_by":"auto","created_at":"2026-02-27 11:02:22","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":148508,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eDistribution of post operative complications\u003c/strong\u003e. The pie chart illustrates the relative frequency of all 48 postoperative complication events.\u003c/p\u003e","description":"","filename":"floatimage2.png","url":"https://assets-eu.researchsquare.com/files/rs-8554813/v1/b460e01abaa2e1c17601a98f.png"},{"id":103585165,"identity":"16e625de-0588-46f5-94a7-555a101795b4","added_by":"auto","created_at":"2026-02-27 11:02:22","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":143617,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eMachine learning workflow and model performance on the training cohort. (A)\u003c/strong\u003e The bar plot shows the top 10 most important clinical features ranked by elastic net feature importance. The table below displays the peak cross-validated AUROC for four ML models, identifying the Pathologic Score as the single best predictor (AUROC = 0.674) using Logistic Regression. \u003cstrong\u003e(B)\u003c/strong\u003e The bar plot shows the top 10 most important spirometric features. The corresponding table reports the peak cross-validated AUROC for each model, identifying FEV1_TEOR as the single best predictor (AUROC = 0.738), also via Logistic Regression. \u003cstrong\u003e(C)\u003c/strong\u003e AUROC for the final model, which combines the Pathologic Score and FEV1_TEOR. The model achieves an AUROC of 0.82 (95% CI [0.72-0.92]). The shaded area represents the 95% confidence interval, and the red diagonal line indicates a random classifier (AUROC = 0.50). \u003cstrong\u003e(D)\u003c/strong\u003e A nomogram developed from the final logistic regression model to provide a graphical tool for calculating the probability of postoperative complications based on a patient's Pathologic Score and FEV1_TEOR.\u003c/p\u003e","description":"","filename":"floatimage3.png","url":"https://assets-eu.researchsquare.com/files/rs-8554813/v1/ef1b7dd9853da04526b61677.png"},{"id":104399254,"identity":"a2110747-65d0-44d8-9f4d-65f772d0a6ac","added_by":"auto","created_at":"2026-03-11 12:05:15","extension":"png","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":84747,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eModel performance in the external validation cohort to predict postoperative complications.\u003c/strong\u003e The panels show the performance,\u003cstrong\u003e \u003c/strong\u003eevaluated by Area Under the Receiver-Operator Characteristic (AUROC) curves, of models using: \u003cstrong\u003e(A)\u003c/strong\u003e FEV1_TEOR alone; and \u003cstrong\u003e(B)\u003c/strong\u003e the combined Pathology Score and FEV1_TEOR. For all curves, the light grey shading represents the bootstrapped 95% confidence interval, and the red line indicates the performance of a random classifier (AUC = 0.50).\u003c/p\u003e","description":"","filename":"floatimage4.png","url":"https://assets-eu.researchsquare.com/files/rs-8554813/v1/2fabf7bd4e585a0a3f82ba88.png"},{"id":104407465,"identity":"63a06d8e-e853-474f-8fee-094f82dcacc3","added_by":"auto","created_at":"2026-03-11 12:38:17","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":1436648,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-8554813/v1/799c966f-f246-4cf3-a363-fe830064866a.pdf"},{"id":103585164,"identity":"b2de42c1-4a1f-45cf-b434-72a39dd268f5","added_by":"auto","created_at":"2026-02-27 11:02:22","extension":"docx","order_by":1,"title":"","display":"","copyAsset":false,"role":"supplement","size":502737,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eSupplementary figures:\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eSupplementary Figure 1. \u003c/strong\u003ePerformance comparison of feature selection algorithms. Violin plots show the distribution of the Area Under the Receiver Operating Characteristic (AUROC) curve for three feature selection methods: Minimum Redundancy Maximum Relevance (mRMR), Lasso logistic regression, and Elastic-Net logistic regression. Performance was evaluated within a 20-repetition bootstrapping framework on the training data. Panel (A) displays the results using the clinical dataset, and panel (B) shows the results using the spirometric dataset.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eSupplementary Figure 2.\u003c/strong\u003e Elastic-net feature importance ranking for clinical variables. The higher the number, the greater the relevance of the predictor for the algorithm.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eSupplementary Figure 3.\u003c/strong\u003e Elastic-net feature importance ranking for spirometric variables. The higher the number, the greater the relevance of the predictor for the algorithm.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eSupplementary Figure 4.\u003c/strong\u003e Model performance in the external validation cohort, evaluated by Area Under the Receiver-Operator Characteristic (AUROC) curves. The panels show the performance of deployment models for two distinct outcomes: (A) cardiovascular complications and (B) respiratory complications. Within each panel, the curve on the left represents a model using FEV1_TEOR alone, while the curve on the right represents the combined model using both the Pathologic Score and FEV1_TEOR. For all curves, the light grey shading represents the bootstrapped 95% confidence interval, and the red line indicates the performance of a random classifier (AUC = 0.50).\u003c/p\u003e","description":"","filename":"SupplementaryfiguresBMC.docx","url":"https://assets-eu.researchsquare.com/files/rs-8554813/v1/ecf23c03ae4b330973df6d25.docx"},{"id":104399147,"identity":"d9aa4479-3977-4faa-9777-0c821cb3b6d1","added_by":"auto","created_at":"2026-03-11 12:04:49","extension":"docx","order_by":2,"title":"","display":"","copyAsset":false,"role":"supplement","size":27890,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eSupplementary tables\u003c/strong\u003e:\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eSupplementary Table 1A\u003c/strong\u003e. Description of preoperative clinical variables and their completeness rates.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eSupplementary Table 1B\u003c/strong\u003e. Description of preoperative spirometric variables and their completeness rates.\u003c/p\u003e","description":"","filename":"SupplementarytablesBMC.docx","url":"https://assets-eu.researchsquare.com/files/rs-8554813/v1/ed9c08ea0f8f405ee6aab384.docx"},{"id":104398352,"identity":"08d45408-5baa-4374-9fa3-0824bf309697","added_by":"auto","created_at":"2026-03-11 12:01:58","extension":"docx","order_by":3,"title":"","display":"","copyAsset":false,"role":"supplement","size":21402,"visible":true,"origin":"","legend":"","description":"","filename":"Tables.docx","url":"https://assets-eu.researchsquare.com/files/rs-8554813/v1/bda3f2ec7af77a315060c3c3.docx"}],"financialInterests":"No competing interests reported.","formattedTitle":"LANTERN-01: AI model for Postoperative Complications Prediction after NSCLC Lung Resection: Prospective Multicentric Study and Extern Validation","fulltext":[{"header":"INTRODUCTION","content":"\u003cp\u003eThe application of artificial intelligence (AI) and machine learning (ML) into lung cancer treatment has shown potential in improving diagnostic accuracy, treatment personalization, and patient outcomes [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e]. AI applications in thoracic surgery is revealing the potential to improve surgical precision, patient safety, and overall care efficiency [\u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e]. Despite remarkable advances in lung cancer management, postoperative complications continue to represent a major determinant of outcomes, as they may increase morbidity and mortality. In the era of novel multimodal strategies for resectable NSCLC, which may also affect the complexity of surgical procedures, the identification of patient subgroups at higher risk of postoperative complications is of paramount importance. Such stratification could enable the implementation of personalized perioperative strategies aimed at improving overall outcomes. Under this point of view, Advances in precision medicine offer hope to improve also surgical outcomes [\u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e].\u003c/p\u003e \u003cp\u003ePrevious studies investigated the possibility to individuate high risk patient in different setting (5), considering also the radiological or pre-operative features and using AI based models (5,6)\u003c/p\u003e \u003cp\u003eIn this frame, the LANTERN (Lung cancer multi-omics digital human avatars for integrating precision medicine) project [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e] arises, with the aims to deliver a novel approach for comprehensive lung cancer decision making solutions, based on predictive digital platforms powered by the integration multi-omics data. The current study, namely LANTERN1, has been led with the aim of creating a machine learning algorithm, able to analyze all the pre operative features of patients candidate to thoracic surgery, to predict the post operative complication risk.\u003c/p\u003e \u003cp\u003eIn literature, different studies have been published with the aim of predicting post operative complications after non cardiac thoracic surgery. For example, He and coworkers [\u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e] analysed a ICU (intensive care unit) cohort, to predict in particular respiratory complications and they used univariate and multilogistic regression to perform the analysis. Another interesting experience, from Nonomura and coworkers [\u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e] tries to predict the risk of pulmonary complications on the base of pre operative radiological features. A very interesting experience is from Salati et al [\u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e]. They already proposed an AI based model able to define the risk of cardiac and pulmonary complications in the early postoperative period for patients submitted to anatomic lung resection. By the way, as reported by the authors, their experience is a monocentric one and they analyzed the risk on the base of 50 preoperative features. Our study considered more than 100 variables and comprises an external cohort to validate the results. To the best of our knowledge, our analysis would be the first one with these characteristics, with the aim of predicting post operative complications, in patients candidate to surgery for NSCLC, with AI algorithms. This would improve the process of risk definition and therapy planning for patients undergoing lung surgery\u003c/p\u003e \u003cp\u003eWith this study, we aimed to develop an AI based model for complication risk in a large cohort of patients who underwent curative surgery for NSCLC.\u003c/p\u003e"},{"header":"METHODS","content":"\u003cdiv id=\"Sec3\" class=\"Section2\"\u003e \u003ch2\u003eEthics\u003c/h2\u003e \u003cp\u003e The protocol of the study has been drawn up in accordance with the Standards of Good Clinical Practice of the European Union and the current review of the Helsinki Declaration and has been approved by the Ethics Committee of this structure. All methods will be carried out in accordance with relevant guidelines and regulations. The informed consent has been obtained from all subjects and/or their legal guardian(s). Approval of Fondazione Policlinico Universitario Agostino Gemelli IRCCS \u0026ndash; Universit\u0026agrave; Cattolica del Sacro Cuore Ethics Committee Number: 5420\u0026thinsp;\u0026minus;\u0026thinsp;0002485/23; Trial registration - clinicaltrial.gov: NCT05802771.\u003c/p\u003e \u003c/div\u003e\n\u003ch3\u003eStudy Design and Population\u003c/h3\u003e\n\u003cp\u003eThis study was designed as a prospective, multicenter observational trial followed by external validation, in accordance with the TRIPOD (Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis) guidelines [\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e]. Data for 337 patients were collected prospectively between March 2023 and December 2024 within the framework of the LANTERN consortium [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e]. Of these, 231 patients with Stage I-III Non-Small Cell Lung Cancer (NSCLC) underwent curative-intent surgery. Spirometry was performed for 176 of these patients (Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003e). To assess the generalizability of the final model, we used an independent, retrospective surgical cohort of 232 patients collected from a separate cancer research center..\u003c/p\u003e\n\u003ch3\u003eData Acquisition and Endpoints\u003c/h3\u003e\n\u003cp\u003eWe collected and categorized a comprehensive set of preoperative variables. The clinical dataset included 80 variables, covering among other things, patient demographics, clinical characteristics, and a panel of comorbidities (see Supplementary Table\u0026nbsp;1A). The second part of the dataset consisted of 43 spirometry variables derived from preoperative pulmonary function tests, including but not limited to Forced Expiratory Volume in one second (FEV1), Diffusing Capacity of the Lungs for Carbon Monoxide (DLCO), and other derived flow and volume parameters (see Supplementary Table\u0026nbsp;1B). The primary endpoint was the occurrence of any postoperative complication, graded according to the Clavien-Dindo classification [\u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eWe considered 20 adverse events as defined as Fernandez and coworkers reported [\u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e] : air leakage, pleural effusion, subcutaneous emphysema, pneumonia, arrhythmia, hemorrhage, sepsis, wound dehiscence, abdominal collection, anemia, acute myocardial ischemia, deep venous thrombosis, fistula, infected wound, occlusion, pulmonary embolism, renal failure, infection of the urinary tract.\u003c/p\u003e \u003cp\u003eA further sub-analysis was performed to predict specific complications, grouped as cardiovascular events (defined as acute myocardial ischemia, arrhythmia, deep venous thrombosis pulmonary embolism) and respiratory events (defined as pleural effusion, air leakage, subcutaneous emphysema, pneumonia).\u003c/p\u003e \u003cp\u003eThe cardiopulmonary as well as respiratory complications were analyzed as a binary outcome (complicated vs not-complicated patient). Only complications occurred within the first 30 days after the operation, or later if experienced within the same postoperative hospital stay were included in the present analysis.\u003c/p\u003e \u003cp\u003eBefore proceeding with the AI analysis, the database underwent a preprocessing with specific data quality assessment procedures for optimizing data completeness (reported in Supplementary Table\u0026nbsp;1A and 1B) and consistency.\u003c/p\u003e\n\u003ch3\u003eFeature Selection, Model Optimization and Evaluation\u003c/h3\u003e\n\u003cp\u003eTo identify preoperative predictors, we designed and implemented a machine learning workflow. First, we performed a feature selection protocol embedded within a 20-repetition bootstrapping framework to ensure the stability of feature rankings. Three distinct algorithms were applied: Minimum Redundancy Maximum Relevance (mRMR) [\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e], Lasso logistic regression [\u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e12\u003c/span\u003e], and Elastic-Net logistic regression [\u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e13\u003c/span\u003e]. Feature importance was aggregated across all repetitions to generate a stable ranked list, and the algorithm demonstrating the highest average Area Under the Receiver Operating Characteristic (AUROC) curve was chosen as the optimal method (Supplementary Figs.\u0026nbsp;1A and 1B).\u003c/p\u003e \u003cp\u003eUsing the best ranked feature list, we then identified the optimal combination of signature length and machine learning model. We evaluated four supervised learning algorithms: logistic regression (LR), random forest (RF) [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e], a support-vector machine (SVM) with a radial basis function (RBF) kernel [\u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e], and eXtreme Gradient Boosting (XGBoost) [\u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e16\u003c/span\u003e]. For each algorithm, we incrementally constructed feature signatures of increasing length, from 1 (the most important feature) to 10 (the top 10 most important features). The performance of each algorithm and signature length combination was evaluated using a 5-repetition, 3-fold cross-validation scheme on the training data. Model-specific hyperparameters were tuned to maximize performance. The configuration that maximized the mean AUROC across all cross-validation folds was selected as optimal.\u003c/p\u003e \u003cp\u003eFinally, predictive models were trained on the entire training dataset using the optimal learner and feature signature length. This process was performed using clinical dataset only, spirometry dataset only, and integrating top features from both datasets. The performance of these final models was then assessed on the independent, external validation cohort by calculating the AUROC with 95% confidence intervals.\u003c/p\u003e \u003cp\u003eAll data processing, statistical analysis, and machine learning model development were conducted using R (version 4.5.1). The feature ranking, signature determination, and final model development were supported by the \u0026ldquo;Fully Automated Machine Learning with Interpretable Analysis of Results\u0026rdquo; (FAMILIAR) framework [\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eThe initial step in our modeling process was to identify the most stable and performant feature selection algorithm. We evaluated three distinct methods within a 20-repetition bootstrapping framework. The Elastic-Net algorithm consistently achieved the highest average Area Under the Receiver Operating Characteristic (AUROC) curve for both the clinical and spirometric datasets (Supplementary Fig.\u0026nbsp;1A and 1B).\u003c/p\u003e"},{"header":"RESULTS","content":"\u003cdiv id=\"Sec8\" class=\"Section2\"\u003e \u003ch2\u003ePatient Characteristics and Outcomes\u003c/h2\u003e \u003cp\u003eA total of 231 patients who underwent curative-intent surgery for NSCLC were included in the training cohort. The baseline clinical and pathological characteristics of this cohort are summarized in Table\u0026nbsp;\u003cspan refid=\"Tab1\" class=\"InternalRef\"\u003e1\u003c/span\u003e. The mean age was 68.6 years (SD\u0026thinsp;\u0026plusmn;\u0026thinsp;9.5), and the majority of patients were former or current smokers (184 patients, 79.7%). Postoperative complications of any grade occurred in 37 patients, corresponding to an overall complication rate of 16.0%. A total of 48 complications occurred in these 37 patients, as nine patients experienced two complications and one patient experienced three. Among the patients with complications, the majority experienced low-grade events (Clavien-Dindo grade I-II: 23 patients, 63.9%), with a mean time from surgery to complication onset of 7 days (SD\u0026thinsp;\u0026plusmn;\u0026thinsp;13). The characteristics of the complications are summarized in Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003e and Table\u0026nbsp;\u003cspan refid=\"Tab2\" class=\"InternalRef\"\u003e2\u003c/span\u003e.\u003c/p\u003e \u003c/div\u003e\n\u003ch3\u003eModel Development and Internal Performance\u003c/h3\u003e\n\u003cp\u003eTo identify the most stable and performant feature selection algorithm, we compared the performance of three methods. The Elastic-Net algorithm consistently achieved the highest average Area Under the Receiver Operating Characteristic (AUROC) curve for both the clinical and spirometric datasets (Supplementary Fig.\u0026nbsp;1A and 1B) and was therefore selected to generate the final feature importance rankings.Based on these Elastic-Net rankings (Supplementary Figs.\u0026nbsp;2 and 3), the machine learning workflow identified the Charlson Comorbidity index (also defines as Pathologic Score in text (see supplementary Table\u0026nbsp;1)) (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eA) and the ratio of preoperative Forced Expiratory Volume in one second to its theoretical value (FEV1_TEOR) (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eB) as the most influential variables for predicting postoperative complications. This was determined by evaluating four different machine learning models across various feature set sizes (signature lengths). The tables in Figs.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eA and \u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eB report the peak performance for each model. The selection of these features is the result of a cross-validated hyperparameter search that included optimizing the signature length for four distinct machine learning algorithms. The tables report the optimal cross-validated AUROC for each algorithm. A logistic regression model with a signature length of one demonstrated the highest predictive power for both clinical (AUROC of 0.674) and spirometric (AUROC of 0.738) datasets.\u003c/p\u003e \u003cp\u003eWhen performed on the entire training cohort, a model based on the pathologic score alone achieved an AUROC of 0.72 (95% CI [0.62\u0026ndash;0.81]), while a model using only preoperative FEV1_TEOR demonstrated superior predictive power with an AUROC of 0.77 (95% CI [0.67\u0026ndash;0.89]). By integrating these two key features, the combined model produced the highest performance on the training data, achieving an AUROC of 0.82 (95% CI [0.72\u0026ndash;0.92]), indicating a strong predictive signal within the development dataset (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eC). To facilitate clinical application, this final two-feature model was translated into a nomogram, providing a graphical tool to estimate the probability of postoperative complications (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eD).\u003c/p\u003e \u003cp\u003eWhen fixing the pathologic score at 5, changes in FEV1_TEOR are associated with significant variations in postoperative risk: a value of 75% corresponds to an estimated risk of 26.4%, 65% to 37.0%, and 55% to 49.1%. Conversely, when fixing FEV1_TEOR at 65%, variations in the pathologic score also show a clear trend: a score of 4 corresponds to a 31.0% risk, 5 to 37.0%, and 6 to 43.3%. These findings suggest an approximate 6% increase in postoperative risk for each incremental step in pathologic score, and an increase of about 11\u0026ndash;12% for every 10% decrease in FEV1_TEOR.\u003c/p\u003e\n\u003ch3\u003eExternal Validation of Predictive Models\u003c/h3\u003e\n\u003cp\u003eWhen the trained models were evaluated on the independent external validation cohort, their performance varied. The model based only on the pathologic score failed to generalize, showing no predictive ability beyond random chance (AUROC\u0026thinsp;=\u0026thinsp;0.50, 95% CI [0.43\u0026ndash;0.58]). In contrast, the preoperative FEV1_TEOR was confirmed as a robust and generalizable predictor (Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e4\u003c/span\u003eA), with the spirometry-only model achieving an AUROC of 0.72 (95% CI [0.66\u0026ndash;0.79]) on the external dataset. The combined model, integrating both pathologic score and FEV1_TEOR, also demonstrated valid predictive performance (Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e4\u003c/span\u003eB), with an AUROC of 0.68 (95% CI [0.61\u0026ndash;0.75]), although this was slightly lower than the performance of the FEV1_TEOR-only model.\u003c/p\u003e \u003cdiv id=\"Sec11\" class=\"Section2\"\u003e \u003ch2\u003eAnalysis of Specific Complication Types\u003c/h2\u003e \u003cp\u003eTo investigate the prediction of specific adverse events, the modeling pipeline was re-applied to develop distinct models for cardiovascular and respiratory complications. Using the previously identified optimal features, we trained and validated models specifically for these endpoints. In the external validation cohort, similar to the overall complication analysis, the pathologic score alone showed no predictive power for these specific endpoints. Whereas, model using only preoperative FEV1_TEOR (Supplementary Fig.\u0026nbsp;4A) proved to be effective for predicting both cardiovascular events (AUROC\u0026thinsp;=\u0026thinsp;0.76, 95% CI [0.64\u0026ndash;0.87]) and respiratory events (AUROC\u0026thinsp;=\u0026thinsp;0.69, 95% CI [0.61\u0026ndash;0.77]). Finally, the combined model, incorporating both FEV1_TEOR and pathologic score (Supplementary Fig.\u0026nbsp;4B), also remained predictive for both sub-endpoints, though its performance was somewhat attenuated, achieving an AUROC of 0.69 (95% CI [0.56\u0026ndash;0.81]) for cardiovascular and 0.62 (95% CI [0.53\u0026ndash;0.71]) for respiratory complications. To facilitate clinical use, nomograms derived from these combined models are provided for estimating the probability of cardiovascular and respiratory events (Supplementary Fig.\u0026nbsp;4C).\u003c/p\u003e \u003c/div\u003e"},{"header":"DISCUSSION","content":"\u003cp\u003eThe present analysis, based on LANTERN data, focused on a still not well investigated topic, the prediction of post operative complications in patients candidate to thoracic surgery for NSCLC with curative intent with ML algorithms.\u003c/p\u003e \u003cp\u003eThe LANTERN project was ideated [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e] with the purpose to create personalized models of lung cancer treatment through the collection of a great amount of patients data (big data) in digitalized human avatar (DHA). Using an artificial intelligence\u0026ndash;based approach, we were able to include a large number of variables (over one hundred), which are typically excluded from analyses relying on traditional statistical methods due to limited data collection or intrinsic methodological constraints.\u003c/p\u003e \u003cp\u003eOnly few studies already described the AI application for post-operative complication prediction. Matar and coworkers [\u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e18\u003c/span\u003e], trained a machine learning model to create a nomogram to predict osteonecrosis after fibula free flap reconstruction in oral cancer patients. In their model, pre-operative radiation therapy, post-operative wound infection, plate exposure, and surgical re-exploration were the most influential features in model predictions. Another interesting experience comes from Wang and colleagues [\u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e19\u003c/span\u003e]. They focused on a machine learning-based model to predict of gastroparesis risk following complete mesocolic excision. They found this complication to be associated with advanced age, prolonged surgeries, extensive intraoperative blood loss, surgical techniques, low serum protein levels, anaemia, diabetes, and hypothyroidism and the strength of their results stays in the fact that they validated their results with an internal validation.\u003c/p\u003e \u003cp\u003eAlso in pediatric surgery ML algorithms have been studied. Liu et al. [\u003cspan citationid=\"CR20\" class=\"CitationRef\"\u003e20\u003c/span\u003e] presented an integrated machine learning model delayed graft function prediction in pediatric renal transplantation from deceased donors.\u003c/p\u003e \u003cp\u003eWhile the aforementioned studies successfully apply specific ML algorithms, our work highlights the value of a structured machine learning workflow. We systematically tested combinations of feature selection techniques and ML learners to ensure the final model represents the most robust and predictive configuration for identifying patients at risk of postoperative complications.\u003c/p\u003e \u003cp\u003eConcerning instead the prediction of complication specifically in thoracic surgery, some studied formulated analysis based on traditional statistics to create nomograms. For example, Wang et al. [\u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e21\u003c/span\u003e] created indeed a nomogram, based on four objective and easily assessed factors. It demonstrates excellent predictive performance for pediatric postoperative pulmonary complications after one-lung ventilation, enabling early risk assessment and targeted interventions to improve patient outcomes. Zhou and colleagues [\u003cspan citationid=\"CR22\" class=\"CitationRef\"\u003e22\u003c/span\u003e] led a prospective bicentric study for the elderly patients submitted to thoracic surgery. They developed and validated of a nomogram for predicting postoperative pulmonary complications in this cohort of patients. This nomogram could identify patients at risk for complication within 30 days. Liu and coworkers developed a very similar experience [\u003cspan citationid=\"CR25\" class=\"CitationRef\"\u003e25\u003c/span\u003e] creating a nomogram to predict complications in older patients. They identified preoperative presence of chronic obstructive pulmonary disease (COPD), elevated leukocyte count, higher partial pressure of arterial carbon dioxide (PaCO2) level, surgical site, thoracotomy, intraoperative hypotension, blood loss\u0026thinsp;\u0026gt;\u0026thinsp;100 mL, surgery duration\u0026thinsp;\u0026gt;\u0026thinsp;180 min, and malignant tumor to be good prognosticator of complications. A very interesting experience comes from Shixin and colleagues [\u003cspan citationid=\"CR24\" class=\"CitationRef\"\u003e24\u003c/span\u003e], who focused their analysis on post operative complication prediction in patients surgically treated for NSCLC. The population of interest is the same we considered but they developed a nomogram through traditional statistic approach.\u003c/p\u003e \u003cp\u003eTaking into account the prediction of complications in thoracic surgery for NSCLC, with the application of AI, to the best of our knowledge, only 2 experiences are reported. A similar analysis to ours, was reported by Salati et al. [\u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e], but they considered patients submitted to surgery for both malignant and non-malignant conditions and analysed a lower number of variables (n\u0026thinsp;=\u0026thinsp;50). Furthermore, their one was a retrospective experience, instead ours is a prospective one with the validation on an external cohort. Another one comes from Wand and colleagues [\u003cspan citationid=\"CR25\" class=\"CitationRef\"\u003e25\u003c/span\u003e], who considered patients operated for NSCLC, but then took into account for the analysis, only pneumonectomies. They applied ML models, as multiple machine-learning models (Logistic regression, eXtreme Gradient Boosting, Random forest, Light Gradient Boosting Machine and Na\u0026iuml;ve Bayes). Multivariate logistic regression analysis revealed that age, surgery duration, intraoperative intercostal nerve block, postoperative patient-controlled analgesia, bronchial blocker use and sufentanil use were independent predictors of cardiovascular and neurologic complications.\u003c/p\u003e \u003cp\u003eOur analysis brings up some novelties if compared to literature. To our knowledge, this is the first study to evaluate a cohort of patients surgically treated for NSCLC with curative intent, encompassing all types of resections performed in this setting\u0026mdash;both anatomical and non-anatomical\u0026mdash;according to the best available evidence. Furthermore, one interesting aspect, is that more than a hundred variables were considered, of which nearly a half about spirometric parameters. Our results show the weight of the variables in predicting overall complications and the different type of complication with several clinical implications. In particular, pre operative FEV1_TEOR and the combination of Charlson comorbity index and pre operative FEV1_TEOR are the most powerlful factors in predicting post operative complications both in our series and in the external cohort, even if in this cohort with a lower AUROC. Charlson comorbity index alone could predict post operative complication only in our cohort. These results were also confirmed when analysing in particular cardiovascular and respiratory complications.\u003c/p\u003e \u003cp\u003eThe external cohort used for validation consisted of surgical patients, and therefore the indications for surgery were comparable to those applied at our center. However, the patient populations differed, as did the distribution of lobectomies and segmentectomies and the overall rate of postoperative complications. This represents a strength of our study, since the reproducibility of the results in an independent external cohort supports the robustness of the proposed model and demonstrates that it can be trained on heterogeneous patient populations while maintaining satisfactory performance.\u003c/p\u003e \u003cp\u003eThe observed differences between the two cohorts may be partly attributable to the ongoing process of data reporting within the dataset, which relies on human input and may introduce variability related to the subjective assessment of complications. Indeed, a well-recognized limitation of AI-based analyses is the challenge of reproducibility, as data codification, interpretation, and reporting are inherently dependent on human oversight and may vary across centers, even when the ontology and definitions are prospectively standardized. In addition, population characteristics may differ between institutions, despite our efforts to select cohorts with comparable features in terms of ethnicity, lifestyle habits, and geographical background. From this perspective, the implementation of cloud-based infrastructures enabling continuous and automated data updating will be of paramount importance, as it would allow models to be continuously retrained, thereby reducing variability, improving reproducibility, and ultimately enhancing their generalizability.In light of the considerations outlined above, the proposed model is not yet ready for routine clinical implementation. In accordance with the objectives of the LANTERN project, prospective validation in our patient cohort will be required once enrollment is completed.\u003c/p\u003e \u003cp\u003eThe use of a machine learning approach enabled the integration of a broad range of variables that are frequently excluded from conventional statistical analyses because of data quality issues or methodological limitations. This represents a clear advantage over traditional regression models, as it allows the simultaneous evaluation of multiple clinical features that have been insufficiently explored in relation to postoperative complications, largely due to the constraint of limiting the number of covariates in standard multivariate analyses.\u003c/p\u003e \u003cp\u003eThe ultimate goal is the development of an online application (e.g., for smartphones and tablets, already widely adopted in clinical practice) capable of estimating the overall risk of postoperative complications, as well as the specific risk of cardiovascular and respiratory events, based on patient data, including at least the most clinically relevant variables.\u003c/p\u003e \u003cp\u003eIn the near future, such a tool could support personalized perioperative management by enabling targeted preventive strategies. For instance, a predicted increased risk of postoperative atrial fibrillation could prompt the initiation of preoperative beta-blocker therapy. More broadly, postoperative intensive care unit (ICU) utilization could be anticipated according to the estimated surgical risk, potentially reducing both healthcare costs and ICU-related complications.\u003c/p\u003e \u003cp\u003eAlthough cardiovascular and respiratory complication categories encompass events requiring different clinical interventions (e.g., antibiotic therapy for pneumonia versus surgical sealants for air leaks), identifying patients at increased risk for a specific complication group may heighten clinical awareness and facilitate the implementation of tailored preventive measures, such as prophylactic sealant use, early antibiotic treatment, or intensified respiratory physiotherapy.\u003c/p\u003e \u003cp\u003eOverall, this modeling approach has the potential to enhance clinical practice by informing training strategies, optimizing diagnostic and therapeutic decision-making, supporting interventional planning, and improving patient monitoring and follow-up.\u003c/p\u003e"},{"header":"CONCLUSIONS","content":"\u003cp\u003eIn the field of AI application in surgery, the implementation of a model including preoperative data, in particular, pre-operative FEV1_TEOR and the pathologic score permits to predict post-operative complications, selecting ad hoc strategies for patients management.\u0026nbsp;\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003eThe study was led in accordance to Helsinki Declaration\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eEthics approval and consent to participate\u003c/strong\u003e: The informed consent has been obtained from all subjects and/or their legal guardian(s). Approval of Fondazione Policlinico Universitario Agostino Gemelli IRCCS \u0026ndash; Universit\u0026agrave; Cattolica del Sacro Cuore Ethics Committee Number: 5420 \u0026minus; 0002485/23; Trial registration - clinicaltrial.gov: NCT05802771.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAvailability of data and materials\u003c/strong\u003e: The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eConsent for publication\u003c/strong\u003e: not applicable\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eFunding:\u003c/strong\u003e This project was supported by the Ministry of Health under the frame of ERA PerMed JTC2022\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAuthors\u0026apos; contributions\u003c/strong\u003e: FL:Conceptualization, ; visualization: writing-review and editing, surpervision, methodology CS: Conceptualization, writing- \u0026nbsp;original draft preparation, visualization, writing-review and editing , DD and AC: data curation and formal analsysis LB, FG. MB, VP, AB, ET, SL, RA, NF, EO, EM: resources, MC: visualization, Alessandra Cancellieri: resources, investigation RT: resources, EB: resources, validation AG: supersivion, validation, Stefano Margaritora: supervision\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAcknowledgements\u003c/strong\u003e: none\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\u003cli\u003e\u003cspan\u003eBonci EA, Bandura A, Dooley A, Erjan A, Gebreslase HW, Hategan M, Khanduja D, Lai E, Lescaie A, Nitescu GV, Ramalho S, Thiha A, Bertolaccini L. Artificial intelligence in NSCLC management for revolutionizing diagnosis, prognosis, and treatment optimization: A systematic review. Crit Rev Oncol Hematol. 2025;216:104929. Epub ahead of print. PMID: 40967442.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eAleem MU, Khan JA, Younes A, Sabbah BN, Saleh W, Migliore M. Enhancing Thoracic Surgery with AI: A Review of Current Practices and Emerging Trends. Curr Oncol. 2024;31(10):6232\u0026ndash;44. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.3390/curroncol31100464\u003c/span\u003e\u003cspan address=\"10.3390/curroncol31100464\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PMID: 39451768; PMCID: PMC11506543.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLococo F, Ghaly G, Chiappetta M, Flamini S, Evangelista J, Bria E, Stefani A, Vita E, Martino A, Boldrini L, Sassorossi C, Campanella A, Margaritora S, Mohammed A. Implementation of Artificial Intelligence in Personalized Prognostic Assessment of Lung Cancer: A Narrative Review. Cancers (Basel). 2024;16(10):1832. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.3390/cancers16101832\u003c/span\u003e\u003cspan address=\"10.3390/cancers16101832\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PMID: 38791910; PMCID: PMC11119930.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLococo F, Boldrini L, Diepriye CD, Evangelista J, Nero C, Flamini S, Minucci A, De Paolis E, Vita E, Cesario A, Annunziata S, Calcagni ML, Chiappetta M, Cancellieri A, Larici AR, Cicchetti G, Troost EGC, \u0026Aacute;d\u0026aacute;ny R, Farr\u0026eacute; N, \u0026Ouml;zt\u0026uuml;rk E, Van Doorne D, Leoncini F, Urbani A, Trisolini R, Bria E, Giordano A, Rindi G, Sala E, Tortora G, Valentini V, Boccia S, Margaritora S, Scambia G. Lung cancer multi-omics digital human avatars for integrating precision medicine into clinical practice: the LANTERN study. BMC Cancer. 2023;23(1):540. doi: 10.1186/s12885-023-10997-x. Erratum in: BMC Cancer. 2023;23(1):1082. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1186/s12885-023-11606-7\u003c/span\u003e\u003cspan address=\"10.1186/s12885-023-11606-7\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PMID: 37312079; PMCID: PMC10262371.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHe X, Dong M, Xiong H, Zhu Y, Ping F, Wang B, Kang Y. Prediction models for postoperative pulmonary complications in intensive care unit patients after noncardiac thoracic surgery. BMC Pulm Med. 2024;24(1):420. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1186/s12890-024-03153-z\u003c/span\u003e\u003cspan address=\"10.1186/s12890-024-03153-z\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PMID: 39210309; PMCID: PMC11360767.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eNonomura R, Tabata T, Sasaki T, Mitomo H, Ishibashi N, Sugawara T, Metoki H. Prediction of Postoperative Respiratory Complications after Lobectomy in Lung Cancer Patients with COPD by Quantitative Image Analysis: A Historical Cohort Study. Ann Thorac Cardiovasc Surg. 2022;28(6):411\u0026ndash;9. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.5761/atcs.oa.22-00133\u003c/span\u003e\u003cspan address=\"10.5761/atcs.oa.22-00133\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. Epub 2022 Oct 7. PMID: 36216576; PMCID: PMC9763719.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSalati M, Migliorelli L, Moccia S, Andolfi M, Roncon A, Guiducci GM, Xium\u0026egrave; F, Tiberi M, Frontoni E, Refai M. World J Surg. 2021;45(5):1585\u0026ndash;94. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1007/s00268-020-05948-7\u003c/span\u003e\u003cspan address=\"10.1007/s00268-020-05948-7\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. Epub 2021 Feb 16. PMID: 33594578. A Machine Learning Approach for Postoperative Outcome Prediction: Surgical Data Science Application in a Thoracic Surgery Setting.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCollins GS, Reitsma JB, Altman DG, Moons K. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD Statement. BMC Med. 2015;13:1.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eDindo D, Demartines N, Clavien P-A. Classification of Surgical Complications: A New Proposal With Evaluation in a Cohort of 6336 Patients and Results of a Survey. Ann Surg. 2004;240:205\u0026ndash;13.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eFernandez FG, Falcoz PE, Kozower BD, et al. The societyof thoracic surgeons and the european society of thoracic sur-geons general thoracic surgery databases: joint standardization ofvariables definitions and terminology. Ann Thorac Surg. 2015;99:368\u0026ndash;76.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePeng H, Long F, Ding C. Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans Pattern Anal Mach Intell. 2005;27:1226\u0026ndash;38.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eTibshirani R. Regression Shrinkage and Selection Via the Lasso. J R Stat Soc Ser B Stat Methodol. 1996;58:267\u0026ndash;88.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eZou H, Hastie T. Regularization and Variable Selection Via the Elastic Net. J R Stat Soc Ser B Stat Methodol. 2005;67:301\u0026ndash;20.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eTin Kam Ho. Random decision forests. in Proceedings of 3rd International Conference on Document Analysis and Recognition vol. 1 278\u0026ndash;282. (IEEE Comput. Soc. Press, Montreal, Que., Canada, 1995).\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCortes C, Vapnik V. Support-vector networks. Mach Learn. 1995;20:273\u0026ndash;97.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eChen T, Guestrin C, XGBoost:. A Scalable Tree Boosting System. in \u003cem\u003eProceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining\u003c/em\u003e 785\u0026ndash;794ACM, San Francisco California USA, (2016). \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1145/2939672.2939785\u003c/span\u003e\u003cspan address=\"10.1145/2939672.2939785\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eZwanenburg A, L\u0026ouml;ck S. familiar: End-to-End Automated Machine Learning and Model Evaluation. 1.5.0 \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.32614/CRAN.package.familiar\u003c/span\u003e\u003cspan address=\"10.32614/CRAN.package.familiar\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e (2022).\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMatar DY, Mackert GA, Matar AY, Chen AC, Panayi AC, Knoedler L, Knoedler S, Yang R, Mady LJ, Kao HK. Artificial Intelligence Outperforms a Nomogram for Osteoradionecrosis Prognostication Following Fibula Free Flap Reconstruction in Oral Cancer Patients. J Stomatol Oral Maxillofac Surg. 2025 Oct 1:102584. doi: 10.1016/j.jormas.2025.102584. Epub ahead of print. PMID: 41043770.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWang W, Yan Z, Zhang Z, Zhang Q, Jia Y. Machine learning-based prediction of gastroparesis risk following complete mesocolic excision. Discov Oncol. 2024;15(1):483. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1007/s12672-024-01355-9\u003c/span\u003e\u003cspan address=\"10.1007/s12672-024-01355-9\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PMID: 39331201; PMCID: PMC11436699.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLiu XY, Feng RT, Feng WX, Jiang WW, Chen JA, Zhong GL, Chen CW, Li ZJ, Zeng JD, Liu D, Zhou S, Hu JM, Liao GR, Liao J, Guo ZF, Li YZ, Yang SQ, Li SC, Chen H, Guo Y, Li M, Fan LP, Yan HY, Chen JR, Li LY, Liu YG. An integrated machine learning model enhances delayed graft function prediction in pediatric renal transplantation from deceased donors. BMC Med. 2024;22(1):407. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1186/s12916-024-03624-4\u003c/span\u003e\u003cspan address=\"10.1186/s12916-024-03624-4\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PMID: 39304842; PMCID: PMC11415997.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWang L, Xiao T, Du Z, Chen T, Pei D, Qu S. Development and validation of a nomogram to pediatric postoperative pulmonary complications following thoracic surgery. BMC Anesthesiol. 2025;25(1):255. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1186/s12871-025-03122-x\u003c/span\u003e\u003cspan address=\"10.1186/s12871-025-03122-x\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PMID: 40399778; PMCID: PMC12096645.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eZhou Y, Wang H, Lu D, Jiang T, Huang Z, Wang F, Yao Y, Gu Y, Wei W. Development and validation of a nomogram for predicting postoperative pulmonary complications in older patients undergoing noncardiac thoracic surgery: a prospective, bicentric cohort study. BMC Geriatr. 2025;25(1):169. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1186/s12877-025-05791-2\u003c/span\u003e\u003cspan address=\"10.1186/s12877-025-05791-2\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PMID: 40082767; PMCID: PMC11905546.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLiu J, Xue D, Wang L, Li Y, Liu L, Liao G, Cao J, Liu Y, Lou J, Li H, Yang Y, Mi W, Fu Q. Development and validation of a nomogram for predicting pulmonary complications in elderly patients undergoing thoracic surgery. Aging Clin Exp Res. 2024;36(1):197. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1007/s40520-024-02844-1\u003c/span\u003e\u003cspan address=\"10.1007/s40520-024-02844-1\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PMID: 39368046; PMCID: PMC11455794.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMa S, Li F, Li J, Wang L, Song H. Risk factor analysis and nomogram prediction model construction of postoperative complications of thoracoscopic non-small cell lung cancer. J Thorac Dis. 2024;16(6):3655\u0026ndash;67. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.21037/jtd-24-113\u003c/span\u003e\u003cspan address=\"10.21037/jtd-24-113\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. Epub 2024 Jun 12. PMID: 38983183; PMCID: PMC11228728.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWang Y, Xie S, Liu J, Wang H, Yu J, Li W, Guan A, Xu S, Cui Y, Tan W. Predicting postoperative complications after pneumonectomy using machine learning: a 10-year study. Ann Med. 2025;57(1):2487636. Epub 2025 Apr 7. PMID: 40193241; PMCID: PMC11980193.\u003c/span\u003e\u003c/li\u003e\u003c/ol\u003e"},{"header":"Tables","content":"\u003cp\u003eTables 1 and 2 are available in the Supplementary Files section.\u003c/p\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"bmc-cancer","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"bcan","sideBox":"Learn more about [BMC Cancer](http://bmccancer.biomedcentral.com/)","snPcode":"","submissionUrl":"https://www.editorialmanager.com/bcan/default.aspx","title":"BMC Cancer","twitterHandle":"BMC_series","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"em","reportingPortfolio":"BMC Series","inReviewEnabled":true,"inReviewRevisionsEnabled":true},"keywords":"NSCLC, AI, post operative complications, personalized medicine","lastPublishedDoi":"10.21203/rs.3.rs-8554813/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-8554813/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003ch2\u003eIntroduction\u003c/h2\u003e \u003cp\u003e: Artificial intelligence(AI) techniques may combine various omics datasets to create more accurate predictive models for lung cancer patients management. The aim of this study from the LANTERN project, is to develop an AI-based predictive model of post-operative complications after lung resection for NSCLC.\u003c/p\u003e\u003ch2\u003eMethods\u003c/h2\u003e \u003cp\u003eIn the framework of LANTERN Consortium we prospectively collected data from 3/2023 to 12/2024 of patients who underwent curative surgery for Stage I-III NSCLC and were herein analyzed considering 80 preoperative clinical features and 43 spirometric variables to predict the occurrence of post-operative complications. Prediction models were developed on the basis of different feature selection algorithms and machine-learning models. An external dataset composed by a surgical series of 232 patients was used for validation.\u003c/p\u003e\u003ch2\u003eResults\u003c/h2\u003e \u003cp\u003eThe final analysis was conducted on 231 surgical patients. Post-operative complications were observed in 37 patients (16%). AI-based models showed that pathologic score (AUC\u0026thinsp;=\u0026thinsp;0.72, 95% CI [0.62\u0026ndash;0.81]) and pre-opFEV1_TEOR (AUC\u0026thinsp;=\u0026thinsp;0.77, CI [0.67\u0026ndash;0.89]) was the most relevant variables. Testing the models on the external dataset, while the predictive value of Pathologic score alone was reduced, pre-opFEV1_TEOR and the combination of Pathologic score and pre-opFEV1_TEOR were confirmed to predict post-op outcome. Postoperative risk rises by about 6% per pathologic score level and 11\u0026ndash;12% for each 10% decrease in FEV1_TEOR.\u003c/p\u003e\u003ch2\u003eConclusions\u003c/h2\u003e \u003cp\u003eCombining the FEV1_TEOR and the pathologic score permits to predict the complications risk in significant way. This model could be tested in a further prospective cohort of patients to verify its effectiveness in order to improve the perioperative management of lung resection candidates.\u003c/p\u003e","manuscriptTitle":"LANTERN-01: AI model for Postoperative Complications Prediction after NSCLC Lung Resection: Prospective Multicentric Study and Extern Validation","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2026-02-27 11:02:17","doi":"10.21203/rs.3.rs-8554813/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"editorInvitedReview","content":"","date":"2026-03-13T16:21:19+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2026-03-07T17:11:45+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"127845331869665059192138970921449018921","date":"2026-03-04T07:57:39+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2026-03-02T20:33:07+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"111601280289212252936473164393569842089","date":"2026-03-02T12:59:31+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"160652044095748210001466139958084974931","date":"2026-02-27T15:12:36+00:00","index":"hide","fulltext":""},{"type":"reviewersInvited","content":"","date":"2026-02-24T08:42:51+00:00","index":"","fulltext":""},{"type":"editorInvited","content":"","date":"2026-02-05T11:17:30+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2026-01-13T05:25:16+00:00","index":"","fulltext":""},{"type":"checksComplete","content":"","date":"2026-01-12T14:22:51+00:00","index":"","fulltext":""},{"type":"submitted","content":"BMC Cancer","date":"2026-01-12T14:11:34+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"bmc-cancer","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"bcan","sideBox":"Learn more about [BMC Cancer](http://bmccancer.biomedcentral.com/)","snPcode":"","submissionUrl":"https://www.editorialmanager.com/bcan/default.aspx","title":"BMC Cancer","twitterHandle":"BMC_series","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"em","reportingPortfolio":"BMC Series","inReviewEnabled":true,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"ee84e436-df47-4287-b10a-513aefb490b1","owner":[],"postedDate":"February 27th, 2026","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"under-review","subjectAreas":[],"tags":[],"updatedAt":"2026-02-27T11:02:17+00:00","versionOfRecord":[],"versionCreatedAt":"2026-02-27 11:02:17","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-8554813","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-8554813","identity":"rs-8554813","version":["v1"]},"buildId":"XKTyCvWXoU3ODBz1xrDgd","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

⚙ Ask this paper AI returns verbatim quotes from the full text · source: preprint-html ⓘ

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc: last seen: 2026-05-20T01:45:00.602351+00:00