Multimodal Deep Learning Integrating Chest CT Lung-Component Segmentation and Clinical Variables for Prediction of 1-Year Composite Cardiovascular Events in COPD: A Single-Centre Retrospective Cohort Study

doi:10.21203/rs.3.rs-8733857/v1

Multimodal Deep Learning Integrating Chest CT Lung-Component Segmentation and Clinical Variables for Prediction of 1-Year Composite Cardiovascular Events in COPD: A Single-Centre Retrospective Cohort Study

2026 · doi:10.21203/rs.3.rs-8733857/v1

preprint OA: closed

Full text JSON View at publisher

Full text 109,886 characters · extracted from preprint-html · click to expand

Multimodal Deep Learning Integrating Chest CT Lung-Component Segmentation and Clinical Variables for Prediction of 1-Year Composite Cardiovascular Events in COPD: A Single-Centre Retrospective Cohort Study | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Multimodal Deep Learning Integrating Chest CT Lung-Component Segmentation and Clinical Variables for Prediction of 1-Year Composite Cardiovascular Events in COPD: A Single-Centre Retrospective Cohort Study Kang Yu, Chunhui Qin, Yuxin Li, Chunxiao Wang, Yunjia Shuai, Ao Li, and 2 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-8733857/v1 This work is licensed under a CC BY 4.0 License Status: Posted Version 1 posted You are reading this latest preprint version Abstract Background Patients with chronic obstructive pulmonary disease (COPD) are at increased risk of cardiovascular disease (CVD) events. We aimed to develop and internally validate a multimodal deep learning model that integrates chest CT–derived lung-component information with clinical variables to predict 1-year composite CVD events in COPD. Methods We enrolled COPD patients between January 2023 and January 2025 who underwent baseline thin-slice non-contrast chest CT and completed 12-month follow-up. The primary endpoint was the first occurrence within 12 months of a composite CVD event (acute myocardial infarction, ischaemic stroke, hospitalisation for unstable angina, or hospitalisation for heart failure). CT images were standardised and resampled to 1.0×1.0×1.0 mm voxels. Lung parenchyma was segmented using a U-Net–based model; emphysema regions were identified by thresholding (HU < − 950); pulmonary vessels were enhanced using a Frangi filter and extracted via connected component analysis. A three-branch 3D convolutional neural network (3D-CNN) was constructed to extract imaging features from normal lung tissue, emphysematous regions, and pulmonary vascular structures, which were fused with clinical features (including age, smoking pack-years, and FEV1%) using an attention mechanism. The dataset was split into training/validation/test sets (8:1:1) with stratified sampling. Discrimination and classification metrics were reported on the independent test set; calibration was assessed using a calibration curve, Brier score, and calibration intercept/slope. Results A total of 306 COPD patients were included (training n = 244, validation n = 30, test n = 32). Within 12 months, 182 composite CVD events occurred (59.5%). On the independent test set (events = 17/32), the model achieved an AUC of 0.8588 with an accuracy of 78.12%, precision of 81.25%, recall/sensitivity of 76.47%, specificity of 80.00%, and F1-score of 0.7879. The Brier score was 0.187, with a calibration slope of 0.435 and an intercept of − 0.606. Conclusions A multimodal deep learning framework combining lung-component CT features and clinical variables demonstrated good discrimination for predicting 1-year composite CVD events in COPD. Further external validation and calibration refinement are warranted before clinical deployment. chronic obstructive pulmonary disease chest computed tomography lung segmentation deep learning multimodal fusion cardiovascular events Figures Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Figure 6 Introduction Chronic obstructive pulmonary disease (COPD) is a common chronic respiratory condition characterized by persistent airflow limitation. Its core pathophysiological mechanisms involve multi-level and multi-system alterations, primarily including airway obstruction, destruction of lung parenchyma, and structural and functional remodeling of pulmonary vessels.[ 1 ] Long-term persistent airway inflammation leads to thickening of the small airway walls, luminal narrowing, and dynamic collapse, thereby exacerbating airflow limitation. Concurrently, alveolar wall destruction and decreased elastic recoil result in gas trapping and pulmonary hyperinflation, which significantly increase the work of breathing and reduce ventilation efficiency. Furthermore, chronic hypoxemia can induce pulmonary vasoconstriction, smooth muscle hyperplasia, and vascular wall remodeling. In some patients, this progresses to COPD-associated pulmonary hypertension, increasing right ventricular afterload and affecting systemic circulatory function. These pathological changes not only determine the respiratory manifestations of COPD but also lay the foundation for its multi-system complications.[ 2 ] In recent years, increasing research has focused on the close association between COPD and cardiovascular diseases. Epidemiological studies indicate that COPD patients have a significantly higher risk of cardiovascular and cerebrovascular events, such as coronary artery disease, myocardial infarction, heart failure, and stroke, compared to the general population. Cardiovascular disease has become one of the leading causes of death in COPD patients. The risk of cardiovascular events is particularly prominent during or shortly after acute exacerbations, suggesting that respiratory inflammation, hypoxemia, and hemodynamic changes may act synergistically in the process of cardiovascular injury.[ 3 ] Therefore, early identification and assessment of cardiovascular disease risk in the COPD population hold significant clinical importance for optimizing individualized treatment strategies and reducing the incidence of adverse events.[ 4 ] Currently, cardiovascular disease risk assessment primarily relies on clinical risk factors, hematological biomarkers, and cardiac imaging examinations such as cardiac CT or echocardiography. However, the routine application of these methods in COPD patients faces certain limitations, including high costs, complex procedures, and insufficient patient compliance. In contrast, chest CT has become the most commonly used imaging modality in the diagnosis, phenotyping, and treatment evaluation of COPD, offering relatively high accessibility and repeatability in clinical practice.[ 5 ] In terms of imaging quantitative analysis, previous studies have mostly used whole-lung average-based metrics to assess COPD, such as the low attenuation area percentage (LAA%). While these methods can reflect the extent of parenchymal destruction to a certain degree, they overlook the spatial heterogeneity in the distribution of different lung tissue components, making it difficult to comprehensively characterize local structural abnormalities and their potential impact on cardiopulmonary interactions. In fact, variations in the proportion and spatial distribution of emphysematous regions, relatively normal lung tissue, and pulmonary vascular structures may be closely related to cardiovascular disease risk by influencing pulmonary vascular resistance and cardiac load, among other pathways.[ 6 ] Therefore, in-depth exploration of chest CT imaging information at a finer tissue-structural level is of significant value. With the rapid development of deep learning technology, segmentation and feature extraction methods based on convolutional neural networks provide new tools for analyzing complex medical imaging information. By performing fine segmentation of lung tissue components and extracting high-dimensional representative features from images, it is possible to more comprehensively reflect the structural and functional status of the lungs in COPD patients. Moreover, integrating imaging features with clinical data to construct multi-modal fusion models may further enhance the predictive capability for cardiovascular disease risk.[ 7 ] Based on the above background, this study aims to use chest CT images from COPD patients as the research subject. It will involve automatic segmentation and quantitative analysis of normal lung tissue, emphysematous regions, and pulmonary vascular structures. Combined with deep learning methods to extract imaging features and integrated with clinical information, a multi-modal prediction model will be constructed to explore its application value in assessing cardiovascular disease risk in COPD patients. Through this research, we hope to provide new imaging-based evidence for the early risk identification of COPD combined with cardiovascular disease and offer references for formulating individualized clinical management strategies. Methods Study design and setting This was a single-centre retrospective cohort study conducted at the Fourth Affiliated Hospital of Harbin Medical University, Harbin, China. Ethics approval The study protocol was approved by the institutional ethics committee and complied with the Declaration of Helsinki .The requirement for written informed consent was waived because of the retrospective design and the use of de-identified data. Participants We enrolled COPD patients between January 2023 and January 2025 who completed baseline thin-slice non-contrast chest CT and 12-month follow-up. Inclusion criteria were: (1) spirometry-confirmed COPD (FEV1/FVC < 70%); and (2) availability of baseline thin-slice chest CT. Exclusion criteria were: (1) severe CT artefacts affecting analysis; (2) missing key clinical or follow-up information; or (3) incomplete follow-up. (Figure 1) Figure 1. Patient inclusion flow diagram. Records were identified from the hospital database (Jan 2023–Jan 2025, N = 1343 ) and screened for eligibility. A total of 1037 records were excluded due to missing key clinical data ( n = 377 ), incomplete 12-month follow-up ( n = 659 ), or severe CT artifacts affecting analysis ( n = 1 ). The final study cohort included 306 eligible COPD patients with baseline CT and was stratified into a training set ( n = 244 ), validation set ( n = 30 ), and test set ( n = 32 ) at an 8:1:1 ratio. Outcome definition Outcome definition The primary endpoint was the first occurrence within 12 months of a composite CVD event: acute myocardial infarction, ischaemic stroke, hospitalisation for unstable angina, or hospitalisation for heart failure. If multiple events occurred, only the first event was recorded.[ 8 ] Outcomes were ascertained using electronic medical records, discharge summaries and follow-up telephone calls.[ 9 ] CT acquisition and image preprocessing All patients underwent thin-slice chest CT on multi-vendor spiral CT scanners (eg, Siemens and Philips). Typical parameters included 120 kVp, automatic tube current modulation, 1.0 mm slice thickness, 512×512 matrix, pitch approximately 1.0, with iterative reconstruction. To reduce inter-scanner variability, CT intensities (Hounsfield units) were standardised and images were resampled to isotropic 1.0×1.0×1.0 mm voxels before analysis. Lung-component segmentation This study employed a deep learning network based on the U-Net architecture for the automatic segmentation of lung parenchyma. Emphysematous regions were identified using a threshold method (HU < − 950). The pulmonary vascular regions were first enhanced using a Frangi filter to highlight vascular structures, followed by extraction through connected component analysis.[ 10 ] To ensure the reliability of the segmentation results, all segmented outputs were independently visually assessed by two radiologists, with a focus on the integrity of lung parenchyma boundaries, the accuracy of emphysematous region identification, and the continuity of vascular structures. The evaluation results showed that the segmentation outputs for most cases were highly consistent with the clinical anatomical structures. Minor discrepancies were observed only in some regions with severe emphysema or complex vascular courses. As the primary objective of this study was the construction of a risk prediction model, and the segmentation results were primarily used for subsequent imaging feature extraction, a systematic quantitative evaluation of segmentation accuracy was not performed. Instead, visual consistency assessment by the imaging physicians was adopted as the quality control method. Deep Learning-Based Feature Extraction and Fusion Based on the segmentation results, this study constructed a three-branch three-dimensional convolutional neural network (3D-CNN) to extract imaging features from normal lung tissue, emphysematous regions, and pulmonary vascular structures, respectively. The design of the three-branch architecture aims to independently characterize the structural heterogeneity of different lung tissue components, preventing interference between features of distinct tissues within a single network. Each imaging branch extracts multi-scale features through multiple layers of convolution, batch normalization, nonlinear activation, and pooling operations. The process culminates in a global average pooling layer to obtain fixed-dimensional imaging feature representations. The selection of clinical variables was based on previous research and clinical availability, taking into account sample size and variable stability. Variables such as age, smoking pack-years, and FEV1% were included as clinical input features. During the feature fusion stage, imaging features and clinical features were concatenated. An attention mechanism was introduced to enhance the weighted representation of key features, followed by fully connected layers to perform the classification task. The final output is the probability of the patient developing a cardiovascular disease (CVD) event.(Figure 2) Figure 2. Overview of data processing, model training, and evaluation. Chest CT images were preprocessed (including intensity normalization and fixed-size resampling). Lung parenchyma was automatically segmented using a U-Net–based network and further decomposed into three components—relatively normal lung, emphysema (HU < − 950), and pulmonary vasculature (Frangi enhancement followed by connected component analysis). The three-component information was encoded into a single component mask and stacked with the corresponding 3D CT crop to form a two-channel input (CT + component-encoded mask) for a multimodal model. A 3D-CNN imaging encoder extracted imaging representations, which were fused with clinical features using an MLP-based clinical branch (with an attention-enhanced fusion module) to predict 1-year composite cardiovascular events. The dataset was split into training, validation, and independent test sets (8:1:1). Model performance was evaluated using ROC curves, confusion matrices, and standard classification metrics. Model training and internal validation The dataset was partitioned into training, validation, and test sets in an 8:1:1 ratio, using a stratified sampling method to ensure a relatively balanced proportion of CVD events across the different sets. Training employed the cross-entropy loss function and the Adam optimizer. During the training process, the F1-score on the validation set served as the primary monitoring metric, and a learning rate scheduler was utilized to automatically adjust the learning rate when validation performance plateaued. Hyperparameters, including the number of training epochs, batch size, initial learning rate, and dropout rate, were uniformly set in the training script. A fixed random seed was used throughout the experiments to minimize randomness associated with data splitting and the training process. Calibration assessment In addition to discrimination metrics (AUC, accuracy, sensitivity, specificity, and F1-score), calibration was assessed using predicted probabilities of the 1-year composite cardiovascular event. A calibration curve was generated on the independent test set by grouping predicted probabilities into 10 uniform bins (n_bins = 10) and comparing the mean predicted probability with the observed event rate in each bin against the ideal line (y = x). Calibration error was quantified using the Brier score. Calibration intercept and slope were estimated by fitting a logistic regression model of the observed outcome on the logit-transformed predicted probability (intercept closer to 0 and slope closer to 1 indicate better calibration).[ 11 ] Model Evaluation Methods Model performance was evaluated in the training, validation, and test sets. The primary evaluation metrics included accuracy, precision, recall, F1-score, and the area under the receiver operating characteristic curve (AUC). ROC curves were plotted to assess the model's discriminative ability. A confusion matrix was also generated to display the distribution of classification error types, and from this matrix, clinically interpretable metrics such as sensitivity and specificity were further reported. All final performance results were reported on the independent test set. To provide an interpretable baseline comparison relevant to real-world clinical scenarios, this study constructed a clinical baseline model. Common cardiovascular risk factors were selected as independent variables, including age, gender, smoking history (yes/no), hypertension, and diabetes. A multivariate logistic regression model was used to predict the occurrence of a composite cardiovascular event within one year. The model was fitted on the training set, and its discriminative ability (AUC) was evaluated on the independent test set. Additionally, accuracy, F1-score, sensitivity, and specificity at a threshold of 0.5 were reported. The 95% confidence interval for the AUC was obtained using bootstrap resampling. Results Participant characteristics A total of 306 patients with COPD were included and randomly split (stratified by outcome) into a training set (n = 244), a validation set (n = 30) and an independent test set (n = 32) at an 8:1:1 ratio. The three datasets were mutually exclusive and derived from the same centre and enrolment framework. Baseline demographic and clinical characteristics are summarised in Table 1 . Table 1 Baseline characteristics of the study population (N = 306) Characteristic Value Characteristic Overall Age, years 69.9 ± 8.9 Male sex, n (%) 183 (59.8) Weight, kg 61.7 ± 14.7 (n = 263) Height, cm 164.6 ± 8.8 (n = 99) Diabetes, n (%) 52 (17.0) Renal insufficiency, n (%) 23 (7.5) Other vascular disease, n (%) 127 (41.5) Hypertension, n (%) 134 (43.8) Coronary artery disease, n (%) 84 (27.5) Smoking history, n (%) 195 (63.7) Alcohol history, n (%) 64 (20.9) Hemoglobin, g/L 138.3 ± 19.9 (n = 269) Hematocrit, % 42.3 ± 6.0 (n = 258) C-reactive protein (CRP), mg/L 15.76 (4.01–57.70) (n = 275) D-dimer, mg/L 0.51 (0.33–0.85) (n = 249) Total cholesterol, mmol/L 4.8 ± 1.1 (n = 151) LDL-C, mmol/L 2.9 ± 0.9 (n = 136) HDL-C, mmol/L 1.2 ± 0.3 (n = 136) Triglycerides, mmol/L 1.3 ± 1.0 (n = 151) B-type natriuretic peptide (BNP), pg/mL 196 (58–850) (n = 187) Interleukin-6 (IL-6), pg/mL 20.86 (8.02–36.89) (n = 30) Follow-up outcomes Within 12 months, 182 composite cardiovascular events occurred (59.5%, 182/306). Event rates were 61.5% (150/244) in the training set, 50.0% (15/30) in the validation set and 53.1% (17/32) in the test set. During follow-up, 219 patients (71.6%) experienced at least one acute exacerbation, while 87 (28.4%) had no acute exacerbations; among those with exacerbations, the mean number of exacerbations was 1.39. Detailed outcomes are provided in Table 2 . Table 2 Follow-up outcomes within 1 year (N = 306) Outcome Value Composite CVD events within 1 year, n (%) 182 (59.5%) Acute exacerbations during follow-up, n (%) 219 (71.6%) No exacerbations, n (%) 87 (28.4%) Mean exacerbations per patient (among those with any) 1.39 Performance of the multimodal deep learning model Model discrimination and classification performance are presented in Table 3 . Training set (n = 244) accuracy 85.25%; AUC 0.8794. The confusion matrix yielded sensitivity 83.33% (125/150) and specificity 88.30% (83/94) (TN = 83, FP = 11, FN = 25, TP = 125).( Fig. 3 ) Figure 3. Model discrimination and classification performance in the training set (n = 244). Receiver operating characteristic (ROC) curve and confusion matrix for the training set. The model achieved AUC = 0.8794 . Confusion matrix counts: TN = 83, FP = 11, FN = 25, TP = 125 . Validation set (n = 30) accuracy 96.67%; precision 93.75%; recall 100.00%; F1-score 0.9677; AUC 0.9822. Sensitivity was 100.00% (15/15) and specificity 93.33% (14/15) (TN = 14, FP = 1, FN = 0, TP = 15). ( Fig. 4 ) Figure 4. Model discrimination and classification performance in the validation set (n = 30). ROC curve and confusion matrix for the validation set. The model achieved AUC = 0.9822 . Confusion matrix counts: TN = 14, FP = 1, FN = 0, TP = 15 . Test set (n = 32) accuracy 78.12%; precision 81.25%; recall 76.47%; F1-score 0.7879; AUC 0.8588. Sensitivity was 76.47% (13/17) and specificity 80.00% (12/15) (TN = 12, FP = 3, FN = 4, TP = 13). ( Fig. 5 ) Figure 5. Model discrimination and classification performance in the test set (n = 32). ROC curve and confusion matrix for the independent test set. The model achieved AUC = 0.8588 . Confusion matrix counts: TN = 12, FP = 3, FN = 4, TP = 13 . On the independent test set (n = 32, events = 17), the model demonstrated acceptable calibration (Fig. 4). The Brier score was 0.187, with a calibration slope of 0.435 and an intercept of − 0.606. The discrimination performance remained good (AUC = 0.859). Table 3 Performance of the deep learning model on the training, validation, and test sets Dataset Accuracy F1-score Loss Training 0.8525 0.8741 0.3940 Validation 0.9667 0.9677 0.2163 Test 0.7812 0.7879 0.5906 Clinical baseline model To provide an interpretable clinical benchmark, we fitted a multivariable logistic regression model including age, sex, smoking history (yes/no), hypertension and diabetes (Table 4 ). In the full cohort, hypertension showed the strongest association with 1-year composite events (OR 17.38, 95% CI 8.65 to 34.92; p < 0.001). Age (per 1-year increase) was associated with higher risk (OR 1.06, 95% CI 1.03 to 1.10; p = 0.001), and smoking history was also associated with events (OR 1.94, 95% CI 1.01 to 3.70; p = 0.045). Sex and diabetes were not significantly associated with events (both p > 0.05). Table 4 Multivariable logistic regression for 1-year composite CVD events (N = 306) Predictor OR (95% CI) P value Age (per 1-year increase) 1.06 (1.03–1.10) 0.001 Male sex 0.78 (0.42–1.47) 0.449 Smoking history (yes vs no) 1.94 (1.01–3.70) 0.045 Hypertension (yes vs no) 17.38 (8.65–34.92) < 0.001 Diabetes (yes vs no) 1.58 (0.65–3.83) 0.310 Using the same stratified split (244/30/32), the clinical baseline model achieved: training set accuracy 0.6352 and F1-score 0.6764; validation set accuracy 0.6333 and F1-score 0.6207; and test set accuracy 0.4688 and F1-score 0.3704. On the test set, the confusion matrix was TN = 10, FP = 5, FN = 12 and TP = 5, corresponding to sensitivity 29.41% (5/17) and specificity 66.67% (10/15).(Table 5 ) On the independent test set (n = 32, events = 17), the model demonstrated acceptable calibration (Fig. 6). The Brier score was 0.187, with a calibration slope of 0.435 and an intercept of − 0.606. The discrimination performance remained good (AUC = 0.859). Figure 6. Calibration curve of the multimodal model in the independent test set. Calibration plot comparing mean predicted probability and observed event rate in the test set ( n = 32 ). The dashed line indicates perfect calibration. Model calibration statistics are reported in the panel ( Brier score = 0.187; calibration slope = 0.435; intercept = − 0.606; AUC = 0.859 ). Table 5 Performance of the clinical baseline logistic regression model Dataset Event / Non-event Accuracy F1-score Training 150 / 94 0.6352 0.6764 Validation 15 / 15 0.6333 0.6207 Test 17 / 15 0.4688 0.3704 Comparison across datasets Across the training, validation and test sets, the multimodal deep learning model showed higher discrimination and classification performance than the clinical baseline model. Discussion In this single-centre retrospective cohort study of patients with COPD, we developed a multimodal deep learning framework that integrates routine chest CT–derived lung-component representations with clinical variables to predict 1-year composite cardiovascular events. The model showed good discrimination in the training and validation sets and maintained performance in an independent held-out test set, suggesting predictive potential within the current sample size. A key contribution of this work is not simply the use of deep learning for classification but the component-level decomposition of lung structure. By partitioning the lung into three biologically interpretable regions—relatively normal lung, emphysema and pulmonary vasculature—and learning representations from each region separately before multimodal fusion, the framework aims to capture structural heterogeneity in COPD in a more organised and potentially interpretable manner for outcome-oriented risk stratification. [ 12 ] COPD-related structural injury extends beyond airway and parenchymal changes and involves multi-pathway alterations, including reduction and remodelling of the pulmonary vascular bed. Previous studies have suggested that emphysema-related alveolar destruction may reduce the alveolar–capillary bed, thereby affecting pulmonary vascular resistance and right-heart load; pulmonary vascular pruning and perfusion abnormalities may also interact with systemic inflammation, oxidative stress and endothelial dysfunction, potentially contributing to elevated cardiovascular risk in COPD.[ 13 ] The present study is designed for outcome prediction, and the observed associations between imaging phenotypes and subsequent events do not establish causality.[ 14 ] Nonetheless, a component-level representation may offer a more granular imaging perspective than conventional whole-lung or single-threshold metrics, which can be limited in capturing spatially heterogeneous structural patterns. Future work that combines interpretability approaches (eg, saliency or occlusion analyses) with comparisons against established quantitative CT measures may help examine whether the model’s informative regions align with plausible physiological mechanisms. We observed higher performance in the validation set than in the test set. This pattern is common in medical imaging deep learning studies and may be amplified under small-sample conditions. [ 15 ]The limited size of the validation and test sets means performance estimates may be sensitive to individual cases. In addition, although stratified sampling was used to balance event proportions, residual heterogeneity between subsets (eg, comorbidity profiles, CT acquisition parameters and reconstruction settings) may remain and influence generalisation. Moreover, some degree of adaptation to the training distribution is expected during optimisation, which can manifest as reduced performance on held-out data. Notably, the test-set AUC remained relatively high, indicating discrimination in the internal held-out evaluation. However, given the single split and the small test set, these estimates are uncertain and should not be overgeneralised as evidence of clinical generalisability. A more cautious interpretation is that the model demonstrated predictive potential in an internal test set, warranting further evaluation using repeated resampling/bootstrapping and, critically, external cohorts, together with calibration assessment and clinical utility analyses. A practical advantage of the proposed approach is that the imaging input is derived from routine chest CT, which is commonly used in COPD assessment and follow-up and does not require additional tests. If the model proves stable in multi-centre external validation and undergoes appropriate calibration and clinical utility assessment, it could potentially be incorporated into imaging workflows to provide short-term cardiovascular risk estimates at the time of CT interpretation or follow-up review, supporting more refined risk stratification and targeted cardiovascular evaluation. [ 16 ]Importantly, the net benefit of any downstream clinical action based on model predictions must be demonstrated before implementation in real-world settings. This study has limitations. First, it is a single-centre retrospective study with a limited sample size, particularly in the validation and test sets, which may affect the stability of performance estimates. Second, although multi-vendor CT scanners were included, differences in scanning protocols and reconstruction algorithms may introduce variability, and no domain adaptation or batch-effect correction was applied. Third, the work focused on predictive performance and did not include formal interpretability analyses, limiting transparency regarding model decision-making. Finally, follow-up duration was relatively short, and longer-term cardiovascular risk prediction was not assessed. [ 16 ]Future studies should validate the model in larger, multi-centre cohorts to better assess generalisability and clinical applicability, incorporate additional clinical indicators and longer follow-up, and apply interpretability and robustness analyses to enhance transparency and support translation toward clinical use. [ 17 ] Conclusion In a cohort of 306 patients with COPD, we developed and internally evaluated a multimodal deep learning framework that integrates component-level lung structural representations from routine chest CT (relatively normal lung, emphysema and pulmonary vasculature) with clinical variables to predict 1-year composite cardiovascular events. The model achieved AUCs of 0.8794, 0.9822 and 0.8588 in the training, validation and test sets, respectively, and outperformed an interpretable clinical baseline model based on common cardiovascular risk factors, suggesting that CT-derived imaging representations may provide incremental information for short-term outcome prediction in this setting. Nevertheless, this was a single-centre retrospective study with limited validation/test sample sizes and internal testing only; external validation, calibration assessment and evaluation of clinical net benefit are required to determine generalisability and real-world utility. Declarations Ethics approval and consent to participate The study protocol was approved by the Ethics Committee of the Fourth Affiliated Hospital of Harbin Medical University (approval No. 2025-115, 15 August 2025). The requirement for written informed consent was waived due to the retrospective design and the use of de-identified data. Consent for publication Not applicable. Availability of data and materials The data that support the findings of this study are not publicly available due to patient privacy and ethical restrictions. De-identified data may be made available from the corresponding author upon reasonable request and with approval from the Ethics Committee of the Fourth Affiliated Hospital of Harbin Medical University (or the institutional data access committee, as applicable). Competing interests The authors declare that they have no competing interests. Funding This research received no specific grant from any funding agency in the public, commercial or not-for-profit sectors. Authors’ contributions Yu Kang: Data curation; Methodology; Writing—original draft. Chunhui Qin: Data acquisition; Validation. Yuxin Li: Image processing; Software. Chunxiao Wang: Formal analysis (statistical analysis). Shuai Yunjia: Data curation. Li Ao: Data curation. Chen Jiahui: Data curation. Tong Zhang: Conceptualization; Supervision; Project administration; Writing—review & editing; Guarantor. All authors read and approved the final manuscript. Patient and public involvement Patients and/or the public were not involved in the design, conduct, reporting or dissemination plans of this research. Acknowledgements Not applicable. Authors’ information Tong Zhang, MD Department of Radiology, The Fourth Affiliated Hospital of Harbin Medical University No. 37, YiYuan Street, NanGang District, Harbin 150001, Heilongjiang, China Tel: +86-451-82576831 E-mail: [email protected] References Cornelius T. Clinical guideline highlights for the hospitalist: GOLD COPD update 2024. J Hosp Med. 2024;19(9):818–20. Agustí AG, Noguera A, Sauleda J, Sala E, Pons J, Busquets X. Systemic effects of chronic obstructive pulmonary disease. Eur Respir J. 2003;21(2):347–60. Chen W, Thomas J, Sadatsafavi M, FitzGerald JM. Risk of cardiovascular comorbidity in patients with chronic obstructive pulmonary disease: a systematic review and meta-analysis. Lancet Respir Med. 2015;3(8):631–9. Donaldson GC, Hurst JR, Smith CJ, Hubbard RB, Wedzicha JA. Increased risk of myocardial infarction and stroke following exacerbation of COPD. Chest. 2010;137(5):1091–7. Lynch DA, Al-Qaisi MA. Quantitative computed tomography in chronic obstructive pulmonary disease. J Thorac Imaging. 2013;28(5):284–90. Washko GR, Nardelli P, Ash SY, Vegas Sanchez-Ferrero G, Rahaghi FN, Come CE, Dransfield MT, Kalhan R, Han MK, Bhatt SP, et al. Arterial Vascular Pruning, Right Ventricular Size, and Clinical Outcomes in Chronic Obstructive Pulmonary Disease. A Longitudinal Observational Study. Am J Respir Crit Care Med. 2019;200(4):454–61. Falk T, Mai D, Bensch R, Çiçek Ö, Abdulkadir A, Marrakchi Y, Böhm A, Deubner J, Jäckel Z, Seiwald K, et al. U-Net: deep learning for cell counting, detection, and morphometry. Nat Methods. 2019;16(1):67–70. Weintraub WS. Statistical Approaches to Composite Endpoints. JACC Cardiovasc Interv. 2016;9(22):2289–91. Sinaiko AR, Jacobs DR Jr., Woo JG, Bazzano L, Burns T, Hu T, Juonala M, Prineas R, Raitakari O, Steinberger J, et al. The International Childhood Cardiovascular Cohort (i3C) consortium outcomes study of childhood cardiovascular risk factors and adult cardiovascular morbidity and mortality: Design and recruitment. Contemp Clin Trials. 2018;69:55–64. Hasenstab KA, Tabalon J, Yuan N, Retson T, Hsiao A. CNN-based Deformable Registration Facilitates Fast and Accurate Air Trapping Measurements at Inspiratory and Expiratory CT. Radiol Artif Intell. 2022;4(1):e210211. Van Calster B, McLernon DJ, van Smeden M, Wynants L, Steyerberg EW. Calibration: the Achilles heel of predictive analytics. BMC Med. 2019;17(1):230. Pistenmaa CL, Nardelli P, Ash SY, Come CE, Diaz AA, Rahaghi FN, Barr RG, Young KA, Kinney GL, Simmons JP, et al. Pulmonary Arterial Pruning and Longitudinal Change in Percent Emphysema and Lung Function: The Genetic Epidemiology of COPD Study. Chest. 2021;160(2):470–80. Barr RG, Bluemke DA, Ahmed FS, Carr JJ, Enright PL, Hoffman EA, Jiang R, Kawut SM, Kronmal RA, Lima JA, et al. Percent emphysema, airflow obstruction, and impaired left ventricular filling. N Engl J Med. 2010;362(3):217–27. Polverino F, Celli BR, Owen CA. COPD as an endothelial disorder: endothelial injury linking lesions in the lungs and other organs? (2017 Grover Conference Series). Pulm Circ 2018, 8(1):2045894018758528. Zantvoort K, Nacke B, Görlich D, Hornstein S, Jacobi C, Funk B. Estimation of minimal data sets sizes for machine learning predictions in digital mental health interventions. NPJ Digit Med. 2024;7(1):361. Vickers AJ, Elkin EB. Decision curve analysis: a novel method for evaluating prediction models. Med Decis Mak. 2006;26(6):565–74. Collins GS, Reitsma JB, Altman DG, Moons KG. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): the TRIPOD statement. Ann Intern Med. 2015;162(1):55–63. Additional Declarations No competing interests reported. Cite Share Download PDF Status: Posted Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-8733857","acceptedTermsAndConditions":true,"allowDirectSubmit":true,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":597029130,"identity":"5a252e00-a4cd-47e8-bd53-b9906628b185","order_by":0,"name":"Kang Yu","email":"","orcid":"","institution":"Department of Radiology, The Fourth Affiliated Hospital of Harbin Medical University, Harbin,Heilongjiang, China.","correspondingAuthor":false,"prefix":"","firstName":"Kang","middleName":"","lastName":"Yu","suffix":""},{"id":597029131,"identity":"90ff7db5-fa3f-4627-92d5-b7e31e31b55b","order_by":1,"name":"Chunhui Qin","email":"","orcid":"","institution":"Department of Radiology, The Fourth Affiliated Hospital of Harbin Medical University, Harbin,Heilongjiang, China.","correspondingAuthor":false,"prefix":"","firstName":"Chunhui","middleName":"","lastName":"Qin","suffix":""},{"id":597029132,"identity":"4d392334-4ac9-4d85-8572-947ac5275d2f","order_by":2,"name":"Yuxin Li","email":"","orcid":"","institution":"Department of Radiology, The Fourth Affiliated Hospital of Harbin Medical University, Harbin,Heilongjiang, China.","correspondingAuthor":false,"prefix":"","firstName":"Yuxin","middleName":"","lastName":"Li","suffix":""},{"id":597029133,"identity":"4bba9b39-824a-4112-bc40-914ab3c838bf","order_by":3,"name":"Chunxiao Wang","email":"","orcid":"","institution":"Department of Radiology, The Fourth Affiliated Hospital of Harbin Medical University, Harbin,Heilongjiang, China.","correspondingAuthor":false,"prefix":"","firstName":"Chunxiao","middleName":"","lastName":"Wang","suffix":""},{"id":597029134,"identity":"f761eaec-9b75-4841-88dc-18da75235316","order_by":4,"name":"Yunjia Shuai","email":"","orcid":"","institution":"Department of Radiology, The Fourth Affiliated Hospital of Harbin Medical University, Harbin,Heilongjiang, China.","correspondingAuthor":false,"prefix":"","firstName":"Yunjia","middleName":"","lastName":"Shuai","suffix":""},{"id":597029135,"identity":"ab0695d1-7a6c-4b3d-ab80-1687521eea67","order_by":5,"name":"Ao Li","email":"","orcid":"","institution":"Department of Radiology, The Fourth Affiliated Hospital of Harbin Medical University, Harbin,Heilongjiang, China.","correspondingAuthor":false,"prefix":"","firstName":"Ao","middleName":"","lastName":"Li","suffix":""},{"id":597029136,"identity":"be5824f6-c9bf-4b14-893d-306eccca201e","order_by":6,"name":"Jiahui Chen","email":"","orcid":"","institution":"Department of Radiology, The Fourth Affiliated Hospital of Harbin Medical University, Harbin,Heilongjiang, China.","correspondingAuthor":false,"prefix":"","firstName":"Jiahui","middleName":"","lastName":"Chen","suffix":""},{"id":597029137,"identity":"75bec6ed-c979-430d-a402-db69be8d8b90","order_by":7,"name":"tong zhang","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA6klEQVRIiWNgGAWjYDACZhBhAMTs7QcffDCwsSNBC8+ZZMMZBWnJJFgnkWAmzfPhEGMDIYUGx3kPv+YpsMuTj0hIk7YxOMDMwH746Aa8Wg7zpVnOMEguNjzz8LB1jsEdPgaetLQb+LXwmBl8MGBO3NiekHg7x+AZM4MEjxlhLQkG9YkbGxIMpC0MDjM2EKHFGBi2hxPncyQYSTMQo0USaAvjDIPjiRtAgdxjkJbMRsgvfOfPGH/m+VOdOL8dGJU//tjY8bMfPoZXi8IBBjYJsAsPQEXY8CkHAfkGBuYPUMYoGAWjYBSMAuwAAI6DTimOLVklAAAAAElFTkSuQmCC","orcid":"","institution":"Department of Radiology, The Fourth Affiliated Hospital of Harbin Medical University, Harbin,Heilongjiang, China.","correspondingAuthor":true,"prefix":"","firstName":"tong","middleName":"","lastName":"zhang","suffix":""}],"badges":[],"createdAt":"2026-01-29 16:53:09","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-8733857/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-8733857/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":103587431,"identity":"65e34602-0f14-4b17-b897-637e9d25be52","added_by":"auto","created_at":"2026-02-27 11:27:59","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":126573,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003ePatient inclusion flow diagram.\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eRecords were identified from the hospital database (Jan 2023–Jan 2025, \u003cstrong\u003eN = 1343\u003c/strong\u003e) and screened for eligibility. A total of \u003cstrong\u003e1037\u003c/strong\u003erecords were excluded due to missing key clinical data (\u003cstrong\u003en = 377\u003c/strong\u003e), incomplete 12-month follow-up (\u003cstrong\u003en = 659\u003c/strong\u003e), or severe CT artifacts affecting analysis (\u003cstrong\u003en = 1\u003c/strong\u003e). The final study cohort included \u003cstrong\u003e306\u003c/strong\u003eeligible COPD patients with baseline CT and was stratified into a training set (\u003cstrong\u003en = 244\u003c/strong\u003e), validation set (\u003cstrong\u003en = 30\u003c/strong\u003e), and test set (\u003cstrong\u003en = 32\u003c/strong\u003e) at an 8:1:1 ratio.\u003c/p\u003e","description":"","filename":"Figure1.png","url":"https://assets-eu.researchsquare.com/files/rs-8733857/v1/278621225dfa8735001321a7.png"},{"id":103587425,"identity":"619caf98-91e6-45b6-8b07-6ce444fe59c2","added_by":"auto","created_at":"2026-02-27 11:27:58","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":1075833,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eOverview of data processing, model training, and evaluation.\u003c/strong\u003e\u003c/p\u003e","description":"","filename":"Figure2.png","url":"https://assets-eu.researchsquare.com/files/rs-8733857/v1/40547df712ddef68d665044e.png"},{"id":103587427,"identity":"6afe764b-f9aa-406a-9f41-5486ce298ade","added_by":"auto","created_at":"2026-02-27 11:27:58","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":50066,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eModel discrimination and classification performance in the training set (n = 244).\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eReceiver operating characteristic (ROC) curve and confusion matrix for the training set. The model achieved \u003cstrong\u003eAUC = 0.8794\u003c/strong\u003e. Confusion matrix counts: \u003cstrong\u003eTN = 83, FP = 11, FN = 25, TP = 125\u003c/strong\u003e.\u003c/p\u003e","description":"","filename":"Figure3.png","url":"https://assets-eu.researchsquare.com/files/rs-8733857/v1/daebe5e73962fae2b315d0b6.png"},{"id":103587432,"identity":"1755025a-f243-4236-a5e5-263c64d11747","added_by":"auto","created_at":"2026-02-27 11:27:59","extension":"png","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":45876,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eModel discrimination and classification performance in the validation set (n = 30).\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eROC curve and confusion matrix for the validation set. The model achieved \u003cstrong\u003eAUC = 0.9822\u003c/strong\u003e. Confusion matrix counts: \u003cstrong\u003eTN = 14, FP = 1, FN = 0, TP = 15\u003c/strong\u003e.\u003c/p\u003e","description":"","filename":"Figure4.png","url":"https://assets-eu.researchsquare.com/files/rs-8733857/v1/0e6294dff468218149570a41.png"},{"id":103587430,"identity":"3a76b307-6cf4-42b4-a6c8-fb425610bd7e","added_by":"auto","created_at":"2026-02-27 11:27:59","extension":"png","order_by":5,"title":"Figure 5","display":"","copyAsset":false,"role":"figure","size":43817,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eModel discrimination and classification performance in the test set (n = 32).\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eROC curve and confusion matrix for the independent test set. The model achieved \u003cstrong\u003eAUC = 0.8588\u003c/strong\u003e. Confusion matrix counts: \u003cstrong\u003eTN = 12, FP = 3, FN = 4, TP = 13\u003c/strong\u003e.\u003c/p\u003e","description":"","filename":"Figure5.png","url":"https://assets-eu.researchsquare.com/files/rs-8733857/v1/e7d380e31c655e0412994889.png"},{"id":103587433,"identity":"3ad5703a-e32a-45bc-bc7a-a1759171dcf2","added_by":"auto","created_at":"2026-02-27 11:27:59","extension":"png","order_by":6,"title":"Figure 6","display":"","copyAsset":false,"role":"figure","size":156801,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eCalibration curve of the multimodal model in the independent test set.\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eCalibration plot comparing mean predicted probability and observed event rate in the test set (\u003cstrong\u003en = 32\u003c/strong\u003e). The dashed line indicates perfect calibration. Model calibration statistics are reported in the panel (\u003cstrong\u003eBrier score = 0.187; calibration slope = 0.435; intercept = −0.606; AUC = 0.859\u003c/strong\u003e).\u003c/p\u003e","description":"","filename":"Figure6.png","url":"https://assets-eu.researchsquare.com/files/rs-8733857/v1/21a75840b7c9292a32cd9e4e.png"},{"id":107657318,"identity":"d1e63409-0290-4386-a08f-e342f7beb61c","added_by":"auto","created_at":"2026-04-23 16:10:35","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":1762324,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-8733857/v1/d0970de4-dbc3-4afc-95d5-89646f2daf98.pdf"}],"financialInterests":"No competing interests reported.","formattedTitle":"Multimodal Deep Learning Integrating Chest CT Lung-Component Segmentation and Clinical Variables for Prediction of 1-Year Composite Cardiovascular Events in COPD: A Single-Centre Retrospective Cohort Study","fulltext":[{"header":"Introduction","content":"\u003cp\u003eChronic obstructive pulmonary disease (COPD) is a common chronic respiratory condition characterized by persistent airflow limitation. Its core pathophysiological mechanisms involve multi-level and multi-system alterations, primarily including airway obstruction, destruction of lung parenchyma, and structural and functional remodeling of pulmonary vessels.[\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e] Long-term persistent airway inflammation leads to thickening of the small airway walls, luminal narrowing, and dynamic collapse, thereby exacerbating airflow limitation. Concurrently, alveolar wall destruction and decreased elastic recoil result in gas trapping and pulmonary hyperinflation, which significantly increase the work of breathing and reduce ventilation efficiency. Furthermore, chronic hypoxemia can induce pulmonary vasoconstriction, smooth muscle hyperplasia, and vascular wall remodeling. In some patients, this progresses to COPD-associated pulmonary hypertension, increasing right ventricular afterload and affecting systemic circulatory function. These pathological changes not only determine the respiratory manifestations of COPD but also lay the foundation for its multi-system complications.[\u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e]\u003c/p\u003e \u003cp\u003eIn recent years, increasing research has focused on the close association between COPD and cardiovascular diseases. Epidemiological studies indicate that COPD patients have a significantly higher risk of cardiovascular and cerebrovascular events, such as coronary artery disease, myocardial infarction, heart failure, and stroke, compared to the general population. Cardiovascular disease has become one of the leading causes of death in COPD patients. The risk of cardiovascular events is particularly prominent during or shortly after acute exacerbations, suggesting that respiratory inflammation, hypoxemia, and hemodynamic changes may act synergistically in the process of cardiovascular injury.[\u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e] Therefore, early identification and assessment of cardiovascular disease risk in the COPD population hold significant clinical importance for optimizing individualized treatment strategies and reducing the incidence of adverse events.[\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e]\u003c/p\u003e \u003cp\u003eCurrently, cardiovascular disease risk assessment primarily relies on clinical risk factors, hematological biomarkers, and cardiac imaging examinations such as cardiac CT or echocardiography. However, the routine application of these methods in COPD patients faces certain limitations, including high costs, complex procedures, and insufficient patient compliance. In contrast, chest CT has become the most commonly used imaging modality in the diagnosis, phenotyping, and treatment evaluation of COPD, offering relatively high accessibility and repeatability in clinical practice.[\u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e]\u003c/p\u003e \u003cp\u003eIn terms of imaging quantitative analysis, previous studies have mostly used whole-lung average-based metrics to assess COPD, such as the low attenuation area percentage (LAA%). While these methods can reflect the extent of parenchymal destruction to a certain degree, they overlook the spatial heterogeneity in the distribution of different lung tissue components, making it difficult to comprehensively characterize local structural abnormalities and their potential impact on cardiopulmonary interactions. In fact, variations in the proportion and spatial distribution of emphysematous regions, relatively normal lung tissue, and pulmonary vascular structures may be closely related to cardiovascular disease risk by influencing pulmonary vascular resistance and cardiac load, among other pathways.[\u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e] Therefore, in-depth exploration of chest CT imaging information at a finer tissue-structural level is of significant value.\u003c/p\u003e \u003cp\u003eWith the rapid development of deep learning technology, segmentation and feature extraction methods based on convolutional neural networks provide new tools for analyzing complex medical imaging information. By performing fine segmentation of lung tissue components and extracting high-dimensional representative features from images, it is possible to more comprehensively reflect the structural and functional status of the lungs in COPD patients. Moreover, integrating imaging features with clinical data to construct multi-modal fusion models may further enhance the predictive capability for cardiovascular disease risk.[\u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e]\u003c/p\u003e \u003cp\u003eBased on the above background, this study aims to use chest CT images from COPD patients as the research subject. It will involve automatic segmentation and quantitative analysis of normal lung tissue, emphysematous regions, and pulmonary vascular structures. Combined with deep learning methods to extract imaging features and integrated with clinical information, a multi-modal prediction model will be constructed to explore its application value in assessing cardiovascular disease risk in COPD patients. Through this research, we hope to provide new imaging-based evidence for the early risk identification of COPD combined with cardiovascular disease and offer references for formulating individualized clinical management strategies.\u003c/p\u003e"},{"header":"Methods","content":"\u003cdiv id=\"Sec3\" class=\"Section2\"\u003e \u003ch2\u003eStudy design and setting\u003c/h2\u003e \u003cp\u003eThis was a single-centre retrospective cohort study conducted at the Fourth Affiliated Hospital of Harbin Medical University, Harbin, China.\u003c/p\u003e \u003c/div\u003e\n\u003ch3\u003eEthics approval\u003c/h3\u003e\n\u003cp\u003e The study protocol was approved by the institutional ethics committee and complied with the Declaration of Helsinki .The requirement for written informed consent was waived because of the retrospective design and the use of de-identified data.\u003c/p\u003e\n\u003ch3\u003eParticipants\u003c/h3\u003e\n\u003cp\u003eWe enrolled COPD patients between January 2023 and January 2025 who completed baseline thin-slice non-contrast chest CT and 12-month follow-up.\u003c/p\u003e \u003cp\u003eInclusion criteria were: (1) spirometry-confirmed COPD (FEV1/FVC\u0026thinsp;\u0026lt;\u0026thinsp;70%); and (2) availability of baseline thin-slice chest CT.\u003c/p\u003e \u003cp\u003eExclusion criteria were: (1) severe CT artefacts affecting analysis; (2) missing key clinical or follow-up information; or (3) incomplete follow-up. (Figure 1)\u003c/p\u003e \u003cp\u003e \u003cb\u003eFigure 1. Patient inclusion flow diagram.\u003c/b\u003e \u003c/p\u003e \u003cp\u003eRecords were identified from the hospital database (Jan 2023\u0026ndash;Jan 2025, \u003cb\u003eN\u0026thinsp;=\u0026thinsp;1343\u003c/b\u003e) and screened for eligibility. A total of \u003cb\u003e1037\u003c/b\u003e records were excluded due to missing key clinical data (\u003cb\u003en\u0026thinsp;=\u0026thinsp;377\u003c/b\u003e), incomplete 12-month follow-up (\u003cb\u003en\u0026thinsp;=\u0026thinsp;659\u003c/b\u003e), or severe CT artifacts affecting analysis (\u003cb\u003en\u0026thinsp;=\u0026thinsp;1\u003c/b\u003e). The final study cohort included \u003cb\u003e306\u003c/b\u003e eligible COPD patients with baseline CT and was stratified into a training set (\u003cb\u003en\u0026thinsp;=\u0026thinsp;244\u003c/b\u003e), validation set (\u003cb\u003en\u0026thinsp;=\u0026thinsp;30\u003c/b\u003e), and test set (\u003cb\u003en\u0026thinsp;=\u0026thinsp;32\u003c/b\u003e) at an 8:1:1 ratio.\u003c/p\u003e\n\u003ch3\u003eOutcome definition\u003c/h3\u003e\n\u003cdiv class=\"Heading\"\u003eOutcome definition\u003c/div\u003e \u003cp\u003eThe primary endpoint was the first occurrence within 12 months of a composite CVD event: acute myocardial infarction, ischaemic stroke, hospitalisation for unstable angina, or hospitalisation for heart failure. If multiple events occurred, only the first event was recorded.[\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e]\u003c/p\u003e \u003cp\u003eOutcomes were ascertained using electronic medical records, discharge summaries and follow-up telephone calls.[\u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e]\u003c/p\u003e\n\u003ch3\u003eCT acquisition and image preprocessing\u003c/h3\u003e\n\u003cp\u003eAll patients underwent thin-slice chest CT on multi-vendor spiral CT scanners (eg, Siemens and Philips). Typical parameters included 120 kVp, automatic tube current modulation, 1.0 mm slice thickness, 512\u0026times;512 matrix, pitch approximately 1.0, with iterative reconstruction.\u003c/p\u003e \u003cp\u003eTo reduce inter-scanner variability, CT intensities (Hounsfield units) were standardised and images were resampled to isotropic 1.0\u0026times;1.0\u0026times;1.0 mm voxels before analysis.\u003c/p\u003e \u003cdiv id=\"Sec8\" class=\"Section2\"\u003e \u003ch2\u003eLung-component segmentation\u003c/h2\u003e \u003cp\u003eThis study employed a deep learning network based on the U-Net architecture for the automatic segmentation of lung parenchyma. Emphysematous regions were identified using a threshold method (HU\u0026thinsp;\u0026lt;\u0026thinsp;\u0026minus;\u0026thinsp;950). The pulmonary vascular regions were first enhanced using a Frangi filter to highlight vascular structures, followed by extraction through connected component analysis.[\u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e]\u003c/p\u003e \u003cp\u003eTo ensure the reliability of the segmentation results, all segmented outputs were independently visually assessed by two radiologists, with a focus on the integrity of lung parenchyma boundaries, the accuracy of emphysematous region identification, and the continuity of vascular structures. The evaluation results showed that the segmentation outputs for most cases were highly consistent with the clinical anatomical structures. Minor discrepancies were observed only in some regions with severe emphysema or complex vascular courses. As the primary objective of this study was the construction of a risk prediction model, and the segmentation results were primarily used for subsequent imaging feature extraction, a systematic quantitative evaluation of segmentation accuracy was not performed. Instead, visual consistency assessment by the imaging physicians was adopted as the quality control method.\u003c/p\u003e \u003c/div\u003e\n\u003ch3\u003eDeep Learning-Based Feature Extraction and Fusion\u003c/h3\u003e\n\u003cp\u003eBased on the segmentation results, this study constructed a three-branch three-dimensional convolutional neural network (3D-CNN) to extract imaging features from normal lung tissue, emphysematous regions, and pulmonary vascular structures, respectively. The design of the three-branch architecture aims to independently characterize the structural heterogeneity of different lung tissue components, preventing interference between features of distinct tissues within a single network.\u003c/p\u003e \u003cp\u003eEach imaging branch extracts multi-scale features through multiple layers of convolution, batch normalization, nonlinear activation, and pooling operations. The process culminates in a global average pooling layer to obtain fixed-dimensional imaging feature representations. The selection of clinical variables was based on previous research and clinical availability, taking into account sample size and variable stability. Variables such as age, smoking pack-years, and FEV1% were included as clinical input features.\u003c/p\u003e \u003cp\u003eDuring the feature fusion stage, imaging features and clinical features were concatenated. An attention mechanism was introduced to enhance the weighted representation of key features, followed by fully connected layers to perform the classification task. The final output is the probability of the patient developing a cardiovascular disease (CVD) event.(Figure 2)\u003c/p\u003e \u003cp\u003e \u003cb\u003eFigure 2. Overview of data processing, model training, and evaluation.\u003c/b\u003e \u003c/p\u003e \u003cp\u003eChest CT images were preprocessed (including intensity normalization and fixed-size resampling). Lung parenchyma was automatically segmented using a U-Net\u0026ndash;based network and further decomposed into three components\u0026mdash;relatively normal lung, emphysema (HU\u0026thinsp;\u0026lt;\u0026thinsp;\u0026minus;\u0026thinsp;950), and pulmonary vasculature (Frangi enhancement followed by connected component analysis). The three-component information was encoded into a single component mask and stacked with the corresponding 3D CT crop to form a two-channel input (CT\u0026thinsp;+\u0026thinsp;component-encoded mask) for a multimodal model. A 3D-CNN imaging encoder extracted imaging representations, which were fused with clinical features using an MLP-based clinical branch (with an attention-enhanced fusion module) to predict 1-year composite cardiovascular events. The dataset was split into training, validation, and independent test sets (8:1:1). Model performance was evaluated using ROC curves, confusion matrices, and standard classification metrics.\u003c/p\u003e\n\u003ch3\u003eModel training and internal validation\u003c/h3\u003e\n\u003cp\u003e The dataset was partitioned into training, validation, and test sets in an 8:1:1 ratio, using a stratified sampling method to ensure a relatively balanced proportion of CVD events across the different sets.\u003c/p\u003e \u003cp\u003eTraining employed the cross-entropy loss function and the Adam optimizer. During the training process, the F1-score on the validation set served as the primary monitoring metric, and a learning rate scheduler was utilized to automatically adjust the learning rate when validation performance plateaued. Hyperparameters, including the number of training epochs, batch size, initial learning rate, and dropout rate, were uniformly set in the training script. A fixed random seed was used throughout the experiments to minimize randomness associated with data splitting and the training process.\u003c/p\u003e \u003cdiv id=\"Sec11\" class=\"Section2\"\u003e \u003ch2\u003eCalibration assessment\u003c/h2\u003e \u003cp\u003eIn addition to discrimination metrics (AUC, accuracy, sensitivity, specificity, and F1-score), calibration was assessed using predicted probabilities of the 1-year composite cardiovascular event. A calibration curve was generated on the independent test set by grouping predicted probabilities into 10 uniform bins (n_bins\u0026thinsp;=\u0026thinsp;10) and comparing the mean predicted probability with the observed event rate in each bin against the ideal line (y\u0026thinsp;=\u0026thinsp;x). Calibration error was quantified using the Brier score. Calibration intercept and slope were estimated by fitting a logistic regression model of the observed outcome on the logit-transformed predicted probability (intercept closer to 0 and slope closer to 1 indicate better calibration).[\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e]\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec12\" class=\"Section2\"\u003e \u003ch2\u003eModel Evaluation Methods\u003c/h2\u003e \u003cp\u003eModel performance was evaluated in the training, validation, and test sets. The primary evaluation metrics included accuracy, precision, recall, F1-score, and the area under the receiver operating characteristic curve (AUC). ROC curves were plotted to assess the model's discriminative ability. A confusion matrix was also generated to display the distribution of classification error types, and from this matrix, clinically interpretable metrics such as sensitivity and specificity were further reported. All final performance results were reported on the independent test set.\u003c/p\u003e \u003cp\u003eTo provide an interpretable baseline comparison relevant to real-world clinical scenarios, this study constructed a clinical baseline model. Common cardiovascular risk factors were selected as independent variables, including age, gender, smoking history (yes/no), hypertension, and diabetes. A multivariate logistic regression model was used to predict the occurrence of a composite cardiovascular event within one year. The model was fitted on the training set, and its discriminative ability (AUC) was evaluated on the independent test set. Additionally, accuracy, F1-score, sensitivity, and specificity at a threshold of 0.5 were reported. The 95% confidence interval for the AUC was obtained using bootstrap resampling.\u003c/p\u003e \u003c/div\u003e"},{"header":"Results","content":"\u003cdiv id=\"Sec14\" class=\"Section2\"\u003e \u003ch2\u003eParticipant characteristics\u003c/h2\u003e \u003cp\u003eA total of 306 patients with COPD were included and randomly split (stratified by outcome) into a training set (n\u0026thinsp;=\u0026thinsp;244), a validation set (n\u0026thinsp;=\u0026thinsp;30) and an independent test set (n\u0026thinsp;=\u0026thinsp;32) at an 8:1:1 ratio. The three datasets were mutually exclusive and derived from the same centre and enrolment framework. Baseline demographic and clinical characteristics are summarised in Table\u0026nbsp;\u003cspan refid=\"Tab1\" class=\"InternalRef\"\u003e1\u003c/span\u003e.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab1\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 1\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eBaseline characteristics of the study population (N\u0026thinsp;=\u0026thinsp;306)\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"2\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eCharacteristic\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eValue\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eCharacteristic\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eOverall\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eAge, years\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e69.9\u0026thinsp;\u0026plusmn;\u0026thinsp;8.9\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eMale sex, n (%)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e183 (59.8)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eWeight, kg\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e61.7\u0026thinsp;\u0026plusmn;\u0026thinsp;14.7 (n\u0026thinsp;=\u0026thinsp;263)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eHeight, cm\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e164.6\u0026thinsp;\u0026plusmn;\u0026thinsp;8.8 (n\u0026thinsp;=\u0026thinsp;99)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eDiabetes, n (%)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e52 (17.0)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eRenal insufficiency, n (%)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e23 (7.5)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eOther vascular disease, n (%)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e127 (41.5)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eHypertension, n (%)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e134 (43.8)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eCoronary artery disease, n (%)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e84 (27.5)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eSmoking history, n (%)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e195 (63.7)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eAlcohol history, n (%)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e64 (20.9)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eHemoglobin, g/L\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e138.3\u0026thinsp;\u0026plusmn;\u0026thinsp;19.9 (n\u0026thinsp;=\u0026thinsp;269)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eHematocrit, %\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e42.3\u0026thinsp;\u0026plusmn;\u0026thinsp;6.0 (n\u0026thinsp;=\u0026thinsp;258)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eC-reactive protein (CRP), mg/L\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e15.76 (4.01\u0026ndash;57.70) (n\u0026thinsp;=\u0026thinsp;275)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eD-dimer, mg/L\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e0.51 (0.33\u0026ndash;0.85) (n\u0026thinsp;=\u0026thinsp;249)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eTotal cholesterol, mmol/L\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e4.8\u0026thinsp;\u0026plusmn;\u0026thinsp;1.1 (n\u0026thinsp;=\u0026thinsp;151)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eLDL-C, mmol/L\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e2.9\u0026thinsp;\u0026plusmn;\u0026thinsp;0.9 (n\u0026thinsp;=\u0026thinsp;136)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eHDL-C, mmol/L\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e1.2\u0026thinsp;\u0026plusmn;\u0026thinsp;0.3 (n\u0026thinsp;=\u0026thinsp;136)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eTriglycerides, mmol/L\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e1.3\u0026thinsp;\u0026plusmn;\u0026thinsp;1.0 (n\u0026thinsp;=\u0026thinsp;151)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eB-type natriuretic peptide (BNP), pg/mL\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e196 (58\u0026ndash;850) (n\u0026thinsp;=\u0026thinsp;187)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eInterleukin-6 (IL-6), pg/mL\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e20.86 (8.02\u0026ndash;36.89) (n\u0026thinsp;=\u0026thinsp;30)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec15\" class=\"Section2\"\u003e \u003ch2\u003eFollow-up outcomes\u003c/h2\u003e \u003cp\u003eWithin 12 months, 182 composite cardiovascular events occurred (59.5%, 182/306). Event rates were 61.5% (150/244) in the training set, 50.0% (15/30) in the validation set and 53.1% (17/32) in the test set. During follow-up, 219 patients (71.6%) experienced at least one acute exacerbation, while 87 (28.4%) had no acute exacerbations; among those with exacerbations, the mean number of exacerbations was 1.39. Detailed outcomes are provided in Table\u0026nbsp;\u003cspan refid=\"Tab2\" class=\"InternalRef\"\u003e2\u003c/span\u003e.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab2\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 2\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eFollow-up outcomes within 1 year (N\u0026thinsp;=\u0026thinsp;306)\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"2\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eOutcome\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eValue\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eComposite CVD events within 1 year, n (%)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e182 (59.5%)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eAcute exacerbations during follow-up, n (%)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e219 (71.6%)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eNo exacerbations, n (%)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e87 (28.4%)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eMean exacerbations per patient (among those with any)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e1.39\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec16\" class=\"Section2\"\u003e \u003ch2\u003ePerformance of the multimodal deep learning model\u003c/h2\u003e \u003cp\u003eModel discrimination and classification performance are presented in Table\u0026nbsp;\u003cspan refid=\"Tab3\" class=\"InternalRef\"\u003e3\u003c/span\u003e.\u003c/p\u003e \u003cp\u003e \u003cstrong\u003eTraining set (n\u0026thinsp;=\u0026thinsp;244)\u003c/strong\u003e \u003cp\u003eaccuracy 85.25%; AUC 0.8794. The confusion matrix yielded sensitivity 83.33% (125/150) and specificity 88.30% (83/94) (TN\u0026thinsp;=\u0026thinsp;83, FP\u0026thinsp;=\u0026thinsp;11, FN\u0026thinsp;=\u0026thinsp;25, TP\u0026thinsp;=\u0026thinsp;125).( \u003cb\u003eFig.\u0026nbsp;3\u003c/b\u003e)\u003c/p\u003e \u003c/p\u003e \u003cp\u003e \u003cb\u003eFigure 3. Model discrimination and classification performance in the training set (n\u0026thinsp;=\u0026thinsp;244).\u003c/b\u003e \u003c/p\u003e \u003cp\u003eReceiver operating characteristic (ROC) curve and confusion matrix for the training set. The model achieved \u003cb\u003eAUC\u0026thinsp;=\u0026thinsp;0.8794\u003c/b\u003e. Confusion matrix counts: \u003cb\u003eTN\u0026thinsp;=\u0026thinsp;83, FP\u0026thinsp;=\u0026thinsp;11, FN\u0026thinsp;=\u0026thinsp;25, TP\u0026thinsp;=\u0026thinsp;125\u003c/b\u003e.\u003c/p\u003e \u003cp\u003e \u003cstrong\u003eValidation set (n\u0026thinsp;=\u0026thinsp;30)\u003c/strong\u003e \u003cp\u003eaccuracy 96.67%; precision 93.75%; recall 100.00%; F1-score 0.9677; AUC 0.9822. Sensitivity was 100.00% (15/15) and specificity 93.33% (14/15) (TN\u0026thinsp;=\u0026thinsp;14, FP\u0026thinsp;=\u0026thinsp;1, FN\u0026thinsp;=\u0026thinsp;0, TP\u0026thinsp;=\u0026thinsp;15). ( \u003cb\u003eFig.\u0026nbsp;4\u003c/b\u003e)\u003c/p\u003e \u003c/p\u003e \u003cp\u003e \u003cb\u003eFigure 4. Model discrimination and classification performance in the validation set (n\u0026thinsp;=\u0026thinsp;30).\u003c/b\u003e \u003c/p\u003e \u003cp\u003eROC curve and confusion matrix for the validation set. The model achieved \u003cb\u003eAUC\u0026thinsp;=\u0026thinsp;0.9822\u003c/b\u003e. Confusion matrix counts: \u003cb\u003eTN\u0026thinsp;=\u0026thinsp;14, FP\u0026thinsp;=\u0026thinsp;1, FN\u0026thinsp;=\u0026thinsp;0, TP\u0026thinsp;=\u0026thinsp;15\u003c/b\u003e.\u003c/p\u003e \u003cp\u003e \u003cstrong\u003eTest set (n\u0026thinsp;=\u0026thinsp;32)\u003c/strong\u003e \u003cp\u003eaccuracy 78.12%; precision 81.25%; recall 76.47%; F1-score 0.7879; AUC 0.8588. Sensitivity was 76.47% (13/17) and specificity 80.00% (12/15) (TN\u0026thinsp;=\u0026thinsp;12, FP\u0026thinsp;=\u0026thinsp;3, FN\u0026thinsp;=\u0026thinsp;4, TP\u0026thinsp;=\u0026thinsp;13). ( \u003cb\u003eFig.\u0026nbsp;5\u003c/b\u003e)\u003c/p\u003e \u003c/p\u003e \u003cp\u003e \u003cb\u003eFigure 5. Model discrimination and classification performance in the test set (n\u0026thinsp;=\u0026thinsp;32).\u003c/b\u003e \u003c/p\u003e \u003cp\u003eROC curve and confusion matrix for the independent test set. The model achieved \u003cb\u003eAUC\u0026thinsp;=\u0026thinsp;0.8588\u003c/b\u003e. Confusion matrix counts: \u003cb\u003eTN\u0026thinsp;=\u0026thinsp;12, FP\u0026thinsp;=\u0026thinsp;3, FN\u0026thinsp;=\u0026thinsp;4, TP\u0026thinsp;=\u0026thinsp;13\u003c/b\u003e.\u003c/p\u003e \u003cp\u003eOn the independent test set (n\u0026thinsp;=\u0026thinsp;32, events\u0026thinsp;=\u0026thinsp;17), the model demonstrated acceptable calibration (Fig.\u0026nbsp;4). The Brier score was 0.187, with a calibration slope of 0.435 and an intercept of \u0026minus;\u0026thinsp;0.606. The discrimination performance remained good (AUC\u0026thinsp;=\u0026thinsp;0.859).\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab3\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 3\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003ePerformance of the deep learning model on the training, validation, and test sets\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"4\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eDataset\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eAccuracy\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003eF1-score\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c4\"\u003e \u003cp\u003eLoss\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eTraining\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.8525\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.8741\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.3940\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eValidation\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.9667\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.9677\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.2163\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eTest\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.7812\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.7879\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.5906\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec17\" class=\"Section2\"\u003e \u003ch2\u003eClinical baseline model\u003c/h2\u003e \u003cp\u003eTo provide an interpretable clinical benchmark, we fitted a multivariable logistic regression model including age, sex, smoking history (yes/no), hypertension and diabetes (Table\u0026nbsp;\u003cspan refid=\"Tab4\" class=\"InternalRef\"\u003e4\u003c/span\u003e). In the full cohort, hypertension showed the strongest association with 1-year composite events (OR 17.38, 95% CI 8.65 to 34.92; p\u0026thinsp;\u0026lt;\u0026thinsp;0.001). Age (per 1-year increase) was associated with higher risk (OR 1.06, 95% CI 1.03 to 1.10; p\u0026thinsp;=\u0026thinsp;0.001), and smoking history was also associated with events (OR 1.94, 95% CI 1.01 to 3.70; p\u0026thinsp;=\u0026thinsp;0.045). Sex and diabetes were not significantly associated with events (both p\u0026thinsp;\u0026gt;\u0026thinsp;0.05).\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab4\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 4\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eMultivariable logistic regression for 1-year composite CVD events (N\u0026thinsp;=\u0026thinsp;306)\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"3\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003ePredictor\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eOR (95% CI)\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003eP value\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eAge (per 1-year increase)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e1.06 (1.03\u0026ndash;1.10)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eMale sex\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.78 (0.42\u0026ndash;1.47)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.449\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eSmoking history (yes vs no)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e1.94 (1.01\u0026ndash;3.70)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.045\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eHypertension (yes vs no)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e17.38 (8.65\u0026ndash;34.92)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e\u0026lt;\u0026thinsp;0.001\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eDiabetes (yes vs no)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e1.58 (0.65\u0026ndash;3.83)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.310\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003eUsing the same stratified split (244/30/32), the clinical baseline model achieved: training set accuracy 0.6352 and F1-score 0.6764; validation set accuracy 0.6333 and F1-score 0.6207; and test set accuracy 0.4688 and F1-score 0.3704. On the test set, the confusion matrix was TN\u0026thinsp;=\u0026thinsp;10, FP\u0026thinsp;=\u0026thinsp;5, FN\u0026thinsp;=\u0026thinsp;12 and TP\u0026thinsp;=\u0026thinsp;5, corresponding to sensitivity 29.41% (5/17) and specificity 66.67% (10/15).(Table\u0026nbsp;\u003cspan refid=\"Tab5\" class=\"InternalRef\"\u003e5\u003c/span\u003e) On the independent test set (n\u0026thinsp;=\u0026thinsp;32, events\u0026thinsp;=\u0026thinsp;17), the model demonstrated acceptable calibration (Fig.\u0026nbsp;6). The Brier score was 0.187, with a calibration slope of 0.435 and an intercept of \u0026minus;\u0026thinsp;0.606. The discrimination performance remained good (AUC\u0026thinsp;=\u0026thinsp;0.859).\u003c/p\u003e \u003cp\u003e \u003cb\u003eFigure 6. Calibration curve of the multimodal model in the independent test set.\u003c/b\u003e \u003c/p\u003e \u003cp\u003eCalibration plot comparing mean predicted probability and observed event rate in the test set (\u003cb\u003en\u0026thinsp;=\u0026thinsp;32\u003c/b\u003e). The dashed line indicates perfect calibration. Model calibration statistics are reported in the panel (\u003cb\u003eBrier score\u0026thinsp;=\u0026thinsp;0.187; calibration slope\u0026thinsp;=\u0026thinsp;0.435; intercept\u0026thinsp;=\u0026thinsp;\u0026minus;\u0026thinsp;0.606; AUC\u0026thinsp;=\u0026thinsp;0.859\u003c/b\u003e).\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab5\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 5\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003ePerformance of the clinical baseline logistic regression model\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"4\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eDataset\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eEvent / Non-event\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003eAccuracy\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c4\"\u003e \u003cp\u003eF1-score\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eTraining\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e150 / 94\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.6352\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.6764\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eValidation\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e15 / 15\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.6333\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.6207\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eTest\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e17 / 15\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.4688\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.3704\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec18\" class=\"Section2\"\u003e \u003ch2\u003eComparison across datasets\u003c/h2\u003e \u003cp\u003eAcross the training, validation and test sets, the multimodal deep learning model showed higher discrimination and classification performance than the clinical baseline model.\u003c/p\u003e \u003c/div\u003e"},{"header":"Discussion","content":"\u003cp\u003eIn this single-centre retrospective cohort study of patients with COPD, we developed a multimodal deep learning framework that integrates routine chest CT\u0026ndash;derived lung-component representations with clinical variables to predict 1-year composite cardiovascular events. The model showed good discrimination in the training and validation sets and maintained performance in an independent held-out test set, suggesting predictive potential within the current sample size. A key contribution of this work is not simply the use of deep learning for classification but the component-level decomposition of lung structure. By partitioning the lung into three biologically interpretable regions\u0026mdash;relatively normal lung, emphysema and pulmonary vasculature\u0026mdash;and learning representations from each region separately before multimodal fusion, the framework aims to capture structural heterogeneity in COPD in a more organised and potentially interpretable manner for outcome-oriented risk stratification. [\u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e12\u003c/span\u003e]\u003c/p\u003e \u003cp\u003eCOPD-related structural injury extends beyond airway and parenchymal changes and involves multi-pathway alterations, including reduction and remodelling of the pulmonary vascular bed. Previous studies have suggested that emphysema-related alveolar destruction may reduce the alveolar\u0026ndash;capillary bed, thereby affecting pulmonary vascular resistance and right-heart load; pulmonary vascular pruning and perfusion abnormalities may also interact with systemic inflammation, oxidative stress and endothelial dysfunction, potentially contributing to elevated cardiovascular risk in COPD.[\u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e13\u003c/span\u003e] The present study is designed for outcome prediction, and the observed associations between imaging phenotypes and subsequent events do not establish causality.[\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e] Nonetheless, a component-level representation may offer a more granular imaging perspective than conventional whole-lung or single-threshold metrics, which can be limited in capturing spatially heterogeneous structural patterns. Future work that combines interpretability approaches (eg, saliency or occlusion analyses) with comparisons against established quantitative CT measures may help examine whether the model\u0026rsquo;s informative regions align with plausible physiological mechanisms.\u003c/p\u003e \u003cp\u003eWe observed higher performance in the validation set than in the test set. This pattern is common in medical imaging deep learning studies and may be amplified under small-sample conditions. [\u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e]The limited size of the validation and test sets means performance estimates may be sensitive to individual cases. In addition, although stratified sampling was used to balance event proportions, residual heterogeneity between subsets (eg, comorbidity profiles, CT acquisition parameters and reconstruction settings) may remain and influence generalisation. Moreover, some degree of adaptation to the training distribution is expected during optimisation, which can manifest as reduced performance on held-out data. Notably, the test-set AUC remained relatively high, indicating discrimination in the internal held-out evaluation. However, given the single split and the small test set, these estimates are uncertain and should not be overgeneralised as evidence of clinical generalisability. A more cautious interpretation is that the model demonstrated predictive potential in an internal test set, warranting further evaluation using repeated resampling/bootstrapping and, critically, external cohorts, together with calibration assessment and clinical utility analyses.\u003c/p\u003e \u003cp\u003eA practical advantage of the proposed approach is that the imaging input is derived from routine chest CT, which is commonly used in COPD assessment and follow-up and does not require additional tests. If the model proves stable in multi-centre external validation and undergoes appropriate calibration and clinical utility assessment, it could potentially be incorporated into imaging workflows to provide short-term cardiovascular risk estimates at the time of CT interpretation or follow-up review, supporting more refined risk stratification and targeted cardiovascular evaluation. [\u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e16\u003c/span\u003e]Importantly, the net benefit of any downstream clinical action based on model predictions must be demonstrated before implementation in real-world settings.\u003c/p\u003e \u003cp\u003eThis study has limitations. First, it is a single-centre retrospective study with a limited sample size, particularly in the validation and test sets, which may affect the stability of performance estimates. Second, although multi-vendor CT scanners were included, differences in scanning protocols and reconstruction algorithms may introduce variability, and no domain adaptation or batch-effect correction was applied. Third, the work focused on predictive performance and did not include formal interpretability analyses, limiting transparency regarding model decision-making. Finally, follow-up duration was relatively short, and longer-term cardiovascular risk prediction was not assessed. [\u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e16\u003c/span\u003e]Future studies should validate the model in larger, multi-centre cohorts to better assess generalisability and clinical applicability, incorporate additional clinical indicators and longer follow-up, and apply interpretability and robustness analyses to enhance transparency and support translation toward clinical use. [\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e]\u003c/p\u003e"},{"header":"Conclusion","content":"\u003cp\u003eIn a cohort of 306 patients with COPD, we developed and internally evaluated a multimodal deep learning framework that integrates component-level lung structural representations from routine chest CT (relatively normal lung, emphysema and pulmonary vasculature) with clinical variables to predict 1-year composite cardiovascular events. The model achieved AUCs of 0.8794, 0.9822 and 0.8588 in the training, validation and test sets, respectively, and outperformed an interpretable clinical baseline model based on common cardiovascular risk factors, suggesting that CT-derived imaging representations may provide incremental information for short-term outcome prediction in this setting. Nevertheless, this was a single-centre retrospective study with limited validation/test sample sizes and internal testing only; external validation, calibration assessment and evaluation of clinical net benefit are required to determine generalisability and real-world utility.\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003eEthics approval and consent to participate\u003c/p\u003e\n\u003cp\u003eThe study protocol was approved by the Ethics Committee of the Fourth Affiliated Hospital of Harbin Medical University (approval No. 2025-115, 15 August 2025). The requirement for written informed consent was waived due to the retrospective design and the use of de-identified data.\u003c/p\u003e\n\u003cp\u003eConsent for publication\u003c/p\u003e\n\u003cp\u003eNot applicable.\u003c/p\u003e\n\u003cp\u003eAvailability of data and materials\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eThe data that support the findings of this study are not publicly available due to patient privacy and ethical restrictions. De-identified data may be made available from the corresponding author upon reasonable request and with approval from the Ethics Committee of the Fourth Affiliated Hospital of Harbin Medical University (or the institutional data access committee, as applicable).\u003c/p\u003e\n\u003cp\u003eCompeting interests\u003c/p\u003e\n\u003cp\u003eThe authors declare that they have no competing interests.\u003c/p\u003e\n\u003cp\u003eFunding\u003c/p\u003e\n\u003cp\u003eThis research received no specific grant from any funding agency in the public, commercial or not-for-profit sectors.\u003c/p\u003e\n\u003cp\u003eAuthors\u0026rsquo; contributions\u003c/p\u003e\n\u003cp\u003eYu Kang: Data curation; Methodology; Writing\u0026mdash;original draft.\u003c/p\u003e\n\u003cp\u003eChunhui Qin: Data acquisition; Validation.\u003c/p\u003e\n\u003cp\u003eYuxin Li: Image processing; Software.\u003c/p\u003e\n\u003cp\u003eChunxiao Wang: Formal analysis (statistical analysis).\u003c/p\u003e\n\u003cp\u003eShuai Yunjia: Data curation.\u003c/p\u003e\n\u003cp\u003eLi Ao: Data curation.\u003c/p\u003e\n\u003cp\u003eChen Jiahui: Data curation.\u003c/p\u003e\n\u003cp\u003eTong Zhang: Conceptualization; Supervision; Project administration; Writing\u0026mdash;review \u0026amp; editing; Guarantor.\u003c/p\u003e\n\u003cp\u003eAll authors read and approved the final manuscript.\u003c/p\u003e\n\u003cp\u003ePatient and public involvement\u003c/p\u003e\n\u003cp\u003ePatients and/or the public were not involved in the design, conduct, reporting or dissemination plans of this research.\u003c/p\u003e\n\u003cp\u003eAcknowledgements\u003c/p\u003e\n\u003cp\u003eNot applicable.\u003c/p\u003e\n\u003cp\u003eAuthors\u0026rsquo; information\u003c/p\u003e\n\u003cp\u003eTong Zhang, MD\u003c/p\u003e\n\u003cp\u003eDepartment of Radiology, The Fourth Affiliated Hospital of Harbin Medical University\u003c/p\u003e\n\u003cp\u003eNo. 37, YiYuan Street, NanGang District, Harbin 150001, Heilongjiang, China\u003c/p\u003e\n\u003cp\u003eTel: +86-451-82576831\u003c/p\u003e\n\u003cp\u003eE-mail: [email protected]\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\u003cli\u003e\u003cspan\u003eCornelius T. Clinical guideline highlights for the hospitalist: GOLD COPD update 2024. J Hosp Med. 2024;19(9):818\u0026ndash;20.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eAgust\u0026iacute; AG, Noguera A, Sauleda J, Sala E, Pons J, Busquets X. Systemic effects of chronic obstructive pulmonary disease. Eur Respir J. 2003;21(2):347\u0026ndash;60.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eChen W, Thomas J, Sadatsafavi M, FitzGerald JM. Risk of cardiovascular comorbidity in patients with chronic obstructive pulmonary disease: a systematic review and meta-analysis. Lancet Respir Med. 2015;3(8):631\u0026ndash;9.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eDonaldson GC, Hurst JR, Smith CJ, Hubbard RB, Wedzicha JA. Increased risk of myocardial infarction and stroke following exacerbation of COPD. Chest. 2010;137(5):1091\u0026ndash;7.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLynch DA, Al-Qaisi MA. Quantitative computed tomography in chronic obstructive pulmonary disease. J Thorac Imaging. 2013;28(5):284\u0026ndash;90.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWashko GR, Nardelli P, Ash SY, Vegas Sanchez-Ferrero G, Rahaghi FN, Come CE, Dransfield MT, Kalhan R, Han MK, Bhatt SP, et al. Arterial Vascular Pruning, Right Ventricular Size, and Clinical Outcomes in Chronic Obstructive Pulmonary Disease. A Longitudinal Observational Study. Am J Respir Crit Care Med. 2019;200(4):454\u0026ndash;61.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eFalk T, Mai D, Bensch R, \u0026Ccedil;i\u0026ccedil;ek \u0026Ouml;, Abdulkadir A, Marrakchi Y, B\u0026ouml;hm A, Deubner J, J\u0026auml;ckel Z, Seiwald K, et al. U-Net: deep learning for cell counting, detection, and morphometry. Nat Methods. 2019;16(1):67\u0026ndash;70.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWeintraub WS. Statistical Approaches to Composite Endpoints. JACC Cardiovasc Interv. 2016;9(22):2289\u0026ndash;91.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSinaiko AR, Jacobs DR Jr., Woo JG, Bazzano L, Burns T, Hu T, Juonala M, Prineas R, Raitakari O, Steinberger J, et al. The International Childhood Cardiovascular Cohort (i3C) consortium outcomes study of childhood cardiovascular risk factors and adult cardiovascular morbidity and mortality: Design and recruitment. Contemp Clin Trials. 2018;69:55\u0026ndash;64.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHasenstab KA, Tabalon J, Yuan N, Retson T, Hsiao A. CNN-based Deformable Registration Facilitates Fast and Accurate Air Trapping Measurements at Inspiratory and Expiratory CT. Radiol Artif Intell. 2022;4(1):e210211.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eVan Calster B, McLernon DJ, van Smeden M, Wynants L, Steyerberg EW. Calibration: the Achilles heel of predictive analytics. BMC Med. 2019;17(1):230.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePistenmaa CL, Nardelli P, Ash SY, Come CE, Diaz AA, Rahaghi FN, Barr RG, Young KA, Kinney GL, Simmons JP, et al. Pulmonary Arterial Pruning and Longitudinal Change in Percent Emphysema and Lung Function: The Genetic Epidemiology of COPD Study. Chest. 2021;160(2):470\u0026ndash;80.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBarr RG, Bluemke DA, Ahmed FS, Carr JJ, Enright PL, Hoffman EA, Jiang R, Kawut SM, Kronmal RA, Lima JA, et al. Percent emphysema, airflow obstruction, and impaired left ventricular filling. N Engl J Med. 2010;362(3):217\u0026ndash;27.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePolverino F, Celli BR, Owen CA. COPD as an endothelial disorder: endothelial injury linking lesions in the lungs and other organs? (2017 Grover Conference Series). \u003cem\u003ePulm Circ\u003c/em\u003e 2018, 8(1):2045894018758528.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eZantvoort K, Nacke B, G\u0026ouml;rlich D, Hornstein S, Jacobi C, Funk B. Estimation of minimal data sets sizes for machine learning predictions in digital mental health interventions. NPJ Digit Med. 2024;7(1):361.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eVickers AJ, Elkin EB. Decision curve analysis: a novel method for evaluating prediction models. Med Decis Mak. 2006;26(6):565\u0026ndash;74.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCollins GS, Reitsma JB, Altman DG, Moons KG. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): the TRIPOD statement. Ann Intern Med. 2015;162(1):55\u0026ndash;63.\u003c/span\u003e\u003c/li\u003e\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":true,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true},"keywords":"chronic obstructive pulmonary disease, chest computed tomography, lung segmentation, deep learning, multimodal fusion, cardiovascular events","lastPublishedDoi":"10.21203/rs.3.rs-8733857/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-8733857/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003ch2\u003eBackground\u003c/h2\u003e \u003cp\u003ePatients with chronic obstructive pulmonary disease (COPD) are at increased risk of cardiovascular disease (CVD) events. We aimed to develop and internally validate a multimodal deep learning model that integrates chest CT\u0026ndash;derived lung-component information with clinical variables to predict 1-year composite CVD events in COPD.\u003c/p\u003e\u003ch2\u003eMethods\u003c/h2\u003e \u003cp\u003eWe enrolled COPD patients between January 2023 and January 2025 who underwent baseline thin-slice non-contrast chest CT and completed 12-month follow-up. The primary endpoint was the first occurrence within 12 months of a composite CVD event (acute myocardial infarction, ischaemic stroke, hospitalisation for unstable angina, or hospitalisation for heart failure). CT images were standardised and resampled to 1.0\u0026times;1.0\u0026times;1.0 mm voxels. Lung parenchyma was segmented using a U-Net\u0026ndash;based model; emphysema regions were identified by thresholding (HU\u0026thinsp;\u0026lt;\u0026thinsp;\u0026minus;\u0026thinsp;950); pulmonary vessels were enhanced using a Frangi filter and extracted via connected component analysis. A three-branch 3D convolutional neural network (3D-CNN) was constructed to extract imaging features from normal lung tissue, emphysematous regions, and pulmonary vascular structures, which were fused with clinical features (including age, smoking pack-years, and FEV1%) using an attention mechanism. The dataset was split into training/validation/test sets (8:1:1) with stratified sampling. Discrimination and classification metrics were reported on the independent test set; calibration was assessed using a calibration curve, Brier score, and calibration intercept/slope.\u003c/p\u003e\u003ch2\u003eResults\u003c/h2\u003e \u003cp\u003eA total of 306 COPD patients were included (training n\u0026thinsp;=\u0026thinsp;244, validation n\u0026thinsp;=\u0026thinsp;30, test n\u0026thinsp;=\u0026thinsp;32). Within 12 months, 182 composite CVD events occurred (59.5%). On the independent test set (events\u0026thinsp;=\u0026thinsp;17/32), the model achieved an AUC of 0.8588 with an accuracy of 78.12%, precision of 81.25%, recall/sensitivity of 76.47%, specificity of 80.00%, and F1-score of 0.7879. The Brier score was 0.187, with a calibration slope of 0.435 and an intercept of \u0026minus;\u0026thinsp;0.606.\u003c/p\u003e\u003ch2\u003eConclusions\u003c/h2\u003e \u003cp\u003eA multimodal deep learning framework combining lung-component CT features and clinical variables demonstrated good discrimination for predicting 1-year composite CVD events in COPD. Further external validation and calibration refinement are warranted before clinical deployment.\u003c/p\u003e","manuscriptTitle":"Multimodal Deep Learning Integrating Chest CT Lung-Component Segmentation and Clinical Variables for Prediction of 1-Year Composite Cardiovascular Events in COPD: A Single-Centre Retrospective Cohort Study","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2026-02-27 11:27:53","doi":"10.21203/rs.3.rs-8733857/v1","editorialEvents":[{"type":"communityComments","content":0}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"8623a513-63b5-40a5-b902-6b0d4266d083","owner":[],"postedDate":"February 27th, 2026","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"posted","subjectAreas":[],"tags":[],"updatedAt":"2026-04-23T16:10:09+00:00","versionOfRecord":[],"versionCreatedAt":"2026-02-27 11:27:53","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-8733857","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-8733857","identity":"rs-8733857","version":["v1"]},"buildId":"XKTyCvWXoU3ODBz1xrDgd","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

⚙ Ask this paper AI returns verbatim quotes from the full text · source: preprint-html ⓘ

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc: last seen: 2026-05-20T01:45:00.602351+00:00