Predicting Future Organ Support Needs Using Longitudinal Emergency Department Data: A Proof-of-Concept Study

doi:10.21203/rs.3.rs-9036340/v1

Predicting Future Organ Support Needs Using Longitudinal Emergency Department Data: A Proof-of-Concept Study

2026 · doi:10.21203/rs.3.rs-9036340/v1

preprint OA: closed

Full text JSON View at publisher

Full text 102,310 characters · extracted from preprint-html · click to expand

Predicting Future Organ Support Needs Using Longitudinal Emergency Department Data: A Proof-of-Concept Study | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Predicting Future Organ Support Needs Using Longitudinal Emergency Department Data: A Proof-of-Concept Study Samuel Chiacchia, Katie Lebold, Andrew Moore, Hayley Hedlin, Christian Rose, and 2 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-9036340/v1 This work is licensed under a CC BY 4.0 License Status: Under Review Version 1 posted 6 You are reading this latest preprint version Abstract Background Prediction of organ support needs, rather than mortality or critical care transfer alone, may improve the utility of early warning scores (EWS). Existing EWS may have limited sensitivity in predicting organ support due to reliance on cross-sectional snapshots of patient physiology, limiting their ability to account for changes in patient status. We aimed to develop and compare novel models capable of using longitudinal clinical data to predict organ support or death (OSD) within 48 hours of hospital admission. Methods We leveraged a retrospective cohort of adult ED encounters at a U.S. quaternary academic medical center from March 1, 2022, to February 5, 2024. Encounters were included if patients were ≥ 18 years and admitted to a medical service; those receiving organ support in the ED were excluded. The primary outcome was a composite of vasopressor initiation, invasive mechanical ventilation, continuous renal replacement therapy, or death within 48 hours of admission. Performance metrics included AUROC, AUPRC, sensitivity, and specificity. Results 1.7% (549/32,329) experienced organ support or death within 48 hours of admission. The transformer-based neural net demonstrated the strongest overall performance, with an AUROC of 0.84 and AUPRC of 0.20, outperforming the baseline to National Early Warning Score 2 (NEWS2) with higher sensitivity for the primary outcome (0.78 vs. 0.61) while maintaining sufficient specificity (0.71 vs. 0.83). XGBoost and elastic-net regression showed similar improvements in sensitivity (both 0.83) with modest reductions in specificity relative to NEWS2 calculated at time of admission. Conclusions Organ support represents a potentially modifiable and temporally proximal marker of critical illness. Models trained to interpret longitudinal trends in clinical variables—rather than cross-sectional snapshots—may better mirror clinician reasoning. emergency critical care organ support time-series analysis Figures Figure 1 Figure 2 Figure 3 Figure 4 BACKGROUND The volume and complexity of patients presenting to emergency departments (EDs) have increased substantially, without proportional growth in inpatient capacity. 1 As a result, emergency physicians face increasing pressure to triage and disposition patients accurately and efficiently. Among admitted patients, unplanned ICU transfers are associated with higher morbidity and mortality, making early identification of patients at risk for near-term decompensation essential for the equitable and effective use of critical care resources 2–8 . Although emergency clinicians excel at recognizing and stabilizing overt critical illness, it remains difficult to determine which patients who appear stable at the time of admission will ultimately require ICU-level care early in their hospital course. The Society of Critical Care Medicine has emphasized the lack of reliable, objective criteria for ICU triage, while endorsing the use of early warning scores (EWS) to support risk stratification. 9 Traditionally, EWS development has focused on mortality and ICU transfer as endpoints. Yet mortality endpoints may reflect an unpreventable outcome while failing to identify patients more likely to respond to timely interventions. 10 Transfer-based endpoints, while more sensitive than mortality, are at risk for bias shaped by institution-specific ICU admission practices and policies, limiting generalizability. Thus, scores like NEWS2 may miss patients who require escalation of care but lack immediate ICU transfer criteria. 10,11 Shifting focus to prediction of future organ support needs could overcome these limitations and provide important resolution with respect to delivery of specific critical care resources and appropriate disposition at the time of admission. 10 However, existing early warning scores, like NEWS2, appear limited in their ability to predict near-term organ support needs using clinical data collected from ED patients at the time of admission. 11,12 We hypothesize that addressing these limitations using novel models that incorporate longitudinal vital sign trends, in addition to laboratory and demographic data, could improve prediction of organ support needs at the time of hospital admission, thereby supporting triage of admitted patients at-risk for near-term decompensation. While similar approaches have been applied in ICU patient populations, few have evaluated such an approach could be applied to ED patient populations 13–17 . There are multiple model architectures able to process longitudinal and multi-modal data streams; each with their own strengths and limitations. For example, linear regression is easily interpretable but fails to capture non-linear relationships between input data and outcomes of interest and requires feature engineering to first be applied to time-series data. Tree-based models, such as XG-Boost, can capture such non-linear relationships and handle missing data natively, but like regression, require feature engineering to provide temporal context to time-series data. Newer machine learning approaches, such as transformer-based neural nets, are well suited for processing missing and time-series data and can capture non-linear relationships between variables, but require vast amounts of training data and therefore may be limited in prediction of rare clinical outcomes such as organ support or mortality events. To test these strengths and limitations, and to compare performance across different model architectures, we developed three binary classifiers that leverage time series data to predict organ support or death (OSD) within 48-hours of admission from the ED. METHODS Study Design, Setting, and Population We conducted a retrospective cohort study using all adult ED encounters at a U.S. quaternary academic medical center between March 1, 2022 and February 5, 2024, where the disposition was admission to an inpatient medical service. Patients were included if they were ≥18 years old and admitted from the ED to a predefined set of adult medical services, including general medicine, subspecialty medicine (e.g., cardiology, hepatology, oncology), and intensive care units. We excluded encounters of patients admitted to surgical or psychiatric services, ED observation units, those with missing patient identifiers, or without an ED evaluation (i.e. direct admissions). Patients admitted to surgical services were excluded due to the different system of care used for the triage and initial management of traumatically injured patients. Patients who received organ support (invasive mechanical ventilation, vasopressors, or continuous renal replacement therapy) or died in the emergency department were excluded from the analysis, as they had already met the endpoint of interest. We treated each ED encounter as unique in our models such that multiple admissions from an individual patient were included and handled as described below. This study was conducted in accordance with the ethical principles outlined in the Declaration of Helsinki and was approved by the Stanford University Institutional Review Board IRB 69934. The requirement for informed consent was waived due to the retrospective nature of the study and use of de-identified electronic health record data. Primary and Secondary Outcomes The primary composite outcome was organ support or death within 48 hours of hospital admission. Organ support was defined as initiation of vasopressors, invasive mechanical ventilation, or continuous renal replacement therapy (CRRT). Non-invasive positive pressure ventilation and high-flow nasal oxygen were not considered organ support for the purpose of this analysis. Extracorporeal membrane oxygenation (ECMO) was also not included as organ support, as it was assumed patients received other organ support therapies before initiation of ECMO. Secondary outcomes include each type of organ support individually or death within 48 hours of admission. Data Collection, Processing, and Partitioning Structured electronic health record (EHR) data were extracted from the research data warehouse, including demographics, vital signs, laboratory values, and hospital course details relevant to our primary and secondary outcomes including administration of vasoactive medications, mechanical ventilation, continuous renal replacement therapy, or death. Use of vasopressors was identified via continuous infusion orders for norepinephrine, epinephrine, vasopressin, dobutamine, dopamine, or phenylephrine; push-dose pressors were excluded. Receipt of mechanical ventilation was determined using respiratory care flowsheets. Continuous renal replacement therapy was identified by dialysis nursing orders documenting initiation of CRRT. Time-to-event variables were calculated relative to the admission timestamp, and not a patient’s physical location, to assess outcome timing. Predictor variables were chosen based on previous work exploring performance of longitudinal vital signs and laboratory values in predicting clinical outcomes among ICU patients 18–23 . Charlson comorbidity scores were derived using validated mappings of ICD-9 and ICD-10 codes 24 . To include time-series data in linear and tree based models, minimum, maximum, and composite metrics for ED-based vital signs were calculated. Composite metrics were calculated by calculating the slope of the line of best fit of each vital sign with respect to time multiplied by the R^2 value to control for strength of fit. 25 Shock index was calculated each time a systolic blood pressure and heart rate were recorded within 5 minutes of each other. Multiply imputed data were used for the elastic net regression models. We performed predictive mean matching (PMM) multiple imputation using the {mice} package 3.19.0. 26 Twenty imputations were performed using five iterations per chain. All continuous variables were standardized using z-score normalization prior to imputation. A 75/15/10 split, constrained to maintain patient-level temporal ordering, for training, validation, and testing sets was used. To prevent data leakage from repeat patient encounters, we implemented a patient-level temporal data split. The dataset was partitioned by medical record number using a temporal ordering approach, where each patient was sorted by earliest ED arrival time, producing. All encounters for a given patient were assigned to a single partition. Training set class distributions could not be balanced due to positive class rarity; thus class imbalance was addressed using a weighted loss function. The validation and test sets retained their natural class distributions for realistic performance assessment. All data collection, processing, partitioning, and subsequent analysis was completed using R 4.5.1. 27 Model Development We trained logistic elastic net regression models to predict the primary outcome within 48-hours after admission using {glmnet} 4.1-10. 28 Each model was trained using 5-fold time-series cross-validation, with hyperparameters (penalty λ and mixture α) tuned over a predefined grid (α 0-1 by 0.05; λ 10 -4 -10 -1 by 10 -1/3 ). The final model was selected based on the highest area under the precision recall curve (AUPRC) evaluated on validation data. A gradient-boosted decision tree model was developed using the {xgboost} 3.1.3.1 engine to predict primary outcome within 48-hours. 29 A Latin hypercube sampling strategy was used to explore combinations of hyperparameters, including number of trees, learning rate, maximum tree depth, and regularization parameters. Model tuning used the same time-series cross-validation strategy as the elastic net regression. The final model was selected based on the highest area under the precision recall curve (AUPRC) evaluated on validation data. We developed a transformer-based neural network model using the {torch} 0.16.3 to predict the primary outcome. 30,31 The model processes sequences of up to 150 timesteps containing vital signs, laboratory values, patient demographics, ED length of stay, and Charlson comorbidity index. To preserve information about data availability, we implemented a dual-feature architecture where each clinical variable is accompanied by a binary indicator denoting whether the value was observed (1) or missing (0). Missing values were set to zero after normalization, allowing the model to learn appropriate uncertainty based on data completeness. Continuous features were normalized using z-score standardization, with means and standard deviations calculated exclusively from observed (non-missing, non-padded) values in the training set; these parameters were then applied to validation and test sets for normalization. The transformer architecture consisted of 4 encoder layers with 2 attention heads, a feedforward dimension of 128, sinusoidal positional encodings to capture temporal ordering, and a 3-layer classification head with ReLU activations and dropout (0.2, 0.1) for regularization. We employed padding masks to exclude completely empty timesteps from attention computations. The model was trained using the AdamW optimizer (learning rate=1×10⁻⁴, weight decay=1×10⁻³) with binary cross-entropy loss weighted by class frequency to address outcome imbalance. Training proceeded for 20 epochs with early stopping based on validation set precision-recall area under the curve (AUPRC), and gradient clipping (max norm=1) was applied to prevent instability. Platt Scaling and Model Performance Evaluation Model calibration was performed using Platt scaling, where raw logits from the validation set were used to fit a logistic regression model. 32 Calibrated models were then used to transform raw predictions on the held-out test set into final probabilities. Performance was assessed on the held-out test set using area under the receiver operating curve (AUROC), AUPRC, sensitivity, specificity, and predictive values. The optimal threshold for binary classification was determined using Youden’s J statistic. Model discrimination for secondary outcomes was evaluated using PRC and ROC curves with AUC calculations. We employed SHapley Additive exPlanations (SHAP) to quantify feature importance and interpretability of model predictions in python. 33 We generated summary plots displaying the distribution of SHAP values for each feature, where the magnitude indicates importance and the sign indicates the direction of effect on prediction. For continuous features, color-coding represents feature values, revealing non-linear relationships and interaction effects between feature values and their impact on predictions. NEWS2 Calculation NEWS2 was calculated using the closest documented vital signs and blood gas values (arterial or venous) to inpatient admission, defined as the time of entry of an admission order to a medical inpatient service. Hypercapnic respiratory failure was defined as pCO₂ >45 mmHg (ABG) or >50 mmHg (VBG), in keeping with NEWS2 precedent. Patients without blood gas data were presumed not to be hypercapnic. Per original recommendations, NEWS2 ≥5 or any individual component score of 3 was considered high risk. RESULTS Cohort characteristics and outcomes Of 148,727 adult ED encounters between March 2022 and February 2024, 32,329 (21%) met all inclusion criteria and were used for model development (Figure 1). ED encounters were most commonly excluded due to discharge home (n= 105,978, 71%) or admission to a surgical or psychiatric service (n= 9,877, 6%). Among included patient encounters, the median age was 64 years, 50% were male, and 44% had greater than one co-morbidity included in the Charlson co-morbidity index. Median ED length of stay was 4.9 hours, and median hospital length of stay was 3.8 days. Cohort selection, demographics, and hospital outcomes are further summarized in Figure 1. Six hundred ninety-six encounters required organ support or resulted in death during their hospital admission, with 576 (81%) meeting the primary outcome of organ support or death within 48 hours of hospital admission (Figure 2A). Patients who received organ support within 48 hours of admission most frequently required only one support modality (Figure 2B). Among subtypes of organ support, vasopressor use (n = 333, 1% of total cohort) and invasive mechanical ventilation (n = 304, 0.9%) were most common, followed by CRRT (n = 45, 0.1%). There were only 34 deaths within 48 hours of admission (0.1%). Patients who required organ support or died within 48 hours of admission had similar age to those that did not meet our primary outcome (66 years [43-89] vs 66 years [39-93]) but were more likely to be male (56% vs 50%) and have at least one chronic co-morbidity (56% vs 43%). ED length of stay was shorter among encounters with decompensation (median 3.9 hours [3.4] vs 5.0 hours [4.0]) while hospital length of stay was longer (8.0 days [11.9] vs 3.8 days [4.8]). Performance and feature importance across model architectures is similar The 32,329 samples in our cohort were randomly assigned to training (n = 26,346, 81%), validation (n = 3796, 12%), and test (2,187, 7%) sets. The prevalence of primary and secondary outcomes in training, validation, and test sets were similar to those observed in the full cohort (Supplemental Figure 1). We trained logistic elastic net regression models to predict the primary outcome within 48-hours after admission. Hyperparameter tuning over a predefined grid (α 0-1 by 0.05; λ 10 -4 -10 -1 by 10 -1/3 ) identified the highest AUPRC of 0.12 for ENR with alpha 0.3 and lambda 0.04. AUPRC for validation and test sets was 0.07 (Supplement Table 1). For gradient-boosted decision tree models, a Latin hypercube sampling strategy was used to explore combinations of hyperparameters, including number of trees, learning rate, maximum tree depth, and regularization parameters. The highest AUPRC of 0.15 was achieved with a model with 652 trees of depth 3, learn rate of 5.81×10 -7 , and loss reduction 3.3×10 -6 . AUPRC for validation and test sets were 0.08 and 0.12, respectively (Supplement Table 1). The small learning rate reflects convergence of the tuning procedure to the lower bound of a wide log-scaled search range. When the learning rate was constrained to conventional XGBoost values (0.001–0.1), model discrimination was largely unchanged (AUPRC 0.19), suggesting performance was not sensitive to this parameter. Finally, we trained a transformer using the AdamW optimizer (learning rate=1×10 -4 , weight decay=1×10 -3 ) with binary cross-entropy loss weighted by class frequency to address outcome imbalance. Training proceeded for 20 epochs; the strongest performing model had an AUPRC of 0.35. AUPRC for validation and test sets were 0.14 and 0.2 respectively (Supplement Table 1). Insight into feature importance was guided by model architecture. For ENR, we used the absolute value of the model coefficients to assess feature importance; for XGB, we used average gain (i.e. the mean improvement in model loss attributed to splits on each feature across all trees); for the transformer we used SHapley Additive exPlanations (SHAP). Similar features were important to model performance across all three architectures, specifically trends in systolic blood pressure, heart rate, respirations, pulse oximetry, as well as lab values such as venous pH, lactate, and blood urea nitrogen (Supplemental Figure 2). Figure 3 summarizes model performance in predicting our primary and secondary outcomes. In the prediction of our primary outcome among samples in the hold-out test set, the transformer had the highest AUPRC (0.2) while the XGB had the highest AUROC (0.86). Among secondary outcomes, performance was strongest in prediction of future vasopressors and CRRT and weakest in prediction of invasive mechanical ventilation and mortality across all model architectures (Figure 3). Times series models are more sensitive than NEWS2 in predicting organ support or death within 48 hours of admission Previously, we evaluated the performance of NEWS2 in prediction of future organ support or death after admission. 12 To compare performance of NEWS2 at time of admission to our time series models, we subset our test sample (n = 2,187) to include samples in which a NEWS2 score could be reliably calculated using our previous methods. 12 Among this subset of 1,545 patient encounters, 43 (2%) met our primary outcome of organ support or death within 48 hours of admission (Figure 4A). Of the 43 cases that met our primary outcome, 14 (33%) cases correctly identified by at least one time series model were missed by NEWS2 calculated at time of admission; 2 (4%) cases correctly flagged by NEWS2 were missed by all three time series models. Of the 1,497 encounters that did not involve organ support or result in death within 48 hours, 218 (14%) correctly identified by a time series model were missed by NEWS2 calculated at time of admission; 119 (7%) encounters correctly flagged by NEWS2 were missed by all three time series models. All three time series models had greater sensitivity than NEWS2 in prediction of our primary outcome as well as use of vasopressors or invasive mechanical ventilation within 48 hours of admission (78-83% vs 61%, Table 1). NEWS2 had greater specificity in prediction of our primary outcome (78% vs. 70-71%). Assessment of performance in prediction of CRRT and death in this sub-group was limited by the low prevalence of CRRT and death. Table 1: Comparison of Test Characteristics Among Times Series Models and NEWS2 NEWS2 ENR XGB Transformer Test Sample (N = 1,545) Sensitivity Specificity Sensitivity Specificity Sensitivity Specificity Sensitivity Specificity OSD (n = 38) 0.61 0.78 0.83 0.71 0.83 0.70 0.78 0.71 Vasopressors (n = 33) 0.60 0.78 0.85 0.71 0.90 0.70 0.85 0.71 Intubation (n = 21) 0.57 0.77 0.76 0.70 0.76 0.69 0.67 0.70 CRRT (n = 3) 0.67 0.77 1 0.70 1 0.69 1 0.70 Death (n = 2) 1 0.77 1 0.70 0.5 0.68 1 0.70 Case series analysis of discordant classifications illustrates limitations of cross-sectional risk scores like NEWS2 While feature importance highlights the role of longitudinal trends in driving model performance for predicting near-term organ support (Supplemental Figure 2), we sought to directly visualize how temporal changes in vital signs may result in misclassification of high risk patients by cross-sectional risk scores, like NEWS. To accomplish this, we conducted a case series analysis of positive samples with discordant predictions between NEWS2 and our time-series models. Figure 4A-4B illustrate discordant true positive and true negative predictions between NEWS2 and the time series models. We examined the 10 samples correctly predicted by all three time-series models to meet OSD within 48 hours of admission that were not identified by NEWS2 calculated at the time of admission (Figure 4A). We plotted the vital signs used to calculate NEWS2 with respect to time (Figure 4C–F). These figures illustrate vital sign trends over time; these trends are not captured by cross sectional risk scores and could explain differences in sensitivity observed in Table 1. DISCUSSION In this retrospective study of approximately 32,000 ED admissions, we explored whether real-world, longitudinal clinical data improved prediction of organ support or death within 48-hours of admission relative to the existing cross sectional risk score, NEWS2. A central contribution of this work is the use of organ support, rather than mortality or ICU transfer alone, as a primary endpoint. Organ support represents a potentially modifiable and temporally proximal marker of critical illness. Unlike mortality, which may reflect irreversible disease processes, or ICU transfer, which is influenced by local policy and bed availability, initiation of vasopressors, mechanical ventilation, or CRRT more directly reflects physiologic decompensation requiring intervention. Aligning prediction models with such actionable endpoints may improve relevance for future interventional studies. Second, we hypothesized that longitudinal modeling better aligns with clinician reasoning, and observed similar discrimination and improved sensitivity relative to a cross-sectional score (NEWS2), suggesting that incorporating physiologic trajectories may provide incremental predictive information beyond admission-time snapshots. The similar discrimination achieved across regression, tree-based, and transformer models suggests that the predictive signal may lie primarily in the longitudinal physiologic information itself rather than in any specific modeling architecture. This finding is important, as it indicates that improvements in prediction may stem from better alignment of features with clinical trajectories rather than increasing algorithmic complexity. While encouraging, discrimination in the “80/80” range implies substantial misclassification. At low outcome prevalence (1.7%), even modest false-positive rates generate a large number of alerts relative to true events, raising questions about acceptable tradeoffs between sensitivity and alert burden. Our study was not designed to determine whether this level of misclassification is clinically acceptable, nor whether the model adds value beyond careful physician assessment. Future work must quantify the net clinical benefit across plausible risk thresholds and compare model performance directly against clinician gestalt in prospective settings. This study has several strengths. First, we analyzed a contemporary patient cohort from a large quaternary care center, capturing real-world clinical practice and preserving outcome prevalence in validation and test sets. Second, the use of patient-level temporal splits minimized data leakage and provide a rigorous assessment of generalizability. Third, we present a novel framework for predicting organ support outcomes by demonstrating models trained to predict composite outcomes (i.e. vasopressor, intubation, CRRT, or death) are also able to predict individual organ support outcomes with comparable sensitivity and specificity. This has important implications for future efforts to predict organ support that may be constrained by limited outcome prevalence, proving especially relevant for deep learning models like transformers. Fourth, we compared three distinct modeling paradigms against an existing risk score, NEWS2. Others have shown similar superior performance of machine learning models in predicting ICU admission, however, we chose organ support or death, which is subject to less institutional and provider variability and to increase generalizability. 34–36 Finally, this analysis aimed to compare performance across different model architectures, a crucial step prior to external validation and deployment. This study also has important limitations. It was conducted at a single academic medical center, limiting evaluation of generalizability to settings with different patient populations, resource availability, and ED workflows. The two-year timeframe constrained the number of positive cases and class imbalance may have influenced model behavior despite use of weighted loss functions. Although we selected a composite endpoint to address concerns that critically ill patients often require multiple organ support modalities, many patients in our cohort required only one form of support within the 48-hour window. Finally, although transformers demonstrated encouraging discrimination, their performance likely remains sensitive to the quality, density, and ordering of time-series data. Missingness patterns in EHR data are non-random and may encode clinical decision-making rather than pure physiology; while our dual-feature representation attempts to capture this, it may not fully resolve sampling bias. Finally, our models were trained on structured EHR data drawn from a heterogeneous population encompassing diverse disease states and organ failures. While this reflects real-world practice, it limits mechanistic insight and biological interpretability. Future efforts integrating physiologic trajectories with molecular or disease-specific data may offer greater explanatory power and potentially improved discrimination. CONCLUSIONS Organ support represents a potentially modifiable and temporally proximal marker of critical illness. Models trained to interpret longitudinal trends in clinical variables—rather than cross-sectional snapshots—may better mirror clinician reasoning. Our findings support the hypothesis that incorporating longitudinal data improves sensitivity in predicting near-term organ support or death. Declarations Ethics approval and consent to participate: This study was approved by the institutional review board (IRB 69934). Consent for publication: Not applicable. The study used de-identified data and did not include identifiable individual information Availability of data and materials: The datasets generated and analyzed during the current study are not publicly available due institutional data privacy policies but are available from the corresponding author on reasonable request. Competing interests: The authors of this manuscript have no competing interests to declare. Funding: This work was supported, in part, by a 2025-2026 Society for Academic Emergency Medicine (SAEM) resident research grant (RE2022- 0000000651) and by the following NIH funding source of Stanford’s Center for Clinical and Translational Education and Research award, under the Biostatistics, Epidemiology and Research Design (BERD) Program: UM1TR004921. Author contributions: SRC, JW, and DAK conceived of the initial research question and study design. SRC, DAK, and HH developed and implemented the statistical analysis plan. CR coordinated data acquisition from the institutional data warehouse. SRC, KL, and JW drafted the initial manuscript. All authors contributed to editing of the final manuscript. Acknowledgements: The authors would like to acknowledge the Stanford’s Emergency Department Analytics Committee (EDAC) for their contributions to EHR database management that supported this project. References Weinick, R. M., Bruna, S., Boicourt, R. M., Michael, S. S. & Sessums, L. L. AHRQ summit to address emergency department boarding. https://www.ahrq.gov/sites/default/files/wysiwyg/topics/ed-boarding-summit-report.pdf (2025). Delgado, M. K. et al. Risk factors for unplanned transfer to intensive care within 24 hours of admission from the emergency department in an integrated healthcare system. J. Hosp. Med. 8 , 13–19 (2013). Dahn, C. M. et al. A critical analysis of unplanned ICU transfer within 48 hours from ED admission as a quality measure. Am. J. Emerg. Med. 34 , 1505–1510 (2016). Solano, J. J. et al. Hospital ward transfer to intensive care unit as a quality marker in emergency medicine. Am. J. Emerg. Med. 35 , 753–756 (2017). Escobar, G. J. et al. Intra-hospital transfers to a higher level of care: contribution to total hospital and intensive care unit (ICU) mortality and length of stay (LOS). J. Hosp. Med. 6 , 74–80 (2011). Nates, J. L. et al. ICU admission, discharge, and triage guidelines: A framework to enhance clinical operations, development of institutional policies, and further research. Crit. Care Med. 44 , 1553–1602 (2016). Fernando, S. M. et al. Emergency Department disposition decisions and associated mortality and costs in ICU patients with suspected infection. Crit. Care 22 , (2018). Weissman, G. E. et al. Potentially preventable intensive care unit admissions in the United States, 2006-2015. Ann. Am. Thorac. Soc. 17 , 81–88 (2020). Honarmand, K. et al. Society of critical care medicine guidelines on recognizing and responding to clinical deterioration outside the ICU: 2023. Crit. Care Med. 52 , 314–330 (2024). Goodacre, S. Using clinical risk models to predict outcomes: what are we predicting and why? Emerg. Med. J. 40 , 728–730 (2023). Goodacre, S., Sutton, L., Fuller, G., Trimble, A. & Pilbery, R. Accuracy of the National Early Warning Score version 2 (NEWS2) in predicting need for time-critical treatment: retrospective observational cohort study. Emerg. Med. J. 42 , 396–402 (2025). Chiacchia, S. R. et al. Early warning score performance at time of admission in the prediction of future organ support needs. Acad. Emerg. Med. (2025) doi:10.1111/acem.70182. Choi, A. et al. Development of a machine learning-based clinical decision support system to predict clinical deterioration in patients visiting the emergency department. Sci. Rep. 13 , 8561 (2023). Kwak, G. H., Ling, L. & Hui, P. Predicting the need for vasopressors in the Intensive Care unit using an attention based deep learning model. Shock 56 , 73–79 (2021). Duval, L. et al. Early prediction of vasopressor initiation in ICU sepsis patients using an interpretable EHR-based ML model. BMC Med. Inform. Decis. Mak. 25 , 442 (2025). Contreras, M. et al. Real-time prediction of intensive care unit patient acuity and therapy requirements using state-space modelling. Nat. Commun. 16 , 7315 (2025). Hyland, S. L. et al. Early prediction of circulatory failure in the intensive care unit using machine learning. Nat. Med. 26 , 364–373 (2020). Thorsen-Meyer, H.-C. et al. Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records. Lancet Digit. Health 2 , e179–e191 (2020). Meiring, C. et al. Optimal intensive care outcome prediction over time using machine learning. PLoS One 13 , e0206862 (2018). Brekke, I. J., Puntervoll, L. H., Pedersen, P. B., Kellett, J. & Brabrand, M. The value of vital sign trends in predicting and monitoring clinical deterioration: A systematic review. PLoS One 14 , e0210875 (2019). Zheng, Z. et al. Development and validation of a dynamic real-time risk prediction model for Intensive Care units patients based on longitudinal irregular data: Multicenter retrospective study. J. Med. Internet Res. 27 , e69293 (2025). Deng, Y. et al. Explainable time-series deep learning models for the prediction of mortality, prolonged length of stay and 30-day readmission in intensive care patients. Front. Med. (Lausanne) 9 , 933037 (2022). Redfern, O. C. et al. Predicting in-hospital mortality and unanticipated admissions to the intensive care unit using routinely collected blood tests and vital signs: Development and validation of a multivariable model. Resuscitation 133 , 75–81 (2018). Deyo, R. A., Cherkin, D. C. & Ciol, M. A. Adapting a clinical comorbidity index for use with ICD-9-CM administrative databases. J. Clin. Epidemiol. 45 , 613–619 (1992). Stein, D. F. et al. Prediction cardiovascular deterioration in a paediatric intensive care unit (PicEWS): a machine learning modelling study of routinely collected health-care data. EClinicalMedicine 85 , 103255 (2025). van Buuren, S. & Groothuis-Oudshoorn, K. mice: Multivariate Imputation by Chained Equations inR. J. Stat. Softw. 45 , (2011). The R project for statistical computing. https://www.R-project.org/. Tay, J. K., Narasimhan, B. & Hastie, T. Elastic net regularization paths for all generalized linear models. J. Stat. Softw. 106 , (2023). Chen, T. & Guestrin, C. XGBoost: A Scalable Tree Boosting System. in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (ACM, New York, NY, USA, 2016). doi:10.1145/2939672.2939785. Falbel D, L. J. torch: Tensors and Neural Networks with “GPU” Acceleration . R package version 0.16.3. https://torch.mlverse.org/docs. (2025). Vaswani, A. et al. Attention is all you need. arXiv [cs.CL] (2025) doi:10.65215/2q58a426. Platt1999. Lundberg, S. & Lee, S.-I. A unified approach to interpreting model predictions. arXiv [cs.AI] (2017). Joseph, J. W. et al. Deep-learning approaches to identify critically Ill patients at emergency department triage using limited information. J. Am. Coll. Emerg. Physicians Open 1 , 773–781 (2020). Nguyen, M. et al. Developing machine learning models to personalize care levels among emergency room patients for hospital admission. J. Am. Med. Inform. Assoc. 28 , 2423–2432 (2021). Boulitsakis Logothetis, S., Green, D., Holland, M. & Al Moubayed, N. Predicting acute clinical deterioration with interpretable machine learning to support emergency care decision making. Sci. Rep. 13 , 13563 (2023). Additional Declarations No competing interests reported. Supplementary Files SUPPLEMENTAL.docx Cite Share Download PDF Status: Under Review Version 1 posted Reviewers agreed at journal 05 Apr, 2026 Reviewers invited by journal 30 Mar, 2026 Editor invited by journal 13 Mar, 2026 Editor assigned by journal 12 Mar, 2026 Submission checks completed at journal 12 Mar, 2026 First submitted to journal 05 Mar, 2026 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-9036340","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":614260036,"identity":"a74357e4-50ff-46fe-a1f1-7b53dbf2b0bc","order_by":0,"name":"Samuel Chiacchia","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAAxUlEQVRIie3OsQrCMBCA4SuBTHmDon2FKwF1qPoqlkC37o6dOlVni32IrG6FQqbiK1jo2qGjQ0CrOAmSdnPIv9xyH3cANtt/Rpr+PUoABuAkIwj1T4DD2I0n4LJJxDtXDQ+09paUqL6DYCZLA8FrhG2con9JqcgLiLiZMAY8TtCRt4wTBlVoJF7GwF1p3ErFXuRhJlAPBCiGH1KaCdZ04R9SLqSiwilQ8Nz8GGmbu56vpSIVdPvN7Gh87OvotHWbzWaz/egJpp49927kSigAAAAASUVORK5CYII=","orcid":"","institution":"Stanford Medicine","correspondingAuthor":true,"prefix":"","firstName":"Samuel","middleName":"","lastName":"Chiacchia","suffix":""},{"id":614260037,"identity":"0d70531b-e2af-4b49-a710-b4b2bec363cb","order_by":1,"name":"Katie Lebold","email":"","orcid":"","institution":"Oregon Health \u0026 Science University","correspondingAuthor":false,"prefix":"","firstName":"Katie","middleName":"","lastName":"Lebold","suffix":""},{"id":614260038,"identity":"8066b07b-67d5-4234-8a18-a672128c1b2d","order_by":2,"name":"Andrew Moore","email":"","orcid":"","institution":"Stanford Medicine","correspondingAuthor":false,"prefix":"","firstName":"Andrew","middleName":"","lastName":"Moore","suffix":""},{"id":614260039,"identity":"9d978a5d-3a27-42fb-affa-32830b5805e1","order_by":3,"name":"Hayley Hedlin","email":"","orcid":"","institution":"Stanford Medicine","correspondingAuthor":false,"prefix":"","firstName":"Hayley","middleName":"","lastName":"Hedlin","suffix":""},{"id":614260040,"identity":"e60fffe9-0781-4060-9dee-f38e5eb328a3","order_by":4,"name":"Christian Rose","email":"","orcid":"","institution":"Stanford Medicine","correspondingAuthor":false,"prefix":"","firstName":"Christian","middleName":"","lastName":"Rose","suffix":""},{"id":614260041,"identity":"267444f0-c283-4923-82bc-dff1a2c4fd4f","order_by":5,"name":"David Kim","email":"","orcid":"","institution":"Stanford Medicine","correspondingAuthor":false,"prefix":"","firstName":"David","middleName":"","lastName":"Kim","suffix":""},{"id":614260042,"identity":"10464832-9b3f-47f0-934d-c4aeb960c716","order_by":6,"name":"Jenny Wilson","email":"","orcid":"","institution":"Stanford Medicine","correspondingAuthor":false,"prefix":"","firstName":"Jenny","middleName":"","lastName":"Wilson","suffix":""}],"badges":[],"createdAt":"2026-03-05 05:54:12","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-9036340/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-9036340/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":106069840,"identity":"f0b9b1ab-3a1b-40a2-9dd5-7ba835c010ab","added_by":"auto","created_at":"2026-04-03 06:25:48","extension":"jpg","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":154980,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eConsort diagram illustrating cohort selection with associated demographics and clinical outcomes.\u003c/strong\u003e\u003c/p\u003e","description":"","filename":"Figure1OSD.jpg","url":"https://assets-eu.researchsquare.com/files/rs-9036340/v1/7a870b84f94e1a58106e6062.jpg"},{"id":106069812,"identity":"0d5ca2be-ddec-4873-ac45-6a3fce4d34a2","added_by":"auto","created_at":"2026-04-03 06:25:38","extension":"jpg","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":111339,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eTiming and co-occurrence of critical care outcomes (i.e., vasopressors, intubation, CRRT, or death) among admitted patients. \u003c/strong\u003eA) Density plot representing distribution of individual critical care outcomes with respect to timing of hospital admission order placement among all patients that experienced organ support or death after admission; x-axis represents time after admission order is placed; y-axis represents probability density of individual critical care outcomes; dashed vertical line indicates 48-hours after admission. B) Upset plot illustrating overlap between critical care outcomes organized by individual critical care outcomex.\u003c/p\u003e","description":"","filename":"Figure2OSD.jpg","url":"https://assets-eu.researchsquare.com/files/rs-9036340/v1/0a9ccf41e013de012f2715fd.jpg"},{"id":106069797,"identity":"08ef9c9e-05d3-4a0e-ae04-58928e588192","added_by":"auto","created_at":"2026-04-03 06:25:32","extension":"jpg","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":752269,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eComparison of Model Performance by AUROC and AUPRC.\u003c/strong\u003e A,B,C) ROCs for hold-out test sample (N = 2,187) generated by ENR (A), XGB (B), and Transformer (C) models in prediction of organ support or death (OSD) or individual critical care outcomes within 48 hours of admission. D) Table summarizing AUPRC, AUROC by model architecture and outcome.\u003c/p\u003e","description":"","filename":"Figure3OSD.jpg","url":"https://assets-eu.researchsquare.com/files/rs-9036340/v1/9f93759e3eb2197053bf5263.jpg"},{"id":106069798,"identity":"a52fc683-537f-4eb1-8106-7434239ba2f5","added_by":"auto","created_at":"2026-04-03 06:25:33","extension":"jpeg","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":198007,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eNEWS2 at Time of Admission vs Time Series Models in Prediction of Organ Support or Death\u003c/strong\u003e A-B) Upset plot illustrating discordance in true positives and true negatives in our hold out test set. C-F) Vital signs used for NEWS2 calculation with respect to relative collection time during ED course among 10 patients correctly identified by all three time series models but missed by NEWS2 at time of admission; sample intersect denoted in Figure 4A by red asterix.\u003c/p\u003e","description":"","filename":"FIgure4OSD.jpeg","url":"https://assets-eu.researchsquare.com/files/rs-9036340/v1/6aeb0260c840e39e6bdc63d4.jpeg"},{"id":106069974,"identity":"bfb78d8f-a880-4f17-a23e-7b8070747917","added_by":"auto","created_at":"2026-04-03 06:26:12","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":2008880,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-9036340/v1/7e39b5ff-0d55-4fe8-8993-238d29833d66.pdf"},{"id":106069815,"identity":"ef7290af-c183-4c10-ab95-aa32eb12f866","added_by":"auto","created_at":"2026-04-03 06:25:39","extension":"docx","order_by":1,"title":"","display":"","copyAsset":false,"role":"supplement","size":441048,"visible":true,"origin":"","legend":"","description":"","filename":"SUPPLEMENTAL.docx","url":"https://assets-eu.researchsquare.com/files/rs-9036340/v1/09351b6bbbbec6b0ab3655c4.docx"}],"financialInterests":"No competing interests reported.","formattedTitle":"Predicting Future Organ Support Needs Using Longitudinal Emergency Department Data: A Proof-of-Concept Study","fulltext":[{"header":"BACKGROUND","content":"\u003cp\u003eThe volume and complexity of patients presenting to emergency departments (EDs) have increased substantially, without proportional growth in inpatient capacity.\u003csup\u003e1\u003c/sup\u003e As a result, emergency physicians face increasing pressure to triage and disposition patients accurately and efficiently. Among admitted patients, unplanned ICU transfers are associated with higher morbidity and mortality, making early identification of patients at risk for near-term decompensation essential for the equitable and effective use of critical care resources\u003csup\u003e2\u0026ndash;8\u003c/sup\u003e. Although emergency clinicians excel at recognizing and stabilizing overt critical illness, it remains difficult to determine which patients who appear stable at the time of admission will ultimately require ICU-level care early in their hospital course.\u003c/p\u003e \u003cp\u003eThe Society of Critical Care Medicine has emphasized the lack of reliable, objective criteria for ICU triage, while endorsing the use of early warning scores (EWS) to support risk stratification.\u003csup\u003e9\u003c/sup\u003e Traditionally, EWS development has focused on mortality and ICU transfer as endpoints. Yet mortality endpoints may reflect an unpreventable outcome while failing to identify patients more likely to respond to timely interventions.\u003csup\u003e10\u003c/sup\u003e Transfer-based endpoints, while more sensitive than mortality, are at risk for bias shaped by institution-specific ICU admission practices and policies, limiting generalizability. Thus, scores like NEWS2 may miss patients who require escalation of care but lack immediate ICU transfer criteria.\u003csup\u003e10,11\u003c/sup\u003e\u003c/p\u003e \u003cp\u003eShifting focus to prediction of future organ support needs could overcome these limitations and provide important resolution with respect to delivery of specific critical care resources and appropriate disposition at the time of admission.\u003csup\u003e10\u003c/sup\u003e However, existing early warning scores, like NEWS2, appear limited in their ability to predict near-term organ support needs using clinical data collected from ED patients at the time of admission.\u003csup\u003e11,12\u003c/sup\u003e We hypothesize that addressing these limitations using novel models that incorporate longitudinal vital sign trends, in addition to laboratory and demographic data, could improve prediction of organ support needs at the time of hospital admission, thereby supporting triage of admitted patients at-risk for near-term decompensation. While similar approaches have been applied in ICU patient populations, few have evaluated such an approach could be applied to ED patient populations\u003csup\u003e13\u0026ndash;17\u003c/sup\u003e.\u003c/p\u003e \u003cp\u003eThere are multiple model architectures able to process longitudinal and multi-modal data streams; each with their own strengths and limitations. For example, linear regression is easily interpretable but fails to capture non-linear relationships between input data and outcomes of interest and requires feature engineering to first be applied to time-series data. Tree-based models, such as XG-Boost, can capture such non-linear relationships and handle missing data natively, but like regression, require feature engineering to provide temporal context to time-series data. Newer machine learning approaches, such as transformer-based neural nets, are well suited for processing missing and time-series data and can capture non-linear relationships between variables, but require vast amounts of training data and therefore may be limited in prediction of rare clinical outcomes such as organ support or mortality events. To test these strengths and limitations, and to compare performance across different model architectures, we developed three binary classifiers that leverage time series data to predict organ support or death (OSD) within 48-hours of admission from the ED.\u003c/p\u003e"},{"header":"METHODS","content":"\u003cp\u003e\u003cstrong\u003eStudy Design, Setting, and Population\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eWe conducted a retrospective cohort study using all adult ED encounters at a U.S. quaternary academic medical center between March 1, 2022 and February 5, 2024, where the disposition was admission to an inpatient medical service. Patients were included if they were \u0026ge;18 years old and admitted from the ED to a predefined set of adult medical services, including general medicine, subspecialty medicine (e.g., cardiology, hepatology, oncology), and intensive care units. We excluded encounters of patients admitted to surgical or psychiatric services, ED observation units, those with missing patient identifiers, or without an ED evaluation (i.e. direct admissions). Patients admitted to surgical services were excluded due to the different system of care used for the triage and initial management of traumatically injured patients. Patients who received organ support (invasive mechanical ventilation, vasopressors, or continuous renal replacement therapy) or died in the emergency department were excluded from the analysis, as they had already met the endpoint of interest. We treated each ED encounter as unique in our models such that multiple admissions from an individual patient were included and handled as described below. This study was conducted in accordance with the ethical principles outlined in the \u003cstrong\u003eDeclaration of Helsinki\u003c/strong\u003e and was approved by the \u003cstrong\u003eStanford University Institutional Review Board \u003c/strong\u003eIRB 69934. The requirement for informed consent was waived due to the retrospective nature of the study and use of de-identified electronic health record data.\u003c/p\u003e\n\n\u003cp\u003e\u003cstrong\u003ePrimary and Secondary Outcomes\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe primary composite outcome was organ support or death within 48 hours of hospital admission. Organ support was defined as initiation of vasopressors, invasive mechanical ventilation, or continuous renal replacement therapy (CRRT). Non-invasive positive pressure ventilation and high-flow nasal oxygen were not considered organ support for the purpose of this analysis. Extracorporeal membrane oxygenation (ECMO) was also not included as organ support, as it was assumed patients received other organ support therapies before initiation of ECMO. Secondary outcomes include each type of organ support individually or death within 48 hours of admission. \u003c/p\u003e\n\n\u003cp\u003e\u003cstrong\u003eData Collection, Processing, and Partitioning\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eStructured electronic health record (EHR) data were extracted from the research data warehouse, including demographics, vital signs, laboratory values, and hospital course details relevant to our primary and secondary outcomes including administration of vasoactive medications, mechanical ventilation, continuous renal replacement therapy, or death. Use of vasopressors was identified via continuous infusion orders for norepinephrine, epinephrine, vasopressin, dobutamine, dopamine, or phenylephrine; push-dose pressors were excluded. Receipt of mechanical ventilation was determined using respiratory care flowsheets. Continuous renal replacement therapy was identified by dialysis nursing orders documenting initiation of CRRT. Time-to-event variables were calculated relative to the admission timestamp, and not a patient\u0026rsquo;s physical location, to assess outcome timing. \u003c/p\u003e\n\u003cp\u003ePredictor variables were chosen based on previous work exploring performance of longitudinal vital signs and laboratory values in predicting clinical outcomes among ICU patients\u003csup\u003e18\u0026ndash;23\u003c/sup\u003e. Charlson comorbidity scores were derived using validated mappings of ICD-9 and ICD-10 codes \u003csup\u003e24\u003c/sup\u003e. To include time-series data in linear and tree based models, minimum, maximum, and composite metrics for ED-based vital signs were calculated. Composite metrics were calculated by calculating the slope of the line of best fit of each vital sign with respect to time multiplied by the R^2 value to control for strength of fit.\u003csup\u003e25\u003c/sup\u003e Shock index was calculated each time a systolic blood pressure and heart rate were recorded within 5 minutes of each other. \u003c/p\u003e\n\u003cp\u003eMultiply imputed data were used for the elastic net regression models. We performed predictive mean matching (PMM) multiple imputation using the {mice} package 3.19.0.\u003csup\u003e26\u003c/sup\u003e Twenty imputations were performed using five iterations per chain. All continuous variables were standardized using z-score normalization prior to imputation. \u003c/p\u003e\n\u003cp\u003eA 75/15/10 split, constrained to maintain patient-level temporal ordering, for training, validation, and testing sets was used. To prevent data leakage from repeat patient encounters, we implemented a patient-level temporal data split. The dataset was partitioned by medical record number using a temporal ordering approach, where each patient was sorted by earliest ED arrival time, producing. All encounters for a given patient were assigned to a single partition. Training set class distributions could not be balanced due to positive class rarity; thus class imbalance was addressed using a weighted loss function. The validation and test sets retained their natural class distributions for realistic performance assessment. All data collection, processing, partitioning, and subsequent analysis was completed using R 4.5.1.\u003csup\u003e27\u003c/sup\u003e \u003c/p\u003e\n\n\u003cp\u003e\u003cstrong\u003eModel Development\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eWe trained logistic elastic net regression models to predict the primary outcome within 48-hours after admission using {glmnet} 4.1-10. \u003csup\u003e28\u003c/sup\u003e Each model was trained using 5-fold time-series cross-validation, with hyperparameters (penalty \u0026lambda; and mixture \u0026alpha;) tuned over a predefined grid (\u0026alpha; 0-1 by 0.05; \u0026lambda; 10\u003csup\u003e-4\u003c/sup\u003e-10\u003csup\u003e-1\u003c/sup\u003e by 10\u003csup\u003e-1/3\u003c/sup\u003e). The final model was selected based on the highest area under the precision recall curve (AUPRC) evaluated on validation data.\u003c/p\u003e\n\u003cp\u003eA gradient-boosted decision tree model was developed using the {xgboost} 3.1.3.1 engine to predict primary outcome within 48-hours.\u003csup\u003e29\u003c/sup\u003e A Latin hypercube sampling strategy was used to explore combinations of hyperparameters, including number of trees, learning rate, maximum tree depth, and regularization parameters. Model tuning used the same time-series cross-validation strategy as the elastic net regression. The final model was selected based on the highest area under the precision recall curve (AUPRC) evaluated on validation data.\u003c/p\u003e\n\u003cp\u003eWe developed a transformer-based neural network model using the {torch} 0.16.3 to predict the primary outcome.\u003csup\u003e30,31\u003c/sup\u003e The model processes sequences of up to 150 timesteps containing vital signs, laboratory values, patient demographics, ED length of stay, and Charlson comorbidity index. To preserve information about data availability, we implemented a dual-feature architecture where each clinical variable is accompanied by a binary indicator denoting whether the value was observed (1) or missing (0). Missing values were set to zero after normalization, allowing the model to learn appropriate uncertainty based on data completeness. \u003c/p\u003e\n\u003cp\u003eContinuous features were normalized using z-score standardization, with means and standard deviations calculated exclusively from observed (non-missing, non-padded) values in the training set; these parameters were then applied to validation and test sets for normalization. The transformer architecture consisted of 4 encoder layers with 2 attention heads, a feedforward dimension of 128, sinusoidal positional encodings to capture temporal ordering, and a 3-layer classification head with ReLU activations and dropout (0.2, 0.1) for regularization. We employed padding masks to exclude completely empty timesteps from attention computations. The model was trained using the AdamW optimizer (learning rate=1\u0026times;10⁻⁴, weight decay=1\u0026times;10⁻\u0026sup3;) with binary cross-entropy loss weighted by class frequency to address outcome imbalance. Training proceeded for 20 epochs with early stopping based on validation set precision-recall area under the curve (AUPRC), and gradient clipping (max norm=1) was applied to prevent instability. \u003c/p\u003e\n\n\u003cp\u003e\u003cstrong\u003ePlatt Scaling and Model Performance Evaluation\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eModel calibration was performed using Platt scaling, where raw logits from the validation set were used to fit a logistic regression model.\u003csup\u003e32\u003c/sup\u003e Calibrated models were then used to transform raw predictions on the held-out test set into final probabilities. Performance was assessed on the held-out test set using area under the receiver operating curve (AUROC), AUPRC, sensitivity, specificity, and predictive values. The optimal threshold for binary classification was determined using Youden\u0026rsquo;s J statistic. Model discrimination for secondary outcomes was evaluated using PRC and ROC curves with AUC calculations.\u003c/p\u003e\n\u003cp\u003eWe employed SHapley Additive exPlanations (SHAP) to quantify feature importance and interpretability of model predictions in python.\u003csup\u003e33\u003c/sup\u003e We generated summary plots displaying the distribution of SHAP values for each feature, where the magnitude indicates importance and the sign indicates the direction of effect on prediction. For continuous features, color-coding represents feature values, revealing non-linear relationships and interaction effects between feature values and their impact on predictions.\u003c/p\u003e\n\n\u003cp\u003e\u003cstrong\u003eNEWS2 Calculation\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eNEWS2 was calculated using the closest documented vital signs and blood gas values (arterial or venous) to inpatient admission, defined as the time of entry of an admission order to a medical inpatient service. Hypercapnic respiratory failure was defined as pCO₂ \u0026gt;45 mmHg (ABG) or \u0026gt;50 mmHg (VBG), in keeping with NEWS2 precedent. Patients without blood gas data were presumed not to be hypercapnic. Per original recommendations, NEWS2 \u0026ge;5 or any individual component score of 3 was considered high risk. \u003c/p\u003e"},{"header":"RESULTS","content":"\u003cp\u003e\u003cstrong\u003e\u003cem\u003eCohort characteristics and outcomes\u003c/em\u003e\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eOf 148,727 adult ED encounters between March 2022 and February 2024, 32,329 (21%) met all inclusion criteria and were used for model development (Figure 1). ED encounters were most commonly excluded due to discharge home (n= 105,978, 71%) or admission to a surgical or psychiatric service (n= 9,877, 6%). Among included patient encounters, the median age was 64 years, 50% were male, and 44% had greater than one co-morbidity included in the Charlson co-morbidity index. Median ED length of stay was 4.9 hours, and median hospital length of stay was 3.8 days. Cohort selection, demographics, and hospital outcomes are further summarized in Figure 1.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eSix hundred ninety-six encounters required organ support or resulted in death during their hospital admission, with 576 (81%) meeting the primary outcome of organ support or death within 48 hours of hospital admission (Figure 2A). Patients who received organ support within 48 hours of admission most frequently required only one support modality (Figure 2B). Among subtypes of organ support, vasopressor use (n = 333, 1% of total cohort) and invasive mechanical ventilation (n = 304, 0.9%) were most common, followed by CRRT (n = 45, 0.1%). There were only 34 deaths within 48 hours of admission (0.1%).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003ePatients who required organ support or died within 48 hours of admission had similar age to those that did not meet our primary outcome (66 years [43-89] vs 66 years [39-93]) but were more likely to be male (56% vs 50%) and have at least one chronic co-morbidity (56% vs 43%). ED length of stay was shorter among encounters with decompensation (median 3.9 hours [3.4] vs 5.0 hours [4.0]) while hospital length of stay was longer (8.0 days [11.9] vs 3.8 days [4.8]).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003e\u003cem\u003ePerformance and feature importance across model architectures is similar\u0026nbsp;\u003c/em\u003e\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe 32,329 samples in our cohort were randomly assigned to training (n = 26,346, 81%), validation (n = 3796, 12%), and test (2,187, 7%) sets. The prevalence of primary and secondary outcomes in training, validation, and test sets were similar to those observed in the full cohort (Supplemental Figure 1).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eWe trained logistic elastic net regression models to predict the primary outcome within 48-hours after admission. Hyperparameter tuning over a predefined grid (\u0026alpha; 0-1 by 0.05; \u0026lambda; 10\u003csup\u003e-4\u003c/sup\u003e-10\u003csup\u003e-1\u003c/sup\u003e by 10\u003csup\u003e-1/3\u003c/sup\u003e) identified the highest AUPRC of 0.12 for ENR with alpha 0.3 and lambda 0.04. AUPRC for validation and test sets was 0.07 (Supplement Table 1). For gradient-boosted decision tree models, a Latin hypercube sampling strategy was used to explore combinations of hyperparameters, including number of trees, learning rate, maximum tree depth, and regularization parameters. The highest AUPRC of 0.15 was achieved with a model with 652 trees of depth 3, learn rate of 5.81\u0026times;10\u003csup\u003e-7\u003c/sup\u003e, and loss reduction 3.3\u0026times;10\u003csup\u003e-6\u003c/sup\u003e. AUPRC for validation and test sets were 0.08 and 0.12, respectively (Supplement Table 1).\u0026nbsp;The small learning rate reflects convergence of the tuning procedure to the lower bound of a wide log-scaled search range. When the learning rate was constrained to conventional XGBoost values (0.001\u0026ndash;0.1), model discrimination was largely unchanged (AUPRC 0.19), suggesting performance was not sensitive to this parameter. Finally, we trained a transformer using the AdamW optimizer (learning rate=1\u0026times;10\u003csup\u003e-4\u003c/sup\u003e, weight decay=1\u0026times;10\u003csup\u003e-3\u003c/sup\u003e) with binary cross-entropy loss weighted by class frequency to address outcome imbalance. Training proceeded for 20 epochs; the strongest performing model had an AUPRC of 0.35. AUPRC for validation and test sets were 0.14 and 0.2 respectively (Supplement Table 1).\u003c/p\u003e\n\u003cp\u003e\u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; Insight into feature importance was guided by model architecture. For ENR, we used the absolute value of the model coefficients to assess feature importance; for XGB, we used average gain (i.e. the mean improvement in model loss attributed to splits on each feature across all trees); for the transformer we used SHapley Additive exPlanations (SHAP). Similar features were important to model performance across all three architectures, specifically trends in systolic blood pressure, heart rate, respirations, pulse oximetry, as well as lab values such as venous pH, lactate, and blood urea nitrogen (Supplemental Figure 2).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; Figure 3 summarizes model performance in predicting our primary and secondary outcomes. In the prediction of our primary outcome among samples in the hold-out test set, the transformer had the highest AUPRC (0.2) while the XGB had the highest AUROC (0.86). Among secondary outcomes, performance was strongest in prediction of future vasopressors and CRRT and weakest in prediction of invasive mechanical ventilation and mortality across all model architectures (Figure 3).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003e\u003cem\u003eTimes series models are more sensitive than NEWS2 in predicting organ support or death within 48 hours of admission\u003c/em\u003e\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003ePreviously, we evaluated the performance of NEWS2 in prediction of future organ support or death after admission.\u003csup\u003e12\u003c/sup\u003e To compare performance of NEWS2 at time of admission to our time series models, we subset our test sample (n = 2,187) to include samples in which a NEWS2 score could be reliably calculated using our previous methods.\u003csup\u003e12\u003c/sup\u003e Among this subset of 1,545 patient encounters, 43 (2%) met our primary outcome of organ support or death within 48 hours of admission (Figure 4A).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eOf the 43 cases that met our primary outcome, 14 (33%) cases correctly identified by at least one time series model were missed by NEWS2 calculated at time of admission; 2 (4%) cases\u0026nbsp;correctly flagged by NEWS2 were\u0026nbsp;missed by all three time series models. Of the 1,497 encounters that did not involve organ support or result in death within 48 hours, 218 (14%) correctly identified by a time series model were\u0026nbsp;missed by NEWS2 calculated at time of admission; 119 (7%) encounters correctly flagged by NEWS2 were missed by all three time series models.\u003c/p\u003e\n\u003cp\u003eAll three time series models had greater sensitivity than NEWS2 in prediction of our primary outcome as well as use of vasopressors or invasive mechanical ventilation within 48 hours of admission (78-83% vs 61%, Table 1). NEWS2 had greater specificity in prediction of our primary outcome (78% vs. 70-71%). Assessment of performance in prediction of CRRT and death in this sub-group was limited by the low prevalence of CRRT and death.\u0026nbsp;\u003c/p\u003e\n\u003ctable border=\"1\" cellspacing=\"0\" cellpadding=\"0\"\u003e\n \u003ctbody\u003e\n \u003ctr\u003e\n \u003ctd colspan=\"9\" valign=\"top\" style=\"width: 623px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eTable 1: Comparison of Test Characteristics Among Times Series Models and NEWS2\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\" style=\"width: 94px;\"\u003e\n \u003cp\u003e\u003cstrong\u003e\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd colspan=\"2\" valign=\"top\" style=\"width: 132px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eNEWS2\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd colspan=\"2\" valign=\"top\" style=\"width: 132px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eENR\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd colspan=\"2\" valign=\"top\" style=\"width: 132px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eXGB\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd colspan=\"2\" valign=\"top\" style=\"width: 132px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eTransformer\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\" style=\"width: 94px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eTest Sample\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e\n \u003cp\u003e\u003cstrong\u003e(N = 1,545)\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eSensitivity\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eSpecificity\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eSensitivity\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eSpecificity\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eSensitivity\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eSpecificity\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eSensitivity\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eSpecificity\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\" style=\"width: 94px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eOSD\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e\n \u003cp\u003e\u003cstrong\u003e(n = 38)\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.61\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.78\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.83\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.71\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.83\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.70\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.78\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.71\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\" style=\"width: 94px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eVasopressors\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e\n \u003cp\u003e\u003cstrong\u003e(n = 33)\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.60\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.78\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.85\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.71\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.90\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.70\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.85\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.71\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\" style=\"width: 94px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eIntubation\u003c/strong\u003e\u003c/p\u003e\n \u003cp\u003e\u003cstrong\u003e(n = 21)\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.57\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.77\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.76\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.70\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.76\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.69\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.67\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.70\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\" style=\"width: 94px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eCRRT\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e\n \u003cp\u003e\u003cstrong\u003e(n = 3)\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.67\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.77\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e1\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.70\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e1\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.69\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e1\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.70\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\" style=\"width: 94px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eDeath\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e\n \u003cp\u003e\u003cstrong\u003e(n = 2)\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e1\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.77\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e1\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.70\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.5\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.68\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e1\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 66px;\"\u003e\n \u003cp\u003e0.70\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003c/tbody\u003e\n\u003c/table\u003e\n\u003cp\u003e\u003cstrong\u003e\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003e\u003cem\u003eCase series analysis of discordant classifications illustrates limitations of cross-sectional risk scores like NEWS2\u003c/em\u003e\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eWhile feature importance highlights the role of longitudinal trends in driving model performance for predicting near-term organ support (Supplemental Figure 2), we sought to directly visualize how temporal changes in vital signs may result in misclassification of high risk patients by cross-sectional risk scores, like NEWS. To accomplish this, we conducted a case series analysis of positive samples with discordant predictions between NEWS2 and our time-series models. Figure 4A-4B illustrate discordant true positive and true negative predictions between NEWS2 and the time series models. We examined the 10 samples correctly predicted by all three time-series models to meet OSD within 48 hours of admission that were not identified by NEWS2 calculated at the time of admission (Figure 4A). We plotted the vital signs used to calculate NEWS2 with respect to time \u0026nbsp; (Figure 4C\u0026ndash;F). These figures illustrate vital sign trends over time; these trends are not captured by cross sectional risk scores and could explain differences in sensitivity observed in Table 1.\u0026nbsp;\u003c/p\u003e"},{"header":"DISCUSSION","content":"\u003cp\u003eIn this retrospective study of approximately 32,000 ED admissions, we explored whether real-world, longitudinal clinical data improved prediction of organ support or death within 48-hours of admission relative to the existing cross sectional risk score, NEWS2. A central contribution of this work is the use of organ support, rather than mortality or ICU transfer alone, as a primary endpoint. Organ support represents a potentially modifiable and temporally proximal marker of critical illness. Unlike mortality, which may reflect irreversible disease processes, or ICU transfer, which is influenced by local policy and bed availability, initiation of vasopressors, mechanical ventilation, or CRRT more directly reflects physiologic decompensation requiring intervention. Aligning prediction models with such actionable endpoints may improve relevance for future interventional studies.\u003c/p\u003e \u003cp\u003eSecond, we hypothesized that longitudinal modeling better aligns with clinician reasoning, and observed similar discrimination and improved sensitivity relative to a cross-sectional score (NEWS2), suggesting that incorporating physiologic trajectories may provide incremental predictive information beyond admission-time snapshots. The similar discrimination achieved across regression, tree-based, and transformer models suggests that the predictive signal may lie primarily in the longitudinal physiologic information itself rather than in any specific modeling architecture. This finding is important, as it indicates that improvements in prediction may stem from better alignment of features with clinical trajectories rather than increasing algorithmic complexity.\u003c/p\u003e \u003cp\u003eWhile encouraging, discrimination in the \u0026ldquo;80/80\u0026rdquo; range implies substantial misclassification. At low outcome prevalence (1.7%), even modest false-positive rates generate a large number of alerts relative to true events, raising questions about acceptable tradeoffs between sensitivity and alert burden. Our study was not designed to determine whether this level of misclassification is clinically acceptable, nor whether the model adds value beyond careful physician assessment. Future work must quantify the net clinical benefit across plausible risk thresholds and compare model performance directly against clinician gestalt in prospective settings.\u003c/p\u003e \u003cp\u003eThis study has several strengths. First, we analyzed a contemporary patient cohort from a large quaternary care center, capturing real-world clinical practice and preserving outcome prevalence in validation and test sets. Second, the use of patient-level temporal splits minimized data leakage and provide a rigorous assessment of generalizability. Third, we present a novel framework for predicting organ support outcomes by demonstrating models trained to predict composite outcomes (i.e. vasopressor, intubation, CRRT, or death) are also able to predict individual organ support outcomes with comparable sensitivity and specificity. This has important implications for future efforts to predict organ support that may be constrained by limited outcome prevalence, proving especially relevant for deep learning models like transformers. Fourth, we compared three distinct modeling paradigms against an existing risk score, NEWS2. Others have shown similar superior performance of machine learning models in predicting ICU admission, however, we chose organ support or death, which is subject to less institutional and provider variability and to increase generalizability.\u003csup\u003e34\u0026ndash;36\u003c/sup\u003e Finally, this analysis aimed to compare performance across different model architectures, a crucial step prior to external validation and deployment.\u003c/p\u003e \u003cp\u003eThis study also has important limitations. It was conducted at a single academic medical center, limiting evaluation of generalizability to settings with different patient populations, resource availability, and ED workflows. The two-year timeframe constrained the number of positive cases and class imbalance may have influenced model behavior despite use of weighted loss functions. Although we selected a composite endpoint to address concerns that critically ill patients often require multiple organ support modalities, many patients in our cohort required only one form of support within the 48-hour window. Finally, although transformers demonstrated encouraging discrimination, their performance likely remains sensitive to the quality, density, and ordering of time-series data. Missingness patterns in EHR data are non-random and may encode clinical decision-making rather than pure physiology; while our dual-feature representation attempts to capture this, it may not fully resolve sampling bias. Finally, our models were trained on structured EHR data drawn from a heterogeneous population encompassing diverse disease states and organ failures. While this reflects real-world practice, it limits mechanistic insight and biological interpretability. Future efforts integrating physiologic trajectories with molecular or disease-specific data may offer greater explanatory power and potentially improved discrimination.\u003c/p\u003e"},{"header":"CONCLUSIONS","content":"\u003cp\u003eOrgan support represents a potentially modifiable and temporally proximal marker of critical illness. Models trained to interpret longitudinal trends in clinical variables\u0026mdash;rather than cross-sectional snapshots\u0026mdash;may better mirror clinician reasoning. Our findings support the hypothesis that incorporating longitudinal data improves sensitivity in predicting near-term organ support or death.\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003eEthics approval and consent to participate:\u0026nbsp;This study was approved by the institutional review board (IRB 69934). \u0026nbsp;\u003c/p\u003e\n\u003cp\u003eConsent for publication:\u0026nbsp;Not applicable. The study used de-identified data and did not include identifiable individual information\u003c/p\u003e\n\u003cp\u003eAvailability of data and materials: The datasets generated and analyzed during the current study are not publicly available due institutional data privacy policies but are available from the corresponding author on reasonable request.\u003c/p\u003e\n\u003cp\u003eCompeting interests: The authors of this manuscript have no competing interests to declare.\u003c/p\u003e\n\u003cp\u003eFunding:\u0026nbsp;This work was supported, in part, by a 2025-2026 Society for Academic Emergency Medicine (SAEM) resident research grant (RE2022- 0000000651) and by the following NIH funding source of Stanford\u0026rsquo;s Center for Clinical and Translational Education and Research award, under the Biostatistics, Epidemiology and Research Design (BERD) Program: UM1TR004921.\u003c/p\u003e\n\u003cp\u003eAuthor contributions: SRC, JW, and DAK conceived of the initial research question and study design. SRC, DAK, and HH developed and implemented the statistical analysis plan. CR coordinated data acquisition from the institutional data warehouse. SRC, KL, and JW drafted the initial manuscript. All authors contributed to editing of the final manuscript.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eAcknowledgements: The authors would like to acknowledge the Stanford\u0026rsquo;s Emergency Department Analytics Committee (EDAC) for their contributions to EHR database management that supported this project.\u0026nbsp;\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\n\u003cli\u003eWeinick, R. M., Bruna, S., Boicourt, R. M., Michael, S. S. \u0026amp; Sessums, L. L. AHRQ summit to address emergency department boarding. https://www.ahrq.gov/sites/default/files/wysiwyg/topics/ed-boarding-summit-report.pdf (2025).\u003c/li\u003e\n\u003cli\u003eDelgado, M. K. \u003cem\u003eet al.\u003c/em\u003e Risk factors for unplanned transfer to intensive care within 24 hours of admission from the emergency department in an integrated healthcare system. \u003cem\u003eJ. Hosp. Med.\u003c/em\u003e \u003cstrong\u003e8\u003c/strong\u003e, 13\u0026ndash;19 (2013).\u003c/li\u003e\n\u003cli\u003eDahn, C. M. \u003cem\u003eet al.\u003c/em\u003e A critical analysis of unplanned ICU transfer within 48 hours from ED admission as a quality measure. \u003cem\u003eAm. J. Emerg. Med.\u003c/em\u003e \u003cstrong\u003e34\u003c/strong\u003e, 1505\u0026ndash;1510 (2016).\u003c/li\u003e\n\u003cli\u003eSolano, J. J. \u003cem\u003eet al.\u003c/em\u003e Hospital ward transfer to intensive care unit as a quality marker in emergency medicine. \u003cem\u003eAm. J. Emerg. Med.\u003c/em\u003e \u003cstrong\u003e35\u003c/strong\u003e, 753\u0026ndash;756 (2017).\u003c/li\u003e\n\u003cli\u003eEscobar, G. J. \u003cem\u003eet al.\u003c/em\u003e Intra-hospital transfers to a higher level of care: contribution to total hospital and intensive care unit (ICU) mortality and length of stay (LOS). \u003cem\u003eJ. Hosp. Med.\u003c/em\u003e \u003cstrong\u003e6\u003c/strong\u003e, 74\u0026ndash;80 (2011).\u003c/li\u003e\n\u003cli\u003eNates, J. L. \u003cem\u003eet al.\u003c/em\u003e ICU admission, discharge, and triage guidelines: A framework to enhance clinical operations, development of institutional policies, and further research. \u003cem\u003eCrit. Care Med.\u003c/em\u003e \u003cstrong\u003e44\u003c/strong\u003e, 1553\u0026ndash;1602 (2016).\u003c/li\u003e\n\u003cli\u003eFernando, S. M. \u003cem\u003eet al.\u003c/em\u003e Emergency Department disposition decisions and associated mortality and costs in ICU patients with suspected infection. \u003cem\u003eCrit. Care\u003c/em\u003e \u003cstrong\u003e22\u003c/strong\u003e, (2018).\u003c/li\u003e\n\u003cli\u003eWeissman, G. E. \u003cem\u003eet al.\u003c/em\u003e Potentially preventable intensive care unit admissions in the United States, 2006-2015. \u003cem\u003eAnn. Am. Thorac. Soc.\u003c/em\u003e \u003cstrong\u003e17\u003c/strong\u003e, 81\u0026ndash;88 (2020).\u003c/li\u003e\n\u003cli\u003eHonarmand, K. \u003cem\u003eet al.\u003c/em\u003e Society of critical care medicine guidelines on recognizing and responding to clinical deterioration outside the ICU: 2023. \u003cem\u003eCrit. Care Med.\u003c/em\u003e \u003cstrong\u003e52\u003c/strong\u003e, 314\u0026ndash;330 (2024).\u003c/li\u003e\n\u003cli\u003eGoodacre, S. Using clinical risk models to predict outcomes: what are we predicting and why? \u003cem\u003eEmerg. Med. J.\u003c/em\u003e \u003cstrong\u003e40\u003c/strong\u003e, 728\u0026ndash;730 (2023).\u003c/li\u003e\n\u003cli\u003eGoodacre, S., Sutton, L., Fuller, G., Trimble, A. \u0026amp; Pilbery, R. Accuracy of the National Early Warning Score version 2 (NEWS2) in predicting need for time-critical treatment: retrospective observational cohort study. \u003cem\u003eEmerg. Med. J.\u003c/em\u003e \u003cstrong\u003e42\u003c/strong\u003e, 396\u0026ndash;402 (2025).\u003c/li\u003e\n\u003cli\u003eChiacchia, S. R. \u003cem\u003eet al.\u003c/em\u003e Early warning score performance at time of admission in the prediction of future organ support needs. \u003cem\u003eAcad. Emerg. Med.\u003c/em\u003e (2025) doi:10.1111/acem.70182.\u003c/li\u003e\n\u003cli\u003eChoi, A. \u003cem\u003eet al.\u003c/em\u003e Development of a machine learning-based clinical decision support system to predict clinical deterioration in patients visiting the emergency department. \u003cem\u003eSci. Rep.\u003c/em\u003e \u003cstrong\u003e13\u003c/strong\u003e, 8561 (2023).\u003c/li\u003e\n\u003cli\u003eKwak, G. H., Ling, L. \u0026amp; Hui, P. Predicting the need for vasopressors in the Intensive Care unit using an attention based deep learning model. \u003cem\u003eShock\u003c/em\u003e \u003cstrong\u003e56\u003c/strong\u003e, 73\u0026ndash;79 (2021).\u003c/li\u003e\n\u003cli\u003eDuval, L. \u003cem\u003eet al.\u003c/em\u003e Early prediction of vasopressor initiation in ICU sepsis patients using an interpretable EHR-based ML model. \u003cem\u003eBMC Med. Inform. Decis. Mak.\u003c/em\u003e \u003cstrong\u003e25\u003c/strong\u003e, 442 (2025).\u003c/li\u003e\n\u003cli\u003eContreras, M. \u003cem\u003eet al.\u003c/em\u003e Real-time prediction of intensive care unit patient acuity and therapy requirements using state-space modelling. \u003cem\u003eNat. Commun.\u003c/em\u003e \u003cstrong\u003e16\u003c/strong\u003e, 7315 (2025).\u003c/li\u003e\n\u003cli\u003eHyland, S. L. \u003cem\u003eet al.\u003c/em\u003e Early prediction of circulatory failure in the intensive care unit using machine learning. \u003cem\u003eNat. Med.\u003c/em\u003e \u003cstrong\u003e26\u003c/strong\u003e, 364\u0026ndash;373 (2020).\u003c/li\u003e\n\u003cli\u003eThorsen-Meyer, H.-C. \u003cem\u003eet al.\u003c/em\u003e Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records. \u003cem\u003eLancet Digit. Health\u003c/em\u003e \u003cstrong\u003e2\u003c/strong\u003e, e179\u0026ndash;e191 (2020).\u003c/li\u003e\n\u003cli\u003eMeiring, C. \u003cem\u003eet al.\u003c/em\u003e Optimal intensive care outcome prediction over time using machine learning. \u003cem\u003ePLoS One\u003c/em\u003e \u003cstrong\u003e13\u003c/strong\u003e, e0206862 (2018).\u003c/li\u003e\n\u003cli\u003eBrekke, I. J., Puntervoll, L. H., Pedersen, P. B., Kellett, J. \u0026amp; Brabrand, M. The value of vital sign trends in predicting and monitoring clinical deterioration: A systematic review. \u003cem\u003ePLoS One\u003c/em\u003e \u003cstrong\u003e14\u003c/strong\u003e, e0210875 (2019).\u003c/li\u003e\n\u003cli\u003eZheng, Z. \u003cem\u003eet al.\u003c/em\u003e Development and validation of a dynamic real-time risk prediction model for Intensive Care units patients based on longitudinal irregular data: Multicenter retrospective study. \u003cem\u003eJ. Med. Internet Res.\u003c/em\u003e \u003cstrong\u003e27\u003c/strong\u003e, e69293 (2025).\u003c/li\u003e\n\u003cli\u003eDeng, Y. \u003cem\u003eet al.\u003c/em\u003e Explainable time-series deep learning models for the prediction of mortality, prolonged length of stay and 30-day readmission in intensive care patients. \u003cem\u003eFront. Med. (Lausanne)\u003c/em\u003e \u003cstrong\u003e9\u003c/strong\u003e, 933037 (2022).\u003c/li\u003e\n\u003cli\u003eRedfern, O. C. \u003cem\u003eet al.\u003c/em\u003e Predicting in-hospital mortality and unanticipated admissions to the intensive care unit using routinely collected blood tests and vital signs: Development and validation of a multivariable model. \u003cem\u003eResuscitation\u003c/em\u003e \u003cstrong\u003e133\u003c/strong\u003e, 75\u0026ndash;81 (2018).\u003c/li\u003e\n\u003cli\u003eDeyo, R. A., Cherkin, D. C. \u0026amp; Ciol, M. A. Adapting a clinical comorbidity index for use with ICD-9-CM administrative databases. \u003cem\u003eJ. Clin. Epidemiol.\u003c/em\u003e \u003cstrong\u003e45\u003c/strong\u003e, 613\u0026ndash;619 (1992).\u003c/li\u003e\n\u003cli\u003eStein, D. F. \u003cem\u003eet al.\u003c/em\u003e Prediction cardiovascular deterioration in a paediatric intensive care unit (PicEWS): a machine learning modelling study of routinely collected health-care data. \u003cem\u003eEClinicalMedicine\u003c/em\u003e \u003cstrong\u003e85\u003c/strong\u003e, 103255 (2025).\u003c/li\u003e\n\u003cli\u003evan Buuren, S. \u0026amp; Groothuis-Oudshoorn, K. mice: Multivariate Imputation by Chained Equations inR. \u003cem\u003eJ. Stat. Softw.\u003c/em\u003e \u003cstrong\u003e45\u003c/strong\u003e, (2011).\u003c/li\u003e\n\u003cli\u003eThe R project for statistical computing. https://www.R-project.org/.\u003c/li\u003e\n\u003cli\u003eTay, J. K., Narasimhan, B. \u0026amp; Hastie, T. Elastic net regularization paths for all generalized linear models. \u003cem\u003eJ. Stat. Softw.\u003c/em\u003e \u003cstrong\u003e106\u003c/strong\u003e, (2023).\u003c/li\u003e\n\u003cli\u003eChen, T. \u0026amp; Guestrin, C. XGBoost: A Scalable Tree Boosting System. in \u003cem\u003eProceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining\u003c/em\u003e (ACM, New York, NY, USA, 2016). doi:10.1145/2939672.2939785.\u003c/li\u003e\n\u003cli\u003eFalbel D, L. J. \u003cem\u003etorch: Tensors and Neural Networks with \u0026ldquo;GPU\u0026rdquo; Acceleration\u003c/em\u003e. R package version 0.16.3. \u003cem\u003ehttps://torch.mlverse.org/docs.\u003c/em\u003e (2025).\u003c/li\u003e\n\u003cli\u003eVaswani, A. \u003cem\u003eet al.\u003c/em\u003e Attention is all you need. \u003cem\u003earXiv [cs.CL]\u003c/em\u003e (2025) doi:10.65215/2q58a426.\u003c/li\u003e\n\u003cli\u003ePlatt1999.\u003c/li\u003e\n\u003cli\u003eLundberg, S. \u0026amp; Lee, S.-I. A unified approach to interpreting model predictions. \u003cem\u003earXiv [cs.AI]\u003c/em\u003e (2017).\u003c/li\u003e\n\u003cli\u003eJoseph, J. W. \u003cem\u003eet al.\u003c/em\u003e Deep-learning approaches to identify critically Ill patients at emergency department triage using limited information. \u003cem\u003eJ. Am. Coll. Emerg. Physicians Open\u003c/em\u003e \u003cstrong\u003e1\u003c/strong\u003e, 773\u0026ndash;781 (2020).\u003c/li\u003e\n\u003cli\u003eNguyen, M. \u003cem\u003eet al.\u003c/em\u003e Developing machine learning models to personalize care levels among emergency room patients for hospital admission. \u003cem\u003eJ. Am. Med. Inform. Assoc.\u003c/em\u003e \u003cstrong\u003e28\u003c/strong\u003e, 2423\u0026ndash;2432 (2021).\u003c/li\u003e\n\u003cli\u003eBoulitsakis Logothetis, S., Green, D., Holland, M. \u0026amp; Al Moubayed, N. Predicting acute clinical deterioration with interpretable machine learning to support emergency care decision making. \u003cem\u003eSci. Rep.\u003c/em\u003e \u003cstrong\u003e13\u003c/strong\u003e, 13563 (2023). \u003c/li\u003e\n\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"bmc-emergency-medicine","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"emmd","sideBox":"Learn more about [BMC Emergency Medicine](http://bmcemergmed.biomedcentral.com/)","snPcode":"","submissionUrl":"https://www.editorialmanager.com/emmd","title":"BMC Emergency Medicine","twitterHandle":"@BMC_series","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"em","reportingPortfolio":"BMC Series","inReviewEnabled":true,"inReviewRevisionsEnabled":true},"keywords":"emergency critical care, organ support, time-series analysis","lastPublishedDoi":"10.21203/rs.3.rs-9036340/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-9036340/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003ch2\u003eBackground\u003c/h2\u003e \u003cp\u003ePrediction of organ support needs, rather than mortality or critical care transfer alone, may improve the utility of early warning scores (EWS). Existing EWS may have limited sensitivity in predicting organ support due to reliance on cross-sectional snapshots of patient physiology, limiting their ability to account for changes in patient status. We aimed to develop and compare novel models capable of using longitudinal clinical data to predict organ support or death (OSD) within 48 hours of hospital admission.\u003c/p\u003e\u003ch2\u003eMethods\u003c/h2\u003e \u003cp\u003eWe leveraged a retrospective cohort of adult ED encounters at a U.S. quaternary academic medical center from March 1, 2022, to February 5, 2024. Encounters were included if patients were \u0026ge;\u0026thinsp;18 years and admitted to a medical service; those receiving organ support in the ED were excluded. The primary outcome was a composite of vasopressor initiation, invasive mechanical ventilation, continuous renal replacement therapy, or death within 48 hours of admission. Performance metrics included AUROC, AUPRC, sensitivity, and specificity.\u003c/p\u003e\u003ch2\u003eResults\u003c/h2\u003e \u003cp\u003e1.7% (549/32,329) experienced organ support or death within 48 hours of admission. The transformer-based neural net demonstrated the strongest overall performance, with an AUROC of 0.84 and AUPRC of 0.20, outperforming the baseline to National Early Warning Score 2 (NEWS2) with higher sensitivity for the primary outcome (0.78 vs. 0.61) while maintaining sufficient specificity (0.71 vs. 0.83). XGBoost and elastic-net regression showed similar improvements in sensitivity (both 0.83) with modest reductions in specificity relative to NEWS2 calculated at time of admission.\u003c/p\u003e\u003ch2\u003eConclusions\u003c/h2\u003e \u003cp\u003eOrgan support represents a potentially modifiable and temporally proximal marker of critical illness. Models trained to interpret longitudinal trends in clinical variables\u0026mdash;rather than cross-sectional snapshots\u0026mdash;may better mirror clinician reasoning.\u003c/p\u003e","manuscriptTitle":"Predicting Future Organ Support Needs Using Longitudinal Emergency Department Data: A Proof-of-Concept Study","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2026-04-03 06:23:35","doi":"10.21203/rs.3.rs-9036340/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"reviewerAgreed","content":"320951258746997121570997403217440817463","date":"2026-04-05T13:55:51+00:00","index":"hide","fulltext":""},{"type":"reviewersInvited","content":"","date":"2026-03-30T04:26:26+00:00","index":"","fulltext":""},{"type":"editorInvited","content":"","date":"2026-03-13T10:59:51+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2026-03-12T09:24:17+00:00","index":"","fulltext":""},{"type":"checksComplete","content":"","date":"2026-03-12T09:23:43+00:00","index":"","fulltext":""},{"type":"submitted","content":"BMC Emergency Medicine","date":"2026-03-05T05:41:08+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"bmc-emergency-medicine","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"emmd","sideBox":"Learn more about [BMC Emergency Medicine](http://bmcemergmed.biomedcentral.com/)","snPcode":"","submissionUrl":"https://www.editorialmanager.com/emmd","title":"BMC Emergency Medicine","twitterHandle":"@BMC_series","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"em","reportingPortfolio":"BMC Series","inReviewEnabled":true,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"2978e352-1062-46a3-b962-502b414eca68","owner":[],"postedDate":"April 3rd, 2026","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"under-review","subjectAreas":[],"tags":[],"updatedAt":"2026-04-03T06:23:35+00:00","versionOfRecord":[],"versionCreatedAt":"2026-04-03 06:23:35","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-9036340","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-9036340","identity":"rs-9036340","version":["v1"]},"buildId":"XKTyCvWXoU3ODBz1xrDgd","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

⚙ Ask this paper AI returns verbatim quotes from the full text · source: preprint-html ⓘ

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc: last seen: 2026-05-20T01:45:00.602351+00:00