ShinyEvents: harmonizing longitudinal data for real world survival estimation | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Article ShinyEvents: harmonizing longitudinal data for real world survival estimation Alyssa Obermayer, Joshua Davis, Divya Priyanka Talada, Mingxiang Teng, and 18 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-7231850/v1 This work is licensed under a CC BY 4.0 License Status: Published Journal Publication published 13 Jan, 2026 Read the published version in npj Precision Oncology → Version 1 posted 13 You are reading this latest preprint version Abstract Longitudinal data analysis of the patient’s treatment course is critical to uncovering variables that influence outcomes. However, existing tools have significant limitations in integrating multilayered time-series data. Here, we developed ShinyEvents, a web-based framework for complex longitudinal data analysis. ShinyEvents allows users to upload data and generate interactive timelines of the patient’s clinical events. Our tool can perform cohort-level analysis, including the assignment of treatment clusters and clinical endpoints. Our tool also provides informative cohort visualizations, such as a Sankey diagram of the treatment line and Swimmer diagram of the clinical course. Finally, our tool can infer a real-world progression-free survival (rwPFS) based on user-defined endpoints to perform Kaplan-Meier and Cox proportional hazards regression analysis. With these features, the tool can then associate the lines of treatment with clinical outcomes. Altogether, ShinyEvents facilitates the integration of multilayered longitudinal data and enables survival analysis in real-time. A live link to the tool is available https://shawlab-moffitt.shinyapps.io/shinyevents/ . Biological sciences/Computational biology and bioinformatics Health sciences/Health care Physical sciences/Mathematics and computing Health sciences/Medical research Figures Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Figure 6 Introduction Longitudinal real-world data (RWD) are complex and heterogeneous, with inconsistent quality that makes them difficult to utilize for several reasons 1 . First, RWD is observational and often unstructured, varying in naming due to different documentation practices across providers 2 , 3 . Second, RWD can be challenging to analyze due to their overwhelming size, as they are generated from dynamic datasets from private and public repositories, such as the Oncology Research Information Exchange Network (ORIEN) AVATAR 4 – 8 , FLATIRON 9 , and AACR Project GENIE 10 . Third, RWD may lack important annotations for research purposes, such as treatment response labeling or surrogate endpoints 1 . Therefore, establishing standardized approaches for visualizing longitudinal data will be essential to leverage RWD to inform best practices in clinical decision-making and for the planning of future clinical trials. Several tools have been developed for visualizing longitudinal clinical data, including cBioPortal 11 , OncoThreads 12 , Read-TV 13 , PlotTwist 14 , and Pergola-web 15 . These tools can be categorized based on the following objectives: 1) visualization of outcome measurements over time, 2) the sequence of the treatment course relative to the genomics data, and 3) the identification of temporal patterns. While these longitudinal visualization methods are useful in their respective field, they were primarily designed with a single focus, such as genomic profiling or blood/serum measurements. Notably, there is no existing tool for cohort-level analysis of treatment-associated information from RWD, especially tools that can define treatment response and surragate endpoint for survival analysis, which is critical for disease trajectory modeling 16 , 17 and personalized decision support 18 . To address these challenges, we developed a web-based viewer, ShinyEvents, that facilitates the analysis and presentation of longitudinal data focusing on the length of treatment, clinical response, and overall survival (Fig. 1 ). In addition to the time-series visualization, the tool can cluster treatment-associated data into treatment lines, define endpoints for real-world progression-free survival (rwPFS), and summarize population characteristics. We applied a public GENIE non-small cell lung cancer (NSCLC) cohort as well a dataset of patients with non-metastatic bladder cancer treated by cystectomy at Moffitt Cancer Center to demonstrate the functionalities of ShinyEvents. Results ShinyEvents Preprocessing with the GENIE non-small cell lung cancer cohort example As an example, we assembled a population of non-metastatic (stage II and III) NSCLC patients with adenocarcinoma histology from GENIE 19 . We collected data associated with the patient demographics, pathological staging, gene panel sequencing, treatment, imaging, and clinical notes. We filtered the cohort based on diagnosis stage II and III and restricted it to adenocarcinoma patients treated with Cisplatin/Vinorelbine or Cisplatin/Pemetrexed (n = 71). During the preprocessing step, the ShinyEvent would guide the user in generating an Events table indexed based on the patient’s age and their unique identifier. Furthermore, the following characteristics were defined for each event: event name (time-series name), treatment (oncologic medicine, surgery, and radiation), response (potential end point), and duration of the event (start and end time points). For each patient, the tool would generate a Swimmer's plot to visualize the multi-dimensional data available for that patient. An example of a patient with an extensive history of adenocarcinoma lung cancer is presented in Fig. 2 , with each category of longitudinal data harmonized across the timeline. Key Preprocessing steps are documented in the supplementary method and tutorial page in https://shawlab-moffitt.shinyapps.io/shinyevents/ . Navigating in ShinyEvents ShinyEvents offers four categories of analysis. 1) Patient Visual Analytics, 2) Treatment-associated Analytics, 3) Time-to-Event Analysis, and 4) Cohort Overview. Two key files are required as initial input. 1) a tab-separated text or Excel file with longitudinal event information. 2) a parameter file defining the Data table, Event Name, Category, Start Time, and End Time. The user can assign events as Treatment, Pathology, Clinical Response, and Imaging Assessment, which will facilitate the downstream analytical workflow. In general, the tool is organized with user input selection on the left panel of the graphical user interface, while the right panel displays the visual output, accompanied by a downloadable table below. When setting up an institutional local instance of ShinyEvents, the tool can be password protected if personal health information is included. A live version of the tool is accessible from https://shawlab-moffitt.shinyapps.io/shinyevents/ . The Shiny app does not save or retain user uploaded files. A complete list of features is described in Table 1 and below. Table 1 List of Features. Feature Description Patient Visual Analytics Patient Timeline Single patient view Event Summary A simplified summary of events Treatment-associated Analytics Sankey Visualization Tracking changes in lines-of-treatment Heatmap Visualization Visualizing the treatment frequency, duration, and combination. Swimmer's Plot Cohort level visualization of a particular event Outlier Analysis Identify extended treatment regimen Time-to-Event Analytics Swimmer's Plot Visualization of the Time origin and Event time Survival Analysis Kaplan Meier and Cox proportional hazard analysis. Cohort Overview Event Summary A table of all events Swimmer's Plot Cohort level visualization of all events Annotate Event Position Integrate table Cohort Stratification Patient Stratification Patient Visual Analytics offers a patient-centric view of longitudinal data. The user can perform generalized filtering of the patient population. Major clinical events, such as metastasis and progressive disease, can be highlighted in the visualization. Based on the filtered patient population, the user can explore the patient’s events throughout the treatment course with mouse-over functionality to highlight additional details of an event. As cancer care often involves a multimodal regimen encompassing systemic therapies, surgical procedures, and radiation, representing these interventions as an aggregated event facilitates computational modeling and downstream analytical workflows. Thus, we implemented an approach to perform clustering of temporally proximal events within a user-defined timeframe (i.e., one month window). The aggregated events can then be visualized in the Patient Event Summary tab (Fig. 3 A). Treatment-associated Analytics provides the user the ability to aggregate treatments into lines-of-treatment (Fig. 3 B). Each treatment within a specified window is grouped together as individual treatment lines, which are named the Treatment Summary Cluster (Fig. 4 A). A Sankey plot can then be used to visualize the lines of treatment across the cohort (Fig. 4 B), where the width of the bars and lines represent the number of patients in that group and their treatment path. For example, in GENIE, patients with lung adenocarcinoma were first treated with Cisplatin/Pemetrexed or Cisplatin/Vinorelbine followed by diverse second line treatments which included EGFR-inhibitors (e.g., Erlotinib, Osimertinib, Afatinib), ALK-inhibitors (Alectinib/Crizotinib), Bevacizumab (anti-VEGF-antibody), Carboplatin, and immune checkpoint inhibitors (e.g., Pembrolizumab, Nivolumab, Durvalumab, Atezolizumab, and Ipilimumab). This can then be further visualized using a Heatmap (Fig. 4 C), which illustrates the frequency of a particular treatment across treatment lines and individual patients ( Supplementary S1 ). Overall, the Sankey plot and heatmap shows primary treatments with Cisplatin/Pemetrexed or Cisplatin/Vinorelbine followed by next line treatment with either 1) an investigational agent (clinical trial) if available, 2) salvage therapy with next line chemotherapy (carboplatin), 3) profiling with EGFRmut or ALK-fusion to give targeted agents, 4) immunotherapy agents, 5) and CDK4/6 inhibitor (palbociclib) after multiple lines of unsuccessful therapy. The duration of the treatment can be evaluated by a Swimmer’s plot, Box plot, and Heatmap to identify patients with a durable treatment. Altogether, the analytics can group patients with similar treatment regimens for downstream analysis. Time-to-Event Analysis enables users to perform survival analysis (Fig. 3 C). To perform survival analysis, the tool enables the user to define start and end points based on the patient’s timeline. An overview of the endpoint definition categories is presented in Table 2 . Typically, the starting point is based on an oncologic treatment, treatment line, diagnosis, and molecular profiling of a subgroup of patients with a similar clinical baseline. The combination of endpoints would then dictate the type of clinical endpoint analysis. To estimate overall survival, we would use death as an endpoint. To estimate time to treatment change, we would use the next treatment line as an endpoint. To estimate a real-world progression-free survival (rwPFS), we would use the following four endpoints as progression events, including pathological recurrence, death, and radiological progression/metastasis. The output of these estimations can then be utilized for downstream survival analysis 20 . Altogether, ShinyEvents facilitates the estimation of rwPFS from the longitudinal data. Table 2 Start and end point definitions. Start End point Overall Survival From Diagnosis Death or last follow-up/contact Time-to-treatment-end Treatment Start Change in Treatment Treatment-associated toxicity Death Real-World Progression Free Survival Treatment Start Change in Treatment-Line Progression/Metastasis Death Cohort Overview ShinyEvents provides several methods for summarizing cohort-level information (Fig. 3 D). Based on the GENIE lung adenocarcinoma population, the clinical table’s data elements can be viewed through an expanded tree viewer (Supplementary Figure S2A ). The user can subset the data and generate a tabular summary of molecular and patient/sample data (Supplementary Figure S2B ). A Swimmers plot can then be used to highlight the clinical course of each patient as well as cross cohort heterogeneity (Fig. 5 ). From the Swimmer’s plot, the user can cursor over (mouse over) each event for a dropdown with additional details, such as location of treatment site. The tool can stratify patients based on clinicopathologic characteristics, such as sex, clinical grade, and tumor histology. Furthermore, the user can position the major event (such as medication) relative to another event (i.e., sequencing data), annotating the molecular profiling before or after a particular drug regimen for downstream sequencing analysis. Altogether, this collection of tools offers several important preprocessing step overviews. A live version of the GENIE data is available https://shawlab-moffitt.shinyapps.io/aacr_genie_nsclc_squamouscell_shinyevents/ , and a tutorial of the tools is also made available through our GitHub page https://github.com/shawlab-moffitt/shinyEvents . Use case. Comparing real-world progression-free survival between line-of-treatment For our separate use case example, we assembled a population of patients diagnosed with non-metastatic muscle-invasive bladder cancer treated with neoadjuvant chemotherapy followed by surgical resection at Moffitt Cancer Center who were also enrolled in the Oncology Research Information Exchange Network (ORIEN) AVATAR project. We collected tables associated with diagnosis, outcome, vital status, metastatic disease, and treatment. Altogether, the cohort consisted of 51 patients with a median age of 64 (range: 45–82) years at diagnosis. We have implemented a general strategy to define treatment-associated rwPFS. We first summarized the treatments rendered into treatment lines, revealing that the patients are predominantly treated with neoadjuvant Gemcitabine, and Cisplatin/or Carboplatin as first-line therapy ( Supplementary Figure S3A ). Using a Sankey plot, we found that line 1 primarily consisted of the Carboplatin Gemcitabine combination (n = 10) and the Cisplatin Gemcitabine combination (n = 21) ( Supplementary Figure S3A, B ). To estimate an rwPFS, we defined progression end points to be associated with treatment change, progression/metastasis, and death (Fig. 6 A). To perform this analysis, the user would need to perform the following steps (Fig. 6 B: 1) define a window for clustering the samples ( Supplementary Figure S4A ), 2) select a starting time point based on a specific treatment ( Supplementary Figure S4B ), 3) defining the surrogate endpoint ( Supplementary Figure S4C ), and 4) define the two groups for survival analysis ( Supplementary Figure S4D ). A tutorial video is presented in the tutorial section, Time-To-Event Analysis . A Swimmer’s plot was used to verify the start and end point of the patient’s clinical course (Fig. 6 C). We performed a Kaplan-Meier analysis and found that patients treated with Carboplatin/Gemcitabine tend to have a worse clinical outcome compared to those treated with the Cisplatin/Gemcitabine combination (Fig. 6 D). Clinically, Carboplatin may be given to patients with poor renal function or performance status, potentially explaining this observation. Discussion The visualization of longitudinal data improves our understanding of the clinical course encompassing various lines of treatment rendered 21 . Current longitudinal data analytics primarily focuses on a single modality that is genomic-centric, which is often limited to a single biopsy or surgery throughout the treatment course 22 . Notably, there is a lack of visualization tools that can facilitate time-to-event analysis. Here we developed a ShinyEvents framework, that offers an expanded function for the user to define clinical start and endpoint for real-world survival analysis. Our use case highlights our tool’s ability to evaluate treatment regimen and its association with progression-free survival in a real-world bladder cancer cohort from Moffitt Cancer Center. Our tool can also be easily adapted to other publicly available longitudinal datasets, such as GENIE and Flatiron. We anticipate that this tool will be valuable to other institutions, allowing them to effectively utilize their private institutional patient treatment data for meaningful analysis and to derive clinical insights. As the field moves toward developing personalized models grounded in RWD, effective preprocessing of time-series data and extraction of clinically relevant features will be essential to realizing the potential of the digital twin technology 23 . To accomplish this goal, it will be critical to maintain effective interaction between clinician, bioinformatician, biostatistician, and machine learning experts, as each brings a unique expertise that maximizes the utility of the patient’s longitudinal clinical information 24 – 26 . One of the major goals of ShinyEvents is to provide web visualization for effective interdisciplinary communication. The integration of clinical knowledge from experienced clinicians is critical to the development of a valid digital twin model, ensuring that the model captures the complexity of patient care and aligns with real-world clinical practice. Furthermore, there is the need to ensure robustness of the applied statistics and genomic inquiries that are required, which is often dependent on the chosen clinical end point. As the field advances toward big data–driven science, it is crucial to avoid indiscriminate data input, especially in machine learning applications where the risk of selection bias is pronounced. In summary, ShinyEvents fulfills a critical unmet need of a web-based tool that facilitates joint interaction between clinicians and data scientists to enable the development of robust and clinically meaningful models. One major limitation of our tool is that it is not optimized for changepoint analysis from longitudinal measurements without extensive preprocessing. In its current version, the tool is designed for case-by-case evaluation of a patient’s disease; however, while it is theoretically possible, a systematic algorithm to determine disease stage (such as early non-metastatic, locally advanced, and metastatic) over the course of their disease can be established. We aim to incorporate these features in future versions of the tool. Methods Sofware Implemention R Shiny and environmental setup. The ShinyEvents R Shiny application and package was built using R version 4.4.1 and can be installed via GitHub. To setup a ShinyEvents application interface a parameter file is required. This file outlines the data tables, event column names, and event descriptions which guide the app during startup to generate a comprehensive event data table which fuels the applications functionality. A detailed explanation of how to set up this parameter file can be found in the Github Page https://github.com/shawlab-moffitt/shinyEvents . To deploy the application the app folder should be structured to include the ‘app.R’ script, the ‘R’ folder, the parameter file, and the supplementary files that are annotated in the parameter file. At the top of the ‘app.R’ script there is a line to designate the file name and path of the parameter file “Event_Param_File” and if previously processed, the user can define the event data file and patient annotation file. With these criteria met, the user can deploy the app via the ‘Run App’ button or using the ‘runApp’ R function. Online User Tutorial Page Within the ShinyEvents application users can view a tutorial page with an in-depth guide on file formatting as well as video tutorials and helpful text that overview the user interface. In the guide on file formatting, we breakdown the input file options to further explain what data is required at minimum and provide descriptions and examples for the file contents ( Supplementary Figure S5A, S5B ). This will cover the event data file and optional supplementary data that can be uploaded directly through the application interface, including a figure outlining how these files can be connected in the app. For more advanced use, we demonstrate how to set up the applications working directory and assemble a parameter file that can be read in through the ‘app.R’ script which guides the generation of detailed event data for exploration in the app. To assist users, we provide an R function that can be used as a pre-processing step to derive the event data from the parameter file. This is recommended when deploying the app with a parameter file or working with larger data sets, as the application startup may require additional computing time if the event data needs to be generated. Additionally, we include tutorial videos and helpful tips that demonstrate the features available in the app. Online Example Source code and tutorial to ShinyEvents can be accessed on our GitHub page https://github.com/shawlab-moffitt/shinyEvents which also contains example data, instructions on setting up a local environment for the application, and R functions to assist in data pre-processing. A version of the tool is accessible by https://shawlab-moffitt.shinyapps.io/shinyevents/ . Here users can load example data provided in the app to view the appropriate formatting and explore the features available. Within the app we provide a tutorial page, helpful tips, as well as a guided layout to simplify data upload and app customization. A Zenodo DOI is available https://doi.org/10.5281/zenodo.16527381 . Quantification and Statistical Analysis Highlight details about the Kaplan Meier analysis, Fisher’s Exact Test. Statistical tests were based on functions implemented in R. Declarations Competing Interests G.D.G, T.I.S., and D.C have a patent pending based on the submitted work. Author Contribution Contribution: A.O., T.I.S., led the software development and testing. J.D., D.P.T, M.T., D.S., D.D., S.E., V.Y., X.W., A.T., J.G., G.D.G., contributed to the software development and testing; B.J.M., A.A.T., S.G., J.M., R.L., R.R.P., G.D.G., provided guidance on the clinical interpretation; V.Y., D.C., provided guidance on statistical analysis. A.T., R.J.R., M.L.C., R.L., G.D.G., provided patient samples and clinical data. A.O., G.D.G., D.C., T.I.S. designed the study, analyzed data, and wrote the manuscript; and A.T., G.D.G., D.C., T.I.S. oversaw the study. All authors assisted in preparation of the manuscript. Acknowledgement This work has been supported in part by the Biostatistics and Bioinformatics Shared Resource at the Moffitt Cancer Center (NCI P30 CA076292), the Moffitt Cancer Center Department of Biostatistics and Bioinformatics Pilot Project (T.I.S.). Funding for this project was provided by the Department of Defense, Grant No. HT94252510691 (T.I.S., B.M.), Florida Department of Health, Grant No. MOAAX (D.C.), and National Institute of Health, Grant No. R21 CA286417-01, (D.C.) and Grant No. T32 CA233399-04 (J.D.). The authors thank all the ORIEN AVATAR Collaborative Members. Data Availability Software code has been deposited inside the Zenodo repository: https://doi.org/10.5281/zenodo.16527381. References Verkerk, K. & Voest, E. E. Generating and using real-world data: A worthwhile uphill battle. Cell 187 , 1636-1650 (2024). https://doi.org:10.1016/j.cell.2024.02.012 Liu, F. & Panagiotakos, D. Real-world data: a brief review of the methods, applications, challenges and opportunities. BMC Med Res Methodol 22 , 287 (2022). https://doi.org:10.1186/s12874-022-01768-6 Ramsey, S. D., Onar-Thomas, A. & Wheeler, S. B. Real-World Database Studies in Oncology: A Call for Standards. J Clin Oncol 42 , 977-980 (2024). https://doi.org:10.1200/JCO.23.02399 Soupir, A. et al. Genomic, transcriptomic, and immunogenomic landscape of over 1300 sarcomas of diverse histology subtypes. Nat Commun 16 , 4206 (2025). https://doi.org:10.1038/s41467-025-58678-6 Park, M. A. et al. Molecular Pathway and Immune Profile Analysis of IPMN-Derived Versus PanIN-Derived Pancreatic Ductal Adenocarcinomas. Int J Mol Sci 25 (2024). https://doi.org:10.3390/ijms252313164 Wang, X. et al. Multicellular immune ecotypes within solid tumors predict real-world therapeutic benefits with immune checkpoint inhibitors. medRxiv (2024). https://doi.org:10.1101/2024.07.19.24310726 Eule, C. J. et al. Clinical and Genomic Features of Patients with Renal Cell Carcinoma and Advanced Chronic Kidney Disease: Analysis of a Multi-Institutional Database. Cancers (Basel) 16 (2024). https://doi.org:10.3390/cancers16101920 Demetriou, A. N. et al. Profiling the molecular and clinical landscape of glioblastoma utilizing the Oncology Research Information Exchange Network brain cancer database. Neurooncol Adv 6 , vdae046 (2024). https://doi.org:10.1093/noajnl/vdae046 Castellanos, E. H., Wittmershaus, B. K. & Chandwani, S. Raising the Bar for Real-World Data in Oncology: Approaches to Quality Across Multiple Dimensions. JCO Clin Cancer Inform 8 , e2300046 (2024). https://doi.org:10.1200/CCI.23.00046 Consortium, A. P. G. AACR Project GENIE: Powering Precision Medicine through an International Consortium. Cancer Discov 7 , 818-831 (2017). https://doi.org:10.1158/2159-8290.CD-17-0151 de Bruijn, I. et al. Analysis and Visualization of Longitudinal Genomic and Clinical Data from the AACR Project GENIE Biopharma Collaborative in cBioPortal. Cancer Res 83 , 3861-3867 (2023). https://doi.org:10.1158/0008-5472.CAN-23-0816 Harbig, T. A. et al. OncoThreads: visualization of large-scale longitudinal cancer molecular data. Bioinformatics 37 , i59-i66 (2021). https://doi.org:10.1093/bioinformatics/btab289 Del Gaizo, J., Catchpole, K. R. & Alekseyenko, A. V. Research and Exploratory Analysis Driven-Time-data Visualization (read-tv) software. JAMIA Open 4 , ooab007 (2021). https://doi.org:10.1093/jamiaopen/ooab007 Goedhart, J. PlotTwist: A web app for plotting and annotating continuous data. PLoS Biol 18 , e3000581 (2020). https://doi.org:10.1371/journal.pbio.3000581 Espinosa-Carrasco, J. et al. Pergola-web: a web server for the visualization and analysis of longitudinal behavioral data using repurposed genomics tools and standards. Nucleic Acids Res 47 , W600-W604 (2019). https://doi.org:10.1093/nar/gkz414 Maheux, E. et al. Forecasting individual progression trajectories in Alzheimer's disease. Nat Commun 14 , 761 (2023). https://doi.org:10.1038/s41467-022-35712-5 Nguena Nguefack, H. L. et al. Trajectory Modelling Techniques Useful to Epidemiological Research: A Comparative Narrative Review of Approaches. Clin Epidemiol 12 , 1205-1222 (2020). https://doi.org:10.2147/CLEP.S265287 Johnson, K. B. et al. Precision Medicine, AI, and the Future of Personalized Health Care. Clin Transl Sci 14 , 86-93 (2021). https://doi.org:10.1111/cts.12884 Choudhury, N. J. et al. The GENIE BPC NSCLC Cohort: A Real-World Repository Integrating Standardized Clinical and Genomic Data for 1,846 Patients with Non-Small Cell Lung Cancer. Clin Cancer Res 29 , 3418-3428 (2023). https://doi.org:10.1158/1078-0432.CCR-23-0580 Obermayer, A. N. et al. PATH-SURVEYOR: pathway level survival enquiry for immuno-oncology and drug repurposing. BMC Bioinformatics 24 , 266 (2023). https://doi.org:10.1186/s12859-023-05393-y Turchioe, M. R. et al. A Systematic Review of Patient-Facing Visualizations of Personal Health Data. Appl Clin Inform 10 , 751-770 (2019). https://doi.org:10.1055/s-0039-1697592 Avila, M. & Meric-Bernstam, F. Next-generation sequencing for the general cancer patient. Clin Adv Hematol Oncol 17 , 447-454 (2019). Stahlberg, E. A. et al. Exploring approaches for predictive cancer patient digital twins: Opportunities for collaboration and innovation. Front Digit Health 4 , 1007784 (2022). https://doi.org:10.3389/fdgth.2022.1007784 Altman, R. B. The interactions between clinical informatics and bioinformatics: a case study. J Am Med Inform Assoc 7 , 439-443 (2000). https://doi.org:10.1136/jamia.2000.0070439 Martin-Sanchez, F. et al. Synergy between medical informatics and bioinformatics: facilitating genomic medicine for future health care. J Biomed Inform 37 , 30-42 (2004). https://doi.org:10.1016/j.jbi.2003.09.003 Alowais, S. A. et al. Revolutionizing healthcare: the role of artificial intelligence in clinical practice. BMC Med Educ 23 , 689 (2023). https://doi.org:10.1186/s12909-023-04698-z Additional Declarations Competing interest reported. G.D.G, T.I.S., and D.C have a patent pending based on the submitted work. Supplementary Files ShinyEventsSupplementaryFiguresv120250724.pdf Cite Share Download PDF Status: Published Journal Publication published 13 Jan, 2026 Read the published version in npj Precision Oncology → Version 1 posted Editorial decision: Revision requested 11 Sep, 2025 Reviews received at journal 09 Sep, 2025 Reviews received at journal 07 Sep, 2025 Reviews received at journal 05 Sep, 2025 Reviewers agreed at journal 03 Sep, 2025 Reviewers agreed at journal 31 Aug, 2025 Reviewers agreed at journal 31 Aug, 2025 Reviewers agreed at journal 17 Aug, 2025 Reviewers agreed at journal 17 Aug, 2025 Reviewers invited by journal 11 Aug, 2025 Editor assigned by journal 11 Aug, 2025 Submission checks completed at journal 31 Jul, 2025 First submitted to journal 28 Jul, 2025 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-7231850","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Article","associatedPublications":[],"authors":[{"id":494045811,"identity":"56a5e408-7c90-40b0-9bae-696d7aafe339","order_by":0,"name":"Alyssa Obermayer","email":"","orcid":"","institution":"H. Lee Moffitt Cancer Center and Research Institute","correspondingAuthor":false,"prefix":"","firstName":"Alyssa","middleName":"","lastName":"Obermayer","suffix":""},{"id":494045812,"identity":"534eb23a-34fa-409e-8dd8-0c52e42b7124","order_by":1,"name":"Joshua Davis","email":"","orcid":"","institution":"H. Lee Moffitt Cancer Center and Research Institute","correspondingAuthor":false,"prefix":"","firstName":"Joshua","middleName":"","lastName":"Davis","suffix":""},{"id":494045813,"identity":"7c8207a1-0fdd-4461-9d46-f4ce0ab6783f","order_by":2,"name":"Divya Priyanka Talada","email":"","orcid":"","institution":"H. Lee Moffitt Cancer Center and Research Institute","correspondingAuthor":false,"prefix":"","firstName":"Divya","middleName":"Priyanka","lastName":"Talada","suffix":""},{"id":494045814,"identity":"b2b8a8f6-86b1-4f3d-9687-5e36d1a70f41","order_by":3,"name":"Mingxiang Teng","email":"","orcid":"","institution":"H. Lee Moffitt Cancer Center and Research Institute","correspondingAuthor":false,"prefix":"","firstName":"Mingxiang","middleName":"","lastName":"Teng","suffix":""},{"id":494045815,"identity":"27242e70-4597-435a-96a7-fba9c2724706","order_by":4,"name":"Steven Eschrich","email":"","orcid":"","institution":"H. Lee Moffitt Cancer Center and Research Institute","correspondingAuthor":false,"prefix":"","firstName":"Steven","middleName":"","lastName":"Eschrich","suffix":""},{"id":494045816,"identity":"984e4c76-4590-4165-9b51-c5b1ca177023","order_by":5,"name":"Vivien Yin","email":"","orcid":"","institution":"H. Lee Moffitt Cancer Center and Research Institute","correspondingAuthor":false,"prefix":"","firstName":"Vivien","middleName":"","lastName":"Yin","suffix":""},{"id":494045817,"identity":"cf298eba-832b-4b17-8d4c-d4b4d668424a","order_by":6,"name":"Daniel Spakowicz","email":"","orcid":"","institution":"The Ohio State University Comprehensive Cancer Center","correspondingAuthor":false,"prefix":"","firstName":"Daniel","middleName":"","lastName":"Spakowicz","suffix":""},{"id":494045818,"identity":"96787e2d-8b55-4cf1-8d76-f371150c1849","order_by":7,"name":"Dipankor Dhrubo","email":"","orcid":"","institution":"The Ohio State University Comprehensive Cancer Center","correspondingAuthor":false,"prefix":"","firstName":"Dipankor","middleName":"","lastName":"Dhrubo","suffix":""},{"id":494045819,"identity":"7ac0c5f8-be4f-48cf-b973-ef8cbcba4cc9","order_by":8,"name":"Robert J Rounbehler","email":"","orcid":"","institution":"Aster Insights","correspondingAuthor":false,"prefix":"","firstName":"Robert","middleName":"J","lastName":"Rounbehler","suffix":""},{"id":494045820,"identity":"295abad7-cc95-496e-9d32-1d4c9e83aaa9","order_by":9,"name":"Michelle L. Churchman","email":"","orcid":"","institution":"Aster Insights","correspondingAuthor":false,"prefix":"","firstName":"Michelle","middleName":"L.","lastName":"Churchman","suffix":""},{"id":494045821,"identity":"dda11ca7-3960-49ec-b11c-10109e97b1b2","order_by":10,"name":"Ahmad A. Tarhini","email":"","orcid":"","institution":"H. Lee Moffitt Cancer Center and Research Institute","correspondingAuthor":false,"prefix":"","firstName":"Ahmad","middleName":"A.","lastName":"Tarhini","suffix":""},{"id":494045822,"identity":"fb781b9c-e379-4f60-be16-4be9336d8229","order_by":11,"name":"Xuefeng Wang","email":"","orcid":"","institution":"H. Lee Moffitt Cancer Center and Research Institute","correspondingAuthor":false,"prefix":"","firstName":"Xuefeng","middleName":"","lastName":"Wang","suffix":""},{"id":494045823,"identity":"03f1f467-799b-46db-869d-00bf8838d062","order_by":12,"name":"Sumati Gupta","email":"","orcid":"","institution":"Huntsman Cancer Institute","correspondingAuthor":false,"prefix":"","firstName":"Sumati","middleName":"","lastName":"Gupta","suffix":""},{"id":494045824,"identity":"2d84122b-ebf6-41d1-9a61-b80dcda0fa66","order_by":13,"name":"Joseph Markowitz","email":"","orcid":"","institution":"H. Lee Moffitt Cancer Center and Research Institute","correspondingAuthor":false,"prefix":"","firstName":"Joseph","middleName":"","lastName":"Markowitz","suffix":""},{"id":494045825,"identity":"43088075-e822-4b21-8b27-5953de16e01d","order_by":14,"name":"Jeremy Goecks","email":"","orcid":"","institution":"H. Lee Moffitt Cancer Center and Research Institute","correspondingAuthor":false,"prefix":"","firstName":"Jeremy","middleName":"","lastName":"Goecks","suffix":""},{"id":494045826,"identity":"cf488922-92fd-44f8-81e0-50feacdd296b","order_by":15,"name":"Roger Li","email":"","orcid":"","institution":"H. Lee Moffitt Cancer Center and Research Institute","correspondingAuthor":false,"prefix":"","firstName":"Roger","middleName":"","lastName":"Li","suffix":""},{"id":494045827,"identity":"7eb9edd1-acc4-4c56-8711-20e9a992831b","order_by":16,"name":"Rodrigo Rodriguez-Pessoa","email":"","orcid":"","institution":"H. Lee Moffitt Cancer Center and Research Institute","correspondingAuthor":false,"prefix":"","firstName":"Rodrigo","middleName":"","lastName":"Rodriguez-Pessoa","suffix":""},{"id":494045828,"identity":"55a16a25-58d5-46eb-948f-6baeb3fccf04","order_by":17,"name":"Brandon J. Manley","email":"","orcid":"","institution":"H. Lee Moffitt Cancer Center and Research Institute","correspondingAuthor":false,"prefix":"","firstName":"Brandon","middleName":"J.","lastName":"Manley","suffix":""},{"id":494045829,"identity":"56a81e65-7bb9-4115-9fc0-331e9315d0ae","order_by":18,"name":"Aik-Choon Tan","email":"","orcid":"","institution":"Huntsman Cancer Institute","correspondingAuthor":false,"prefix":"","firstName":"Aik-Choon","middleName":"","lastName":"Tan","suffix":""},{"id":494045830,"identity":"c52ea9ce-0eaa-4d8e-aaff-24af0c7c931c","order_by":19,"name":"G Daniel Grass","email":"","orcid":"","institution":"H. Lee Moffitt Cancer Center and Research Institute","correspondingAuthor":false,"prefix":"","firstName":"G","middleName":"Daniel","lastName":"Grass","suffix":""},{"id":494045831,"identity":"0d867c70-575d-4095-9d83-53ed8fa3f88b","order_by":20,"name":"Dung-tsa Chen","email":"","orcid":"","institution":"H. Lee Moffitt Cancer Center and Research Institute","correspondingAuthor":false,"prefix":"","firstName":"Dung-tsa","middleName":"","lastName":"Chen","suffix":""},{"id":494045832,"identity":"4f320fd0-a58f-4365-893c-e4451c0dbdab","order_by":21,"name":"Timothy I. Shaw","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA/UlEQVRIiWNgGAWjYBAC+8NAgrGBgUF+BpDxgYGBn72ZgBY2ZiBxEKjF4AZQK1CbZM9hZpAReLQwwLRIMDAw84C0HCCkhZ078fHHHYcZDKSbn0nb5thI8LDzH3/AUHHPDpc+NmbezQYHzxxmkJ9zzEw6d1uaBA8zyJYzxcl4tGyTONgGDIQbCWa3c7cdrrMHaWFsS0jG7X24lvRvty23HYbYQqSWHLPbjEha7PBo2Wxwti2dx+BGTvnPXohfDGcknElIwKmF/+zGB5Vt1nLyM9I3G/zcBgwx/oMPPnyoSLDHpQUKmnlQ+UArEhvwa6nDFCJkyygYBaNgFIwcAABNB1Ub/xEpIQAAAABJRU5ErkJggg==","orcid":"","institution":"H. Lee Moffitt Cancer Center and Research Institute","correspondingAuthor":true,"prefix":"","firstName":"Timothy","middleName":"I.","lastName":"Shaw","suffix":""}],"badges":[],"createdAt":"2025-07-28 09:08:07","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-7231850/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-7231850/v1","draftVersion":[],"editorialEvents":[{"content":"https://doi.org/10.1038/s41698-025-01212-0","type":"published","date":"2026-01-13T16:28:26+00:00"}],"editorialNote":"","failedWorkflow":false,"files":[{"id":88410089,"identity":"59b0aea6-02ad-4add-8128-40c759f38271","added_by":"auto","created_at":"2025-08-06 08:22:03","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":202365,"visible":true,"origin":"","legend":"\u003cp\u003eShinyEvents integrates longitudinal data as input. The tool then aligns and summarizes the events into clusters. The tool provides effective visualization while allowing the user to define informative real-world PFS, which can then be associated with other biomarkers generated within the study.\u003c/p\u003e","description":"","filename":"1.png","url":"https://assets-eu.researchsquare.com/files/rs-7231850/v1/b83a91e6241f03e964fd8c2c.png"},{"id":88410088,"identity":"1d5940ad-ea9c-4b56-af90-8e0dba67d263","added_by":"auto","created_at":"2025-08-06 08:22:03","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":269133,"visible":true,"origin":"","legend":"\u003cp\u003ePatient-level summary of a patient with lung adenocarcinoma. Each block of longitudinal data is highlighted with a different color. The “Summary Block” provides a high-level summary of each set of longitudinal data.\u003c/p\u003e","description":"","filename":"2.png","url":"https://assets-eu.researchsquare.com/files/rs-7231850/v1/b53a03aa74d7d9773995459d.png"},{"id":88410093,"identity":"358202e0-61c9-4a71-87bb-4c269663ab27","added_by":"auto","created_at":"2025-08-06 08:22:03","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":226415,"visible":true,"origin":"","legend":"\u003cp\u003eShinyEvents Tab-based Navigation. \u003cstrong\u003eA)\u003c/strong\u003e Patient Visual Analytics. \u003cstrong\u003eB)\u003c/strong\u003e Treatment Associated Analytics. \u003cstrong\u003eC)\u003c/strong\u003e Time-to-Event Analytics. \u003cstrong\u003eD)\u003c/strong\u003e Cohort Overview.\u003c/p\u003e","description":"","filename":"3.png","url":"https://assets-eu.researchsquare.com/files/rs-7231850/v1/58e0d615c61f5a7f90752fc4.png"},{"id":88411915,"identity":"28bd3638-66fd-4ee7-b6fd-79bc66c5a6a3","added_by":"auto","created_at":"2025-08-06 08:30:03","extension":"png","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":334175,"visible":true,"origin":"","legend":"\u003cp\u003eShinyEvents facilitates the exploration of the treatment data. \u003cstrong\u003eA)\u003c/strong\u003e The treatment for each patient is grouped into Treatment Summary Cluster, defining the treatment lines. \u003cstrong\u003eB)\u003c/strong\u003e The user can define treatment clusters and visualized as a Sankey plot of the treatment line. \u003cstrong\u003eC)\u003c/strong\u003e The heatmap shows the number of patients treated with the drug in each treatment line. Treatment cluster based on a 3 month window in the GENIE NSCLC cohort.\u003c/p\u003e","description":"","filename":"4.png","url":"https://assets-eu.researchsquare.com/files/rs-7231850/v1/5cd56adfd7688bd24c078dca.png"},{"id":88411918,"identity":"bf16039a-3e01-4a7e-9d17-abbac4134de6","added_by":"auto","created_at":"2025-08-06 08:30:04","extension":"png","order_by":5,"title":"Figure 5","display":"","copyAsset":false,"role":"figure","size":377154,"visible":true,"origin":"","legend":"\u003cp\u003eShinyEvents facilitates the generation of a Swimmer’s Plot summary. Each node shows an event summary.\u003c/p\u003e","description":"","filename":"5.png","url":"https://assets-eu.researchsquare.com/files/rs-7231850/v1/4360d7671dccbbfb1799af94.png"},{"id":88410094,"identity":"be46da31-d3cf-4a01-97db-ca782cff3586","added_by":"auto","created_at":"2025-08-06 08:22:03","extension":"png","order_by":6,"title":"Figure 6","display":"","copyAsset":false,"role":"figure","size":135032,"visible":true,"origin":"","legend":"\u003cp\u003eDefining informative endpoints. \u003cstrong\u003eA)\u003c/strong\u003e Endpoint definition that can be defined. \u003cstrong\u003eB)\u003c/strong\u003eFour key decisions required in the user interface. These include 1. setting the treatment clustering window, 2. selecting the starting time, 3. defining the surrogate end point, and 4. defining the groups for the survival analysis. \u003cstrong\u003eC)\u003c/strong\u003e Swimmer plot example showing starting neo-adjuvant treatment with Cisplatin/Gemcitabine or Carboplatin/Gemcitabine in patients with bladder cancer; the endpoint defined was based on treatment change, death, progression, and metastasis. \u003cstrong\u003eD)\u003c/strong\u003eThe real-world progression-free survival of patients with bladder cancer was stratified based on treatment with carboplatin- or cisplatin-based treatment.\u003c/p\u003e","description":"","filename":"6.png","url":"https://assets-eu.researchsquare.com/files/rs-7231850/v1/bc9c6575966b348f18e0d0a9.png"},{"id":100615729,"identity":"f5837cf3-5208-4717-acec-d74d9273f937","added_by":"auto","created_at":"2026-01-19 17:36:11","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":2168437,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-7231850/v1/c5eb95a9-42e0-4f7b-baac-bdaeb77a2aab.pdf"},{"id":88411914,"identity":"7a6707ed-574d-41a4-97bd-e636a4747c09","added_by":"auto","created_at":"2025-08-06 08:30:03","extension":"pdf","order_by":1,"title":"","display":"","copyAsset":false,"role":"supplement","size":1307457,"visible":true,"origin":"","legend":"","description":"","filename":"ShinyEventsSupplementaryFiguresv120250724.pdf","url":"https://assets-eu.researchsquare.com/files/rs-7231850/v1/75dc190436d999e60c1c6767.pdf"}],"financialInterests":"Competing interest reported. G.D.G, T.I.S., and D.C have a patent pending based on the submitted work.","formattedTitle":"ShinyEvents: harmonizing longitudinal data for real world survival estimation","fulltext":[{"header":"Introduction","content":"\u003cp\u003eLongitudinal real-world data (RWD) are complex and heterogeneous, with inconsistent quality that makes them difficult to utilize for several reasons\u003csup\u003e\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e\u003c/sup\u003e. First, RWD is observational and often unstructured, varying in naming due to different documentation practices across providers \u003csup\u003e\u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e,\u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e\u003c/sup\u003e. Second, RWD can be challenging to analyze due to their overwhelming size, as they are generated from dynamic datasets from private and public repositories, such as the Oncology Research Information Exchange Network (ORIEN) AVATAR \u003csup\u003e\u003cspan additionalcitationids=\"CR5 CR6 CR7\" citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e\u003c/sup\u003e, FLATIRON \u003csup\u003e\u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e\u003c/sup\u003e, and AACR Project GENIE \u003csup\u003e\u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e\u003c/sup\u003e. Third, RWD may lack important annotations for research purposes, such as treatment response labeling or surrogate endpoints\u003csup\u003e\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e\u003c/sup\u003e. Therefore, establishing standardized approaches for visualizing longitudinal data will be essential to leverage RWD to inform best practices in clinical decision-making and for the planning of future clinical trials.\u003c/p\u003e\u003cp\u003eSeveral tools have been developed for visualizing longitudinal clinical data, including cBioPortal\u003csup\u003e\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e\u003c/sup\u003e, OncoThreads\u003csup\u003e\u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e12\u003c/span\u003e\u003c/sup\u003e, Read-TV\u003csup\u003e\u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e13\u003c/span\u003e\u003c/sup\u003e, PlotTwist\u003csup\u003e\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e\u003c/sup\u003e, and Pergola-web\u003csup\u003e\u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e\u003c/sup\u003e. These tools can be categorized based on the following objectives: 1) visualization of outcome measurements over time, 2) the sequence of the treatment course relative to the genomics data, and 3) the identification of temporal patterns. While these longitudinal visualization methods are useful in their respective field, they were primarily designed with a single focus, such as genomic profiling or blood/serum measurements. Notably, there is no existing tool for cohort-level analysis of treatment-associated information from RWD, especially tools that can define treatment response and surragate endpoint for survival analysis, which is critical for disease trajectory modeling\u003csup\u003e\u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e16\u003c/span\u003e,\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e\u003c/sup\u003e and personalized decision support\u003csup\u003e\u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e18\u003c/span\u003e\u003c/sup\u003e.\u003c/p\u003e\u003cp\u003e\u003c/p\u003e\u003cp\u003eTo address these challenges, we developed a web-based viewer, ShinyEvents, that facilitates the analysis and presentation of longitudinal data focusing on the length of treatment, clinical response, and overall survival (Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003e). In addition to the time-series visualization, the tool can cluster treatment-associated data into treatment lines, define endpoints for real-world progression-free survival (rwPFS), and summarize population characteristics. We applied a public GENIE non-small cell lung cancer (NSCLC) cohort as well a dataset of patients with non-metastatic bladder cancer treated by cystectomy at Moffitt Cancer Center to demonstrate the functionalities of ShinyEvents.\u003c/p\u003e"},{"header":"Results","content":"\u003cp\u003e\u003cb\u003eShinyEvents Preprocessing with the GENIE non-small cell lung cancer cohort example\u003c/b\u003e\u003c/p\u003e\u003cp\u003eAs an example, we assembled a population of non-metastatic (stage II and III) NSCLC patients with adenocarcinoma histology from GENIE\u003csup\u003e\u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e19\u003c/span\u003e\u003c/sup\u003e. We collected data associated with the patient demographics, pathological staging, gene panel sequencing, treatment, imaging, and clinical notes. We filtered the cohort based on diagnosis stage II and III and restricted it to adenocarcinoma patients treated with Cisplatin/Vinorelbine or Cisplatin/Pemetrexed (n\u0026thinsp;=\u0026thinsp;71). During the preprocessing step, the ShinyEvent would guide the user in generating an Events table indexed based on the patient\u0026rsquo;s age and their unique identifier. Furthermore, the following characteristics were defined for each event: event name (time-series name), treatment (oncologic medicine, surgery, and radiation), response (potential end point), and duration of the event (start and end time points). For each patient, the tool would generate a Swimmer's plot to visualize the multi-dimensional data available for that patient. An example of a patient with an extensive history of adenocarcinoma lung cancer is presented in Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003e, with each category of longitudinal data harmonized across the timeline. Key Preprocessing steps are documented in the supplementary method and tutorial page in \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://shawlab-moffitt.shinyapps.io/shinyevents/\u003c/span\u003e\u003cspan address=\"https://shawlab-moffitt.shinyapps.io/shinyevents/\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/p\u003e\u003cp\u003e\u003c/p\u003e\u003cp\u003e\u003cb\u003eNavigating in ShinyEvents\u003c/b\u003e\u003c/p\u003e\u003cp\u003eShinyEvents offers four categories of analysis. 1) Patient Visual Analytics, 2) Treatment-associated Analytics, 3) Time-to-Event Analysis, and 4) Cohort Overview. Two key files are required as initial input. 1) a tab-separated text or Excel file with longitudinal event information. 2) a parameter file defining the Data table, Event Name, Category, Start Time, and End Time. The user can assign events as Treatment, Pathology, Clinical Response, and Imaging Assessment, which will facilitate the downstream analytical workflow. In general, the tool is organized with user input selection on the left panel of the graphical user interface, while the right panel displays the visual output, accompanied by a downloadable table below. When setting up an institutional local instance of ShinyEvents, the tool can be password protected if personal health information is included. A live version of the tool is accessible from \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://shawlab-moffitt.shinyapps.io/shinyevents/\u003c/span\u003e\u003cspan address=\"https://shawlab-moffitt.shinyapps.io/shinyevents/\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. The Shiny app does not save or retain user uploaded files. A complete list of features is described in Table\u0026nbsp;\u003cspan refid=\"Tab1\" class=\"InternalRef\"\u003e1\u003c/span\u003e and below.\u003c/p\u003e\u003cp\u003e\u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab1\" border=\"1\"\u003e\u003ccaption language=\"En\"\u003e\u003cdiv class=\"CaptionNumber\"\u003eTable 1\u003c/div\u003e\u003cdiv class=\"CaptionContent\"\u003e\u003cp\u003eList of Features.\u003c/p\u003e\u003c/div\u003e\u003c/caption\u003e\u003ccolgroup cols=\"3\"\u003e\u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e\u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e\u003cdiv align=\"left\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e\u003cthead\u003e\u003ctr\u003e\u003cth align=\"left\" colname=\"c1\"\u003e\u0026nbsp;\u003c/th\u003e\u003cth align=\"left\" colname=\"c2\"\u003e\u003cp\u003eFeature\u003c/p\u003e\u003c/th\u003e\u003cth align=\"left\" colname=\"c3\"\u003e\u003cp\u003eDescription\u003c/p\u003e\u003c/th\u003e\u003c/tr\u003e\u003c/thead\u003e\u003ctbody\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u003cp\u003e\u003cb\u003ePatient Visual Analytics\u003c/b\u003e\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u003cp\u003ePatient Timeline\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003eSingle patient view\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u0026nbsp;\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u003cp\u003eEvent Summary\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003eA simplified summary of events\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u003cp\u003e\u003cb\u003eTreatment-associated Analytics\u003c/b\u003e\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u003cp\u003eSankey Visualization\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003eTracking changes in lines-of-treatment\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u0026nbsp;\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u003cp\u003eHeatmap Visualization\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003eVisualizing the treatment frequency, duration, and combination.\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u0026nbsp;\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u003cp\u003eSwimmer's Plot\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003eCohort level visualization of a particular event\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u0026nbsp;\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u003cp\u003eOutlier Analysis\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003eIdentify extended treatment regimen\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u003cp\u003e\u003cb\u003eTime-to-Event Analytics\u003c/b\u003e\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u003cp\u003eSwimmer's Plot\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003eVisualization of the Time origin and Event time\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u0026nbsp;\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u003cp\u003eSurvival Analysis\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003eKaplan Meier and Cox proportional hazard analysis.\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u003cp\u003e\u003cb\u003eCohort Overview\u003c/b\u003e\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u003cp\u003eEvent Summary\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003eA table of all events\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u0026nbsp;\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u003cp\u003eSwimmer's Plot\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003eCohort level visualization of all events\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u0026nbsp;\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u003cp\u003eAnnotate Event Position\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003eIntegrate table\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u0026nbsp;\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u003cp\u003eCohort Stratification\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003ePatient Stratification\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003c/tbody\u003e\u003c/colgroup\u003e\u003c/table\u003e\u003c/div\u003e\u003c/p\u003e\u003cp\u003e\u003cb\u003ePatient Visual Analytics\u003c/b\u003e offers a patient-centric view of longitudinal data. The user can perform generalized filtering of the patient population. Major clinical events, such as metastasis and progressive disease, can be highlighted in the visualization. Based on the filtered patient population, the user can explore the patient\u0026rsquo;s events throughout the treatment course with mouse-over functionality to highlight additional details of an event. As cancer care often involves a multimodal regimen encompassing systemic therapies, surgical procedures, and radiation, representing these interventions as an aggregated event facilitates computational modeling and downstream analytical workflows. Thus, we implemented an approach to perform clustering of temporally proximal events within a user-defined timeframe (i.e., one month window). The aggregated events can then be visualized in the Patient Event Summary tab (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eA).\u003c/p\u003e\u003cp\u003e\u003c/p\u003e\u003cp\u003e\u003cb\u003eTreatment-associated Analytics\u003c/b\u003e provides the user the ability to aggregate treatments into lines-of-treatment (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eB). Each treatment within a specified window is grouped together as individual treatment lines, which are named the \u003cb\u003eTreatment Summary Cluster\u003c/b\u003e (Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e4\u003c/span\u003eA). A Sankey plot can then be used to visualize the lines of treatment across the cohort (Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e4\u003c/span\u003eB), where the width of the bars and lines represent the number of patients in that group and their treatment path. For example, in GENIE, patients with lung adenocarcinoma were first treated with Cisplatin/Pemetrexed or Cisplatin/Vinorelbine followed by diverse second line treatments which included EGFR-inhibitors (e.g., Erlotinib, Osimertinib, Afatinib), ALK-inhibitors (Alectinib/Crizotinib), Bevacizumab (anti-VEGF-antibody), Carboplatin, and immune checkpoint inhibitors (e.g., Pembrolizumab, Nivolumab, Durvalumab, Atezolizumab, and Ipilimumab). This can then be further visualized using a Heatmap (Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e4\u003c/span\u003eC), which illustrates the frequency of a particular treatment across treatment lines and individual patients (\u003cb\u003eSupplementary S1\u003c/b\u003e). Overall, the Sankey plot and heatmap shows primary treatments with Cisplatin/Pemetrexed or Cisplatin/Vinorelbine followed by next line treatment with either 1) an investigational agent (clinical trial) if available, 2) salvage therapy with next line chemotherapy (carboplatin), 3) profiling with EGFRmut or ALK-fusion to give targeted agents, 4) immunotherapy agents, 5) and CDK4/6 inhibitor (palbociclib) after multiple lines of unsuccessful therapy. The duration of the treatment can be evaluated by a Swimmer\u0026rsquo;s plot, Box plot, and Heatmap to identify patients with a durable treatment. Altogether, the analytics can group patients with similar treatment regimens for downstream analysis.\u003c/p\u003e\u003cp\u003e\u003c/p\u003e\u003cp\u003e\u003cb\u003eTime-to-Event Analysis\u003c/b\u003e enables users to perform survival analysis (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eC). To perform survival analysis, the tool enables the user to define start and end points based on the patient\u0026rsquo;s timeline. An overview of the endpoint definition categories is presented in Table\u0026nbsp;\u003cspan refid=\"Tab2\" class=\"InternalRef\"\u003e2\u003c/span\u003e. Typically, the starting point is based on an oncologic treatment, treatment line, diagnosis, and molecular profiling of a subgroup of patients with a similar clinical baseline. The combination of endpoints would then dictate the type of clinical endpoint analysis. To estimate overall survival, we would use death as an endpoint. To estimate time to treatment change, we would use the next treatment line as an endpoint. To estimate a real-world progression-free survival (rwPFS), we would use the following four endpoints as progression events, including pathological recurrence, death, and radiological progression/metastasis. The output of these estimations can then be utilized for downstream survival analysis\u003csup\u003e\u003cspan citationid=\"CR20\" class=\"CitationRef\"\u003e20\u003c/span\u003e\u003c/sup\u003e. Altogether, ShinyEvents facilitates the estimation of rwPFS from the longitudinal data.\u003c/p\u003e\u003cp\u003e\u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab2\" border=\"1\"\u003e\u003ccaption language=\"En\"\u003e\u003cdiv class=\"CaptionNumber\"\u003eTable 2\u003c/div\u003e\u003cdiv class=\"CaptionContent\"\u003e\u003cp\u003eStart and end point definitions.\u003c/p\u003e\u003c/div\u003e\u003c/caption\u003e\u003ccolgroup cols=\"3\"\u003e\u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e\u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e\u003cdiv align=\"left\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e\u003cthead\u003e\u003ctr\u003e\u003cth align=\"left\" colname=\"c1\"\u003e\u0026nbsp;\u003c/th\u003e\u003cth align=\"left\" colname=\"c2\"\u003e\u003cp\u003eStart\u003c/p\u003e\u003c/th\u003e\u003cth align=\"left\" colname=\"c3\"\u003e\u003cp\u003eEnd point\u003c/p\u003e\u003c/th\u003e\u003c/tr\u003e\u003c/thead\u003e\u003ctbody\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u003cp\u003eOverall Survival\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u003cp\u003eFrom Diagnosis\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003eDeath or last follow-up/contact\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u003cp\u003eTime-to-treatment-end\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u003cp\u003eTreatment Start\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003eChange in Treatment\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u0026nbsp;\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u0026nbsp;\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003eTreatment-associated toxicity\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u0026nbsp;\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u0026nbsp;\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003eDeath\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u003cp\u003eReal-World Progression Free Survival\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u003cp\u003eTreatment Start\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003eChange in Treatment-Line\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u0026nbsp;\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u0026nbsp;\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003eProgression/Metastasis\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u0026nbsp;\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u0026nbsp;\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003eDeath\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003c/tbody\u003e\u003c/colgroup\u003e\u003c/table\u003e\u003c/div\u003e\u003c/p\u003e\u003cp\u003e\u003c/p\u003e\u003cp\u003e\u003cb\u003eCohort Overview\u003c/b\u003e ShinyEvents provides several methods for summarizing cohort-level information (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eD). Based on the GENIE lung adenocarcinoma population, the clinical table\u0026rsquo;s data elements can be viewed through an expanded tree viewer \u003cb\u003e(Supplementary Figure S2A\u003c/b\u003e). The user can subset the data and generate a tabular summary of molecular and patient/sample data \u003cb\u003e(Supplementary Figure S2B\u003c/b\u003e). A Swimmers plot can then be used to highlight the clinical course of each patient as well as cross cohort heterogeneity (Fig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003e). From the Swimmer\u0026rsquo;s plot, the user can cursor over (mouse over) each event for a dropdown with additional details, such as location of treatment site. The tool can stratify patients based on clinicopathologic characteristics, such as sex, clinical grade, and tumor histology. Furthermore, the user can position the major event (such as medication) relative to another event (i.e., sequencing data), annotating the molecular profiling before or after a particular drug regimen for downstream sequencing analysis. Altogether, this collection of tools offers several important preprocessing step overviews.\u003c/p\u003e\u003cp\u003eA live version of the GENIE data is available \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://shawlab-moffitt.shinyapps.io/aacr_genie_nsclc_squamouscell_shinyevents/\u003c/span\u003e\u003cspan address=\"https://shawlab-moffitt.shinyapps.io/aacr_genie_nsclc_squamouscell_shinyevents/\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e, and a tutorial of the tools is also made available through our GitHub page \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://github.com/shawlab-moffitt/shinyEvents\u003c/span\u003e\u003cspan address=\"https://github.com/shawlab-moffitt/shinyEvents\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/p\u003e\u003cp\u003e\u003cb\u003eUse case. Comparing real-world progression-free survival between line-of-treatment\u003c/b\u003e\u003c/p\u003e\u003cp\u003eFor our separate use case example, we assembled a population of patients diagnosed with non-metastatic muscle-invasive bladder cancer treated with neoadjuvant chemotherapy followed by surgical resection at Moffitt Cancer Center who were also enrolled in the Oncology Research Information Exchange Network (ORIEN) AVATAR project. We collected tables associated with diagnosis, outcome, vital status, metastatic disease, and treatment. Altogether, the cohort consisted of 51 patients with a median age of 64 (range: 45\u0026ndash;82) years at diagnosis.\u003c/p\u003e\u003cp\u003e\u003c/p\u003e\u003cp\u003eWe have implemented a general strategy to define treatment-associated rwPFS. We first summarized the treatments rendered into treatment lines, revealing that the patients are predominantly treated with neoadjuvant Gemcitabine, and Cisplatin/or Carboplatin as first-line therapy (\u003cb\u003eSupplementary Figure S3A\u003c/b\u003e). Using a Sankey plot, we found that line 1 primarily consisted of the Carboplatin Gemcitabine combination (n\u0026thinsp;=\u0026thinsp;10) and the Cisplatin Gemcitabine combination (n\u0026thinsp;=\u0026thinsp;21) (\u003cb\u003eSupplementary Figure S3A, B\u003c/b\u003e). To estimate an rwPFS, we defined progression end points to be associated with treatment change, progression/metastasis, and death (Fig.\u0026nbsp;\u003cspan refid=\"Fig6\" class=\"InternalRef\"\u003e6\u003c/span\u003eA). To perform this analysis, the user would need to perform the following steps (Fig.\u0026nbsp;\u003cspan refid=\"Fig6\" class=\"InternalRef\"\u003e6\u003c/span\u003eB: 1) define a window for clustering the samples (\u003cb\u003eSupplementary Figure S4A\u003c/b\u003e), 2) select a starting time point based on a specific treatment (\u003cb\u003eSupplementary Figure S4B\u003c/b\u003e), 3) defining the surrogate endpoint (\u003cb\u003eSupplementary Figure S4C\u003c/b\u003e), and 4) define the two groups for survival analysis (\u003cb\u003eSupplementary Figure S4D\u003c/b\u003e). A tutorial video is presented in the tutorial section, \u003cb\u003eTime-To-Event Analysis\u003c/b\u003e. A Swimmer\u0026rsquo;s plot was used to verify the start and end point of the patient\u0026rsquo;s clinical course (Fig.\u0026nbsp;\u003cspan refid=\"Fig6\" class=\"InternalRef\"\u003e6\u003c/span\u003eC). We performed a Kaplan-Meier analysis and found that patients treated with Carboplatin/Gemcitabine tend to have a worse clinical outcome compared to those treated with the Cisplatin/Gemcitabine combination (Fig.\u0026nbsp;\u003cspan refid=\"Fig6\" class=\"InternalRef\"\u003e6\u003c/span\u003eD). Clinically, Carboplatin may be given to patients with poor renal function or performance status, potentially explaining this observation.\u003c/p\u003e"},{"header":"Discussion","content":"\u003cp\u003eThe visualization of longitudinal data improves our understanding of the clinical course encompassing various lines of treatment rendered \u003csup\u003e\u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e21\u003c/span\u003e\u003c/sup\u003e. Current longitudinal data analytics primarily focuses on a single modality that is genomic-centric, which is often limited to a single biopsy or surgery throughout the treatment course \u003csup\u003e\u003cspan citationid=\"CR22\" class=\"CitationRef\"\u003e22\u003c/span\u003e\u003c/sup\u003e. Notably, there is a lack of visualization tools that can facilitate time-to-event analysis. Here we developed a ShinyEvents framework, that offers an expanded function for the user to define clinical start and endpoint for real-world survival analysis. Our use case highlights our tool’s ability to evaluate treatment regimen and its association with progression-free survival in a real-world bladder cancer cohort from Moffitt Cancer Center. Our tool can also be easily adapted to other publicly available longitudinal datasets, such as GENIE and Flatiron. We anticipate that this tool will be valuable to other institutions, allowing them to effectively utilize their private institutional patient treatment data for meaningful analysis and to derive clinical insights.\u003c/p\u003e\u003cp\u003eAs the field moves toward developing personalized models grounded in RWD, effective preprocessing of time-series data and extraction of clinically relevant features will be essential to realizing the potential of the digital twin technology \u003csup\u003e\u003cspan citationid=\"CR23\" class=\"CitationRef\"\u003e23\u003c/span\u003e\u003c/sup\u003e. To accomplish this goal, it will be critical to maintain effective interaction between clinician, bioinformatician, biostatistician, and machine learning experts, as each brings a unique expertise that maximizes the utility of the patient’s longitudinal clinical information \u003csup\u003e\u003cspan additionalcitationids=\"CR25\" citationid=\"CR24\" class=\"CitationRef\"\u003e24\u003c/span\u003e–\u003cspan citationid=\"CR26\" class=\"CitationRef\"\u003e26\u003c/span\u003e\u003c/sup\u003e. One of the major goals of ShinyEvents is to provide web visualization for effective interdisciplinary communication. The integration of clinical knowledge from experienced clinicians is critical to the development of a valid digital twin model, ensuring that the model captures the complexity of patient care and aligns with real-world clinical practice. Furthermore, there is the need to ensure robustness of the applied statistics and genomic inquiries that are required, which is often dependent on the chosen clinical end point. As the field advances toward big data–driven science, it is crucial to avoid indiscriminate data input, especially in machine learning applications where the risk of selection bias is pronounced. In summary, ShinyEvents fulfills a critical unmet need of a web-based tool that facilitates joint interaction between clinicians and data scientists to enable the development of robust and clinically meaningful models.\u003c/p\u003e\u003cp\u003eOne major limitation of our tool is that it is not optimized for changepoint analysis from longitudinal measurements without extensive preprocessing. In its current version, the tool is designed for case-by-case evaluation of a patient’s disease; however, while it is theoretically possible, a systematic algorithm to determine disease stage (such as early non-metastatic, locally advanced, and metastatic) over the course of their disease can be established. We aim to incorporate these features in future versions of the tool.\u003c/p\u003e"},{"header":"Methods","content":"\u003cp\u003e\u003cb\u003eSofware Implemention R Shiny and environmental setup.\u003c/b\u003e\u003c/p\u003e\u003cp\u003eThe ShinyEvents R Shiny application and package was built using R version 4.4.1 and can be installed via GitHub. To setup a ShinyEvents application interface a parameter file is required. This file outlines the data tables, event column names, and event descriptions which guide the app during startup to generate a comprehensive event data table which fuels the applications functionality. A detailed explanation of how to set up this parameter file can be found in the Github Page \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://github.com/shawlab-moffitt/shinyEvents\u003c/span\u003e\u003cspan address=\"https://github.com/shawlab-moffitt/shinyEvents\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. To deploy the application the app folder should be structured to include the ‘app.R’ script, the ‘R’ folder, the parameter file, and the supplementary files that are annotated in the parameter file. At the top of the ‘app.R’ script there is a line to designate the file name and path of the parameter file “Event_Param_File” and if previously processed, the user can define the event data file and patient annotation file. With these criteria met, the user can deploy the app via the ‘Run App’ button or using the ‘runApp’ R function.\u003c/p\u003e\u003cp\u003e\u003cb\u003eOnline User Tutorial Page\u003c/b\u003e\u003c/p\u003e\u003cp\u003eWithin the ShinyEvents application users can view a tutorial page with an in-depth guide on file formatting as well as video tutorials and helpful text that overview the user interface. In the guide on file formatting, we breakdown the input file options to further explain what data is required at minimum and provide descriptions and examples for the file contents (\u003cb\u003eSupplementary Figure S5A, S5B\u003c/b\u003e). This will cover the event data file and optional supplementary data that can be uploaded directly through the application interface, including a figure outlining how these files can be connected in the app. For more advanced use, we demonstrate how to set up the applications working directory and assemble a parameter file that can be read in through the ‘app.R’ script which guides the generation of detailed event data for exploration in the app. To assist users, we provide an R function that can be used as a pre-processing step to derive the event data from the parameter file. This is recommended when deploying the app with a parameter file or working with larger data sets, as the application startup may require additional computing time if the event data needs to be generated. Additionally, we include tutorial videos and helpful tips that demonstrate the features available in the app.\u003c/p\u003e\u003cp\u003e\u003cb\u003eOnline Example\u003c/b\u003e\u003c/p\u003e\u003cp\u003eSource code and tutorial to ShinyEvents can be accessed on our GitHub page \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://github.com/shawlab-moffitt/shinyEvents\u003c/span\u003e\u003cspan address=\"https://github.com/shawlab-moffitt/shinyEvents\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e which also contains example data, instructions on setting up a local environment for the application, and R functions to assist in data pre-processing. A version of the tool is accessible by \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://shawlab-moffitt.shinyapps.io/shinyevents/\u003c/span\u003e\u003cspan address=\"https://shawlab-moffitt.shinyapps.io/shinyevents/\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. Here users can load example data provided in the app to view the appropriate formatting and explore the features available. Within the app we provide a tutorial page, helpful tips, as well as a guided layout to simplify data upload and app customization. A Zenodo DOI is available \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.5281/zenodo.16527381\u003c/span\u003e\u003cspan address=\"10.5281/zenodo.16527381\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/p\u003e\u003cp\u003e\u003cb\u003eQuantification and Statistical Analysis\u003c/b\u003e\u003c/p\u003e\u003cp\u003eHighlight details about the Kaplan Meier analysis, Fisher’s Exact Test. Statistical tests were based on functions implemented in R.\u003c/p\u003e"},{"header":"Declarations","content":"\u003ch2\u003eCompeting Interests\u003c/h2\u003e\n\u003cp\u003eG.D.G, T.I.S., and D.C have a patent pending based on the submitted work.\u003c/p\u003e\n\u003ch2\u003eAuthor Contribution\u003c/h2\u003e\n\u003cp\u003eContribution: A.O., T.I.S., led the software development and testing. J.D., D.P.T, M.T., D.S., D.D., S.E., V.Y., X.W., A.T., J.G., G.D.G., contributed to the software development and testing; B.J.M., A.A.T., S.G., J.M., R.L., R.R.P., G.D.G., provided guidance on the clinical interpretation; V.Y., D.C., provided guidance on statistical analysis. A.T., R.J.R., M.L.C., R.L., G.D.G., provided patient samples and clinical data. A.O., G.D.G., D.C., T.I.S. designed the study, analyzed data, and wrote the manuscript; and A.T., G.D.G., D.C., T.I.S. oversaw the study. All authors assisted in preparation of the manuscript.\u003c/p\u003e\n\u003ch2\u003eAcknowledgement\u003c/h2\u003e\n\u003cp\u003eThis work has been supported in part by the Biostatistics and Bioinformatics Shared Resource at the Moffitt Cancer Center (NCI P30 CA076292), the Moffitt Cancer Center Department of Biostatistics and Bioinformatics Pilot Project (T.I.S.). Funding for this project was provided by the Department of Defense, Grant No. HT94252510691 (T.I.S., B.M.), Florida Department of Health, Grant No. MOAAX (D.C.), and National Institute of Health, Grant No. R21 CA286417-01, (D.C.) and Grant No. T32 CA233399-04 (J.D.). The authors thank all the ORIEN AVATAR Collaborative Members.\u003c/p\u003e\n\u003ch2\u003eData Availability\u003c/h2\u003e\n\u003cp\u003eSoftware code has been deposited inside the Zenodo repository: https://doi.org/10.5281/zenodo.16527381.\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\n\u003cli\u003eVerkerk, K. \u0026amp; Voest, E. E. Generating and using real-world data: A worthwhile uphill battle. \u003cem\u003eCell\u003c/em\u003e \u003cstrong\u003e187\u003c/strong\u003e, 1636-1650 (2024). https://doi.org:10.1016/j.cell.2024.02.012\u003c/li\u003e\n\u003cli\u003eLiu, F. \u0026amp; Panagiotakos, D. Real-world data: a brief review of the methods, applications, challenges and opportunities. \u003cem\u003eBMC Med Res Methodol\u003c/em\u003e \u003cstrong\u003e22\u003c/strong\u003e, 287 (2022). https://doi.org:10.1186/s12874-022-01768-6\u003c/li\u003e\n\u003cli\u003eRamsey, S. D., Onar-Thomas, A. \u0026amp; Wheeler, S. B. Real-World Database Studies in Oncology: A Call for Standards. \u003cem\u003eJ Clin Oncol\u003c/em\u003e \u003cstrong\u003e42\u003c/strong\u003e, 977-980 (2024). https://doi.org:10.1200/JCO.23.02399\u003c/li\u003e\n\u003cli\u003eSoupir, A.\u003cem\u003e et al.\u003c/em\u003e Genomic, transcriptomic, and immunogenomic landscape of over 1300 sarcomas of diverse histology subtypes. \u003cem\u003eNat Commun\u003c/em\u003e \u003cstrong\u003e16\u003c/strong\u003e, 4206 (2025). https://doi.org:10.1038/s41467-025-58678-6\u003c/li\u003e\n\u003cli\u003ePark, M. A.\u003cem\u003e et al.\u003c/em\u003e Molecular Pathway and Immune Profile Analysis of IPMN-Derived Versus PanIN-Derived Pancreatic Ductal Adenocarcinomas. \u003cem\u003eInt J Mol Sci\u003c/em\u003e \u003cstrong\u003e25\u003c/strong\u003e (2024). https://doi.org:10.3390/ijms252313164\u003c/li\u003e\n\u003cli\u003eWang, X.\u003cem\u003e et al.\u003c/em\u003e Multicellular immune ecotypes within solid tumors predict real-world therapeutic benefits with immune checkpoint inhibitors. \u003cem\u003emedRxiv\u003c/em\u003e (2024). https://doi.org:10.1101/2024.07.19.24310726\u003c/li\u003e\n\u003cli\u003eEule, C. J.\u003cem\u003e et al.\u003c/em\u003e Clinical and Genomic Features of Patients with Renal Cell Carcinoma and Advanced Chronic Kidney Disease: Analysis of a Multi-Institutional Database. \u003cem\u003eCancers (Basel)\u003c/em\u003e \u003cstrong\u003e16\u003c/strong\u003e (2024). https://doi.org:10.3390/cancers16101920\u003c/li\u003e\n\u003cli\u003eDemetriou, A. N.\u003cem\u003e et al.\u003c/em\u003e Profiling the molecular and clinical landscape of glioblastoma utilizing the Oncology Research Information Exchange Network brain cancer database. \u003cem\u003eNeurooncol Adv\u003c/em\u003e \u003cstrong\u003e6\u003c/strong\u003e, vdae046 (2024). https://doi.org:10.1093/noajnl/vdae046\u003c/li\u003e\n\u003cli\u003eCastellanos, E. H., Wittmershaus, B. K. \u0026amp; Chandwani, S. Raising the Bar for Real-World Data in Oncology: Approaches to Quality Across Multiple Dimensions. \u003cem\u003eJCO Clin Cancer Inform\u003c/em\u003e \u003cstrong\u003e8\u003c/strong\u003e, e2300046 (2024). https://doi.org:10.1200/CCI.23.00046\u003c/li\u003e\n\u003cli\u003eConsortium, A. P. G. AACR Project GENIE: Powering Precision Medicine through an International Consortium. \u003cem\u003eCancer Discov\u003c/em\u003e \u003cstrong\u003e7\u003c/strong\u003e, 818-831 (2017). https://doi.org:10.1158/2159-8290.CD-17-0151\u003c/li\u003e\n\u003cli\u003ede Bruijn, I.\u003cem\u003e et al.\u003c/em\u003e Analysis and Visualization of Longitudinal Genomic and Clinical Data from the AACR Project GENIE Biopharma Collaborative in cBioPortal. \u003cem\u003eCancer Res\u003c/em\u003e \u003cstrong\u003e83\u003c/strong\u003e, 3861-3867 (2023). https://doi.org:10.1158/0008-5472.CAN-23-0816\u003c/li\u003e\n\u003cli\u003eHarbig, T. A.\u003cem\u003e et al.\u003c/em\u003e OncoThreads: visualization of large-scale longitudinal cancer molecular data. \u003cem\u003eBioinformatics\u003c/em\u003e \u003cstrong\u003e37\u003c/strong\u003e, i59-i66 (2021). https://doi.org:10.1093/bioinformatics/btab289\u003c/li\u003e\n\u003cli\u003eDel Gaizo, J., Catchpole, K. R. \u0026amp; Alekseyenko, A. V. Research and Exploratory Analysis Driven-Time-data Visualization (read-tv) software. \u003cem\u003eJAMIA Open\u003c/em\u003e \u003cstrong\u003e4\u003c/strong\u003e, ooab007 (2021). https://doi.org:10.1093/jamiaopen/ooab007\u003c/li\u003e\n\u003cli\u003eGoedhart, J. PlotTwist: A web app for plotting and annotating continuous data. \u003cem\u003ePLoS Biol\u003c/em\u003e \u003cstrong\u003e18\u003c/strong\u003e, e3000581 (2020). https://doi.org:10.1371/journal.pbio.3000581\u003c/li\u003e\n\u003cli\u003eEspinosa-Carrasco, J.\u003cem\u003e et al.\u003c/em\u003e Pergola-web: a web server for the visualization and analysis of longitudinal behavioral data using repurposed genomics tools and standards. \u003cem\u003eNucleic Acids Res\u003c/em\u003e \u003cstrong\u003e47\u003c/strong\u003e, W600-W604 (2019). https://doi.org:10.1093/nar/gkz414\u003c/li\u003e\n\u003cli\u003eMaheux, E.\u003cem\u003e et al.\u003c/em\u003e Forecasting individual progression trajectories in Alzheimer\u0026apos;s disease. \u003cem\u003eNat Commun\u003c/em\u003e \u003cstrong\u003e14\u003c/strong\u003e, 761 (2023). https://doi.org:10.1038/s41467-022-35712-5\u003c/li\u003e\n\u003cli\u003eNguena Nguefack, H. L.\u003cem\u003e et al.\u003c/em\u003e Trajectory Modelling Techniques Useful to Epidemiological Research: A Comparative Narrative Review of Approaches. \u003cem\u003eClin Epidemiol\u003c/em\u003e \u003cstrong\u003e12\u003c/strong\u003e, 1205-1222 (2020). https://doi.org:10.2147/CLEP.S265287\u003c/li\u003e\n\u003cli\u003eJohnson, K. B.\u003cem\u003e et al.\u003c/em\u003e Precision Medicine, AI, and the Future of Personalized Health Care. \u003cem\u003eClin Transl Sci\u003c/em\u003e \u003cstrong\u003e14\u003c/strong\u003e, 86-93 (2021). https://doi.org:10.1111/cts.12884\u003c/li\u003e\n\u003cli\u003eChoudhury, N. J.\u003cem\u003e et al.\u003c/em\u003e The GENIE BPC NSCLC Cohort: A Real-World Repository Integrating Standardized Clinical and Genomic Data for 1,846 Patients with Non-Small Cell Lung Cancer. \u003cem\u003eClin Cancer Res\u003c/em\u003e \u003cstrong\u003e29\u003c/strong\u003e, 3418-3428 (2023). https://doi.org:10.1158/1078-0432.CCR-23-0580\u003c/li\u003e\n\u003cli\u003eObermayer, A. N.\u003cem\u003e et al.\u003c/em\u003e PATH-SURVEYOR: pathway level survival enquiry for immuno-oncology and drug repurposing. \u003cem\u003eBMC Bioinformatics\u003c/em\u003e \u003cstrong\u003e24\u003c/strong\u003e, 266 (2023). https://doi.org:10.1186/s12859-023-05393-y\u003c/li\u003e\n\u003cli\u003eTurchioe, M. R.\u003cem\u003e et al.\u003c/em\u003e A Systematic Review of Patient-Facing Visualizations of Personal Health Data. \u003cem\u003eAppl Clin Inform\u003c/em\u003e \u003cstrong\u003e10\u003c/strong\u003e, 751-770 (2019). https://doi.org:10.1055/s-0039-1697592\u003c/li\u003e\n\u003cli\u003eAvila, M. \u0026amp; Meric-Bernstam, F. Next-generation sequencing for the general cancer patient. \u003cem\u003eClin Adv Hematol Oncol\u003c/em\u003e \u003cstrong\u003e17\u003c/strong\u003e, 447-454 (2019). \u003c/li\u003e\n\u003cli\u003eStahlberg, E. A.\u003cem\u003e et al.\u003c/em\u003e Exploring approaches for predictive cancer patient digital twins: Opportunities for collaboration and innovation. \u003cem\u003eFront Digit Health\u003c/em\u003e \u003cstrong\u003e4\u003c/strong\u003e, 1007784 (2022). https://doi.org:10.3389/fdgth.2022.1007784\u003c/li\u003e\n\u003cli\u003eAltman, R. B. The interactions between clinical informatics and bioinformatics: a case study. \u003cem\u003eJ Am Med Inform Assoc\u003c/em\u003e \u003cstrong\u003e7\u003c/strong\u003e, 439-443 (2000). https://doi.org:10.1136/jamia.2000.0070439\u003c/li\u003e\n\u003cli\u003eMartin-Sanchez, F.\u003cem\u003e et al.\u003c/em\u003e Synergy between medical informatics and bioinformatics: facilitating genomic medicine for future health care. \u003cem\u003eJ Biomed Inform\u003c/em\u003e \u003cstrong\u003e37\u003c/strong\u003e, 30-42 (2004). https://doi.org:10.1016/j.jbi.2003.09.003\u003c/li\u003e\n\u003cli\u003eAlowais, S. A.\u003cem\u003e et al.\u003c/em\u003e Revolutionizing healthcare: the role of artificial intelligence in clinical practice. \u003cem\u003eBMC Med Educ\u003c/em\u003e \u003cstrong\u003e23\u003c/strong\u003e, 689 (2023). https://doi.org:10.1186/s12909-023-04698-z\u003c/li\u003e\n\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":true,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":true,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"npj-precision-oncology","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"npjprecisiononcology","sideBox":"Learn more about [npj Precision Oncology](http://www.nature.com/npjprecisiononcology/)","snPcode":"41698","submissionUrl":"https://submission.springernature.com/new-submission/41698/3","title":"npj Precision Oncology","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"NPJ","inReviewEnabled":true,"inReviewRevisionsEnabled":true},"keywords":"","lastPublishedDoi":"10.21203/rs.3.rs-7231850/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-7231850/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eLongitudinal data analysis of the patient\u0026rsquo;s treatment course is critical to uncovering variables that influence outcomes. However, existing tools have significant limitations in integrating multilayered time-series data. Here, we developed ShinyEvents, a web-based framework for complex longitudinal data analysis. ShinyEvents allows users to upload data and generate interactive timelines of the patient\u0026rsquo;s clinical events. Our tool can perform cohort-level analysis, including the assignment of treatment clusters and clinical endpoints. Our tool also provides informative cohort visualizations, such as a Sankey diagram of the treatment line and Swimmer diagram of the clinical course. Finally, our tool can infer a real-world progression-free survival (rwPFS) based on user-defined endpoints to perform Kaplan-Meier and Cox proportional hazards regression analysis. With these features, the tool can then associate the lines of treatment with clinical outcomes. Altogether, ShinyEvents facilitates the integration of multilayered longitudinal data and enables survival analysis in real-time. A live link to the tool is available \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://shawlab-moffitt.shinyapps.io/shinyevents/\u003c/span\u003e\u003cspan address=\"https://shawlab-moffitt.shinyapps.io/shinyevents/\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/p\u003e","manuscriptTitle":"ShinyEvents: harmonizing longitudinal data for real world survival estimation","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2025-08-06 08:21:58","doi":"10.21203/rs.3.rs-7231850/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"decision","content":"Revision requested","date":"2025-09-11T17:13:02+00:00","index":"","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2025-09-09T22:03:35+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2025-09-08T02:12:52+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2025-09-05T21:36:11+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"204194095873434641159492154993318815792","date":"2025-09-03T07:42:18+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"35070991701516783151230905119533943268","date":"2025-09-01T00:15:32+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"296119190198648905507950306572840153925","date":"2025-09-01T00:10:45+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"31573752937552014903294499135411076695","date":"2025-08-18T00:59:37+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"332999162613971136846493358997177086794","date":"2025-08-17T21:08:11+00:00","index":"hide","fulltext":""},{"type":"reviewersInvited","content":"","date":"2025-08-12T02:16:15+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2025-08-11T18:08:18+00:00","index":"","fulltext":""},{"type":"checksComplete","content":"","date":"2025-08-01T03:41:14+00:00","index":"","fulltext":""},{"type":"submitted","content":"npj Precision Oncology","date":"2025-07-28T08:52:06+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"npj-precision-oncology","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"npjprecisiononcology","sideBox":"Learn more about [npj Precision Oncology](http://www.nature.com/npjprecisiononcology/)","snPcode":"41698","submissionUrl":"https://submission.springernature.com/new-submission/41698/3","title":"npj Precision Oncology","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"NPJ","inReviewEnabled":true,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"965ea3e5-cd34-4523-ad44-da4f0a1477c8","owner":[],"postedDate":"August 6th, 2025","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"published-in-journal","subjectAreas":[{"id":52468459,"name":"Biological sciences/Computational biology and bioinformatics"},{"id":52468460,"name":"Health sciences/Health care"},{"id":52468461,"name":"Physical sciences/Mathematics and computing"},{"id":52468462,"name":"Health sciences/Medical research"}],"tags":[],"updatedAt":"2026-01-19T17:01:22+00:00","versionOfRecord":{"articleIdentity":"rs-7231850","link":"https://doi.org/10.1038/s41698-025-01212-0","journal":{"identity":"npj-precision-oncology","isVorOnly":false,"title":"npj Precision Oncology"},"publishedOn":"2026-01-13 16:28:26","publishedOnDateReadable":"January 13th, 2026"},"versionCreatedAt":"2025-08-06 08:21:58","video":"","vorDoi":"10.1038/s41698-025-01212-0","vorDoiUrl":"https://doi.org/10.1038/s41698-025-01212-0","workflowStages":[]},"version":"v1","identity":"rs-7231850","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-7231850","identity":"rs-7231850","version":["v1"]},"buildId":"8U1c8b4HqxoKbykW_rLl7","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.