Kidney Stone Segmentation from Computed Tomography Images Using Computer Vision Techniques

doi:10.21203/rs.3.rs-8937650/v1

Kidney Stone Segmentation from Computed Tomography Images Using Computer Vision Techniques

2026 · doi:10.21203/rs.3.rs-8937650/v1

preprint OA: closed

Full text JSON View at publisher

Full text 66,055 characters · extracted from preprint-html · click to expand

Kidney Stone Segmentation from Computed Tomography Images Using Computer Vision Techniques | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Kidney Stone Segmentation from Computed Tomography Images Using Computer Vision Techniques Rifat Burak Ergül, Poyraz Doğan, Muhammed Edip Görür, Mücahit Bayram, and 4 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-8937650/v1 This work is licensed under a CC BY 4.0 License Status: Under Review Version 1 posted 6 You are reading this latest preprint version Abstract Objective Urolithiasis is a common condition in urological practice, and accurate imaging is essential for proper diagnosis and management. Computed tomography (CT) is widely accepted as the gold standard for evaluating kidney stones due to its high sensitivity in assessing stone location, size, and morphology. However, manual interpretation of CT images is time-consuming and subject to observer-dependent variability. This study aimed to develop a deep learning–based model for automatic kidney stone segmentation and size estimation from CT images and to integrate this model into a web-based clinical decision support system. Methods An open-access dataset consisting of 3,584 CT images and corresponding kidney stone segmentation masks was used. The dataset was divided into training (70%), validation (20%), and test (10%) subsets. Image preprocessing and model development were performed using Python and the TensorFlow framework. A U-Net + + architecture was employed for kidney stone segmentation. Model training was conducted in the Google Colab environment. Stone size estimation was performed using the equivalent circular diameter method based on the segmented stone area. Model performance was evaluated using Precision, Recall, Precision–Recall Area Under the Curve (PR-AUC), Intersection over Union (IoU), and Dice/F1-score metrics. The trained model was additionally deployed via a web-based interface to demonstrate potential clinical applicability. Results On the test dataset, the proposed model achieved a PR-AUC of 90.58%. The Dice/F1-score and IoU values were 82.10% and 69.63%, respectively. Precision and Recall were 76.66% and 88.36%. Qualitative evaluation demonstrated accurate localization and segmentation of kidney stones, including small calculi. Conclusion This study presents an effective deep learning–based approach for automated kidney stone segmentation and size estimation from CT images. The favorable quantitative results and consistent visual performance suggest that the proposed system may support clinical decision-making by reducing workload and improving diagnostic consistency in routine urological practice. Kidney stone computed tomography deep learning artificial intelligence segmentation Figures Figure 1 Figure 2 Figure 3 Figure 4 Introduction Kidney stone disease is one of the most common urological disorders worldwide and represents a significant public health burden due to its high prevalence and recurrence rate. The disease may present with acute renal colic, hematuria, urinary tract infections, and, in advanced cases, obstructive uropathy leading to loss of renal function. Owing to its frequent acute presentation, urolithiasis accounts for a substantial proportion of emergency department admissions, making rapid and accurate diagnosis essential for effective clinical management [1–3]. Non-contrast computed tomography (NCCT) is currently regarded as the gold standard imaging modality for the evaluation of kidney stones, owing to its high sensitivity and specificity in detecting calculi, as well as its ability to accurately assess stone size, location, density, and anatomical relationships [4, 5]. Despite these advantages, manual interpretation of NCCT images remains time-consuming and prone to inter-observer variability. Factors such as increased workload, limited availability of experienced radiologists, and time pressure, particularly in emergency settings, may adversely affect diagnostic accuracy. [6]. Moreover, small stone size, low-contrast appearance, and complex anatomical variations can further contribute to missed or misinterpreted findings in routine clinical practice [7]. These limitations have prompted growing interest in automated and reliable tools to support radiological assessment. In recent years, artificial intelligence (AI), particularly deep learning–based approaches using convolutional neural networks (CNNs), has demonstrated remarkable performance in medical image analysis tasks, including classification, detection, and segmentation [8, 9] Several studies have applied AI-based techniques to kidney stone detection and segmentation on NCCT images, reporting encouraging diagnostic performance and potential reductions in clinical workload [10, 11]. However, many previously published studies are limited by small sample sizes, single-center datasets, class imbalance, and insufficient consideration of clinical integration and real-time applicability [12, 13]. In addition, issues related to model explainability and deployment within clinical decision support systems have often been inadequately addressed. More recent investigations suggest that deep learning models trained on large, diverse datasets and designed with practical clinical workflows in mind may yield more robust and generalizable results [14]. Therefore, the present study aimed to develop a deep learning–based model for the automatic segmentation and size estimation of kidney stones from NCCT images using a large open-access dataset. Furthermore, the trained model was integrated into a web-based clinical decision support system to facilitate practical deployment and to support diagnostic decision-making in routine urological practice. Materials and Methods Dataset and Study Design In this study, a deep learning–based approach was developed for the automatic segmentation of kidney stones from computed tomography (CT) images. The dataset consisted of CT images paired with corresponding binary segmentation masks indicating stone regions. Data organization and processing were performed in accordance with recommended best practices for medical image analysis to ensure reproducibility and methodological transparency [8]. Input data included common raster image formats (PNG, JPG, TIFF) and DICOM (.dcm) files. For DICOM images, pixel intensities were normalized using VOI-LUT transformation and subsequently converted into a three-channel representation compatible with the model input. This preprocessing step aimed to reduce intensity variability arising from different CT acquisition protocols [15]. To minimize data leakage and improve patient-level generalizability, a group-based data splitting strategy was applied whenever patient-related identifiers could be inferred. This ensured that images originating from the same patient were not distributed across training, validation, and test subsets simultaneously. The dataset was divided into training (70%), validation (20%), and test (10%) sets, following commonly accepted practices in medical artificial intelligence studies [12]. Model Architecture Kidney stone segmentation was performed using a U-Net + + architecture. Compared to the classical U-Net, U-Net + + employs nested skip connections that enable more effective multi-scale feature aggregation, which is particularly advantageous for segmenting small and low-contrast targets [16]. An ImageNet-pretrained EfficientNet-B3 backbone was used as the network's encoder. EfficientNet architectures are widely adopted in medical imaging due to their favorable balance between parameter efficiency and strong feature extraction capability [17]. Dropout was applied in the decoder pathway to reduce overfitting. The overall model architecture is illustrated in Fig. 1 . Preprocessing, Data Augmentation, and Mixing Strategies All images were resized to match the model input dimensions while preserving the original aspect ratio using a letterbox/padding approach. Images were then normalized using ImageNet mean and standard deviation values before model ingestion. To enhance robustness against variations in acquisition conditions and anatomical appearance, strong data augmentation was applied during training. Augmentation techniques included random brightness and contrast adjustment, noise injection, gamma correction, perspective transformation, elastic deformation, and horizontal and vertical flipping. Such augmentation strategies have been shown to improve generalization performance in medical image segmentation tasks substantially [18]. To further mitigate class imbalance and reduce overfitting, probabilistic MixUp and CutMix data-mixing strategies were applied during training. These approaches increase sample diversity and encourage smoother decision boundaries, contributing to more stable and generalizable learning. [18] Training Configuration and Overfitting Prevention Model training was conducted using the PyTorch framework, with AdamW employed as the optimization algorithm. To control overfitting, weight decay, gradient clipping, and early stopping were applied. Learning rate scheduling was performed using CosineAnnealingWarmRestarts, and Stochastic Weight Averaging (SWA) was applied during the later stages of training to improve generalization by averaging model weights across epochs [19]. Automatic Mixed Precision (AMP) was utilized throughout training to improve computational efficiency and reduce memory consumption. During validation, the probability threshold used to binarize the segmentation output was not fixed; instead, a threshold sweep was performed on the validation set to identify the value that maximized the Dice (F1) score. This strategy is particularly appropriate for segmentation problems with imbalanced pixel distributions [20]. Testing and Evaluation Metrics Model performance was evaluated exclusively on the held-out test set. During inference, predictions were enhanced using test-time augmentation (TTA), where horizontally and vertically flipped versions of each image were processed, and the resulting probability maps were averaged. This approach improves prediction stability and robustness [21]. Segmentation performance was assessed using pixel-level Dice (F1-score) and Intersection over Union (IoU) metrics, as well as ROC and precision–recall (PR) curve–based area under the curve (AUC) values. These metrics are widely accepted for evaluating segmentation quality, particularly for small target structures such as kidney stones [22]. To generate clinically interpretable outputs, a subset of positive test cases was visualized by overlaying predicted masks, contours, bounding boxes, and equivalent circular diameter measurements onto the original CT images. Web-Based Application The developed segmentation model was deployed as a web-based clinical decision support tool (www.kidney-stone-segmentation.com) to facilitate practical use scenarios. The application allows users to upload CT images and receive real-time model outputs through an interactive interface. Uploaded images are processed using the same preprocessing pipeline as in training, and the model outputs probability maps and binary segmentation masks. Results are displayed visually with stone contours and measurement annotations. To ensure safe clinical use, the interface includes a disclaimer stating that the system does not replace physician judgment. An example screenshot of the web interface is shown in Fig. 2. Results The quantitative performance of the proposed kidney stone segmentation model is summarized in Table 1 , which reports the precision, recall, PR-AUC, Dice (F1), and IoU values obtained on the independent test set. The model achieved 76.66% precision and 88.36% recall, indicating high sensitivity for kidney stone detection. The PR-AUC of 90.58% demonstrates strong discriminative performance under class-imbalanced conditions, as shown in the precision–recall curve in Fig. 3 . In terms of spatial overlap, the model yielded a Dice (F1) score of 82.10% and an IoU of 69.63%, reflecting accurate pixel-wise agreement between predicted masks and ground truth annotations. Representative qualitative examples of kidney stone segmentations overlaid on CT images are presented in Fig. 4 , further supporting the quantitative results. Table 1 Performance metrics of the model Precision (%) Recall (%) PR-AUC (%) Dice (%) IoU (%) 76,66 88,36 90,58 82,10 69,63 Discussions In this study, a deep learning–based framework was developed for the automatic segmentation of kidney stones from non-contrast computed tomography (NCCT) images, and its performance was evaluated using both pixel-level overlap metrics and precision–recall–based measures. The results demonstrate that the proposed model achieves a favorable balance between sensitivity and segmentation accuracy, supporting its potential as a clinically relevant decision-support tool. In particular, the high PR-AUC value highlights the model’s robustness to substantial class imbalance, a common characteristic of kidney stone segmentation tasks in which stone pixels represent only a small fraction of the image [8]. Non-contrast CT remains the imaging modality of choice for evaluating nephrolithiasis due to its superior sensitivity and detailed anatomical depiction [23]. However, accurately interpreting NCCT images can be challenging in routine clinical practice, especially in high-workload environments and emergency settings. Small stones, low contrast relative to surrounding tissues, and complex anatomical variations may increase the likelihood of missed or inconsistent detections [6]. In this context, automated segmentation methods can provide consistent and objective delineation of stone regions, potentially reducing inter-observer variability and supporting radiologists during image interpretation. Several deep learning–based approaches for kidney stone detection and segmentation have been reported in recent years. Prior studies have explored convolutional neural network–based frameworks for automated stone detection, volumetric segmentation, and stone burden assessment on CT images, reporting encouraging diagnostic performance [10, 11, 24]. More recent work has investigated advanced architectures, including hybrid and attention-based models, to improve robustness and segmentation accuracy further [25, 26]. Compared with these approaches, the present study demonstrates competitive performance, particularly in precision–recall–based evaluation, which is increasingly recognized as a more informative metric than ROC-based measures in highly imbalanced medical imaging problems [27]. Beyond algorithmic performance, the clinical utility of artificial intelligence systems depends on their robustness, generalizability, and ease of integration into existing workflows. Previous methodological reviews emphasize that strong internal validation results alone are insufficient for clinical adoption, and that external validation and real-world testing are critical next steps [28, 29]. In the present study, integrating the segmentation model into a web-based interface represents an initial step toward practical deployment, enabling real-time visualization of segmentation outputs and facilitating interaction with end users in a clinical context. Explainability and transparency are also increasingly recognized as key determinants of clinician trust in artificial intelligence–based systems. Although the current work focuses primarily on segmentation performance, future incorporation of explainable AI techniques, such as saliency maps or gradient-based visualization methods, may further enhance interpretability and support clinical acceptance [29, 30]. In addition, recent methodological studies highlight that combining overlap-based metrics with precision–recall–oriented evaluation provides a more comprehensive assessment of segmentation performance, particularly for small target structures such as kidney stones [27]. Several limitations of this study should be acknowledged. First, the model was evaluated on an internal test dataset derived from publicly available imaging sources, which may not fully capture the variability encountered in routine clinical practice. Second, the current framework focuses on binary stone segmentation and does not explicitly address clinically relevant attributes such as stone composition, anatomical location, or volumetric burden. Finally, although a web-based deployment was implemented, prospective and multicenter validation studies are required to assess the model’s real-world impact and generalizability. Conclusion In this study, a deep learning–based approach was developed for the automatic segmentation of kidney stones from non-contrast computed tomography (NCCT) images. It demonstrated robust performance on the internal test dataset, with reliable pixel-level delineation and strong discriminative capability under class-imbalanced conditions. The deployment of the model as a web-based application (www.kidney-stone-segmentation.com) represents an important step toward clinical translation by enabling real-time visualization of segmentation outputs through an accessible interface. Although external validation and prospective multicenter studies are required to confirm its generalizability and clinical impact, the results of this study suggest that artificial intelligence–driven segmentation systems have the potential to support radiological assessment and enhance urological diagnostic workflows. Declarations Ethics Ethical approval was not required for this study. Footnotes Authorship Contributions Concept and design of the study: R.B.E., P.D., T.T.; Data collection: M.E.G., M.E.E, İ.T.G.; Data preprocessing: R.B.E., İ.T.G.; AI model development and training: P.D., M.B., Y.B..; Statistical analysis and model evaluation: R.B.E., M.E.E.; Software development and coding: P.D., M.E.G.; Manuscript writing: P.D., M.E.G., M.B., Y.B..; Review and editing: R.B.E., İ.T.G., M.E.E., T.T.; Project supervision: T.T.. Conflict of Interest: No conflict of interest was declared by the authors. Financial Disclosure: The authors declared that this study received no financial support Data Availability The dataset used in this study is publicly available from an open-access repository. Processed data and model outputs are available from the corresponding author upon reasonable request. References Finger M, Finger E, Bellucci A, Malieckal DA. Medical management for the prevention of kidney stones. Postgrad Med J. 2023;99(1169):112–8. doi: 10.1136/postgradmedj-2021-140971. Scales CD, Jr., Smith AC, Hanley JM, Saigal CS, Urologic Diseases in America P. Prevalence of kidney stones in the United States. Eur Urol. 2012;62(1):160–5. doi: 10.1016/j.eururo.2012.03.052. Sorokin I, Mamoulakis C, Miyazawa K, Rodgers A, Talati J, Lotan Y. Epidemiology of stone disease across the world. World J Urol. 2017;35(9):1301–20. doi: 10.1007/s00345-017-2008-6. Turk C, Petrik A, Sarica K, Seitz C, Skolarikos A, Straub M, et al. EAU Guidelines on Diagnosis and Conservative Management of Urolithiasis. Eur Urol. 2016;69(3):468–74. doi: 10.1016/j.eururo.2015.07.040. Waite S, Scott J, Gale B, Fuchs T, Kolla S, Reede D. Interpretive Error in Radiology. AJR Am J Roentgenol. 2017;208(4):739–49. doi: 10.2214/AJR.16.16963. Brady AP. Error and discrepancy in radiology: inevitable or avoidable? Insights Imaging. 2017;8(1):171–82. doi: 10.1007/s13244-016-0534-1. Moore CL, Daniels B, Ghita M, Gunabushanam G, Luty S, Molinaro AM, et al. Accuracy of reduced-dose computed tomography for ureteral stones in emergency department patients. Ann Emerg Med. 2015;65(2):189–98 e2. doi: 10.1016/j.annemergmed.2014.09.008. Litjens G, Kooi T, Bejnordi BE, Setio AAA, Ciompi F, Ghafoorian M, et al. A survey on deep learning in medical image analysis. Med Image Anal. 2017;42:60–88. doi: 10.1016/j.media.2017.07.005. Esteva A, Robicquet A, Ramsundar B, Kuleshov V, DePristo M, Chou K, et al. A guide to deep learning in healthcare. Nat Med. 2019;25(1):24–9. doi: 10.1038/s41591-018-0316-z. Elton DC, Turkbey EB, Pickhardt PJ, Summers RM. A deep learning system for automated kidney stone detection and volumetric segmentation on noncontrast CT scans. Med Phys. 2022;49(4):2545–54. doi: 10.1002/mp.15518. Cui Y, Sun Z, Ma S, Liu W, Wang X, Zhang X, et al. Automatic Detection and Scoring of Kidney Stones on Noncontrast CT Images Using S.T.O.N.E. Nephrolithometry: Combined Deep Learning and Thresholding Methods. Mol Imaging Biol. 2021;23(3):436–45. doi: 10.1007/s11307-020-01554-0. Park SH, Han K. Methodologic Guide for Evaluating Clinical Performance and Effect of Artificial Intelligence Technology for Medical Diagnosis and Prediction. Radiology. 2018;286(3):800–9. doi: 10.1148/radiol.2017171920. Kelly CJ, Karthikesalingam A, Suleyman M, Corrado G, King D. Key challenges for delivering clinical impact with artificial intelligence. BMC Med. 2019;17(1):195. doi: 10.1186/s12916-019-1426-2. Mazurowski MA, Buda M, Saha A, Bashir MR. Deep learning in radiology: An overview of the concepts and a survey of the state of the art with focus on MRI. J Magn Reson Imaging. 2019;49(4):939–54. doi: 10.1002/jmri.26534. Dapamede T, Li F, Khosravi B, Purkayastha S, Trivedi H, Gichoya J. DICOM LUT is a Key Step in Medical Image Preprocessing Towards AI Generalizability. J Imaging Inform Med. 2025;38(5):3040–8. doi: 10.1007/s10278-025-01418-5. Zhou Z, Siddiquee MMR, Tajbakhsh N, Liang J. UNet++: A Nested U-Net Architecture for Medical Image Segmentation. Deep Learn Med Image Anal Multimodal Learn Clin Decis Support (2018). 2018;11045:3–11. doi: 10.1007/978-3-030-00889-5_1. Keishing V, Ahn ES, Khashim Z, Loudermilk AD, Erickson BJ. Towards Automated Craniosynostosis Diagnosis Using EfficientNet-Based Artificial Intelligence Models: A Two-Class and Multi-Class Approach. J Imaging Inform Med. 2025. doi: 10.1007/s10278-025-01694-1. Zeng W. Image data augmentation techniques based on deep learning: A survey. Math Biosci Eng. 2024;21(6):6190–224. doi: 10.3934/mbe.2024272. Kunkel KJ, Anwaruddin S. Papillary Muscle Rupture Due to Delayed STEMI Presentation in a Patient Self-Isolating for Presumed COVID-19. JACC Case Rep. 2020;2(10):1633–6. doi: 10.1016/j.jaccas.2020.06.036. Youden WJ. Index for rating diagnostic tests. Cancer. 1950;3(1):32–5. doi: 10.1002/1097-0142(1950)3:13.0.co;2-3. Rylaarsdam LE, Johnecheck GN, Looyenga BD, Louters LL. GLUT1 is associated with sphingolipid-organized, cholesterol-independent domains in L929 mouse fibroblast cells. Biochimie. 2019;162:88–96. doi: 10.1016/j.biochi.2019.04.010. Rodriguez-Espinosa N, Gonzalez-Colaco Harmand M, Miranda Saavedra F, Galvan Gonzalez MA, Plasencia Nunez M, Aldea Perona AM, et al. [Prescription of acetylcholinesterase inhibitors and memantine in the Canary Islands, comparison with the Spanish population.]. Rev Esp Salud Publica. 2021;95. Smith RC, Rosenfield AT, Choe KA, Essenmacher KR, Verga M, Glickman MG, et al. Acute flank pain: comparison of non-contrast-enhanced CT and intravenous urography. Radiology. 1995;194(3):789–94. doi: 10.1148/radiology.194.3.7862980. Frenken MWE, Thijssen KMJ, Vlemminx MWC, van den Heuvel ER, Westerhuis M, Oei SG. Clinical evaluation of electrohysterography as method of monitoring uterine contractions during labor: A propensity score matched study. Eur J Obstet Gynecol Reprod Biol. 2021;259:178–84. doi: 10.1016/j.ejogrb.2021.02.029. Wu N, Gao H, Xu Q, Zhang Z. Characterization and Whole-Genome Analysis of a Zearalenone-Degrading Stappia sp. WLB 29. Curr Microbiol. 2022;79(6):179. doi: 10.1007/s00284-022-02874-w. Hada AM, Potara M, Astilean S, Cordaro A, Neri G, Malanga M, et al. Linezolid nanoAntiobiotics and SERS-nanoTags based on polymeric cyclodextrin bimetallic core-shell nanoarchitectures. Carbohydr Polym. 2022;293:119736. doi: 10.1016/j.carbpol.2022.119736. Saito T, Rehmsmeier M. The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLoS One. 2015;10(3):e0118432. doi: 10.1371/journal.pone.0118432. Miller H, Johns L. Interoperability of Electronic Health Records: A Physician-Driven Redesign. Manag Care. 2018;27(1):37–40. Shi X, Du R, Zhang J, Lei Y, Guo H. Evaluation of the anti-cancer potential of Cedrus deodara total lignans by inducing apoptosis of A549 cells. BMC Complement Altern Med. 2019;19(1):281. doi: 10.1186/s12906-019-2682-6. Wei J, Wang L, Yang X. Game analysis on the evolution of COVID-19 epidemic under the prevention and control measures of the government. PLoS One. 2020;15(10):e0240961. doi: 10.1371/journal.pone.0240961. Additional Declarations No competing interests reported. Cite Share Download PDF Status: Under Review Version 1 posted Reviews received at journal 17 Mar, 2026 Reviewers agreed at journal 02 Mar, 2026 Reviewers invited by journal 02 Mar, 2026 Editor assigned by journal 23 Feb, 2026 Submission checks completed at journal 23 Feb, 2026 First submitted to journal 22 Feb, 2026 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-8937650","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":599448794,"identity":"4a2489cd-f6c0-41d1-bcc1-0da6bc5021d0","order_by":0,"name":"Rifat Burak Ergül","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA8UlEQVRIiWNgGAWjYDACZsYHMFbjYwjN3EBAC7MBwwEwi7HZmIHBAGQIAS0MCC1t0mAtDAS0mLczM37+wGCTLx+R2FZdUPEnmr8dqOVHxTacWmQOMzNLHGBIs9x4I7Ht9owzBrkzDjM2MPacuY1TiwQz/wGJg/8OGxjOAGrhbTPIbQBqYWZsw6eFmfnHAYb/YC3FIC3zidDCBnTYAQN5icQ2ZpCWDcRosTjDkGxgwPOwWZrnjHHuRqCWg3j9wn+Y+UYFg52BfHvywc88FXK5884fPvjgRwVuLXBgcACJcwCHIlQg30CUslEwCkbBKBiJAAB+DlJgtk9KVwAAAABJRU5ErkJggg==","orcid":"","institution":"Mus State Hospital","correspondingAuthor":true,"prefix":"","firstName":"Rifat","middleName":"Burak","lastName":"Ergül","suffix":""},{"id":599448795,"identity":"9983a73f-16db-4f0e-9f8f-7b0681128338","order_by":1,"name":"Poyraz Doğan","email":"","orcid":"","institution":"Istanbul University","correspondingAuthor":false,"prefix":"","firstName":"Poyraz","middleName":"","lastName":"Doğan","suffix":""},{"id":599448803,"identity":"3de2b801-6277-47aa-a82a-78d0a3b86087","order_by":2,"name":"Muhammed Edip Görür","email":"","orcid":"","institution":"Istanbul University","correspondingAuthor":false,"prefix":"","firstName":"Muhammed","middleName":"Edip","lastName":"Görür","suffix":""},{"id":599448804,"identity":"ca460297-75de-4027-be27-d533c6064f77","order_by":3,"name":"Mücahit Bayram","email":"","orcid":"","institution":"Istanbul Health and Technology University","correspondingAuthor":false,"prefix":"","firstName":"Mücahit","middleName":"","lastName":"Bayram","suffix":""},{"id":599448814,"identity":"acbe819b-1ea7-4e8c-9b74-861c1d7a2d13","order_by":4,"name":"Yakuphan Bilmez","email":"","orcid":"","institution":"Istanbul Health and Technology University","correspondingAuthor":false,"prefix":"","firstName":"Yakuphan","middleName":"","lastName":"Bilmez","suffix":""},{"id":599448820,"identity":"6ac03e04-d597-44a1-bf0c-77516a6fefa5","order_by":5,"name":"İsmail Taha Gürlek","email":"","orcid":"","institution":"Demre State Hospital","correspondingAuthor":false,"prefix":"","firstName":"İsmail","middleName":"Taha","lastName":"Gürlek","suffix":""},{"id":599448825,"identity":"ace66349-b3da-475a-bc1f-a785db32e922","order_by":6,"name":"Mert Emre Erden","email":"","orcid":"","institution":"Istanbul University","correspondingAuthor":false,"prefix":"","firstName":"Mert","middleName":"Emre","lastName":"Erden","suffix":""},{"id":599448828,"identity":"e2a36b0a-c518-491f-a0c4-bb5e9ff9301b","order_by":7,"name":"Tzevat Tefik","email":"","orcid":"","institution":"Istanbul University","correspondingAuthor":false,"prefix":"","firstName":"Tzevat","middleName":"","lastName":"Tefik","suffix":""}],"badges":[],"createdAt":"2026-02-22 07:53:13","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-8937650/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-8937650/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":104181786,"identity":"00270613-63d9-4ed3-8c9e-113d509524fc","added_by":"auto","created_at":"2026-03-08 17:30:03","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":283586,"visible":true,"origin":"","legend":"\u003cp\u003eU-Net++–based kidney stone segmentation model with an EfficientNet-B3 encoder.\u003c/p\u003e","description":"","filename":"1.png","url":"https://assets-eu.researchsquare.com/files/rs-8937650/v1/e7febdba3d2fa2db6b5576b8.png"},{"id":104181784,"identity":"e27fd23d-2a92-4ad6-b7c9-42a1a1b7efec","added_by":"auto","created_at":"2026-03-08 17:30:03","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":35852,"visible":true,"origin":"","legend":"\u003cp\u003eScreenshot of the web-based kidney stone segmentation interface (www.kidney-stone-segmentation.com).\u003c/p\u003e","description":"","filename":"2.png","url":"https://assets-eu.researchsquare.com/files/rs-8937650/v1/9629331666e795178163f20d.png"},{"id":104181785,"identity":"fa5c8233-2515-4356-ae03-10c47ddd9071","added_by":"auto","created_at":"2026-03-08 17:30:03","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":85534,"visible":true,"origin":"","legend":"\u003cp\u003ePR-AUC curve\u003c/p\u003e","description":"","filename":"3.png","url":"https://assets-eu.researchsquare.com/files/rs-8937650/v1/1d654b003bf8cf5428e4a521.png"},{"id":104181787,"identity":"35b6868f-b4bd-4505-baa9-c1aa11ba6514","added_by":"auto","created_at":"2026-03-08 17:30:03","extension":"png","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":175066,"visible":true,"origin":"","legend":"\u003cp\u003eOverlaid CT image\u003c/p\u003e","description":"","filename":"4.png","url":"https://assets-eu.researchsquare.com/files/rs-8937650/v1/c051c16d2bdc74f8bab35ebe.png"},{"id":104181802,"identity":"af25ac29-f038-4ef9-b789-8cfeb84bf738","added_by":"auto","created_at":"2026-03-08 17:30:08","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":1124134,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-8937650/v1/9e27cd26-a0c0-4745-a8c6-356776e8988a.pdf"}],"financialInterests":"No competing interests reported.","formattedTitle":"Kidney Stone Segmentation from Computed Tomography Images Using Computer Vision Techniques","fulltext":[{"header":"Introduction","content":"\u003cp\u003eKidney stone disease is one of the most common urological disorders worldwide and represents a significant public health burden due to its high prevalence and recurrence rate. The disease may present with acute renal colic, hematuria, urinary tract infections, and, in advanced cases, obstructive uropathy leading to loss of renal function. Owing to its frequent acute presentation, urolithiasis accounts for a substantial proportion of emergency department admissions, making rapid and accurate diagnosis essential for effective clinical management [1\u0026ndash;3]. Non-contrast computed tomography (NCCT) is currently regarded as the gold standard imaging modality for the evaluation of kidney stones, owing to its high sensitivity and specificity in detecting calculi, as well as its ability to accurately assess stone size, location, density, and anatomical relationships [4, 5]. Despite these advantages, manual interpretation of NCCT images remains time-consuming and prone to inter-observer variability. Factors such as increased workload, limited availability of experienced radiologists, and time pressure, particularly in emergency settings, may adversely affect diagnostic accuracy. [6]. Moreover, small stone size, low-contrast appearance, and complex anatomical variations can further contribute to missed or misinterpreted findings in routine clinical practice [7].\u003c/p\u003e \u003cp\u003eThese limitations have prompted growing interest in automated and reliable tools to support radiological assessment. In recent years, artificial intelligence (AI), particularly deep learning\u0026ndash;based approaches using convolutional neural networks (CNNs), has demonstrated remarkable performance in medical image analysis tasks, including classification, detection, and segmentation [8, 9] Several studies have applied AI-based techniques to kidney stone detection and segmentation on NCCT images, reporting encouraging diagnostic performance and potential reductions in clinical workload [10, 11]. However, many previously published studies are limited by small sample sizes, single-center datasets, class imbalance, and insufficient consideration of clinical integration and real-time applicability [12, 13]. In addition, issues related to model explainability and deployment within clinical decision support systems have often been inadequately addressed. More recent investigations suggest that deep learning models trained on large, diverse datasets and designed with practical clinical workflows in mind may yield more robust and generalizable results [14].\u003c/p\u003e \u003cp\u003eTherefore, the present study aimed to develop a deep learning\u0026ndash;based model for the automatic segmentation and size estimation of kidney stones from NCCT images using a large open-access dataset. Furthermore, the trained model was integrated into a web-based clinical decision support system to facilitate practical deployment and to support diagnostic decision-making in routine urological practice.\u003c/p\u003e"},{"header":"Materials and Methods","content":"\u003cdiv id=\"Sec3\" class=\"Section2\"\u003e \u003ch2\u003eDataset and Study Design\u003c/h2\u003e \u003cp\u003eIn this study, a deep learning\u0026ndash;based approach was developed for the automatic segmentation of kidney stones from computed tomography (CT) images. The dataset consisted of CT images paired with corresponding binary segmentation masks indicating stone regions. Data organization and processing were performed in accordance with recommended best practices for medical image analysis to ensure reproducibility and methodological transparency [8].\u003c/p\u003e \u003cp\u003eInput data included common raster image formats (PNG, JPG, TIFF) and DICOM (.dcm) files. For DICOM images, pixel intensities were normalized using VOI-LUT transformation and subsequently converted into a three-channel representation compatible with the model input. This preprocessing step aimed to reduce intensity variability arising from different CT acquisition protocols [15].\u003c/p\u003e \u003cp\u003eTo minimize data leakage and improve patient-level generalizability, a group-based data splitting strategy was applied whenever patient-related identifiers could be inferred. This ensured that images originating from the same patient were not distributed across training, validation, and test subsets simultaneously. The dataset was divided into training (70%), validation (20%), and test (10%) sets, following commonly accepted practices in medical artificial intelligence studies [12].\u003c/p\u003e \u003c/div\u003e\n\u003ch3\u003eModel Architecture\u003c/h3\u003e\n\u003cp\u003eKidney stone segmentation was performed using a U-Net\u0026thinsp;+\u0026thinsp;+\u0026thinsp;architecture. Compared to the classical U-Net, U-Net\u0026thinsp;+\u0026thinsp;+\u0026thinsp;employs nested skip connections that enable more effective multi-scale feature aggregation, which is particularly advantageous for segmenting small and low-contrast targets [16].\u003c/p\u003e \u003cp\u003eAn ImageNet-pretrained EfficientNet-B3 backbone was used as the network's encoder. EfficientNet architectures are widely adopted in medical imaging due to their favorable balance between parameter efficiency and strong feature extraction capability [17]. Dropout was applied in the decoder pathway to reduce overfitting. The overall model architecture is illustrated in Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003e.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e\n\u003ch3\u003ePreprocessing, Data Augmentation, and Mixing Strategies\u003c/h3\u003e\n\u003cp\u003eAll images were resized to match the model input dimensions while preserving the original aspect ratio using a letterbox/padding approach. Images were then normalized using ImageNet mean and standard deviation values before model ingestion.\u003c/p\u003e \u003cp\u003eTo enhance robustness against variations in acquisition conditions and anatomical appearance, strong data augmentation was applied during training. Augmentation techniques included random brightness and contrast adjustment, noise injection, gamma correction, perspective transformation, elastic deformation, and horizontal and vertical flipping. Such augmentation strategies have been shown to improve generalization performance in medical image segmentation tasks substantially [18].\u003c/p\u003e \u003cp\u003eTo further mitigate class imbalance and reduce overfitting, probabilistic MixUp and CutMix data-mixing strategies were applied during training. These approaches increase sample diversity and encourage smoother decision boundaries, contributing to more stable and generalizable learning. [18]\u003c/p\u003e\n\u003ch3\u003eTraining Configuration and Overfitting Prevention\u003c/h3\u003e\n\u003cp\u003eModel training was conducted using the PyTorch framework, with AdamW employed as the optimization algorithm. To control overfitting, weight decay, gradient clipping, and early stopping were applied. Learning rate scheduling was performed using CosineAnnealingWarmRestarts, and Stochastic Weight Averaging (SWA) was applied during the later stages of training to improve generalization by averaging model weights across epochs [19].\u003c/p\u003e \u003cp\u003eAutomatic Mixed Precision (AMP) was utilized throughout training to improve computational efficiency and reduce memory consumption. During validation, the probability threshold used to binarize the segmentation output was not fixed; instead, a threshold sweep was performed on the validation set to identify the value that maximized the Dice (F1) score. This strategy is particularly appropriate for segmentation problems with imbalanced pixel distributions [20].\u003c/p\u003e\n\u003ch3\u003eTesting and Evaluation Metrics\u003c/h3\u003e\n\u003cp\u003eModel performance was evaluated exclusively on the held-out test set. During inference, predictions were enhanced using test-time augmentation (TTA), where horizontally and vertically flipped versions of each image were processed, and the resulting probability maps were averaged. This approach improves prediction stability and robustness [21].\u003c/p\u003e\n\u003cp\u003eSegmentation performance was assessed using pixel-level Dice (F1-score) and Intersection over Union (IoU) metrics, as well as ROC and precision–recall (PR) curve–based area under the curve (AUC) values. These metrics are widely accepted for evaluating segmentation quality, particularly for small target structures such as kidney stones [22].\u003c/p\u003e\n\u003cp\u003eTo generate clinically interpretable outputs, a subset of positive test cases was visualized by overlaying predicted masks, contours, bounding boxes, and equivalent circular diameter measurements onto the original CT images.\u003c/p\u003e\n\u003cdiv id=\"Sec8\"\u003e\n \u003ch2\u003eWeb-Based Application\u003c/h2\u003e\n \u003cp\u003eThe developed segmentation model was deployed as a web-based clinical decision support tool (www.kidney-stone-segmentation.com) to facilitate practical use scenarios. The application allows users to upload CT images and receive real-time model outputs through an interactive interface. Uploaded images are processed using the same preprocessing pipeline as in training, and the model outputs probability maps and binary segmentation masks. Results are displayed visually with stone contours and measurement annotations. To ensure safe clinical use, the interface includes a disclaimer stating that the system does not replace physician judgment. An example screenshot of the web interface is shown in Fig.\u0026nbsp;2.\u003c/p\u003e\n\u003c/div\u003e"},{"header":"Results","content":"\u003cp\u003eThe quantitative performance of the proposed kidney stone segmentation model is summarized in Table\u0026nbsp;\u003cspan refid=\"Tab1\" class=\"InternalRef\"\u003e1\u003c/span\u003e, which reports the precision, recall, PR-AUC, Dice (F1), and IoU values obtained on the independent test set. The model achieved 76.66% precision and 88.36% recall, indicating high sensitivity for kidney stone detection. The PR-AUC of 90.58% demonstrates strong discriminative performance under class-imbalanced conditions, as shown in the precision\u0026ndash;recall curve in Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003e. In terms of spatial overlap, the model yielded a Dice (F1) score of 82.10% and an IoU of 69.63%, reflecting accurate pixel-wise agreement between predicted masks and ground truth annotations. Representative qualitative examples of kidney stone segmentations overlaid on CT images are presented in Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e4\u003c/span\u003e, further supporting the quantitative results.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab1\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 1\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003ePerformance metrics of the model\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"5\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c5\" colnum=\"5\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003ePrecision\u003c/p\u003e \u003cp\u003e(%)\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eRecall\u003c/p\u003e \u003cp\u003e(%)\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003ePR-AUC (%)\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c4\"\u003e \u003cp\u003eDice (%)\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c5\"\u003e \u003cp\u003eIoU (%)\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e76,66\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e88,36\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e90,58\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e82,10\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c5\"\u003e \u003cp\u003e69,63\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003e \u003c/p\u003e"},{"header":"Discussions","content":"\u003cp\u003eIn this study, a deep learning\u0026ndash;based framework was developed for the automatic segmentation of kidney stones from non-contrast computed tomography (NCCT) images, and its performance was evaluated using both pixel-level overlap metrics and precision\u0026ndash;recall\u0026ndash;based measures. The results demonstrate that the proposed model achieves a favorable balance between sensitivity and segmentation accuracy, supporting its potential as a clinically relevant decision-support tool. In particular, the high PR-AUC value highlights the model\u0026rsquo;s robustness to substantial class imbalance, a common characteristic of kidney stone segmentation tasks in which stone pixels represent only a small fraction of the image [8].\u003c/p\u003e \u003cp\u003eNon-contrast CT remains the imaging modality of choice for evaluating nephrolithiasis due to its superior sensitivity and detailed anatomical depiction [23]. However, accurately interpreting NCCT images can be challenging in routine clinical practice, especially in high-workload environments and emergency settings. Small stones, low contrast relative to surrounding tissues, and complex anatomical variations may increase the likelihood of missed or inconsistent detections [6]. In this context, automated segmentation methods can provide consistent and objective delineation of stone regions, potentially reducing inter-observer variability and supporting radiologists during image interpretation.\u003c/p\u003e \u003cp\u003eSeveral deep learning\u0026ndash;based approaches for kidney stone detection and segmentation have been reported in recent years. Prior studies have explored convolutional neural network\u0026ndash;based frameworks for automated stone detection, volumetric segmentation, and stone burden assessment on CT images, reporting encouraging diagnostic performance [10, 11, 24]. More recent work has investigated advanced architectures, including hybrid and attention-based models, to improve robustness and segmentation accuracy further [25, 26]. Compared with these approaches, the present study demonstrates competitive performance, particularly in precision\u0026ndash;recall\u0026ndash;based evaluation, which is increasingly recognized as a more informative metric than ROC-based measures in highly imbalanced medical imaging problems [27].\u003c/p\u003e \u003cp\u003eBeyond algorithmic performance, the clinical utility of artificial intelligence systems depends on their robustness, generalizability, and ease of integration into existing workflows. Previous methodological reviews emphasize that strong internal validation results alone are insufficient for clinical adoption, and that external validation and real-world testing are critical next steps [28, 29]. In the present study, integrating the segmentation model into a web-based interface represents an initial step toward practical deployment, enabling real-time visualization of segmentation outputs and facilitating interaction with end users in a clinical context.\u003c/p\u003e \u003cp\u003eExplainability and transparency are also increasingly recognized as key determinants of clinician trust in artificial intelligence\u0026ndash;based systems. Although the current work focuses primarily on segmentation performance, future incorporation of explainable AI techniques, such as saliency maps or gradient-based visualization methods, may further enhance interpretability and support clinical acceptance [29, 30]. In addition, recent methodological studies highlight that combining overlap-based metrics with precision\u0026ndash;recall\u0026ndash;oriented evaluation provides a more comprehensive assessment of segmentation performance, particularly for small target structures such as kidney stones [27].\u003c/p\u003e \u003cp\u003eSeveral limitations of this study should be acknowledged. First, the model was evaluated on an internal test dataset derived from publicly available imaging sources, which may not fully capture the variability encountered in routine clinical practice. Second, the current framework focuses on binary stone segmentation and does not explicitly address clinically relevant attributes such as stone composition, anatomical location, or volumetric burden. Finally, although a web-based deployment was implemented, prospective and multicenter validation studies are required to assess the model\u0026rsquo;s real-world impact and generalizability.\u003c/p\u003e"},{"header":"Conclusion","content":"\u003cp\u003eIn this study, a deep learning\u0026ndash;based approach was developed for the automatic segmentation of kidney stones from non-contrast computed tomography (NCCT) images. It demonstrated robust performance on the internal test dataset, with reliable pixel-level delineation and strong discriminative capability under class-imbalanced conditions. The deployment of the model as a web-based application (www.kidney-stone-segmentation.com) represents an important step toward clinical translation by enabling real-time visualization of segmentation outputs through an accessible interface. Although external validation and prospective multicenter studies are required to confirm its generalizability and clinical impact, the results of this study suggest that artificial intelligence\u0026ndash;driven segmentation systems have the potential to support radiological assessment and enhance urological diagnostic workflows.\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003e\u003cstrong\u003eEthics\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eEthical approval was not required for this study.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eFootnotes\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAuthorship Contributions\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eConcept and design of the study: R.B.E., P.D., T.T.; Data collection: M.E.G., M.E.E, İ.T.G.; Data preprocessing: R.B.E., İ.T.G.; AI model development and training: P.D., M.B., Y.B..; Statistical analysis and model evaluation: R.B.E., M.E.E.; Software development and coding: P.D., M.E.G.; Manuscript writing: P.D., M.E.G., M.B., Y.B..; Review and editing: R.B.E., İ.T.G., M.E.E., T.T.; Project supervision: T.T..\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eConflict of Interest:\u003c/strong\u003e No conflict of interest was declared by the authors.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eFinancial Disclosure:\u003c/strong\u003e The authors declared that this study received no financial support\u003c/p\u003e\u003ch2\u003eData Availability\u003c/h2\u003e\u003cp\u003eThe dataset used in this study is publicly available from an open-access repository. Processed data and model outputs are available from the corresponding author upon reasonable request.\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\n\u003cli\u003eFinger M, Finger E, Bellucci A, Malieckal DA. Medical management for the prevention of kidney stones. Postgrad Med J. 2023;99(1169):112\u0026ndash;8. doi: 10.1136/postgradmedj-2021-140971.\u003c/li\u003e\n\u003cli\u003eScales CD, Jr., Smith AC, Hanley JM, Saigal CS, Urologic Diseases in America P. Prevalence of kidney stones in the United States. Eur Urol. 2012;62(1):160\u0026ndash;5. doi: 10.1016/j.eururo.2012.03.052.\u003c/li\u003e\n\u003cli\u003eSorokin I, Mamoulakis C, Miyazawa K, Rodgers A, Talati J, Lotan Y. Epidemiology of stone disease across the world. World J Urol. 2017;35(9):1301\u0026ndash;20. doi: 10.1007/s00345-017-2008-6.\u003c/li\u003e\n\u003cli\u003eTurk C, Petrik A, Sarica K, Seitz C, Skolarikos A, Straub M, et al. EAU Guidelines on Diagnosis and Conservative Management of Urolithiasis. Eur Urol. 2016;69(3):468\u0026ndash;74. doi: 10.1016/j.eururo.2015.07.040.\u003c/li\u003e\n\u003cli\u003eWaite S, Scott J, Gale B, Fuchs T, Kolla S, Reede D. Interpretive Error in Radiology. AJR Am J Roentgenol. 2017;208(4):739\u0026ndash;49. doi: 10.2214/AJR.16.16963.\u003c/li\u003e\n\u003cli\u003eBrady AP. Error and discrepancy in radiology: inevitable or avoidable? Insights Imaging. 2017;8(1):171\u0026ndash;82. doi: 10.1007/s13244-016-0534-1.\u003c/li\u003e\n\u003cli\u003eMoore CL, Daniels B, Ghita M, Gunabushanam G, Luty S, Molinaro AM, et al. Accuracy of reduced-dose computed tomography for ureteral stones in emergency department patients. Ann Emerg Med. 2015;65(2):189\u0026ndash;98 e2. doi: 10.1016/j.annemergmed.2014.09.008.\u003c/li\u003e\n\u003cli\u003eLitjens G, Kooi T, Bejnordi BE, Setio AAA, Ciompi F, Ghafoorian M, et al. A survey on deep learning in medical image analysis. Med Image Anal. 2017;42:60\u0026ndash;88. doi: 10.1016/j.media.2017.07.005.\u003c/li\u003e\n\u003cli\u003eEsteva A, Robicquet A, Ramsundar B, Kuleshov V, DePristo M, Chou K, et al. A guide to deep learning in healthcare. Nat Med. 2019;25(1):24\u0026ndash;9. doi: 10.1038/s41591-018-0316-z.\u003c/li\u003e\n\u003cli\u003eElton DC, Turkbey EB, Pickhardt PJ, Summers RM. A deep learning system for automated kidney stone detection and volumetric segmentation on noncontrast CT scans. Med Phys. 2022;49(4):2545\u0026ndash;54. doi: 10.1002/mp.15518.\u003c/li\u003e\n\u003cli\u003eCui Y, Sun Z, Ma S, Liu W, Wang X, Zhang X, et al. Automatic Detection and Scoring of Kidney Stones on Noncontrast CT Images Using S.T.O.N.E. Nephrolithometry: Combined Deep Learning and Thresholding Methods. Mol Imaging Biol. 2021;23(3):436\u0026ndash;45. doi: 10.1007/s11307-020-01554-0.\u003c/li\u003e\n\u003cli\u003ePark SH, Han K. Methodologic Guide for Evaluating Clinical Performance and Effect of Artificial Intelligence Technology for Medical Diagnosis and Prediction. Radiology. 2018;286(3):800\u0026ndash;9. doi: 10.1148/radiol.2017171920.\u003c/li\u003e\n\u003cli\u003eKelly CJ, Karthikesalingam A, Suleyman M, Corrado G, King D. Key challenges for delivering clinical impact with artificial intelligence. BMC Med. 2019;17(1):195. doi: 10.1186/s12916-019-1426-2.\u003c/li\u003e\n\u003cli\u003eMazurowski MA, Buda M, Saha A, Bashir MR. Deep learning in radiology: An overview of the concepts and a survey of the state of the art with focus on MRI. J Magn Reson Imaging. 2019;49(4):939\u0026ndash;54. doi: 10.1002/jmri.26534.\u003c/li\u003e\n\u003cli\u003eDapamede T, Li F, Khosravi B, Purkayastha S, Trivedi H, Gichoya J. DICOM LUT is a Key Step in Medical Image Preprocessing Towards AI Generalizability. J Imaging Inform Med. 2025;38(5):3040\u0026ndash;8. doi: 10.1007/s10278-025-01418-5.\u003c/li\u003e\n\u003cli\u003eZhou Z, Siddiquee MMR, Tajbakhsh N, Liang J. UNet++: A Nested U-Net Architecture for Medical Image Segmentation. Deep Learn Med Image Anal Multimodal Learn Clin Decis Support (2018). 2018;11045:3\u0026ndash;11. doi: 10.1007/978-3-030-00889-5_1.\u003c/li\u003e\n\u003cli\u003eKeishing V, Ahn ES, Khashim Z, Loudermilk AD, Erickson BJ. Towards Automated Craniosynostosis Diagnosis Using EfficientNet-Based Artificial Intelligence Models: A Two-Class and Multi-Class Approach. J Imaging Inform Med. 2025. doi: 10.1007/s10278-025-01694-1.\u003c/li\u003e\n\u003cli\u003eZeng W. Image data augmentation techniques based on deep learning: A survey. Math Biosci Eng. 2024;21(6):6190\u0026ndash;224. doi: 10.3934/mbe.2024272.\u003c/li\u003e\n\u003cli\u003eKunkel KJ, Anwaruddin S. Papillary Muscle Rupture Due to Delayed STEMI Presentation in a Patient Self-Isolating for Presumed COVID-19. JACC Case Rep. 2020;2(10):1633\u0026ndash;6. doi: 10.1016/j.jaccas.2020.06.036.\u003c/li\u003e\n\u003cli\u003eYouden WJ. Index for rating diagnostic tests. Cancer. 1950;3(1):32\u0026ndash;5. doi: 10.1002/1097-0142(1950)3:1\u0026lt;32::aid-cncr2820030106\u0026gt;3.0.co;2-3.\u003c/li\u003e\n\u003cli\u003eRylaarsdam LE, Johnecheck GN, Looyenga BD, Louters LL. GLUT1 is associated with sphingolipid-organized, cholesterol-independent domains in L929 mouse fibroblast cells. Biochimie. 2019;162:88\u0026ndash;96. doi: 10.1016/j.biochi.2019.04.010.\u003c/li\u003e\n\u003cli\u003eRodriguez-Espinosa N, Gonzalez-Colaco Harmand M, Miranda Saavedra F, Galvan Gonzalez MA, Plasencia Nunez M, Aldea Perona AM, et al. [Prescription of acetylcholinesterase inhibitors and memantine in the Canary Islands, comparison with the Spanish population.]. Rev Esp Salud Publica. 2021;95. \u003c/li\u003e\n\u003cli\u003eSmith RC, Rosenfield AT, Choe KA, Essenmacher KR, Verga M, Glickman MG, et al. Acute flank pain: comparison of non-contrast-enhanced CT and intravenous urography. Radiology. 1995;194(3):789\u0026ndash;94. doi: 10.1148/radiology.194.3.7862980.\u003c/li\u003e\n\u003cli\u003eFrenken MWE, Thijssen KMJ, Vlemminx MWC, van den Heuvel ER, Westerhuis M, Oei SG. Clinical evaluation of electrohysterography as method of monitoring uterine contractions during labor: A propensity score matched study. Eur J Obstet Gynecol Reprod Biol. 2021;259:178\u0026ndash;84. doi: 10.1016/j.ejogrb.2021.02.029.\u003c/li\u003e\n\u003cli\u003eWu N, Gao H, Xu Q, Zhang Z. Characterization and Whole-Genome Analysis of a Zearalenone-Degrading Stappia sp. WLB 29. Curr Microbiol. 2022;79(6):179. doi: 10.1007/s00284-022-02874-w.\u003c/li\u003e\n\u003cli\u003eHada AM, Potara M, Astilean S, Cordaro A, Neri G, Malanga M, et al. Linezolid nanoAntiobiotics and SERS-nanoTags based on polymeric cyclodextrin bimetallic core-shell nanoarchitectures. Carbohydr Polym. 2022;293:119736. doi: 10.1016/j.carbpol.2022.119736.\u003c/li\u003e\n\u003cli\u003eSaito T, Rehmsmeier M. The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLoS One. 2015;10(3):e0118432. doi: 10.1371/journal.pone.0118432.\u003c/li\u003e\n\u003cli\u003eMiller H, Johns L. Interoperability of Electronic Health Records: A Physician-Driven Redesign. Manag Care. 2018;27(1):37\u0026ndash;40. \u003c/li\u003e\n\u003cli\u003eShi X, Du R, Zhang J, Lei Y, Guo H. Evaluation of the anti-cancer potential of Cedrus deodara total lignans by inducing apoptosis of A549 cells. BMC Complement Altern Med. 2019;19(1):281. doi: 10.1186/s12906-019-2682-6.\u003c/li\u003e\n\u003cli\u003eWei J, Wang L, Yang X. Game analysis on the evolution of COVID-19 epidemic under the prevention and control measures of the government. PLoS One. 2020;15(10):e0240961. doi: 10.1371/journal.pone.0240961.\u003c/li\u003e\n\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"world-journal-of-urology","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"wjur","sideBox":"Learn more about [World Journal of Urology](https://link.springer.com/journal/345)","snPcode":"345","submissionUrl":"https://submission.nature.com/new-submission/345/3","title":"World Journal of Urology","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"Springer Hybrid","inReviewEnabled":true,"inReviewRevisionsEnabled":false},"keywords":"Kidney stone, computed tomography, deep learning, artificial intelligence, segmentation","lastPublishedDoi":"10.21203/rs.3.rs-8937650/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-8937650/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003ch2\u003eObjective\u003c/h2\u003e \u003cp\u003eUrolithiasis is a common condition in urological practice, and accurate imaging is essential for proper diagnosis and management. Computed tomography (CT) is widely accepted as the gold standard for evaluating kidney stones due to its high sensitivity in assessing stone location, size, and morphology. However, manual interpretation of CT images is time-consuming and subject to observer-dependent variability. This study aimed to develop a deep learning\u0026ndash;based model for automatic kidney stone segmentation and size estimation from CT images and to integrate this model into a web-based clinical decision support system.\u003c/p\u003e\u003ch2\u003eMethods\u003c/h2\u003e \u003cp\u003eAn open-access dataset consisting of 3,584 CT images and corresponding kidney stone segmentation masks was used. The dataset was divided into training (70%), validation (20%), and test (10%) subsets. Image preprocessing and model development were performed using Python and the TensorFlow framework. A U-Net\u0026thinsp;+\u0026thinsp;+\u0026thinsp;architecture was employed for kidney stone segmentation. Model training was conducted in the Google Colab environment. Stone size estimation was performed using the equivalent circular diameter method based on the segmented stone area. Model performance was evaluated using Precision, Recall, Precision\u0026ndash;Recall Area Under the Curve (PR-AUC), Intersection over Union (IoU), and Dice/F1-score metrics. The trained model was additionally deployed via a web-based interface to demonstrate potential clinical applicability.\u003c/p\u003e\u003ch2\u003eResults\u003c/h2\u003e \u003cp\u003eOn the test dataset, the proposed model achieved a PR-AUC of 90.58%. The Dice/F1-score and IoU values were 82.10% and 69.63%, respectively. Precision and Recall were 76.66% and 88.36%. Qualitative evaluation demonstrated accurate localization and segmentation of kidney stones, including small calculi.\u003c/p\u003e\u003ch2\u003eConclusion\u003c/h2\u003e \u003cp\u003eThis study presents an effective deep learning\u0026ndash;based approach for automated kidney stone segmentation and size estimation from CT images. The favorable quantitative results and consistent visual performance suggest that the proposed system may support clinical decision-making by reducing workload and improving diagnostic consistency in routine urological practice.\u003c/p\u003e","manuscriptTitle":"Kidney Stone Segmentation from Computed Tomography Images Using Computer Vision Techniques","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2026-03-08 17:29:58","doi":"10.21203/rs.3.rs-8937650/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"editorInvitedReview","content":"","date":"2026-03-17T08:16:27+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"118880875630731713152113522745547736227","date":"2026-03-02T15:50:39+00:00","index":"hide","fulltext":""},{"type":"reviewersInvited","content":"","date":"2026-03-02T14:56:16+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2026-02-24T02:50:25+00:00","index":"","fulltext":""},{"type":"checksComplete","content":"","date":"2026-02-23T16:49:18+00:00","index":"","fulltext":""},{"type":"submitted","content":"World Journal of Urology","date":"2026-02-22T07:40:49+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"world-journal-of-urology","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"wjur","sideBox":"Learn more about [World Journal of Urology](https://link.springer.com/journal/345)","snPcode":"345","submissionUrl":"https://submission.nature.com/new-submission/345/3","title":"World Journal of Urology","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"Springer Hybrid","inReviewEnabled":true,"inReviewRevisionsEnabled":false}}],"origin":"","ownerIdentity":"18929861-c8b7-4ed5-a1af-8d99195804f2","owner":[],"postedDate":"March 8th, 2026","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"under-review","subjectAreas":[],"tags":[],"updatedAt":"2026-03-08T17:29:58+00:00","versionOfRecord":[],"versionCreatedAt":"2026-03-08 17:29:58","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-8937650","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-8937650","identity":"rs-8937650","version":["v1"]},"buildId":"XKTyCvWXoU3ODBz1xrDgd","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

⚙ Ask this paper AI returns verbatim quotes from the full text · source: preprint-html ⓘ

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc: last seen: 2026-05-20T01:45:00.602351+00:00