Deep Learning Approach for the Diagnosis of Pediatric Pneumonia Using Chest X-ray Imaging

preprint OA: closed
Full text JSON View at publisher
Full text 65,115 characters · extracted from preprint-html · click to expand
Deep Learning Approach for the Diagnosis of Pediatric Pneumonia Using Chest X-ray Imaging | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Deep Learning Approach for the Diagnosis of Pediatric Pneumonia Using Chest X-ray Imaging Fatemeh Hosseinabadi, Mohammad Mojtaba Rohani This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-7240894/v1 This work is licensed under a CC BY 4.0 License Status: Posted Version 1 posted You are reading this latest preprint version Abstract Pediatric pneumonia remains a leading cause of morbidity and mortality in children worldwide. Timely and accurate diagnosis is critical but often challenged by limited radiological expertise and the physiological and procedural complexity of pediatric imaging. This study investigates the performance of state-of-the-art convolutional neural network (CNN) architectures—ResNetRS, RegNet, and EfficientNetV2—using transfer learning for the automated classification of pediatric chest X-ray images as either pneumonia or normal.A curated subset of 1,000 chest X-ray images was extracted from a publicly available dataset originally comprising 5,856 pediatric images. All images were preprocessed and labeled for binary classification. Each model was fine-tuned using pretrained ImageNet weights and evaluated based on accuracy and sensitivity. RegNet achieved the highest classification performance with an accuracy of 92.4% and a sensitivity of 90.1%, followed by ResNetRS (accuracy: 91.9%, sensitivity: 89.3%) and EfficientNetV2 (accuracy: 88.5%, sensitivity: 88.1%). Artificial Intelligence and Machine Learning Pediatric Pneumonia Chest X-ray RegNet ResNetRS EfficientNetV2 Figures Figure 1 Figure 2 Figure 3 Introduction Pediatric pneumonia remains one of the most critical health challenges affecting children globally, accounting for over 700,000 deaths annually in children under the age of five, according to the World Health Organization. It is caused by a variety of pathogens, including bacteria such as Streptococcus pneumoniae, viruses like respiratory syncytial virus (RSV), and, less commonly, fungi. The clinical presentation of pneumonia in children often includes fever, cough, rapid breathing, and chest indrawing, but these symptoms can overlap with other common pediatric illnesses such as asthma or bronchiolitis, complicating clinical diagnosis. Additionally, young children may not be able to effectively communicate their symptoms, making objective diagnostic tools all the more essential. Prompt diagnosis is vital not only to initiate timely treatment but also to avoid unnecessary antibiotic usage and prevent disease progression, hospitalization, or death [ 1 ]. Medical imaging, particularly chest X-ray radiography, is the most widely used diagnostic tool for confirming suspected cases of pneumonia in children [ 2 – 4 ]. It offers a non-invasive means of visualizing lung infiltrates, consolidation, and pleural effusion—hallmarks of pneumonia. However, interpreting pediatric chest X-rays poses unique challenges. Children have anatomical differences compared to adults, such as smaller thoracic cavities and less-defined lung structures, which can complicate image interpretation. Moreover, acquiring high-quality images can be technically difficult due to patient movement, lack of cooperation, or suboptimal exposure settings. In resource-limited settings, the situation is further exacerbated by a shortage of trained radiologists or pediatric imaging specialists, leading to diagnostic delays or inaccuracies. Consequently, there is a growing demand for automated and reliable diagnostic support systems. In recent years, machine learning (ML) and deep learning (DL) technologies have shown substantial promise in addressing such diagnostic challenges. Machine learning encompasses a broad set of algorithms that learn from data to make predictions or decisions without being explicitly programmed [ 5 – 8 ]. Deep learning, a subfield of ML, involves neural networks with multiple layers that automatically extract complex features from raw data. Among various DL architectures, convolutional neural networks (CNNs) have gained prominence for their exceptional performance in image-related tasks. CNNs are capable of learning hierarchical patterns, from simple edges and textures to more abstract structures, making them particularly suitable for medical image analysis. These models not only reduce the burden on clinicians but also introduce consistency and objectivity to the diagnostic process. The application of deep learning in medicine has rapidly expanded, demonstrating state-of-the-art results across a range of clinical domains. In radiology, CNN-based models have been developed to detect diabetic retinopathy in retinal images, classify skin lesions in dermatology, and identify lung nodules or tuberculosis in chest X-rays. These systems can assist in triage, serve as second readers, or even function as autonomous diagnostic tools in environments lacking human expertise. In addition to classification, deep learning has been used for medical image segmentation, anomaly detection, and survival prediction [ 9 – 12 ]. The integration of AI in healthcare promises to improve diagnostic speed, accuracy, and accessibility, particularly in underserved or high-volume clinical settings. Building on these advancements, this study explores the use of three cutting-edge CNN architectures—ResNetRS, RegNet, and EfficientNetV2—for the detection of pediatric pneumonia using chest X-ray images, with the goal of improving diagnostic outcomes through automated image classification. In this study, we present a comparative analysis of three modern convolutional neural network architectures—ResNetRS, RegNet, and EfficientNetV2—for the task of pediatric pneumonia classification using chest X-ray images. We create a balanced subset of 1,000 images from a publicly available pediatric chest radiograph dataset and apply standardized preprocessing and augmentation techniques to improve model robustness. Leveraging transfer learning with pretrained ImageNet weights, each model is fine-tuned and evaluated based on its classification accuracy and sensitivity. Related Work The use of deep learning in medical imaging has garnered significant attention over the past decade, with numerous studies demonstrating its efficacy in diagnosing thoracic diseases, including pneumonia. One of the most influential works in this domain is by [ 1 ], which introduced a large, publicly available pediatric chest X-ray dataset comprising over 5,000 images labeled as pneumonia (bacterial or viral) or normal. Using a transfer learning approach with the InceptionV3 architecture, their model achieved over 90% accuracy in distinguishing pneumonia from normal cases, showcasing the potential of deep learning in pediatric diagnostic imaging. Following this foundational work, [ 13 ] developed CheXNet, a 121-layer DenseNet trained on the NIH ChestX-ray14 dataset and achieved radiologist-level performance in detecting pneumonia in adult chest X-rays. While not specifically targeted at pediatric patients, this work laid the groundwork for the use of deeper architectures in chest pathology detection. Building upon these advances, researchers have sought to adapt and evaluate CNNs specifically for pediatric populations, recognizing the anatomical and pathological differences. Researchers in [ 14 ] explored the performance of VGG16 and ResNet50 architectures in classifying pediatric chest X-rays and emphasized the importance of dataset balance and augmentation in achieving high sensitivity. Similarly, [ 15 ] applied a deep convolutional neural network for pneumonia diagnosis and highlighted the potential of automated systems in reducing diagnostic workload. More recently, [ 16 ] used a combination of DenseNet201 and Grad-CAM visualization to not only classify pediatric pneumonia but also provide explainability by highlighting the affected lung regions, aiding clinical interpretability. Several comparative studies have also evaluated the performance of multiple CNN models in pediatric pneumonia detection. For instance, [ 17 ] proposed a hybrid model combining CNN feature extraction with support vector machines (SVMs) and achieved notable performance improvements over standalone models. Additionally, [ 18 ] conducted an in-depth review of deep learning methods for pneumonia detection and noted that while transfer learning significantly enhances performance, model generalizability across populations and image acquisition settings remains a critical challenge. Despite the promising results reported across these studies, most research has focused on a limited set of models or datasets, and comparative analyses using more recent architectures are relatively scarce. In response, our study explores and benchmarks three state-of-the-art CNNs—ResNetRS, RegNet, and EfficientNetV2—on pediatric chest X-ray images to assess their effectiveness in real-world diagnostic settings. By incorporating modern training procedures and robust evaluation metrics, this work contributes to the growing body of literature on AI-based pediatric pneumonia diagnosis and offers insights into the clinical applicability of advanced deep learning models. Method Dataset This study utilized a curated subset of a publicly available pediatric chest X-ray (CXR) dataset originally compiled by Kermany et al. (2018). The complete dataset contains 5,856 anterior-posterior chest radiographs of pediatric patients aged between one and five years, categorized into 3,883 pneumonia cases (including 2,538 bacteria and 1,345 viral) and 1,349 normal cases. For our experiments, we selected a representative subset of 1,000 images through stratified random sampling to preserve the original class distribution across pneumonia and normal categories. All images were visually inspected to exclude low-quality or unreadable scans. The Institutional Review Board (IRB)/Ethics Committee approvals were obtained for this dataset [ 1 , 19 ]. To ensure consistency and compatibility with pretrained convolutional neural network (CNN) models, all images were resized to 224 × 224 pixels. Preprocessing included normalization using the standard ImageNet mean and standard deviation values. To enhance generalization and reduce overfitting, we applied data augmentation techniques such as random horizontal flipping, small rotations (± 10 degrees), zooming, and brightness adjustments. The dataset was split into 80% training and 20% testing sets, with stratification to maintain class balance. Five-fold cross-validation was used on the training set to evaluate model stability and performance across different data splits. Deep Learning Models In this study, we employed three state-of-the-art convolutional neural network (CNN) architectures—ResNetRS, RegNet, and EfficientNetV2—to perform binary classification of pediatric chest X-ray images into pneumonia or normal categories. All models were pretrained on the ImageNet dataset to leverage rich feature representations and were subsequently fine-tuned on our pediatric X-ray dataset using transfer learning. ResNetRS ResNetRS (ResNet-Ranked Scaled) [ 20 ] is a family of deep residual networks proposed by Google Research in 2021 as an improved variant of the traditional ResNet architecture. While standard ResNets use a fixed design, ResNetRS introduces several refinements aimed at enhancing training stability and accuracy. These include: Standardized convolutions and weight standardization, which normalize the weight distributions during training to stabilize gradients and improve convergence. Squeeze-and-Excitation (SE) blocks, which adaptively recalibrate channel-wise feature responses by modeling interdependencies between channels. Stochastic depth, a regularization technique that randomly drops residual connections during training, thereby improving model robustness and preventing overfitting. Swish activation function and adaptive scaling of width and depth across blocks. We utilized the ResNetRS-50 variant, which has 50 layers and offers a strong balance between accuracy and computational efficiency. The final classification layer was replaced with a dense output layer containing a single neuron with sigmoid activation. Only the top convolutional block and batch normalization layers were unfrozen during fine-tuning, allowing the model to adapt high-level features to the pediatric pneumonia domain while preserving the generalized representations learned from ImageNet. RegNet RegNet (Regularized Network Design Space) [ 21 ], introduced by Facebook AI Research, is a family of CNNs derived from a simple design space where width, depth, and bottleneck ratios are parameterized using quantized linear functions. This structured approach leads to architectures that are computationally efficient and scalable without compromising accuracy. RegNet differs from earlier models by emphasizing design regularity over manual tuning or neural architecture search. The specific variant used in our study is RegNetY-064, which incorporates: Group convolutions to reduce parameter redundancy while maintaining representational power. Bottleneck residual blocks that compress and expand features for efficient information flow. Squeeze-and-Excitation (SE) attention blocks to model channel-wise interdependencies and improve feature calibration. Efficient stage-wise block scaling, where both width and depth are increased progressively throughout the network. Due to its modular and systematic architecture, RegNetY is well-suited for transfer learning tasks in medical imaging. The classification head was replaced with a dropout layer followed by a sigmoid-activated neuron for binary output. Full model fine-tuning was performed with a gradual unfreezing strategy across the training epochs. EfficientNetV2 EfficientNetV2 [ 22 ], developed by Google Brain, is the successor to EfficientNet and is designed for both accuracy and speed. It achieves faster training with higher accuracy by incorporating two key innovations: Fused-MBConv layers, which replace the standard MBConv (Mobile Inverted Bottleneck Convolution) blocks in the early stages with regular convolution + batch normalization layers, resulting in faster convergence and reduced memory consumption. Progressive learning, a training approach that gradually increases the input image size and data augmentation strength throughout training. This helps stabilize early learning and reduce overfitting, particularly in small datasets. EfficientNetV2 uses compound scaling to balance network depth, width, and resolution. We adopted the EfficientNetV2-Small (EffNetV2-S) variant due to its excellent performance on lightweight medical tasks and suitability for smaller datasets like pediatric chest X-rays. This model incorporates SE blocks and Swish activation functions, which enhance learning capacity and nonlinearity. We customized the classifier head to include batch normalization, dropout (p = 0.3), and a final sigmoid activation for binary classification. Unlike earlier EfficientNet models that required extensive tuning for small medical datasets, EfficientNetV2’s improved learning dynamics make it particularly well-suited for our fine-tuning workflow. The entire model was unfrozen during training to enable comprehensive domain adaptation. These three models were selected due to their robust performance in general image classification, their proven transferability, and their ability to handle limited, class-imbalanced datasets—a common constraint in pediatric imaging tasks. Their architectural diversity also allowed us to compare CNNs with different building principles (residual, regularized, and mobile-efficient designs) under a uniform experimental setting. Experiment Setup All models were implemented in PyTorch and trained in an NVIDIA RTX GPU environment. The training configuration included the Adam optimizer with a learning rate of 1e-4, batch size of 32, and up to 30 epochs with early stopping based on validation loss. To further stabilize training, we employed gradient clipping with a maximum norm of 5.0 and used mixed-precision training via automatic mixed precision (AMP). Results We evaluated the performance of ResNetRS, RegNet, and EfficientNetV2 models on the held-out test set derived from our curated pediatric chest X-ray dataset. Each model was trained using 5-fold cross-validation and evaluated based on classification accuracy and sensitivity (recall)—two metrics of particular clinical importance in diagnostic settings. Table 1 Classification performance of the deep learning models Model Accuracy Sensitivity ResNetRS 91.9 89.3 RegNet 92.4 90.1 EfficientNetV2 88.5 88.1 Accuracy reflects overall classification correctness, while sensitivity quantifies the model’s ability to correctly identify pneumonia cases, which is essential in avoiding missed diagnoses. The results are summarized in Table 1 . Among the tested architectures, RegNet achieved the highest classification performance with an accuracy of 92.4% and sensitivity of 90.1%, indicating its strong ability to correctly identify pneumonia cases while maintaining overall reliability. ResNetRS followed closely, achieving an accuracy of 91.9% and sensitivity of 89.3%, which demonstrates consistent performance and generalization. EfficientNetV2 also performed well, with an accuracy of 88.5% and sensitivity of 88.1%, although slightly lower than the other two models. Overall, the results indicate that modern CNN architecture pretrained on large-scale natural image datasets can be successfully adapted for pediatric medical imaging tasks. Among the tested models, RegNet emerges as the most promising candidate for real-world deployment in pneumonia screening tools, balancing predictive performance with architectural efficiency. The confusion matrix presented in Fig. 3 provides a detailed breakdown of the model’s performance on the held-out test set. Out of the total pneumonia cases, the model correctly identified 83 true positives, while 17 cases were missed, resulting in false negatives. Among the normal cases, 54 were correctly classified as true negatives, and 11 were incorrectly predicted as pneumonia, leading to false positives. These results reflect a strong balance between sensitivity and specificity, with the model demonstrating a reliable ability to detect pneumonia while maintaining a low false positive rate. The relatively low number of false negatives is particularly important in clinical contexts, as it reduces the risk of untreated pneumonia progressing to severe complications. Overall, the confusion matrix confirms the model’s robust diagnostic potential for pediatric pneumonia detection. Conclusion In this study, we investigated the performance of three state-of-the-art convolutional neural network architectures—ResNetRS, RegNet, and EfficientNetV2—for the task of automated pediatric pneumonia classification using chest X-ray images. We curated and preprocessed a subset of 1,000 labeled images from a publicly available pediatric dataset and applied transfer learning techniques to adapt each model for binary classification. All models were trained and evaluated using a standardized pipeline involving five-fold cross-validation and an 80 − 20 train-test split. Our results demonstrated that all three models were capable of effectively distinguishing between pneumonia and normal chest radiographs in pediatric patients. Among them, RegNet achieved the highest accuracy (92.4%) and sensitivity (90.1%), followed closely by ResNetRS. EfficientNetV2 also showed competitive performance, although slightly lower than the other two. The confusion matrix further confirmed the models’ ability to minimize false negatives and false positives, both of which are critical in clinical diagnostics. Overall, this study highlighted the viability of modern CNN-based transfer learning approaches in supporting pediatric pneumonia diagnosis. The models we evaluated showed strong diagnostic performance and could serve as reliable tools in real-world settings, particularly in low-resource environments where expert radiological interpretation may be limited. Future work may focus on extending this analysis to larger and more diverse datasets, incorporating multi-class classification (e.g., distinguishing bacterial vs. viral pneumonia), and integrating model predictions into clinical decision-support systems. References Kermany DS, Goldbaum M, Cai W, Valentim CC, Liang H, Baxter SL, McKeown A, Yang G, Wu X, Yan F, Dong J (2018) Identifying medical diagnoses and treatable diseases by image-based deep learning. Cell 172(5):1122–1131 Irede EL, Aworinde OR, Lekan OK, Amienghemhen OD, Okonkwo TP, Onivefu AP, Ifijen IH Medical imaging: a critical review on X-ray imaging for the detection of infection. Biomedical Mater Devices 2024 Jul 15:1–45 Çallı E, Sogancioglu E, Van Ginneken B, van Leeuwen KG, Murphy K (2021) Deep learning for chest X-ray analysis: A survey. Med Image Anal 72:102125 Sharifi S, Donyadadi A (2025) Detection and diagnosis of congenital heart disease from chest X-rays with deep learning models. Int J Appl Data Sci Eng Health 1(1):1–9 Kohli PS, Arora S Application of machine learning in disease prediction. In2018 4th International conference on computing communication and automation (ICCCA) 2018 Dec 14 (pp. 1–4). IEEE Norouzi F, Machado BL (2024) Predicting Mental Health Outcomes: A Machine Learning Approach to Depression, Anxiety, and Stress. Int J Appl Data Sci Eng Health 1(2):98–104 Uddin S, Khan A, Hossain ME, Moni MA (2019) Comparing different supervised machine learning algorithms for disease prediction. BMC Med Inf Decis Mak 19(1):1–6 Ghasemi F, Sharifi S (2025) Heart Failure Prediction Using Support Vector Machine. Int J Novel Res Life Sci. Feb 1 Anantharajan S, Gunasekaran S, Subramanian T (2024) MRI brain tumor detection using deep learning and machine learning approaches. Measurement: Sens 31:101026 Whig P, Shadadi E, Kouser S, Alamer L (2025) Machine learning approaches for early detection and management of musculoskeletal conditions. Int J Comput Vis Rob 15(1):104–117 Wang L (2024) Mammography with deep learning for breast cancer detection. Front Oncol 14:1281922 Abbasi H, Afrazeh F, Ghasemi Y, Ghasemi F (2024) A shallow review of artificial intelligence applications in brain disease: stroke, Alzheimer's, and aneurysm. Int J Appl Data Sci Eng Health 1(2):32–43 Rajpurkar P, Irvin J, Zhu K, Yang B, Mehta H, Duan T, Ding D, Bagul A, Langlotz C, Shpanskaya K, Lungren MP Chexnet: Radiologist-level pneumonia detection on chest x-rays with deep learning. arXiv preprint arXiv:1711.05225. 2017 Nov 14. Stephen O, Sain M, Maduh UJ, Jeong DU (2019) An efficient deep learning approach to pneumonia classification in healthcare. J Healthc Eng 2019(1):4180949 Abiyev RH, Ma’aitaH MK (2018) Deep convolutional neural networks for chest diseases detection. J Healthc Eng 2018(1):4168538 Ali SA, Kumar MS Detecting pneumonia from X-ray images using transfer learning. In2020 IEEE International Conference on Advent Trends in Multidisciplinary Research and Innovation (ICATMRI) 2020 Dec 30 (pp. 1–4). IEEE Toğaçar M, Ergen B, Cömert Z (2020) COVID-19 detection using deep learning models to exploit Social Mimic Optimization and structured chest X-ray images using fuzzy color and stacking approaches. Comput Biol Med 121:103805 Chowdhury ME, Rahman T, Khandakar A, Mazhar R, Kadir MA, Mahbub ZB, Islam KR, Khan MS, Iqbal A, Al Emadi N, Reaz MB (2020) Can AI help in screening viral and COVID-19 pneumonia? Ieee Access 8:132665–132676 Kermany D, Zhang K, Goldbaum M (2018) Labeled optical coherence tomography (oct) and chest x-ray images for classification Mendeley Data, v2 https://doi.org/10.17632/rscbjbr9sj https://nihcc.app.box.com/v/ChestXray-NIHCC Bello I, Fedus W, Du X, Cubuk ED, Srinivas A, Lin TY, Shlens J, Zoph B (2021) Revisiting resnets: Improved training and scaling strategies. Adv Neural Inf Process Syst 34:22614–22627 Radosavovic I, Kosaraju RP, Girshick R, He K, Dollár P Designing network design spaces. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition 2020 (pp. 10428–10436) Tan M, Le Q Efficientnetv2: Smaller models and faster training. InInternational conference on machine learning 2021 Jul 1 (pp. 10096–10106). PMLR Additional Declarations The authors declare no competing interests. Cite Share Download PDF Status: Posted Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-7240894","acceptedTermsAndConditions":true,"allowDirectSubmit":true,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":492446025,"identity":"aab8bf18-b3ae-487b-aa1d-61724d218519","order_by":0,"name":"Fatemeh Hosseinabadi","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA/0lEQVRIiWNgGAWjYBACgwNgEoIOMNgwyIH4Bx4QryWNwRisJYGgFriuNIbEBhAPr5bj7Y8/fCi4I2/OfnjjAYYEu/T5YYcfAm2xk9NtwK7F/swZM8kZBs8Md/akFQC1JOduvJ1mANSSbGx2ALsWgxs5bMw8BocZNxzIMTjA+IM5d+PsBJCWA4nbcGpJf/z5j8Fh+w3n3wD9lVCfbjg7/QMBLQkG0gwGhxM33MgBaTmcIC+dQ8AWkF96DA4n75zxrOBAQsJxww3SOUCGAR6/gELsx5/Dttv5kzd/+JBQLS8/Ox3IqLCTw6UFFSQwIEUu8UC+gRTVo2AUjIJRMBIAAHR/bk8EfeKOAAAAAElFTkSuQmCC","orcid":"","institution":"Assistant Professor of Radiology, Zahedan University of medical Sciences, Iran","correspondingAuthor":true,"prefix":"","firstName":"Fatemeh","middleName":"","lastName":"Hosseinabadi","suffix":""},{"id":492446026,"identity":"cfc32004-0a89-4f1e-ac53-f3013c5a9758","order_by":1,"name":"Mohammad Mojtaba Rohani","email":"","orcid":"","institution":"Department of Radiology, Poursina Hospital, Guilan University of Medical Sciences, Rasht, Iran","correspondingAuthor":false,"prefix":"","firstName":"Mohammad","middleName":"Mojtaba","lastName":"Rohani","suffix":""}],"badges":[],"createdAt":"2025-07-29 08:33:09","currentVersionCode":1,"declarations":{"humanSubjects":true,"vertebrateSubjects":false,"conflictsOfInterestStatement":false,"humanSubjectEthicalGuidelines":true,"humanSubjectConsent":true,"humanSubjectClinicalTrial":false,"humanSubjectCaseReport":false,"vertebrateSubjectEthicalGuidelines":false},"doi":"10.21203/rs.3.rs-7240894/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-7240894/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":88004272,"identity":"49dae61e-41e2-4d36-a7d6-61f4a9c0bf8f","added_by":"auto","created_at":"2025-07-31 10:36:04","extension":"jpeg","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":657559,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cem\u003eExamples of Pediatric Chest X-ray. Top: Normal, Down: Pneumonia [1]\u003c/em\u003e\u003c/p\u003e","description":"","filename":"floatimage1.jpeg","url":"https://assets-eu.researchsquare.com/files/rs-7240894/v1/edc9eac41a7c117e89053a41.jpeg"},{"id":88004276,"identity":"e1c62108-c07a-450c-8207-e53ccfccd62a","added_by":"auto","created_at":"2025-07-31 10:36:04","extension":"jpeg","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":192009,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cem\u003eResidual block in the original ResNet (left). The RegNet module (right) [20]\u003c/em\u003e\u003c/p\u003e","description":"","filename":"floatimage2.jpeg","url":"https://assets-eu.researchsquare.com/files/rs-7240894/v1/f80aef7774a73a2d2f1b1f48.jpeg"},{"id":88004274,"identity":"57829579-43d6-441c-999d-8aaa3cd98165","added_by":"auto","created_at":"2025-07-31 10:36:04","extension":"jpeg","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":45774,"visible":true,"origin":"","legend":"\u003cp\u003eConfusion matrix of RegNet model on testing data\u003c/p\u003e","description":"","filename":"floatimage3.jpeg","url":"https://assets-eu.researchsquare.com/files/rs-7240894/v1/b63264eee3c4ce40fe5ef7d7.jpeg"},{"id":88004282,"identity":"15502f72-7991-4b44-9d31-66f13f6e1b7f","added_by":"auto","created_at":"2025-07-31 10:36:09","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":1213069,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-7240894/v1/7d6a1cd9-047d-49ea-9c46-207bcf2e5ae9.pdf"}],"financialInterests":"The authors declare no competing interests.","formattedTitle":"\u003cp\u003eDeep Learning Approach for the Diagnosis of Pediatric Pneumonia Using Chest X-ray Imaging\u003c/p\u003e","fulltext":[{"header":"Introduction","content":"\u003cp\u003ePediatric pneumonia remains one of the most critical health challenges affecting children globally, accounting for over 700,000 deaths annually in children under the age of five, according to the World Health Organization. It is caused by a variety of pathogens, including bacteria such as Streptococcus pneumoniae, viruses like respiratory syncytial virus (RSV), and, less commonly, fungi. The clinical presentation of pneumonia in children often includes fever, cough, rapid breathing, and chest indrawing, but these symptoms can overlap with other common pediatric illnesses such as asthma or bronchiolitis, complicating clinical diagnosis. Additionally, young children may not be able to effectively communicate their symptoms, making objective diagnostic tools all the more essential. Prompt diagnosis is vital not only to initiate timely treatment but also to avoid unnecessary antibiotic usage and prevent disease progression, hospitalization, or death [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e].\u003c/p\u003e\u003cp\u003eMedical imaging, particularly chest X-ray radiography, is the most widely used diagnostic tool for confirming suspected cases of pneumonia in children [\u003cspan additionalcitationids=\"CR3\" citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e–\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e]. It offers a non-invasive means of visualizing lung infiltrates, consolidation, and pleural effusion—hallmarks of pneumonia. However, interpreting pediatric chest X-rays poses unique challenges. Children have anatomical differences compared to adults, such as smaller thoracic cavities and less-defined lung structures, which can complicate image interpretation. Moreover, acquiring high-quality images can be technically difficult due to patient movement, lack of cooperation, or suboptimal exposure settings. In resource-limited settings, the situation is further exacerbated by a shortage of trained radiologists or pediatric imaging specialists, leading to diagnostic delays or inaccuracies. Consequently, there is a growing demand for automated and reliable diagnostic support systems.\u003c/p\u003e\u003cp\u003eIn recent years, machine learning (ML) and deep learning (DL) technologies have shown substantial promise in addressing such diagnostic challenges. Machine learning encompasses a broad set of algorithms that learn from data to make predictions or decisions without being explicitly programmed [\u003cspan additionalcitationids=\"CR6 CR7\" citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e–\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e]. Deep learning, a subfield of ML, involves neural networks with multiple layers that automatically extract complex features from raw data. Among various DL architectures, convolutional neural networks (CNNs) have gained prominence for their exceptional performance in image-related tasks. CNNs are capable of learning hierarchical patterns, from simple edges and textures to more abstract structures, making them particularly suitable for medical image analysis. These models not only reduce the burden on clinicians but also introduce consistency and objectivity to the diagnostic process.\u003c/p\u003e\u003cp\u003eThe application of deep learning in medicine has rapidly expanded, demonstrating state-of-the-art results across a range of clinical domains. In radiology, CNN-based models have been developed to detect diabetic retinopathy in retinal images, classify skin lesions in dermatology, and identify lung nodules or tuberculosis in chest X-rays. These systems can assist in triage, serve as second readers, or even function as autonomous diagnostic tools in environments lacking human expertise. In addition to classification, deep learning has been used for medical image segmentation, anomaly detection, and survival prediction [\u003cspan additionalcitationids=\"CR10 CR11\" citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e–\u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e12\u003c/span\u003e]. The integration of AI in healthcare promises to improve diagnostic speed, accuracy, and accessibility, particularly in underserved or high-volume clinical settings. Building on these advancements, this study explores the use of three cutting-edge CNN architectures—ResNetRS, RegNet, and EfficientNetV2—for the detection of pediatric pneumonia using chest X-ray images, with the goal of improving diagnostic outcomes through automated image classification.\u003c/p\u003e\u003cp\u003eIn this study, we present a comparative analysis of three modern convolutional neural network architectures—ResNetRS, RegNet, and EfficientNetV2—for the task of pediatric pneumonia classification using chest X-ray images. We create a balanced subset of 1,000 images from a publicly available pediatric chest radiograph dataset and apply standardized preprocessing and augmentation techniques to improve model robustness. Leveraging transfer learning with pretrained ImageNet weights, each model is fine-tuned and evaluated based on its classification accuracy and sensitivity.\u003c/p\u003e\u003cdiv class=\"BlockQuote\"\u003e\u003cp\u003e\u003cb\u003eRelated Work\u003c/b\u003e\u003c/p\u003e\u003c/div\u003e\u003cp\u003e\u003c/p\u003e\u003cp\u003eThe use of deep learning in medical imaging has garnered significant attention over the past decade, with numerous studies demonstrating its efficacy in diagnosing thoracic diseases, including pneumonia. One of the most influential works in this domain is by [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e], which introduced a large, publicly available pediatric chest X-ray dataset comprising over 5,000 images labeled as pneumonia (bacterial or viral) or normal. Using a transfer learning approach with the InceptionV3 architecture, their model achieved over 90% accuracy in distinguishing pneumonia from normal cases, showcasing the potential of deep learning in pediatric diagnostic imaging.\u003c/p\u003e\u003cp\u003eFollowing this foundational work, [\u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e13\u003c/span\u003e] developed CheXNet, a 121-layer DenseNet trained on the NIH ChestX-ray14 dataset and achieved radiologist-level performance in detecting pneumonia in adult chest X-rays. While not specifically targeted at pediatric patients, this work laid the groundwork for the use of deeper architectures in chest pathology detection. Building upon these advances, researchers have sought to adapt and evaluate CNNs specifically for pediatric populations, recognizing the anatomical and pathological differences.\u003c/p\u003e\u003cp\u003e\u003c/p\u003e\u003cp\u003eResearchers in [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e] explored the performance of VGG16 and ResNet50 architectures in classifying pediatric chest X-rays and emphasized the importance of dataset balance and augmentation in achieving high sensitivity. Similarly, [\u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e] applied a deep convolutional neural network for pneumonia diagnosis and highlighted the potential of automated systems in reducing diagnostic workload. More recently, [\u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e16\u003c/span\u003e] used a combination of DenseNet201 and Grad-CAM visualization to not only classify pediatric pneumonia but also provide explainability by highlighting the affected lung regions, aiding clinical interpretability.\u003c/p\u003e\u003cp\u003eSeveral comparative studies have also evaluated the performance of multiple CNN models in pediatric pneumonia detection. For instance, [\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e] proposed a hybrid model combining CNN feature extraction with support vector machines (SVMs) and achieved notable performance improvements over standalone models. Additionally, [\u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e18\u003c/span\u003e] conducted an in-depth review of deep learning methods for pneumonia detection and noted that while transfer learning significantly enhances performance, model generalizability across populations and image acquisition settings remains a critical challenge.\u003c/p\u003e\u003cp\u003eDespite the promising results reported across these studies, most research has focused on a limited set of models or datasets, and comparative analyses using more recent architectures are relatively scarce. In response, our study explores and benchmarks three state-of-the-art CNNs—ResNetRS, RegNet, and EfficientNetV2—on pediatric chest X-ray images to assess their effectiveness in real-world diagnostic settings. By incorporating modern training procedures and robust evaluation metrics, this work contributes to the growing body of literature on AI-based pediatric pneumonia diagnosis and offers insights into the clinical applicability of advanced deep learning models.\u003c/p\u003e"},{"header":"Method","content":"\u003cp\u003e\u003cb\u003eDataset\u003c/b\u003e\u003c/p\u003e\u003cp\u003eThis study utilized a curated subset of a publicly available pediatric chest X-ray (CXR) dataset originally compiled by Kermany et al. (2018). The complete dataset contains 5,856 anterior-posterior chest radiographs of pediatric patients aged between one and five years, categorized into 3,883 pneumonia cases (including 2,538 bacteria and 1,345 viral) and 1,349 normal cases. For our experiments, we selected a representative subset of 1,000 images through stratified random sampling to preserve the original class distribution across pneumonia and normal categories. All images were visually inspected to exclude low-quality or unreadable scans. The Institutional Review Board (IRB)/Ethics Committee approvals were obtained for this dataset [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e, \u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e19\u003c/span\u003e].\u003c/p\u003e\u003cp\u003eTo ensure consistency and compatibility with pretrained convolutional neural network (CNN) models, all images were resized to 224 × 224 pixels. Preprocessing included normalization using the standard ImageNet mean and standard deviation values. To enhance generalization and reduce overfitting, we applied data augmentation techniques such as random horizontal flipping, small rotations (± 10 degrees), zooming, and brightness adjustments. The dataset was split into 80% training and 20% testing sets, with stratification to maintain class balance. Five-fold cross-validation was used on the training set to evaluate model stability and performance across different data splits.\u003c/p\u003e\u003cp\u003e\u003cb\u003eDeep Learning Models\u003c/b\u003e\u003c/p\u003e\u003cp\u003eIn this study, we employed three state-of-the-art convolutional neural network (CNN) architectures—ResNetRS, RegNet, and EfficientNetV2—to perform binary classification of pediatric chest X-ray images into pneumonia or normal categories. All models were pretrained on the ImageNet dataset to leverage rich feature representations and were subsequently fine-tuned on our pediatric X-ray dataset using transfer learning.\u003c/p\u003e\u003cp\u003e\u003cem\u003eResNetRS\u003c/em\u003e\u003c/p\u003e\u003cp\u003eResNetRS (ResNet-Ranked Scaled) [\u003cspan citationid=\"CR20\" class=\"CitationRef\"\u003e20\u003c/span\u003e] is a family of deep residual networks proposed by Google Research in 2021 as an improved variant of the traditional ResNet architecture. While standard ResNets use a fixed design, ResNetRS introduces several refinements aimed at enhancing training stability and accuracy. These include:\u003c/p\u003e\u003col style=\"list-style-type:lower-alpha;\"\u003e\u003cspan\u003e\u003cli\u003e\u003cp\u003eStandardized convolutions and weight standardization, which normalize the weight distributions during training to stabilize gradients and improve convergence.\u003c/p\u003e\u003c/li\u003e\u003c/span\u003e\u003cspan\u003e\u003cli\u003e\u003cp\u003eSqueeze-and-Excitation (SE) blocks, which adaptively recalibrate channel-wise feature responses by modeling interdependencies between channels.\u003c/p\u003e\u003c/li\u003e\u003c/span\u003e\u003cspan\u003e\u003cli\u003e\u003cp\u003eStochastic depth, a regularization technique that randomly drops residual connections during training, thereby improving model robustness and preventing overfitting.\u003c/p\u003e\u003c/li\u003e\u003c/span\u003e\u003cspan\u003e\u003cli\u003e\u003cp\u003eSwish activation function and adaptive scaling of width and depth across blocks.\u003c/p\u003e\u003c/li\u003e\u003c/span\u003e\u003c/ol\u003e\u003cp\u003eWe utilized the ResNetRS-50 variant, which has 50 layers and offers a strong balance between accuracy and computational efficiency. The final classification layer was replaced with a dense output layer containing a single neuron with sigmoid activation. Only the top convolutional block and batch normalization layers were unfrozen during fine-tuning, allowing the model to adapt high-level features to the pediatric pneumonia domain while preserving the generalized representations learned from ImageNet.\u003c/p\u003e\u003cp\u003e\u003cem\u003eRegNet\u003c/em\u003e\u003c/p\u003e\u003cp\u003eRegNet (Regularized Network Design Space) [\u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e21\u003c/span\u003e], introduced by Facebook AI Research, is a family of CNNs derived from a simple design space where width, depth, and bottleneck ratios are parameterized using quantized linear functions. This structured approach leads to architectures that are computationally efficient and scalable without compromising accuracy. RegNet differs from earlier models by emphasizing design regularity over manual tuning or neural architecture search.\u003c/p\u003e\u003cp\u003eThe specific variant used in our study is RegNetY-064, which incorporates:\u003c/p\u003e\u003col style=\"list-style-type:lower-alpha;\"\u003e\u003cspan\u003e\u003cli\u003e\u003cp\u003eGroup convolutions to reduce parameter redundancy while maintaining representational power.\u003c/p\u003e\u003c/li\u003e\u003c/span\u003e\u003cspan\u003e\u003cli\u003e\u003cp\u003eBottleneck residual blocks that compress and expand features for efficient information flow.\u003c/p\u003e\u003c/li\u003e\u003c/span\u003e\u003cspan\u003e\u003cli\u003e\u003cp\u003eSqueeze-and-Excitation (SE) attention blocks to model channel-wise interdependencies and improve feature calibration.\u003c/p\u003e\u003c/li\u003e\u003c/span\u003e\u003cspan\u003e\u003cli\u003e\u003cp\u003eEfficient stage-wise block scaling, where both width and depth are increased progressively throughout the network.\u003c/p\u003e\u003c/li\u003e\u003c/span\u003e\u003c/ol\u003e\u003cp\u003eDue to its modular and systematic architecture, RegNetY is well-suited for transfer learning tasks in medical imaging. The classification head was replaced with a dropout layer followed by a sigmoid-activated neuron for binary output. Full model fine-tuning was performed with a gradual unfreezing strategy across the training epochs.\u003c/p\u003e\u003cp\u003e\u003cem\u003eEfficientNetV2\u003c/em\u003e\u003c/p\u003e\u003cp\u003eEfficientNetV2 [\u003cspan citationid=\"CR22\" class=\"CitationRef\"\u003e22\u003c/span\u003e], developed by Google Brain, is the successor to EfficientNet and is designed for both accuracy and speed. It achieves faster training with higher accuracy by incorporating two key innovations:\u003c/p\u003e\u003col style=\"list-style-type:lower-alpha;\"\u003e\u003cspan\u003e\u003cli\u003e\u003cp\u003eFused-MBConv layers, which replace the standard MBConv (Mobile Inverted Bottleneck Convolution) blocks in the early stages with regular convolution + batch normalization layers, resulting in faster convergence and reduced memory consumption.\u003c/p\u003e\u003c/li\u003e\u003c/span\u003e\u003cspan\u003e\u003cli\u003e\u003cp\u003eProgressive learning, a training approach that gradually increases the input image size and data augmentation strength throughout training. This helps stabilize early learning and reduce overfitting, particularly in small datasets.\u003c/p\u003e\u003c/li\u003e\u003c/span\u003e\u003c/ol\u003e\u003cp\u003eEfficientNetV2 uses compound scaling to balance network depth, width, and resolution. We adopted the EfficientNetV2-Small (EffNetV2-S) variant due to its excellent performance on lightweight medical tasks and suitability for smaller datasets like pediatric chest X-rays. This model incorporates SE blocks and Swish activation functions, which enhance learning capacity and nonlinearity. We customized the classifier head to include batch normalization, dropout (p = 0.3), and a final sigmoid activation for binary classification.\u003c/p\u003e\u003cp\u003eUnlike earlier EfficientNet models that required extensive tuning for small medical datasets, EfficientNetV2’s improved learning dynamics make it particularly well-suited for our fine-tuning workflow. The entire model was unfrozen during training to enable comprehensive domain adaptation. These three models were selected due to their robust performance in general image classification, their proven transferability, and their ability to handle limited, class-imbalanced datasets—a common constraint in pediatric imaging tasks. Their architectural diversity also allowed us to compare CNNs with different building principles (residual, regularized, and mobile-efficient designs) under a uniform experimental setting.\u003c/p\u003e\u003cp\u003e\u003cb\u003eExperiment Setup\u003c/b\u003e\u003c/p\u003e\u003cp\u003eAll models were implemented in PyTorch and trained in an NVIDIA RTX GPU environment. The training configuration included the Adam optimizer with a learning rate of 1e-4, batch size of 32, and up to 30 epochs with early stopping based on validation loss. To further stabilize training, we employed gradient clipping with a maximum norm of 5.0 and used mixed-precision training via automatic mixed precision (AMP).\u003c/p\u003e"},{"header":"Results","content":"\u003cp\u003eWe evaluated the performance of ResNetRS, RegNet, and EfficientNetV2 models on the held-out test set derived from our curated pediatric chest X-ray dataset. Each model was trained using 5-fold cross-validation and evaluated based on classification accuracy and sensitivity (recall)\u0026mdash;two metrics of particular clinical importance in diagnostic settings.\u003c/p\u003e\u003cp\u003e\u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab1\" border=\"1\"\u003e\u003ccaption language=\"En\"\u003e\u003cdiv class=\"CaptionNumber\"\u003eTable 1\u003c/div\u003e\u003cdiv class=\"CaptionContent\"\u003e\u003cp\u003eClassification performance of the deep learning models\u003c/p\u003e\u003c/div\u003e\u003c/caption\u003e\u003ccolgroup cols=\"3\"\u003e\u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e\u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e\u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e\u003cthead\u003e\u003ctr\u003e\u003cth align=\"left\" colname=\"c1\"\u003e\u003cp\u003eModel\u003c/p\u003e\u003c/th\u003e\u003cth align=\"left\" colname=\"c2\"\u003e\u003cp\u003eAccuracy\u003c/p\u003e\u003c/th\u003e\u003cth align=\"left\" colname=\"c3\"\u003e\u003cp\u003eSensitivity\u003c/p\u003e\u003c/th\u003e\u003c/tr\u003e\u003c/thead\u003e\u003ctbody\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u003cp\u003eResNetRS\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e\u003cp\u003e91.9\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e\u003cp\u003e89.3\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u003cp\u003eRegNet\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e\u003cp\u003e92.4\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e\u003cp\u003e90.1\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u003cp\u003eEfficientNetV2\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e\u003cp\u003e88.5\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e\u003cp\u003e88.1\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003c/tbody\u003e\u003c/colgroup\u003e\u003c/table\u003e\u003c/div\u003e\u003c/p\u003e\u003cp\u003eAccuracy reflects overall classification correctness, while sensitivity quantifies the model\u0026rsquo;s ability to correctly identify pneumonia cases, which is essential in avoiding missed diagnoses. The results are summarized in Table\u0026nbsp;\u003cspan refid=\"Tab1\" class=\"InternalRef\"\u003e1\u003c/span\u003e. Among the tested architectures, RegNet achieved the highest classification performance with an accuracy of 92.4% and sensitivity of 90.1%, indicating its strong ability to correctly identify pneumonia cases while maintaining overall reliability. ResNetRS followed closely, achieving an accuracy of 91.9% and sensitivity of 89.3%, which demonstrates consistent performance and generalization. EfficientNetV2 also performed well, with an accuracy of 88.5% and sensitivity of 88.1%, although slightly lower than the other two models.\u003c/p\u003e\u003cp\u003eOverall, the results indicate that modern CNN architecture pretrained on large-scale natural image datasets can be successfully adapted for pediatric medical imaging tasks. Among the tested models, RegNet emerges as the most promising candidate for real-world deployment in pneumonia screening tools, balancing predictive performance with architectural efficiency.\u003c/p\u003e\u003cp\u003e\u003c/p\u003e\u003cp\u003eThe confusion matrix presented in Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003e provides a detailed breakdown of the model\u0026rsquo;s performance on the held-out test set. Out of the total pneumonia cases, the model correctly identified 83 true positives, while 17 cases were missed, resulting in false negatives. Among the normal cases, 54 were correctly classified as true negatives, and 11 were incorrectly predicted as pneumonia, leading to false positives. These results reflect a strong balance between sensitivity and specificity, with the model demonstrating a reliable ability to detect pneumonia while maintaining a low false positive rate. The relatively low number of false negatives is particularly important in clinical contexts, as it reduces the risk of untreated pneumonia progressing to severe complications. Overall, the confusion matrix confirms the model\u0026rsquo;s robust diagnostic potential for pediatric pneumonia detection.\u003c/p\u003e"},{"header":"Conclusion","content":"\u003cp\u003eIn this study, we investigated the performance of three state-of-the-art convolutional neural network architectures\u0026mdash;ResNetRS, RegNet, and EfficientNetV2\u0026mdash;for the task of automated pediatric pneumonia classification using chest X-ray images. We curated and preprocessed a subset of 1,000 labeled images from a publicly available pediatric dataset and applied transfer learning techniques to adapt each model for binary classification. All models were trained and evaluated using a standardized pipeline involving five-fold cross-validation and an 80\u0026thinsp;\u0026minus;\u0026thinsp;20 train-test split. Our results demonstrated that all three models were capable of effectively distinguishing between pneumonia and normal chest radiographs in pediatric patients. Among them, RegNet achieved the highest accuracy (92.4%) and sensitivity (90.1%), followed closely by ResNetRS. EfficientNetV2 also showed competitive performance, although slightly lower than the other two. The confusion matrix further confirmed the models\u0026rsquo; ability to minimize false negatives and false positives, both of which are critical in clinical diagnostics.\u003c/p\u003e\u003cp\u003eOverall, this study highlighted the viability of modern CNN-based transfer learning approaches in supporting pediatric pneumonia diagnosis. The models we evaluated showed strong diagnostic performance and could serve as reliable tools in real-world settings, particularly in low-resource environments where expert radiological interpretation may be limited. Future work may focus on extending this analysis to larger and more diverse datasets, incorporating multi-class classification (e.g., distinguishing bacterial vs. viral pneumonia), and integrating model predictions into clinical decision-support systems.\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\u003cli\u003e\u003cspan\u003eKermany DS, Goldbaum M, Cai W, Valentim CC, Liang H, Baxter SL, McKeown A, Yang G, Wu X, Yan F, Dong J (2018) Identifying medical diagnoses and treatable diseases by image-based deep learning. Cell 172(5):1122\u0026ndash;1131\u003c/span\u003e\u003c/li\u003e\u003cli\u003e\u003cspan\u003eIrede EL, Aworinde OR, Lekan OK, Amienghemhen OD, Okonkwo TP, Onivefu AP, Ifijen IH Medical imaging: a critical review on X-ray imaging for the detection of infection. Biomedical Mater Devices 2024 Jul 15:1\u0026ndash;45\u003c/span\u003e\u003c/li\u003e\u003cli\u003e\u003cspan\u003e\u0026Ccedil;allı E, Sogancioglu E, Van Ginneken B, van Leeuwen KG, Murphy K (2021) Deep learning for chest X-ray analysis: A survey. Med Image Anal 72:102125\u003c/span\u003e\u003c/li\u003e\u003cli\u003e\u003cspan\u003eSharifi S, Donyadadi A (2025) Detection and diagnosis of congenital heart disease from chest X-rays with deep learning models. Int J Appl Data Sci Eng Health 1(1):1\u0026ndash;9\u003c/span\u003e\u003c/li\u003e\u003cli\u003e\u003cspan\u003eKohli PS, Arora S Application of machine learning in disease prediction. In2018 4th International conference on computing communication and automation (ICCCA) 2018 Dec 14 (pp. 1\u0026ndash;4). IEEE\u003c/span\u003e\u003c/li\u003e\u003cli\u003e\u003cspan\u003eNorouzi F, Machado BL (2024) Predicting Mental Health Outcomes: A Machine Learning Approach to Depression, Anxiety, and Stress. Int J Appl Data Sci Eng Health 1(2):98\u0026ndash;104\u003c/span\u003e\u003c/li\u003e\u003cli\u003e\u003cspan\u003eUddin S, Khan A, Hossain ME, Moni MA (2019) Comparing different supervised machine learning algorithms for disease prediction. BMC Med Inf Decis Mak 19(1):1\u0026ndash;6\u003c/span\u003e\u003c/li\u003e\u003cli\u003e\u003cspan\u003eGhasemi F, Sharifi S (2025) Heart Failure Prediction Using Support Vector Machine. Int J Novel Res Life Sci. Feb 1\u003c/span\u003e\u003c/li\u003e\u003cli\u003e\u003cspan\u003eAnantharajan S, Gunasekaran S, Subramanian T (2024) MRI brain tumor detection using deep learning and machine learning approaches. Measurement: Sens 31:101026\u003c/span\u003e\u003c/li\u003e\u003cli\u003e\u003cspan\u003eWhig P, Shadadi E, Kouser S, Alamer L (2025) Machine learning approaches for early detection and management of musculoskeletal conditions. Int J Comput Vis Rob 15(1):104\u0026ndash;117\u003c/span\u003e\u003c/li\u003e\u003cli\u003e\u003cspan\u003eWang L (2024) Mammography with deep learning for breast cancer detection. Front Oncol 14:1281922\u003c/span\u003e\u003c/li\u003e\u003cli\u003e\u003cspan\u003eAbbasi H, Afrazeh F, Ghasemi Y, Ghasemi F (2024) A shallow review of artificial intelligence applications in brain disease: stroke, Alzheimer's, and aneurysm. Int J Appl Data Sci Eng Health 1(2):32\u0026ndash;43\u003c/span\u003e\u003c/li\u003e\u003cli\u003e\u003cspan\u003eRajpurkar P, Irvin J, Zhu K, Yang B, Mehta H, Duan T, Ding D, Bagul A, Langlotz C, Shpanskaya K, Lungren MP Chexnet: Radiologist-level pneumonia detection on chest x-rays with deep learning. arXiv preprint arXiv:1711.05225. 2017 Nov 14.\u003c/span\u003e\u003c/li\u003e\u003cli\u003e\u003cspan\u003eStephen O, Sain M, Maduh UJ, Jeong DU (2019) An efficient deep learning approach to pneumonia classification in healthcare. J Healthc Eng 2019(1):4180949\u003c/span\u003e\u003c/li\u003e\u003cli\u003e\u003cspan\u003eAbiyev RH, Ma\u0026rsquo;aitaH MK (2018) Deep convolutional neural networks for chest diseases detection. J Healthc Eng 2018(1):4168538\u003c/span\u003e\u003c/li\u003e\u003cli\u003e\u003cspan\u003eAli SA, Kumar MS Detecting pneumonia from X-ray images using transfer learning. In2020 IEEE International Conference on Advent Trends in Multidisciplinary Research and Innovation (ICATMRI) 2020 Dec 30 (pp. 1\u0026ndash;4). IEEE\u003c/span\u003e\u003c/li\u003e\u003cli\u003e\u003cspan\u003eToğa\u0026ccedil;ar M, Ergen B, C\u0026ouml;mert Z (2020) COVID-19 detection using deep learning models to exploit Social Mimic Optimization and structured chest X-ray images using fuzzy color and stacking approaches. Comput Biol Med 121:103805\u003c/span\u003e\u003c/li\u003e\u003cli\u003e\u003cspan\u003eChowdhury ME, Rahman T, Khandakar A, Mazhar R, Kadir MA, Mahbub ZB, Islam KR, Khan MS, Iqbal A, Al Emadi N, Reaz MB (2020) Can AI help in screening viral and COVID-19 pneumonia? Ieee Access 8:132665\u0026ndash;132676\u003c/span\u003e\u003c/li\u003e\u003cli\u003e\u003cspan\u003eKermany D, Zhang K, Goldbaum M (2018) Labeled optical coherence tomography (oct) and chest x-ray images for classification Mendeley Data, v2 https://doi.org/10.17632/rscbjbr9sj \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://nihcc.app.box.com/v/ChestXray-NIHCC\u003c/span\u003e\u003cspan address=\"https://nihcc.app.box.com/v/ChestXray-NIHCC\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e\u003cli\u003e\u003cspan\u003eBello I, Fedus W, Du X, Cubuk ED, Srinivas A, Lin TY, Shlens J, Zoph B (2021) Revisiting resnets: Improved training and scaling strategies. Adv Neural Inf Process Syst 34:22614\u0026ndash;22627\u003c/span\u003e\u003c/li\u003e\u003cli\u003e\u003cspan\u003eRadosavovic I, Kosaraju RP, Girshick R, He K, Doll\u0026aacute;r P Designing network design spaces. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition 2020 (pp. 10428\u0026ndash;10436)\u003c/span\u003e\u003c/li\u003e\u003cli\u003e\u003cspan\u003eTan M, Le Q Efficientnetv2: Smaller models and faster training. InInternational conference on machine learning 2021 Jul 1 (pp. 10096\u0026ndash;10106). PMLR\u003c/span\u003e\u003c/li\u003e\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":true,"hideJournal":true,"highlight":"","institution":"Zahedan University of Medical Sciences","isAcceptedByJournal":false,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true},"keywords":"Pediatric Pneumonia, Chest X-ray, RegNet, ResNetRS, EfficientNetV2","lastPublishedDoi":"10.21203/rs.3.rs-7240894/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-7240894/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003ePediatric pneumonia remains a leading cause of morbidity and mortality in children worldwide. Timely and accurate diagnosis is critical but often challenged by limited radiological expertise and the physiological and procedural complexity of pediatric imaging. This study investigates the performance of state-of-the-art convolutional neural network (CNN) architectures\u0026mdash;ResNetRS, RegNet, and EfficientNetV2\u0026mdash;using transfer learning for the automated classification of pediatric chest X-ray images as either pneumonia or normal.A curated subset of 1,000 chest X-ray images was extracted from a publicly available dataset originally comprising 5,856 pediatric images. All images were preprocessed and labeled for binary classification. Each model was fine-tuned using pretrained ImageNet weights and evaluated based on accuracy and sensitivity. RegNet achieved the highest classification performance with an accuracy of 92.4% and a sensitivity of 90.1%, followed by ResNetRS (accuracy: 91.9%, sensitivity: 89.3%) and EfficientNetV2 (accuracy: 88.5%, sensitivity: 88.1%).\u003c/p\u003e","manuscriptTitle":"Deep Learning Approach for the Diagnosis of Pediatric Pneumonia Using Chest X-ray Imaging","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2025-07-31 10:35:59","doi":"10.21203/rs.3.rs-7240894/v1","editorialEvents":[{"type":"communityComments","content":0}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"9878dbf0-7e78-4cef-9476-41bf54557809","owner":[],"postedDate":"July 31st, 2025","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"posted","subjectAreas":[{"id":52285090,"name":"Artificial Intelligence and Machine Learning"}],"tags":[],"updatedAt":"2025-07-31T10:35:59+00:00","versionOfRecord":[],"versionCreatedAt":"2025-07-31 10:35:59","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-7240894","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-7240894","identity":"rs-7240894","version":["v1"]},"buildId":"8U1c8b4HqxoKbykW_rLl7","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: preprint-html

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00