Discovering Hidden Reservoir Physics Using Explainable Machine Learning for Permeability Prediction in Carbonate Reservoirs With Noisy Legacy Datasets

preprint OA: closed CC-BY-4.0
📄 Open PDF Full text JSON View at publisher
Full text 15,076 characters · extracted from preprint-html · click to expand
Discovering Hidden Reservoir Physics Using Explainable Machine Learning for Permeability Prediction in Carbonate Reservoirs With Noisy Legacy Datasets | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Discovering Hidden Reservoir Physics Using Explainable Machine Learning for Permeability Prediction in Carbonate Reservoirs With Noisy Legacy Datasets Abdulwahab A. Abdulwahab, Watheq J. Al-Mudhafar, Mohammed A. Abbas This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-9097850/v1 This work is licensed under a CC BY 4.0 License Status: Posted Version 1 posted You are reading this latest preprint version Abstract Accurate permeability prediction is essential for reservoir characterization and production forecasting; however, traditional physics-based models often fail to capture the complex pore systems typical of carbonate reservoirs. This study aims to demonstrate how explainable machine learning can uncover hidden reservoir physics while improving permeability prediction using noisy legacy datasets. The work focuses on integrating physics-based modeling with explainable AI to reinterpret legacy well logs and extract previously unresolved petrophysical relationships. A physics-guided machine learning framework was developed using legacy well data from a carbonate reservoir in southern Iraq. The original dataset contained more than 2,400 samples from eight wells and exhibited significant contamination from incorrectly imputed or interpolated permeability values. A rigorous cleaning workflow combining statistical filtering and zonation-based quality control reduced the dataset to 1,132 physically consistent samples. A baseline permeability model derived from well log–estimated porosity and porosity–permeability transform served as the physical reference. Machine learning discrepancy models were trained using five physics-informed features—neutron porosity (NPHI), gamma ray (GR), bulk density (RHOB), sonic travel time (DTP), and depth. Six algorithms (Random Forest, Linear Regression, Polynomial Regression, Support Vector Machines, XGBoost, and CatBoost) were evaluated using five-fold cross-validation. Tree-based gradient boosting algorithms demonstrated the strongest predictive performance, with XGBoost producing the most accurate discrepancy corrections. When integrated with the baseline physics model through a multiplicative boosting framework, the hybrid model significantly improved predictive accuracy, reducing RMSE by approximately 80% and increasing adjusted R² from a negative baseline to greater than 0.95. Beyond predictive improvement, explainable AI analysis using SHAP values and Partial Dependence Plots provided insights into previously unresolved reservoir physics. The algorithm identified dual-porosity behavior by applying strong positive permeability corrections in tight, low-porosity intervals associated with fracture networks while penalizing high-porosity intervals dominated by ineffective microporosity. Interaction analysis revealed strong coupling between neutron porosity, bulk density, and sonic travel time, enabling the model to implicitly synthesize a Secondary Porosity Index that captures deviations between acoustic porosity and effective flow capacity. Depth-dependent analysis further revealed a distinct permeability enhancement zone below approximately 4,000 m, indicating a likely diagenetically enhanced flow unit. In addition, the minimal influence of gamma ray logs confirmed a clean carbonate system where permeability is primarily controlled by secondary porosity rather than shale content. This study demonstrates that explainable machine learning can move beyond predictive modeling to reveal hidden reservoir physics in complex carbonate systems. By combining physics-guided discrepancy modeling with explainable AI, the workflow converts machine learning models into transparent diagnostic tools capable of identifying fracture-dominated flow regimes, ineffective microporosity, and stratigraphically controlled flow units. The approach provides a practical framework for reinterpreting legacy well logs, improving reservoir characterization, and identifying bypassed pay in datasets traditionally considered too noisy for reliable analysis. Explainable machine learning Physics-Guided discrepancy modeling permeability prediction Carbonate Reservoir Characterization and Legacy Well Data Analysis Full Text Additional Declarations No competing interests reported. Cite Share Download PDF Status: Posted Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-9097850","acceptedTermsAndConditions":true,"allowDirectSubmit":true,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":611183796,"identity":"2fa52481-1ade-4a04-b068-01c8dbe0c1e0","order_by":0,"name":"Abdulwahab A. Abdulwahab","email":"","orcid":"","institution":"Basrah University for Oil and Gas","correspondingAuthor":false,"prefix":"","firstName":"Abdulwahab","middleName":"A.","lastName":"Abdulwahab","suffix":""},{"id":611183797,"identity":"6b615ef1-4ce7-44b8-864f-62012df04282","order_by":1,"name":"Watheq J. Al-Mudhafar","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAABKUlEQVRIie3QsUrDQBjA8S8ULstFN7lwYF7hjkIq1Hqv0hDIlEEphIKCkcBNxa5X8CHs4lwIpEto10CWQsGpQ11EoYohQqdTcRPMH+6W737ccQBNTX8zVO8H1aIAxwBmvB8xvWihPaxIGwDPjPiXhPS/J2y+zFbbCBx0lPPy/Io5ncmms3qSZwLM5IHoSO6bXC2ASxq2uypj/K4M+Y3KfS/GWaQj7sxH1JJgVMSlGO0MRUOe4GGrDyR0tWS5RvRNgvgk70woO+fJjl0LcDZ6UlS3GBK8mliSeYpgnsAwNWKCtUQUa9ceLYgvaTDoWrfMVzi4mIzyuSdxMDjREHvsPZKX6LQ3pv60xM+sp8z0fvsqL8WhmU4L3S/XId0D0JfHf542NTU1/fc+AOGWWVVEWpXRAAAAAElFTkSuQmCC","orcid":"","institution":"Basrah Oil Company","correspondingAuthor":true,"prefix":"","firstName":"Watheq","middleName":"J.","lastName":"Al-Mudhafar","suffix":""},{"id":611183798,"identity":"c634603d-860f-4636-9eef-99846257c47b","order_by":2,"name":"Mohammed A. Abbas","email":"","orcid":"","institution":"Basrah Oil Company","correspondingAuthor":false,"prefix":"","firstName":"Mohammed","middleName":"A.","lastName":"Abbas","suffix":""}],"badges":[],"createdAt":"2026-03-11 20:08:31","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-9097850/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-9097850/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[],"financialInterests":"No competing interests reported.","formattedTitle":"Discovering Hidden Reservoir Physics Using Explainable Machine Learning for Permeability Prediction in Carbonate Reservoirs With Noisy Legacy Datasets","fulltext":[],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":false,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":true,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":true,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":true,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true},"keywords":"Explainable machine learning, Physics-Guided discrepancy modeling, permeability prediction, Carbonate Reservoir Characterization, and Legacy Well Data Analysis","lastPublishedDoi":"10.21203/rs.3.rs-9097850/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-9097850/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eAccurate permeability prediction is essential for reservoir characterization and production forecasting; however, traditional physics-based models often fail to capture the complex pore systems typical of carbonate reservoirs. This study aims to demonstrate how explainable machine learning can uncover hidden reservoir physics while improving permeability prediction using noisy legacy datasets. The work focuses on integrating physics-based modeling with explainable AI to reinterpret legacy well logs and extract previously unresolved petrophysical relationships.\u003c/p\u003e \u003cp\u003eA physics-guided machine learning framework was developed using legacy well data from a carbonate reservoir in southern Iraq. The original dataset contained more than 2,400 samples from eight wells and exhibited significant contamination from incorrectly imputed or interpolated permeability values. A rigorous cleaning workflow combining statistical filtering and zonation-based quality control reduced the dataset to 1,132 physically consistent samples. A baseline permeability model derived from well log\u0026ndash;estimated porosity and porosity\u0026ndash;permeability transform served as the physical reference. Machine learning discrepancy models were trained using five physics-informed features\u0026mdash;neutron porosity (NPHI), gamma ray (GR), bulk density (RHOB), sonic travel time (DTP), and depth. Six algorithms (Random Forest, Linear Regression, Polynomial Regression, Support Vector Machines, XGBoost, and CatBoost) were evaluated using five-fold cross-validation.\u003c/p\u003e \u003cp\u003eTree-based gradient boosting algorithms demonstrated the strongest predictive performance, with XGBoost producing the most accurate discrepancy corrections. When integrated with the baseline physics model through a multiplicative boosting framework, the hybrid model significantly improved predictive accuracy, reducing RMSE by approximately 80% and increasing adjusted R\u0026sup2; from a negative baseline to greater than 0.95. Beyond predictive improvement, explainable AI analysis using SHAP values and Partial Dependence Plots provided insights into previously unresolved reservoir physics. The algorithm identified dual-porosity behavior by applying strong positive permeability corrections in tight, low-porosity intervals associated with fracture networks while penalizing high-porosity intervals dominated by ineffective microporosity. Interaction analysis revealed strong coupling between neutron porosity, bulk density, and sonic travel time, enabling the model to implicitly synthesize a Secondary Porosity Index that captures deviations between acoustic porosity and effective flow capacity. Depth-dependent analysis further revealed a distinct permeability enhancement zone below approximately 4,000 m, indicating a likely diagenetically enhanced flow unit. In addition, the minimal influence of gamma ray logs confirmed a clean carbonate system where permeability is primarily controlled by secondary porosity rather than shale content.\u003c/p\u003e \u003cp\u003eThis study demonstrates that explainable machine learning can move beyond predictive modeling to reveal hidden reservoir physics in complex carbonate systems. By combining physics-guided discrepancy modeling with explainable AI, the workflow converts machine learning models into transparent diagnostic tools capable of identifying fracture-dominated flow regimes, ineffective microporosity, and stratigraphically controlled flow units. The approach provides a practical framework for reinterpreting legacy well logs, improving reservoir characterization, and identifying bypassed pay in datasets traditionally considered too noisy for reliable analysis.\u003c/p\u003e","manuscriptTitle":"Discovering Hidden Reservoir Physics Using Explainable Machine Learning for Permeability Prediction in Carbonate Reservoirs With Noisy Legacy Datasets","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2026-03-26 19:06:26","doi":"10.21203/rs.3.rs-9097850/v1","editorialEvents":[{"type":"communityComments","content":0}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"204ca6c4-80c4-4e6e-bf5d-ad8bc522c07e","owner":[],"postedDate":"March 26th, 2026","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"posted","subjectAreas":[],"tags":[],"updatedAt":"2026-05-09T06:53:49+00:00","versionOfRecord":[],"versionCreatedAt":"2026-03-26 19:06:26","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-9097850","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-9097850","identity":"rs-9097850","version":["v1"]},"buildId":"XKTyCvWXoU3ODBz1xrDgd","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: preprint-html

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00
unpaywall
last seen: 2026-05-23T02:00:01.238055+00:00
License: CC-BY-4.0