Early Prediction of the Failure Probability Distribution for Energy Storage Technologies Driven by Domain-Knowledge-Informed Machine Learning | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Article Early Prediction of the Failure Probability Distribution for Energy Storage Technologies Driven by Domain-Knowledge-Informed Machine Learning Maher Alghalayini, Stephen J. Harris, Marcus Noack This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-3871499/v1 This work is licensed under a CC BY 4.0 License Status: Under Review Version 1 posted You are reading this latest preprint version Abstract There is a growing focus on sustainable energy sources and storage systems. The challenge with such emerging systems is their need to be warrantied for around 15 years with just a year of early testing. This requires accurate data extrapolation and estimation of the failure distribution. Physics-based approaches can be overwhelmed by the complexity of degradation, and pure data-driven approaches are inherently unable to extrapolate beyond the testing data. Here, we propose a framework for a hybrid approach for technology-agnostic customization of a Gaussian process for stochastic and domain-knowledge-informed failure distribution prediction. We equip the Gaussian process with customized non-stationary kernels, heteroscedastic noise models, and prior-mean functions to enable it to accurately extrapolate with high accuracy. Furthermore, we minimize testing time with a novel experiment-stopping criterion, which can significantly reduce the required data. Our framework could revolutionize energy-storage testing, enabling the rapid development of new technologies. Physical sciences/Energy science and technology/Energy storage/Batteries Physical sciences/Mathematics and computing/Statistics Physical sciences/Mathematics and computing/Applied mathematics Machine Learning Energy Storage Early Lifetime Prediction Failure Probability Distribution Gaussian Processes Full Text Additional Declarations There is NO Competing Interest. Cite Share Download PDF Status: Under Review Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-3871499","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Article","associatedPublications":[],"authors":[{"id":267592169,"identity":"fa9b1ffc-6fe1-4bf2-85d4-e92a79a8f429","order_by":0,"name":"Maher Alghalayini","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA7klEQVRIiWNgGAWjYFAC5gZmBgYLIM3AcOBDwQEZIrQwgrRIgLUcnGFwgIcELSALeYjRott+sPl1QY2EvMFx3oOHbQzu8DDwLz4mgU+L2ZnENusZxyQMNxzmSzicY/CMh0HiWRp+LQcS24x52CQYNxzmMQBqOQzUcsbYAK+W8w+BWv5J2IO1WBCl5UZi82PeNolEsBYGkBb+HsMH+LU8bGPm7ZNIngnUcrAH6Bc2CbZE/FrOJx/+zPPNxrbv/BnjDz8q7sjx8x8+cACfFiBgA4ePwgE4N4GABmAEfgCR8g0wPj8hO0bBKBgFo2CkAQCwOk4Y06LpMAAAAABJRU5ErkJggg==","orcid":"https://orcid.org/0000-0002-4071-0272","institution":"Lawrence Berkeley National Laboratory","correspondingAuthor":true,"prefix":"","firstName":"Maher","middleName":"","lastName":"Alghalayini","suffix":""},{"id":267592170,"identity":"1452f85b-50f1-4288-b9db-7436a73f82fb","order_by":1,"name":"Stephen J. Harris","email":"","orcid":"","institution":"Lawrence Berkeley, National Laboratory","correspondingAuthor":false,"prefix":"","firstName":"Stephen","middleName":"J.","lastName":"Harris","suffix":""},{"id":267592171,"identity":"474276cb-d405-4a0e-aa08-63b03b6c4338","order_by":2,"name":"Marcus Noack","email":"","orcid":"","institution":"Lawrence Berkeley National Laboratory","correspondingAuthor":false,"prefix":"","firstName":"Marcus","middleName":"","lastName":"Noack","suffix":""}],"badges":[],"createdAt":"2024-01-17 01:55:10","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-3871499/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-3871499/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":50551580,"identity":"d091c798-40cf-449e-bfab-181d5d1174a6","added_by":"auto","created_at":"2024-02-02 11:15:41","extension":"pdf","order_by":1,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":11685641,"visible":true,"origin":"","legend":"","description":"","filename":"EnergyStorageLifetimeEarlyPredictionusingDomainKnowledgeInformedMachineLearning.pdf","url":"https://assets-eu.researchsquare.com/files/rs-3871499/v1_covered_58db8b6f-f872-4bcc-a59b-a7629cf7b441.pdf"}],"financialInterests":"There is \u003cb\u003eNO\u003c/b\u003e Competing Interest.","formattedTitle":"Early Prediction of the Failure Probability Distribution for Energy Storage Technologies Driven by Domain-Knowledge-Informed Machine Learning","fulltext":[],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":false,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":true,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":true,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":true,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"nature-portfolio","isNatureJournal":true,"hasQc":false,"allowDirectSubmit":false,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"","title":"Nature Portfolio","twitterHandle":"","acdcEnabled":false,"dfaEnabled":false,"editorialSystem":"ejp","reportingPortfolio":"","inReviewEnabled":true,"inReviewRevisionsEnabled":false},"keywords":"Machine Learning, Energy Storage, Early Lifetime Prediction, Failure Probability Distribution, Gaussian Processes","lastPublishedDoi":"10.21203/rs.3.rs-3871499/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-3871499/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"There is a growing focus on sustainable energy sources and storage systems. The challenge with such emerging systems is their need to be warrantied for around 15 years with just a year of early testing. This requires accurate data extrapolation and estimation of the failure distribution. Physics-based approaches can be overwhelmed by the complexity of degradation, and pure data-driven approaches are inherently unable to extrapolate beyond the testing data. Here, we propose a framework for a hybrid approach for technology-agnostic customization of a Gaussian process for stochastic and domain-knowledge-informed failure distribution prediction. We equip the Gaussian process with customized non-stationary kernels, heteroscedastic noise models, and prior-mean functions to enable it to accurately extrapolate with high accuracy. Furthermore, we minimize testing time with a novel experiment-stopping criterion, which can significantly reduce the required data. Our framework could revolutionize energy-storage testing, enabling the rapid development of new technologies.","manuscriptTitle":"Early Prediction of the Failure Probability Distribution for Energy Storage Technologies Driven by Domain-Knowledge-Informed Machine Learning","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2024-01-23 18:00:00","doi":"10.21203/rs.3.rs-3871499/v1","editorialEvents":[],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"nature-communications","isNatureJournal":true,"hasQc":false,"allowDirectSubmit":false,"externalIdentity":"NCOMMS","sideBox":"Learn more about [Nature Communications](http://www.nature.com/ncomms/)","snPcode":"","submissionUrl":"https://mts-ncomms.nature.com/","title":"Nature Communications","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"ejp","reportingPortfolio":"Nature Communications","inReviewEnabled":true,"inReviewRevisionsEnabled":false}}],"origin":"","ownerIdentity":"b3272c46-4d7f-4913-9d9c-39128998fa15","owner":[],"postedDate":"January 23rd, 2024","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"under-review","subjectAreas":[{"id":28192995,"name":"Physical sciences/Energy science and technology/Energy storage/Batteries"},{"id":28192996,"name":"Physical sciences/Mathematics and computing/Statistics"},{"id":28192997,"name":"Physical sciences/Mathematics and computing/Applied mathematics"}],"tags":[],"updatedAt":"2024-03-05T14:30:15+00:00","versionOfRecord":[],"versionCreatedAt":"2024-01-23 18:00:00","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-3871499","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-3871499","identity":"rs-3871499","version":["v1"]},"buildId":"qtupq5eGEP_6zYnWcrvyt","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.