Research on edge defect prediction of hot-rolled strip steel based on LGBM-LR | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Research on edge defect prediction of hot-rolled strip steel based on LGBM-LR Yunfei Liu, Jingbin Shao, Jianxin Tang, Weiliang Liu, Jianliang Sun, and 1 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-6653536/v1 This work is licensed under a CC BY 4.0 License Status: Posted Version 1 posted You are reading this latest preprint version Abstract Surface quality control of hot-rolled strip remains challenging due to complex defect formation mechanisms and imbalanced datasets from industrial processes. To address this, we propose a predictive framework integrating Light Gradient Boosting Machine (LGBM) and Logistic Regression (LR) to identify defect-related features and enhance defect prediction. The model focuses on edge defects, leveraging historical manufacturing data for early-stage quality guidance. High-dimensional process data are analyzed using LGBM to rank defect-critical features, followed by LR-based interpretation to evaluate high/low-score variables. To address severe sample imbalance caused by scarce defective samples compared to normal samples, the framework integrates cross-stratified undersampling, BorderlineSMOTE oversampling, and adaptive LR threshold optimization. Comparative experiments are conducted using LGBM and hybrid LGBM-LR models with varied sampling techniques and decision thresholds. Results demonstrate that both models achieve 87% accuracy in defect anomaly prediction. The proposed BorderlineSMOTE and adaptive thresholding significantly improve recall rates from 20–60%, effectively addressing class imbalance. Furthermore, 13 of the top 20 features identified in the edge defect importance ranking align with empirical domain knowledge, validating the model's interpretability. This feature-ranking outcome offers actionable insights for prioritizing process parameters in surface quality control. The study highlights the efficacy of combining ensemble learning, resampling techniques, and threshold adjustment to enhance defect prediction in imbalanced industrial datasets, providing a data-driven reference for optimizing hot-rolled strip production. Hot-rolled strip Edge defect Anomaly prediction Light Gradient Boosting Machine Logistic Regressio Full Text Additional Declarations No competing interests reported. Cite Share Download PDF Status: Posted Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-6653536","acceptedTermsAndConditions":true,"allowDirectSubmit":true,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":461046458,"identity":"f39f555f-e58d-4c09-ab43-fdc43c195631","order_by":0,"name":"Yunfei Liu","email":"","orcid":"","institution":"National Key Laboratory of Metal Forming Technology and Heavy Equipment","correspondingAuthor":false,"prefix":"","firstName":"Yunfei","middleName":"","lastName":"Liu","suffix":""},{"id":461046459,"identity":"494c4586-1bb0-4775-ad33-580880e70b18","order_by":1,"name":"Jingbin Shao","email":"","orcid":"","institution":"National Engineering Research Center for Equipment and Technology of Cold Rolled Strip, Yanshan University","correspondingAuthor":false,"prefix":"","firstName":"Jingbin","middleName":"","lastName":"Shao","suffix":""},{"id":461046460,"identity":"74d0d2c7-ebe0-4eec-875e-1b46af732ce2","order_by":2,"name":"Jianxin Tang","email":"","orcid":"","institution":"National Engineering Research Center for Equipment and Technology of Cold Rolled Strip, Yanshan University","correspondingAuthor":false,"prefix":"","firstName":"Jianxin","middleName":"","lastName":"Tang","suffix":""},{"id":461046461,"identity":"87308c88-e242-4062-ae3d-6c332067f4ec","order_by":3,"name":"Weiliang Liu","email":"","orcid":"","institution":"National Engineering Research Center for Equipment and Technology of Cold Rolled Strip, Yanshan University","correspondingAuthor":false,"prefix":"","firstName":"Weiliang","middleName":"","lastName":"Liu","suffix":""},{"id":461046464,"identity":"d3343e83-e269-4a6c-90be-c78876be0d7f","order_by":4,"name":"Jianliang Sun","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAABAElEQVRIiWNgGAWjYJACgwQGBhk2EOsDVIAoLTwgLYwziNUCAjwggpmHGC3yM3IPFDyoucPDx3728GvbtrrEBvbmbRIMNXdwO+pGXoJBwrFnPGw8eWnWuW2HExt4jpVJMBx7hluLRI6BQQLbYaBfcsyMc9sOJDZI5JhJMDYcxuMwkJZ/QC38b8yMLUEOk3+DXwvDDaCWxDagFokc48eMbcxAW3jwazE48waopQ+k5Y0ZY8+5w8ZtPGnFFgnH8DisPcfM8Me3w3Ly/TnGH36U1cn2sx/eeONDDR6HMTCwwaKBTYIRGKHgZJCATwMwAh/AGB8Y/uBXOgpGwSgYBSMTAACyslB4a3pVkwAAAABJRU5ErkJggg==","orcid":"","institution":"National Engineering Research Center for Equipment and Technology of Cold Rolled Strip, Yanshan University","correspondingAuthor":true,"prefix":"","firstName":"Jianliang","middleName":"","lastName":"Sun","suffix":""},{"id":461046466,"identity":"2b12a021-6b36-44c8-b396-cb46fabc7a3b","order_by":5,"name":"Yanghu Hu","email":"","orcid":"","institution":"National Key Laboratory of Metal Forming Technology and Heavy Equipment","correspondingAuthor":false,"prefix":"","firstName":"Yanghu","middleName":"","lastName":"Hu","suffix":""}],"badges":[],"createdAt":"2025-05-13 08:53:29","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-6653536/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-6653536/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":105034822,"identity":"87a77fe1-8949-462c-a54e-027159146193","added_by":"auto","created_at":"2026-03-20 07:24:22","extension":"pdf","order_by":1,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":462071,"visible":true,"origin":"","legend":"","description":"","filename":"ManuscriptfileResearchonedgedefectpredictionofhotrolledstripsteelbasedonLGBMLR.pdf","url":"https://assets-eu.researchsquare.com/files/rs-6653536/v1_covered_81732199-2c50-471d-971b-0823ce33ebac.pdf"}],"financialInterests":"No competing interests reported.","formattedTitle":"Research on edge defect prediction of hot-rolled strip steel based on LGBM-LR","fulltext":[],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":false,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":true,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":true,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":true,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true},"keywords":"Hot-rolled strip, Edge defect, Anomaly prediction, Light Gradient Boosting Machine, Logistic Regressio","lastPublishedDoi":"10.21203/rs.3.rs-6653536/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-6653536/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eSurface quality control of hot-rolled strip remains challenging due to complex defect formation mechanisms and imbalanced datasets from industrial processes. To address this, we propose a predictive framework integrating Light Gradient Boosting Machine (LGBM) and Logistic Regression (LR) to identify defect-related features and enhance defect prediction. The model focuses on edge defects, leveraging historical manufacturing data for early-stage quality guidance. High-dimensional process data are analyzed using LGBM to rank defect-critical features, followed by LR-based interpretation to evaluate high/low-score variables. To address severe sample imbalance caused by scarce defective samples compared to normal samples, the framework integrates cross-stratified undersampling, BorderlineSMOTE oversampling, and adaptive LR threshold optimization. Comparative experiments are conducted using LGBM and hybrid LGBM-LR models with varied sampling techniques and decision thresholds. Results demonstrate that both models achieve 87% accuracy in defect anomaly prediction. The proposed BorderlineSMOTE and adaptive thresholding significantly improve recall rates from 20\u0026ndash;60%, effectively addressing class imbalance. Furthermore, 13 of the top 20 features identified in the edge defect importance ranking align with empirical domain knowledge, validating the model's interpretability. This feature-ranking outcome offers actionable insights for prioritizing process parameters in surface quality control. The study highlights the efficacy of combining ensemble learning, resampling techniques, and threshold adjustment to enhance defect prediction in imbalanced industrial datasets, providing a data-driven reference for optimizing hot-rolled strip production.\u003c/p\u003e","manuscriptTitle":"Research on edge defect prediction of hot-rolled strip steel based on LGBM-LR","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2025-05-26 08:17:13","doi":"10.21203/rs.3.rs-6653536/v1","editorialEvents":[{"type":"communityComments","content":0}],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"0dd9eb5e-d14d-426c-9d11-84f0f0b4e7ef","owner":[],"postedDate":"May 26th, 2025","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"posted","subjectAreas":[],"tags":[],"updatedAt":"2026-03-19T05:55:41+00:00","versionOfRecord":[],"versionCreatedAt":"2025-05-26 08:17:13","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-6653536","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-6653536","identity":"rs-6653536","version":["v1"]},"buildId":"8U1c8b4HqxoKbykW_rLl7","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.