Tlb‑yolo: a rapid and efcient real‑time algorithm for box-type classification and barcode recognition on the moving conveying and sorting systems

doi:10.21203/rs.3.rs-4981502/v1

Tlb‑yolo: a rapid and efcient real‑time algorithm for box-type classification and barcode recognition on the moving conveying and sorting systems

2024 · doi:10.21203/rs.3.rs-4981502/v1

preprint OA: closed

Full text JSON View at publisher

Full text 15,143 characters · extracted from preprint-html · click to expand

Tlb‑yolo: a rapid and efcient real‑time algorithm for box-type classification and barcode recognition on the moving conveying and sorting systems | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Tlb‑yolo: a rapid and efcient real‑time algorithm for box-type classification and barcode recognition on the moving conveying and sorting systems Liang Shen, Xin Li, Wei Yang, Qiang Wang This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-4981502/v1 This work is licensed under a CC BY 4.0 License Status: Under Review Version 1 posted 5 You are reading this latest preprint version Abstract In recent years, camera-based vision sensors utilizing YOLO neural network algorithms have increasingly replaced traditional sensors such as photoelectric sensors. Due to their advantages in cost and efficiency, applications in object classification and barcode detection have become more widespread in industrial and logistics sectors. This study addresses the real-time items classification and barcode detection by proposing the Transmission Line Barcode YOLO (TLB-YOLO) model. We made improvements to the YOLOv8 model by introducing several new components. We integrated the Coordinate Attention (CA) mechanism into the backbone network to improve the model's sensitivity to object locations. The Wise-IoU loss function was employed to enhance localization accuracy, while the GSConv (Grouped Shuffle Convolution) and Slim Neck architecture were incorporated to boost detection accuracy and speed. The proposed model, with 3.8 million parameters and 8.5 GFLOPs, was trained on the COCO dataset, achieving an mAP0.5 of 68.1% and an mAP0.95 of 44.2%, which represent improvements of 7.9% and 5.2% over YOLOv8n, respectively. It attained a frame rate of 153.8 FPS. When retrained on a custom dataset incorporating synthetic data from Omniverse, the model demonstrated over 90% accuracy in detecting cardboard boxes, plastic containers, and barcodes. To resolve model export issues caused by the dynamic pooling layer, an equivalent code substitution was applied, enhancing inference speed. Testing on a Jetson Nano development board, with each experiment phase repeated 50 times, showed that the TLB-YOLO model achieved 100% detection accuracy for plastic containers side scanning, cardboard box top scanning, and barcode verification. The model’s detection accuracy and inference speed fully satisfy real-time detection requirements in practical scenarios. process optimization object detection barcode recognition improved YOLOv8 model simulation modeling Full Text Additional Declarations No competing interests reported. Supplementary Files Appendix1ImprovedModuleCode.docx Appendix2TensorRTRealTimeInferenceCode.docx Appendix3Barcodeinferencerecognitioncode.docx Appendix4HikvisionIndustrialCameraControlandImageProcessingcode.docx Cite Share Download PDF Status: Under Review Version 1 posted Editorial decision: Revision requested 01 Sep, 2024 Reviewers invited by journal 01 Sep, 2024 Editor assigned by journal 28 Aug, 2024 Submission checks completed at journal 28 Aug, 2024 First submitted to journal 27 Aug, 2024 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-4981502","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":347983751,"identity":"fba18ee3-8600-4675-83bf-c8b621fa1414","order_by":0,"name":"Liang Shen","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA7ElEQVRIiWNgGAWjYDACCSB+YAAk2BugIgeI0ZIA0sIDUppAtBYwI4FILfKzm489SCiwy5OPfP5M8ucPBjm+GwmMnwvwaDG4cyzdIMEgudjwdo6ZNE8Cg7HkjQRm6Rn4tEjkmEkkGDAnbpydwyYNdFjihhsJbMw8+Bw2I/8bUEt94saZx59J/khgqCeoheFGDhtQy+HE+RIMZhJAhyUYENJicCMN5LDjiRt4coytedIkDGeeedgsjd9hyc8kPvypTpzffvzhzR82NvJ8x5MPfsbrMLh1B8AUKJoYG4jRALSOSHWjYBSMglEwAgEAj1VKLF87rmAAAAAASUVORK5CYII=","orcid":"","institution":"Shaanxi University of Science and Technology","correspondingAuthor":true,"prefix":"","firstName":"Liang","middleName":"","lastName":"Shen","suffix":""},{"id":347983752,"identity":"a876e91d-1702-41dd-a7b6-6f6c306df010","order_by":1,"name":"Xin Li","email":"","orcid":"","institution":"Shaanxi University of Science and Technology","correspondingAuthor":false,"prefix":"","firstName":"Xin","middleName":"","lastName":"Li","suffix":""},{"id":347983753,"identity":"ecb19773-55a3-4b1a-9e9d-566b377fc6fd","order_by":2,"name":"Wei Yang","email":"","orcid":"","institution":"Shaanxi University of Science and Technology","correspondingAuthor":false,"prefix":"","firstName":"Wei","middleName":"","lastName":"Yang","suffix":""},{"id":347983754,"identity":"00a3f74a-2adf-4cd4-8249-18537670f36d","order_by":3,"name":"Qiang Wang","email":"","orcid":"","institution":"Shaanxi University of Science and Technology","correspondingAuthor":false,"prefix":"","firstName":"Qiang","middleName":"","lastName":"Wang","suffix":""}],"badges":[],"createdAt":"2024-08-27 04:56:10","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-4981502/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-4981502/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":65401640,"identity":"a8152aaa-0c7c-4ccb-80fc-75700ff1623d","added_by":"auto","created_at":"2024-09-27 03:31:44","extension":"pdf","order_by":1,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":1282424,"visible":true,"origin":"","legend":"","description":"","filename":"Tlbyoloarapidandefcientrealtimealgorithmforboxtypeclassificationandbarcoderecognitiononthemovingconveyingandsortingsystems.pdf","url":"https://assets-eu.researchsquare.com/files/rs-4981502/v1_covered_5fba4eb1-9225-42d5-9274-12e3b0abe486.pdf"},{"id":65401177,"identity":"a24273f6-3b42-40cc-b6bd-120a277185c0","added_by":"auto","created_at":"2024-09-27 03:23:43","extension":"docx","order_by":1,"title":"","display":"","copyAsset":false,"role":"supplement","size":82490,"visible":true,"origin":"","legend":"","description":"","filename":"Appendix1ImprovedModuleCode.docx","url":"https://assets-eu.researchsquare.com/files/rs-4981502/v1/bcf7cdcf1045ec55882a42b9.docx"},{"id":65400719,"identity":"b8b067ef-c1eb-467a-8359-367e354b2847","added_by":"auto","created_at":"2024-09-27 03:15:42","extension":"docx","order_by":2,"title":"","display":"","copyAsset":false,"role":"supplement","size":81410,"visible":true,"origin":"","legend":"","description":"","filename":"Appendix2TensorRTRealTimeInferenceCode.docx","url":"https://assets-eu.researchsquare.com/files/rs-4981502/v1/5e38497df680d63dc743862c.docx"},{"id":65400721,"identity":"49778d92-9abe-4c8f-8923-1b2ff2306a78","added_by":"auto","created_at":"2024-09-27 03:15:43","extension":"docx","order_by":3,"title":"","display":"","copyAsset":false,"role":"supplement","size":81668,"visible":true,"origin":"","legend":"","description":"","filename":"Appendix3Barcodeinferencerecognitioncode.docx","url":"https://assets-eu.researchsquare.com/files/rs-4981502/v1/58b50916d1aeaacf715645c6.docx"},{"id":65401172,"identity":"e31fe0a4-390a-4775-8dc3-4f82c9c37f36","added_by":"auto","created_at":"2024-09-27 03:23:42","extension":"docx","order_by":4,"title":"","display":"","copyAsset":false,"role":"supplement","size":85533,"visible":true,"origin":"","legend":"","description":"","filename":"Appendix4HikvisionIndustrialCameraControlandImageProcessingcode.docx","url":"https://assets-eu.researchsquare.com/files/rs-4981502/v1/c901914fdccfa8222ba7dbbd.docx"}],"financialInterests":"No competing interests reported.","formattedTitle":"Tlb‑yolo: a rapid and efcient real‑time algorithm for box-type classification and barcode recognition on the moving conveying and sorting systems","fulltext":[],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":false,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":true,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":true,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"signal-image-and-video-processing","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"sivp","sideBox":"Learn more about [Signal, Image and Video Processing](http://link.springer.com/journal/11760)","snPcode":"11760","submissionUrl":"https://submission.nature.com/new-submission/11760/3","title":"Signal, Image and Video Processing","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"em","reportingPortfolio":"Springer Hybrid","inReviewEnabled":true,"inReviewRevisionsEnabled":false},"keywords":"process optimization, object detection, barcode recognition, improved YOLOv8 model, simulation modeling","lastPublishedDoi":"10.21203/rs.3.rs-4981502/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-4981502/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eIn recent years, camera-based vision sensors utilizing YOLO neural network algorithms have increasingly replaced traditional sensors such as photoelectric sensors. Due to their advantages in cost and efficiency, applications in object classification and barcode detection have become more widespread in industrial and logistics sectors. This study addresses the real-time items classification and barcode detection by proposing the Transmission Line Barcode YOLO (TLB-YOLO) model. We made improvements to the YOLOv8 model by introducing several new components. We integrated the Coordinate Attention (CA) mechanism into the backbone network to improve the model's sensitivity to object locations. The Wise-IoU loss function was employed to enhance localization accuracy, while the GSConv (Grouped Shuffle Convolution) and Slim Neck architecture were incorporated to boost detection accuracy and speed. The proposed model, with 3.8 million parameters and 8.5 GFLOPs, was trained on the COCO dataset, achieving an mAP0.5 of 68.1% and an mAP0.95 of 44.2%, which represent improvements of 7.9% and 5.2% over YOLOv8n, respectively. It attained a frame rate of 153.8 FPS. When retrained on a custom dataset incorporating synthetic data from Omniverse, the model demonstrated over 90% accuracy in detecting cardboard boxes, plastic containers, and barcodes. To resolve model export issues caused by the dynamic pooling layer, an equivalent code substitution was applied, enhancing inference speed. Testing on a Jetson Nano development board, with each experiment phase repeated 50 times, showed that the TLB-YOLO model achieved 100% detection accuracy for plastic containers side scanning, cardboard box top scanning, and barcode verification. The model’s detection accuracy and inference speed fully satisfy real-time detection requirements in practical scenarios.\u003c/p\u003e","manuscriptTitle":"Tlb‑yolo: a rapid and efcient real‑time algorithm for box-type classification and barcode recognition on the moving conveying and sorting systems","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2024-09-27 03:15:36","doi":"10.21203/rs.3.rs-4981502/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"decision","content":"Revision requested","date":"2024-09-01T15:25:02+00:00","index":"","fulltext":""},{"type":"reviewersInvited","content":"","date":"2024-09-01T15:24:10+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2024-08-28T10:57:42+00:00","index":"","fulltext":""},{"type":"checksComplete","content":"","date":"2024-08-28T10:54:29+00:00","index":"","fulltext":""},{"type":"submitted","content":"Signal, Image and Video Processing","date":"2024-08-27T04:54:48+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"signal-image-and-video-processing","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"sivp","sideBox":"Learn more about [Signal, Image and Video Processing](http://link.springer.com/journal/11760)","snPcode":"11760","submissionUrl":"https://submission.nature.com/new-submission/11760/3","title":"Signal, Image and Video Processing","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"em","reportingPortfolio":"Springer Hybrid","inReviewEnabled":true,"inReviewRevisionsEnabled":false}}],"origin":"","ownerIdentity":"f4f40da7-c8d9-42a5-a1c5-104bc19f640c","owner":[],"postedDate":"September 27th, 2024","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"under-review","subjectAreas":[],"tags":[],"updatedAt":"2024-09-27T03:15:37+00:00","versionOfRecord":[],"versionCreatedAt":"2024-09-27 03:15:36","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-4981502","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-4981502","identity":"rs-4981502","version":["v1"]},"buildId":"qtupq5eGEP_6zYnWcrvyt","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

⚙ Ask this paper AI returns verbatim quotes from the full text · source: preprint-html ⓘ

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2024) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc: last seen: 2026-05-20T01:45:00.602351+00:00