Smart Driver Assistance: Real-Time Drowsiness Detection Leveraging Facial Cues with MediaPipe and OpenCV

preprint OA: closed
Full text JSON View at publisher
Full text 11,857 characters · extracted from preprint-html · click to expand
Smart Driver Assistance: Real-Time Drowsiness Detection Leveraging Facial Cues with MediaPipe and OpenCV | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Smart Driver Assistance: Real-Time Drowsiness Detection Leveraging Facial Cues with MediaPipe and OpenCV Ariveeti Karthikeya Reddy, Ibrahim Ahmed Khan, Neethu C T, Lidiya Lilly Thampi, and 3 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-4642662/v1 This work is licensed under a CC BY 4.0 License Status: Posted Version 1 posted You are reading this latest preprint version Abstract The primary aim of this research is to identify driver drowsiness to prevent car accidents and improve road safety. This study explores and proposes potential solutions to mitigate drowsiness-related accidents and enhance overall road security by monitoring the driver's eye, mouth, and head movements, which are key indicators of drowsiness. An algorithm has been developed to track these movements. The study involves analyzing CNN models and computer vision models for eye detection, yawn detection, and head movement, with the objective of identifying the most effective approach among the models. For building CNN models, we use the MRL Eye Dataset and YawDD datasets for eye and mouth tracking. The performance of the models developed using these datasets will be assessed based on various metrics such as accuracy and loss. We achieved an accuracy of 84.53% using the CNN approach for the MRL Eye Dataset, while for the YawDD Dataset, we obtained an accuracy of 96.42%. Despite these impressive results, CNN techniques encounter several challenges, including poor performance in low light conditions, varying outcomes for different ethnicities, and unreliability in dynamic environments. To address these challenges, we utilize Dlib and MediaPipe for tracking facial landmarks. By leveraging these techniques, we can simultaneously monitor head movement, eye movements, and detect yawns. Between Dlib and MediaPipe, we prefer MediaPipe due to its capability to detect the 3D coordinates of key points. Drowsiness CNN CV approach Dlib and Media pipe Full Text Additional Declarations No competing interests reported. Cite Share Download PDF Status: Posted Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-4642662","acceptedTermsAndConditions":true,"allowDirectSubmit":true,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":323002351,"identity":"16c86198-ae46-418f-9b25-f681569f348e","order_by":0,"name":"Ariveeti Karthikeya Reddy","email":"","orcid":"","institution":"IIIT Kottayam","correspondingAuthor":false,"prefix":"","firstName":"Ariveeti","middleName":"Karthikeya","lastName":"Reddy","suffix":""},{"id":323002357,"identity":"963e2ea7-2b78-401e-bf32-d6bf19ee522e","order_by":1,"name":"Ibrahim Ahmed Khan","email":"","orcid":"","institution":"IIIT Kottayam","correspondingAuthor":false,"prefix":"","firstName":"Ibrahim","middleName":"Ahmed","lastName":"Khan","suffix":""},{"id":323002360,"identity":"4efa73dc-d038-4ae0-a5ff-5a345b6c67fc","order_by":2,"name":"Neethu C T","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAABa0lEQVRIie3QwUrDMBjA8a8E2ktGrikb+gotg6kw5oN4aSl0lxXmpQjC7CjES2VXx17EY6WwXkbPvZkx8DSQIoiDiaadm5sOz4L9UwJt8ktCAcrK/mioGBXPQCA3sYzQPTc0CnRnkbH9IvlioIBDLohdIwqzNEEoBSR5vxJq5iRqqsGkkR/wjXx1pPjx7Jy1egCWPeu6CGup0bjg3eOeej2I+YK1zkCJOEzv1uQkGJv+kFli33G7fpvIObHT/GJVHEn9G2Y5HrY1MCdroqUd3a+wkIIU2NUKwzkZF+SAWpJXYcjxoANgsg15eMpWBJEXQaggJuuuSf+NXTkeme+QFEsrIkN+iobVIJKhuJggYipyPLp7ysTWRzixVIahXceJgcVPRlQQdRhE+qiWxA6jj1q4ReKIP2O3Rcght2fYfT+VEcmy12WP0LjPs7l76QyINZ0uNmSTXDw/kj6/h3vmYD+B5f61ZWVlZf+pD+YqglwTnqd/AAAAAElFTkSuQmCC","orcid":"","institution":"IIIT Kottayam","correspondingAuthor":true,"prefix":"","firstName":"Neethu","middleName":"C","lastName":"T","suffix":""},{"id":323002365,"identity":"aa0659e5-3273-47e4-ad2f-23db6fa661da","order_by":3,"name":"Lidiya Lilly Thampi","email":"","orcid":"","institution":"IIIT Kottayam","correspondingAuthor":false,"prefix":"","firstName":"Lidiya","middleName":"Lilly","lastName":"Thampi","suffix":""},{"id":323002367,"identity":"b571fb42-944c-44d3-99e8-ee2871ef1ca1","order_by":4,"name":"Aswathy M A","email":"","orcid":"","institution":"MIT","correspondingAuthor":false,"prefix":"","firstName":"Aswathy","middleName":"M","lastName":"A","suffix":""},{"id":323002372,"identity":"1ba206d1-ec12-4de6-b35d-cb7435ee5468","order_by":5,"name":"Abhishek Kumar","email":"","orcid":"","institution":"IIIT Kottayam","correspondingAuthor":false,"prefix":"","firstName":"Abhishek","middleName":"","lastName":"Kumar","suffix":""},{"id":323002375,"identity":"30197e45-b433-4f9f-ae20-e7ed3806a898","order_by":6,"name":"Sunny Kumar","email":"","orcid":"","institution":"IIIT Kottayam","correspondingAuthor":false,"prefix":"","firstName":"Sunny","middleName":"","lastName":"Kumar","suffix":""}],"badges":[],"createdAt":"2024-06-26 12:14:21","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-4642662/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-4642662/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":60945984,"identity":"9669a529-6141-4e0d-a4af-905673d9883a","added_by":"auto","created_at":"2024-07-23 22:48:35","extension":"pdf","order_by":1,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":794783,"visible":true,"origin":"","legend":"","description":"","filename":"ManuscriptDDD.pdf","url":"https://assets-eu.researchsquare.com/files/rs-4642662/v1_covered_e46bb385-d6d1-438d-bd77-6566362d2794.pdf"}],"financialInterests":"No competing interests reported.","formattedTitle":"Smart Driver Assistance: Real-Time Drowsiness Detection Leveraging Facial Cues with MediaPipe and OpenCV","fulltext":[],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":false,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":true,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":true,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":true,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true},"keywords":"Drowsiness, CNN, CV approach, Dlib and Media pipe","lastPublishedDoi":"10.21203/rs.3.rs-4642662/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-4642662/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"The primary aim of this research is to identify driver drowsiness to prevent car accidents and improve road safety. This study explores and proposes potential solutions to mitigate drowsiness-related accidents and enhance overall road security by monitoring the driver's eye, mouth, and head movements, which are key indicators of drowsiness. An algorithm has been developed to track these movements. The study involves analyzing CNN models and computer vision models for eye detection, yawn detection, and head movement, with the objective of identifying the most effective approach among the models. For building CNN models, we use the MRL Eye Dataset and YawDD datasets for eye and mouth tracking. The performance of the models developed using these datasets will be assessed based on various metrics such as accuracy and loss. We achieved an accuracy of 84.53\\% using the CNN approach for the MRL Eye Dataset, while for the YawDD Dataset, we obtained an accuracy of 96.42\\%. Despite these impressive results, CNN techniques encounter several challenges, including poor performance in low light conditions, varying outcomes for different ethnicities, and unreliability in dynamic environments. To address these challenges, we utilize Dlib and MediaPipe for tracking facial landmarks. By leveraging these techniques, we can simultaneously monitor head movement, eye movements, and detect yawns. Between Dlib and MediaPipe, we prefer MediaPipe due to its capability to detect the 3D coordinates of key points.","manuscriptTitle":"Smart Driver Assistance: Real-Time Drowsiness Detection Leveraging Facial Cues with MediaPipe and OpenCV","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2024-07-19 03:18:53","doi":"10.21203/rs.3.rs-4642662/v1","editorialEvents":[{"type":"communityComments","content":0}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"c360f9de-d979-4df3-862a-2f6217bf18b6","owner":[],"postedDate":"July 19th, 2024","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"posted","subjectAreas":[],"tags":[],"updatedAt":"2024-08-12T12:04:18+00:00","versionOfRecord":[],"versionCreatedAt":"2024-07-19 03:18:53","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-4642662","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-4642662","identity":"rs-4642662","version":["v1"]},"buildId":"8U1c8b4HqxoKbykW_rLl7","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: preprint-html

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2024) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00