Predicting Air Quality Index in Accra using Machine Learning | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Predicting Air Quality Index in Accra using Machine Learning Stephen Edward Moore, Aaron P. Antwi, Eric Onyame, Sianou E. Houénafa This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-5774869/v1 This work is licensed under a CC BY 4.0 License Status: Posted Version 1 posted You are reading this latest preprint version Abstract Air quality is a significant public health issue, and accurate predictions of the Air Quality Index (AQI) are crucial for timely interventions. This study explores the use of supervised machine learning algorithms to forecast AQI across different neighborhoods in Accra, Ghana. Six models including Random Forest , CatBoost, Support Vector Regression (SVR), Linear Regression, Ridge and Lasso Regressions, were evaluated. Data from the Breathe Accra platform, encompassing pollutants and weather conditions in five neighborhoods, were preprocessed through data cleaning, feature selection, and normalization. Model performance was assessed using the coefficient of determination (R 2 ), Root Mean Square Error (RMSE), and Mean Absolute Error (MAE). While the initial performance of the models was less satisfactory, the incorporation of wavelet transform preprocessing significantly enhanced the results. This improvement was particularly notable for the area of Korle Bu, where the CatBoost model's R 2 increased from 0.32 to 0.58, RMSE decreased from 29.03 to 22.79, and MAE dropped from 22.11 to 17.01. On average, all models except SVR performed well across all areas, as evidenced by the evaluation metrics. These findings have direct implications for enhancing air quality management and policymaking in the city of Accra, where accurate AQI predictions are vital for effective public health interventions and environmental planning. Artificial Intelligence and Machine Learning Environmental Chemistry Environmental Policy Air quality Accra machine learning climate change wavelet transform Full Text Additional Declarations The authors declare no competing interests. Cite Share Download PDF Status: Posted Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-5774869","acceptedTermsAndConditions":true,"allowDirectSubmit":true,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":398376295,"identity":"2dc51221-fd72-4f79-8e6a-3862b8e5ba85","order_by":0,"name":"Stephen Edward Moore","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA+klEQVRIiWNgGAWjYJACZgSzAkJJAHECTvVsKFrOkKyFsY0ILfLzm489Lqi4w6Db3nzsw8d5h+X5G5gP3uZhqM3DpcXgGFu68YwzzxjMzhxLnjlz22HDGQfYkq15GI4X49TCxmMmzdt2mMHsRo4xM++2wwkGDEARHoZjiQ24HNbG/02a9x9Uy985IC1AEXxaGI7xsEnzNkC1MDaAbWEDaqnBqcXgWJq5Mc+xwzwgvzD2HEs3nHGYzdhyjsEB3A5rPvzsMU/NYTmz482HGX7UWMvztzc/vPGmog63w8Axw8DAg+CDo8ngMG4NUC0YoA6fllEwCkbBKBhZAAChwVJBYTo2fQAAAABJRU5ErkJggg==","orcid":"","institution":"Department of Mathematics, University of Cape Coast","correspondingAuthor":true,"prefix":"","firstName":"Stephen","middleName":"Edward","lastName":"Moore","suffix":""},{"id":398376296,"identity":"ba542e3f-a727-4c5d-b08e-f925eaca57f2","order_by":1,"name":"Aaron P. Antwi","email":"","orcid":"","institution":"Department of Mathematics, University of Cape Coast","correspondingAuthor":false,"prefix":"","firstName":"Aaron","middleName":"P.","lastName":"Antwi","suffix":""},{"id":398376297,"identity":"191fbb5f-e542-4c63-9248-481252e2aed5","order_by":2,"name":"Eric Onyame","email":"","orcid":"","institution":"School of Data Science, University of Virginia, Charlottesville, Virginia, USA","correspondingAuthor":false,"prefix":"","firstName":"Eric","middleName":"","lastName":"Onyame","suffix":""},{"id":398376298,"identity":"6ee80369-29ea-4844-a936-c48db0941618","order_by":3,"name":"Sianou E. Houénafa","email":"","orcid":"","institution":"Department of Mathematics, Pan African University Institute for Basic Sciences, Technology and Innovation, Nairobi, Kenya","correspondingAuthor":false,"prefix":"","firstName":"Sianou","middleName":"E.","lastName":"Houénafa","suffix":""}],"badges":[],"createdAt":"2025-01-06 15:04:25","currentVersionCode":1,"declarations":{"humanSubjects":false,"vertebrateSubjects":false,"conflictsOfInterestStatement":false,"humanSubjectEthicalGuidelines":false,"humanSubjectConsent":false,"humanSubjectClinicalTrial":false,"humanSubjectCaseReport":false,"vertebrateSubjectEthicalGuidelines":false},"doi":"10.21203/rs.3.rs-5774869/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-5774869/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":73135767,"identity":"a66f6e44-c6f8-4cdc-9089-2fc05397335c","added_by":"auto","created_at":"2025-01-07 06:00:17","extension":"pdf","order_by":1,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":3329834,"visible":true,"origin":"","legend":"","description":"","filename":"AirQualityProjectUCC.pdf","url":"https://assets-eu.researchsquare.com/files/rs-5774869/v1_covered_786b5119-7ad0-4249-816d-a86bda05f04c.pdf"}],"financialInterests":"The authors declare no competing interests.","formattedTitle":"\u003cp\u003ePredicting Air Quality Index in Accra using Machine Learning\u003c/p\u003e","fulltext":[],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":false,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":true,"hideJournal":true,"highlight":"","institution":"University of Cape Coast","isAcceptedByJournal":false,"isAuthorSuppliedPdf":true,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":true,"isPdf":true,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true},"keywords":"Air quality, Accra, machine learning, climate change, wavelet transform ","lastPublishedDoi":"10.21203/rs.3.rs-5774869/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-5774869/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eAir quality is a significant public health issue, and accurate predictions of the Air Quality Index (AQI) are crucial for timely interventions. This study explores the use of supervised machine learning algorithms to forecast AQI across different neighborhoods in Accra, Ghana. Six models \u0026nbsp;including Random Forest , CatBoost, Support Vector Regression (SVR), Linear Regression, Ridge and Lasso Regressions, were evaluated. Data from the Breathe Accra platform, encompassing pollutants and weather conditions in five neighborhoods, were preprocessed through data cleaning, feature selection, and normalization. Model performance was assessed using the coefficient of determination (R\u003csup\u003e2\u003c/sup\u003e), Root Mean Square Error (RMSE), and Mean Absolute Error (MAE). While the initial performance of the models was less satisfactory, the incorporation of wavelet transform preprocessing significantly enhanced the results. This improvement was particularly notable for the area of Korle Bu, where the CatBoost model's R\u003csup\u003e2\u003c/sup\u003e increased from 0.32 to 0.58, RMSE decreased from 29.03 to 22.79, and MAE dropped from 22.11 to 17.01. On average, all models except SVR performed well across all areas, as evidenced by the evaluation metrics. These findings have direct implications for enhancing air quality management and policymaking in the city of Accra, where accurate AQI predictions are vital for effective public health interventions and environmental planning.\u003c/p\u003e","manuscriptTitle":"Predicting Air Quality Index in Accra using Machine Learning","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2025-01-07 05:36:09","doi":"10.21203/rs.3.rs-5774869/v1","editorialEvents":[{"type":"communityComments","content":0}],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"12fa82a5-b52d-4fd2-8aa7-af85561f51b8","owner":[],"postedDate":"January 7th, 2025","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"posted","subjectAreas":[{"id":42437306,"name":"Artificial Intelligence and Machine Learning"},{"id":42437307,"name":"Environmental Chemistry"},{"id":42437308,"name":"Environmental Policy"}],"tags":[],"updatedAt":"2025-01-07T05:36:09+00:00","versionOfRecord":[],"versionCreatedAt":"2025-01-07 05:36:09","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-5774869","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-5774869","identity":"rs-5774869","version":["v1"]},"buildId":"8U1c8b4HqxoKbykW_rLl7","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.