Socioeconomic Determinants of Cardiometabolic Diseases in Kenya: A Machine Learning Approach | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Socioeconomic Determinants of Cardiometabolic Diseases in Kenya: A Machine Learning Approach Tatenda Duncan Kavu, Evans Omondi, Daniel Mwanga, Patrice Mawa, and 2 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-6408964/v1 This work is licensed under a CC BY 4.0 License Status: Under Review Version 1 posted 10 You are reading this latest preprint version Abstract Cardiometabolic diseases (CMDs) are major contributors to global morbidity and mortality, with significant disparities driven by socioeconomic factors. This study uses data collected in 2019 in Kenya by the AWI-Gen project, a collaborative initiative between the University of the Witwatersrand (Wits) and the INDEPTH Network. The work in Kenya was conducted within the Nairobi Urban Health and Demographic Surveillance System (NUHDSS), run by the African Population and Health Research Center (APHRC). The main objective is to apply machine learning methods to identify socioeconomic factors that influence rising CMD rates in Kenya, thereby guiding targeted interventions to mitigate CMDs. We performed translational research by leveraging the K-Prototypes clustering algorithm, a Random Forest classification model, and the SHAPley Additive Explanations (SHAP) technique to investigate and identify socioeconomic status factors influencing CMD risk. The K-Prototypes clustering model identified three groups: 31% of participants in the high-vegetable intake group, 34% in the overextended labour group, and 37% in the economically disadvantaged group had CMDs. The economically disadvantaged group, which had the highest prevalence of CMDs, also exhibited lower average years of schooling and a higher proportion of individuals with no formal education. Unemployment was notably high in this group (62%), with only 16% being self-employed, and asset ownership was significantly lower. Consuming vegetables and fruits more than three times a week reduced CMD risk, while education level, income, employment status, overall socioeconomic status, and the number of hours worked per week contributed to CMD risk in varying degrees. This study highlights that targeted socioeconomic interventions, including improved education and job creation, can mitigate CMD risk, especially among disadvantaged groups. Access to affordable, healthy food should be increased through subsidies, community gardening, and public-private partnerships that expand urban food markets. Healthcare coverage must be enhanced and out-of-pocket costs minimized, with government-funded mobile clinics reaching underserved populations. Public health campaigns can boost awareness of early detection and regular check-ups, further reducing CMD prevalence. Clustering Classification Cardiometabolic Diseases Socio-Economic Status Social Determinants of Health. Full Text Additional Declarations No competing interests reported. Cite Share Download PDF Status: Under Review Version 1 posted Reviews received at journal 15 Jun, 2025 Reviewers agreed at journal 04 Jun, 2025 Reviews received at journal 28 May, 2025 Reviewers agreed at journal 26 May, 2025 Reviewers agreed at journal 16 May, 2025 Reviewers invited by journal 14 May, 2025 Editor invited by journal 11 Apr, 2025 Editor assigned by journal 10 Apr, 2025 Submission checks completed at journal 10 Apr, 2025 First submitted to journal 09 Apr, 2025 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-6408964","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":456590924,"identity":"bc6a3006-d975-4616-9a9c-1b00226ac6c3","order_by":0,"name":"Tatenda Duncan Kavu","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAAsElEQVRIiWNgGAWjYDACHhBRwcBgQLwOsJYzJGthbCNFiz3P4WfSvPPs8sylDz9g+LinlghbeNvMpHm3JRdb9qUZMM54dpwILfwMxsa825gTN5zhYWDmOXCMGC3sn41559STooW3x/Axb8NhmJYaIrScOVP4cM6x48UGZ9gMDs44cICwFvae9A0H3tRU5xmcYX744MOBOsJaYCABRACtOEyiFiAgwZZRMApGwSgYMQAALvA3QMMsnw8AAAAASUVORK5CYII=","orcid":"","institution":"African Population and Health Research Center","correspondingAuthor":true,"prefix":"","firstName":"Tatenda","middleName":"Duncan","lastName":"Kavu","suffix":""},{"id":456590925,"identity":"8369c23f-97a2-404b-bf36-7b60574930bd","order_by":1,"name":"Evans Omondi","email":"","orcid":"","institution":"African Population and Health Research Center","correspondingAuthor":false,"prefix":"","firstName":"Evans","middleName":"","lastName":"Omondi","suffix":""},{"id":456590926,"identity":"230047d6-31f4-4033-b408-f61debfeeb20","order_by":2,"name":"Daniel Mwanga","email":"","orcid":"","institution":"African Population and Health Research Center","correspondingAuthor":false,"prefix":"","firstName":"Daniel","middleName":"","lastName":"Mwanga","suffix":""},{"id":456590927,"identity":"40b0ad1f-b398-414d-aa24-6ebd7b5c5aad","order_by":3,"name":"Patrice Mawa","email":"","orcid":"","institution":"African Population and Health Research Center","correspondingAuthor":false,"prefix":"","firstName":"Patrice","middleName":"","lastName":"Mawa","suffix":""},{"id":456590928,"identity":"b8e345b0-aff9-4b02-9bf8-1a1c2ac1beca","order_by":4,"name":"Steve Cygu","email":"","orcid":"","institution":"African Population and Health Research Center","correspondingAuthor":false,"prefix":"","firstName":"Steve","middleName":"","lastName":"Cygu","suffix":""},{"id":456590929,"identity":"488ffb98-7b65-4037-8622-f35ec4764ff5","order_by":5,"name":"Gershim Asiki","email":"","orcid":"","institution":"African Population and Health Research Center","correspondingAuthor":false,"prefix":"","firstName":"Gershim","middleName":"","lastName":"Asiki","suffix":""}],"badges":[],"createdAt":"2025-04-09 07:08:25","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-6408964/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-6408964/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":82855403,"identity":"b42041f3-fa92-478e-9d49-0f324095b663","added_by":"auto","created_at":"2025-05-16 04:53:21","extension":"pdf","order_by":1,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":2377817,"visible":true,"origin":"","legend":"","description":"","filename":"SocioeconomicDeterminantsofCardiometabolicDiseasesinKenyaAMachineLearningApproach.pdf","url":"https://assets-eu.researchsquare.com/files/rs-6408964/v1_covered_5a55ea3d-edda-49e8-b9c6-0f8c81a74267.pdf"}],"financialInterests":"No competing interests reported.","formattedTitle":"\u003cp\u003eSocioeconomic Determinants of Cardiometabolic Diseases in Kenya: A Machine Learning Approach\u003c/p\u003e","fulltext":[],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":false,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":true,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":true,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"bmc-public-health","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"pubh","sideBox":"Learn more about [BMC Public Health](http://bmcpublichealth.biomedcentral.com/)","snPcode":"","submissionUrl":"https://www.editorialmanager.com/pubh/default.aspx","title":"BMC Public Health","twitterHandle":"@BMC_series","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"em","reportingPortfolio":"BMC Series","inReviewEnabled":true,"inReviewRevisionsEnabled":true},"keywords":"Clustering, Classification, Cardiometabolic Diseases, Socio-Economic Status, Social Determinants of Health.","lastPublishedDoi":"10.21203/rs.3.rs-6408964/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-6408964/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"Cardiometabolic diseases (CMDs) are major contributors to global morbidity and mortality, with significant disparities driven by socioeconomic factors. This study uses data collected in 2019 in Kenya by the AWI-Gen project, a collaborative initiative between the University of the Witwatersrand (Wits) and the INDEPTH Network. The work in Kenya was conducted within the Nairobi Urban Health and Demographic Surveillance System (NUHDSS), run by the African Population and Health Research Center (APHRC). The main objective is to apply machine learning methods to identify socioeconomic factors that influence rising CMD rates in Kenya, thereby guiding targeted interventions to mitigate CMDs. We performed translational research by leveraging the K-Prototypes clustering algorithm, a Random Forest classification model, and the SHAPley Additive Explanations (SHAP) technique to investigate and identify socioeconomic status factors influencing CMD risk. The K-Prototypes clustering model identified three groups: 31\\% of participants in the high-vegetable intake group, 34\\% in the overextended labour group, and 37\\% in the economically disadvantaged group had CMDs. The economically disadvantaged group, which had the highest prevalence of CMDs, also exhibited lower average years of schooling and a higher proportion of individuals with no formal education. Unemployment was notably high in this group (62\\%), with only 16\\% being self-employed, and asset ownership was significantly lower. Consuming vegetables and fruits more than three times a week reduced CMD risk, while education level, income, employment status, overall socioeconomic status, and the number of hours worked per week contributed to CMD risk in varying degrees. This study highlights that targeted socioeconomic interventions, including improved education and job creation, can mitigate CMD risk, especially among disadvantaged groups. Access to affordable, healthy food should be increased through subsidies, community gardening, and public-private partnerships that expand urban food markets. Healthcare coverage must be enhanced and out-of-pocket costs minimized, with government-funded mobile clinics reaching underserved populations. Public health campaigns can boost awareness of early detection and regular check-ups, further reducing CMD prevalence.","manuscriptTitle":"Socioeconomic Determinants of Cardiometabolic Diseases in Kenya: A Machine Learning Approach","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2025-05-16 04:45:16","doi":"10.21203/rs.3.rs-6408964/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"editorInvitedReview","content":"","date":"2025-06-15T09:28:37+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"26169409525740182491847113310787644120","date":"2025-06-04T04:01:04+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2025-05-29T00:59:28+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"195917433545109135486829182403043241210","date":"2025-05-26T22:27:21+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"101471730122296206657355998801142628435","date":"2025-05-16T23:22:02+00:00","index":"hide","fulltext":""},{"type":"reviewersInvited","content":"","date":"2025-05-14T09:42:40+00:00","index":"","fulltext":""},{"type":"editorInvited","content":"","date":"2025-04-11T05:14:33+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2025-04-10T08:43:22+00:00","index":"","fulltext":""},{"type":"checksComplete","content":"","date":"2025-04-10T08:41:00+00:00","index":"","fulltext":""},{"type":"submitted","content":"BMC Public Health","date":"2025-04-09T06:59:03+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"bmc-public-health","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"pubh","sideBox":"Learn more about [BMC Public Health](http://bmcpublichealth.biomedcentral.com/)","snPcode":"","submissionUrl":"https://www.editorialmanager.com/pubh/default.aspx","title":"BMC Public Health","twitterHandle":"@BMC_series","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"em","reportingPortfolio":"BMC Series","inReviewEnabled":true,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"d73c63bd-9765-4d68-99ae-c8f24f75c291","owner":[],"postedDate":"May 16th, 2025","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"under-review","subjectAreas":[],"tags":[],"updatedAt":"2025-05-16T04:45:16+00:00","versionOfRecord":[],"versionCreatedAt":"2025-05-16 04:45:16","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-6408964","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-6408964","identity":"rs-6408964","version":["v1"]},"buildId":"8U1c8b4HqxoKbykW_rLl7","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.