Multilingual Competency and Academic Performance: A Machine Learning-Based Analysis of the 2022/2023 Somaliland National Primary Exam Data | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Article Multilingual Competency and Academic Performance: A Machine Learning-Based Analysis of the 2022/2023 Somaliland National Primary Exam Data Jibril Abdikadir Ali, Mustafe Khadar Abdi, Tawakal Abdi Ali, Abdisalan Hassan Muse, and 2 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-5237882/v1 This work is licensed under a CC BY 4.0 License Status: Posted Version 1 posted You are reading this latest preprint version Abstract This study assesses the influence of language proficiency on academic performance in primary education within Somaliland, utilizing data from the 2022/2023 National Primary Exams. Employing a dataset of 20,638 students and applying ten machine learning regression models, the research investigates the impact of Somali, Arabic, and English language skills on overall academic outcomes. The findings reveal that proficiency in these languages significantly contributes to overall performance, with English showing the strongest positive association, followed by Arabic and Somali. Additionally, the study highlights minimal gender disparities in academic performance, aligning with previous research from the region. However, the urban-rural divide in educational outcomes remains substantial, with urban students outperforming their rural counterparts. Machine learning models, particularly Polynomial Regression, outperformed traditional methods in predicting student success, showcasing the utility of advanced analytics in educational research. These findings offer critical insights for policymakers aiming to improve language education, reduce regional disparities, and promote equitable access to quality education across Somaliland. Physical sciences/Mathematics and computing/Computational science Physical sciences/Mathematics and computing/Information technology Physical sciences/Mathematics and computing/Software Physical sciences/Mathematics and computing/Statistics Language proficiency academic performance primary education machine learning multilingual education gender disparities urban-rural divide educational outcomes Somaliland regression models. Full Text Additional Declarations No competing interests reported. Cite Share Download PDF Status: Posted Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-5237882","acceptedTermsAndConditions":true,"allowDirectSubmit":true,"archivedVersions":[],"articleType":"Article","associatedPublications":[],"authors":[{"id":378599265,"identity":"d5f377d9-849d-4157-ac79-8fa7b734ffec","order_by":0,"name":"Jibril Abdikadir Ali","email":"","orcid":"","institution":"Amoud University","correspondingAuthor":false,"prefix":"","firstName":"Jibril","middleName":"Abdikadir","lastName":"Ali","suffix":""},{"id":378599266,"identity":"b4c1e33e-e340-4b1a-a372-afc459c56aa6","order_by":1,"name":"Mustafe Khadar Abdi","email":"","orcid":"","institution":"Amoud University","correspondingAuthor":false,"prefix":"","firstName":"Mustafe","middleName":"Khadar","lastName":"Abdi","suffix":""},{"id":378599267,"identity":"5738ed7e-a695-4ac3-b8eb-6a8791a11ae9","order_by":2,"name":"Tawakal Abdi Ali","email":"","orcid":"","institution":"Amoud University","correspondingAuthor":false,"prefix":"","firstName":"Tawakal","middleName":"Abdi","lastName":"Ali","suffix":""},{"id":378599268,"identity":"91cb1386-671b-444b-8546-4ce084d91d9f","order_by":3,"name":"Abdisalan Hassan Muse","email":"","orcid":"","institution":"Amoud University","correspondingAuthor":false,"prefix":"","firstName":"Abdisalan","middleName":"Hassan","lastName":"Muse","suffix":""},{"id":378599269,"identity":"3eac835e-94ea-49aa-a287-f81694033254","order_by":4,"name":"Mukhtar Abdi Omar","email":"","orcid":"","institution":"Amoud University","correspondingAuthor":false,"prefix":"","firstName":"Mukhtar","middleName":"Abdi","lastName":"Omar","suffix":""},{"id":378599270,"identity":"4dd8a7d8-5b16-4819-a0a5-4798ff465f52","order_by":5,"name":"Mukhtaar Axmed Cumar","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAABH0lEQVRIiWNgGAWjYBAC+QYg8cAASEiAuAY2YFEgmxmnFoMDQCIBoSWNCC0gIoEBpoXhMBFa2LsTPyQU2OUzSDc//Pil4Ly8bnuP4Q2GCuvEBv7DB7D6pefsZokEg2TLBpljxtIyBrcNt505Y2zBcCY9sUEiLQGrNTdyNwC1MBswAElpCYPbjNtu5JhJMLYdBmrhMcChZfOPBIN6oJb0z78lDM7ZQ7T8A2rhP4NLyzag+YeBWnLMJD8YHEiEaGkAamHIwarF4MzZbRYJBscN2CRyyqwZDJKTt505VmyRcCzduA2HX+Tbezff+PCn2oBfIn3zzR9/7Gy3HW/eeONDjbVsP44QgwM2IGbmgfESoCIEAeMPYlSNglEwCkbBiAMAeR1iX7KFEBEAAAAASUVORK5CYII=","orcid":"","institution":"Amoud University","correspondingAuthor":true,"prefix":"","firstName":"Mukhtaar","middleName":"Axmed","lastName":"Cumar","suffix":""}],"badges":[],"createdAt":"2024-10-10 08:38:25","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-5237882/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-5237882/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":73448574,"identity":"85194ddf-2d6e-428a-8e0e-e2ef8af804e4","added_by":"auto","created_at":"2025-01-10 05:26:42","extension":"pdf","order_by":1,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":579012,"visible":true,"origin":"","legend":"","description":"","filename":"MLRLanguagesandPerformance20240929.pdf","url":"https://assets-eu.researchsquare.com/files/rs-5237882/v1_covered_c88acf37-a600-4280-ac66-312369ede04a.pdf"}],"financialInterests":"No competing interests reported.","formattedTitle":"Multilingual Competency and Academic Performance: A Machine Learning-Based Analysis of the 2022/2023 Somaliland National Primary Exam Data","fulltext":[],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":false,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":true,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":true,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":true,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true},"keywords":"Language proficiency, academic performance, primary education, machine learning, multilingual education, gender disparities, urban-rural divide, educational outcomes, Somaliland, regression models.","lastPublishedDoi":"10.21203/rs.3.rs-5237882/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-5237882/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eThis study assesses the influence of language proficiency on academic performance in primary education within Somaliland, utilizing data from the 2022/2023 National Primary Exams. Employing a dataset of 20,638 students and applying ten machine learning regression models, the research investigates the impact of Somali, Arabic, and English language skills on overall academic outcomes. The findings reveal that proficiency in these languages significantly contributes to overall performance, with English showing the strongest positive association, followed by Arabic and Somali. Additionally, the study highlights minimal gender disparities in academic performance, aligning with previous research from the region. However, the urban-rural divide in educational outcomes remains substantial, with urban students outperforming their rural counterparts. Machine learning models, particularly Polynomial Regression, outperformed traditional methods in predicting student success, showcasing the utility of advanced analytics in educational research. These findings offer critical insights for policymakers aiming to improve language education, reduce regional disparities, and promote equitable access to quality education across Somaliland.\u003c/p\u003e","manuscriptTitle":"Multilingual Competency and Academic Performance: A Machine Learning-Based Analysis of the 2022/2023 Somaliland National Primary Exam Data","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2024-11-18 13:30:07","doi":"10.21203/rs.3.rs-5237882/v1","editorialEvents":[{"type":"communityComments","content":0}],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"27b58a4b-a72c-4dd6-8f74-c12581e7a3ae","owner":[],"postedDate":"November 18th, 2024","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"posted","subjectAreas":[{"id":40319544,"name":"Physical sciences/Mathematics and computing/Computational science"},{"id":40319545,"name":"Physical sciences/Mathematics and computing/Information technology"},{"id":40319546,"name":"Physical sciences/Mathematics and computing/Software"},{"id":40319547,"name":"Physical sciences/Mathematics and computing/Statistics"}],"tags":[],"updatedAt":"2025-04-23T04:53:10+00:00","versionOfRecord":[],"versionCreatedAt":"2024-11-18 13:30:07","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-5237882","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-5237882","identity":"rs-5237882","version":["v1"]},"buildId":"qtupq5eGEP_6zYnWcrvyt","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.