Enhancing Subscription Fraud Detection Through Ensemble Learning: The Case of Ethio Telecom | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Article Enhancing Subscription Fraud Detection Through Ensemble Learning: The Case of Ethio Telecom Esubalew Asmare Desta, Kidus Workineh, Abenet Alazar Hailu, Fikadu Berie Adugna, and 4 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-7297765/v1 This work is licensed under a CC BY 4.0 License Status: Published Journal Publication published 09 Feb, 2026 Read the published version in Scientific Reports → Version 1 posted 14 You are reading this latest preprint version Abstract Telecommunication companies globally face the critical challenge of subscription fraud, which threatens both financial stability and national security. This research addresses this issue by developing an advanced fraud detection model specifically for Ethio Telecom. The model utilizes Ensemble and Adaptive Learning techniques to enhance detection accuracy by combining multiple classifiers. The study used a dataset of 1,000,000 Call Detail Records (CDRs) collected over two months known for increased fraudulent activity3. After filtering out irrelevant data and aggregating multiple call records per subscriber, the dataset was refined to 349,164 records. Initially, 16 features were analyzed, with four excluded for lacking relevance. The remaining 11 features, excluding the target variable, underwent preprocessing including data cleaning, transformation, and balancing4. Feature selection, utilizing Correlation Matrix and Random Forest importance analysis, led to the removal of four additional features, resulting in a final set of 8 key features, including INT_DIALLED, RATIO_INT_TOTAL, and RATIO_UNIQUE_TOTAL4. Three individual models, namely Decision Tree (DT), Logistic Regression (LR), and Artificial Neural Network (ANN), were implemented alongside ensemble methods such as Bagging, Boosting, Stacking, and Voting, and adaptive models like Hoeffding Tree and Adaptive Random Forest45. The findings of this research recommend Stacking and Adaptive Random Forest (ARF) as robust tools for subscription fraud detection. Physical sciences/Engineering Physical sciences/Mathematics and computing Subscription Fraud Fraud Detection Ensemble Learning Adaptive Learning6 Decision Tree Adaptive Random Forest Stacking Call Detail Records (CDRs) Hyperparameter Tuning Full Text Additional Declarations No competing interests reported. Cite Share Download PDF Status: Published Journal Publication published 09 Feb, 2026 Read the published version in Scientific Reports → Version 1 posted Editorial decision: Revision requested 31 Oct, 2025 Reviews received at journal 29 Oct, 2025 Reviews received at journal 30 Sep, 2025 Reviews received at journal 24 Sep, 2025 Reviewers agreed at journal 23 Sep, 2025 Reviewers agreed at journal 21 Sep, 2025 Reviewers agreed at journal 20 Sep, 2025 Reviewers agreed at journal 20 Sep, 2025 Reviewers agreed at journal 31 Aug, 2025 Reviewers invited by journal 18 Aug, 2025 Editor invited by journal 11 Aug, 2025 Editor assigned by journal 08 Aug, 2025 Submission checks completed at journal 07 Aug, 2025 First submitted to journal 05 Aug, 2025 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-7297765","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Article","associatedPublications":[],"authors":[{"id":503283130,"identity":"e59a6164-15bc-4990-884d-52401bd86daf","order_by":0,"name":"Esubalew Asmare Desta","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA30lEQVRIiWNgGAWjYHAD5gNAQkKGFC1sCSAtPKRo4TEAkwTV8fOfMWD48WebvHl7z+dXN2oseBjYDx/dgE+L5IwcA8bettuGc86c3WadcwzoMJ60tBv4tBjcALqHt+E24wyJ3G3GOWxALRI8Zni12J8/Y8D4589t+xnyb54Z5/wjQosBQ44BMw/b7cQZEjzMj3PbiNAicSOt4LBs2+3kGTxpZsy5fRI8bIT8wt9/eOPDN39u285gP/z4c863Ojl+9sPH8GoBgQNQmk0CTBJSjgyYP5CiehSMglEwCkYOAAAZqEVZwouEcQAAAABJRU5ErkJggg==","orcid":"","institution":"University of Gondar","correspondingAuthor":true,"prefix":"","firstName":"Esubalew","middleName":"Asmare","lastName":"Desta","suffix":""},{"id":503283131,"identity":"63fbd670-c696-47fc-b603-2ac78849b46f","order_by":1,"name":"Kidus Workineh","email":"","orcid":"","institution":"University of Gondar","correspondingAuthor":false,"prefix":"","firstName":"Kidus","middleName":"","lastName":"Workineh","suffix":""},{"id":503283132,"identity":"5d25d99d-7c05-46e3-85da-1ff6b7baf4a0","order_by":2,"name":"Abenet Alazar Hailu","email":"","orcid":"","institution":"University of Gondar","correspondingAuthor":false,"prefix":"","firstName":"Abenet","middleName":"Alazar","lastName":"Hailu","suffix":""},{"id":503283133,"identity":"16df7823-832b-4785-9cf4-9a0e9eebd897","order_by":3,"name":"Fikadu Berie Adugna","email":"","orcid":"","institution":"University of Gondar","correspondingAuthor":false,"prefix":"","firstName":"Fikadu","middleName":"Berie","lastName":"Adugna","suffix":""},{"id":503283134,"identity":"5b2cde4b-7322-41f0-bd37-22b075fa2cad","order_by":4,"name":"Alexander Takele Mengesha","email":"","orcid":"","institution":"University of Gondar","correspondingAuthor":false,"prefix":"","firstName":"Alexander","middleName":"Takele","lastName":"Mengesha","suffix":""},{"id":503283135,"identity":"08865d6d-0964-464d-804a-0cf5cff435ae","order_by":5,"name":"Selamawit Fentie Belay","email":"","orcid":"","institution":"University of Gondar","correspondingAuthor":false,"prefix":"","firstName":"Selamawit","middleName":"Fentie","lastName":"Belay","suffix":""},{"id":503283136,"identity":"820fa795-6a8e-4947-86d7-5d04336a5ced","order_by":6,"name":"Habtamu Ayenew Asegie","email":"","orcid":"","institution":"University of Gondar","correspondingAuthor":false,"prefix":"","firstName":"Habtamu","middleName":"Ayenew","lastName":"Asegie","suffix":""},{"id":503283137,"identity":"46205b9b-b391-4ad6-b705-39eb1b9440ab","order_by":7,"name":"Ayodeji Olalekan Salau","email":"","orcid":"","institution":"Saveetha Institute of Medical and Technical Sciences","correspondingAuthor":false,"prefix":"","firstName":"Ayodeji","middleName":"Olalekan","lastName":"Salau","suffix":""}],"badges":[],"createdAt":"2025-08-05 07:53:14","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-7297765/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-7297765/v1","draftVersion":[],"editorialEvents":[{"content":"https://doi.org/10.1038/s41598-026-38790-3","type":"published","date":"2026-02-09T15:57:06+00:00"}],"editorialNote":"","failedWorkflow":false,"files":[{"id":102786411,"identity":"c8551cfc-1245-4e3b-9573-76871c17fccd","added_by":"auto","created_at":"2026-02-16 16:13:23","extension":"pdf","order_by":1,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":1655333,"visible":true,"origin":"","legend":"","description":"","filename":"REVISEDENHANCINGSUBSCRIPTIONFRAUDDETECTIONTHROUGHENSEMBLELEARNING.pdf","url":"https://assets-eu.researchsquare.com/files/rs-7297765/v1_covered_88d5b962-d539-4b7a-8cf4-f622e090e3b4.pdf"}],"financialInterests":"No competing interests reported.","formattedTitle":"\u003cp\u003eEnhancing Subscription Fraud Detection Through Ensemble Learning: The Case of Ethio Telecom\u003c/p\u003e","fulltext":[],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":false,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":true,"isAuthorSuppliedPdf":true,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":true,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"scientific-reports","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"scirep","sideBox":"Learn more about [Scientific Reports](http://www.nature.com/srep/)","snPcode":"","submissionUrl":"","title":"Scientific Reports","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"Scientific Reports","inReviewEnabled":true,"inReviewRevisionsEnabled":true},"keywords":"Subscription Fraud, Fraud Detection, Ensemble Learning, Adaptive Learning6, Decision Tree, Adaptive Random Forest, Stacking, Call Detail Records (CDRs), Hyperparameter Tuning","lastPublishedDoi":"10.21203/rs.3.rs-7297765/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-7297765/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eTelecommunication companies globally face the critical challenge of subscription fraud, which threatens both financial stability and national security. This research addresses this issue by developing an advanced fraud detection model specifically for Ethio Telecom. The model utilizes Ensemble and Adaptive Learning techniques to enhance detection accuracy by combining multiple classifiers. The study used a dataset of 1,000,000 Call Detail Records (CDRs) collected over two months known for increased fraudulent activity3. After filtering out irrelevant data and aggregating multiple call records per subscriber, the dataset was refined to 349,164 records. Initially, 16 features were analyzed, with four excluded for lacking relevance. The remaining 11 features, excluding the target variable, underwent preprocessing including data cleaning, transformation, and balancing4. Feature selection, utilizing Correlation Matrix and Random Forest importance analysis, led to the removal of four additional features, resulting in a final set of 8 key features, including INT_DIALLED, RATIO_INT_TOTAL, and RATIO_UNIQUE_TOTAL4. Three individual models, namely Decision Tree (DT), Logistic Regression (LR), and Artificial Neural Network (ANN), were implemented alongside ensemble methods such as Bagging, Boosting, Stacking, and Voting, and adaptive models like Hoeffding Tree and Adaptive Random Forest45. The findings of this research recommend Stacking and Adaptive Random Forest (ARF) as robust tools for subscription fraud detection.\u0026nbsp;\u003c/p\u003e","manuscriptTitle":"Enhancing Subscription Fraud Detection Through Ensemble Learning: The Case of Ethio Telecom","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2025-08-26 15:23:40","doi":"10.21203/rs.3.rs-7297765/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"decision","content":"Revision requested","date":"2025-10-31T19:23:05+00:00","index":"","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2025-10-29T07:41:53+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2025-09-30T10:14:08+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2025-09-24T04:39:27+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"295478026436054679554239401285083477719","date":"2025-09-24T03:45:58+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"193647820091610985209029960584223042885","date":"2025-09-21T05:42:13+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"115923191564900809996893898482128991508","date":"2025-09-21T02:33:24+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"281586833144084869552698003849404473420","date":"2025-09-20T17:42:57+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"200290987620901327576418411247534260089","date":"2025-08-31T13:46:25+00:00","index":"hide","fulltext":""},{"type":"reviewersInvited","content":"","date":"2025-08-18T17:44:14+00:00","index":"","fulltext":""},{"type":"editorInvited","content":"","date":"2025-08-11T19:30:26+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2025-08-08T09:48:24+00:00","index":"","fulltext":""},{"type":"checksComplete","content":"","date":"2025-08-07T07:46:30+00:00","index":"","fulltext":""},{"type":"submitted","content":"Scientific Reports","date":"2025-08-05T07:39:57+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"scientific-reports","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"scirep","sideBox":"Learn more about [Scientific Reports](http://www.nature.com/srep/)","snPcode":"","submissionUrl":"","title":"Scientific Reports","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"Scientific Reports","inReviewEnabled":true,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"669e5e3b-72f5-45ce-85cd-033ffa1ec738","owner":[],"postedDate":"August 26th, 2025","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"published-in-journal","subjectAreas":[{"id":53740288,"name":"Physical sciences/Engineering"},{"id":53740289,"name":"Physical sciences/Mathematics and computing"}],"tags":[],"updatedAt":"2026-02-16T16:11:06+00:00","versionOfRecord":{"articleIdentity":"rs-7297765","link":"https://doi.org/10.1038/s41598-026-38790-3","journal":{"identity":"scientific-reports","isVorOnly":false,"title":"Scientific Reports"},"publishedOn":"2026-02-09 15:57:06","publishedOnDateReadable":"February 9th, 2026"},"versionCreatedAt":"2025-08-26 15:23:40","video":"","vorDoi":"10.1038/s41598-026-38790-3","vorDoiUrl":"https://doi.org/10.1038/s41598-026-38790-3","workflowStages":[]},"version":"v1","identity":"rs-7297765","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-7297765","identity":"rs-7297765","version":["v1"]},"buildId":"XKTyCvWXoU3ODBz1xrDgd","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.