Predicting Customer Churn in Retail Using Machine Learning on Transaction Data
| dc.contributor.author | Bozan M.T. | |
| dc.contributor.author | Gozukara H. | |
| dc.contributor.author | Patel J. | |
| dc.contributor.author | Kizilay A. | |
| dc.contributor.author | Sahin Z. | |
| dc.contributor.author | Tosun B. | |
| dc.contributor.author | Cakar T. | |
| dc.date.accessioned | 2026-03-05T15:02:38Z | |
| dc.date.available | 2026-03-05T15:02:38Z | |
| dc.date.issued | 2025 | |
| dc.description.abstract | Customer churn prediction is critical for businesses to retain customers and reduce revenue loss. This paper presents a retail customer churn prediction study. We preprocess transactional data from a retail dataset comprising approximately 19.7 million transactions involving over 1 million customers. Temporal behavioral features, such as purchase frequency, monetary value, product variety, and promotional engagement metrics, are engineered using a four-month observation window. A Random Forest classifier is trained, utilizing balanced class weighting to address churn class imbalance. The churn label is defined as customers not purchasing in the subsequent six-month period. Our Random Forest model achieves approximately 84% accuracy, 86% precision, 85% recall, and an F1- score of 85%. Additionally, an XGBoost model achieves similar accuracy (≈ 84%) but higher recall (93%) and F1-score (89%), indicating improved churn prediction. The confusion matrix illustrates clear model performance. This study demonstrates that carefully engineered RFM-based features and ensemble learning approaches significantly enhance churn prediction in retail contexts. © 2025 IEEE. | en_US |
| dc.identifier.doi | 10.1109/UBMK67458.2025.11207037 | |
| dc.identifier.issn | 2521-1641 | |
| dc.identifier.scopus | 2-s2.0-105030842914 | |
| dc.identifier.uri | https://doi.org/10.1109/UBMK67458.2025.11207037 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.11779/3228 | |
| dc.language.iso | en | en_US |
| dc.publisher | Institute of Electrical and Electronics Engineers Inc. | en_US |
| dc.relation.ispartof | International Conference on Computer Science and Engineering, UBMK | en_US |
| dc.rights | info:eu-repo/semantics/closedAccess | en_US |
| dc.subject | Customer Behavior | en_US |
| dc.subject | Customer Churn | en_US |
| dc.subject | Ensemble Learning | en_US |
| dc.subject | Feature Engineering | en_US |
| dc.subject | Machine Learning | en_US |
| dc.subject | Predictive Modeling | en_US |
| dc.subject | Random Forest | en_US |
| dc.subject | Retention Strategies | en_US |
| dc.title | Predicting Customer Churn in Retail Using Machine Learning on Transaction Data | en_US |
| dc.type | Conference Object | en_US |
| dspace.entity.type | Publication | |
| gdc.author.institutional | Çakar, Tuna | |
| gdc.author.scopusid | 60411218700 | |
| gdc.author.scopusid | 60411229400 | |
| gdc.author.scopusid | 60092909300 | |
| gdc.author.scopusid | 58876728800 | |
| gdc.author.scopusid | 58876518800 | |
| gdc.author.scopusid | 60411286500 | |
| gdc.author.scopusid | 60411218800 | |
| gdc.collaboration.industrial | true | |
| gdc.description.department | Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü | |
| gdc.description.endpage | 1140 | en_US |
| gdc.description.issue | 2025 | en_US |
| gdc.description.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | en_US |
| gdc.description.scopusquality | N/A | |
| gdc.description.startpage | 1135 | en_US |
| gdc.description.wosquality | N/A | |
| gdc.identifier.openalex | W4415524209 | |
| gdc.index.type | Scopus | |
| gdc.openalex.collaboration | National | |
| gdc.openalex.fwci | 0.0 | |
| gdc.openalex.normalizedpercentile | 0.41 | |
| gdc.opencitations.count | 0 | |
| gdc.plumx.mendeley | 1 | |
| gdc.plumx.scopuscites | 0 | |
| gdc.publishedmonth | Ekim | |
| gdc.scopus.citedcount | 0 | |
| gdc.virtual.author | Çakar, Tuna | |
| gdc.yokperiod | YÖK - 2025-26 | |
| relation.isAuthorOfPublication | 10f8ce3b-94c2-40f0-9381-0725723768fe | |
| relation.isAuthorOfPublication.latestForDiscovery | 10f8ce3b-94c2-40f0-9381-0725723768fe | |
| relation.isOrgUnitOfPublication | a6e60d5c-b0c7-474a-b49b-284dc710c078 | |
| relation.isOrgUnitOfPublication | 05ffa8cd-2a88-4676-8d3b-fc30eba0b7f3 | |
| relation.isOrgUnitOfPublication | 0d54cd31-4133-46d5-b5cc-280b2c077ac3 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | a6e60d5c-b0c7-474a-b49b-284dc710c078 |
