Predicting Customer Churn in Retail Using Machine Learning on Transaction Data

dc.contributor.author Bozan M.T.
dc.contributor.author Gozukara H.
dc.contributor.author Patel J.
dc.contributor.author Kizilay A.
dc.contributor.author Sahin Z.
dc.contributor.author Tosun B.
dc.contributor.author Cakar T.
dc.date.accessioned 2026-03-05T15:02:38Z
dc.date.available 2026-03-05T15:02:38Z
dc.date.issued 2025
dc.description.abstract Customer churn prediction is critical for businesses to retain customers and reduce revenue loss. This paper presents a retail customer churn prediction study. We preprocess transactional data from a retail dataset comprising approximately 19.7 million transactions involving over 1 million customers. Temporal behavioral features, such as purchase frequency, monetary value, product variety, and promotional engagement metrics, are engineered using a four-month observation window. A Random Forest classifier is trained, utilizing balanced class weighting to address churn class imbalance. The churn label is defined as customers not purchasing in the subsequent six-month period. Our Random Forest model achieves approximately 84% accuracy, 86% precision, 85% recall, and an F1- score of 85%. Additionally, an XGBoost model achieves similar accuracy (≈ 84%) but higher recall (93%) and F1-score (89%), indicating improved churn prediction. The confusion matrix illustrates clear model performance. This study demonstrates that carefully engineered RFM-based features and ensemble learning approaches significantly enhance churn prediction in retail contexts. © 2025 IEEE. en_US
dc.identifier.doi 10.1109/UBMK67458.2025.11207037
dc.identifier.issn 2521-1641
dc.identifier.scopus 2-s2.0-105030842914
dc.identifier.uri https://doi.org/10.1109/UBMK67458.2025.11207037
dc.identifier.uri https://hdl.handle.net/20.500.11779/3228
dc.language.iso en en_US
dc.publisher Institute of Electrical and Electronics Engineers Inc. en_US
dc.relation.ispartof International Conference on Computer Science and Engineering, UBMK en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject Customer Behavior en_US
dc.subject Customer Churn en_US
dc.subject Ensemble Learning en_US
dc.subject Feature Engineering en_US
dc.subject Machine Learning en_US
dc.subject Predictive Modeling en_US
dc.subject Random Forest en_US
dc.subject Retention Strategies en_US
dc.title Predicting Customer Churn in Retail Using Machine Learning on Transaction Data en_US
dc.type Conference Object en_US
dspace.entity.type Publication
gdc.author.institutional Çakar, Tuna
gdc.author.scopusid 60411218700
gdc.author.scopusid 60411229400
gdc.author.scopusid 60092909300
gdc.author.scopusid 58876728800
gdc.author.scopusid 58876518800
gdc.author.scopusid 60411286500
gdc.author.scopusid 60411218800
gdc.collaboration.industrial true
gdc.description.department Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü
gdc.description.endpage 1140 en_US
gdc.description.issue 2025 en_US
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality N/A
gdc.description.startpage 1135 en_US
gdc.description.wosquality N/A
gdc.identifier.openalex W4415524209
gdc.index.type Scopus
gdc.openalex.collaboration National
gdc.openalex.fwci 0.0
gdc.openalex.normalizedpercentile 0.41
gdc.opencitations.count 0
gdc.plumx.mendeley 1
gdc.plumx.scopuscites 0
gdc.publishedmonth Ekim
gdc.scopus.citedcount 0
gdc.virtual.author Çakar, Tuna
gdc.yokperiod YÖK - 2025-26
relation.isAuthorOfPublication 10f8ce3b-94c2-40f0-9381-0725723768fe
relation.isAuthorOfPublication.latestForDiscovery 10f8ce3b-94c2-40f0-9381-0725723768fe
relation.isOrgUnitOfPublication a6e60d5c-b0c7-474a-b49b-284dc710c078
relation.isOrgUnitOfPublication 05ffa8cd-2a88-4676-8d3b-fc30eba0b7f3
relation.isOrgUnitOfPublication 0d54cd31-4133-46d5-b5cc-280b2c077ac3
relation.isOrgUnitOfPublication.latestForDiscovery a6e60d5c-b0c7-474a-b49b-284dc710c078

Files