Ensemble-Based Stock Prediction for Retail - XGBoost and LightGBM with Rolling Window Training

Loading...
Publication Logo

Date

2025

Journal Title

Journal ISSN

Volume Title

Publisher

Institute of Electrical and Electronics Engineers Inc.
IEEE

Open Access Color

OpenAIRE Downloads

OpenAIRE Views

Impulse
Average
Influence
Average
Popularity
Average

Research Projects

Journal Issue

Abstract

Stock prediction in retail settings is a critical challenge that impacts numerous businesses globally, that require precise and timely forecasts to optimize inventory management and enhance customer satisfaction. State-of-the-art approaches for accurate stock prediction leverage machine learning (ML) models, which require large amounts of historical sales data for effective training. Such detailed datasets are often hard to obtain, limiting the performance and scalability of these approaches. In this paper, we propose various strategies to tackle this limitation. Initially, we adopt a transfer-learning approach, utilizing pre-trained models like XGBoost and LightGBM, which are fine-tuned for stock prediction in retail environments. To further boost model performance, we incorporate an ensemble method that combines predictions from both models to improve accuracy and manage outliers. Experiments conducted on an extremely large dataset, comprising millions of retail transactions, highlight the presence of significant outliers. Our models, augmented with ensemble strategies, significantly outperform traditional models in handling these complexities and improving prediction accuracy. © 2025 Elsevier B.V., All rights reserved.
Stock prediction in retail settings is a critical challenge that impacts numerous businesses globally, that require precise and timely forecasts to optimize inventory management and enhance customer satisfaction. State-of-the-art approaches for accurate stock prediction leverage machine learning (ML) models, which require large amounts of historical sales data for effective training. Such detailed datasets are often hard to obtain, limiting the performance and scalability of these approaches. In this paper, we propose various strategies to tackle this limitation. Initially, we adopt a transfer-learning approach, utilizing pre-trained models like XGBoost and LightGBM, which are fine-tuned for stock prediction in retail environments. To further boost model performance, we incorporate an ensemble method that combines predictions from both models to improve accuracy and manage outliers. Experiments conducted on an extremely large dataset, comprising millions of retail transactions, highlight the presence of significant outliers. Our models, augmented with ensemble strategies, significantly outperform traditional models in handling these complexities and improving prediction accuracy.

Description

Isik University

Keywords

Component, Formatting, Style, Styling, Insert

Fields of Science

Citation

WoS Q

N/A

Scopus Q

N/A
OpenCitations Logo
OpenCitations Citation Count
N/A

Source

-- 33rd IEEE Conference on Signal Processing and Communications Applications, SIU 2025 -- Istanbul; Isik University Sile Campus -- 211450
33rd Conference on Signal Processing and Communications Applications-SIU-Annual -- Jun 25-28, 2025 -- Istanbul, Türkiye

Volume

Issue

Start Page

1

End Page

4
PlumX Metrics
Citations

Scopus : 0

Captures

Mendeley Readers : 1

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.0

Sustainable Development Goals