Research on E-Commerce Retail Demand Forecasting Based on SARIMA Model and K-means Clustering Algorithm

Authors

  • Yiding Zhao

DOI:

https://doi.org/10.54097/45acxz19

Keywords:

E-commerce, demand forecasting, SARIMA model, k-means clustering.

Abstract

With the rapid development of e-commerce, precise demand forecasting and efficient inventory management have become essential for the success and profitability of retail businesses. This study focuses on demand forecasting for e-commerce retailers using the Seasonal Autoregressive Integrated Moving Average (SARIMA) model and the K-means clustering algorithm. The research utilizes a dataset containing 1996 time series of sales data from various products, merchants, and warehouses, aiming to predict demand changes for the next 15 days. The study initially evaluates three models—Linear Regression (LR), Autoregressive Integrated Moving Average (ARIMA), and SARIMA—by fitting them to historical sales data to forecast future demand. The SARIMA model is identified as the most effective through rigorous evaluation using 1-mWAPE (mean weighted absolute percentage error) and RMSE (root mean square error) metrics. To enhance homogeneity within demand categories, the K-means clustering method is applied to divide products into four distinct groups, further refining the forecasting process.The paper also addresses the challenge of integrating new sequences into the dataset by leveraging clustering results to classify sequences and using cosine similarity to identify analogous historical time series. These matched sequences serve as the basis for demand prediction using the established SARIMA model. The findings highlight the robustness of the SARIMA model in capturing trends and seasonality, providing a reliable framework for e-commerce demand forecasting that can significantly impact inventory strategies and operational efficiency.

Downloads

Download data is not yet available.

References

Lalou P, Ponis S T, Efthymiou O K. Demand forecasting of retail sales using data analytics and statistical programming[J]. Management & Marketing, 2020, 15(2): 186-202.

Bandara K, Shi P, Bergmeir C, et al. Sales demand forecast in e-commerce using a long short-term memory neural network methodology[C]//Neural Information Processing: 26th International Conference, ICONIP 2019, Sydney, NSW, Australia, December 12–15, 2019, Proceedings, Part III 26. Springer International Publishing, 2019: 462-474.

Leung K H, Mo D Y, Ho G T S, et al. Modelling near-real-time order arrival demand in e-commerce context: a machine learning predictive methodology[J]. Industrial Management & Data Systems, 2020, 120(6): 1149-1174.

Shih Y S, Lin M H. A LSTM approach for sales forecasting of goods with short-term demands in E-commerce[C]//Intelligent Information and Database Systems: 11th Asian Conference, ACIIDS 2019, Yogyakarta, Indonesia, April 8–11, 2019, Proceedings, Part I 11. Springer International Publishing, 2019: 244-256.

Dabral P P, Murry M Z. Modelling and forecasting of rainfall time series using SARIMA[J]. Environmental Processes, 2017, 4(2): 399-419.

Dubey A K, Kumar A, García-Díaz V, et al. Study and analysis of SARIMA and LSTM in forecasting time series data[J]. Sustainable Energy Technologies and Assessments, 2021, 47: 101474.

Downloads

Published

27-04-2024

Issue

Section

Articles

How to Cite

Zhao, Y. (2024). Research on E-Commerce Retail Demand Forecasting Based on SARIMA Model and K-means Clustering Algorithm. Academic Journal of Science and Technology, 10(3), 226-231. https://doi.org/10.54097/45acxz19