E-Commerce Demand Forecasting Using SARIMA Model and K-means Clustering Analysis
DOI:
https://doi.org/10.54097/ctfb0379Keywords:
SARIMA Model, K-means, E-Commerce Demand ForecastingAbstract
E-commerce has become a significant driver in the retail industry within the digital era, with escalating online transaction volumes. Accurate demand forecasting and effective inventory management are crucial for retailers to minimize costs related to overstock and stockouts while maximizing customer satisfaction and revenue. This study explores the application of advanced time series models for demand forecasting by e-commerce retailers, with a particular emphasis on the Seasonal Autoregressive Integrated Moving Average (SARIMA) model. The SARIMA model, an extension of the ARIMA model, incorporates autoregression, moving averages, and accounts for seasonal variations, making it suitable for analyzing data with regular periodic patterns, such as seasonal fluctuations in consumer purchasing behavior. The study methodology includes data preprocessing, establishment of the ARIMA model, construction of a Linear Regression (LR) model, development of the SARIMA model, demand forecasting, and clustering analysis. The SARIMA model demonstrated higher fit and predictive accuracy compared to traditional LR and ARIMA models, as substantiated by the 1-mWAPE and RMSE metrics. The application of the K-means clustering algorithm enhanced the homogeneity of demand forecasting and improved the model's adaptability to new sequence data. Cluster analysis categorized products into four classes, with more similar demand characteristics within each class, aiding the model in more accurately capturing potential demand changes. In conclusion, this research confirms the effectiveness and practicality of combining the SARIMA model with the K-means clustering algorithm for e-commerce demand forecasting.
References
Lalou P, Ponis S T, Efthymiou O K. Demand forecasting of retail sales using data analytics and statistical programming[J]. Management & Marketing, 2020, 15(2): 186-202.
Bandara K, Shi P, Bergmeir C, et al. Sales demand forecast in e-commerce using a long short-term memory neural network methodology[C]//Neural Information Processing: 26th International Conference, ICONIP 2019, Sydney, NSW, Australia, December 12–15, 2019, Proceedings, Part III 26. Springer International Publishing, 2019: 462-474.
Leung K H, Mo D Y, Ho G T S, et al. Modelling near-real-time order arrival demand in e-commerce context: a machine learning predictive methodology[J]. Industrial Management & Data Systems, 2020, 120(6): 1149-1174.
Shih Y S, Lin M H. A LSTM approach for sales forecasting of goods with short-term demands in E-commerce[C]//Intelligent Information and Database Systems: 11th Asian Conference, ACIIDS 2019, Yogyakarta, Indonesia, April 8–11, 2019, Proceedings, Part I 11. Springer International Publishing, 2019: 244-256.
Dabral P P, Murry M Z. Modelling and forecasting of rainfall time series using SARIMA[J]. Environmental Processes, 2017, 4(2): 399-419.
Dubey A K, Kumar A, García-Díaz V, et al. Study and analysis of SARIMA and LSTM in forecasting time series data[J]. Sustainable Energy Technologies and Assessments, 2021, 47: 101474.
Downloads
Published
Issue
Section
License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.