Directional Predictability of the CSI 300 Index: A Walk-forward Evaluation of a BILSTM-attention Model
DOI:
https://doi.org/10.54097/8pbsb573Keywords:
CSI 300 index; BiLSTM Attention; Stock index prediction; Deep sequence models; Volatility aware hybrids.Abstract
This study looks at short-term predictability in China’s stock market, using the CSI 300 index as the main example. A Bidirectional Long Short-Term Memory (BiLSTM) model with an Attention mechanism is built that uses recent price- and volatility-based technical indicators and is tested with an expanding-window, walk-forward method for both direction classification and return-size regression. The model choice follows earlier work showing that deep sequence models and volatility-aware hybrids can capture nonlinear patterns and changing volatility, even though the economic value of daily predictions is usually small. In the results, the BiLSTM-Attention model gives a stable but modest edge in one-day-ahead direction prediction—AUC, PR-AUC, F1, and hit ratio are all above random—while predicted return sizes are still very noisy, which is consistent with weak-form market efficiency. The paper also discusses a tradability-oriented framework that links calibrated probabilities to position sizing under transaction costs and regime shifts, and outlines possible extensions using volatility-aware hybrids and multi-scale or global attention to improve robustness across different regimes and assets. Overall, the findings show that short-horizon predictability is fragile, and that careful implementation, probability calibration, and risk control are crucial if weak statistical signals are to be turned into investable strategies.
Downloads
References
[1] Engle R F. Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation. Econometrica, 1982, 50(4), 987–1007.
[2] Bollerslev T. Generalized autoregressive conditional heteroskedasticity. Journal of Econometrics, 1986, 31(3), 307–327.
[3] Fischer T, & Krauss C. Deep learning with long short-term memory networks for financial market predictions. European Journal of Operational Research, 2018, 270(2), 654–669.
[4] Bao W, Yue J, & Rao Y. A deep learning framework for financial time series using stacked autoencoders and long short-term memory. PLOS ONE, 2017, 12(7), e0180944.
[5] Kim H Y, & Won C H. Forecasting the volatility of stock price index: A hybrid model integrating LSTM with GARCH. Expert Systems with Applications, 2018, 103, 25–37.
[6] Pan H, Tang Y, & Wang G. A stock index futures price prediction approach based on the MULTIGARCHLSTM mixed model. Mathematics, 2024, 12, 1677.
[7] Yu F, Tong Q, et al. BiLSTMMLAM: A multiscale feature fusion model with local attention for time series prediction. Sensors, 2024, 24, 3962.
[8] Mitra A, Jain A, Kishore A, & Kumar P. A comparative study of demand forecasting models for a multichannel retail company: A novel hybrid machine learning approach. Operations Research Forum, 2022, 3, 58.
[9] Ejaz A, Ahmed A D, et al. Stock price prediction using machine learning and deep learning methods during COVID-19. European Journal of Management and Business Economics, 2022, 31(3), 234–252.
[10] Khan M U G, Yu H, Tariq M, Javed M A, & Baik S W. LSTMmTransMLP: A hybrid model for robust financial time series prediction. Sci, 2025, 7(7), 1–22.
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

