Xie, J. (2026) “Multi-Scale Entropy for Transformers: Interpreting Training Dynamics and Guiding an Adaptive Training Pipeline”, Mathematical Modeling and Algorithm Application, 9(1), pp. 678–694. doi:10.54097/ynsrha65.