Dynamic Hyper-Parameter Adjustment in Fedprox for Performance Optimization in Federated Learning
DOI:
https://doi.org/10.54097/fphadj19Keywords:
Camera-Ready Paper, Trans Tech Publications, A4 Format.Abstract
This paper investigates the application of dynamic hyper-parameter adjustment strategies to improve the performance of the FedProx algorithm in federated learning (FL). FedProx addresses the challenges posed by data heterogeneity across clients, a common issue in FL that hinders convergence. While FedProx improves upon the basic FedAvg algorithm by introducing a proximal term to stabilize model updates, the hyper-parameter μ that controls the impact of the proximal term is sensitive and needs careful tuning. An inappropriate μ can degrade performance, making dynamic adjustment crucial. The study introduces two dynamic μ adjustment strategies. The first strategy, called ExponentialDecay, progressively reduces μ exponentially by multiplying it with a constant factor. The second strategy, AdaptiveLoss, adjusts μ based on changes in the loss value, decreasing it when the loss decreases and increasing it when the loss increases, offering a more responsive approach to the model’s performance. The effectiveness of both strategies is evaluated on the EMNIST dataset, which simulates real-world non-IID data distributions. Results show that both dynamic strategies outperform the static μ setting in terms of model accuracy, loss reduction, and training speed, particularly in scenarios with highly heterogeneous data. Specifically, the AdaptiveLoss strategy achieves the best performance when the data is extremely non-IID, reducing loss by 10.2% and improving accuracy by 2.5% compared to the static μ approach. These dynamic strategies not only enhance FedProx’s stability but also accelerate the convergence of the global model.
References
[1] McMahan, H.B., Moore, E., Ramage, D., Hampson, S., Aguera y Arcas, B.: 'Communication-Efficient Learning of Deep Networks from Decentralized Data'. Proc. 20th International Conference on Artificial Intelligence and Statistics (AISTATS), Fort Lauderdale, FL, USA, 2017, pp. 1–10 .
[2] Li, T., Sahu, A.K., Zaheer, M., Sanjabi, M., Talwalkar, A., Smith, V.: 'Federated Optimization in Heterogeneous Networks'. Proceedings of Machine Learning and Systems (MLSys), 2020, 2, pp. 429–450
[3] Cohen, G., Afshar, S., Tapson, J., van Schaik, A.: 'EMNIST: an extension of MNIST to handwritten letters'. Proc. 14th International Conference on Machine Learning and Applications (ICMLA), Miami, FL, USA, December 2017, pp. 1–6 .
[4] Li, T., Sahu, A.K., Talwalkar, A., Smith, V.: 'Federated Learning: Challenges, Methods, and Future Directions'. IEEE Signal Processing Magazine, 2020, 37, (3), pp. 50–60 .
[5] Tao, H., Zhang, J., Zhou, Y., et al.: 'Heterogeneous Federated Learning: State-of-the-art and Research Challenges'. ACM Computing Surveys, 2024, 56, (3), Article 79, pp. 1–44 .
[6] Wang, J., et al.: 'Knowledge Generation and Distillation for Road Segmentation in Intelligent Transportation Systems'. IEEE Transactions on Intelligent Transportation Systems, 2025.
[7] Zhu, X., et al.: 'RMER-DT: Robust Multimodal Emotion Recognition in Conversational Contexts Based on Diffusion and Transformers'. Information Fusion, 2025, 103268.
[8] Wang, R., et al.: 'RAFT: Robust Adversarial Fusion Transformer for Multimodal Sentiment Analysis'. Array, 2025, 100445.
[9] Chen, Y., Fang, F., Wang, B., et al.: 'An Efficient Federated Learning Framework for IoT: Intrusion Detection'. Proc. IEEE International Conference on Communications (ICC), Denver, CO, USA, October 2024 .
[10] Acar, D.A.E., Zhao, Y., Matas, R., Mattina, M., Whatmough, P.N., Saligrama, V.: 'Federated Learning Based on Dynamic Regularization'. Proc. 9th International Conference on Learning Representations (ICLR), Virtual, 2021 .
[11] Haque, R.U., Markopoulos, P.: 'Federated Learning with Automated Dual-Level Hyperparameter Tuning'. IEEE Open Journal of Signal Processing, 2025, PP, (99), pp. 1–9 .
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







