Application and Optimization of Reinforcement Learning Based on Deep Q-Network (DQN) in Complex Environments
DOI:
https://doi.org/10.54097/yynknn80Keywords:
Reinforcement learning; double deep Q-Network; application and optimization.Abstract
Reinforcement Learning, as an important branch of machine learning, is dedicated to learning how to make optimal decisions to maximize the cumulative reward in an uncertain environment. Deep Q-Network (DQN) combines the advantages of deep learning and reinforcement learning and approximates the Q-value function through deep neural network. Despite the remarkable success of DQN in complex tasks, there are still challenges in continuous action space processing, high-dimensional state space exploration efficiency, adaptability to environmental changes, and delayed reward issues. To overcome these limitations, this paper proposes an optimization of the DQN algorithm, Double-Deep Q-Network, and the concept of DDQN. This paper first analyzes the challenges faced by DQN in complex environments and outline the introduction of DDQN and its comparative analysis with DQN. Then, a series of optimization measures are introduced, including network structure optimization, target network update strategy improvement, dynamic adjustment of exploration strategy and ensemble learning, to improve the performance and stability of DDQN. In addition, the application cases of DDQN in different fields such as autonomous driving, and game AI are explored, and the corresponding improvement strategies are analyzed. Finally, the optimization strategies of DDQN in practical applications are summarized, and its potential applications in future complex decision-making problems are prospected. Through continuous technological innovation and algorithm optimization, DDQN is expected to play a greater role in a wider range of fields and promote the development of intelligent decision systems.
Downloads
References
[1] Zhen He, Kien Phuoc Tran, Simon Thomassey, et al. Multi-Objective Optimization of the Textile Manufacturing Process Using Deep-Q-Network Based Multi-Agent Reinforcement Learning. Journal, 2020, 1-11.
[2] Wei, Z., Ma, Y., Yang, N., Ruan, S., & Xiang, C.. Reinforcement learning based power management integrating economic rotational speed of turboshaft engine and safety constraints of battery for hybrid electric power system. Energy, 2023, 263, Article 125752.
[3] Zhiyuan Liu, Xuefei Chen, Yan Chen, et al. Deep Reinforcement Learning Based Dynamic Resource Allocation in 5G Ultra-Dense Networks. In: 2019 IEEE International Conference on Smart Internet of Things (SmartIoT), 2019, 1-12.
[4] Bo Liu, Xiaoye Ye, Cheng Zhou, et al. The Improved Algorithm of Deep Q-learning Network Based on Eligibility Trace. In: 2020 6th International Conference on Control, Automation and Robotics (ICCAR), 2020.
[5] Zhiyu Yu, Xiaojun Yang, Feng GAO, et al. A Knowledge-based reinforcement learning control approach using deep Q network for cooling tower in HVAC systems. In: 2020 Chinese Automation Congress (CAC), 2020.
[6] Kiyoshi Kaneko, Takashi Komatsu, Zhou. Autonomous optimization of cutting conditions in end milling operation based on deep reinforcement learning (Offline training in simulation environment for feed rate optimization). Journal of Advanced Mechanical Design Systems and Manufacturing, 2023, 17(5).
[7] Jing Zhang, Fang Qian, Jian Yang. Online routing and spectrum allocation in elastic optical networks based on dueling Deep Q-network. Computers & Industrial Engineering, 2022, 173:108663.
[8] Xiaomin Li, Shaohu Yang, Jia Shi, et al. Deep reinforcement learning based resource allocation algorithm in cellular networks. Journal on Communications, 2019, 332-344.
[9] Zhuang Zhu, Jian Wang, Qi Qi, et al. Adaptive and Robust Network Routing Based on Deep Reinforcement Learning with Lyapunov Optimization. ACM, 2020.
[10] Xing Tang, Jinfeng Chen, Tian Liu, et al. Distributed Deep Reinforcement Learning-Based Energy and Emission Management Strategy for Hybrid Electric Vehicles. IEEE Transactions on Vehicular Technology, 2021, 70, 10.
[11] Yong Qian, Rui Wang, Jian Wu, et al. Reinforcement Learning Based Optimal Computing and Caching in Mobile Edge Network. IEEE Journal on Selected Areas in Communications, 2020, 1-11.
[12] Fan Wu, Lian Zhang, Yong He. Energy Efficiency Optimization in Downlink NOMA-Enabled Fog Radio Access Network Based on Deep Reinforcement Learning. In: 2021 Computing, Communications and IoT Applications (ComComAp), 2021.
[13] Yi Ke, Kai Huang, Xin Qiu, et al. Distributed Routing Optimization Algorithm for FANET Based on Multi-agent Reinforcement Learning. IEEE Sensors Journal, 2024, 24.
Downloads
Published
Issue
Section
License
Copyright (c) 2024 Highlights in Science, Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







