LIU, Muchen. Integrating Multi-Agent Deep Deterministic Policy Gradient and Go-Explore for Enhanced Reward Optimization. Highlights in Science, Engineering and Technology, [S. l.], v. 85, p. 403–410, 2024. DOI: 10.54097/znrt8d63. Disponível em: https://drpress.org/ojs/index.php/HSET/article/view/18398. Acesso em: 11 jun. 2026.