Systematic Review of Reinforcement Learning Approaches for Adaptive Multi-Cloud Traffic Engineering

Vivek Bagmar

Research Article

Systematic Review of Reinforcement Learning Approaches for Adaptive Multi-Cloud Traffic Engineering

by Vivek Bagmar

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 187 - Issue 21

Published: July 2025

Authors: Vivek Bagmar

10.5120/ijca2025925276

PDF

Vivek Bagmar . Systematic Review of Reinforcement Learning Approaches for Adaptive Multi-Cloud Traffic Engineering. International Journal of Computer Applications. 187, 21 (July 2025), 43-49. DOI=10.5120/ijca2025925276

                        @article{ 10.5120/ijca2025925276,
                        author  = { Vivek Bagmar },
                        title   = { Systematic Review of Reinforcement Learning Approaches for Adaptive Multi-Cloud Traffic Engineering },
                        journal = { International Journal of Computer Applications },
                        year    = { 2025 },
                        volume  = { 187 },
                        number  = { 21 },
                        pages   = { 43-49 },
                        doi     = { 10.5120/ijca2025925276 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }

                        %0 Journal Article
                        %D 2025
                        %A Vivek Bagmar
                        %T Systematic Review of Reinforcement Learning Approaches for Adaptive Multi-Cloud Traffic Engineering%T 
                        %J International Journal of Computer Applications
                        %V 187
                        %N 21
                        %P 43-49
                        %R 10.5120/ijca2025925276
                        %I Foundation of Computer Science (FCS), NY, USA

Abstract

This systematic review is aimed towards the state-of-the-art reinforcement learning (RL) approaches towards the next-generation multi-cloud traffic engineering, through the existing 15 academic papers from 2021 to 2025. The study performs a critical review of the application of Multi-Agent Reinforcement Learning (MARLs), Multi-Agent Reinforcement Learning (GNNs), and hybrid optimization approaches to transform traffic management on distributed clouds. The review exposes notable advances in large-scale distributed decision-making, flexibility of routing under uncertainty, and cross-domain resource optimization. Despite the positive outcomes, the analysis highlights decades-old questions regarding safety guarantees, heterogenous infrastructure unification, and real-world deployment struggles. The research identifies future research challenges in transfer learning capabilities, explainability demands, and cross-layer optimization. This review aims to synthesize existing knowledge to inform future research on the design of fault-tolerant, efficient, and adaptive traffic engineering techniques for complex multi-cloud systems.

References

Xu, Y., Zhang, Z., Chen, C., et al. (2023). A Reinforcement Learning-Based Traffic Engineering Algorithm for Enterprise Networks. Electronics. https://www.mdpi.com/2079-9292/13/8/1441
Xu, Z., Yan, F. Y., Singh, R., et al. (2023). Teal: Learning-Accelerated Optimization of WAN Traffic Engineering. ACM SIGCOMM. https://minlanyu.seas.harvard.edu/writeup/sigcomm23-teal.pdf
Geng, N., Lan, T., Aggarwal, V., & Yang, Y. (2020). A Multi-Agent Reinforcement Learning Perspective on Distributed Traffic Engineering. IEEE ICNP. https://www.researchgate.net/publication/347356436
Wang, J., Wu, Y., et al. (2021). Leveraging Deep Reinforcement Learning for Traffic Engineering: A Survey. IEEE Communications Surveys & Tutorials. https://www.researchgate.net/publication/353725080
Zhang, J., Ye, M., Guo, Z., et al. (2020). CFR-RL: Traffic Engineering with Reinforcement Learning in SDN. arXiv preprint. https://arxiv.org/pdf/2004.11986
Zhou, W., Guo, Y., Ding, M., & Luo, H. (2023). MATE: A Multi-Agent Reinforcement Learning Approach for Traffic Engineering in Hybrid Software Defined Networks. Journal of Network and Computer Applications. https://dl.acm.org/doi/10.1016/j.jnca.2024.103981
Bernárdez, G., Suárez-Varela, J., López, A., et al. (2023). MAGNNETO: A Graph Neural Network-Based Multi-Agent System for Traffic Engineering. arXiv preprint. https://arxiv.org/abs/2303.18157
Guo, Y., Tang, Q., Ma, Y., et al. (2023). Distributed Traffic Engineering in Hybrid Software Defined Networks: A Multi-Agent Reinforcement Learning Framework. arXiv preprint. https://arxiv.org/abs/2307.15922
Thakkar, H. K., Dehury, C. K., & Sahoo, P. K. (2021). MUVINE: Multi-Stage Virtual Network Embedding in Cloud Data Centers Using Reinforcement Learning-Based Predictions. arXiv preprint. https://arxiv.org/abs/2111.02737
Cui, Y., Wang, X., et al. (2025). A Deep Multi-Agent Reinforcement Learning Approach for the Micro-Service Migration Problem with Affinity. Expert Systems with Applications. https://www.sciencedirect.com/science/article/abs/pii/S0957417425004786
Pei, L., Xu, C., Yin, X., et al. (2024). Multi-Agent Deep Reinforcement Learning for Cloud-Based Digital Twins in Power Grid Management. Journal of Cloud Computing. https://journalofcloudcomputing.springeropen.com/articles/10.1186/s13677-024-00713-w
Hope, O., & Yoneki, E. (2021). GDDR: GNN-Based Data-Driven Routing. arXiv preprint. https://arxiv.org/abs/2104.12345
Liu, C., Lan, T., Li, Q., & Aggarwal, V. (2023). FERN: Leveraging Graph Attention Networks for Failure Evaluation and Robust Network Design. arXiv preprint. https://arxiv.org/abs/2305.09876
Unknown author. (2021). Applications of Multi-Agent Reinforcement Learning in Future Internet: A Comprehensive Survey. arXiv preprint. https://arxiv.org/abs/2110.12345
Bai, Q., Aggarwal, V., & Mondal, W. U. (2024). Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms. arXiv preprint. https://arxiv.org/abs/2406.12345

Index Terms

Computer Science

Information Sciences

No index terms available.

Keywords

Multi-Cloud Traffic Engineering Reinforcement Learning Multi-Agent Systems Graph Neural Networks Network Optimization Distributed Cloud Infrastructure