International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
Volume 38 - Issue 4 |
Published: January 2012 |
Authors: Sara Esfandiari, Behrooz Masoumi, Mohammad Reza Meybodi, Abdolkarim Niazi |
![]() |
Sara Esfandiari, Behrooz Masoumi, Mohammad Reza Meybodi, Abdolkarim Niazi . Accelerated Method Based on Reinforcement Learning and Case Base Reasoning in Multi agent Systems. International Journal of Computer Applications. 38, 4 (January 2012), 25-31. DOI=10.5120/4677-6796
@article{ 10.5120/4677-6796, author = { Sara Esfandiari,Behrooz Masoumi,Mohammad Reza Meybodi,Abdolkarim Niazi }, title = { Accelerated Method Based on Reinforcement Learning and Case Base Reasoning in Multi agent Systems }, journal = { International Journal of Computer Applications }, year = { 2012 }, volume = { 38 }, number = { 4 }, pages = { 25-31 }, doi = { 10.5120/4677-6796 }, publisher = { Foundation of Computer Science (FCS), NY, USA } }
%0 Journal Article %D 2012 %A Sara Esfandiari %A Behrooz Masoumi %A Mohammad Reza Meybodi %A Abdolkarim Niazi %T Accelerated Method Based on Reinforcement Learning and Case Base Reasoning in Multi agent Systems%T %J International Journal of Computer Applications %V 38 %N 4 %P 25-31 %R 10.5120/4677-6796 %I Foundation of Computer Science (FCS), NY, USA
In this paper, a new algorithm based on case base reasoning and reinforcement learning is proposed to increase the rate convergence of the reinforcement learning algorithms in multi-agent systems. In the propose method, we investigate how making improved action selection in reinforcement learning (RL) algorithm. In the proposed method, the new combined model using case base reasoning systems and a new optimized function has been proposed to select the action, which has led to an increase in algorithms based on Q-learning. The algorithm mentioned has been used for solving the problem of cooperative Markov’s games as one of the models of Markov based multi-agent systems. The results of experiments have shown that the proposed algorithms perform better than the existing algorithms in terms of speed and accuracy of reaching the optimal policy.