This paper deals with the trouble of multi-agent Finding out of a populace of players, engaged in the repeated normalform game. Assuming boundedly-rational brokers, we propose a design of social learning based upon trial and mistake, called "social reinforcement Finding out". This extension of well-known Q-Finding out algorithm, allows players https://gallagherp630ehn2.wikitron.com/user