Reinforcement Learning with simple game
There is a traditional ‘optimal’ way to play the game of ‘tic tac toe’ — the way defined by applying the so-called ‘minimax’ algorithm (Chap 5 Russell & Norvig ‘Artif. Intelligence: a Modern Approach’ [R&N]) . An area in of application of ‘Markov Decision Processes’ and ‘Reinforcement Learning’ (Chap 22 [R&N], Chap 13 Mackworth & … Read more