====== Reinforcement learning ====== =References= - [[http://www.cs.ualberta.ca/~sutton/book/ebook/the-book.html|Introductory Book [EN] ]] ===MDP=== Markov Decision Processes, //Processus de Décision de Markov// ==Q-learning== ==Value iteration== ==Policy iteration==