Part 4. Reinforcement Learning - Q-Learning
Now we have everything set up and the basic random policy. Mario would have to be super lucky to get anywhere with this. It's time to improve the policy.
I'm going to use the reinforcement learning technique of deep Q-learning.
This is the …