In this project, I implement the Q-Learning and Dyna-Q solutions to the reinforcement learning problem. I apply them to a navigation problem in this project. In a later project, I will apply them to trading. The reason for working with the navigation problem first is that navigation is an easy problem to work with and understand. My Q-Learning code really shouldn't care which problem it is solving. The difference is that I need to wrap the learner in different code that frames the problem for the learner as necessary.

Overall, my tasks for this project include:

Code a Q-Learner Code the Dyna-Q feature of Q-Learning Test/debug the Q-Learner in navigation problems

Built With

Share this project:

Updates