GitHub - xy9485/DVQN_RL

This is the repository for paper:

Towards Unbiased Action Value Estimation in Reinforcement Learning

The training requires WANDB to log data

To train a agent, go to ./shell_scripts and run:

./train.sh algo_name

algo_name is to be replaced by the following methods:

dvqn, dqn, sarsa, ddqn, cddqn, avgdqn, dueldqn

Hyperparameters are to be modified within train.sh

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
gridworld		gridworld
hyperparams		hyperparams
latentrl		latentrl
shell_scripts		shell_scripts
toy_mdp		toy_mdp
.gitignore		.gitignore
README.md		README.md
conda_env.yml		conda_env.yml
requirements.txt		requirements.txt

Provide feedback