Inverse_MDPGame

Forward solver and inverse learning in infinite-horizon MDP Games. Learning reward parameters that best explain the given multi-agent demonstrations, generated from ma_gym environments.

Data Generation: `ma_gym`

Using a policy trained by the algorithm VDN, 100 trajectories from all three players for ma_gym:PredatorPrey5x5-v1 are sampled, processed, and saved in the format of pickle files in directory /processed_data/ma_gym:PredatorPrey5x5-v1/.

Convert data to Julia

julia convert_data_to_julia.jl --env ma_gym:PredatorPrey5x5-v1 --type vdn

After converted to julia JLD2, files are saved in /julia_data/ma_gym:PredatorPrey5x5-v1/ directory.

Run experiments using the Proposed Algorithm

julia test_game.jl --env ma_gym:PredatorPrey5x5-v1 --type vdn --seed 1

Results will be saved in directory results/ma_gym:PredatorPrey5x5-v1/vdn/.

Run experiemnts using the Baseline

julia test_decoupled.jl --env ma_gym:PredatorPrey5x5-v1 --type vdn --seed 1

Results will be saved in directory results/ma_gym:PredatorPrey5x5-v1/vdn_decoupled/.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
julia_data/ma_gym:PredatorPrey5x5-v1		julia_data/ma_gym:PredatorPrey5x5-v1
processed_data/ma_gym:PredatorPrey5x5-v1		processed_data/ma_gym:PredatorPrey5x5-v1
results/ma_gym:PredatorPrey5x5-v1		results/ma_gym:PredatorPrey5x5-v1
solvers		solvers
.gitignore		.gitignore
README.md		README.md
convert_data_to_julia.jl		convert_data_to_julia.jl
plotting.jl		plotting.jl
test_decoupled.jl		test_decoupled.jl
test_game.jl		test_game.jl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Inverse_MDPGame

Data Generation: `ma_gym`

Run experiments using the Proposed Algorithm

Run experiemnts using the Baseline

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Inverse_MDPGame

Data Generation: ma_gym

Run experiments using the Proposed Algorithm

Run experiemnts using the Baseline

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Data Generation: `ma_gym`

Packages