Accelerating Model-Based Reinforcement Learning with Skill Abstraction from Unlabeled Data

The code is built off of SUPE with inspiration from SkiMo.

You will need to login with wandb to view the results.

Before setting up the environment, make sure that MuJoCo and the dependencies for mujoco-py are installed (https://github.com/openai/mujoco-py). Then, run the create_env.sh script, which will create the conda environment and download the pretrained checkpoints.

Reproducing Experiments in the Paper

Pretraining

Pretrained checkpoints for all environments are downloaded in create_env.sh. Below are the commands used to generate the checkpoints.

Kitchen

python run_opal.py --env_name=kitchen-mixed-v0 --seed=1 --vision=False

Replace the env_name with kitchen-partial-v0 and kitchen-complete-v0 to test the other tasks.

Online Learning

Kitchen

python train_finetuning_mosaud.py --config.backup_entropy=False --config.num_min_qs=2 --offline_relabel_type=pred --use_rnd_offline=True --use_rnd_online=True --env_name=kitchen-mixed-v0 --seed=1 --config.init_temperature=1.0

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
configs		configs
supe		supe
README.md		README.md
create_env.sh		create_env.sh
get_hilp_agent.py		get_hilp_agent.py
requirements.txt		requirements.txt
run_opal.py		run_opal.py
train_finetuning_mosaud.py		train_finetuning_mosaud.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Accelerating Model-Based Reinforcement Learning with Skill Abstraction from Unlabeled Data

Reproducing Experiments in the Paper

Pretraining

Kitchen

Online Learning

Kitchen

About

Uh oh!

Releases

Packages

Languages

aarontrinh02/MoSAUD

Folders and files

Latest commit

History

Repository files navigation

Accelerating Model-Based Reinforcement Learning with Skill Abstraction from Unlabeled Data

Reproducing Experiments in the Paper

Pretraining

Kitchen

Online Learning

Kitchen

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages