ALDA Official Repo

Official Repo of ALDA! Zero-shot visual generalization of RL agents on various DMControl tasks using disentangled representation learning with principles of associative memory.

Installation

Setting up Conda Environment

Create the conda environment and activate it

conda create -n alda python=3.8
conda activate alda

run or manually install the packages in dmcontrol_generalization_benchmark/setup/install_envs.sh.

cd dmcontrol_generalization_benchmark
./setup/install_envs.sh
cd ..

Install the packages in requirements.txt:

pip install -r requirements.txt

Setting up DMControl Generalization Benchmark

In dmcontrol_generalization_benchmark/setup/config.cfg change your/path/to/ to wherever you put this repository. Follow the instructions here under the Datasets header to download the DAVIS dataset for the distracting background environment. Download and extract the dataset to alda_official/dmcontrol_generalization_benchmark/datasets/.

Running Experiments

The specs/ folder contains yaml configs for all tasks with the default hyperparameters used for the main paper results. For example, to run ALDA on the Walker Walk task:

python -m scripts.train --experiment_spec_file specs/train_alda_walker_walk.yaml

We use Weights and Biases to record results. To enable w&b, add the --use_wandb flag and specify your w&b entity and project name with the --wandb_entity and --wandb_project flags:

python -m scripts.train --experiment_spec_file specs/train_alda_walker_walk.yaml --wandb_entity xyz --wandb_project abc

By default, the code will not let you run two experiments with the same name so that you don't accidentally overwrite an existing result. To change this behavior, add the --debug flag:

python -m scripts.train --experiment_spec_file specs/train_alda_walker_walk.yaml --debug

Acknowledgements

This SAC implementation is based off of this repository (SVEA).
ALDA builds directly on top of QLAE for disentanglement.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
autoencoders		autoencoders
common		common
disentangle		disentangle
dmcontrol_generalization_benchmark		dmcontrol_generalization_benchmark
models		models
runners/slurm		runners/slurm
scripts		scripts
specs		specs
trainers		trainers
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ALDA Official Repo

Installation

Setting up Conda Environment

Setting up DMControl Generalization Benchmark

Running Experiments

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

SumeetBatra/ALDA_Official

Folders and files

Latest commit

History

Repository files navigation

ALDA Official Repo

Installation

Setting up Conda Environment

Setting up DMControl Generalization Benchmark

Running Experiments

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages