Overview

This is a repository for the paper "Learning from Reward-Free Offline Data: A Case for Planning with Latent Dynamics Models".

[Website] [Paper]

In this paper, we focus on methods that can learn from offline trajectories without reward annotations. We test methods ranging from RL to control, and find that planning with a learned latent dynamics model (PLDM) is a promising approach for this setting when the data is imperfect.

Setting up

Repo Setup

git clone git@github.com:vladisai/PLDM.git

cd PLDM

pip install -r requirements.txt

pip install -e .

Run Experiments

Go to pldm_envs/, follow instructions to set up dataset for the environment of your hoice
Go to pldm/, follow instruction to run training or evaluation

Datasets

To see the datasets we used to train our models, see folders inside pldm_envs/. The readmes there will guide you on how to download and set up the datasets.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github/workflows		.github/workflows
assets		assets
pldm		pldm
pldm_envs		pldm_envs
.flake8		.flake8
.gitignore		.gitignore
LICENSE		LICENSE
pyproject.toml		pyproject.toml
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Overview

Setting up

Repo Setup

Run Experiments

Datasets

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

vladisai/PLDM

Folders and files

Latest commit

History

Repository files navigation

Overview

Setting up

Repo Setup

Run Experiments

Datasets

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages