RoTO: Robot Tactile Olympiad

RoTO is a reinforcement learning benchmark environment designed to standardise and promote future research in tactile-based manipulation. It is introduced in detail in Enhancing Tactile-based RL for Robotic Control (NeurIPS 2025). The environments are designed to cover a wide range of tactile interactions: sparse (Find), intermittent (Bounce), and sustained (Baoding). We will continue to add more environments and strongly welcome contributions 🤗

✨ Overview

We split the paper code across two repositories. Imagine the typical RL loop: you can think of multimodal_rl as the agent, and roto as the environment. We did this for modularity, in case you want to use your own RL repository instead of ours (there will be some integration to achieve this but happy to help).

multimodal_rl: The motto of this repo is "doing good RL with Isaac Lab as painlessly as possible". We started from the skrl library and made significant changes to better handle multimodal dictionary observations, observation stacking and associated memory management, and integrated self-supervision. Many existing libraries did not provide support for doing robust RL research (correct evaluation metrics, distinct train/evaluation envs, integrated hyperparameter optimisation). These are well established norms in the RL research community, but are not yet consistently present in RL+robotics research, which we want to encourage 🚀

roto: This repo just contains the robot configurations and task definitions. We take advantage of class inheritance to heavily reduce repeated code. RotoEnv is a child of DirectRLEnv, and sets up basic functions to perform joint position control of a robot and reset it. [Robot]Env is a child of RotoEnv, defining robot-specific functions that do not change task-to-task, e.g. the proprioceptive observation key. Finally, [Task]Env defines task-specific functions such as setting up the environment, rewards, and episode resets.

🤖 Environments

The agents are all joint position controlled. Franka has 9 joints, Shadow has 20 actuated joints.

Description	Observations	Rewards	Resets
The agent must locate a fixed ball on a plate as quickly as possible.	Proprioception + 2 binary contacts	Distance reward from end-effector to ball	Timestep limit
The agent must bounce a ball as many times as possible within 10s.	Proprioception + 17 binary contacts	Small airtime reward + bounce bonus	Timestep limit, ball falls
The agent must rotate two small balls around each other without letting them drop.	Proprioception + 17 binary contacts	Small distance reward to ball target + successful rotation bonus	Timestep limit, ball falls

Observations

We use dictionary-style observations, and categorising into proprioception, tactile, rgb, depth, and gt (ground-truth). The proprioception & tactile methods should be defined in RobotEnv, but gt information is task-dependent. To specify which observations are used, add the keys to obs_list in the agent cfg..

observations:
  obs_list:
  - prop
  - tactile
  - rgb
  - depth
  - gt
  obs_stack: 3
  tactile_cfg:
    binary_tactile: true
    binary_threshold: 0.01
  pixel_cfg:
    width: 80
    height: 80
    latent_pixel_dim: 128 
    normalise_rgb: true
    max_depth: 2.0  # meters

Here is an example rendering of raw RGB, normalised RGB, and depth of Shadow Baoding agent.

🛠️ Installation

We need to install Isaac Sim, Isaac Lab, multimodal_rl and roto in a conda environment. We recommend using the latest Isaac Sim for maximum performance.

Create conda environment and install Isaac Lab and Isaac Sim (easiest to install both as pip packages)
Install multimodal_rl as a local editable package

git clone git@github.com:elle-miller/multimodal_rl.git
cd multimodal_rl
pip install -e .

Install roto as a local editable package

git clone git@github.com:elle-miller/roto.git
cd roto
pip install -e .

Test the installation by playing a trained agent in the viewer or saving a video. Note that the viewer playback is much slower than real-time.

python scripts/play.py --task Baoding --num_envs 512 --agent_cfg forward_dynamics_memory --checkpoint readme_assets/checkpoints/baoding_memory.pt
python scripts/play.py --task Baoding --num_envs 512 --agent_cfg forward_dynamics_memory --video --video_length 1200 --headless --checkpoint readme_assets/checkpoints/baoding_memory.pt

The video should pop up in a ./videos folder and look like this:

You can find more trained checkpoints in the roto_paper_results repository.

🏃 Usage

Mostly the same as default Isaac Lab setup. The only breaking change is that a given task is not linked to a cfg file. The cfgs must be defined in the task __init__.py and specified as an agent_cfg argument.

We provide 3 environments x 7 cfgs, corresponding to the paper

gym.register(
    id="Baoding",
    entry_point="tasks.shadow.baoding:BaodingEnv",
    disable_env_checker=True,
    kwargs={
        "env_cfg_entry_point": baoding.BaodingCfg,
        "default_cfg": baoding_default_cfg,
        "rl_only_pt": baoding_rl_only_pt,
        "tac_recon": baoding_tactile_recon,
        "full_recon": baoding_full_recon,
        "forward_dynamics": baoding_forward_dynamics,
        "tac_dynamics": baoding_tactile_dynamics,
    }
)

Training

Here is how you would train a Find agent just with RL, a Bounce agent with RL + Tactile Reconstruction, and a Baoding agent with RL + Forward Dynamics.

python scripts/train.py --task Find --num_envs 4196 --headless --seed 1234 --agent_cfg rl_only_pt
python scripts/train.py --task Bounce --num_envs 4196 --headless --seed 1234 --agent_cfg tac_recon
python scripts/train.py --task Baoding --num_envs 4196 --headless --seed 1234 --agent_cfg forward_dynamics

Sweeping

We use opunta for integrated hyperparameter optimisation. The command is the same as for train.py, but with an additional --study name argument. You can specify the pruner, number of trials, number of warm up steps etc. I recommend this blogpost if you are new to sweeping :)

python scripts/sweep.py --task Find --num_envs 4196 --headless --seed 1234 --agent_cfg rl_only_pt --study find_rl_only_pt
python scripts/sweep.py --task Bounce --num_envs 4196 --headless --seed 1234 --agent_cfg tac_recon --study bounce_tac_recon
python scripts/sweep.py --task Baoding --num_envs 4196 --headless --seed 1234 --agent_cfg forward_dynamics --study baoding_forward_dynamics

Playing

See last step in installation.

📊 Benchmark Results [in-progress]

Please see the paper for now.

📁 Data

The data in the paper (checkpoints, training logs, plot scripts) is available in the roto_paper_results repo.

📧 Contact

For any questions, issues, or collaborations, please feel free to post an issue/start a discussion/reach out.

Maintainer: Elle Miller
Project Website: https://elle-miller.github.io/tactile_rl

This project is licensed under the BSD-3 License.

🤗 Contributing

This is our plan for future additions, but we highly welcome community contributions and PRs!

More environments
Observation augmentations (code exists just need to integrate)
Integrate TacSL for high-resolution touch sensing when it becomes released: isaac-sim/IsaacGymEnvs#244
Provide transformer architectures
Action chunking

📄 Citation

If you use this benchmark environment in your academic or professional research, please cite the following work:

@inproceedings{miller2025tactilerl,
  author    = {Miller, Elle and McInroe, Trevor and Abel, David and Mac Aodha, Oisin and Vijayakumar, Sethu},
  title     = {Enhancing Tactile-based Reinforcement Learning for Robotic Control},
  booktitle = {NeurIPS},
  year      = {2025},
}

Name		Name	Last commit message	Last commit date
Latest commit History 103 Commits
readme_assets		readme_assets
roto		roto
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
environment.yml		environment.yml
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RoTO: Robot Tactile Olympiad

✨ Overview

🤖 Environments

Observations

🛠️ Installation

🏃 Usage

Training

Sweeping

Playing

📊 Benchmark Results [in-progress]

📁 Data

📧 Contact

🤗 Contributing

📄 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RoTO: Robot Tactile Olympiad

✨ Overview

🤖 Environments

Observations

🛠️ Installation

🏃 Usage

Training

Sweeping

Playing

📊 Benchmark Results [in-progress]

📁 Data

📧 Contact

🤗 Contributing

📄 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages