CrayonRobo

The official codebase for CrayonRobo: Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation (CVPR 2025)

Acknowledgement

This repo benefits from LLama_Adapter and Where2act. Thanks for their wonderful works.

Setup

conda create --name crayonrobo python=3.8
conda activate crayonrobo
pip install -r requirement.txt

Note that, the installed torch should satisfy your own cuda version

Data Collection

Download our training and test data: train data and test data. The files should be zipped to ./CrayonRobo/data_collection/data.

./data/train_dataset
  ├── 148_Faucet_0_pulling_0
  |   └── png/json/...
  ├── 148_Faucet_0_pulling_9
  |   └── png/json/...
  ├── ...
  │   ...
  └── ...

Or collect data by your own: Download partnet mobility assets and zip to /CrayonRobo/data_collection/assets.

./assets
  ├── 148
  |   └── mobility.urdf
  ├── 149
  |   └── mobility.urdf
  ├── ...
  │   ...
  └── ...

cd ./CrayonRobo/data_collection/code

bash scripts/run_gen_offline_data.sh

This command will first generate training dataset and then generate the testing dataset.

Model Training

Preparation:

Download checkpoints for LLaMa-Adapter. The downloaded checkpoints should be placed under ./Crayonrobo/crayonrobo/ckpts.

./ckpts/llama_model_weights
├── 7B
│   ├── checklist.chk
│   ├── consolidated.00.pth
│   └── params.json
└── tokenizer.model
./ckpts/BIAS_LORA_NORM-336-Chinese-7B.pth
./ckpts/ViT-L-14-336px.pt

Model training: The training requires the server to has a least 40g memory. The command will first generate the training json, then start training
```
cd ./CrayonRobo/crayonrobo

bash finetune.sh
```

Model Testing

Download the released checkpoint or use your own trained checkpoint. The link we provide is baiduyun downloading link. If you can not download, feel free to reach out via email to xl3062@columbia.edu, then we will share the ckpts with you directly. Note that, due to the randomness in data collection, the provided testing dataset is different from the ones in paper, so you may result in slightly different but comparable results compared with the results in paper.
The testing requires the server to has a least 40g memory. This command will first use the model to infer on all the test samples, and then interact with object in the simulator (SAPIEN).
```
cd ./CrayonRobo/crayonrobo

bash test.sh
```
Remember to change the argument --adapter_dir in test_model.sh to the directory you placed the ckpts. The default dir is at ./CrayonRobo/crayonrobo/exp

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
crayonrobo		crayonrobo
data_collection		data_collection
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CrayonRobo

Acknowledgement

Setup

Data Collection

Model Training

Model Testing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

clorislili/CrayonRobo

Folders and files

Latest commit

History

Repository files navigation

CrayonRobo

Acknowledgement

Setup

Data Collection

Model Training

Model Testing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages