GitHub - zooai/gym: Gym

Gym

A Free and Open Source LLM Fine-tuning Framework

🎉 Latest Updates

2025/07:
- ND Parallelism support has been added into Gym. Compose Context Parallelism (CP), Tensor Parallelism (TP), and Fully Sharded Data Parallelism (FSDP) within a single node and across multiple nodes. Check out the blog post for more info.
- Gym adds more models: GPT-OSS, Gemma 3n, Liquid Foundation Model 2 (LFM2), and Arcee Foundation Models (AFM).
- FP8 finetuning with fp8 gather op is now possible in Gym via torchao. Get started here!
- Voxtral, Magistral 1.1, and Devstral with mistral-common tokenizer support has been integrated in Gym!
- TiledMLP support for single-GPU to multi-GPU training with DDP, DeepSpeed and FSDP support has been added to support Arctic Long Sequence Training. (ALST). See examples for using ALST with Gym!
2025/05: Quantization Aware Training (QAT) support has been added to Gym. Explore the docs to learn more!
2025/03: Gym has implemented Sequence Parallelism (SP) support. Read the blog and docs to learn how to scale your context length when fine-tuning.

Expand older updates

2025/06: Magistral with mistral-common tokenizer support has been added to Gym. See examples to start training your own Magistral models with Gym!
2025/04: Llama 4 support has been added in Gym. See examples to start training your own Llama 4 models with Gym's linearized version!
2025/03: (Beta) Fine-tuning Multimodal models is now supported in Gym. Check out the docs to fine-tune your own!
2025/02: Gym has added LoRA optimizations to reduce memory usage and improve training speed for LoRA and QLoRA in single GPU and multi-GPU training (DDP and DeepSpeed). Jump into the docs to give it a try.
2025/02: Gym has added GRPO support. Dive into our blog and GRPO example and have some fun!
2025/01: Gym has added Reward Modelling / Process Reward Modelling fine-tuning support. See docs.

✨ Overview

Gym is a free and open-source tool designed to streamline post-training and fine-tuning for the latest large language models (LLMs).

Features:

Multiple Model Support: Train various models like GPT-OSS, LLaMA, Mistral, Mixtral, Pythia, and many more models available on the Hugging Face Hub.
Multimodal Training: Fine-tune vision-language models (VLMs) including LLaMA-Vision, Qwen2-VL, Pixtral, LLaVA, SmolVLM2, and audio models like Voxtral with image, video, and audio support.
Training Methods: Full fine-tuning, LoRA, QLoRA, GPTQ, QAT, Preference Tuning (DPO, IPO, KTO, ORPO), RL (GRPO), and Reward Modelling (RM) / Process Reward Modelling (PRM).
Easy Configuration: Re-use a single YAML configuration file across the full fine-tuning pipeline: dataset preprocessing, training, evaluation, quantization, and inference.
Performance Optimizations: Multipacking, Flash Attention, Xformers, Flex Attention, Liger Kernel, Cut Cross Entropy, Sequence Parallelism (SP), LoRA optimizations, Multi-GPU training (FSDP1, FSDP2, DeepSpeed), Multi-node training (Torchrun, Ray), and many more!
Flexible Dataset Handling: Load from local, HuggingFace, and cloud (S3, Azure, GCP, OCI) datasets.
Cloud Ready: We ship Docker images and also PyPI packages for use on cloud platforms and local hardware.

🚀 Quick Start - LLM Fine-tuning in Minutes

Requirements:

NVIDIA GPU (Ampere or newer for bf16 and Flash Attention) or AMD GPU
Python 3.11
PyTorch ≥2.6.0

Google Colab

Installation

Using pip

pip3 install -U packaging==23.2 setuptools==75.8.0 wheel ninja
pip3 install --no-build-isolation zoo-gym[flash-attn,deepspeed]

# Download example gym configs, deepspeed configs
gym fetch examples
gym fetch deepspeed_configs  # OPTIONAL

Using Docker

Installing with Docker can be less error prone than installing in your own environment.

docker run --gpus '"all"' --rm -it zoolabs/gym:main-latest

Other installation approaches are described here.

Cloud Providers

Details

Your First Fine-tune

# Fetch gym examples
gym fetch examples

# Or, specify a custom path
gym fetch examples --dest path/to/folder

# Train a model using LoRA
gym train examples/llama-3/lora-1b.yml

That's it! Check out our Getting Started Guide for a more detailed walkthrough.

📚 Documentation

Installation Options - Detailed setup instructions for different environments
Configuration Guide - Full configuration options and examples
Dataset Loading - Loading datasets from various sources
Dataset Guide - Supported formats and how to use them
Multi-GPU Training
Multi-Node Training
Multipacking
API Reference - Auto-generated code documentation
FAQ - Frequently asked questions

🤝 Getting Help

Join our Discord community for support
Check out our Examples directory
Read our Debugging Guide
Need dedicated support? Please contact ✉️wing@zoo.dev for options

🌟 Contributing

Contributions are welcome! Please see our Contributing Guide for details.

❤️ Sponsors

Interested in sponsoring? Contact us at wing@zoo.dev

📝 Citing Gym

If you use Gym in your research or projects, please cite it as follows:

@software{gym,
  title = {Gym: Open Source LLM Post-Training},
  author = {{Zoo Labs Foundation Inc. and contributors}},
  url = {https://github.com/zoo-labs/gym},
  license = {Apache-2.0},
  year = {2023}
}

📜 License

This project is licensed under the Apache 2.0 License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 2,415 Commits
.github		.github
.runpod		.runpod
.vscode		.vscode
cicd		cicd
deepspeed_configs		deepspeed_configs
devtools		devtools
docker		docker
docs		docs
examples		examples
image		image
scripts		scripts
src		src
tests		tests
.axolotl-complete.bash		.axolotl-complete.bash
.bandit		.bandit
.coderabbit.yaml		.coderabbit.yaml
.coveragerc		.coveragerc
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.mypy.ini		.mypy.ini
.pre-commit-config.yaml		.pre-commit-config.yaml
CITATION.cff		CITATION.cff
CNAME		CNAME
FAQS.md		FAQS.md
LICENSE		LICENSE
LLM.md		LLM.md
MANIFEST.in		MANIFEST.in
README.md		README.md
_quarto.yml		_quarto.yml
codecov.yml		codecov.yml
docker-compose.yaml		docker-compose.yaml
favicon.jpg		favicon.jpg
index.qmd		index.qmd
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements-tests.txt		requirements-tests.txt
requirements.txt		requirements.txt
setup.py		setup.py
styles.css		styles.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎉 Latest Updates

✨ Overview

🚀 Quick Start - LLM Fine-tuning in Minutes

Google Colab

Installation

Using pip

Using Docker

Cloud Providers

Your First Fine-tune

📚 Documentation

🤝 Getting Help

🌟 Contributing

❤️ Sponsors

📝 Citing Gym

📜 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎉 Latest Updates

✨ Overview

🚀 Quick Start - LLM Fine-tuning in Minutes

Google Colab

Installation

Using pip

Using Docker

Cloud Providers

Your First Fine-tune

📚 Documentation

🤝 Getting Help

🌟 Contributing

❤️ Sponsors

📝 Citing Gym

📜 License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages