High-quality video generation with DiT architecture. Part of the Zen LM ecosystem.
Satori is a video generation framework using Diffusion Transformer (DiT) architecture, supporting text-to-video and image-to-video generation at high resolution.
- Text-to-video generation up to 720p
- Image-to-video animation
- Multi-resolution and variable duration support
- Efficient inference with sequence parallelism
- zen-video — Video generation models
- zen-director — Video direction and control
- Zen LM — Full model family
Apache 2.0