-
Notifications
You must be signed in to change notification settings - Fork 32.4k
Closed
Labels
Description
Model description
The model is a LLaMA style architecture with a VQGAN for image input and generation. It is also likely to be finetuned for patch input for images similar to Fuyu, so it would be a good idea to have the implementation flexible for different types of image input. The weights are available under a research license.
Open source status
- The model implementation is available
- The model weights are available
Provide useful links for the implementation
Reactions are currently unavailable