Prism-diffusion

This repository contains the implementation of the Prism method for controling colors in diffusion generated images, using a conditional LoRA, along with the necessary classes and dependencies.This repository provides the scripts and environment setup instructions to run the method effectively.

below are some examples of generations using the same prompt and seed, but with a change of color palette.

Red palette	Blue palette

CoLoRA architecture and training method

the CoLoRa layers are created mid run, using the trained pre-weights

the training method, is as follows

Repository Structure

The repository is structured as follows:

prism_training.ipynb: This Jupyter Notebook script contains the implementation of the Prism training method. It is the main script that needs to be executed to train a Prism LoRA. Before running the script, certain hyperparameters and logging information need to be filled in.
prism.yml: This file specifies the Conda environment required to run the code in prism_training.ipynb. It contains a list of dependencies and their versions to ensure a consistent and reproducible environment.
prism_classes.py: This Python file contains the necessary classes and functions used in the Prism training script. These classes implement the core functionality of the Prism method and provide the required functionality for training the deep learning model.

Getting Started

To run the Prism training code and train a deep learning model using Prism, follow the instructions below:

Environment Setup

Make sure you have Conda installed on your system. If not, follow the instructions for your operating system to install Conda from the official website.
Fork the repository by clicking the "Fork" button on the top-right corner of this repository page. This will create a copy of the repository in your GitHub account.
Clone this Git repository to your local machine using the following command:
```
git clone https://github.com/your-username/Prism-diffusion.git
```
Replace your-username with your GitHub username.
Navigate to the cloned repository:
```
cd Prism-diffusion
```
Create a Conda environment with the required dependencies by running the following command:
```
conda env create -f prism.yml
```
Activate the created Conda environment:
```
conda activate prism
```

Running the Prism Training Script

Open the prism_training.ipynb Jupyter Notebook in a Jupyter-compatible environment, such as Jupyter Notebook or JupyterLab.
Before running the script, fill in the necessary hyperparameters and logging information in the designated code cell. Modify the following variables according to your requirements:
- logging_name: The name to initialize the WandB tracker.
- args.validation_prompt: The prompt for validation (e.g., "a beautiful image of a flower").
- logging_training: Whether to log training information.
- train_check_every_global_steps: Log the train information every x steps.
- valid_check_every_global_steps: Log the validation information every x steps.
- args.checkpointing_steps: Save the checkpoint every x steps.
- args.learning_rate: Learning rate for the training process.
- args.max_train_steps: Maximum number of training steps.
- image_to_get_palette_from: Path to the image to get the palette from.
- coco_root: Path to the COCO dataset root directory.
- coco_annotation_path: Path to the COCO annotation file.
- args.report_to: Specify the reporting platform ("wandb" or None).
- path_to_weights: Path to pre-trained weights if you want to load existing weights.
Execute the cells in the Jupyter Notebook sequentially to run the Prism training script. The script will train the model using the Prism method based on the provided configurations.
Monitor the training progress and logging information in WandB. The training script will log the necessary information according to the specified configurations.

machine

in order to run this method, using a GPU with at least 20 GB RAM is a must

based on

this notebook code is based on the following code from the huggingface diffusers code: https://github.com/huggingface/diffusers/blob/main/examples/text_to_image/train_text_to_image_lora.py with changes relavent to our specific method and architecture, CoLoRA.

contact info

for any questions regarding the project, you can contact me at ronraphaeli at technion.ac.il .

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
README.md		README.md
prism.yml		prism.yml
prism_classes.py		prism_classes.py
prism_training.ipynb		prism_training.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Prism-diffusion

CoLoRA architecture and training method

the CoLoRa layers are created mid run, using the trained pre-weights

the training method, is as follows

Repository Structure

Getting Started

Environment Setup

Running the Prism Training Script

machine

based on

contact info

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Prism-diffusion

CoLoRA architecture and training method

the CoLoRa layers are created mid run, using the trained pre-weights

the training method, is as follows

Repository Structure

Getting Started

Environment Setup

Running the Prism Training Script

machine

based on

contact info

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages