train_model

Model Training

This tutorial describes how to train smaller Borzoi models on the example RNA-seq experiment processed in the make_data tutorial.

To train a 'Mini Borzoi' ensemble (~40M parameters, 2 cross-validation folds), run the script 'train_mini.sh'. The model parameters are specified in 'params_mini.json'. This model can be trained with a batch size of 2 on a 24GB NVIDIA Titan RTX or RTX4090 GPU.

conda activate borzoi_py310
cd ~/borzoi/tutorials/latest/train_model
./train_mini.sh

Alternatively, to train an even smaller 'Micro Borzoi' ensemble (~5M parameters), run the script 'train_micro.sh'. This model can fit into the above GPU cards with a batch size of 4, which means the learning rate can be doubled and each epoch finished in half the time.

./train_micro.sh

Notes:

See here for a description of the scripts called internally by the training .sh script.
Rather than cropping the output predictions before applying the training loss, in the latest version of Borzoi models a smooth position-specific loss weight is applied that penalizes prediction errors less at the left/right boundaries.

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
params_micro.json		params_micro.json
params_mini.json		params_mini.json
train_micro.sh		train_micro.sh
train_mini.sh		train_mini.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Model Training

FilesExpand file tree

train_model

Directory actions

More options

Directory actions

More options

Latest commit

History

train_model

Folders and files

parent directory

README.md

Model Training