Citation

FLAIR: A Foundation LAnguage Image model of the Retina

A Foundation LAnguage Image model of the Retina (FLAIR):
Encoding expert knowledge in text supervision
📜 Published in Medical Image Analysis
Julio Silva-Rodríguez¹, Hadi Chakor², Riadh Kobbi², Jose Dolz¹, Ismail Ben Ayed¹
¹ÉTS Montréal, ²DIAGNOS Inc.
| Project | Journal | ArXiv | Code | Tutorials |

Install FLAIR

Install in your enviroment a compatible torch version with your GPU. For example:

conda create -n flair_env python=3.11 -y
conda activate flair_env
pip install torch==2.6.0 torchvision==0.21.0 torchaudio==2.6.0 --index-url https://download.pytorch.org/whl/cu124

Install FLAIR library.

pip install git+https://github.com/jusiro/FLAIR.git

Usage

from PIL import Image
import numpy as np

# Import FLAIR
from flair import FLAIRModel

# Set model
model = FLAIRModel.from_pretrained("jusiro2/FLAIR")

# Load image and set target categories 
# (if the repo is not cloned, download the image and change the path!)

image = np.array(Image.open("./documents/sample_macular_hole.png"))
text = ["normal", "healthy", "macular edema", "diabetic retinopathy", "glaucoma", "macular hole",
        "lesion", "lesion in the macula"]

# Forward FLAIR model to compute similarities
probs, logits = model(image, text)

print("Image-Text similarities:")
print(logits.round(3)) # [[-0.186 -3.092  3.357  4.444  6.223  7.493  7.028 11.395]]
print("Probabilities:")
print(probs.round(3))  # [[0.      0.     0.     0.001  0.005  0.019  0.012  0.962]]

Pre-training and transferability

In the following, we present the scripts for model pre-training and transferability. To use them, we recommend cloning the whole repository.

git clone https://github.com/jusiro/FLAIR.git
cd FLAIR
pip install -r requirements.txt

📦 Foundation model pre-training

Define the relative paths for datasets and dataframes in ./local_data/constants.py.
Prepare the FUNDUS assembly dataset - check ./local_data/prepare_partitions.py to prepare the dataframes.


01_EYEPACS	08_ODIR-5K	15_APTOS	22_HEI-MED	29_AIROGS	36_ACRIMA
02_MESIDOR	09_PAPILA	16_FUND-OCT	23_HRF	30_SUSTech-SYSU	37_DeepDRiD
03_IDRID	10_PARAGUAY	17_DiaRetDB1	24_ORIGA	31_JICHI	38_MMAC
04_RFMid	11_STARE	18_DRIONS-DB	25_REFUGE	32_CHAKSU
05_1000x39	12_ARIA	19_Drishti-GS1	26_ROC	33_DR1-2
06_DEN	13_FIVES	20_E-ophta	27_BRSET	34_Cataract
07_LAG	14_AGAR300	21_G1020	28_OIA-DDR	35_ScarDat

⚠️ (26/04/25): Please be careful with the links! These are open-access repositories, and some datasets are not maintained by the original authors. Links may change, or contain malware. I will try to keep the links updated, but I can't check them too often :). You may want to open an Issue, and I will check the problem as soon as possible.

Vision-Language Pre-training.

python main_pretrain.py --augment_description True --balance True --epochs 15 --batch_size 128 --num_workers 6

📦 Transferability to downstream tasks/domains

Define the relative paths for datasets and dataframes in ./local_data/constants.py.
Prepare the experiment setting for the target dataset - we used ./local_data/experiments.py to store them.

if experiment == "02_MESSIDOR":
    setting = {"dataframe": PATH_DATAFRAME_TRANSFERABILITY_CLASSIFICATION + "02_MESSIDOR.csv",
               "task": "classification",
               "targets": {"no diabetic retinopathy": 0,
                           "mild diabetic retinopathy": 1,
                           "moderate diabetic retinopathy": 2,
                           "severe diabetic retinopathy": 3,
                           "proliferative diabetic retinopathy": 4}}

Zero-shot (no adaptation).

python main_transferability.py --experiment 02_MESSIDOR --method zero_shot --load_weights True --domain_knowledge True  --shots_train 0% --shots_test 100% --project_features True --norm_features True --folds 1

Linear Probing.

python main_transferability.py --experiment 02_MESSIDOR --method lp --load_weights True --shots_train 80% --shots_test 20% --project_features False --norm_features False --folds 5

Citation

If you find this repository useful, please consider citing this paper:

@article{FLAIR,
    title = {A Foundation Language-Image Model of the Retina (FLAIR): encoding expert knowledge in text supervision},
    author = {Julio Silva-Rodríguez and Hadi Chakor and Riadh Kobbi and Jose Dolz and Ismail {Ben Ayed}},
    journal = {Medical Image Analysis},
    volume = {99},
    pages = {103357},
    year = {2025},
    issn = {1361-8415},
}

License

Code and Model Weights are released under Apache 2.0 license

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
documents		documents
flair		flair
local_data		local_data
.gitignore		.gitignore
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
main_pretrain.py		main_pretrain.py
main_transferability.py		main_transferability.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FLAIR: A Foundation LAnguage Image model of the Retina

Install FLAIR

Usage

Pre-training and transferability

📦 Foundation model pre-training

📦 Transferability to downstream tasks/domains

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

jusiro/FLAIR

Folders and files

Latest commit

History

Repository files navigation

FLAIR: A Foundation LAnguage Image model of the Retina

Install FLAIR

Usage

Pre-training and transferability

📦 Foundation model pre-training

📦 Transferability to downstream tasks/domains

Citation

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages