PharmaHacks McGill is a McGill University-affiliated non-profit organization. PharmaHacks is an annual life-science-themed hackathon focusing on tackling problems in the biotechnology and pharmaceutical industry. We offer university and graduate-level talents across Canada the opportunity to develop computer programs that solve biopharmaceutical, AI, and bioinformatics-focused challenges. We work with our sponsors to expand their talent database and to connect with passionate industry-ready students.

Requirements

What to build:

A well-documented ML solution to our problems, paired with a team presentation! Problems are gonna be presented during the open ceremony, and are listed in the official Pharmahacks 2026 Discord server in the #challenges channel. There are also listed below:

🧪 Challenge 1: Predicting Acute Toxicity (LD50),  Beginner

Can you predict how toxic a chemical compound is just from its molecular structure? In this challenge, you'll build machine learning models to predict the acute toxicity (LD50) of compounds using their SMILES representations. The dataset contains ~7,400 molecules with experimentally measured LD50 values, and your goal is a regression model that predicts log-transformed LD50 as accurately as possible. Start with molecular fingerprints and a simple baseline, then push further with richer descriptors, model ensembles, and interpretability analysis. 

🧬 Challenge 2: Compound-Target Binding Affinity, Advanced (Sponsored by Molecular Forecaster Inc.)

Given a drug compound and a protein target, can you predict how strongly they'll bind? This challenge provides ~421K drug-target interactions with experimental pIC50 measurements. You'll need to decide how to encode both SMILES strings and amino acid sequences into meaningful representations, then train a single protein-agnostic model that generalizes across targets. Your model will be evaluated on four test splits: warm, cold compound, cold target, and full cold.

🧠 Challenge 3: Detecting Dementia from EEG, Advanced

Can you detect Alzheimer's disease or frontotemporal dementia from brain activity alone? You'll work with EEG recordings from 88 subjects (19 channels, 500 Hz) and tackle a binary classification task: AD vs healthy or FTD vs healthy. The recordings vary in length, so you'll need to handle feature extraction (e.g., relative band power, spectral coherence) or padding, and aggregate epoch-level predictions into a single diagnosis per subject. This one demands comfort with signal processing and careful validation on a small dataset.

Hackathon Sponsors

Prizes

$CAD 1,700 in prizes
1st Place: LD50 Beginner (Challenge #1)
$CAD 200 in cash
1 winner

1st Place: Molecular Forcaster Challenge (Challenge #2)
$CAD 600 in cash
1 winner

1st Place: EEG Challenge (Challenge #3)
$CAD 800 in cash
1 winner

Best Pitch
$CAD 100 in cash
1 winner

Devpost Achievements

Submitting to this hackathon could earn you:

Judges

Theodor Semerdzhiev

Theodor Semerdzhiev
VP of Machine Learning

Judging Criteria

  • Methodology and Explorations
    Sound ML Practice, Strategy, and Justification. Specific details are presented in each challenge's PDF.
  • Results and Performance
    Predictive Performance, Validation Strategy, Generalization, and Robustness. Specific details are presented in each challenge's PDF.
  • Interpretability, Analysis, and Presentation Quality
    Interpretability and Clarity of Code and Communication. Specific details are presented in each challenge's PDF.

Questions? Email the hackathon manager

Tell your friends

Hackathon sponsors

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.