PharmaHacks McGill is a McGill University-affiliated non-profit organization. PharmaHacks is an annual life-science-themed hackathon focusing on tackling problems in the biotechnology and pharmaceutical industry. We offer university and graduate-level talents across Canada the opportunity to develop computer programs that solve biopharmaceutical, AI, and bioinformatics-focused challenges. We work with our sponsors to expand their talent database and to connect with passionate industry-ready students.
Requirements
What to build:
A well-documented ML solution to our problems, paired with a team presentation! Problems are gonna be presented during the open ceremony, and are listed in the official Pharmahacks 2026 Discord server in the #challenges channel. There are also listed below:
🧪 Challenge 1: Predicting Acute Toxicity (LD50), Beginner
Can you predict how toxic a chemical compound is just from its molecular structure? In this challenge, you'll build machine learning models to predict the acute toxicity (LD50) of compounds using their SMILES representations. The dataset contains ~7,400 molecules with experimentally measured LD50 values, and your goal is a regression model that predicts log-transformed LD50 as accurately as possible. Start with molecular fingerprints and a simple baseline, then push further with richer descriptors, model ensembles, and interpretability analysis.
🧬 Challenge 2: Compound-Target Binding Affinity, Advanced (Sponsored by Molecular Forecaster Inc.)
Given a drug compound and a protein target, can you predict how strongly they'll bind? This challenge provides ~421K drug-target interactions with experimental pIC50 measurements. You'll need to decide how to encode both SMILES strings and amino acid sequences into meaningful representations, then train a single protein-agnostic model that generalizes across targets. Your model will be evaluated on four test splits: warm, cold compound, cold target, and full cold.
🧠 Challenge 3: Detecting Dementia from EEG, Advanced
Can you detect Alzheimer's disease or frontotemporal dementia from brain activity alone? You'll work with EEG recordings from 88 subjects (19 channels, 500 Hz) and tackle a binary classification task: AD vs healthy or FTD vs healthy. The recordings vary in length, so you'll need to handle feature extraction (e.g., relative band power, spectral coherence) or padding, and aggregate epoch-level predictions into a single diagnosis per subject. This one demands comfort with signal processing and careful validation on a small dataset.
Prizes
1st Place: LD50 Beginner (Challenge #1)
1st Place: Molecular Forcaster Challenge (Challenge #2)
1st Place: EEG Challenge (Challenge #3)
Best Pitch
Devpost Achievements
Submitting to this hackathon could earn you:
Judges
Theodor Semerdzhiev
VP of Machine Learning
Judging Criteria
-
Methodology and Explorations
Sound ML Practice, Strategy, and Justification. Specific details are presented in each challenge's PDF. -
Results and Performance
Predictive Performance, Validation Strategy, Generalization, and Robustness. Specific details are presented in each challenge's PDF. -
Interpretability, Analysis, and Presentation Quality
Interpretability and Clarity of Code and Communication. Specific details are presented in each challenge's PDF.
Questions? Email the hackathon manager
Tell your friends
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

