🏥 DeepPlantAI - Advanced Organ Transplant Prediction Platform

Inspiration

The critical need for better transplant outcomes prediction - Every year, thousands of patients undergo organ transplants with varying success rates. Current risk assessment methods rely heavily on clinical experience and basic scoring systems like MELD/PELD, which don't capture the full complexity of patient factors. We were inspired to leverage machine learning to provide more accurate, data-driven mortality predictions that could help clinicians make better-informed decisions and potentially save more lives.

What it does

DeepPlantAI is an AI-powered clinical decision support system that predicts post-transplant outcomes for organ transplant patients. The platform:

Predicts liver transplant mortality risk using advanced Gradient Boosted Decision Trees (XGBoost/LightGBM)
Supports multiple prediction tasks: Binary mortality, survival status, time categories, and risk stratification
Provides HCC recurrence prediction for hepatocellular carcinoma patients using ensemble models
Offers interactive visualizations including performance metrics, feature importance analysis, and risk distribution charts
Enables real-time risk assessment through an intuitive Streamlit web dashboard
Supports comprehensive data processing with 60+ clinical features from multiple organ transplant datasets

How we built it

Full-stack architecture combining Streamlit with advanced ML:

Frontend: Streamlit-based interactive dashboard with custom medical UI components and visualizations
Backend: Python-based ML pipeline with comprehensive data processing and model training
ML Pipeline: Gradient Boosted Decision Trees (XGBoost/LightGBM) with comprehensive feature engineering
Data Processing: Robust handling of 60+ clinical features including demographics, lab values, donor characteristics, and transplant factors
Model Training: Jupyter notebooks for exploratory data analysis and model development
Multi-organ Support: Liver transplant focus with extensible architecture for other organ types
Deployment: Streamlit-based deployment with integrated Python backend and ML models

Key Features

🎯 Liver Transplant Mortality Prediction

Multiple Prediction Tasks: Binary mortality, survival status, time categories, risk assessment
Advanced ML Models: XGBoost and LightGBM with optimized hyperparameters
Comprehensive Features: 31+ clinical variables including MELD scores, lab values, donor characteristics
High Accuracy: 85-90% prediction accuracy on test data
Real-time Predictions: Instant risk assessment with confidence scores

🧬 HCC Recurrence Prediction

Specialized Model: Ensemble approach combining XGBoost and Neural Networks
Tumor-specific Features: HCC characteristics, vascular invasion, tumor size
2-year Recurrence Prediction: Binary classification for cancer recurrence risk
Clinical Integration: Seamlessly integrated with mortality prediction models

📊 Interactive Dashboard

Model Performance: Real-time accuracy metrics, ROC curves, confusion matrices
Live Predictions: CSV upload and instant prediction capabilities
Feature Analysis: Interactive charts showing feature importance and correlations
Risk Visualization: Distribution charts and risk stratification displays
Professional UI: Medical-grade interface with comprehensive reporting

🔧 Advanced Data Processing

Multi-source Integration: Combines data from 6+ CSV files
Robust Encoding: Handles multiple file encodings (UTF-8, Latin-1, CP1252, ISO-8859-1)
Feature Engineering: Hybrid categorical encoding (One-Hot + Label Encoding)
Data Validation: Comprehensive error handling and fallback mechanisms

Technical Architecture

Data Pipeline

Raw Data (OPTN/UNOS) → Data Processing → Feature Engineering → Model Training → Real-time Predictions

Model Stack

Primary: Gradient Boosted Decision Trees (XGBoost/LightGBM)
Secondary: Neural Networks for HCC recurrence
Ensemble: Combined approaches for enhanced accuracy
Validation: 5-fold cross-validation for robust evaluation

Key Clinical Features

Patient Demographics: Age, Gender, BMI, Ethnicity
Disease Severity: MELD/PELD scores, Lab values (Albumin, Creatinine, Bilirubin, INR)
Donor Characteristics: Donor age, BMI, cause of death, ECD status
Transplant Factors: Cold ischemic time, donor type, procedure type
Clinical Status: Ascites, encephalopathy, diabetes, previous transplants
HCC Features: Tumor characteristics, vascular invasion, recurrence markers

Challenges we ran into

Data complexity: Large, multi-source medical datasets with missing values and encoding issues
Feature engineering: Selecting and processing 60+ clinical features while maintaining medical relevance
Model interpretability: Balancing prediction accuracy with explainable AI for clinical use
Integration complexity: Seamlessly connecting the ML backend with the Streamlit frontend
Real-time performance: Ensuring fast prediction responses for clinical workflow integration
Data validation: Handling inconsistent data formats across multiple organ transplant datasets

Accomplishments that we're proud of

Built a complete end-to-end system from data processing to user interface
Achieved robust ML pipeline with comprehensive feature engineering and model validation
Created intuitive visualizations that make complex medical data accessible to clinicians
Implemented real-time prediction API with proper error handling and fallback mechanisms
Successfully integrated modern web technologies with advanced machine learning
Developed a scalable architecture that can be extended to additional organ types and models
Achieved 85-90% accuracy on liver transplant mortality prediction
Built specialized HCC recurrence prediction with ensemble modeling

What we learned

Medical AI requires careful validation and interpretability for clinical acceptance
Feature engineering is crucial - the right features matter more than complex algorithms
User experience is key - even the best ML model needs an intuitive interface
Robust error handling is essential when dealing with real-world medical data
Integration challenges between different technology stacks require careful planning
Clinical workflows need fast, reliable predictions that fit into existing processes
Multi-organ datasets provide rich context but require sophisticated data processing

What's next for DeepPlantAI

Expand to more organ types (kidney, heart, lung) with specialized prediction models
Add more advanced ML models including deep learning approaches
Implement user authentication and role-based access for different user types
Add batch prediction capabilities for analyzing multiple patients
Integrate with hospital systems and electronic health records
Implement model retraining capabilities for continuous improvement
Add survival analysis with Kaplan-Meier curves and Cox regression
Develop mobile application for point-of-care predictions

Getting Started

Prerequisites

pip install -r src/dashboard/requirements.txt

Running the Dashboard

# Navigate to dashboard directory
cd src/dashboard

# Run the Streamlit dashboard
streamlit run dashboard.py

The dashboard will be available at: http://localhost:8501

Running the Models

# Navigate to models directory
cd src/Models

# Run liver mortality prediction
python run_liver_mortality_gbdt.py

# Run HCC recurrence prediction
python hcc_recurrence_prediction.py

Project Structure

67ers_Hackgt/
├── src/
│   ├── dashboard/           # Streamlit web interface
│   │   ├── dashboard.py     # Main dashboard application
│   │   └── requirements.txt # Dashboard dependencies
│   ├── Models/              # ML models and training
│   │   ├── liver_mortality_gbdt.py      # Main mortality prediction
│   │   ├── hcc_recurrence_prediction.py # HCC recurrence model
│   │   └── requirements_gbdt.txt        # Model dependencies
│   └── data_prep/           # Data processing utilities
├── data/                    # Transplant datasets
│   ├── csv_data/           # Processed CSV files
│   └── dictionaries/       # Data dictionaries
└── README.md               # This file

Medical Disclaimer

Research Only: This tool is for research and educational purposes
Not Clinical: Not intended for clinical decision making
Professional Review: Always consult medical professionals for clinical decisions

License

This project is part of the 67ers Hackathon and is available under the MIT License.

Built with ❤️ by the 67ers Team for HackGT 2024

Built With

python
streamlit

Submitted to

HackGT 12: Midnight at the Museum

Created by

I worked on the dashboard and other UI elements. I was mostly responsible for the front end development of the program. I also worked on a Nodejs version for a more scalable scope.

Anish Sen
I contributed in implementing the neural network and handling a part of the project management.

Akos Vida
bindhu varma
Lokesh Pampatti

Updates

bindhu varma started this project — Sep 28, 2025 07:28 AM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.