Skip to content
View liamge's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report liamge

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
liamge/README.md

Liam Geron

AI Systems | Machine Learning | LLM & Decision Intelligence

Senior Data Scientist focused on building production-grade AI systems that drive measurable business outcomes.

My work sits at the intersection of predictive modeling, NLP, and LLM-powered applications, with a strong emphasis on deployment, evaluation, and real-world impact. I specialize in turning messy, high-volume data (text, PDFs, behavioral signals) into systems that improve revenue, retention, and decision-making.


🔬 Focus Areas

  • Churn modeling and revenue risk estimation
  • Causal inference and uplift modeling
  • Retrieval-augmented generation (RAG) systems
  • Large-scale text and document processing (PDF pipelines)
  • Production ML systems and MLOps
  • Decision intelligence and experimentation

🚀 Featured Projects

Causal Uplift Modeling + ROI Dashboard

End-to-end system for estimating treatment effect (who to target) and translating it into expected business impact.

  • Uplift modeling to identify persuadable vs non-persuadable users
  • Policy targeting simulation (who to intervene on)
  • ROI estimation based on incremental lift
  • Interactive dashboard for decision-making
  • Designed to mirror real-world marketing / retention use cases

Tech: Python • Scikit-Learn • XGBoost • Streamlit

👉 https://causal-uplift-modeling.streamlit.app/


Churn & Retention Risk Workbench

Production-style ML system for predicting churn and quantifying revenue at risk.

  • Modular training + inference pipelines
  • Behavioral feature engineering (engagement, activity, value trends)
  • SHAP-based explainability for business users
  • Revenue impact modeling and segmentation
  • Streamlit dashboard for stakeholder insights

Tech: Python • XGBoost • Scikit-Learn • MLflow • Streamlit

👉 https://retention-risk-workbench.streamlit.app/


Grounded Conversation RAG System

Production-style RAG application for answering questions over internal knowledge sources.

  • Document ingestion + chunking pipelines
  • Vector search + retrieval optimization
  • LLM-based answer generation with grounding
  • Deployed demo with end-to-end flow

Tech: OpenAI • FAISS • LangChain • Python

👉 https://grounded-conversation-rag.streamlit.app/


NLP Augmentation Toolkit

Lightweight toolkit for improving NLP model robustness via data augmentation.

  • Back translation
  • Synonym replacement
  • Embedding-based perturbations

🧠 Technical Stack

Languages
Python • SQL • R • PySpark

Machine Learning
Scikit-Learn • XGBoost • PyTorch • TensorFlow

LLM / NLP
OpenAI API • LangChain • HuggingFace • spaCy • RAG systems

Data & MLOps
Databricks • MLflow • Airflow • Spark • Feature Stores • Docker

Cloud
AWS • Azure • GCP

Visualization
Tableau • PowerBI • Matplotlib


📈 Selected Impact

  • Built NLP + LLM systems that significantly improved conversion in customer conversations
  • Designed large-scale PDF ingestion pipelines for structured data extraction
  • Developed RAG systems for internal knowledge retrieval and decision support
  • Delivered ML systems with clear business framing (revenue at risk, uplift, targeting strategy)

📫 Connect

LinkedIn
https://www.linkedin.com/in/liam-geron/

Pinned Loading

  1. retention-risk-workbench retention-risk-workbench Public

    Jupyter Notebook

  2. causal-uplift-modeling causal-uplift-modeling Public

    Python

  3. grounded-conversation-rag grounded-conversation-rag Public

    Python

  4. nlp-text-augmentation-toolkit nlp-text-augmentation-toolkit Public

    Python