🏹 The Arc: NBA Career Predictor

The Arc is a small Streamlit app that predicts an NBA player’s Year 5 points per game (PPG) using their Year 2 “sophomore season” stats.

The idea: the jump from Year 1 → Year 2 contains strong signals about a player’s long‑term ceiling. This app turns those “sophomore signals” into a simple Year 5 projection.

🔧 What the app does

Lets you search for an NBA player and:
- See their Year 2 stats (PPG, RPG, APG, efficiency, minutes, etc.).
- Get Year 5 PPG predictions from two different modeling paths.
- Compare those predictions to the actual Year 5 PPG (when available).
Groups players into data‑driven archetypes (via clustering) so you can see what “type” of player they are.
Shows model performance:
- Error distributions for each path.
- Feature importance to see which stats drive the predictions.
- Best and worst individual predictions.

🧠 Modeling overview

The project runs two competing modeling strategies:

Path 1 – Baseline

Uses a small, standard feature set:
- Year 2 box score stats (points, rebounds, assists, minutes, shooting splits).
- Simple growth metrics (deltas from Year 1).
- Basic context (draft position, AST/TOV).
Trains a regression model with default hyperparameters.
Goal: fast, interpretable, “good enough” baseline.

Path 2 – Advanced

Uses all Path 1 features plus engineered features, such as:
- Skill Diversity Index (improvement across multiple categories).
- Usage‑to‑Efficiency ratio.
- Draft overperformance.
- Minutes trajectory.
- Free‑throw improvement.
Adds hyperparameter tuning to squeeze out extra accuracy.
Goal: maximum accuracy and richer basketball intuition.

📊 What you can explore in the app

Home
High‑level project overview and quick comparison of Path 1 vs Path 2 performance.
Scouting Report
- Select a player and see:
  - Year 2 stats.
  - Dual Path 1 vs Path 2 Year 5 predictions.
  - Actual Year 5 PPG and errors (when data exists).
- Career trajectory chart and archetype radar.
The DNA Explorer
- View the 5 data‑driven archetypes.
- See average stats per archetype.
- Explore a 2D map of players colored by archetype.
Model Analysis
- Detailed methodology for both paths.
- Side‑by‑side metrics (MAE, R², training time, overfitting checks).
- Feature importance charts.
- Error histograms and best/worst predictions.

🚀 Tech stack

Python
Streamlit for the web app UI
pandas / NumPy for data handling
scikit‑learn for modeling + clustering
Plotly for visualizations

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data collection		data collection
data		data
images		images
streamlit		streamlit
updated_models		updated_models
.python-version		.python-version
README.md		README.md
debug.py		debug.py
idea.MD		idea.MD
requirements.txt		requirements.txt
runtime.txt		runtime.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🏹 The Arc: NBA Career Predictor

🔧 What the app does

🧠 Modeling overview

Path 1 – Baseline

Path 2 – Advanced

📊 What you can explore in the app

🚀 Tech stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🏹 The Arc: NBA Career Predictor

🔧 What the app does

🧠 Modeling overview

Path 1 – Baseline

Path 2 – Advanced

📊 What you can explore in the app

🚀 Tech stack

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages