Skip to content

himanshu748/feb_challenge

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🏏 The Evolution of Indian Cricket: How IPL Transformed a Nation's Game

Codedex February 2026 Dataset Challenge Submission

Python Pandas Plotly


πŸ“– The Story

The Indian Premier League (IPL) launched in 2008 and fundamentally changed cricket forever. But beyond the fireworks and celebrity owners, what does the data actually tell us?

This project analyzes 278,205 ball-by-ball deliveries across 1,169 IPL matches spanning 17 seasons (2008–2025) to answer three key questions:

πŸ” Q1: How has batting evolved? Are teams scoring faster than ever?
πŸ” Q2: Has the bat-vs-ball balance shifted? Are bowlers endangered?
πŸ” Q3: Is the toss advantage a myth or reality?

🎯 Key Findings

# Finding Evidence
1 The Run Explosion is Real Average run rates have surged 25%+ since 2008, driven by a near-doubling of six-hitting frequency
2 Bowlers Have Adapted, Not Died Despite soaring economy rates, wickets per match remain stable
3 The Toss is (Mostly) Irrelevant Toss advantage hovers around 50%, though chasing has become preferred in recent seasons

πŸ’‘ Surprise "Aha" Moment

Death overs (16-20) run rates have exploded from ~8.5 RPO to 11+ RPO β€” the last 5 overs have genuinely become a different game, driven by the rise of finishers like MS Dhoni, Hardik Pandya, and Rinku Singh.

πŸ“Š Visualizations

The notebook contains 10 interactive Plotly visualizations:

  1. πŸ“ˆ The Run Explosion β€” Avg runs per match & run rate trend (dual-axis)
  2. πŸ’₯ The Boundary Revolution β€” Fours vs Sixes per 100 balls (area chart)
  3. 🎳 The Bowlers Plight β€” Economy rates vs Wickets per match (subplot)
  4. πŸ‘‘ All-Time Run Scorers β€” Top 15 batters colored by strike rate
  5. πŸͺ™ The Great Toss Debate β€” Toss win % & Chase vs Bat-first analysis
  6. πŸ† Franchise Dominance Map β€” Team wins heatmap across seasons
  7. 🏟️ Run-Scoring Grounds β€” Venue comparison with six-hitting data
  8. 🌟 Mr. Dependable β€” Most Player of the Match awards
  9. ⚑ Three Phases of T20 β€” Powerplay vs Middle vs Death over scoring
  10. 🎬 Animated Evolution β€” Strike rate vs Average scatter (play button!)

πŸ“‚ Project Structure

feb_challenge/
β”œβ”€β”€ IPL_Evolution_Analysis.ipynb    # πŸ““ Main analysis notebook (submission)
β”œβ”€β”€ README.md                       # πŸ“‹ This file
β”œβ”€β”€ data/
β”‚   β”œβ”€β”€ ipl_raw/                    # πŸ“¦ Raw Cricsheet match files (1,169 Γ— 2)
β”‚   β”œβ”€β”€ ipl_deliveries.csv          # 🏏 278,205 ball-by-ball records
β”‚   β”œβ”€β”€ ipl_matches.csv             # πŸ“Š 1,169 match summaries
β”‚   β”œβ”€β”€ ipl_batting_stats.csv       # 🏏 Player batting stats by season
β”‚   └── ipl_bowling_stats.csv       # 🎳 Player bowling stats by season
β”œβ”€β”€ scripts/
β”‚   β”œβ”€β”€ process_data.py             # πŸ”§ Raw data β†’ clean datasets pipeline
β”‚   └── create_notebook.py          # πŸ““ Notebook generator script
└── visuals/                        # πŸ“Έ Exported chart images

πŸ› οΈ How to Run

Prerequisites

pip install pandas numpy plotly matplotlib seaborn jupyter

Step 1: Process the data

python scripts/process_data.py

Step 2: Open the notebook

jupyter notebook IPL_Evolution_Analysis.ipynb

πŸ“‹ Data Source

  • Source: Cricsheet.org β€” Open-source ball-by-ball cricket data
  • Format: CSV (ball-by-ball + match info files)
  • Processing: Raw match CSVs (2,338 files) parsed and combined into 4 structured datasets using Python/Pandas
  • Total Data Points: 278,205 deliveries across 1,169 matches

πŸ… Challenge Categories Targeted

Category How
πŸ† Best Storyteller Full end-to-end narrative with clear questions, analysis, and conclusions
😍 Best Data Visualization 10 interactive Plotly charts with premium dark theme
πŸ’‘ Sherlock "Aha" Moment Death overs revolution insight + Six-hitting explosion
πŸ’Œ Best Original Dataset Processed 2,338 raw Cricsheet files into clean datasets

Made with ❀️ and 🏏 for the Codedex February 2026 Dataset Challenge

About

🏏 IPL Evolution Analysis β€” Codedex Feb 2026 Dataset Challenge | 278K+ deliveries, 1169 matches, 17 seasons of data-driven cricket storytelling

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors