Skip to content

farzanaa99/HealthInsurancePredictor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

4 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ₯ Health Insurance Charges Predictor

Predict health insurance costs using Linear Regression on demographic and health features. This project showcases end-to-end machine learning, from data cleaning and feature engineering to visualization and error analysis. Interaction terms capture combined effects like smoking Γ— BMI and smoking Γ— age.

πŸ” Project Description

The model predicts insurance charges based on age, BMI, smoking status, and interaction features. Linear regression quantifies the relationship between predictors and charges. Includes visualizations, correlation analysis, and detailed error metrics to evaluate performance.

✨ Features

πŸ“Š Linear Regression Model

⚑ Feature Engineering – Interaction terms for combined effects.

🧹 Data Cleaning – Handles missing values, encodes categorical variables.

πŸ”Ž Exploratory Data Analysis (EDA) – Scatter plots, boxplots, histograms, correlations.

πŸ“‰ Error Analysis – MAE, RMSE, RΒ², and percent error across age groups and smoker status.

πŸ“ˆ Visualizations – Actual vs predicted insurance charges

βš™οΈ Tech Stack

Python

Pandas & NumPy

Matplotlib & Seaborn

scikit-learn

About

A machine learning project exploring predictive modeling with python. It predicts health insurance charges using linear regression. Features include age, BMI, smoking status, and engineered interaction terms to improve predictions. It includes error analysis and visualizations for insight into model performance.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors