Email_Classification

E-Mail Classification

GOAL: Classify emails as spam or not-spam on the basis of the message.

WHAT I HAD DONE:

I have started with simple Exploratory Data Analysis(EDA), looked for some null or duplicate vales. Then splited the training and testing dataset using sklearn. Then I implemented different models to classify the message as spam or not-spam.

MODELS USED:

Naive Bayes (Multinominal)
Random Forest Classifier
XG Boost Classifier

LIBRARIES NEEDED:

Numpy
Pandas
Sklearn

CONCLUSION:

Models	Accuracy on Training Set	Accuracy on Testing Set
Naive Bayes (Multinominal)	0.99346	0.96875
Random Forest Classifier	1.0	0.91666
XG Boost Classifier	0.97908	0.95833

Naive Bayes (Multinominal) Training Set Accuracy	Naive Bayes (Multinominal) Testing Set Accuracy

Name		Name	Last commit message	Last commit date
parent directory ..
Dataset		Dataset
Model		Model
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

README.md

E-Mail Classification

Uh oh!

FilesExpand file tree

Email_Classification

Directory actions

More options

Directory actions

More options

Latest commit

History

Email_Classification

Folders and files

parent directory

README.md

E-Mail Classification