Spam Email Classification

Description

This project uses Natural Language Processing (NLP) and Machine Learning techniques to classify emails as Spam or Ham. It includes data preprocessing, model training, evaluation, and a web-based deployment using Streamlit.

Features

Preprocesses email content for classification.
Classifies emails as spam or ham with a trained machine learning model.
Provides a user-friendly web interface for real-time email classification.

How It Works

Step 1: Data Collection and Preprocessing

Load the dataset and clean the data by removing unnecessary columns and handling null values.
Map labels (ham and spam) to numerical values for machine learning.

Step 2: Feature Engineering

Convert email text into numerical features using CountVectorizer (bag-of-words approach).

Step 3: Model Selection

The Multinomial Naive Bayes algorithm is used for its efficiency in text classification tasks.

Step 4: Model Training

Train the Naive Bayes model using the preprocessed and vectorized data.

Step 5: Evaluation

Evaluate the model's accuracy on a test dataset.

Step 6: Deployment

Save the trained model and vectorizer using Pickle.
Build and deploy the classification interface using Streamlit.

Run the app with the following command:

streamlit run SpamDetect.py

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
Spam Detector.ipynb		Spam Detector.ipynb
SpamDetect.py		SpamDetect.py
spam.csv		spam.csv
spam.pkl		spam.pkl
spam123.pkl		spam123.pkl
vec.pkl		vec.pkl
vec123.pkl		vec123.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spam Email Classification

Description

Features

How It Works

Step 1: Data Collection and Preprocessing

Step 2: Feature Engineering

Step 3: Model Selection

Step 4: Model Training

Step 5: Evaluation

Step 6: Deployment

About

Releases

Packages

Languages

hbSrujana/Spam-Email-Classification

Folders and files

Latest commit

History

Repository files navigation

Spam Email Classification

Description

Features

How It Works

Step 1: Data Collection and Preprocessing

Step 2: Feature Engineering

Step 3: Model Selection

Step 4: Model Training

Step 5: Evaluation

Step 6: Deployment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages