📩 Spam SMS Detection ML Project

Deployed link✨ Here

📌 Project Objective

The goal of this project is to build and deploy a machine learning model that can classify SMS messages as Spam or Ham (Not Spam).
The model is trained using a labeled dataset and deployed for real-world testing.

🛠️ Tech Stack

Python
Scikit-Learn
Pandas, Numpy
Natural Language Processing (NLP)
NLTK
Streamlit (for deployment)

📚 Dataset

Dataset Source: Kaggle - SMS Spam Collection Dataset
Description: 5,500 SMS messages labeled as Spam or Not Spam.

📊 Project Stages

Data Cleaning
Exploratory Data Analysis (EDA)
Text Preprocessing (tokenization, stemming, etc.)
Model Building (Naive Bayes, Logistic Regression, etc.)
Vectorization (TF-IDF, GridSearchCV)
Model Evaluation (Accuracy, Precision, Recall, F1 Score)
PyCharm App Development (Over Streamlit)

📊 Model Performance

Metric	Score
Accuracy	97.9%
Precision	97.5%
Recall	96%

⚙️ Steps to Run the Project

1. Clone the repository:

git clone https://github.com/BleeGleeWee/Spam-SMS-Detection.git
cd Spam-SMS-Detection

2. Install dependencies:

pip install -r requirements.txt

3. Run the Jupyter Notebook:

jupyter notebook spam_sms_detection.ipynb

4. For deployed app:

streamlit run app.py

🌟 FINAL SHOWDOWN:

Email/SMS-spam-classifier
│
├── data/
│   └── spam.csv                         # Original dataset (or link to download in README)
│
├── notebooks/
│   ├── 01_data_cleaning.ipynb           # Handling nulls, duplicates, formatting
│   ├── 02_eda.ipynb                     # Visualizations and exploratory analysis
│   ├── 03_text_preprocessing.ipynb      # Tokenization, stemming, stopword removal
│   ├── 04_model_building.ipynb          # Naive Bayes, Logistic Regression, etc.
│   └── 05_model_improvement.ipynb       # TF-IDF, hyperparameter tuning, evaluation
│
├── models/
│   ├── model.pkl                        # Serialized trained model (pickle)
|   └── vectorizer.pkl                   # Trained model then vectorized
│
├── app/
│   ├── app.py                           # App entry point
│   ├── main.py                  # Utility to load the model
│   └── train_model.py                   # Training model before testing   
│
├── .gitignore                           # Ignore notebooks checkpoints, model files, etc.
├── requirements.txt                     # All dependencies (Flask/FastAPI, sklearn, etc.)
├── nltk.txt                             # NLTK dependencies (stopwords, punkt)
├── README.md                            # Full documentation 
└── LICENSE                              # MIT or any preferred open-source license

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📩 Spam SMS Detection ML Project

📌 Project Objective

🛠️ Tech Stack

📚 Dataset

📊 Project Stages

📊 Model Performance

⚙️ Steps to Run the Project

1. Clone the repository:

2. Install dependencies:

3. Run the Jupyter Notebook:

4. For deployed app:

🌟 FINAL SHOWDOWN:

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.devcontainer		.devcontainer
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
main.py		main.py
model.pkl		model.pkl
nltk.txt		nltk.txt
requirements.txt		requirements.txt
spam-sms-detection.ipynb		spam-sms-detection.ipynb
spam.csv		spam.csv
train_model.py		train_model.py
vectorizer.pkl		vectorizer.pkl

Folders and files

Latest commit

History

Repository files navigation

📩 Spam SMS Detection ML Project

📌 Project Objective

🛠️ Tech Stack

📚 Dataset

📊 Project Stages

📊 Model Performance

⚙️ Steps to Run the Project

1. Clone the repository:

2. Install dependencies:

3. Run the Jupyter Notebook:

4. For deployed app:

🌟 FINAL SHOWDOWN:

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages