NLP-Powered Classification with 100% Accuracy
An NLP-powered email spam detector that achieves perfect classification using machine learning. Trained on 320 emails, the system uses TF-IDF vectorization and multiple algorithms to identify spam patterns with 100% accuracy.
Built using Python, scikit-learn, and natural language processing techniques, this project demonstrates text preprocessing, feature extraction, and multi-model comparison.
Top 15 words that appear most frequently in spam emails. "Free", "cash", and "prizes" are the strongest indicators of spam content.
All three models (Naive Bayes, Logistic Regression, Random Forest) achieved perfect 100% accuracy on the test set.
Perfect classification with zero false positives or false negatives on 64 test emails.
The trained model is saved and ready to classify new emails. The spam detector can be used to automatically filter incoming messages.
Quick Start:
Example Classifications:
Key Features:
spam_detector.pkl)