A Machine Learning Framework for Automated Spam E-mail Classification
PDF
PDF

How to Cite

Patil, Pranav, Sonu H.T., Prajwal S., and Sachin B. 2026. “A Machine Learning Framework for Automated Spam E-Mail Classification”. Recent Research Reviews Journal 5 (1): 1-14. https://doi.org/10.36548/rrrj.2026.1.001.

Keywords

Spam Detection
Email Classification
Decision Tree
SVM
Naive Bayes
LSTM
Machine Learning
Deep Learning
Keyword Filtering

Abstract

Spam mails are some of the critical issues in cybersecurity. In this regard, they have adverse effects on the use of resources and productivity of users. This study suggests an artificial intelligence technique in classifying spam mail by utilizing machine learning and deep learning methods. Such include decision tree, naive bayes, support vector machine, and deep learning known as LSTM. Performance analysis of the model will be dependent on aspects such as accuracy, precision, recall, and F1 score among others. Based on the results in this study, the deep learning techniques such as LSTM provide high accuracy rates of about 98.5%. Similarly, the integration of the Convolutional Neural Network and LSTM offers high accuracy of about 99%. However, Naive Bayes shows low accuracy in the short time frame of training.

References

  1. Asliyuksek, Halim, Ozgur Tonkal, and Ramazan Kocaoglu. "A Comparative Evaluation of a Multimodal Approach for Spam Email Classification Using DistilBERT and Structural Features." Electronics 14, no. 19 (2025): 3855.
  2. Lakshmi, R. Deepa, and N. Radha. "Spam Classification Using Supervised Learning Techniques." In Proceedings of the 1st Amrita ACM-W Celebration on Women in Computing in India, 2010, 1-4.
  3. Awad, Wael Abou, and S. M. ELseuofi. "Machine Learning Methods for Spam E-Mail Classification." International Journal of Computer Science & Information Technology (IJCSIT) 3, no. 1 (2011): 173-184.
  4. Mallampati, Deepika, and Nagaratna P. Hegde. "A Machine Learning Based Email Spam Classification Framework Model: Related Challenges and Issues." International Journal of Innovative Technology and Exploring Engineering 9, no. 4 (2020): 3137-3144.
  5. Junnarkar, Akash, Siddhant Adhikari, Jainam Fagania, Priya Chimurkar, and Deepak Karia. "E-mail Spam Classification via Machine Learning and Natural Language Processing." In 2021 Third International Conference on Intelligent Communication Technologies and Virtual Mobile Networks (ICICV), 693-699.
  6. Siddique, Zeeshan Bin, Mudassar Ali Khan, Ikram Ud Din, Ahmad Almogren, Irfan Mohiuddin, and Shah Nazir. "Machine Learning‐Based Detection of Spam Emails." Scientific Programming 2021, no. 1 (2021): 6508784.
  7. Dada, Emmanuel Gbenga, Joseph Stephen Bassi, Haruna Chiroma, Shafi'I. Muhammad Abdulhamid, Adebayo Olusola Adetunmbi, and Opeyemi Emmanuel Ajibuwa. "Machine Learning for Email Spam Filtering: Review, Approaches and Open Research Problems." 2019, Heliyon 5, no. 6.
  8. Rayan, Alanazi. "Analysis of E‐Mail Spam Detection Using a Novel Machine Learning‐Based Hybrid Bagging Technique." Computational Intelligence and Neuroscience 2022, no. 1 (2022): 2500772.
  9. Mansoor, R. A. Z. A., Nathali Dilshani Jayasinghe, and Muhana Magboul Ali Muslam. "A Comprehensive Review on Email Spam Classification Using Machine Learning Algorithms." In 2021 International conference on information networking (ICOIN), 327-332.
  10. Srinivasan, Sriram, Vinayakumar Ravi, Mamoun Alazab, Simran Ketha, Ala’M. Al-Zoubi, and Soman Kotti Padannayil. "Spam Emails Detection Based on Distributed Word Embedding with Deep Learning." In Machine intelligence and big data analytics for cybersecurity applications, Cham: Springer International Publishing, 2020, 161-189.