Efficient Spam Classification Using Machine Learning Methods

Authors

  • Zhenghao Miao

DOI:

https://doi.org/10.54097/hset.v34i.5375

Keywords:

Spam Classification, Machine Learning Model, Data Classification.

Abstract

With the development of the times, we receive more and more information every day, and email is an important carrier of receiving information in our daily life. In this context, some spam came out, including some advertisements and false information. Therefore, it is necessary to classify spam and receive it selectively. Machine learning algorithms can classify and identify spam with high efficiency, and in this paper, we compare six models and achieve a top result of 98.7%. Spam classification can filter out unwanted emails at the receiving end, effectively saving people's time and preventing malicious users from stealing personal information.

Downloads

Download data is not yet available.

References

Ali A B M S, Xiang Y. Spam classification using adaptive boosting algorithm [C]//6th IEEE/ACIS International Conference on Computer and Information Science (ICIS 2007). IEEE, 2007: 972-976.

Subasi A, Alzahrani S, Aljuhani A, et al. Comparison of decision tree algorithms for spam e-mail filtering[C]//2018 1st International Conference on Computer Applications & Information Security (ICCAIS). IEEE, 2018: 1-5.

Seewald A K. An evaluation of Naive Bayes variants in content-based learning for spam filtering [C]// IOS Press. IOS Press, 2007:497-524.

Tseng C Y, Chen M S. Incremental SVM Model for Spam Detection on Dynamic Email Social Networks [C]// 2009 International Conference on Computational Science and Engineering. IEEE Computer Society, 2009.

Zhang J. Application of Improved KNN Algorithm in Spam E-mail Filtering [J]. New Technology of Library and Information Service, 2007.

Chang M W, Yih W T, Meek C. Partitioned logistic regression for spam filtering [C]// Acm Sigkdd International Conference on Knowledge Discovery & Data Mining. ACM, 2008.

Bao L Q, Chai S H. The Application of Decision Tree in Spam Filtering [J]. Journal of Lanzhou Polytechnic College.

Kumar M. Effective Spam Filtering using Random Forest Machine Learning Algorithm.

Tsang, I. W., Kwok, J. T., Cheung, P. M., & Cristianini, N. (2005). Core vector machines: Fast SVM training on very large data sets. Journal of Machine Learning Research, 6(4).

Lipton, Z. C., Elkan, C., & Narayanaswamy, B. (2014). Thresholding classifiers to maximize F1 score. arXiv preprint arXiv:1402.1892.

Li, J., Cheng, K., Wang, S., Morstatter, F., Trevino, R. P., Tang, J., & Liu, H. (2017). Feature selection: A data perspective. ACM computing surveys (CSUR), 50(6), 1-45.

Renuka, D. K., Hamsapriya, T., Chakkaravarthi, M. R., & Surya, P. L. (2011, July). Spam classification based on supervised learning using machine learning techniques. In 2011 International Conference on Process Automation, Control and Computing (pp. 1-7). IEEE.

Downloads

Published

28-02-2023

How to Cite

Miao, Z. (2023). Efficient Spam Classification Using Machine Learning Methods. Highlights in Science, Engineering and Technology, 34, 60-64. https://doi.org/10.54097/hset.v34i.5375