Malicious Website Detection using Machine Learning on Apache Spark

Document Type : Original Article

Authors

Department of Computer Science and Engineering Menoufia University Menoufia, Egypt

Abstract

Malicious websites considered as critical threats to the users’ systems that access these websites as hackers use these websites to steal users’ personal information or account information or even harms the users’ systems. Many solutions have been developed to detect and prevent these malicious websites, but these solutions are not fully effective as these websites are changed continuously. This paper evaluates various classification algorithms to predict malicious and non- malicious web sites, based on various feature selection scenarios. Reasonable results are reached with 100% accuracy, recall, and precision when applying Logistic Regression and Decision Tree algorithms while 95% when applying Naïve Bayes algorithm with good timing.

Volume 28, ICEEM2019-Special Issue
ICEEM2019-Special Issue: 1st International Conference on Electronic Eng., Faculty of Electronic Eng., Menouf, Egypt, 7-8 Dec.
2019
Pages 337-342