Using supervised machine learning algorithms to detect suspicious URLs in online social networks

Research Output

The increasing volume of malicious content in social networks requires automated methods to detect and eliminate such content. This paper describes a supervised machine learning classification model that has been built to detect the distribution of malicious content in online social networks (ONSs). Multisource features have been used to detect social network posts that contain malicious Uniform Resource Locators (URLs). These URLs could direct users to websites that contain malicious content, drive-by download attacks, phishing, spam, and scams. For the data collection stage, the Twitter streaming application programming interface (API) was used and VirusTotal was used for labelling the dataset. A random forest classification model was used with a combination of features derived from a range of sources. The random forest model without any tuning and feature selection produced a recall value of 0.89. After further investigation and applying parameter tuning and feature selection methods, however, we were able to improve the classifier performance to 0.92 in recall.

Date:

31 July 2017
Publication Status:

Published
DOI:

10.1145/3110025.3116201
Funders:

Historic Funder (pre-Worktribe)

http://researchrepository.napier.ac.uk/output/2809195 <p>Al-Janabi, M., Quincey, E. D., & Andras, P. (2017). Using supervised machine learning algorithms to detect suspicious URLs in online social networks. In <i>ASONAM '17: Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017</i> (1104-1111). https://doi.org/10.1145/3110025.3116201</p>

Citation

Al-Janabi, M., Quincey, E. D., & Andras, P. (2017). Using supervised machine learning algorithms to detect suspicious URLs in online social networks. In ASONAM '17: Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017 (1104-1111). https://doi.org/10.1145/3110025.3116201

Authors

Prof Peter Andras

Dean of School of Computing Engineering and the Built Environment
School of Computing Engineering and the Built Environment

0131 455 2456

P.Andras@napier.ac.uk

Monthly Views:

Available Documents

Files currently unavailable for download , please contact repository@napier.ac.uk to request a copy
Downloadable citations
HTML BIB RTF

Date:

Publication Status:

DOI:

Funders:

Citation

Authors

Prof Peter Andras

Monthly Views:

Files currently unavailable for download , please contact repository@napier.ac.uk to request a copy

Downloadable citations