Topic-aware neural attention network for malicious social media spam detection

Nasser, Maged; Saeed, Faisal; Da’u, Aminu; Alblwi, Abdulaziz; Al-Sarem, Mohammed

Topic-aware neural attention network for malicious social media spam detection

Nasser, Maged and Saeed, Faisal and Da’u, Aminu and Alblwi, Abdulaziz and Al-Sarem, Mohammed (2024) Topic-aware neural attention network for malicious social media spam detection. Alexandria Engineering Journal, 111. pp. 540-554. ISSN 1110-0168

[thumbnail of 1-s2.0-S1110016824012389-main.pdf]

Preview

Text
1-s2.0-S1110016824012389-main.pdf - Published Version
Available under License Creative Commons Attribution.
Download (4MB)

Official URL: https://doi.org/10.1016/j.aej.2024.10.073

Abstract

Social media platforms, such as Facebook and X (formally known as Twitter), have become indispensable tools in today's society because they facilitate social discussion and information sharing. This feature makes social networks more attractive for spammers who intentionally spread fake messages, post malicious links and spread rumours. Recently, several machine learning methods have been introduced for social network malicious spam classification. However, most existing methods generally rely on handcrafted features and traditional embedding models, which are relatively less effective. Therefore, inspired by the success of the neural attention network, we propose an interactive neural attention-based method for malicious spam detection by integrating long short-term memory (LSTM), topic modelling, and the BERT technique. In the proposed approach, first, we employed the LSTM encoder, which was integrated with the Twitter latent Dirichlet allocation (LDA) model via an interactive attention mechanism to jointly learn local content and global topic representations. Second, to further learn the contextualized features of texts, the model was further integrated with the BERT technique. Last, the Softmax function was then applied at the output layer for the final spam classification. A series of experiments were conducted utilizing two real-world datasets to evaluate the model. Using dataset 1, the proposed model outperformed the baseline techniques, with average improvements in recall, precision, and F1 and accuracies of 17.54 %, 6.19 %, 11.91 %, and 12.27 %, respectively. In addition, the proposed model performed well for the second dataset and obtained average gains of 11.81 %, 4.38 %, 8.12, and 7.42 in terms of recall, precision, F1, and accuracy, respectively.

Item Type:	Article
Identification Number:	10.1016/j.aej.2024.10.073
Dates:	Date Event 16 October 2024 Accepted 29 October 2024 Published Online
Uncontrolled Keywords:	Spam detection, Topic modelling, Attention neural network, Malicious detection, Bidirectional encoder representations from transformers (BERT), Online social network,
Subjects:	CAH11 - computing > CAH11-01 - computing > CAH11-01-01 - computer science
Divisions:	Architecture, Built Environment, Computing and Engineering > Computer Science
Depositing User:	Gemma Tonks
Date Deposited:	13 Nov 2024 15:58
Last Modified:	13 Nov 2024 15:58
URI:	https://www.open-access.bcu.ac.uk/id/eprint/15963