A Hybrid Feature Selection Method for Advanced Persistent Threat Detection

Khalid, Adam and Zainal, Anazida and Ghaleb, Fuad A. and Ali Saleh Al-rimy, Bander and Ahmed, Yussuf (2025) A Hybrid Feature Selection Method for Advanced Persistent Threat Detection. Computers, Materials and Continua. ISSN 1546-2218

Preview

Text
TSP_CMC_63451.pdf - Published Version
Available under License Creative Commons Attribution.
Download (1MB)

Official URL: https://www.techscience.com/cmc/online/detail/2372...

Abstract

Advanced Persistent Threats (APTs) represent one of the most complex and dangerous categories of cyber-attacks characterised by their stealthy behaviour, long-term persistence, and ability to bypass traditional detection systems. The complexity of real-world network data poses significant challenges in detection. Machine learning models have shown promise in detecting APTs; however, their performance often suffers when trained on large datasets with redundant or irrelevant features. This study presents a novel, hybrid feature selection method designed to improve APT detection by reducing dimensionality while preserving the informative characteristics of the data. It combines Mutual Information (MI), Symmetric Uncertainty (SU) and Minimum Redundancy Maximum Relevance (mRMR) to enhance feature selection. MI and SU assess feature relevance, while mRMR maximises relevance and minimises redundancy, ensuring that the most impactful features are prioritised. This method addresses redundancy among selected features, improving the overall efficiency and effectiveness of the detection model. Experiments on a real-world APT datasets were conducted to evaluate the proposed method. Multiple classifiers including, Random Forest, Support Vector Machine (SVM), Gradient Boosting, and Neural Networks were used to assess classification performance. The results demonstrate that the proposed feature selection method significantly enhances detection accuracy compared to baseline models trained on the full feature set. The Random Forest algorithm achieved the highest performance, with near-perfect accuracy, precision, recall, and F1 scores (99.97). The proposed adaptive thresholding algorithm within the selection method allows each classifier to benefit from a reduced and optimised feature space, resulting in improved training and predictive performance. This research offers a scalable and classifier-agnostic solution for dimensionality reduction in cybersecurity applications.

Item Type:	Article
Identification Number:	10.32604/cmc.2025.063451
Dates:	Date Event 25 June 2025 Accepted 8 July 2025 Published Online
Uncontrolled Keywords:	Advanced persistent threats, hybrid-based techniques, feature selection, data processing, symmetric uncertainty, mutual information, minimum redundancy, APT detection
Subjects:	CAH11 - computing > CAH11-01 - computing > CAH11-01-01 - computer science
Divisions:	Architecture, Built Environment, Computing and Engineering > Computer Science
Depositing User:	Gemma Tonks
Date Deposited:	09 Jul 2025 12:59
Last Modified:	09 Jul 2025 12:59
URI:	https://www.open-access.bcu.ac.uk/id/eprint/16509